Adaptive Languages : : An Information-Theoretic Account of Linguistic Diversity / / Christian Bentz.
Languages carry information. To fulfil this purpose, they employ a multitude of coding strategies. This book explores a core property of linguistic coding – called lexical diversity. Parallel text corpora of overall more than 1800 texts written in more than 1200 languages are the basis for computati...
Saved in:
Superior document: | Title is part of eBook package: De Gruyter DG Plus DeG Package 2018 Part 1 |
---|---|
VerfasserIn: | |
Place / Publishing House: | Berlin ;, Boston : : De Gruyter Mouton, , [2018] ©2018 |
Year of Publication: | 2018 |
Language: | English |
Series: | Trends in Linguistics. Studies and Monographs [TiLSM] ,
316 |
Online Access: | |
Physical Description: | 1 online resource (XV, 218 p.) |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Other title: | Frontmatter -- Preface -- Contents -- Abbreviations -- List of Tables -- List of Figures -- 1. Introduction -- 2. Languages as Adaptive Systems -- 3. Language Change and Population Structure -- 4. Lexical Diversity across Languages of the World -- 5. Descriptive Factors: Language “Internal” Effects -- 6. Explanatory Factors: Language “External” Effects -- 7. Grouping Factors: Language Families and Areas -- 8. Predicting Lexical Diversity: Statistical Models -- 9. Explaining Diversity: Multiple Factors Interacting -- 10. Further Problems and Caveats -- 11. Conclusions: Universality and Diversity -- 12. Appendix A: Advanced Entropy Estimators -- 13. Appendix B: Multiple Regression Assumptions -- 14. Appendix C: Mixed-effects Regression Assumptions -- Bibliography -- Index |
---|---|
Summary: | Languages carry information. To fulfil this purpose, they employ a multitude of coding strategies. This book explores a core property of linguistic coding – called lexical diversity. Parallel text corpora of overall more than 1800 texts written in more than 1200 languages are the basis for computational analyses. Different measures of lexical diversity are discussed and tested, and Shannon’s measure of uncertainty – the entropy – is chosen to assess differences in the distributions of words. To further explain this variation, a range of descriptive, explanatory, and grouping factors are considered in a series of statistical models. The first category includes writing systems, word-formation patterns, registers and styles. The second category includes population size, non-native speaker proportions and language status. Grouping factors further elicit whether the results extrapolate across – or are limited to – specific language families and areas. This account marries information-theoretic methods with a complex systems framework, illustrating how languages adapt to the varying needs of their users. It sheds light on the puzzling diversity of human languages in a quantitative, data driven and reproducible manner. |
Format: | Mode of access: Internet via World Wide Web. |
ISBN: | 9783110560107 9783110762488 9783110719550 9783110742978 9783110604252 9783110603255 9783110604078 9783110603170 |
ISSN: | 1861-4302 ; |
DOI: | 10.1515/9783110560107 |
Access: | restricted access |
Hierarchical level: | Monograph |
Statement of Responsibility: | Christian Bentz. |