Cluster Analysis for Corpus Linguistics / / Hermann Moisl.

The standard scientific methodology in linguistics is empirical testing of falsifiable hypotheses. As such the process of hypothesis generation is central, and involves formulation of a research question about a domain of interest and statement of a hypothesis relative to it. In corpus linguistics t...

Full description

Saved in:
Bibliographic Details
Superior document:Title is part of eBook package: De Gruyter DG Plus DeG Package 2015 Part 1
VerfasserIn:
Place / Publishing House:Berlin ;, Boston : : De Gruyter Mouton, , [2015]
©2015
Year of Publication:2015
Language:English
Series:Quantitative Linguistics [QL] , 66
Online Access:
Physical Description:1 online resource (381 p.)
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Other title:Frontmatter --
Preface --
Contents --
List of figures --
1. Introduction --
2. Motivation --
3. Data --
4. Cluster --
5. Hypothesis generation --
6. Literature Review --
7. Conclusion --
8. Appendix --
References --
Subject index
Summary:The standard scientific methodology in linguistics is empirical testing of falsifiable hypotheses. As such the process of hypothesis generation is central, and involves formulation of a research question about a domain of interest and statement of a hypothesis relative to it. In corpus linguistics the domain is text, and generation involves abstraction of data from text, data analysis, and formulation of a hypothesis based on inference from the results. Traditionally this process has been paper-based, but the advent of electronic text has increasingly rendered it obsolete both because the size of digital corpora is now at or beyond the limit of what can efficiently be used in the traditional way, and because the complexity of data abstracted from them can be impenetrable to understanding. Linguists are increasingly turning to mathematical and statistical computational methods for help, and cluster analysis is such a method. It is used across the sciences for hypothesis generation by identification of structure in data which are too large or complex, or both, to be interpretable by direct inspection. This book aims to show how cluster analysis can be used for hypothesis generation in corpus linguistics, thereby contributing to a quantitative empirical methodology for the discipline.
Format:Mode of access: Internet via World Wide Web.
ISBN:9783110363814
9783110762518
9783110700985
9783110742961
9783110439687
9783110438710
ISSN:0179-3616 ;
DOI:10.1515/9783110363814
Access:restricted access
Hierarchical level:Monograph
Statement of Responsibility: Hermann Moisl.