Analyzing Linguistic Data: A Practical Introduction to Statistics using RStatistical analysis is a useful skill for linguists and psycholinguists, allowing them to understand the quantitative structure of their data. This textbook provides a straightforward introduction to the statistical analysis of language. Designed for linguists with a non-mathematical background, it clearly introduces the basic principles and methods of statistical analysis, using 'R', the leading computational statistics programme. The reader is guided step-by-step through a range of real data sets, allowing them to analyse acoustic data, construct grammatical trees for a variety of languages, quantify register variation in corpus linguistics, and measure experimental data using state-of-the-art models. The visualization of data plays a key role, both in the initial stages of data exploration and later on when the reader is encouraged to criticize various models. Containing over 40 exercises with model answers, this book will be welcomed by all linguists wishing to learn more about working with and presenting quantitative data. |
Contents
Section 1 | 306 |
Section 2 | 313 |
Section 3 | 314 |
Section 4 | 317 |
Section 5 | 322 |
Section 6 | 323 |
Section 7 | 326 |
Section 8 | 335 |
Section 9 | 337 |
Other editions - View all
Analyzing Linguistic Data: A Practical Introduction to Statistics Using R R. Harald Baayen No preview available - 2008 |
Analyzing Linguistic Data: A Practical Introduction to Statistics Using R R. H. Baayen No preview available - 2008 |
Common terms and phrases
1me4 package AgeSubject alice anova BehavioralScore boxplot chi-squared test Chisq Chi Df cLength Coefficients Cook's distance countOfAlice countOfHare.tab countOfVery cross-validation data frame data set Df Pr(>Chisq distribution Dxy R2 English Error t value Estimate Std EtymAge F-statistic Figure A.6 finalDevoicing finalDevoicing.1rm finalDevoicing.rpl Goodness-of-fit multivariate chi-squared InflectionalEntropy Intercept Kolmogorov-Smirnov test lambda lexdec3 lexdec3.1merE2 lexical linear model lmer log frequency LogFrequency logistic regression mean countOfHare MeanFamiliarity meanWeight mfrow mixed-effects moby Multiple R-Squared multivariate chi-squared test naming.ols NcountStem nessdemog.fzm nessdemog.spc nessw.gigp nessw.lnre.spc nessw.spc Nonlinear Nsyll NVratio Obstruent Onset2Type outliers overfitting p-value p(regular p(voiceless PastBreakPoint plot density Poisson distribution ppois predictors PubDate pvals Q-Q Plot quantile-quantile plots R-squared random effects rcs WrittenFrequency Residual standard error RTnaming sample quantiles scatterplot adds scatterplot matrix ShiftedLogDistance Slope spanishFunctionWords.t spanishMeta Subject Theoretical Quantiles value Pr(>|t vector Vocabulary Growth voiced voiceless VowelType warlpiri Word WrittenFrequency WrittenSpokenRatio writtenVariationLijk X2 df xlim xtabs ylab ylim