Stefan Th. Gries
Contact information
Last updated: 01 August 2016

Teaching at the University of California, Santa Barbara

Ling 201: Research methods and statistics (Fall 2017)

Syllabus and overview

This course is a hands-on introduction to fundamentals of quantitative/statistical methodology in linguistics. It is based on the second edition of my textbook Statistics for linguistics with R: a practical introduction (2013). We begin by looking at a few basic notions such as variables and hypotheses. We then discuss the logic of quantitative studies using the null-hypothesis falsification approach and familiarized ourselves with how to set up factorial experiments and how data from experiments and corpora should be set up for subsequent statistical evaluation. Then, we are concerned with a variety of descriptive graphs and statistics for frequency data, averages, dispersions, and correlations. The largest part is concerned with a variety of statistical tests: distribution fitting tests, tests for independence, and tests for differences for frequencies, means, dispersions, and correlations. We end with a small primer for the kind of multifactorial methods that are the subject of Ling 202. We use the open source software tool R .

Downloads for class sessions
(files will be available as appropriate)

Session 01: slides
Session 02: exercise code, exercise data (must be unzipped), the answer key, and the experimental design spreadsheet
Session 03-04: exercise code, exercise data (must be unzipped), and the answer key
Session 05: exercise code and the answer key
Session 06: exercise code and the answer key
Session 07: exercise code, data for question 2, data for question 4 and the answer key
Session 08: exercise code on functions and plotting examples
Session 09: exercise code, the answer key, and the Berkeley admissions data


assignment 1: code (.r) and data (.csv)
assignment 2: code (.r)
assignment 3: code (.r) and data (.csv)
assignment 'final': code (.r) and data (must be unzipped)

Links to relevant software and sites

R (current version: 3.4.2)
RStudio (current stable version: 1.0.153)

LibreOffice (current stable version:
my 2013 statistics textbook, its companion website, and its StatForLing with R newsgroup, which I moderate.