UCSB Linguistics Department main page Susanna Cumming's department page Susanna Cumming's department page
University of California at Santa Barbara main page

 

Database programs (or DBMSs, "Database Management Systems") are extremely useful tools for the linguist; they allow efficient storage and rapid manipulation of large quantities of data, and have no competition when it comes to testing complex quantitative hypotheses quickly and easily. With a corpus properly coded in a well-designed relational database, you can get an answer to a question like "how many relative clauses have heads which are definite patients" in seconds.

However, they are rather different from other kinds of software most users are familiar with, such as word processors and spreadsheets, and they can be intimidating at first. This guide is intended as an introduction to database concepts and principles.

This is not a step-by-step "how to" guide to any particular piece of database software. The paradigm described here is shared by many different products. Some of those currently on the market, with which people are having success doing linguistic analysis, include:

A piece of advice that applies to these or any other large software products that you plan to use for research:

Learning a complex piece of software while also trying to do research and writing under a deadline is a recipe for frustration. Software can be your friend, but like any other friendship it requires time and commitment to flourish.