The following is a complete listing of the 60 discourse segments published in Parts 1-4 of the Santa Barbara Corpus of American English. For each segment identification number (e.g. SBC001), the corresponding transcription file includes the extension “.trn,” yielding SBC001.trn. The corresponding audio file includes the extension “.wav,” yielding SBC001.wav. Transcription files are in plain text format (ASCII), while audio files are in WAV format (for details, see Recordings).
List of discourse segments:
[1] Numerically by filename
[2] Alphabetically by title |