Therefore, many of the computational methods described in this book are applicable.
First, the corpus contains two layers of annotation, at the phonetic and orthographic levels.
A third property is that there is a sharp division between the original linguistic event captured as an audio recording, and the annotations of that event.
The same holds true of text corpora, in the sense that the original text usually has an external source, and is considered to be an immutable artifact.
It was designed to provide data for the acquisition of acoustic-phonetic knowledge and to support the development and evaluation of automatic speech recognition systems.
Like the Brown Corpus, which displays a balanced selection of text genres and sources, TIMIT includes a balanced selection of dialects, speakers, and materials.