Signal Processing and Speech Communication Laboratory
homedatabases & tools › TIMIT

TIMIT

Acronym
TIMIT
Type
Database
Contact
Research Areas
Sampling frequency
16 kHz
Condition
Clean
Segmentation method
Manual
Segmentation level
Word
Language
en
Number of speakers
630
Number of utterances
5040
Number of channels
1

The TIMIT corpus of read speech has been designed to provide speech data for the acquisition of acoustic-phonetic knowledge and for the development and evaluation of automatic speech recognition systems.  TIMIT has resulted from the joint efforts of several sites under sponsorship from the Defense Advanced Research Projects Agency - Information Science and Technology Office (DARPA-ISTO).  Text corpus design was a joint effort among the Massachusetts Institute of Technology (MIT), Stanford Research Institute (SRI), and Texas Instruments (TI).  The speech was recorded at TI, transcribed at MIT, and has been maintained, verified, and prepared for CD-ROM production by the National Institute of Standards and Technology (NIST). 

TIMIT contains phonetically balanced sentences read by 630 speakers (of which 70%were male) from eight major dialects of American English.  TIMIT is devided into a training and testing division, in which no sentence or speaker appears in both the training and test set. The training set consists of 3,696 utterances, the test set of 1,344 utterances. The files are available in  The TIMIT database was hand-segmented and hand-labeled on the word level and on the phone level using 59 Arpabet symbols.