Corpus Information

Corpus Information - Speech Recognition Final Project...

Info iconThis preview shows pages 1–9. Sign up to view the full content.

View Full Document Right Arrow Icon
Speech Recognition Final Project Resources Professor: Dr. Veton Kepuska Class: ECE5526 Speech Recognition Student: Chih-Ti Shih
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
FTP Server Information Host: 163.118.203.219 User ID: student Password: student Port:21
Background image of page 2
Callhome English Speech Corpus The Callhome English Speech Corpus, produced by the Linguistic Data Consortium. The CALLHOME English corpus of telephone speech consists of 120 unscripted telephone conversations between native speakers of English.
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Callhome English Speech Corpus - directory callhome/doc: directory of documentation for Callhome English speech. callhome/english: path to the speech data files, divided into train, devtest and evltest. 0README.1 st : Corpus information file.
Background image of page 4
TIMIT Acoustic-Phonetic Continuous Speech Corpus The TIMIT corpus of read speech has been designed to provide speech data for the acquisition of acoustic-phonetic knowledge and for the development and evaluation of automatic speech recognition systems. TIMIT contains a total of 6300 sentences, 10 sentences spoken by each of 630 speakers from 8 major dialect regions of the United States.
Background image of page 5

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
TIMIT Acoustic-Phonetic Continuous Speech Corpus
Background image of page 6
TIMIT Acoustic-Phonetic Continuous Speech Corpus
Background image of page 7

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
FFM TIMIT The FFMTIMIT corpus contains the previously unreleased secondary microphone recordings of the TIMIT corpus. FFMTIMIT contains a total of 6130 sentences, 10 sentences spoken by each of 613 speakers from 8 major dialect regions of the United States.
Background image of page 8
Image of page 9
This is the end of the preview. Sign up to access the rest of the document.

This note was uploaded on 02/11/2012 for the course ECE 5526 taught by Professor Staff during the Summer '09 term at FIT.

Page1 / 26

Corpus Information - Speech Recognition Final Project...

This preview shows document pages 1 - 9. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online