224s.09.lec4

224s.09.lec4 - CS 224S LINGUIST 281 Speech Recognition Synthesis and Dialogue Dan Jurafsky Lecture 4 Intro to Festival rest of Text Normalization

Info iconThis preview shows pages 1–11. Sign up to view the full content.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: CS 224S / LINGUIST 281 Speech Recognition, Synthesis, and Dialogue Dan Jurafsky 1/5/07 Lecture 4: Intro to Festival; rest of Text Normalization; Letter-to-Sound IP Notice: lots of info, text, and diagrams on these slides comes (thanks!) from Alan Black’s excellent lecture notes and from Richard Sproat’s great new slides. Outline • Overview of Festival Where it lives, its components Its scripting language: Scheme • Finishing up Part of Speech Tagging • Phonetic Analysis Dictionaries Names Letter-to-Sound Rules (or “Grapheme-to-Phoneme Conversion”) 1/5/07 Festival • Open source speech synthesis system • Designed for development and runtime use Use in many commercial and academic systems Distributed with RedHat 9.x, etc Hundreds of thousands of users Multilingual No built-in language Designed to allow addition of new languages Additional tools for rapid voice development Statistical learning tools Scripts for building models 1/5/07 Text from Richard Sproat Festival as software • http://festvox.org/festival/ • General system for multi-lingual TTS • C/C++ code with Scheme scripting language • General replaceable modules: Lexicons, LTS, duration, intonation, phrasing, POS tagging, tokenizing, diphone/unit selection, signal processing • General tools Intonation analysis (f0, Tilt), signal processing, CART building, N-gram, SCFG, WFST 1/5/07 Text from Richard Sproat Festival as software • http://festvox.org/festival/ • No fixed theories • New languages without new C++ code • Multiplatform (Unix/Windows) • Full sources in distribution • Free software 1/5/07 Text from Richard Sproat CMU FestVox project • Festival is an engine, how do you make voices? • Festvox: building synthetic voices: Tools, scripts, documentation Discussion and examples for building voices Example voice databases Step by step walkthroughs of processes • Support for English and other languages • Support for different waveform synthesis methods Diphone Unit selection Limited domain 1/5/07 Text from Richard Sproat Synthesis tools • I want my computer to talk Festival Speech Synthesis • I want my computer to talk in my voice FestVox Project • I want it to be fast and efficient Flite 1/5/07 Text from Richard Sproat Using Festival • How to get Festival to talk • Scheme (Festival’s scripting language) • Basic Festival commands 1/5/07 Text from Richard Sproat Getting it to talk • Say a file festival --tts file.txt • From Emacs say region, say buffer • Command line interprete r festival> (SayText “hello”) 1/5/07 Text from Richard Sproat Scheme: the scripting lg • Advantages of a scripting lg...
View Full Document

This note was uploaded on 04/21/2011 for the course CS 224 taught by Professor De during the Spring '11 term at Kentucky.

Page1 / 63

224s.09.lec4 - CS 224S LINGUIST 281 Speech Recognition Synthesis and Dialogue Dan Jurafsky Lecture 4 Intro to Festival rest of Text Normalization

This preview shows document pages 1 - 11. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online