SpeechProcessingFinal

SpeechProcessingFinal - Problem Statement The study of...

Info iconThis preview shows pages 1–7. Sign up to view the full content.

View Full Document Right Arrow Icon
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Problem Statement The study of speech processing and recognition has led to many useful applications due to the natural feel of speech communication. We, as people, are trained from the beginning of our lives to communicate with speech. As technology advances it is no surprise that speech will be used as a means of simplifying our lives in various applications. Imagine you were to leave your house in a hurry because you were running late to work, had to pick up your kids from school or for any other reason and you forgot to turn off the stove. You remember three blocks away and you do not have the time to turn around. You then dial a phone number that is assigned to your house and ask your house to turn the stove off for you. Your house confirms the stove will be turned off and you are now relived you won’t come home to a potential fire. This is the essence of the Phoning Home system. Phoning Home allows you to make a phone call to your house and control your different devices/appliances. This could be useful for controlling the temperature of your air conditioner, turning on and off lights, open and close your garage door, etc. Therefore, phoning home is not only a convenient way to control your household appliances but can also be used to manage your energy use more efficiently. Figure 1: Phoning Home System
Background image of page 2
How it works Phoning Home is broken into four main components. The Voice over IP (VOIP) server which forwards the phone speech and uses text to speech (TTS) to speak to the user. The Phoning Home web services which handles the communication for all Phoning Home households. The speech recognition server which is responsible for recognizing the user’s phone speech. The client services which handle an individual households devices. Figure 2: System Overview
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
System Architecture Each device that can be controlled via the Phoning Home system is known as an A Wireless Appliance Reducing Energy (AWARE). Every device is wirelessly controlled via the Phoning Home Master Control which is ultimately an Atmel Atmega16 microcontroller connected via USB to the Client Services. Figure 3: Phoning Home Concept
Background image of page 4
The communication between each component is described in more detail in the following sequence diagram: Figure 4: Phoning Home System Sequence Diagram
Background image of page 5

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Speech Recognition Engine The Carnegie Melon University (CMU) Sphinx speech recognition engine is being used by the Phoning Home system. This is an open source engine that is highly configurable. The Sphinx-4 framework consists of three primary modules: the FrontEnd, the Decoder, and the Linguist. The FrontEnd takes input signals and parameterizes them into a sequence of features. The Linguist translates any type of standard language model along with information from the Dictionary and structural information from one or more sets of Acoustic models into a SearchGraph. The Decoder uses the features from the FrontEnd, and the SearchGraph from the Linguist to perform the actual decoding and produce the Results. Figure 5 - Sphinx4 framework. The main blocks are the frontend, the decoder and the linguist.
Background image of page 6
Image of page 7
This is the end of the preview. Sign up to access the rest of the document.

This note was uploaded on 02/11/2012 for the course ECE 5525 taught by Professor Staff during the Fall '10 term at FIT.

Page1 / 15

SpeechProcessingFinal - Problem Statement The study of...

This preview shows document pages 1 - 7. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online