final_speech_paper_1_2008

final_speech_paper_1_2008 - Foundations and Trends R in...

Info iconThis preview shows pages 1–4. Sign up to view the full content.

View Full Document Right Arrow Icon
Foundations and Trends R ± in Signal Processing Vol. 1, Nos. 1–2 (2007) 1–194 c ± 2007 L. R. Rabiner and R. W. Schafer DOI: 10.1561/2000000001 Introduction to Digital Speech Processing Lawrence R. Rabiner 1 and Ronald W. Schafer 2 1 Rutgers University and University of California, Santa Barbara, USA, rabiner@ece.ucsb.edu 2 Hewlett-Packard Laboratories, Palo Alto, CA, USA Abstract Since even before the time of Alexander Graham Bell’s revolution- ary invention, engineers and scientists have studied the phenomenon of speech communication with an eye on creating more efficient and eFective systems of human-to-human and human-to-machine communi- cation. Starting in the 1960s, digital signal processing (DSP), assumed a central role in speech studies, and today DSP is the key to realizing the fruits of the knowledge that has been gained through decades of research. Concomitant advances in integrated circuit technology and computer architecture have aligned to create a technological environ- ment with virtually limitless opportunities for innovation in speech communication applications. In this text, we highlight the central role of DSP techniques in modern speech communication research and appli- cations. We present a comprehensive overview of digital speech process- ing that ranges from the basic nature of the speech signal, through a variety of methods of representing speech in digital form, to applica- tions in voice communication and automatic synthesis and recognition of speech. The breadth of this subject does not allow us to discuss any
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
aspect of speech processing to great depth; hence our goal is to pro- vide a useful introduction to the wide range of important concepts that comprise the Feld of digital speech processing. A more comprehensive treatment will appear in the forthcoming book, Theory and Application of Digital Speech Processing [101].
Background image of page 2
1 Introduction The fundamental purpose of speech is communication, i.e., the trans- mission of messages. According to Shannon’s information theory [116], a message represented as a sequence of discrete symbols can be quanti- Fed by its information content in bits, and the rate of transmission of information is measured in bits/second (bps). In speech production, as well as in many human-engineered electronic communication systems, the information to be transmitted is encoded in the form of a contin- uously varying (analog) waveform that can be transmitted, recorded, manipulated, and ultimately decoded by a human listener. In the case of speech, the fundamental analog form of the message is an acous- tic waveform, which we call the speech signal . Speech signals, as illus- trated in ±igure 1.1, can be converted to an electrical waveform by a microphone, further manipulated by both analog and digital signal processing, and then converted back to acoustic form by a loudspeaker, a telephone handset or headphone, as desired. This form of speech pro- cessing is, of course, the basis for Bell’s telephone invention as well as today’s multitude of devices for recording, transmitting, and manip- ulating speech and audio signals. Although Bell made his invention without knowing the fundamentals of information theory, these ideas 3
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Image of page 4
This is the end of the preview. Sign up to access the rest of the document.

This note was uploaded on 12/29/2011 for the course ECE 259 taught by Professor Rabiner,l during the Fall '08 term at UCSB.

Page1 / 194

final_speech_paper_1_2008 - Foundations and Trends R in...

This preview shows document pages 1 - 4. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online