11 Pages

kyriakakis

Course: CS 6181, Fall 2008
School: Columbia
Rating:
 
 
 
 
 

Word Count: 8332

Document Preview

and Fundamental Technological Limitations of Immersive Audio Systems CHRIS KYRIAKAKIS, MEMBER, IEEE Numerous applications are currently envisioned for immersive audio systems. The principal function of such systems is to synthesize, manipulate, and render sound elds in real time. In this paper, we examine several fundamental and technological limitations that impede the development of seamless immersive audio...

Register Now

Unformatted Document Excerpt

Coursehero >> New York >> Columbia >> CS 6181

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
and Fundamental Technological Limitations of Immersive Audio Systems CHRIS KYRIAKAKIS, MEMBER, IEEE Numerous applications are currently envisioned for immersive audio systems. The principal function of such systems is to synthesize, manipulate, and render sound elds in real time. In this paper, we examine several fundamental and technological limitations that impede the development of seamless immersive audio systems. Such limitations stem from signal-processing requirements, acoustical considerations, human listening characteristics, and listener movement. We present a brief historical overview to outline the development of immersive audio technologies and discuss the performance and future research directions of immersive audio systems with respect to such limits. Last, we present a novel desktop audio system with integrated listener-tracking capability that circumvents several of the technological limitations faced by todays digital audio workstations. KeywordsAcoustic signal processing, audio systems, auditory system, multimedia systems, signal processing. I. INTRODUCTION Emerging integrated media systems seamlessly combine digital video, digital audio, computer animation, text, and graphics into common displays that allow for mixed media creation, dissemination, and interactive access in real time. Immersive audio and video environments based on such systems can be envisioned for applications that include teleconferencing and telepresence; augmented and virtual reality for manufacturing and entertainment; air-trafc control, pilot warning, and guidance systems; displays for the visually or aurally impaired; home entertainment; distance learning; and professional sound and picture editing for television and lm. The principal function of immersive systems is to synthesize multimodal perceptions that do not exist in the current physical environment, thus immersing users in a seamless blend of visual and aural information. Signicant resources have been allocated over the past 20 years to promote research in the area of image and video processing, resulting in important advances in these elds. Manuscript received September 8, 1997; revised December 4, 1997. The Guest Editor coordinating the review of this paper and approving it for publication was T. Chen. The author is with the Integrated Media Systems Center, University of Southern California, Los Angeles, CA 90089-2564 USA (e-mail: ckyriak@imsc.usc.edu). Publisher Item Identier S 0018-9219(98)03283-6. On the other hand, audio signal processing, and particularly immersive audio, have been largely neglected. Accurate spatial reproduction of sound can signicantly enhance the visualization of three-dimensional (3-D) information for applications in which it is important to achieve sound localization relative to visual images. The human earbrain interface is uniquely capable of localizing and identifying sounds in a 3-D environment with remarkable accuracy. For example, human listeners can detect timeof-arrival differences of about 7 s. Sound perception is based on a multiplicity of cues that include level and time differences and direction-dependent frequency-response effects caused by sound reection in the outer ear, head, and torso, cumulatively referred to as the head-related transfer function (HRTF). In addition to such directional cues, human listeners use a multiplicity of other cues in the perception of timbre, frequency response, and dynamic range. Furthermore, there are numerous subjective sound qualities that vary from listener to listener but are equally important in achieving the suspension of disbelief desired in an immersive audio system. These include attributes such as the apparent source width, listener envelopment, clarity, and warmth [1], [2]. Vision also plays an important role in localization and can overwhelm the aural impression. In fact, a mismatch between the aurally perceived and visually observed positions of a particular sound causes a cognitive dissonance that can seriously limit the visualization enhancement provided by immersive sound. The amount of mismatch required to cause such a dissonance is subjective and can vary in both the level of perception and annoyance. For professional sound designers, a mere 4 offset in the horizontal plane between the visual and aural image is perceptible, whereas it takes a 15 offset before the average layperson will notice [3]. In this paper, we discuss several issues that pertain to immersive audio system requirements that arise from fundamental physical limitations as well as current technological drawbacks. We will address these issues from three complementary perspectives: identication of fundamental physical limitations that affect the performance of immersive audio systems, evaluation of the current status of immersive audio system development with respect to 00189219/98$10.00 1998 IEEE PROCEEDINGS OF THE IEEE, VOL. 86, NO. 5, MAY 1998 941 such fundamental limits, and delineation of technological considerations that affect present and future system design and development. In the nal sections, we will present a novel sound-reproduction system that addresses several of the current technological limitations that currently affect the quality of audio at the desktop. This system incorporates a video-based tracking method that allows real-time processing of the audio signal in response to listener movement. hardware increases, the capability of auralization systems to render complex sound elds will increase proportionally. III. BRIEF HISTORICAL OVERVIEW A. Two-Channel Stereo Although many of the principles of stereophonic sound were developed through research efforts in the early 1930s, there still remains a misconception as to the meaning of the word stereo itself. While it is generally associated with sound reproduction from two loudspeakers, the word originates from the Greek stereos, meaning solid or threedimensional. The two-channel association came about in the 1950s because of technological limitations imposed by the phonograph record that had only two groove walls for encoding information. Stereophony started with the work of Blumlein [6] in the United Kingdom, who recognized early on that it was possible to locate a sound within a range of azimuth angles by using an appropriate combination of delay and level differences. His work focused on accurate reproduction of the sound eld at each ear of the listener and on the development of microphone techniques that would allow the recording of the amplitude and phase differences necessary for stereo reproduction. Fletcher, Steinberg, and Snow at Bell Laboratories in the United States [7][9] took a different approach. They considered a wall of sound in which an innite number of microphones is used to reproduce a sound eld through an innite number of loudspeakers, similar to the Huygens principle of secondary wavelets. While this made for an interesting theoretical result, the Bell Labs researchers realized that practical implementations would require a signicantly smaller number of channels. They showed that a three-channel system consisting of left, center, and right channels in the azimuth plane could represent the lateralization and depth of the desired sound eld with acceptable accuracy [Fig. 1(a)]. The rst such stereophonic three-channel system was demonstrated in 1934 with the Philadelphia Orchestra performing remotely for an audience in Washington, DC, over wide-band telephone lines. B. Four-Channel Matrixed Quadraphonic System While stereophonic methods can be a powerful tool in the reproduction of the spatial attributes of a sound eld, they fall short of true three-dimensional reproduction. The quadraphonic system attempted to circumvent such limitations by capturing and transmitting information about the direct sound and the reverberant sound eld [10], [11]. To deliver the four channels required by quadraphonic recordings over a two-channel medium (e.g., the phonograph record), it was necessary to develop an appropriate encoding and decoding scheme. Several such schemes were proposed based on 4 : 2 : 4 matrix encoding/decoding that relied on phase manipulation of the original stereo signals [12]. Quadraphonic systems were capable of reproducing sound images fairly accurately in the front and rear sectors PROCEEDINGS OF THE IEEE, VOL. 86, NO. 5, MAY 1998 II. THE NATURE OF LIMITATIONS IN IMMERSIVE AUDIO SYSTEMS There are two classes of limitations that impede the implementation of immersive audio systems. The rst class encompasses fundamental limitations that arise from physical laws, and its understanding is essential for determining the feasibility of a particular technology with respect to the absolute physical limits. Many such fundamental limitations are not directly dependent on the choice of systems but instead pertain to the actual process of sound propagation and attenuation in irregularly shaped rooms. The physical properties of the acoustic environment are encoded in the sound eld and must be decoded by an immersive audio system in order to accurately simulate the original environment. The inuence of the local acoustic environment is reected in the perception of spatial attributes such as direction and distance, as well as in the perception of room spaciousness and source size [1], [2]. The situation is further complicated by the fact that the decoding process must include the transformations associated with human hearing. These include the conversion of spatial sound cues into level and time differences and direction-dependent frequency-response effects caused by the pinna, head, and torso through a set of amplitude and phase transformations known as the HRTFs. The seamless incorporation of such cues in immersive audio systems is a very active area of research that, if successful, will give rise to systems that begin to approach performance near the fundamental limits. The second class of limitations consists of constraints that arise purely from technological considerations. These are equally useful in understanding the potential applications of a given system and are imposed by the particular technology chosen for system implementation. For example, the process of encoding parameters associated with room acoustics into sound elds can be modeled using numerical methods. In theory, this would involve the solution of the wave equation for sound subject to the boundary conditions dictated by the complex (absorptive, reective, and diffractive) room surfaces. The computational complexity of this problem is very high and involves the calculation of estimated 10 normal modes that fall within the range of human hearing (20 Hz20 kHz) for a large hall [4]. More recent methods have been developed for rendering sound elds through a process called auralization. Such methods utilize a combination of scaled models, digital ltering, and special-purpose hardware for real-time convolution to predict and render the sound eld [5]. As the processing power of digital signal processing (DSP) 942 (a) (b) (a) (b) Fig. 1. (a) Stereo was originally invented based on three loudspeakers. Sound images in the center are rendered by a real loudspeaker that delivers the same direct sound to both ears. (b) In the two-loudspeaker stereo conguration with a phantom center, the cross-talk terms give rise to a less stable center image as well as a loss of clarity. of the azimuthal plane, but they exhibited serious limitations when attempting to reproduce sound images to the side of the listener. Experiments showed [13], [14] that this was a limitation associated as much with sound-eld synthesis using only four channels as with human psychoacoustic mechanisms. These technical limitations and the presence of two competing formats in the consumer marketplace contributed to the early demise of quadraphonic systems. C. Multichannel Surround Sound In the early 1950s, the rst multichannel sound format was developed by 20th Century Fox. The combination of wide-screen formats such as CinemaScope (35 mm) and six-track Todd-AO (70 mm) with multichannel sound was the lm industrys response to the growing threat of television. Stereophonic lm sound was typically reproduced over three front loudspeakers, but these new formats included an additional monophonic channel that was reproduced over two loudspeakers behind the audience and was known as the effects channel. This channel increased the sense of space for the audience, but it also suffered from a serious technological limitation. Listeners seated on-center with respect to the rear loudspeakers perceived inside-thehead localization similar to the effect of stereo images reproduced over headphones. Listeners seated off-center localized the channel to the effects loudspeaker that was closest to them as dictated by the law of the rst-arriving wavefront, thus destroying the sense of envelopment desired [14] [Fig. 2(a)]. The solution to these problems was found by introducing a second channel reproduced over an array of loudspeakers along the sides of the theater to create a more diffuse sound eld [Fig. 2(b)]. In the mid-1970s, a new sound technology was introduced by Dolby Laboratories called Dolby Stereo. It was based on the optical technology that had been used for sound on lm since the 1930s, and it circumvented the problems associated with magnetic multitrack recording. Dolby developed a matrix method for encoding four channels (left, center, right, and mono surround) into two channels using a technique derived from the matrix methods KYRIAKAKIS: LIMITATIONS OF IMMERSIVE AUDIO SYSTEMS Fig. 2. (a) In early surround-sound systems with a mono surround channel, listeners seated off-center perceived the sound as if it originated from the effects loudspeaker that was closest to them, thus destroying the desired sense of envelopment. (b) Current systems use stereo surrounds reproduced over an array of loudspeakers along the sides of the theater to create a more diffuse sound eld. Fig. 3. Current commercial multichannel systems encode the LFE, three front, and two surround channels into a bit stream that is decoded at the users end. With proper loudspeaker selection and placement, it is possible to simulate the experience of a movie theater. Dipole surround loudspeakers that do not radiate sound directly in the direction of the listeners ears produce the best envelopment. used in quadraphonic systems but also ensured mono and stereo backward compatibility. In 1992, further enhancements by Dolby were introduced through a new format called Dolby Stereo Digital (SRD). This format eliminated matrix-based encoding and decoding and provided ve discrete channels (left, center, right, and independent left and right surround) in a conguration known as stereo surround. A sixth, low-frequency-enhancement (LFE) channel was introduced to add more head room and prevent the main speakers from overloading at low frequencies. The bandwidth of the LFE channel is limited between 0 and 120 Hz, a frequency regime that is outside the localization range for human listeners in a reverberant room, thus simplifying the placement requirements for the subwoofer used for LFE reproduction (Fig. 3). Recent advances in digital audio compression and optical storage have made it possible to deliver up to six discrete audio channels in a consumer format centered around 943 the Dolby AC-3 compression scheme.1 With exciting new formats such as an audio-only digital video disc just around the corner, the number of channels could easily increase to ten or more. While there are several million consumer systems capable of reproducing more than two channels, the majority of users (particularly those with desktop computer systems) would nd the use of multiple loudspeakers impractical. In the sections that follow, we examine the requirements of systems that allow delivery of multiple channels over two loudspeakers using DSP to simulate certain characteristics of human listening. IV. SPATIAL (3-D) AUDIO A. Physiological Signal Processing The human hearing process is based on the analysis of input signals to the two ears for differences in intensity, time of arrival, and directional ltering by the outer ear. Several theories were proposed as early as 1882 [15] that identied two basic mechanisms as being responsible for source localization: 1) interaural time differences (ITDs) and 2) interaural level differences (ILDs). A later theory by Lord Rayleigh [16] was based on a combination of ITD and ILD cues that operated in different wavelength regimes. For short wavelengths (corresponding to frequencies in the range of about 420 kHz), the listeners head casts an acoustical shadow giving rise to a lower sound level at the ear farthest from the sound source (ILD) [Fig. 4(b)]. At long wavelengths (corresponding to frequencies in the range of about 20 Hz1 kHz), the head is very small compared to the wavelength, and localization is based on perceived differences in the time of arrival of sound at the two ears (ITD) [Fig. 4(a)]. The two mechanisms of interaural time and level differences formed the basis of what became known as the duplex theory of sound localization. In the frequency range between approximately 1 and 4 kHz, both of these mechanisms are active, which results in several conicting cues that tend to cause localization errors. While time or intensity differences provide source direction information in the horizontal (azimuthal) plane, in the median plane, time differences are constant and localization is based on spectral ltering. The reection and diffraction of sound waves from the head, torso, shoulders, and pinnae, combined with resonances caused by the ear canal, form the physical basis for the HRTF. The outer ear can be modeled (in the static case) as a linear time-invariant system that is fully characterized by the HRTF in the frequency domain. As Blauert [17] describes it, the role of the outer ear is to superimpose angle- and distance-specic linear distortions on the incident sound signal. Spatial information is thus encoded onto the signals received by the eardrums through a combination of direction-dependent and direction-independent lters [18], [19]. The magnitude and phase of these head-related transfer functions vary signicantly for each sound direction but also from person to person. 1 See (a) (b) Fig. 4. (a) In the low-frequency regime, sound is localized based on differences in the time of arrival at each ear. (b) At higher frequencies, the wavelength of sound is short relative to the size of the head, and localization is based on perceived level differences caused by head shadowing. In the intermediate-frequency region, both mechanisms are active, and this can give rise to conicting cues. Dolby Laboratories at http://www.dolby.com. The emerging eld of 3-D audio is based on digital implementations of such HRTFs. In principle, it is possible to achieve excellent reproduction of three-dimensional sound elds using such methods; however, it has been demonstrated that this requires precise measurement of each listeners individual HRTFs [20]. This seemingly fundamental requirement that derives from inherent physiological and cognitive characteristics of the human earbrain interface has rendered such systems impractical for widespread use. Current research in this area is focused on achieving good localization performance while using synthetic (nonindividualized) HRTFs derived through averaging or modeling or based on the HRTFs of subjects that have been determined to be good localizers [21], [22]. In his review of the challenges in 3-D audio implementations, Begault [23] points out that there are currently three major barriers to successful implementation of such systems: 1) psychoacoustic errors such as frontback reversals typical in headphone-based systems, 2) large amounts of data required to represent measured HRTFs accurately, and 3) frequency- and phase-response errors that arise from mismatches between nonindividualized and measured HRTFs. It should be noted that frontback reversals can be reduced if the listener is allowed to move his head and that the lack of externalization experienced with headphone listening can be alleviated with appropriate use of reverberation. A fourth challenge arises from technological limitations of current computing systems. One capability that we envision for immersive audio systems is the simulation of room acoustics and listener characteristics for interactive, virtual-, and augmented-reality applications. In addition to the computational requirements for photorealistic rendering of visual images, the synthesis of such acoustical environments requires computation of the binaural room response and subsequent convolution with the HRTFs of the listener in real time as the listener moves around the room. Typical impulse response duration is 3 s, which, when sampled at 48 kHz, requires a processor capable of operating at PROCEEDINGS OF THE IEEE, VOL. 86, NO. 5, MAY 1998 944 more than 13 Gops/channel [24]. This problem can be circumvented using special-purpose hardware or hybrid block fast Fourier transform/direct convolution methods [25].2 The main goal is to reduce the number of operations for such computations, thus making them suitable for realtime interactive applications. B. Spatial Audio Rendering A critical issue in the implementation of immersive audio is the reproduction of 3-D sound elds that preserve the desired spatial location, frequency response, and dynamic range. There are two general methods for 3-D audio rendering that can be categorized as head related based on headphone reproduction and nonhead related based on loudspeaker reproduction [19]. A hybrid category, called transaural stereo, also exists that allows loudspeaker rendering of head-related signals. It should be noted that there are other methods for three-dimensional sound-eld capture and synthesis such as ambisonics [26] and waveeld synthesis [27], [28], but these will not be examined in this paper. Nonheadroom-related methods typically use multiple loudspeakers to reproduce multiple matrixed or discrete channels. Such systems can convey precisely localized sound images that are primarily conned to the horizontal plane and diffuse (ambient) sound to the sides of and behind the listener. In addition to the left and right loudspeakers, they make use of a center loudspeaker that helps create a solidly anchored center-stage sound image, as well as two loudspeakers for the ambient surround sound eld. The most prevalent systems currently available to consumers are based on the formats developed by Dolby for lm sound, including Pro Logic (four-channel matrixed encoded on two channels) and Dolby Digital (5.1-channel discrete based on the AC-3 compression scheme).1 Other 5.1-channel schemes include DTS Digital Surround3 and MPEG-2. Multichannel systems were designed primarily for authentic reproduction of sound associated with movies but have recently started to be used for music recordings and games on CD-ROM. The 5.1-channel Dolby Digital system was adopted in the U.S. standard of the upcoming advanced (high-denition) television system [29]. The design requirements for such loudspeaker-based systems include uniform audience coverage, accurate localization relative to visual images on the screen, diffuse rendering of ambient sounds, and capability for reproduction of the wide (up to 105 dB) dynamic range present in lm soundtracks. Head-related binaural recording, or dummy-head stereophony, methods attempt to accurately reproduce at each eardrum of the listener the sound pressure generated by a set of sources and their interactions with the acoustic environment [30]. Such recordings can be made with specially designed probe microphones that are inserted in the listeners ear canal or by using a dummy-head microphone system that is based on average human 2 See 3 See characteristics. Sound recorded using binaural methods is then reproduced through headphones that deliver the desired sound to each ear. It was concluded from early experiments that in order to achieve the desired degree of realism using binaural methods, the required frequencyresponse accuracy of the transfer function was 1 dB [31]. Other related work [32] compared direct listening and binaural recordings for the same subject and concluded that directional hearing was accurately preserved using binaural recording. While there are several commercially available dummyhead systems, binaural recordings are not widely used primarily due to limitations that are associated with headphone listening [20], [33], [34]. These drawbacks can be summarized as follows. 1) Individualized HRTF information does not exist for each listener and the averaged HRTFs that are used make it impossible to match each individuals perception of sound. 2) There are large errors in sound position perception associated with headphones, especially for the most important visual direction, out in front. 3) Headphones are uncomfortable for extended periods of time. 4) It is very difcult to externalize sounds and avoid the inside-the-head sensation. In many applications, however, such as in aircraft cockpits or multiuser environments, the use of headphones is required for practical reasons. The use of loudspeakers for reproduction can circumvent the limitations associated with headphone reproduction of binaural recordings. To deliver the appropriate binaural sound eld to each ear, however, it is necessary to eliminate the cross talk that is inherent in all loudspeaker-based systems. This is a technological limitation of all loudspeaker systems, and it arises from the fact that while each ear receives the desired sound from the same-side (ipsilateral) loudspeaker, it also receives undesired sound from the opposite-side (contralateral) loudspeaker. Several schemes have been proposed to address crosstalk cancellation. The basic principle of such schemes relies on preconditioning the signal into each loudspeaker such that the output sound generates the desired binaural sound pressure at each ear. If we denote the sound pressures that (ear) and (ear) and must be delivered to each ear as the transfer functions from each loudspeaker to each ear as , , , and , then we can write (Fig. 5) speaker speaker (1) Lake DSP at http://www.lakedsp.com. http://www.dtstech.com/. and the input signals to in which we denote by (speaker) the sound pressure each loudspeaker and delivered by each loudspeaker. To accurately reproduce the desired binaural signal at each ear, the input signals and 945 KYRIAKAKIS: LIMITATIONS OF IMMERSIVE AUDIO SYSTEMS to the loudspeakers. This is achieved, with reasonable accuracy, in desktop-based audio systems in which the listener is seated at the keyboard at a xed distance from the loudspeakers. Ultimately, the listeners head (and ear) location must be tracked in order to allow for head rotation and translation. Several issues related to both the desktop and the tracking implementations are discussed below. V. IMMERSIVE AUDIO RENDERING FOR DESKTOP APPLICATIONS For desktop applications, in addition to the user-imposed limitation of (typically) two loudspeakers, there exists an entirely different set of design requirements specic to applications such as professional sound editing for lm and television, teleconferencing and telepresence, augmented and virtual reality, and home personal-computer (PC) entertainment. Such applications require high-quality audio for a single listener in a desktop environment. Issues that must be addressed include the optimization of the frequency response over a given frequency range, the dynamic range, and stereo imaging subject to constraints imposed by room acoustics and human listening characteristics. Several problems are particular to the desktop environment, including frequency-response anomalies that arise due to the local acoustical environment, the proximity of the listener to the loudspeakers, and the acoustics associated with small rooms. A. Acoustical Limitations In a typical desktop sound-monitoring environment, delivery of stereophonic sound is through achieved two loudspeakers that are placed on either side of a video or computer monitor. This environment, combined with the acoustical problems of small rooms, causes severe problems that contribute to audible distortion of the reproduced sound. Among these problems, one of the most important is the effect of discrete early reections [41][43]. It has been shown [43] that these reections are the dominant source of monitoring nonuniformities. These nonuniformities appear in the form of frequency-response anomalies in rooms where the difference between the direct and reected sound level for the rst 15 ms is less than 15 dB [44], [45] (Fig. 6). High levels of reected sound cause comb ltering in the frequency domain, which in turn gives rise to severe changes in timbre. The perceived effects of such distortions were quantied with psychoacoustic experiments [41], [46] that demonstrated their importance. A solution that has been proposed to alleviate the problems of early reections is near-eld monitoring. In theory, the direct sound is dominant when the listener is very close to the loudspeakers, thus reducing the room effects to below audibility. In practice, however, there are several issues that must be addressed in order to provide high-quality sound [47]. One such issue relates to the large reecting surfaces that are typically present near the loudspeakers. Strong reections from a console or a video/computer monitor act as bafe extensions for the loudspeaker, resulting in a boost of PROCEEDINGS OF THE IEEE, VOL. 86, NO. 5, MAY 1998 Fig. 5. Transfer functions associated with a loudspeaker sound-rendering system. To deliver the correct binaural sound, it is necessary to prelter the signal to the loudspeakers so that the cross-talk terms LR and RL are cancelled during reproduction. H H must be chosen such that ear ear speaker speaker (2) The desired loudspeaker input signals are then found from ear ear ear ear (3) and must be reThe only requirement is that alizable lter responses. The rst such cross-talk cancellation scheme was proposed by Bauer [35], later by Atal and Schroeder [4], and by Damaske and Mellert [36], [37] using a system called True Reproduction of All Directional Information by Stereophony (TRADIS). The main limitation of these early systems was the fact that any listener movement that exceeded 75100 mm completely destroyed the spatial effect. Cooper and Bauck [38], [39] showed that under the assumption of leftright symmetry, a much simpler shufer lter can be used to implement cross-talk cancellation as well as synthesize virtual loudspeakers in arbitrary positions. They went on to use results from Mehrgard and Mellert [40], who showed that the head-related transfer function is minimum phase to within a frequency-independent delay that is a function of the angle of incidence. This new transaural system signicantly reduced the computational requirements by allowing implementations that use simple nite-duration impulse response lters. The functionality and practical use of immersive audio systems based on such transaural cross-talk cancellation methods can be greatly enhanced by eliminating the requirement that the user remain stationary with a xed head position. This increased functionality requires the capability to implement the requisite lters (and associated algorithms) in real time. A further requirement is precise information about the location of the listeners ears relative 946 Fig. 6. The time-domain response of a loudspeaker system includes the direct sound as well as the sound due to multiple reections from the local acoustical environment. Psychoacoustic evidence indicates that in order for these reections not to be perceived, their spectrum level should be 15 dB below the level of the direct sound. Fig. 7. Frequency-response problems that arise in the low frequencies due to standing-wave buildup in small rooms and in higher frequencies due to interactions with elements in the local acoustical environment (e.g., CRT screen, table top, untreated walls). midbass frequencies. Furthermore, even if it were possible to place the loudspeakers far away from large reecting surfaces, this would only solve the problem for middle and high frequencies. Low-frequency room modes do not depend on surfaces in the local acoustical environment but rather on the physical size of the room. These modes produce standing waves that give rise to large variations in frequency response (Fig. 7). Such amplitude and phase distortions can completely destroy carefully designed 3-D audio reproduction that relies on the transaural techniques described above. KYRIAKAKIS: LIMITATIONS OF IMMERSIVE AUDIO SYSTEMS B. Design Requirements To circumvent these limitations, a set of solutions has been developed for single-listener desktop reproduction that delivers sound quality equivalent to a calibrated dubbing stage [43]. These solutions include direct-path dominant design and correct low-frequency response. Based on our current understanding of psychoacoustic principles, it is possible to combine such cues to place the listener in a direct sound eld that is dominant over the reected and reverberant sound. The design considerations 947 Fig. 8. A properly designed direct-path dominant system that compensates for frequency anomalies produces a much atter frequency response. Frequencies below 100 Hz are reproduced with a separate subwoofer (response not shown) that is placed at a known distance from the listener to alleviate anomalies from standing waves. for this direct-path dominant design include compensation of the physical (reection and diffraction) effects of the video/computer monitor that extends the loudspeaker bafe as well as the large reecting surface on which the computer keyboard typically rests. The distortions that arise from amplitude and phase anomalies are eliminated, and this results in a listening experience that is dramatically different than what is achievable through traditional near-eld monitoring methods (Fig. 8). Standing waves associated with the acoustics of small rooms give rise to fundamental limitations in the quality of reproduced sound, particularly in the uniformity of lowfrequency response. Variations in this frequency regime can be as large as 15 dB for different listening locations in a typical room. The advantage of immersive audio rendering on desktop systems lies in the fact that the position of the loudspeakers and the listener are known a priori. It is therefore possible to use signal processing (equalization) to correct the low-frequency response. This smooth response, however, can only be achieved for a relatively small region around the listener. To correct over a larger region and compensate for listener movement, it is necessary to track the listeners position and use adaptive signal-processing methods that allow real-time correction of spatial as well as frequency-response attributes. C. Listener-Location Considerations In large rooms, multichannel sound systems are used to convey sound images that are primarily conned to the horizontal plane and are uniformly distributed over the audience area. Typical systems used for cinema reproduction use three front channels (left, center, right), two surround channels (left and right surround), and a separate 948 low-frequency channel. Such 5.1-channel systems (a term coined by Holman to represent ve full-spectrum channels and a low-frequency-only channel) are designed to provide accurate sound localization relative to visual images in front of the listener and diffuse (ambient) sound to the sides and behind the listener. The use of a center loudspeaker helps create a solid sound image between the left and right loudspeakers and anchors the sound to the center of the stage. For desktop applications, in which a single user is located in front of a CRT display, we no longer have the luxury of a center loudspeaker because that position is occupied by the display. Size limitations prevent the front loudspeakers from being capable of reproducing the entire spectrum; thus, a separate subwoofer loudspeaker is used to reproduce the low frequencies. The two front loudspeakers can create a virtual (phantom) image that appears to originate from the exact center of the display provided that the listener is seated symmetrically with respect to the loudspeakers. With proper head and loudspeaker placement, it is possible to recreate a spatially accurate sound eld with the correct frequency response in one exact position, the sweet spot. Even in this static case, however, the sound originating from each loudspeaker arrives at each ear at different times (about 200 s apart), thereby giving rise to acoustic cross talk [Fig. 1(b)]. These time differences, combined with reection and diffraction effects caused by the head, lead to frequency-response anomalies that are perceived as a lack of clarity [48]. This problem can be solved by adding a cross-talk cancellation lter (as described above in the description of transaural methods) to the signal of each loudspeaker. While this solution may be satisfactory for the static case, as soon as the listener moves even slightly, the conditions PROCEEDINGS OF THE IEEE, VOL. 86, NO. 5, MAY 1998 Fig. 9. Desktop sound system with vision-based head tracking. In this early prototype, the time difference of arrival at the two ears is adjusted in real time as the listener moves in the plane parallel to the loudspeakers. Current research is focused on tracking, pose estimation (for head rotations), and pinna shape recognition for real-time cross-talk cancellation and individualized HRTF synthesis. for cancellation are no longer met, and the phantom image moves toward the closest loudspeaker because of the precedence effect. In order, therefore, to achieve the highest possible quality of sound for a nonstationary listener and preserve the spatial information in the original material, it is necessary to know the precise location of the listener relative to the loudspeakers [47], [49], [50]. In the section below, we describe an experimental system that incorporates a novel listener-tracking method in order to overcome the difculties associated with two-ear listening as well as the technological limitations imposed by loudspeaker-based desktop audio systems. VI. FUTURE RESEARCH DIRECTIONS A. Vision-Based Methods for Listener Tracking Computer vision has historically been considered problematic, particularly for tasks that require object recognition. Up to now, the complexity of visionbased approaches has prevented their being incorporated into desktop-based integrated media systems. Recently, however, von der Malsburgs Laboratory of Computational and Biological Vision at the University of Southern California (USC) has developed a vision architecture that is capable of recognizing the identity, spatial position (pose), facial expression, gesture identication, and movement of a human subject in real time. This highly versatile architecture integrates a broad variety of visual cues in order to identify the location of a persons head within the image. Object recognition is achieved through pattern-based analysis that identies convex regions with skin color that are usually associated with the human face and through a stereo algorithm that determines the disparities among pixels that have been moving [51]. This pattern-recognition approach is based on the elastic graph matching method that places graph nodes at appropriate ducial points of the pattern [52]. A set of features is extracted at each graph node corresponding to the amplitudes of complex Gabor wavelets. The key advantage of this method is that a new pattern (face or ear) can be recognized on the basis of a small number of example images (10100). For audio applications, in which KYRIAKAKIS: LIMITATIONS OF IMMERSIVE AUDIO SYSTEMS the system must remember the last position of a listener that may have stopped moving, a hysteresis mechanism is used to estimate the current position and velocity of the head with a linear predictive lter. While there are several alternative methods for tracking humans (e.g., magnetic, ultrasound, infrared, laser), they typically are based on tethered operations or require articial ducial markings (e.g., colored dots, earrings) to be worn by the user. Furthermore, these methods do not offer any additional functionality to match what can be achieved with vision-based methods (e.g., face and expression recognition, ear classication). B. Desktop Audio System with Head Tracking A novel multichannel desktop audio system that meets all the design requirements and acoustical considerations described above has been developed by Holman of TMH Corporation4 in collaboration with the Immersive Audio Laboratory at USCs Integrated Media Systems Center (IMSC).5 This system uses two loudspeakers that are positioned on the sides of a video monitor at a distance of 45 cm from each other and 50 cm from the listeners ears (Fig. 9). The seating position height is adjusted so that the listeners ears are at the tweeter level of the loudspeakers (117 cm from the oor), thus eliminating any colorations in the sound due to off-axis lobing. We have also incorporated the vision-based tracking algorithm described above using a standard video camera connected to an SGI Indy workstation. This tracking system provides us with the coordinates of the center of the listeners head relative to the loudspeakers and is currently capable of operating at 10 frames/s with a 3% accuracy. In this single-camera system, it is possible to track listener movement that is conned in a plane parallel to loudspeakers and at a xed distance from them. When the listener is located at the exact center position (the sweet spot), sound from each loudspeaker arrives at the corresponding ear at the exact same time (i.e., with zero ipsilateral time delay). At any other position of the listener 4 See 5 See http://www.tmhlabs.com. http://imsc.usc.edu. 949 in this plane, there is a relative time difference of arrival between the sound signals from each loudspeaker. To maintain proper stereophonic perspective, the ipsilateral time delay must be adjusted as the listener moves relative to the loudspeakers. The head coordinates provided from the tracking algorithm are used to determine the necessary time-delay adjustment. This information is processed by a 32-b DSP processor board (ADSP-2106x SHARC) resident in a Pentium-II PC. In this early version of our system, the DSP board is used to delay the sound from the loudspeaker that is closest to the listener so that sound arrives with the same time difference as if the listener were positioned in the exact center between the loudspeakers. In other words, we have demonstrated stereophonic reproduction with an adaptively optimized sweet spot. We are currently in the process of identifying the bottlenecks of both the tracking and the audio signal-processing algorithms and integrating both into a single, PC-based platform for real-time operation. Furthermore, we are expanding the capability of the current single-camera system to include a second camera in a stereoscopic conguration that will provide distance (depth) information. C. Pinna Classication for Enhanced Sound Localization Immersive audio systems based on averaged HRTFs suffer from serious drawbacks. To map the entire threedimensional auditory space requires a large number of tedious and time-consuming measurements, which is very difcult to do with human subjects. A further, and perhaps insurmountable, complication arises from the fact that this process must be repeated for every listener in order to produce accurate results. Last, discrete point measurements represent a quantization of 3-D space that is inherently continuous, thus requiring sophisticated interpolation algorithms that are computationally intensive and can give rise to errors [53]. Several methods have been proposed to overcome such limitations. Functional HRTF representations that make use of models to represent HRTFs have been proposed [54][56]; however, most are not suitable for real-time applications because they require signicant computational resources. There is signicant evidence to suggest that the identication and incorporation of pinna physical characteristics may be a key factor limiting the development of seamless immersive audio systems. The human pinna is a rather complicated structure that for many years was considered to be a degenerate remain from past evolutionary forms. It was assumed to be a sound-collecting horn whose purpose was to direct sound into the ear canal. If this were true, then its physical dimensions would limit its role to a collector of high frequencies (short wavelengths). Experimental results, however, have shown that the pinna is a much more sophisticated instrument [17], [54], [57], [58]. The pin...

Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

Columbia - CS - 6181
Copyright (c) 1996 Institute of Electrical and Electronics Engineers. Reprinted, with permission, from the IEEE Multimedia Journal, Summer 1995 issue. This material is posted here with permission of the IEEE. Such permission of the IEEE does not in a
Columbia - CS - 6181
RTP1Real-Time Transport Protocol (RTP)August 12, 2001RTP2RTP protocol goals mixers and translators control: awareness, QOS feedback media adaptationAugust 12, 2001RTP3RTP the big pictureapplicationmedia encapsulationRTP
Columbia - CS - 6181
Video1VideoOctober 16, 2001Video2Event-based programs read() is blocking server only works with single socket audio, network input need I/O multiplexing event-based programming also need to handle time-outs, connection requests all
Columbia - CS - 6181
VoIP1Voice over IPHenning Schulzrinne Columbia University, New York schulzrinne@cs.columbia.educ 1998-2001, Henning Schulzrinne; updated August 12, 2001August 12, 2001VoIP2Overview new Internet services: telephone, radio, television
Columbia - CS - 6181
INFOCOM 20001Integrating Packet FEC into Adaptive Voice Playout Buffer Algorithms on the InternetJonathan Rosenberg, Lili Qiu, Henning Schulzrinne dynamicsoft, Cornell University, Columbia University jdrosen@dynamicsoft.com, lqiu@cs.cornell.edu,
Columbia - CS - 6181
Voice Communication Across the Internet: A Network Voice TerminalHenning Schulzrinne Department of Electrical and Computer Engineering Department of Computer Science University of Massachusetts Amherst, MA 01003hgschulz@cs.umass.eduJuly 29, 1992
Columbia - CS - 6181
1Internet TelephonyHenning Schulzrinne, Columbia UniversityCONTENTSAbstract Internet telephony, also known as voice-over-IP, replaces and complements the existing circuitswitched public telephone network with a packet-based infrastructure. While
Columbia - CS - 6181
hgs/SIP Tutorial1The Session Initiation Protocol (SIP)Henning Schulzrinne Dept. of Computer Science Columbia University New York, New York (sip:)schulzrinne@cs.columbia.eduMay 2001hgs/SIP Tutorial2Overview protocol architecture typical
Columbia - CS - 6181
1The Public Switched Telephone System2Historical perspective1876 1915 1920s 1956 1962 1965 1974 1977 1980s 1990s 1992 invention of telephone rst transcontinental telephone (NYSF) rst automatic switches TAT-1 transatlantic cable (35 lines) digi
Columbia - CS - 6181
QoS1Quality of ServiceAugust 12, 2001QoS2Overview network impairments and congestion current status measurements (Loosely based on Brian Carpenters slides)August 12, 2001QoS3Fundamental Limits Shannon channel capacity with Gau
Columbia - CS - 6181
Diff-Serv1Differentiated Services QoS Problem Diffserv Architecture Per hop behaviorsNovember 27, 2001Diff-Serv2Problem: QoS Need a mechanism for QoS in the Internet Issues to be resolved: Indication of desired service Denition of
Columbia - CS - 6181
RSVP1Resource Control and ReservationOctober 30, 2001RSVP2Resource Control and Reservation policing: hold sources to committed resources scheduling: isolate ows, guarantees resource reservation: establish owsOctober 30, 2001RSVP3
Columbia - CS - 08
Columbia - COMET - 2001
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 19, NO. 3, MARCH 2001401Guest Editorial Active and Programmable NetworksRIVEN BY advances in underlying technologies, increasing acceptance of computing and middleware paradigms in telecommu
Columbia - COMET - 2001
Mobile Networks and Applications 6, 443461, 2001 2001 Kluwer Academic Publishers. Manufactured in The Netherlands.Design, Implementation and Evaluation of Programmable Handoff in Mobile NetworksMICHAEL E. KOUNAVIS and ANDREW T. CAMPBELLComet Gro
Columbia - COMET - 2001
Computer Networks 36 (2001) 4973www.elsevier.com/locate/comnetVirtuosity: Programmable resource management for spawning networksDaniel Villela a, Andrew T. Campbell a,*, John Vicente baCenter for Telecommunications Research, Columbia Universi
Columbia - COMET - 2000
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 18, NO. 12, DECEMBER 20002499Pricing, Provisioning and Peering: Dynamic Markets for Differentiated Internet Services and Implications for Network InterconnectionsNemo Semret, Raymond R.-F. L
Columbia - COMET - 98
Multimedia Tools and Applications 7, 6782 (1998) c 1998 Kluwer Academic Publishers. Manufactured in The Netherlands.QOS-aware Middleware for Mobile Multimedia CommunicationsANDREW T. CAMPBELL campbell@ctr.columbia.edu The COMET Group, Center for T
Columbia - COMET - 97
Distrib. Syst. Engng 4 (1997) 4858. Printed in the UKPII: S0967-1846(97)82056-6A QoS adaptive multimedia transport system: design, implementation and experiencesAndrew Campbell and Geoff Coulson COMET Group, Room 801, Schapiro Research Building
Columbia - COMET - 95
Columbia - COMET - 93
Columbia - COMET - 2000
AbstractWireless access to Internet services will become typical, rather than the exception as it is today. Such a vision presents great demands on mobile networks. Mobile IP represents a simple and scalable global mobility solution but lacks the su
Columbia - COMET - 1998
AbstractExisting mobile systems (e.g., mobile IP, mobile ATM, and third-generation cellular systems) lack the intrinsic architectural flexibility to deal with the complexity of supporting adaptive mobile applications in wireless and mobile environme
Columbia - COMET - 1997
12QoS Adaptive Transports: Delivering Scalable Media to the DesktopAndrew T. Campbell, Columbia University Geoff Coulson, Lancaster University AbstractBy trading off temporal and spatial quality with available bandwidth, or manipulating the playo
Columbia - COMET - 97
Columbia - COMET - 98
OPEN SIGNALING FOR ATM, INTERNET AND MOBILE NETWORKS (OPENSIG98)University of Toronto, Ontario, October 5-6, 1998http:/comet.columbia.edu/opensig/activities/opensig98.html Andrew T. Campbell, Columbia University, USA Irene Katzela, University of To
Columbia - COMET - 97
Report on the 5th IFIP International Workshop on Quality of Service (IWQOS97)May 21-23, 1997. Center for Telecommunications Research Columbia University, New York City http:/comet.ctr.columbia.edu/iwqos97/Oguz Angin, Andrew T. Campbell, Lai-Tee Che
Columbia - COMET - 94
A QUALITY OF SERVICE ARCHITECTUREAndrew Campbell, Geoff Coulson and David Hutchison Department of Computing, Lancaster University, Lancaster LA1 4YR, U.K. E.mail: mpg@comp.lancs.ac.uk Abstract. For applications relying on the transfer of multimedia,
Columbia - COMET - 93
Summary of the 4th International Workshop on Network and Operating System Support for Digital Audio and Video (NOSSDAV'93)3rd-5th November, 1993. Lancaster House, Green Lane, Lancaster.G.S. Blair, A. Campbell, G. Coulson, N. Davies, F. Garcia and D
Columbia - COMET - 92
A ContinuousAndrewMediaGeoffTransportCoulson,andOrchestrationServiceCampbell,FranciscoGarcia and DavidHutchkonComputing Department Engineering Building Lancaster University Lancaster LAl 4YR, UK E.mail: mpg@comp. lanes.ac. uk
Columbia - COMET - 2003
HMP: Hotspot Mitigation Protocol for Mobile Ad hoc NetworksSeoung-Bum Lee and Andrew T. CampbellCOMET Group, Department of Electrical Engineering, Columbia University, New York, NY 10027 {sbl, campbell}@comet.columbia.eduAbstract. Hotspots repres
Columbia - COMET - 2002
SWAN: Service Differentiation in Stateless Wireless Ad Hoc NetworksGahng-Seop Ahn, Andrew T. Campbell, Andras Veres and Li-Hsiang SunAbstractWe propose SWAN, a stateless network model which uses distributed control algorithms to deliver service dif
Columbia - COMET - 2001
IP Radio Resource Control SystemJohn Vicente and Andrew T. Campbell1 1,2 1Center for Telecommunications Research, Columbia University campbell@comet.columbia.edu 2 Intel Corporation john.vicente@intel.comAbstract. With the need for mobility and
Columbia - COMET - 2001
P-MIP: Paging in Mobile IPXiaowei ZhangComet Group Columbia University New York, NY 10027Javier Gomez CastellanosComet Group Columbia University New York, NY 10027Andrew T. CampbellComet Group Columbia University New York, NY 10027xzhang@co
Columbia - COMET - 2001
Distributed DifferentiationMichaelControlBarryl, AndrewAlgorithmsfor Servicein WirelessPacket NetworksT. Campbe112, Andras Veres3Ab.sfracf This paper investigates dlffenmtiated packet networks using a fully distributed approachservi
Columbia - COMET - 01
Sphere: A Binding Model and Middleware for Routing ProtocolsVassilis D. Stachtos, Michael E. Kounavis and Andrew T. CampbellComet Group, Center for Telecommunications Research Columbia University, New York, NY 10027 genesis@comet.columbia.eduAbst
Columbia - COMET - 2000
Feering and Provisioning of Differentiated Internet ServicesN. SemretUCLA and Invisible Hand Networks, Inc. nemo@invisibl ehand. net,R. R.-F. Liao, A. T. CampbellColumbia University {liao, campbell}@comet. columbia. eduA. A. LazarColumbia Uni
Columbia - COMET - 2000
Accelerating Service Creation and Deployment in Mobile NetworksMichael E. Kounavis, Andrew T. Campbell, Gen Ito, and Giuseppe Bianchi.AbstractFuture mobile networks should be built on a foundation of open programmable networking and software radio
Columbia - COMET - 99
Columbia - COMET - 99
Columbia - COMET - 99
Columbia - COMET - 99
ON THE ANALYSIS OF CELLULAR IP ACCESS NETWORKSAndr s G. Valk a oEricsson Research andras.valko@lt.eth.ericsson.seJavier Gomez, Sanghyo Kim, Andrew T. CampbellCenter for Telecommunications Research, Columbia University, New York javierg,shkim2,c
Columbia - MM - 96
A QoS Adaptive Transport System: Design, Implementation and ExperienceAndrew Campbell COMET Group Center for Telecommunication Columbia University New York USAhttp:/www .ctr. colurnbia.ResearchGeoff Coulson Distributed Multimedia Research Group
Columbia - COMET - 94
Flow Management in a Quality of Service ArchitectureAndrew Campbell, Geoff Coulson and David HutchisonComputing Department, Lancaster University, Lancaster LA1 4YR, UK E.mail: mpg@comp.lancs.ac.ukAbstract. For applications relying on the transfe
Columbia - COMET - 93
A MULTIMEDIA ENHANCED TRANSPORT SERVICE IN A QUALITY OF SERVICE ARCHITECTUREAndrew Campbell, Geoff Coulson and David Hutchison Department of Computing, Lancaster University, Lancaster LA1 4YR, U.K. E.mail: mpg@comp.lancs.ac.uk Abstract. For applicat
Columbia - COMET - 93
Columbia - COMET - 92
Orchestration Services for Distributed Multimedia SynchronisationAndrew Campbell, Geoff Coulson, Francisco Garca, and David Hutchison Computing Department, Lancaster University, Lancaster LA1 4YR, UK E.mail: mpg@comp.lancs.ac.ukAbstractRapid deve
Columbia - COMET - 92
RESOURCE MANAGEMENT IN MULTIMEDIA COMMUNICATION STACKSAndrew Campbell, Geoff Coulson, Francisco Garca and David Hutchison In Proc. 4th IEE Conference on Telecommunications, Manchester, UK, April 93. Lancaster University, UK 1. Introduction much grea
Columbia - CS - 6181
The Loss-Delay Based Adjustment Algorithm: A TCP-Friendly Adaptation SchemeDorgham Sisalem GMD-Fokus, Berlin sisalem@fokus.gmd.de Henning Schulzrinne Columbia University, New York schulzrinne@cs.columbia.eduAbstract Many distributed multimedia app
Columbia - CS - 94
Adaptive Playout Mechanisms for Packetized Audio Applications in Wide-Area NetworksRamachandran Ramjee, Jim Kurose, Don Towsley Department of Computer Science Univ. of Massachusettsframjee,kurose,towsleyg@cs.umass.eduHenning Schulzrinne GMD-Fokus
Columbia - CS - 9207
Voice Communication Across the Internet: A Network Voice TerminalHenning Schulzrinne Department of Electrical and Computer Engineering Department of Computer Science University of Massachusetts Amherst, MA 01003hgschulz@cs.umass.eduJuly 29, 1992
Columbia - CS - 9902
Internet Telephony: Architecture and Protocols an IETF PerspectiveHenning Schulzrinne Dept. of Computer Science Columbia University hgs@cs.columbia.edu Jonathan Rosenberg Bell Laboratories Lucent Technologies jdrosen@bell-labs.comJuly 2, 1998Abs
Columbia - COMET - 2001
Wireless Networks 7, 541557, 2001 2001 Kluwer Academic Publishers. Manufactured in The Netherlands.A Utility-Based Approach for Quantitative Adaptation in Wireless Packet NetworksRAYMOND R.-F. LIAO and ANDREW T. CAMPBELLDepartment of Electrical
Columbia - COMET - 99
The Cambridge Wireless Broadband TrialRaymond Liao*,1 , Martin Brown*, Glenford Mapp*, and Ian Wassell*The Comet Group, Columbia University, USA, liao@comet.columbia.edu. AT&T Laboratories Cambridge, UK, {mgb, gem}@uk.research.att.com * Lab for Com
Columbia - COMET - 99
Utility-based Network Adaptation for MPEG-4 SystemsPaul Bocheck, Andrew T. Campbell, Shih-Fu Chang and Raymond R.-F. Liao* Dept. of Electrical Engineering, Columbia University {bocheck, campbell, sfchang, liao }@ee.columbia.eduattention has been pa
Columbia - COMET - 98
1On Programmable Universal Mobile Channels in a Cellular InternetRaymond R.-F. Liao and Andrew T. Campbell Center for Telecommunications Research, Columbia University 530 W. 120th Street, New York, NY10027, USA E-Mail: fliao, campbellg@comet.colum
Columbia - COMET - 98
Proc. of the 6th International Workshop on Quality of Service IEEE IFIP IWQOS'98, Napa Valley, CA, May 1998.On Utility-Fair Adaptive Services in Wireless NetworksGiuseppe Bianchi, Andrew T. Campbell+, Raymond R.-F. Liao+Politecnico di Milano, D
Columbia - COMET - 2004
100IEEE TRANSACTIONS ON NETWORKING, VOL. XX, NO. Y, MONTH 2004Dynamic Core Provisioning for Quantitative Differentiated ServicesRaymond R.-F. Liao, Member, IEEE, and Andrew T. Campbell, Member, IEEEAbstract Efcient network provisioning mechani
Columbia - COMET - 99
Proc. of the 7th International Workshop on Quality of Service IEEE IFIP IWQOS'99, London, UK, June 1999.Market Pricing of Di erentiated Internet ServicesNemo Semret, Raymond R.-F. Liao, Andrew T. Campbell and Aurel A. Lazar Center for Telecommunic
Columbia - COMET - 99
PROGRAMMABLE MOBILE NETWORKSAndrew T. Campbell, Michael E. Kounavis, and Raymond R.-F. LiaoCOMET Group, Center for Telecommunications Research,Columbia University, New York NY 10027 AbstractExisting mobile systems (e.g., mobile IP, mobile ATM an
Columbia - COMET - 98
3URFHHGLQJV RI WK,QWO :RUNVKRS RQ 0RELOH 0XOWLPHGLD &RPPXQLFDWLRQ0R0XF 2FWREHU %HUOLQ7+( '(6,*1 ,03/(0(17$ 7,21 $1' (9 $/8$ 7,21 2) 7+( 02%,:$5( 722/.,75D\PRQG 5) /LDR 0LFKDHO ( .RXQDYLV DQG $QGUHZ 7 &DPSEHOO$EVWUDFW7KHLQWURGXFWLRQR