did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

We're the #1 textbook rental company. Let us show you why.

9780470195369

Speech and Audio Signal Processing Processing and Perception of Speech and Music

by ; ;
  • ISBN13:

    9780470195369

  • ISBN10:

    0470195363

  • Edition: 2nd
  • Format: Hardcover
  • Copyright: 2011-08-23
  • Publisher: Wiley-Interscience
  • Purchase Benefits
List Price: $147.14 Save up to $0.74
  • Buy New
    $146.40
    Add to Cart Free Shipping Icon Free Shipping

    PRINT ON DEMAND: 2-4 WEEKS. THIS ITEM CANNOT BE CANCELLED OR RETURNED.

Supplemental Materials

What is included with this book?

Summary

When Speech and Audio Signal Processing published in 1999, it stood out from its competition in its breadth of coverage and its accessible, intutiont-based style. This book was aimed at individual students and engineers excited about the broad span of audio processing and curious to understand the available techniques. Since then, with the advent of the iPod in 2001, the field of digital audio and music has exploded, leading to a much greater interest in the technical aspects of audio processing. This Second Edition will update and revise the original book to augment it with new material describing both the enabling technologies of digital music distribution (most significantly the MP3) and a range of exciting new research areas in automatic music content processing (such as automatic transcription, music similarity, etc.) that have emerged in the past five years, driven by the digital music revolution. New chapter topics include: Psychoacoustic Audio Coding, describing MP3 and related audio coding schemes based on psychoacoustic masking of quantization noise Music Transcription, including automatically deriving notes, beats, and chords from music signals. Music Information Retrieval, primarily focusing on audio-based genre classification, artist/style identification, and similarity estimation. Audio Source Separation, including multi-microphone beamforming, blind source separation, and the perception-inspired techniques usually referred to as Computational Auditory Scene Analysis (CASA).

Author Biography

The late Ben Gold consulted at Massachusetts Institute of Technology and Lincoln Laboratory and taught at the University of California at Berkeley. He was the author of Digital Processing of Signals and the coauthor of Theory and Applications of Digital Signal Processing. Dr. Gold was an IEEE Fellow, member of the National Academy of Engineering, and recipient of several IEEE awards.

Nelson Morgan is the Director of the International Computer Science Institute, an independent, not-for profit research laboratory affiliated with the University of California at Berkeley. Dr. Morgan is also Professor-in-Residence in the Electrical Engineering and Computer Sciences Department at UC Berkeley. Dr. Morgan is an IEEE Fellow.

Dan Ellis is Associate Professor in the Electrical Engineering Department of Columbia University. Dr. Ellis's Laboratory for Recognition and Organization of Speech and Audio (LabROSA) investigates how to extract high-level information from audio, including speech recognition, music description, and environmental sound processing.

Table of Contents

Preface To The 2011 Editionp. xxi
Introductionp. 1
Historical Background
Synthetic A Udio: A Brief Historyp. 9
Speech Analysis And Synthesis Overviewp. 21
Brief History Of Automatic Speech Recognitionp. 40
Speech-Recognition Overviewp. 59
Mathematical Background
Digital Signal Processingp. 73
Digital Filtersand Discrete Fourier Transformp. 87
Pattern Classificationp. 105
Statistical Pattern Classificationp. 124
Acoustics
Wave Basicsp. 141
Acoustic Tube Modeling Of Speech Productionp. 152
Musical Instrument Acousticsp. 158
Room Acousticsp. 179
Auditory Perception
Ear Physiologyp. 193
Psychoacousticsp. 209
Models Of Pitch Perceptionp. 218
Speech Perceptionp. 232
Human Speech Recognitionp. 250
Speech Features
The Auditory System As A Filter Bankp. 263
The Cepstrum As A Spectral Analyzerp. 277
Linear Predictionp. 286
A Utomatic Speech Recognition
Feature Extraction For Asrp. 301
Linguistic Categories For Speech Recognitionp. 319
Deterministic Sequence Recognition For Asrp. 337
Statistical Sequence Recognitionp. 350
Statistical Model Trainingp. 364
Discriminant Acoustic Probability Estimationp. 381
Acoustic Model Training: Further Topicsp. 394
Speech Recognition And Understandingp. 416
Synthesis And Coding
Speech Synthesisp. 431
Pitch Detectionp. 455
Vocodersp. 473
Low-Rate Vocodersp. 493
Medium-Rate And High-Rate Vocodersp. 505
Perceptual A Udio Codingp. 531
Other Applications
Some Aspects Of Computer Music Synthesisp. 553
Music Signal Analysisp. 567
Music Retrievalp. 581
Source Separationp. 59
Speech Transformationsp. 617
Speaker Verificationp. 633
Speaker Diarizationp. 644
Table of Contents provided by Publisher. All Rights Reserved.

Supplemental Materials

What is included with this book?

The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any access cards, study guides, lab manuals, CDs, etc.

The Used, Rental and eBook copies of this book are not guaranteed to include any supplemental materials. Typically, only the book itself is included. This is true even if the title states it includes any access cards, study guides, lab manuals, CDs, etc.

Rewards Program