When
Speech and Audio Signal Processing published in 1999, it stood out from its competition in its breadth of coverage and its accessible, intutiont-based style. This book was aimed at individual students and engineers excited about the broad span of audio processing and curious to understand the available techniques. Since then, with the advent of the iPod in 2001, the field of digital audio and music has exploded, leading to a much greater interest in the technical aspects of audio processing.
This Second Edition will update and revise the original book to augment it with new material describing both the enabling technologies of digital music distribution (most significantly the MP3) and a range of exciting new research areas in automatic music content processing (such as automatic transcription, music similarity, etc.) that have emerged in the past five years, driven by the digital music revolution.
New chapter topics include:
-
Psychoacoustic Audio Coding, describing MP3 and related audio coding schemes based on psychoacoustic masking of quantization noise
-
Music Transcription, including automatically deriving notes, beats, and chords from music signals.
-
Music Information Retrieval, primarily focusing on audio-based genre classification, artist/style identification, and similarity estimation.
-
Audio Source Separation, including multi-microphone beamforming, blind source separation, and the perception-inspired techniques usually referred to as Computational Auditory Scene Analysis (CASA).
Helps readers develop an intuitive understanding of audio signal processing Acclaimed for its breadth of coverage as well as its clear, accessible presentation, Speech and Audio Signal Processing examines how machines and humans process audio signals, with an emphasis on speech and music. It begins with basic principles and then explains how these principles set the foundation for a wide range of applications. Moreover, the book is organized into a series of short chapters, offering readers a succinct overview of the range of topics that together represent the current state of knowledge in the field.
This Second Edition brings the book fully up to date with the explosive growth in audio processing technology, including the latest advances in digital music processing and distribution. New topics include:
-
Psychoacoustic audio coding, examining MP3 and related audio coding schemes that are based on the psychoacoustic masking of quantization noise
-
Music transcription, explaining how notes, beats, and chords can be automatically derived from music signals
-
Music information retrieval, exploring audio-based genre classification, artist and style identification, and similarity estimation
-
Audio source separation, describing multi-microphone beamforming, blind source separation, and perception-inspired techniques
Throughout the book, the authors present both human and machine strategies for accomplishing audio processing tasks. Readers will discover that, in many cases, human strategies can provide the inspiration for the development of machine strategies.
Speech and Audio Signal Processing is recommended for anyone who needs to understand the technologies underlying some of today's most cutting-edge applications, including speech recognition, audio compression, music synthesis, and diarization.