Adaptive differential pulse-code modulation

Adaptive differential pulse-code modulation (ADPCM) is a variant of differential pulse-code modulation (DPCM) that varies the size of the quantization step, to allow further reduction of the required data bandwidth for a given signal-to-noise ratio.

Typically, the adaptation to signal statistics in ADPCM consists simply of an adaptive scale factor before quantizing the difference in the DPCM encoder.[1]

ADPCM was developed in the early 1970s at Bell Labs for voice coding, by P. Cummiskey, N. S. Jayant and James L. Flanagan.[2]

In telephony

In telephony, a standard audio signal for a single phone call is encoded as 8000 analog samples per second, of 8 bits each, giving a 64 kbit/s digital signal known as DS0. The default signal compression encoding on a DS0 is either μ-law (mu-law) PCM (North America and Japan) or A-law PCM (Europe and most of the rest of the world). These are logarithmic compression systems where a 13 or 14 bit linear PCM sample number is mapped into an 8 bit value. This system is described by international standard G.711. Where circuit costs are high and loss of voice quality is acceptable, it sometimes makes sense to compress the voice signal even further. An ADPCM algorithm is used to map a series of 8 bit µ-law (or a-law) PCM samples into a series of 4 bit ADPCM samples. In this way, the capacity of the line is doubled. The technique is detailed in the G.726 standard.

Some ADPCM techniques are used in Voice over IP communications. ADPCM was also used by Interactive Multimedia Association for development of legacy audio codec known as ADPCM DVI, IMA ADPCM or DVI4, in the early 1990s.[3]

Split-band or subband ADPCM

G.722[4] is an ITU-T standard wideband speech codec operating at 48, 56 and 64 kbit/s, based on subband coding with two channels and ADPCM coding of each.[5] Before the digitization process, it catches the analog signal and divides it in frequency bands with QMF filters (quadrature mirror filters) to get two subbands of the signal. When the ADPCM bitstream of each subband is obtained, the results are multiplexed and the next step is storage or transmission of the data. The decoder has to perform the reverse process, that is, demultiplex and decode each subband of the bitstream and recombine them.

Referring to the coding process, in some applications as voice coding, the subband that includes the voice is coded with more bits than the others. It is a way to reduce the file size.

Software

The Windows Sound System supported ADPCM in .wav files.[6] The corresponding FFmpeg audio codecs are adpcm-ms and adpcm-ima-wav.[7]

References

  1. Ken C. Pohlmann (2005). Principles of Digital Audio. McGraw-Hill Professional. ISBN 978-0-07-144156-8.
  2. P. Cummiskey, N. S. Jayant, and J. L. Flanagan, "Adaptive quantization in differential PCM coding of speech," Bell Syst. Tech. J., vol. 52, pp. 1105—1118, Sept. 1973.
  3. Recommended Practices for Enhancing Digital Audio Compatibility in Multimedia Systems - legacy IMA ADPCM specification, Retrieved on 2009-07-06
  4. ITU-T G.722 page ITU-T Recommendation G.722 (11/88), "7 kHz audio-coding within 64 kbit/s"
  5. Jerry D. Gibson; Toby Berger; Tom Lookabaugh (1998). Digital Compression for Multimedia. Morgan Kaufmann. ISBN 978-1-55860-369-1.
  6. "Differences Between PCM/ADPCM Wave Files Explained". KB 89879 Revision 3.0. Microsoft Knowledge Base. 2011-09-24. Retrieved 2013-12-30.
  7. "FFmpeg General Documentation - Audio Codecs". FFmpeg.org. Retrieved 2013-12-30.
This article is issued from Wikipedia - version of the 10/27/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.