What is MPEG

MPEG is an acronym for Moving Picture Experts Group, which commonly refers to the international standard for digital video and audio compression. The official name of the MPEG-1 standard is: “Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Megabits per second.” It is sometimes referred to by its ISO/IEC project number, 11172 parts 1 through 5. However, this video standard is usually just called “MPEG”.

Just as Norton Utilities is a collection of tools for maintaining computer hard drives (and more), MPEG is a collection of tools for compressing audio and video. An important note about MPEG is that it does not specify how to perform compression. It does, however, describe a set of minimum requirements, which the MPEG decoder must live up to. (An MPEG decoder is the device, which plays back the compressed audio and video.) In particular, it defines a fictitious MPEG decoder, that incorporates the minimum requirements which determine whether something is MPEG or not.

 

There has been a lot of confusion in the media about the differences between MPEG-1 and MPEG-2. Contrary to what many people think, MPEG-1 and MPEG-2 are not competitors to one another, and MPEG-2 is not an improved version of MPEG-1. MPEG-1 was in fact designed specifically for delivering video from a single speed CD-ROM drive. *(PXR4 Recording format-16 bit MPEG 1 Audio Layer2 compressed, 32kHz)

MPEG-2 is a completely different standard from MPEG-1, and is designed for different purposes. MPEG-2 is specifically targeted at digital transmission or broadcast of video signals, and supports a much wider range of resolutions and bit-rates than the MPEG-1 standard. MPEG-2 has been chosen as the standard on which HDTV systems will be based.

 

MPEG Audio Codec Family ("Layer 1, 2, 3")

  • Q: Talking about MPEG audio coding, I heard a lot about "Layer 1, 2 and 3". What does it mean, exactly?

  • A: MPEG describes the compression of audio signals using high performance perceptual coding schemes.
    It specifies a family of three audio coding schemes, simply called Layer-1,-2,-3, with increasing encoder complexity and performance (sound quality per bitrate) from 1 to 3.
    The three codecs are compatible in a hierarchical way, i.e. a Layer-N decoder is able to decode bitstream data encoded in Layer-N and all Layers below N (e.g., a Layer-3 decoder may accept Layer-1,-2 and -3, whereas a Layer-2 decoder may accept only Layer-1 and -2.)

  • Q: So we have a family of three audio coding schemes. What does the MPEG standard define, exactly?

  • A: For each Layer, the standard specifies the bitstream format and the decoder.

  •  Q: What have the three audio Layers in common?

  • A: All Layers use the same basic structure. The coding scheme can be described as "perceptual noise shaping" or "perceptual subband / transform coding".

The encoder analyzes the spectral components of the audio signal by calculating a filterbank or transform and applies a psycho-acoustic model to estimate the just noticeable noise-level. In its quantization and coding stage, the encoder tries to allocate the available number of data bits in a way to meet both the bitrate and masking requirements.

The decoder is much less complex. Its only task is to synthesize an audio signal out of the coded spectral components.

All Layers use the same analysis filterbank (polyphase with 32 subbands). Layer-3 adds a MDCT transform to increase the frequency resolution.


All Layers use the same "header information" in their bitstream, to support the hierarchical structure of the standard.

All Layers have a similar sensitivity to biterrors. They use a bitstream structure that contains parts that are more sensitive to biterrors ("header", "bit allocation", "scalefactors", "side information") and parts that are less sensitive ("data of spectral components").

All Layers support the insertion of program-associated information ("ancillary data") into their audio data bitstream.

All Layers may use 32, 44.1 or 48 kHz sampling frequency.

All Layers are allowed to work with similar bitrates:

    • Layer-1: from 32 kbps to 448 kbps

    • Layer-2: from 32 kbps to 384 kbps

    • Layer-3: from 32 kbps to 320 kbps

Back to PXR4 Page

 

| Home | Marshall |VOX |

For KORG products: Keyboards, Dance, Recording, Tuners, ToneWorks