CSE 228F Week 2, Part 2
Digital Video

The bandwidth required for digital video is staggering. Uncompressed NTSC video requires a bandwidth of 20MByte/sec, HDTV requires 200MByte/sec! Various encoding techniques have been developed in order to make digital video feasible. Two classes of encoding techniques are Source Encoding and Entropy Encoding.

Source Encoding:

Source encoding is lossy and applies techniques based upon properties of the media. There are four types of source encoding:

Entropy Encoding:

Entropy Encoding techniques are lossless techniques which tend to be simpler than source encoding techniques. The three entropy encoding techniques are:

    Disadvantages of Hufferman coding.   (1)not constant bit coding,  error propagation. (2) need to know the percentage of symbols before encoding.

JPEG

JPEG is an acronym for Joint Photographic Experts Group, the group that created the standard. It provides fast, efficient compression for images. It is also the basis for the MPEG video compression standard.

JPEG Stages

Picture Preparation

The first step of the picture preparation phase converts the image into components up to 256 separate components. In JPEG, compression is never done across planes. The standard set of planes for JPEG is YUV.

Now, we can use some information about the source. We know that Y is more important to humans than U or V, so we can subsample the U and V planes by a factor of four in both the x and y direction, encoding a value for sixteen adjacent pixels as one meta-region.

Now, 8x8 blocks of values will be sent through the remaining stages. Note that for the subsampled U and V planes, 64 meta-regions comprise a block. Blocks pass through the stages one at a time and an entire region is completed before the next region begins. This is called Interleaving, and it means that block 1 will have its Y,U,V values encoded before block 2 begins. Due to the subsampling of U and V, one U and V encoding will be sent with 16 Y encodings.

Transform

There are many transforms, most of which are very slow. This is important to consider since video demands real-time encoding and decoding. The JPEG committee took suggestions and empirically studied the use of several different transforms. Although it is not optimal; of the transforms studied, DCT (Discrete Cosine Transform) proved superior.

In JPEG, DCT operates on one block at a time. Because there are 64 elements in an 8x8 block, this is called the 64-element or 64-coefficient DCT. The DCT transform operates on this block in a left-to-right, top-to-bottom manner.

We will examine DCT next time.