Dynamic Audio Compression

DYNAMIC AUDIO COMPRESSION

This note is intended to clarify the sometimes confusing terminology of dynamic Audio Compression.

THRESHOLD
This is the level, measured in dB below peak, below which no alteration of the audio will take place. If the threshold is -1dB then practically no compression will take place. If it is -40dB then pretty well everything will be compressed.

RATIO
Above the threshold any rise in peak level will cause the level of the audio to be reduced. With a 2:1 ratio a 2db increase will be reduced to become a 1 db increase and pro rata. With a 5:1 ratio a 10db increase will be reduced to be a 2 dB increase.

MAKE-UP GAIN
Since the compression reduces the level of the louder parts of the track it is normally necessary to increase the gain in order to bring the peaks back to where they were. This of course results in the quiet (uncompressed) sections being brought up. This is usually applied after the compression. If it is applied before it will be as if you had applied it afterwards but then reduced the threshold.

ATTACK TIME
If the audio is quiet and suddenly goes loud the compression can kick in so fast that the audio is ducked instantaneously, or can be slow so that the level will be reduced more gradually - this will let through sudden peaks. A low attack time is usual - a few ms (milliseconds - thousandths of a second).

RELEASE TIME (RECOVERY TIME)
If the audio has been loud and suddenly goes quiet so that compression is no longer taking place, then the level of this quiet section will be raised back to where it would be with no compression. This can be fast - which can be audible as it comes up suddenly and causes 'pumping' - or slow, so that the recovery is gradual.

Sophisticated compressors can behave differently over different frequency bands.

To get a visual idea of compression, look at this picture of a compressor in Cubase LE:

Look at the square graph in the top left-hand corner, labelled 'Characteristic'. The horizontal axis represents the level of the incoming audio: the vertical axis represents the output after compression.

Note that the line is at 45 degrees from low to -20: this means that up to -20dB the output matches the input. You will see a 'knee' in the graph at -20 and above this the line is flattened: at about -20 input the output is -10 and rises to -1.5 as the input rises to peak - a compression ratio of 2:1 with make-up gain of 8.5 dB.

A limiter is in effect a compressor with an infinity:1 ratio, so that above a specified threshold (usually maximum peak) there is no further gain no matter how much extra level you put in: it's nasty in effect but useful for preventing sudden accidental over-peaks.

Note that we are talking about compression of the dynamic (loud-to-soft) range of audio, not to be confused with digital compression as used in making MP3 files. This is entirely different and works by leaving out bits of the audio which it is calculated the ear will hardly miss.

COMPANDERS

Compressor-expanders are a special case (not relevant to normal audio processing) designed to compress the audio before recording and expand it on playback so as to minimize tape hiss. The simplest system is DBX, which applies 2:1 compression over most of the dynamic range: unfortunately this method can cause the hiss to be heard 'pumping'. The best known is Dolby B, ubiquitous on cassette recorders, which affects only the higher frequencies: unlike a conventional compressor it affects only audio below a certain threshold, compressing as the audio drops below this until there has been about a 10dB increase in the quietest levels. (The actual curve used is a little more complicated than a straight compressor). On playback the process is exactly reversed (requiring accurate level alignment to get the threshold in the right place) and reduces tape hiss by 10dB with remarkable effectiveness, having practicably no detectable effect on the actual audio. Later developments in the professional field - Dolby A and Dolby Spectral - work in a more complex fashion with more frequency bands but the basic principle is the same. Dolby C is the domestic version of Dolby Spectral and is remarkably effective on cassette recordings. However the entire technique has largely been rendered obsolete by digital recording.