Aliasing and Foldover: A Primer

May 28, 2014 Dr. AIX

Have you heard of “aliasing” or “foldover”? In a nutshell, these terms describe another potential problem area for PCM digital audio. We’ve talked about quantization noise and jitter in the past but I don’t think that I’ve explored the concept of aliasing. Consider today’s post an introduction to aliasing…or what happens when a frequency that is higher than half the sample rate reaches your analog to digital converter.

The catalyst for today’s post came from yesterday’s post. As you may recall (you can click here to read it), I did some spectra analysis of a DXD download and found the same sort of ultrasonic noise that is typical of 1-bit DSD type encoding…even though DXD is multibit PCM. The post I wrote about demo files also inspired today’s post (click here to revisit that post). The spectra associated with most of the music bundled with a new high-resolution audio smartphone were full of unwanted ultrasonic noise. It turns out that the “out of band” noise that is unavoidable when recording and releasing music in DSD (even music that is then transcoded to high-resolution PCM) can be a very bad thing for a playback device. We’ll get that story tomorrow.

First, we need to understand that there are some very rigid boundaries associated with digitizing audio. One of them is defined by the Nyquist frequency, which is equal to half of the sample rate. For example, if we take the 44.1 kHz sample rate of a Redbook CD, the Nyquist frequency is 22.050 kHz. That’s the absolute maximum frequency that can be captured by a PCM-based system…and that’s under ideal conditions. But what happens if a frequency of 25 kHz comes along. What does the AD converter do? It tries to digitize it. But it turns out that it doesn’t have enough samples available to do it correctly.

A PCM system needs at least 2 samples for each cycle of a signal. They don’t have to match the top and bottom of the periodic waveform (a common mistake), but the converter does need 2 samples. Let’s take a look at an illustration that shows this:

Figure 1 – A series of waveforms showing a 7 kHz and 13 kHz tone sampled in a 10 kHz system with the resulting “alias tones” that are generated and output. [Click to enlarge]

Any frequency presented to an AD converter that is higher than half of the sampling rate cannot be accurately captured and a false frequency called an “alias” is stored or output instead. The green waveform is what we want but the 7 kHz tone is higher than 5 kHz (the Nyquist frequency or half the sample rate of 10 kHz). So the red 3 kHz alias tone is output as is the 13 kHz blue tone. This is obviously a problem and results in foldover. We could filter out the 13 kHz tone at the DAC but the 3 kHz tone is in the middle of the sound we want! Here’s another illustration:

Figure 2 – An illustration showing how the 7 kHz frequency “folds over” around the Nyquist causing a tone that doesn’t belong. [Click to enlarge]

So there you have it. And that’s why AD converters have very precise Low Pass Filters to make sure that any frequency that is greater than the sample rate divided by 2 is never allowed to arrive at the digitizer.

Tomorrow, I’ll talk about the real world and how aliasing can be our friend or out enemy.

4 thoughts on “Aliasing and Foldover: A Primer”

bob

May 28, 2014 at 4:57 pm

Would i be correct to say that if we wanted to record a 1khz tone using the PCM based approach, it wouldn’t matter if we used 24bit/96khz , 16bit/44.1 , 8bit24khz (or much less) recording platforms. The sound wave wouldn’t be “measured”, characterized or analysed in any greater detail when using any spec pcm recording platform. It would either fit in the “bucket” or not. If it did then it can reproduced exactly as it came in. If not then you get all these problems like quantisation and aliasing.

If i am correct so far, then is there a standard “amount” or samples/sec or some other sampling standard that any pcm platform would sample a given wave such as the 1khz tone.

Sorry if i am totally confused. Thanks
- Admin
  
  May 28, 2014 at 5:24 pm
  
  Bob, you need to leave off the 16 or 24-bit part of the specification because it deals with dynamic range and we don’t know how loud the 1 kHz sine wave is (or the range of levels). Otherwise, you’re right a 1 kHz sine (which has no higher partials) can be completely captured and reproduced using any of the specs you outlined.
Holger Christiansen

May 30, 2014 at 12:54 am

About the LPF, it has to attenuate 24 * 6dB = 144dB over one octave. If the input in 24KHZ the attenuation is 0dB and if the input 48KHZ the attenuation has to be 144dB. The order of the filter must 144 / 6 = 24. Is this assumption correct ?
- Admin
  
  May 30, 2014 at 6:04 am
  
  These filters have to be very accurate and very steep. At lower sample rates they are much more difficult to design and implement. That’s one of the reasons why higher sample rates are a good thing. But remember that the input to an AD converter uses analog LPF of very high order.

Dr. AIX

4 thoughts on “Aliasing and Foldover: A Primer”

Leave a Reply Cancel reply