Creative Technology Explained

5 Tips for Optimizing Audio File Sizes

Joel Lee 25-07-2016

Everyone should, at some point or another, learn how audio files work. This knowledge may seem trivial or unimportant, but it can really come in handy — such as when recording music A Beginner's Guide to Making Music With Audacity at Home Audacity is a fantastic piece of software for all musicians. Here's how to use Audacity to make music at home! Read More , creating a podcast How to Start Your Own Successful Podcast Podcasts can be heard by millions of people worldwide. Now it's easier than ever to collaborate, record, edit, publish, and promote your own show. We show you how. Read More , or optimizing your music library The 4 Best Tools to Manage Your MP3 Music Collection These are the best tools to manage your MP3 collection, helping you cure your music library management headaches. Read More .


In this post, we’re going to explore the various factors that affect audio quality and audio file size. Striking the perfect balance between the two isn’t easy, of course, but by the end you should know enough to feel comfortable and experiment for yourself.

Note: To put this knowledge into practice, you’ll want to grab a free audio editor like Audacity 12 Creative Uses for Audacity: Podcasts, Voiceovers, Ringtones, and More Audacity isn't just useful for podcasts and music production. Here are some creative ways to use Audacity and how to get started. Read More or one of the many Audacity alternatives out there Need to Record & Edit Audio? 4 Audacity Alternatives to Try Audacity can be a fantastic audio recording and editing tool, especially because of its cross platform and open source nature. However, there may be a number of reasons why you do not wish to use... Read More . Learning those tools is beyond the scope of this piece.

1. Sample Rate

In real life, sound is a wave. When someone speaks or claps their hands, what you’re actually hearing is a change in pressure that travels through the air and eventually hits your eardrums.

But how do we capture that sound and convert it into digital data? We can’t just record the full sound wave as it is; instead, we have to take periodic “snapshots” of the sound over time. When you play it all back in sequence, you get an approximate recreation of the original sound.



Each snapshot is called a sample and the interval used between each snapshot is called the sample rate. The shorter the interval, the faster the frequency. Faster frequencies produce more accurate recordings, but also require more data to store each second of recorded sound.

For example, CD quality audio uses a sample frequency of 44.1 KHz (or 44,100 samples per second) whereas TV and DVD quality audio uses a sample freqency of 48 KHz. Given a 10-minute uncompressed mono audio recording, the former might be 51.7 MB while the latter would be 56.3 MB.

You can drop to 32 KHz for speech-only recordings and not experience much loss in quality, but stick to 44.1 KHz if music is involved or if you need utmost quality. Dropping to 22.05 KHz will sound closer to AM radio.

2. Bitrate

Bitrate is not the same thing as sample rate. A lot of people tend to conflate the two, but it’s important that you don’t. First of all, if sample rate is how often the snapshots of sound are taken, then bit depth is how much data is recorded during each snapshot.


To illustrate, imagine a sound wave as a stream of water and you’re trying to capture (i.e. record) that water with a bucket. Sample rate would be how often you dip your bucket into the stream while bit depth would be the size of your bucket.


The higher the bit depth, the more data is captured per sample. This leads to a more accurate recording at the expense of more space required to store that data. But if you reduce the bit depth too much, sound data gets lost.

Bitrate is how much actual sound data is processed per second; in this case, you multiply the sample rate by the bit depth. A CD audio file with a 44.1 KHz sample rate and a 16-bit depth would have an uncompressed bitrate of 705.6 kbps.


For more on optimal bitrates, read the last section in this article on File Formats.

Sometimes the full bitrate isn’t needed in a given snapshot, such as when there’s silence. In that case, you can use variable bitrate (VBR), which is supported by MP3, OGG, AAC, and WMA. In the past, VBR wasn’t widely supported, but nowadays isn’t much of an issue.

3. Stereo vs. Mono

This point is pretty straightforward so I’ll keep it brief. Mono means one channel while stereo means two channels. The two channels in a stereo audio file can be referred to as the “left” and “right” channels.

With a pair of headphones, you’ll be able to hear one of the stereo channels in one ear and the other stereo channel in the other ear. When listening to a mono audio file, you’ll hear the same exact channel in both ears.



In a sense, stereo audio files are essentially two mono audio files in one — which means that a stereo audio file is always twice as big as a mono audio file, assuming the sample rate, bit depth, source sound, etc. are the same between the two.

So the easiest way to instantly cut an audio file size in half is to convert it from stereo to mono. For voice-only recordings, mono is almost always preferred for this reason.

Note that stereo is what makes a lot of music sound more 3D, as if the music is playing around you rather than at you (i.e. mono sounds flatter). But a lot of people can’t tell the difference, so you might be fine with it. Only you can decide if it’s worth the cut.

4. Compression

If you’re working with WAV files, the only way to reduce file size is by tinkering with one of the above settings (sample rate, bit depth, or number of channels). For everything else, compression is the biggest factor in audio file size.

There are two kinds of compression:

  • Lossy compression removes “unnecessary” data from the audio, such as sounds that are beyond the hearing range of most people. Once compressed, this discarded data can’t be recovered.
  • Lossless compression takes an audio file and packs it down as much as possible using mathematical algorithms, but these must be decompressed at the time of playback, which requires more processing power. No actual data is lost.

Lossless compression results in the same quality as uncompressed audio, but even at its best, lossless compression results in file sizes that are at least twice as large as lossy compression. For optimal file sizes, go with lossy compression.

If you’ve never compressed an audio file before, or if you’re looking for a good tool to get the job done, consider using one of these easy and effective ways to compress audio How to Compress Large Audio Files: 5 Easy and Effective Ways Need to reduce the size of your audio files? Here are several ways to compress large audio files. Read More .

5. File Format

Once you’ve decided to go with lossy compression, you have to decide which file format is best for you. As of this writing, the three most popular options are MP3, OGG, and AAC. Learn more in our comparison of audio file formats The 10 Most Common Audio Formats: Which One Should You Use? You know about MP3, but what about AAC, FLAC, OGG, or WMA? Why do so many audio file formats exist and is there a best audio format? Read More .

MP3 is the most popular by far, mainly because it was the first of the three to arrive on the scene. AAC is technically better than MP3, but doesn’t have the same usage rate. OGG is good too, but not many devices support it, so stick with MP3 or AAC.

Regardless of which one you use, you’ll end up compressing to a target bitrate. If we assume you’re going to use the MP3 format, then these are the five most common bitrates currently used:

In terms of file size reduction, an MP3 compressed to 128 kbps loses approximately 90% of the original sound data, whereas an MP3 compressed to 320 kbps only loses about 60%.

Also, if you have an MP3 and an AAC both compressed to the same bitrate, the AAC will often sound better because it uses a more advanced compression algorithm. This means you can get more “quality per megabyte” with AAC than MP3.

Understanding these five factors will not only help you to decide the best way to record and compress music and/or podcasts 7 Audacity Tips for Better Audio Editing on a Budget Here are several useful Audacity tips that will make your life much easier when editing audio, especially if you're on a budget. Read More that you’ve created, but can also help you decide what kind of music formats to purchase or which streaming services to use.

As a listener, what’s your preferred file format and bitrate for music? As a creator, what settings do you use for your music or podcasts? Let us know with a comment below!

Image Credits: Sample Rate via Wikimedia, Bit Depth via Wikimedia, Stereo Track via Audacity Manual

Related topics: File Compression, MP3, Podcasts, Record Audio.

Affiliate Disclosure: By buying the products we recommend, you help keep the site alive. Read more.

Whatsapp Pinterest

Leave a Reply

Your email address will not be published. Required fields are marked *

    January 31, 2019 at 3:50 pm

    Bitrate of a CD = 44.4 KHz x 16 bit x 2 channels = 1411 kBps (1,411 kbps).

    Difference between 192 kbps and 320 is noticeable unless one hasn't an ear for detail. Difference between 320 kbps and 1411 kbps (and higher bitrates) is easily noticeable with high-end equipment - I speak from a lot of experience.

  2. Anonymous
    July 26, 2016 at 6:20 am

    1) You should also consider the difference between "normal stereo" vs "joint stereo". Joint stereo usually has smaller file sizes without any perceptible difference in quality. If in doubt, I would choose joint stereo.

    2) Most audiophiles can't even tell the difference between 192 kbps mp3 files from the original (using double blind testing), let alone 320 kbps. 320 kbps is just overkill in all cases.

    Also, the biggest factor to consider before bitrate and file format is the sound equipment used (speakers) and the listener's auditory senses. I encode my CDs and FLACs to 140~ kbps in AAC. The reason is that listening to heavy metal music has damaged my ears somewhat, and that more often than not I listen to music using the cheap earphones that come with my smartphone or on my laptop's basic speakers.

    3) Audio compression should only be used on lossless conversion. "Lossy compression" should be "lossy encoding/transcoding". Imagine an uncompressed PCM/WAV file. If you convert it to MP3, you are encoding it to a different, lossy format. But if you convert the file into a .zip file, you are merely compressing it. FLAC/APE/ALAC files are merely ZIP files for uncompressed audio, with better compression levels.

  3. Tahomid Borhan
    July 25, 2016 at 8:21 pm

    Very Nice