Compressing wideband signals with FLAC

# -*- coding: utf-8 -*- """ Created on Sun Feb 23 18:43:14 2014 @author: Patrick Mineault """ import getopt, sys, glob import os import os.path def do_plx(d): #check for a plx file files = glob.glob(d + '/*.plx') for f in files: fname_root = os.path.basename(os.path.splitext(f)[0]) dir_name = os.path.dirname(os.path.dirname(f)) mat_name = '%s/mat/%s.mat' % (dir_name,fname_root) if os.path.isfile(mat_name): print "plx file can be eliminated" os.remove(f) def do_flac(d): """Does the actual compression via flac""" files = glob.glob(d + '*_ch0*') print "found something to flac" for f in files: #only compress the files with no extensions if os.path.splitext(f)[1] == '': os.system("flac -f --endian=little --channels=1 --bps=16 --sample-rate=10000 --sign=signed %s" % f) #Make sure that the .flac file actually exists before removing the original! if os.path.isfile(f + ".flac"): print "Removing %s" % f os.remove(f) def recursive_flac(d): dirs = glob.glob(d + '*/') for d in dirs: dirname = os.path.basename(os.path.dirname(d)) if dirname == 'mat': print "found dir %s" % d do_flac(d) if dirname == "plx": do_plx(d) else: #recursify recursive_flac(d) def usage(): print """python great_compressor.py -d directory_name Recursively looks for /mat/*_ch* files and compresses them with flac, then deletes the original files. Also removes spurious plx files if any""" def main(): try: opts, args = getopt.getopt(sys.argv[1:], "d:") except getopt.GetoptError as err: # print help information and exit: print str(err) # will print something like "option -a not recognized" usage() sys.exit(2) thedir = "." for o, a in opts: if o == "-d": thedir = a else: assert False, "unhandled option" recursive_flac(thedir + '/') if __name__ == "__main__": main()

3 responses to “Compressing wideband signals with FLAC”

Martijn van Beurden says:

April 7, 2014 at 4:57 am

If I understood this correctly, the data is in fact 12 bit, which make me wonder: why store it as 16-bit? FLAC can store 12-bit data as well. If you want the ease of working with 16-bit, it might be an option to alter the padding: if you pad the data in a way FLAC detects it, FLAC automatically uses it’s wasted bits mechanism, which makes it output that padded 16-bit data, but stores it internally as 12-bit.

It might save you another 30%, something to consider I’d think!

Reply
- xcorr says:
  
  April 7, 2014 at 12:50 pm
  
  I’m pretty sure that doesn’t matter, although it might be worth trying. Plexon stores data as 16-bit unsigned integers while it actually only uses a dynamic range of 12 bits. FLAC uses a linear dynamic model to predict the coefficients, and it uses a compression scheme on the residuals to maximize their entropy; so it wouldn’t actually use 16 bits to encode the residuals if it needs less than 16 bits. Maybe there’s some internals that would be slightly more efficient if 12 bits was specified though. Hard to say, but I don’t think you’ll get 25% extra compression because of the entropy coding mechanism already in place.
  
  Reply
  - Martijn van Beurden says:
    
    April 9, 2014 at 3:17 am
    
    I can assure you it does: I tried. If I take some noisy music, make it 12-bits and pad it with zeros, the filesize shrinks with 32%. However, If I pad in a way that doens’t trigger the wasted bits mechanism (for example, I pad with 1111) I don’t get any compression benefit.
    
    The entropy coding stage assumes small random values, it doesn’t use a table approach or reordering like general purpose compressors to reduce the range of occuring values because it usually doesn’t happen in music signals, which FLAC was made to handle. The lower bits in music usually contain noise.
    
    Except OptimFROG, all lossless audio codecs I know of don’t look for a reduced number of used values (as they expect noise), but quite a few of them handle the special case in which the last x bits are zero, because certain systems, for example DVD-Audio and LossyWAV use this to store data that is not the usual 8, 16 or 24 bit without end users having to support for example playing 19-bit audio.
    
    So, in short, FLAC can only benefit if the last x bits are zero. If you’d like the extra 25%-30% space saving and are sure those last 4 bits don’t contain any information, just set them to zero and FLAC will do the rest.

Compressing wideband signals with FLAC

3 responses to “Compressing wideband signals with FLAC”

Leave a comment Cancel reply