data compression link collection


The Files label used here doesn’t refer to source files, but rather benchmark, test, and reference files.

x264 - a free h264/avc encoder

x264 is a free library for encoding H264/AVC video streams. It is released under the terms of the GPL license.

* * * *  

Posted in July 2nd, 2007

Silesia compression corpus

Published in Files, Benchmarks

Sebastian Deorowicz decided to create a compression corpus of his own, attempting to overcome some of the deficiencies he sees in the old guard.


Posted in June 23rd, 2003

Benchmark Images and Files

Published in Files, Links, Benchmarks

David Cary is a major link farmer. One of the sections of his massive Data Compression page has links to various images and files that are used in various benchmarks.


Posted in May 1st, 2003

Image & Video Quality Assessment at LIVE

The folks at LIVE conducted a subjective test of images at various compression levels, and have made the results available here. The images and results are both available here.


Posted in April 23rd, 2003

JPEG 2000 Part 4 Conformance Test Files

Published in Files, JPEG-2000

Part 4 of the standard relates to conformance. The files on this site are used in conformance testing. This web site implies that this part of the standard will soon be available for free, but as of March, 2003, this is not the case.


Posted in March 31st, 2003

Data Compression Corpora

Published in Files, Data Compression

Jürgen Abel has a great Data Compression site, and keeps a set of pointers to standard sets of files used for compression. He recently added a reference to the Protein Corpus, a set of difficult to compress files that were first published at the 1999 Data Compression Conference.

* * * * *

Posted in February 17th, 2003

JPEG 2000 Code and Test Data

This page contains links to sample code and test data for implementing the JPEG 2000 standard. Looking for conformance files? This is the place. Looking for working implementations? You can find several here.

* * * *  

Posted in January 19th, 2003

Google - Compression Test Images

Published in Files, Image Compression

Google’s directory entry point for test images used in data compression.


Posted in December 9th, 2002

VCEG ftp server

Published in Files, Standards, Video

This is described as the ftp server for the Video Coding Experts Group, which is working on H.26L and other video projects. A ton of stuff here, no guideposts or indices, have at it.


Posted in June 6th, 2002

Compression Codecs

Published in Files, Links, Video, Audio

This site has a nice collection of codecs, including DirectShow filters, MPG4, MJPEG, and other video codecs. A few audio codecs and AVI test sequences as well.


Posted in June 2nd, 2002

Multimedia Test Sequences

Published in Files, Video

Welcome to, a repository for freely-redistributable test sets. We use these to test our codecs, and hope you will too. This site includes a partial mirror of the Video Quality Experts Group test sequences as well.

* * * * *

Posted in April 3rd, 2002

Huffman Compression Engine

This program is currently capable of reading and extracting files made with LHA and other utilities that generate .lzh files, from -lh4- to -lh7-. The foundation of the algorithm for this program like ARJ is based on Haruhiko Okumura’s work on ar002, which was the foundation of LHA. Unlike Haruhiko’s work however, the dictionary size is dynamic and currently allows for dictionary sizes of up to 64KB. On larger files, compression of files is usually 0.5% to 5% tighter than PKzip, and work in progress will likely yield even better results. Files created with this utility natively create -lh7- signed archives, which on larger files results in slightly better compression than that of lha32 by Haruyasu Yoshizaki.


Posted in January 1st, 2002

The New Canterbury Corpus

Published in Files, Data Compression

No details, but it appears to be a collection of files designed to represent a slightly wider range of modern applications.


Posted in December 25th, 2001

CCITT standard images (Bilevel)

Published in Files, Image Compression

Images commonly used in compression tests are stored here in Sun raster format. At this time I believe that all you will find here are gray scale images.


Posted in December 25th, 2001

HawkVoice Speech Samples

Published in Files, Speech

Some speech samples that have been encoded at various rates using various codecs. If you’ve never heard speech encoded at 1.4 Kbps, here’s a chance to check it out.


Posted in May 8th, 2001

Computer Vision Test Images

Published in Files, Image Compression

A list of links to test images. Utopia for the benchmark junkie.


Posted in September 24th, 2000

H.263 Video Coding

Peter Cherriman’s page on H.263 coding, includes information, pointers, and a couple of demo sequences.


Posted in July 4th, 2000

H.261 Video Coding

Peter Cherriman’s page on H.261 coding, includes information, pointers, and a couple of demo sequences.


Posted in July 4th, 2000

Data Compression Benchmark Suite

A set of links to files that are used to benchmark various data compression algorithms.


Posted in March 10th, 2000

SQAM - Sound Quality Assessment Material

Published in Files, Audio

This site apparently holds a set of files that were used to evaluate MPEG audio compression algorithms.


Posted in January 21st, 2000

MPEG Audio Resources and Software

Links to lots of info regarding the audio compression portions of the MPEG standards. This includes an overview, the MPEG Audio FAQ, pointers to resources, some free software, and test bitstreams.

* * * *  

Posted in November 19th, 1999

Waterloo BragZone test suite

In the BragZone you will find the following:

  • A suite of test images, the “Waterloo Repertoire”.
  • Rate-Distortion plots for various compression codecs.
  • The data from which the above plots are derived.
  • Sample images at selected compression ratios.

* * * * *

Posted in November 14th, 1999

Waterloo BragZone

Published in Files, Benchmarks

Comparing different image compression programs has always been difficult. As a suite of test images and a place for archiving results, the Waterloo BragZone hopes to overcome these problems. Central to the effort is the Waterloo Repertoire, a suite of 32 test images

* * * * *

Posted in November 14th, 1999

PNG Suite from Willem van Schaik

Published in Files, Benchmarks, PNG

This is Willem van Schaik’s suite of PNG icons for testing PNG decoder engines, PNG viewers, and PNG browsers.

* * * * *

Posted in November 14th, 1999

yabbawhap - Y and AP compression filters

Public domain code by Daniel Bernstein. (Note that this ftp site has an excellent selection of compressoin programs and code.)

* * *    

Posted in November 13th, 1999

Project Runeberg

Published in Files, Swedish, Benchmarks

A huge collection of Swedish language text files

* * *    

Posted in November 13th, 1999

Matsusaka University Anonymous FTP Server

Published in Files, Data Compression

An ftp site for compression software. This includes mirrors of some hard to find sites, such as those of D.J. Wheeler. Needs organization, and I often find it a bit slow from the US.

* * * * *

Posted in November 13th, 1999

The USC-SIPI Image Database

Published in Files, Image Compression

The USC-SIPI image database is a collection of digitized images. It is maintained primarily to support research in image processing, image analysis, and machine vision. Contains copies of the mystical goddess Lenna.

* * * * *

Posted in November 8th, 1999

CCITT standard fax images

TIFF versions of the CCITT images.

* * * * *

Posted in November 7th, 1999

Image set used in ACT

Published in Files, Image Compression

A SET of 23 images totalling 27,133,146 bytes. The file type is RAS, does that mean that the files are a simple raster dump? I’m not sure. These files are listed in a few different places as being used in benchmarks.

* * *    

Posted in October 28th, 1999