Dissecting the GZIP format
Describe the DEFLATE algorithm that GZIP implements and depends on. The DEFLATE algorithm uses a combination of LZ77, Huffman codes and run-length-encoding; this article describes each in detail by walking through an example and developing source code to implement the algorithm. My aim is to implement readable rather than efficient or extensible code. I’ll focus here on unzipping, rather than zipping, but by the end of the article, the zipping process should be clear.
Appears in lists (1)
More like this (3)
How to turn an ordinary gzip archive into a database This article demonstrates how specially crafted...How to turn an ordinary gzip archive into a database This article demonstrates how specially crafted but ordinary gzip archives can be used as a database like storage. The ideal is to sort the source files then gzip. When everything is combined, the complete search system takes less than 0.3 seconds to search in the 1TB of compressed data. It also introduces a Python...
How to make compressed file quines, step by step The goal here is to allow for...How to make compressed file quines, step by step The goal here is to allow for any curious, technically-minded newcomer to make sense of all the concepts involved in creating compression quines.