Codelet Parsing: Quadratic-time, Sequential, Adaptive Algorithms for Lossy Compression

Dharmendra S. Modha

doi:10.1109/DCC.2003.1194013

Data Compression Conference

Codelet Parsing: Quadratic-time, Sequential, Adaptive Algorithms for Lossy Compression

Year: 2003, Pages: 223

DOI Bookmark: 10.1109/DCC.2003.1194013

Authors

Dharmendra S. Modha, IBM Almaden Research Center

Abstract

We propose new algorithms, collectively termed, codelet parsing, for lossy compression. The algorithms sequentially parse a given source sequence into phrases, say, sourcelets, and map each sourcelet to a distorted phrase, say, a codelet, such that the per-letter distortion between the two phrases does not exceed the desired distortion. The algorithms adaptively maintain a codebook (a set of codewords), and do not require any a priori knowledge of the soruce statistics. The algorithms use approximate string matching and, as key new idea, at each epoch, carefully select one of the many approximately matching codewords to balance between the code rate in the current epoch versus the code rate from resulting codebooks in future epochs. The algorithms are quadratic-time in the length of the source sequence and output a distorted sequence that can be naturally losslessly compressed using the Lempel-Ziv (LZ78) algorithm.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

Hybrid Differential Evolution and Sequential Quadratic Programming Algorithm
Computer and Information Science, ACIS International Conference on
Parsing algorithms for dictionary compression on the PRAM
Proceedings of IEEE Data Compression Conference (DCC'94)
Lossy compression of noisy cardiac image sequences
Data Compression Conference
An Example Based Approach for Parsing Natural Language Sentences
International Conference on Computing: Theory and Applications
A Lossless Image Compression Algorithm Using Duplication Free Run-Length Coding
Network Applications, Protocols and Services, International Conference on
Offline Dictionary-Based Compression
Data Compression Conference
A New Searchable Variable-to-Variable Compressor
2010 Data Compression Conference (DCC 2010)
Study on mult-lingual LZ77 and LZ78 text compression
Data Compression Conference
Parallel Parsing Algorithms for Static Dictionary Compression
IEEE Transactions on Parallel & Distributed Systems
Sequential Codelet Model of Program Execution - A Super-Codelet model based on the Hierarchical Turing Machine
2019 IEEE/ACM Third Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware (IPDRM)

Codelet Parsing: Quadratic-time, Sequential, Adaptive Algorithms for Lossy Compression

Authors

Abstract

Related Articles