Result: Compressed dictionaries : Space measures, data sets, and experiments

Title:
Compressed dictionaries : Space measures, data sets, and experiments
Source:
Experimental algorithms (5th international workshop, WEA 2006, Cala Galdana, Menorca, Spain, May 24-27, 2006)Lecture notes in computer science. :158-169
Publisher Information:
Berlin: Springer, 2006.
Publication Year:
2006
Physical Description:
print, 15 ref 1
Original Material:
INIST-CNRS
Document Type:
Conference Conference Paper
File Description:
text
Language:
English
Author Affiliations:
Department of Computer Sciences, Purdue University, West Lafayette, IN 47907-2066, United States
ISSN:
0302-9743
Rights:
Copyright 2007 INIST-CNRS
CC BY 4.0
Sauf mention contraire ci-dessus, le contenu de cette notice bibliographique peut être utilisé dans le cadre d’une licence CC BY 4.0 Inist-CNRS / Unless otherwise stated above, the content of this bibliographic record may be used under a CC BY 4.0 licence by Inist-CNRS / A menos que se haya señalado antes, el contenido de este registro bibliográfico puede ser utilizado al amparo de una licencia CC BY 4.0 Inist-CNRS
Notes:
Computer science; theoretical automation; systems
Accession Number:
edscal.19131151
Database:
PASCAL Archive

Further Information

In this paper, we present an experimental study of the space-time tradeoffs for the dictionary problem, where we design a data structure to represent set data, which consist of a subset S of n items out of a universe U = {0,1,..., u-1} supporting various queries on S. Our primary goal is to reduce the space required for such a dictionary data structure. Many compression schemes have been developed for dictionaries, which fall generally in the categories of combinatorial encodings and data-aware methods and still support queries efficiently. We show that for many (real-world) datasets, data-aware methods lead to a worthwhile compression over combinatorial methods. Additionally, we design a new data-aware building block structure called BSGAP that presents improvements over other data-aware methods.