Result: Balanced allocation and dictionaries with tightly packed constant size bins
Altova GmbH, Rudolfsplatz 13a, 1010 Wien, Austria
CC BY 4.0
Sauf mention contraire ci-dessus, le contenu de cette notice bibliographique peut être utilisé dans le cadre d’une licence CC BY 4.0 Inist-CNRS / Unless otherwise stated above, the content of this bibliographic record may be used under a CC BY 4.0 licence by Inist-CNRS / A menos que se haya señalado antes, el contenido de este registro bibliográfico puede ser utilizado al amparo de una licencia CC BY 4.0 Inist-CNRS
Mathematics
Further Information
We study a particular aspect of the balanced allocation paradigm (also known as the two-choices paradigm): constant sized bins, packed as tightly as possible. Let d > 1 be fixed, and assume there are m bins of capacity d each. To each of n ≤ dm balls two possible bins are assigned at random. How close can dm/n = 1 + e be to 1 so that with high probability each ball can be put into one of the two bins assigned to it without any bin overflowing? We show that e > (2/e)d-1 is sufficient. If a new ball arrives with two new randomly assigned bins, we wish to rearrange some of the balls already present in order to accommodate the new ball. We show that on average it takes constant time to rearrange the balls to achieve this, for e > βd, for some constant β < 1. An alternative way to describe the problem is in data structure language. Generalizing cuckoo hashing [R. Pagh, F.F. Rodler, Cuckoo hashing, J. Algorithms 51 (2004) 122-144], we consider a hash table with m positions, each representing a bucket of capacity d ≥ 1. Keys are assigned to buckets by two fully random hash functions. How many keys can be placed in these bins, if key x may go to bin h1 (x) or to bin h2 (x)? We obtain an implementation of a dictionary that accommodates n keys in m = (1+ε)n/d buckets of size d = 0(log(1/ε)), so that key x resides in bucket h1 (x) or h2 (x). For a lookup operation, only two hash functions have to be evaluated and two segments of d contiguous memory cells have to be inspected. If d ≥ 1 + 3.26 ·ln(1/ε), a static arrangement exists with high probability. If d ≥ 16 ln(1/ε), a dynamic version of the dictionary exists so that the expected time for inserting a new key is log(1/ε)0(loglog(1/ε)).