Bloom Filter

Bloom Filter

1. Introduction

Bloom Filter is a data structure to check whether an element is in the set or not. A query returns either "possibly in set"(false positive) or "definitely not in set". It can be used if a certain rate of error is allowed. (e.g. check if an url has been visited in web crawling). Check here for more details.

Bloom Filter is helpful for

  • Checking if a key is duplicate

  • Retrieving the key from dictionary

  • Retrieving the intersection of data

2. False Positive Rate

False Positive Rate is related to:

(1) The number of bits (the more the better)

(2) The number of hash functions (the more the better)

(3) The number of elements inserted (the less the better)

3. Implementation

Last updated

Was this helpful?