Bloom Filter
Bloom Filter
1. Introduction
Bloom Filter is a data structure to check whether an element is in the set or not. A query returns either "possibly in set"(false positive) or "definitely not in set". It can be used if a certain rate of error is allowed. (e.g. check if an url has been visited in web crawling). Check here for more details.
Bloom Filter is helpful for
Checking if a key is duplicate
Retrieving the key from dictionary
Retrieving the intersection of data
2. False Positive Rate
False Positive Rate is related to:
(1) The number of bits (the more the better)
(2) The number of hash functions (the more the better)
(3) The number of elements inserted (the less the better)
3. Implementation
Last updated