OSEC

Neohapsis is currently accepting applications for employment. For more information, please visit our website www.neohapsis.com or email hr@neohapsis.com
 
Re: [Dailydave] approximate string matching - Bloom filters

From: Mateusz Berezecki (mateuszbgmail.com)
Date: Fri Sep 01 2006 - 13:19:23 CDT


On 9/1/06, Fausett, Mark (US SSA) <mark.fausettbaesystems.com> wrote:
> Bloom filters are approximate in a different sense though -- Think of
> them as space efficient, but lossy token sets; you put a bunch of tokens
> in, and subsequently can query whether a particular token was placed
> into the set; to some degree of confidence.
> Bloom filters are subject to false positives -- they'll sometimes
> incorrectly tell you that a token is in the set -- but not false
> negatives. Because hashing functions are used to insert tokens into the
> bloom filter, the false positives have nothing to do with approximate
> string matches.
>

By trial end error I already discovered this really unwanted behavior :-/

It's very good for representing what is not in the set rather than representing
the set itself.

Mateusz
_______________________________________________
Dailydave mailing list
Dailydavelists.immunitysec.com
http://lists.immunitysec.com/mailman/listinfo/dailydave