One more note: the Soundex and Double Metaphone algorithms may be useful for determining if two words sound alike.
And yet one more attempt at something similar from years ago, doing only words, not grammatical sentences:
http://kenta.blogspot.com/2008/08/hash-of-words.html
Ken
_______________________________________________ tor-dev mailing list tor-dev@xxxxxxxxxxxxxxxxxxxx https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev