microsoft/0xDeCA10B

[simulation] Add MurmurHash3 option

juharris opened this issue · 0 comments

  • Pick a library (will likely use https://pypi.org/project/mmh3/)
  • Add test cases to compare hashes to make sure that the library is equivalent to the one we use in JavaScript for a few words.
  • Use for word-based datasets (can't do for IMDB yet because words are already mapped to vocabulary indices)