eliortap/Trigrams
A map-reduce implementation in Apache Hadoop (AWS EMR) for calculating the probabilities of trigrams in the Hebrew language. This project utilizes the deleted estimation two-way cross validation method to calculate trigram probabilities. The Google Hebrew Trigram database serves as this project's corpus.
Java