Themer
Themer it's a Java web-application, that will try to determine the topic of a given text (the text should be in Russian). Themer will show the top 10 suggested topics with their probability.
Example
The current version of this application is running on http://themer.herokuapp.com/
How it works
The algorithm will analyze the frequency of words from a given text. Then it will fetch these words from the dictionary. The dictionary is built using Data Mining methods based on data from Yandex.Catalog web-service.
Technologies used
Technology stack is pretty minimal, as lightweight as possible. It's almost entirely bare Java:
- Java
- Jetty
- PostgreSQL
- Maven
Running the application
The application is built with Maven.
Let Maven do the work:
mvn clean install
Then run it:
java -cp target/classes:target/dependency/* com.art.themer.Main
Deploying
The project is ready to deploy on Heroku via git. Just create heroku app, set up git remote and then push this repo.