-
Get from Internet html pages.
- Status - open.
-
Take text from html page.
- Status - open.
-
NOTE! Instead of 1 & 2 steps we use already existing data from the site
-
Transfer from text to normalize text.
- Status - done.
- Package/class/method - prepare_train_data.py.
-
Transfer from normalize text to vector.
- Bag of words
- Status - done.
- Package/class/method - prepare_train_data.py.
- TF-IDF
- Status - open
- Bag of words
-
Create neural network by vector
- lib Keras, softmax
- Status - done
- Package/class/method -
- lib Keras, softmax
TODO:
- Decrease bag_of_words_full.npy 12,3 ГБ O_o