Text Classification for newspaper
Project for "Introduction to Machine Learning and Data Mining" at HUST.
Built With
Getting Started
Make sure you have python3 and nodejs installed on your computer. Recommended to install on Ubuntu.
To get a local copy up and running follow these simple steps.
Prerequisites
This is an example of how to list things you need to use the software and how to install them.
-
Preprocessed Data
Download at: https://drive.google.com/drive/folders/1iX4VklC9gxY6DbFEcaKvuZAgx9mvntXo?usp=sharing
-
Raw Data
Download at: https://github.com/duyvuleo/VNTC
-
Model And Text Embedding
Download at: https://drive.google.com/file/d/17CeL37I9ldj4y445EHRFQuwRSf34oQb8/view?usp=sharing
Include: svm model, naive bayes model, neural-net model, tfidf vector, svd vector
Installation
- Clone the repo
git clone https://github.com/EdwardNgo/MLDM
- Install python packages
pip install -r requirements.txt
- Install frontend packages
cd react-complete-guide
npm install
- If you dont want train model yourself run script in ipynb file: MLDM.ipynb, TextClassification.ipynb
Link colab for train notebook:
https://colab.research.google.com/drive/1-LHQdwu69mvCKsP6pQFHi9EMuwiBPcGg?usp=sharing, https://colab.research.google.com/drive/1ELORh6xi9rbZmpDXHxTo6NOhKtI0kuaM?usp=sharing
- If you download my model
python app.py
cd react-complete-guide && npm run start