This repository is divided into four parts:
-
data - contains the cleaned and uncleaned data in one of four files:
- Name.txt, Name.csv - contains the data after being cleaned using cleaner program '../coursera_based_implementation/name_cleaner'
- Indian_Male_Names.csv, Indian_Female_Names.csv - contains the database downloaded from kaggle at 'https://www.kaggle.com/chaitanyapatil7/indian-names/version/1'
-
keras_based_implementation - contains the keras with tensorflow based implementation for the model. Using SimpleRNN
- name_predicter.py - program to predict the name given a trained model and parse the data to the trainer
- name_train.py - program to train the neural network on dataset
-
coursera_based_implementation - contains the implementation based on the coursera course on sequence models by Andrew Ng
- utils.py - contains the utilities to make the 'name_pred.py' predicter work
- name_pred.py - contains the predicter program
- name_cleaner.py - cleans the data and extracts only the first name from the data files and saves in Name.txt, Name.csv
-
models - contains keras models trained on different number of epochs