/Arabic-names-gender-classification

Arabic names gender classification with random forest

Primary LanguageJupyter Notebook

Arabic-names-gender-classification

Arabic names gender classification with Maching learning using python.

I used in this project a dataset contain +6000 name labled with gender (either male or female) you can check it here

I used Random Forest Classifier, the model treats only names longer than or equal to 2 characters

the features I used are:

  • the last letter of the name
  • the second last letter of the name
  • the third last letter of the name (in case of len(name)==2 the third last character will be "-")
  • boolean indicator of whether the name contains ابو
  • boolean indicator of whether the name contains عبد