/faker-dataset

📋 Fake Dataset generated via Faker Python Library 📋

Primary LanguageJupyter NotebookMIT LicenseMIT

📋 Faker Dataset 📋

Amazing library to generate fake datas


📝 Problem Description


Pandas Cartoon

Fig. 1 - Pandas Library Cartoon. ©Real Python



You have been hired for a Data Science job where you must explore the Faker Python Library to generate a dataset with fake datas.

Your dataset file must be named faker-dataset.csv and be stored into dataset folder located at this project directory, contain at least 5 features and 3,000 rows, and all features' values must be generated using the Faker Library.

After generating and saving the datas into a csv file, you must read it as utf-8 charset and show its first 5 rows in order to check out whether the dataset is ok to be used for other Data Scientists.

Good Luck!! 🍀 ☘️


📁 Files Description

faker-dataset.csv - contains at least 3,000 rows of fake datas generated by Faker Python Library.


❓ Library Features

region/locale - the person's locale;

name - the person's name;

email - the person's e-mail;

adress - the person's address;

license_plate - the person's automative's license plate;

company - the company where the generated person works at;

job - the job occupied by the person;

color_name - the person's fav color.


You can check out all the possible features here: Faker Library - Standard Providers


⚙️ Setup

Tools

Python Version 3.9.x+;

Jupyter Notebook.


Libraries

Faker;

Numpy, Pandas.


🎉 Acknowledgments

Fzaninotto and Faker Library team!


📫 Reach Me

Email: csfelix08@gmail.com

Linkedin: linkedin.com/in/csfelix/

Portfolio: CSFelix.io

Kaggle: DSFelix