/datasets

different datasets used in this repository

Machine Learning Datasets

The Machine Learning Datasets repository repository provides a curated collection of datasets specifically designed for use in various machine learning tasks, including regression and classification. These datasets are used in different projects in this github account.

Table of Contents

Introduction

The datasets included in this repository cover a wide range of machine learning problems and are suitable for use in educational projects, research, and development. Each dataset is accompanied by a brief description, format details, and potential use cases.

Datasets

Regression Datasets

This section contains datasets suitable for regression tasks, where the goal is to predict a continuous target variable.

  • Salary Dataset: This dataset contains information about "Years of Experience" and "Salary" for 30 employees in a company. The dataset was obtained from the MLPack dataset website.

Classification Datasets

This section contains datasets suitable for classification tasks, where the goal is to predict a categorical target variable.

  • loan default: Loan_default dataset contains historic data for loan defaultees, along with their associated financial background.

Usage

To use these datasets, simply download the desired dataset from the links provided. Each dataset is typically provided in CSV format for easy integration with most data analysis and machine learning tools.

# Example of how to download a dataset using curl
curl -o dataset_name.csv https://path_to_dataset

Author

This repository is maintained by:

Dr. Saad Laouadi
GitHub Profile
dr.saad.laouadi@gmail.com

License

This repository is licensed under the Creative Commons Attribution 4.0 International License. Feel free to use the datasets in your projects, but please give credit to the original authors where applicable.