/SIMPAC-2021-182

Data Science Toolkit

Primary LanguagePythonMIT LicenseMIT

Data Science Toolkit

readthedocs License Reproducible Capsule

Data Science Toolkit (DST) is a Python library that helps implement data science related project with ease.

Simple Demo

data = DataFrame()
data.load_dataset('iris')
y = data.get_column('target')
data.drop_column('target')

# decision tree model
model = Model(data_x=data.get_dataframe(), data_y=y, model_type='dt', training_percent=0.8)

# train the model
model.train()

# get all classification evaluation metrics
model.report()

#get the cross validation
model.cross_validation(5)

Documentation

More information can be found on the DST documentation site.

Contributing

Contrubution and suggestions are welcome via GitHub Pull Requests.

Maintainership

We're actively enhacing the repo with new algorithms.

How to cite