/OpenCompetition

A collection of algorithms for data science competition. Only focus on the tested and latest

Primary LanguagePythonApache License 2.0Apache-2.0

The Open Competition Toolkit

Overview

This is a library that collects the useful codes for data science competition, as well as some code for research purposes.

The library contains materials for Tabular Data Mining, NLP, CV and RL.

Most of the code are written in Python 3.7, except some part of Cython, C++ and R. We do, however, provide an unified interface for python.

Some reference for what will be implemented can be found here https://www.overleaf.com/read/xftrzgtcxkpd (registration needed). I will be hosting the details on the lecture on GeekBang (https://time.geekbang.org/).

Feature of The Library

Environment

It is suggested if you want to use PyTorch and related libraries, to download the docker using the command docker pull opencompetition/opencompetition:0.1. Note that one should use NVidia-docker instead of docker.

To use the tensorflow libraries, it is recommended to use the official google cloud vm's support TPU training. Similarly, if one wishes to use PyTorch XLA to run PyTorch code on GPU, use Google Cloud VM as well.