theovincent

PhD student at IAS - TU Darmstadt, interested in Reinforcement Learning. Graduated from MVA - ENS Paris-Saclay & Ponts ParisTech.

IAS TU DarmstadtDarmstadt

Pinned Repositories

Website
Dash website to display our results at https://www.multidimensionality-of-aging.net/
Language:Python5 4 00
ppo
Proximal Policy Optimization Algorithm implementation for the Deep Reinforcement Learning course @ MVA
Language:Python1 3 230
TrainingCenter
Trains machine learning algorithms to predict the age and the risk of dying for participants of NHANES dataset
Language:Python4 0 00
3DPointCloudClassification
Challenge to classify 3D point clouds of cities into Ground - Building - Poles - Pedestrians - Cars - Vegetation
Language:Jupyter Notebook6 1 01
coach
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
Language:Python0 0 00
CPDE
Results of the research to reproduce
Language:Jupyter Notebook1 0 00
PBO
Projected Bellman Operator (PBO). An attempt to learn a parametric Bellman Operator.
Language:Python1 4 00
theovincent
Self presentation
2 1 00

theovincent's Repositories

theovincent/3DPointCloudClassification
Challenge to classify 3D point clouds of cities into Ground - Building - Poles - Pedestrians - Cars - Vegetation
Language:Jupyter Notebook6 1 01
theovincent/theovincent
Self presentation
2 1 00
theovincent/CPDE
Results of the research to reproduce
Language:Jupyter Notebook1 0 00
theovincent/PBO
Projected Bellman Operator (PBO). An attempt to learn a parametric Bellman Operator.
Language:Python1 4 00
theovincent/birdClassification
Kaggle class competition on bird classification. Dataset used: Caltech-UCSD Birds-200-2011 bird dataset.
Language:Jupyter Notebook0 1 00
theovincent/coach
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
Language:Python0 0 00
theovincent/DirectFuturePrediction
Code for the paper "Learning to Act by Predicting the Future", Alexey Dosovitskiy and Vladlen Koltun, ICLR 2017
Language:Python0 0 00
theovincent/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Language:Jupyter Notebook0 0
theovincent/double_pendulum
Dual purpose Acrobot and Pendubot Platform
Language:Python
theovincent/HeightmapsGeneration
This project was made on GitLab. In reality, there are four contributors : Auriane Riou, Candice Van Den Bergh, Alex Fauduet and Théo Vincent
Language:Python1 0
theovincent/HouseNumberRecognition
Language:Python1 0
theovincent/lidar-bonnetal
Semantic and Instance Segmentation of LiDAR point clouds for autonomous driving
Language:Python0 0
theovincent/MOPSI
Language:Python1 0
theovincent/ReinforcementLearningWithDemonstration
Problem tackled: How to wisely use expert demonstrations in Reinforcement Learning
Language:Jupyter Notebook1 0
theovincent/ruptures
ruptures: change point detection in Python
Language:Python0 0
theovincent/s3-cp-action
GitHub Action for S3 cp
Language:Shell0 0
theovincent/SAG_vs_SDCA
The idea of this project is to study different machine learning algorithms
Language:Python1 0
theovincent/sbx
SBX: Stable Baselines Jax (SB3 + Jax)
Language:Python
theovincent/TweetSentimentExtraction
Extract support phrases for sentiment labels : Kaggle Competition
Language:Jupyter Notebook1 0
theovincent/website_tutorial
Small website to learn the basics of Dash
Language:Python0 01
theovincent/xport
Python reader and writer for SAS XPORT data transport files.
Language:Python0 0

theovincent

Pinned Repositories

Website

ppo

TrainingCenter

3DPointCloudClassification

coach

CPDE

PBO

theovincent

theovincent's Repositories

theovincent/3DPointCloudClassification

theovincent/theovincent

theovincent/CPDE

theovincent/PBO

theovincent/birdClassification

theovincent/coach

theovincent/DirectFuturePrediction

theovincent/dopamine

theovincent/double_pendulum

theovincent/HeightmapsGeneration

theovincent/HouseNumberRecognition

theovincent/lidar-bonnetal

theovincent/MOPSI

theovincent/ReinforcementLearningWithDemonstration

theovincent/ruptures

theovincent/s3-cp-action

theovincent/SAG_vs_SDCA

theovincent/sbx

theovincent/TweetSentimentExtraction

theovincent/website_tutorial

theovincent/xport