luckeciano
Towards General Artificial Intelligence. Primary focus on Deep Reinforcement Learning. PhD Student at University of Oxford.
University of Oxford
Pinned Repositories
ab-test-RL
Using reinforcement learning for AB test
BAL-PM
Deep Bayesian Active Learning for Preference Modeling in Large Language Models (NeurIPS 2024)
bumps
Code for the paper "Bottom-Up Meta-Policy Search", presented at NeurIPS Deep RL Workshop 2019
deep-rl-humanoid-motions-masters
Repository for code from the Master's Thesis "Imitation Learning and Meta-Reinforcement Learning for Optimizing Humanoid Robot Motions".
deep-rl-undergrad-thesis
A Deep Reinforcement Learning Method for Humanoid Kick Motion - Bachelor's Thesis
humanoid-run-ppo
Code for the paper "Learning Humanoid Robot Running Skills through Proximal Policy Optimization"
kaggle-eef-house-prediction
Code from some models used in First Place's solution of Kaggle Data Science Challenge for EEF for ITA/Unifesp SJC students.
ml-room
Implementations of machine learning algorithms from scratch.
neural-networks-generate-lyrics
LSTM-based model for generate music lyrics
transformers-metarl
Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022
luckeciano's Repositories
luckeciano/transformers-metarl
Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022
luckeciano/deep-rl-humanoid-motions-masters
Repository for code from the Master's Thesis "Imitation Learning and Meta-Reinforcement Learning for Optimizing Humanoid Robot Motions".
luckeciano/BAL-PM
Deep Bayesian Active Learning for Preference Modeling in Large Language Models (NeurIPS 2024)
luckeciano/humanoid-run-ppo
Code for the paper "Learning Humanoid Robot Running Skills through Proximal Policy Optimization"
luckeciano/ml-room
Implementations of machine learning algorithms from scratch.
luckeciano/neural-networks-generate-lyrics
LSTM-based model for generate music lyrics
luckeciano/ab-test-RL
Using reinforcement learning for AB test
luckeciano/deep-rl-undergrad-thesis
A Deep Reinforcement Learning Method for Humanoid Kick Motion - Bachelor's Thesis
luckeciano/kaggle-eef-house-prediction
Code from some models used in First Place's solution of Kaggle Data Science Challenge for EEF for ITA/Unifesp SJC students.
luckeciano/bumps
Code for the paper "Bottom-Up Meta-Policy Search", presented at NeurIPS Deep RL Workshop 2019
luckeciano/distributed-tensorflow
training neural networks using distributed tensorflow
luckeciano/automata-to-regular-expression
Automata to Regular Expression Converter - Programming Assignment of "Automata Theory" course.
luckeciano/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
luckeciano/competitive-programming
My Solutions to UVA exercises from Competitive Programming 3 (Book).
luckeciano/cv
My CV
luckeciano/deep-rl-humanoid-kick
Repository for Research Work "A Deep Reinforcement Learning Method for Humanoid Kick"
luckeciano/discrete-structures-for-computing
Exercises for "Discrete Structures for Computing" course.
luckeciano/hackerrank
Solutions to HackerRank's exercises
luckeciano/ita-artificial-intelligence-PSO
Implementation of Particle Swarm Optimization for N-queens problem
luckeciano/kaggle-tensorflow-speech-recognition-challenge
Kaggle Tensorflow Speech Recognition Challenge
luckeciano/luckeciano.github.io
luckeciano/minesweeper-ruby
A simple implementation of minesweeper in Ruby
luckeciano/neural-architecture-search-mlp
A distributed MLP neural architecture search for TORQUE-based clusters
luckeciano/osim-rl
Reinforcement learning environments with musculoskeletal models
luckeciano/python-data-structures
Basic Python Data Structures.
luckeciano/ray
A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
luckeciano/Research-Agile-Methods
1600 hours of research in agile for safety critical systems and agile at scale
luckeciano/rl-room
Implementation of RL algorithms from scratch.
luckeciano/software-engineering-assignments-ita
Programming Assignments and Tests from "Software Engineering" course - ITA.
luckeciano/TD-VCL
Temporal-Difference Variational Continual Learning