giobin

giobin's Stars

pytorch/torchtitan
A native PyTorch Library for large model training
Language:Python2.5k182
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.1k172
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python5.5k631
wouterkool/attention-learn-to-route
Attention based model for learning to solve different routing problems
Language:Jupyter Notebook1.1k340
ricsinaruto/Seq2seqChatbots
A wrapper around tensor2tensor to flexibly train, interact, and generate data for neural chatbots.
Language:Python47274
dennybritz/reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Language:Jupyter Notebook20.5k6k
marco-roberti/pytorch-e2e-dataset
The E2E Dataset, packed as a PyTorch DataSet subclass
Language:Python6
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python83.1k22.4k
grzegorzbrze/Conpartir0.2
A new conpartir project
Language:JavaScript1