giobin's Stars
pytorch/torchtitan
A native PyTorch Library for large model training
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
wouterkool/attention-learn-to-route
Attention based model for learning to solve different routing problems
ricsinaruto/Seq2seqChatbots
A wrapper around tensor2tensor to flexibly train, interact, and generate data for neural chatbots.
dennybritz/reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
marco-roberti/pytorch-e2e-dataset
The E2E Dataset, packed as a PyTorch DataSet subclass
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
grzegorzbrze/Conpartir0.2
A new conpartir project