jmribeiro's Stars
ollama/ollama
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.
3b1b/manim
Animation engine for explanatory math videos
maybe-finance/maybe
The OS for your personal finances
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
microsoft/MS-DOS
The original sources of MS-DOS 1.25, 2.0, and 4.0 for reference purposes
meta-llama/llama3
The official Meta Llama 3 GitHub site
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
pyg-team/pytorch_geometric
Graph Neural Network Library for PyTorch
python/mypy
Optional static typing for Python
dmlc/dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
huggingface/trl
Train transformer language models with reinforcement learning.
official-stockfish/Stockfish
A free and strong UCI chess engine
AI4Finance-Foundation/FinRL
FinRL: Financial Reinforcement Learning. 🔥
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
jonbarron/website
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
lcompilers/lpython
Python compiler
HumanCompatibleAI/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
lcswillems/rl-starter-files
RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
dezoito/ollama-grid-search
A multi-platform desktop application to evaluate and compare LLM models, written in Rust and React.
google-research/batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
facebookresearch/BenchMARL
A collection of MARL benchmarks based on TorchRL
Cognitive-AI-Systems/pogema
POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically designed to be flexible, tunable and scalable. It can be tailored to a variety of PO-MAPF settings.
andy-yang-1/DoubleSparse
16-fold memory access reduction with nearly no loss
sustcsonglin/gated_linear_attention_layer
bramgrooten/automatic-noise-filtering
[AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"
jmribeiro/UGP
Paper code for Multi-task learning without Catastrophic Forgetting in Deep Reinforcement Learning (https://login.easychair.org/publications/paper/8RPq)
joemersonsouza/llm_speech
This project is a speech recognition system that converts speech into text using Python as the main language. It then uses a Language Model (LM) to generate responses based on user requests. The project also includes the use of VLC for playing MP3 audio files and gTTS for converting the LM responses into audio.