rasdani

Pinned Repositories

--headful
Make a web browser multimodal, give it eyes and ears.
Language:Python4 2 01
chat-your-code
Ask questions about your codebase using GPT
Language:Python1 1 00
cheatcode
your cheatcode for productivity
Language:Python1 1 01
distilabel
Distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Language:Python0 0 00
dotfiles-v1
1 1 00
germanrag
GermanRAG - a German dataset for finetuning Retrieval Augmented Generation
Language:Python6 1 00
inference-is-all-you-need
Language:Python6 1 01
mp-transformer
Learn latent primitives of human movement.
Language:Python0 1 01
smolR1
reproducing DeepSeek R1 Zero with Qwen2.5-0.5B on two 4090 GPUs
Language:Python80

rasdani's Repositories

rasdani/smolR1
reproducing DeepSeek R1 Zero with Qwen2.5-0.5B on two 4090 GPUs
Language:Python80
rasdani/atropos
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments
rasdani/build-smolGRPO
Language:Python
rasdani/dotfiles
managed by chezmoi
Language:Shell1 0
rasdani/evalchemy
Automatic evals for LLMs
Language:HTML
rasdani/genesys
rasdani/gpt-oss
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Language:Python
rasdani/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
rasdani/j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
Language:Python
rasdani/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python
rasdani/modal-examples
Examples of programs built using Modal
rasdani/nano-vllm
Nano vLLM
rasdani/open-instruct
AllenAI's post-training codebase
rasdani/OpenHands
🙌 OpenHands: Code Less, Make More
rasdani/prime
prime is a framework for efficient, globally distributed training of AI models over the internet.
rasdani/prime-cli
The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers
Language:Python
rasdani/prime-rl
Language:Python
rasdani/R2E-Gym
Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
Language:Python
rasdani/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
rasdani/reasoning-gym
procedural reasoning datasets
Language:Python
rasdani/rllm
Democratizing Reinforcement Learning for LLMs
Language:Jupyter Notebook
rasdani/simpleRL-reason
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Language:Python
rasdani/SkyRL
SkyRL: A Modular Full-stack RL Library for LLMs
rasdani/SWE-smith
Scaling Data for SWE-agents
Language:Python
rasdani/triton
Development repository for the Triton language and compiler
rasdani/unsloth
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
rasdani/verifiers
Verifiers for LLM Reinforcement Learning
Language:Python
rasdani/verl
verl: Volcano Engine Reinforcement Learning for LLMs
Language:Python1
rasdani/verl-internvl
rasdani/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs