Pinned Repositories
ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
IsaacLab
Unified framework for robot learning built on NVIDIA Isaac Sim
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
h8907283's Repositories
h8907283/ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
h8907283/IsaacLab
Unified framework for robot learning built on NVIDIA Isaac Sim