Pinned Repositories
.github
Awesome-Medical-LLM
Large language model of Medical AI, General Medical AI (GMAI)
AIOS
AIOS: LLM Agent Operating System
AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
AutoKG
LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities
awesome-rabbit-r1
A list of resources for hacking on the Rabbit r1
chameleon
Repository for Meta Chameleon a mixed-modal early-fusion foundation model from FAIR.
colpali
The code used to train and run inference with the ColPali architecture.
HMC-SNUH
pytorch-template
PyTorch deep learning projects made easy.
Eruly's Repositories
Eruly/AIOS
AIOS: LLM Agent Operating System
Eruly/AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Eruly/awesome-rabbit-r1
A list of resources for hacking on the Rabbit r1
Eruly/chameleon
Repository for Meta Chameleon a mixed-modal early-fusion foundation model from FAIR.
Eruly/colpali
The code used to train and run inference with the ColPali architecture.
Eruly/CuMo
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Eruly/deep-learning-pytorch-huggingface
Eruly/EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Models (LLMs).
Eruly/GUICourse
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
Eruly/litellm
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate, Groq (100+ LLMs)
Eruly/llama-agentic-system
Agentic components of the Llama Stack APIs
Eruly/LLaMA-Factory-ko
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
Eruly/LLM-Workshop
LLM Workshop by Sourab Mangrulkar
Eruly/lmeval
Eruly/LOMO
LOMO: LOw-Memory Optimization
Eruly/MambaVision
Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Eruly/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
Eruly/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
Eruly/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Eruly/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Eruly/outlines
Structured Text Generation
Eruly/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Eruly/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
Eruly/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Eruly/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
Eruly/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Eruly/vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark 👀. Evaluation code for the "ColPali: Efficient Document Retrieval with Vision Language Models" paper.
Eruly/vision-process-webui
💡💡💡awesome compute vision app in gradio
Eruly/Visual-CoT
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
Eruly/xft
XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts