hijkzzz

AI agent enthusiast, from RL to LLM, RLHF, MLSys.

NVIDIA

hijkzzz's Stars

BIT-aerial-robotics/AquaML
Language:Python9010
Developer-Y/cs-video-courses
List of Computer Science courses with video lectures.
65.9k9k
kvcache-ai/Mooncake
Mooncake is the serving platform for icon Kimi, a leading LLM service provided by icon Moonshot AI.
5758
OpenAccess-AI-Collective/axolotl
Go ahead and axolotl questions
Language:Python6.8k746
unslothai/unsloth
Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Language:Python12.1k789
NVIDIA/TensorRT-Model-Optimizer
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
Language:Python28315
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
Language:Jupyter Notebook10.3k1.5k
OpenBMB/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
Language:Python7.8k539
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language:Python3.2k255
multimodal-art-projection/MAP-NEO
Language:Python73468
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
10.4k696
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript35.9k4.8k
riiswa/kanrl
Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments
Language:Python22629
Johnshall/Shadowrocket-ADBlock-Rules-Forever
提供多款 Shadowrocket 规则，拥有强劲的广告过滤功能。每日8时重新构建规则。
11.3k686
GMOogway/shadowrocket-rules
小火箭规则🚀，小火箭配置，shadowrocket规则，shadowrocket rules，最全面的直连（DIRECT）、代理（PROXY）、屏蔽（REJECT）规则，自动构建，每日更新
1.5k81
jondurbin/airoboros
Customizable implementation of the self-instruct paper.
Language:Python97466
lm-sys/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
Language:Jupyter Notebook30525
jzhang38/LongMamba
Some preliminary explorations of Mamba's context scaling.
Language:Python1718
AntNLP/nope_head_scale
Language:Python13
hsiehjackson/RULER
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Language:Python32218
OpenNLPLab/TransnormerLLM
Official implementation of TransNormerLLM: A Faster and Better LLM
Language:Python21811
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Language:Python70133
XuezheMax/megalodon
Reference implementation of Megalodon 7B model
Language:Cuda48750
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python3.1k332
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.3k356
openai/simple-evals
Language:Python1.3k112
THUDM/LongBench
LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Language:Python53334
gkamradt/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Language:Jupyter Notebook1.3k131
ContextualAI/gritlm
Generative Representational Instruction Tuning
Language:Jupyter Notebook47434
TencentARC/LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
Language:Python43633

hijkzzz

hijkzzz's Stars

BIT-aerial-robotics/AquaML

Developer-Y/cs-video-courses

kvcache-ai/Mooncake

OpenAccess-AI-Collective/axolotl

unslothai/unsloth

NVIDIA/TensorRT-Model-Optimizer

meta-llama/llama-recipes

OpenBMB/MiniCPM-V

InternLM/xtuner

multimodal-art-projection/MAP-NEO

BradyFU/Awesome-Multimodal-Large-Language-Models

langgenius/dify

riiswa/kanrl

Johnshall/Shadowrocket-ADBlock-Rules-Forever

GMOogway/shadowrocket-rules

jondurbin/airoboros

lm-sys/arena-hard-auto

jzhang38/LongMamba

AntNLP/nope_head_scale

hsiehjackson/RULER

OpenNLPLab/TransnormerLLM

sustcsonglin/flash-linear-attention

XuezheMax/megalodon

open-compass/opencompass

mit-han-lab/streaming-llm

openai/simple-evals

THUDM/LongBench

gkamradt/LLMTest_NeedleInAHaystack

ContextualAI/gritlm

TencentARC/LLaMA-Pro