Kou-Guandong's Stars
triton-lang/triton
Development repository for the Triton language and compiler
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
deepseek-ai/DeepSeek-R1
mamba-org/mamba
The Fast Cross-Platform Package Manager
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
AlibabaResearch/DAMO-ConvAI
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
3b1b/manim
Animation engine for explanatory math videos
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
huggingface/trl
Train transformer language models with reinforcement learning.
test-time-training/ttt-lm-jax
Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 500+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
DanielWarfield1/MLWritingAndResearch
Notebook Examples used in machine learning writing and research
zjunlp/LLMAgentPapers
Must-read Papers on LLM Agents.
ggml-org/llama.cpp
LLM inference in C/C++
explodinggradients/ragas
Supercharge Your LLM Application Evaluations 🚀
OthersideAI/self-operating-computer
A framework to enable multimodal models to operate a computer.
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
xai-org/grok-1
Grok open release
chroma-core/chroma
the AI-native open-source embedding database
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
abetlen/llama-cpp-python
Python bindings for llama.cpp
BoltzmannEntropy/interviews.ai
It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced researchers will find it fascinating as well.
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.