PancakeAwesome's Stars
modelscope/FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
X-LANCE/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
01-ai/Yi-1.5
Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability.
TheMistoAI/MistoLine
A Versatile and Robust SDXL-ControlNet Model for Adaptable Line Art Conditioning
google-research/timesfm
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
lllyasviel/IC-Light
More relighting!
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
IDEA-CCNL/Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
airaria/TextBrewer
A PyTorch-based knowledge distillation toolkit for natural language processing
airaria/Visual-Chinese-LLaMA-Alpaca
多模态中文LLaMA&Alpaca大语言模型(VisualCLA)
ymcui/Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
lllyasviel/ControlNet
Let us control diffusion models!
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
TinyLLaVA/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
huggingface/jat
General multi-task deep RL Agent
EricGuo5513/momask-codes
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
SynaLinks/HybridAGI
The Programmable Cypher-based Neuro-Symbolic AGI that lets you program its behavior using Graph-based Prompt Programming: for people who want AI to behave as expected
CrazyBoyM/phi3-Chinese
Phi3 中文仓库
dcharatan/flowmap
Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann
ml-explore/mlx-swift-examples
Examples using MLX Swift
Doriandarko/maestro
A framework for Claude Opus to intelligently orchestrate subagents.
lavague-ai/LaVague
Large Action Model framework to develop AI Web Agents
McGill-NLP/webllama
Llama-3 agents that can browse the web by following instructions and talking to you
Shenzhi-Wang/Llama3-Chinese-Chat
This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.
OpenBMB/ProAgent
An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation
AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath