SOLARleisu's Stars
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
zju3dv/EfficientLoFTR
Code for "Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed", CVPR 2024
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Ucas-HaoranWei/Slow-Perception
Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step
getomni-ai/zerox
PDF to Markdown with vision models
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
LinXueyuanStdio/Data-for-LaTeX_OCR
LaTeX OCR 的数据仓库
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
allenai/open-instruct
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
meijieru/crnn.pytorch
Convolutional recurrent network in pytorch
LingyvKong/OneChart
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
ucaslcl/Fox
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"
Ucas-HaoranWei/Vary-tiny-600k
Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
faustomorales/keras-ocr
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
google-research/omniglue
Code release for CVPR'24 submission 'OmniGlue'
hms-dbmi/CHIEF
Clinical Histopathology Imaging Evaluation Foundation Model
cvlab-epfl/disk
Disk code release
zju3dv/LoFTR
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
cvg/GlueStick
Joint Deep Matcher for Points and Lines 🖼️💥🖼️ (ICCV 2023)