songge25

songge25's Stars

lucasjinreal/Namo-R1
A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.
Language:Python13314
PKU-Alignment/align-anything
Align Anything: Training All-modality Model with Feedback
Language:Python2.6k348
hiyouga/EasyR1
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Language:Python1k51
dyh/unbox_yolov5_deepsort_counting
yolov5 deepsort 行人车辆跟踪检测计数
Language:Python983249
codelion/optillm
Optimizing inference proxy for LLMs
Language:Python2.1k160
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Language:Python7.1k619
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Language:TypeScript28k2.3k
LLaVA-VL/LLaVA-NeXT
Language:Python3.5k322
open-compass/VLMEvalKit
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Language:Python2k284
OpenBMB/MiniCPM-o
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Language:Python18.8k1.3k
wdndev/llm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
Language:HTML5.8k670
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell15.8k1.1k
sparkfish/augraphy
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Language:Python38947
OleehyO/TexTeller
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
Language:Python46352
opendatalab/UniMERNet
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
Language:Python27527
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
Language:HTML14.9k1.7k
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language:Python4.3k328
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python11.6k2.6k
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python8.8k637
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook47.4k5k
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Language:Python3.2k281
kovzol/Java-Geometry-Expert
Java Geometry Expert
Language:Java3520
NVIDIA/NeMo-Curator
Scalable data pre processing and curation toolkit for LLMs
Language:Jupyter Notebook808112
megvii-research/NAFNet
The state-of-the-art image restoration model without nonlinear activation functions.
Language:Python2.4k302
SupritYoung/RLHF-Label-Tool
用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.
Language:Python24819
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python11.1k1.1k
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python5.6k424
wgwang/awesome-LLMs-In-China
**大模型
6k505
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python43k5.2k
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
Language:Jupyter Notebook9.8k778