Lunamoon-flow

Lunamoon-flow's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python35.5k 214 5.4k4.4k
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Language:Python15.7k 132 6151.9k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python14.5k 109 1.1k1.2k
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.8k 106 596894
yangjianxin1/Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language:Python5.9k 56 280530
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Language:Python5.7k 67 128504
lyhue1991/eat_pytorch_in_20_days
Pytorch🍊🍉 is delicious, just eat it! 😋😋
Language:Jupyter Notebook5.4k 62 291.2k
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python5.1k 49 454387
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k 41 395298
AiuniAI/Unique3D
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Language:Python3.1k 40 113246
EleutherAI/pythia
The hub for EleutherAI's work on interpretability and learning dynamics
Language:Jupyter Notebook2.3k 33 109173
lyhue1991/torchkeras
Pytorch❤️ Keras 😋😋
Language:Jupyter Notebook1.8k 21 98234
SUDO-AI-3D/zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
Language:Python1.8k 29 80124
RUC-NLPIR/FlashRAG
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
Language:Python1.4k 13 89114
bytedance/MVDream
Multi-view Diffusion for 3D Generation
Language:Python834 21 3661
OpenLMLab/GAOKAO-Bench
GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.
Language:Python562 4 2642
kmeng01/memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
Language:Python442 6 1953
zcablii/SARDet_100K
[NeurIPS 2024 spotlight] Offical implementation of MSFA and release of SARDet_100K dataset for Large-Scale Synthetic Aperture Radar (SAR) Object Detection
Language:Python404 3 4029
aceliuchanghong/FAQ_Of_LLM_Interview
大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"
Language:Jupyter Notebook303 1 117
zeke-xie/deep-learning-dynamics-paper-list
This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and stochastic optimization. Thus, deep learning dynamics play an essentially important role in theoretical foundation of deep learning.
252 16 023
IAAR-Shanghai/UHGEval
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.
Language:Python184 12 417
wenhuchen/TheoremQA
The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset
Language:Python154 5 28
FreedomIntelligence/CMB
CMB, A Comprehensive Medical Benchmark in Chinese
Language:Python136 8 2912
pnnl/chemreasoner
ChemReasoner - Catalyst Discovery via Large Language Model-driven Reasoning
Language:Python35 10 35
whyNLP/Conic10K
Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.
Language:Python23 1 22
FANGAreNotGnu/ControlAug
Official Implementation of WACV 2024 paper "Data Augmentation for Object Detection via Controllable Diffusion Models"
Language:Python18 3 33
noewangjy/csprd_dataset
This is the repository for the paper CSPRD: A Financial Policy Retrieval Dataset for Chinese Stock Market
Language:Python16 2 24
yws-wxs/Vim-F
This project is based on Vim (paper, code) and we appreciate this excellent work.
Language:Python9 1 11
samlhuillier/tunetherag-train-scripts
Language:Jupyter Notebook3 1 00
Lunamoon-flow/learning-rate-decay
开源代码
Language:Python1 1 00