Lunamoon-flow's Stars
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
yangjianxin1/Firefly
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
lyhue1991/eat_pytorch_in_20_days
Pytorch🍊🍉 is delicious, just eat it! 😋😋
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
AiuniAI/Unique3D
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
EleutherAI/pythia
The hub for EleutherAI's work on interpretability and learning dynamics
lyhue1991/torchkeras
Pytorch❤️ Keras 😋😋
SUDO-AI-3D/zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
RUC-NLPIR/FlashRAG
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
bytedance/MVDream
Multi-view Diffusion for 3D Generation
OpenLMLab/GAOKAO-Bench
GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.
kmeng01/memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
zcablii/SARDet_100K
[NeurIPS 2024 spotlight] Offical implementation of MSFA and release of SARDet_100K dataset for Large-Scale Synthetic Aperture Radar (SAR) Object Detection
aceliuchanghong/FAQ_Of_LLM_Interview
大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"
zeke-xie/deep-learning-dynamics-paper-list
This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and stochastic optimization. Thus, deep learning dynamics play an essentially important role in theoretical foundation of deep learning.
IAAR-Shanghai/UHGEval
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.
wenhuchen/TheoremQA
The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset
FreedomIntelligence/CMB
CMB, A Comprehensive Medical Benchmark in Chinese
pnnl/chemreasoner
ChemReasoner - Catalyst Discovery via Large Language Model-driven Reasoning
whyNLP/Conic10K
Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.
FANGAreNotGnu/ControlAug
Official Implementation of WACV 2024 paper "Data Augmentation for Object Detection via Controllable Diffusion Models"
noewangjy/csprd_dataset
This is the repository for the paper CSPRD: A Financial Policy Retrieval Dataset for Chinese Stock Market
yws-wxs/Vim-F
This project is based on Vim (paper, code) and we appreciate this excellent work.
samlhuillier/tunetherag-train-scripts
Lunamoon-flow/learning-rate-decay
开源代码