Xuanfang1121

Xuanfang1121's Stars

microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python11.1k 70 108699
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
6.2k 94 11342
SophonPlus/ChineseNlpCorpus
搜集、整理、发布中文自然语言处理语料/数据集，与有志之士共同促进中文自然语言处理的发展。
Language:Jupyter Notebook6k 116 241.4k
microsoft/TaskWeaver
A code-first agent framework for seamlessly planning and executing data analytics tasks.
Language:Python5.5k 65 218693
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python5k 51 215522
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python4.2k 30 484252
facebookresearch/large_concept_model
Large Concept Models: Language modeling in a sentence representation space
Language:Python1.7k 28 10135
juand-r/entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Language:Python1.5k 41 13247
MeetKai/functionary
Chat language model that can use tools and interpret the results
Language:Python1.5k 21 127116
hymie122/RAG-Survey
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
1.4k 30 494
Tencent/Tencent-Hunyuan-Large
Language:Python1.3k 25 1570
sihyun-yu/REPA
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Language:Python786 17 2839
shibing624/ChatPDF
RAG for Local LLM, chat with PDF/doc/txt files, ChatPDF. 纯原生实现RAG功能，基于本地LLM、embedding模型、reranker模型实现，无须安装任何第三方agent库。
Language:Python655 6 32111
epfLLM/Megatron-LLM
distributed trainer for LLMs
Language:Python556 18 5978
CLUEbenchmark/FewCLUE
FewCLUE 小样本学习测评基准，中文版
Language:Python503 12 1473
zhilizju/Awesome-instruction-tuning
A curated list of awesome instruction tuning datasets, models, papers and repositories.
Language:Python321 3 014
IAAR-Shanghai/Awesome-Attention-Heads
An awesome repository & A comprehensive survey on interpretability of LLM attention heads.
Language:TeX296 8 288
alibaba/ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
Language:Python268 13 2420
rail-berkeley/crossformer
Language:Python214 15 1022
OpenNLPLab/lightning-attention
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
Language:Python191 12 1516
alipay/financial_evaluation_dataset
Language:Python179 1 116
deepseek-ai/ESFT
Expert Specialized Fine-Tuning
Language:Python167 9 420
wangyuxinwhy/generate
A Python Package to Access World-Class Generative Models
Language:Python126 1 415
goombalab/hydra
Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"
Language:Python118 6 159
LeapLabTHU/Deep-Incubation
Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)
Language:Python91 2 25
test-time-training/ttt-lm-kernels
Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Language:Cuda48 3 24
Cranial-XIX/longhorn
Official PyTorch Implementation of the Longhorn Deep State Space Model
Language:Python45 2 23
DACUS1995/pytorch-mmap-dataset
A custom pytorch Dataset extension that provides a faster iteration and better RAM usage
Language:Python42 2 27
junkangwu/beta-DPO
[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$
Language:Python38 2 21
RenzeLou/Muffin
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
Language:Python14 2 03