yanyi74

yanyi74's Stars

CyC2018/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
177k 5.3k 57251.1k
krahets/hello-algo
《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version ongoing
Language:Java97.9k 537 22212.4k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python55.8k 456 1325.7k
Alvin9999/new-pac
翻墙-科学上网、自由上网、免费科学上网、免费翻墙、油管youtube、fanqiang、软件、VPN、一键翻墙浏览器，vps一键搭建翻墙服务器脚本/教程，免费shadowsocks/ss/ssr/v2ray/goflyway账号/节点，翻墙梯子，电脑、手机、iOS、安卓、windows、Mac、Linux、路由器翻墙、科学上网、youtube视频下载、美区apple id共享账号
55.4k 1.5k 1.6k9.4k
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27k 287 422.2k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook14.5k 189 3652.1k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12.4k 271 117793
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
Language:HTML10.2k 82 211k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python9.9k 76 1.2k1.3k
DSXiangLi/DecryptPrompt
总结Prompt&LLM论文，开源数据&模型，AIGC应用
2.7k 58 2271
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
Language:Python2k 24 92124
davidmrau/mixture-of-experts
PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538
Language:Python972 5 26102
hellonlp/classifier-multi-label
多标签文本分类，多标签分类，文本分类, multi-label, classifier, text classification, BERT, seq2seq，attention, multi-label-classification
Language:Python702 9 15142
NLPJCL/RAG-Retrieval
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT,Cross Encoder
Language:Python483 6 2744
BDBC-KG-NLP/IE-Survey
北京航空航天大学大数据高精尖中心自然语言处理研究团队对信息抽取领域的调研。包括实体识别，关系抽取，属性抽取等子任务，每类子任务分别对学术界和工业界进行调研。
457 13 269
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
Language:Python405 18 2244
zhpmatrix/PaperReading
每天阅读过的论文的简要笔记
196 20 18
IAAR-Shanghai/xFinder
xFinder: Robust and Pinpoint Answer Extraction for Large Language Models
Language:Python143 4 43
Mxoder/LLM-from-scratch
一些 LLM 方面的从零复现笔记
Language:Jupyter Notebook130 4 319
Aligner2024/aligner
Achieving Efficient Alignment through Learned Correction
Language:Python108 1 75
louieworth/awesome-rlhf
An index of algorithms for reinforcement learning from human feedback (rlhf))
86 9 01
YoumiMa/dreeam
source code for {D}ocument-level {R}elation {E}xtraction with {E}vidence-guided {A}ttention {M}echanism
Language:Python75 1 2716
THUDM/SciGLM
SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)
Language:Python65 12 26
beccabai/Data-centric_multimodal_LLM
Survey on Data-centric Large Language Models
62 1 00
opendatalab/HA-DPO
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
Language:Python62 4 85
Iven-Wang/DocRE-reading-list
a paper reading list on Document level Relation Extraction
60 9 24
epfl-dlab/SynthIE
The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction".
Language:Python58 4 35
ICT-GoKnow/KnowCoder
Official Repo of paper "KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction". In the paper, we propose KnowCoder, the most powerful large language model so far for universal information extraction.
Language:Python55 1 34
wzhouad/WPO
Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"
Language:Python250
Tamiflu233/AutoGPT
An experimental open-source attempt to make GPT-4 fully autonomous.
Language:JavaScript10