varuy322's Stars
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
MLNLP-World/Paper-Writing-Tips
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
EvanLi/Github-Ranking
:star:Github Ranking:star: Github stars and forks ranking list. Github Top100 stars list of different languages. Automatically update daily. | Github仓库排名,每日自动更新
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
sail-sg/regmix
🧬 RegMix: Data Mixture as Regression for Language Model Pre-training
amusi/Deep-Learning-Interview-Book
深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
justchenhao/ChatDailyPapers
Build a daily academic subscription pipeline! Get daily Arxiv papers and corresponding chatGPT summaries with pre-defined keywords. It is deployed on GitHub automated without the need for manual running locally.
amusi/AI-Job-Notes
AI算法岗求职攻略(涵盖准备攻略、刷题指南、内推和AI公司清单等资料)
hyperai/awesome-ai4s
AI for Science 论文解读合集(持续更新ing),论文/数据集/教程下载:hyper.ai
datawhalechina/hugging-llm
HuggingLLM, Hugging Future.
yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)
shengyp/doing_the_PhD
bilibili/Index-1.9B
A SOTA lightweight multilingual LLM
EleutherAI/pythia
The hub for EleutherAI's work on interpretability and learning dynamics
opendatalab/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
varuy322/ColossalAI
Making big AI models cheaper, easier, and more scalable
huytransformer/Awesome-Out-Of-Distribution-Detection
Out-of-distribution detection, robustness, and generalization resources. The repository contains a professionally curated list of papers, tutorials, books, videos, articles and open-source libraries etc
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
cxcscmu/MATES
Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models
googleinterns/localizing-paragraph-memorization
opendatalab/MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
mlfoundations/dclm
DataComp for Language Models
Ivorforce/ECG-Viewer
A simple program to view MIT-BIH waveform data and annotations.
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
GanjinZero/awesome_Chinese_medical_NLP
中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc
Shubhamsaboo/awesome-llm-apps
Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
bigcode-project/bigcode-analysis
Repository for analysis and experiments in the BigCode project.
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4