lyuwen's Stars
abseil/abseil-py
Abseil Common Libraries (Python)
neelk07/neelkothari
My personal website including my blog and project built in Django
mosaicml/llm-foundry
LLM training code for Databricks foundation models
kpu/kenlm
KenLM: Faster and Smaller Language Model Queries
state-spaces/mamba
Mamba SSM architecture
WMD-group/SMACT
Python package to aid materials design and informatics
microsoft/presidio
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
PabloMK7/citra
A Nintendo 3DS Emulator
ChenghaoMou/text-dedup
All-in-one text de-duplication
MasterAI-EAM/Darwin
An open-source project dedicated to build foundational large language model for natural science, mainly in physics, chemistry and material science.
ThreeRiversAINexus/sample-agents
dqzboy/Docker-Proxy
🔥 🔥 🔥 自建Docker镜像加速服务,基于官方Docker Registry 一键部署Docker、K8s、Quay、Ghcr、Mcr、Nvcr等镜像加速\管理服务。支持免服务器部署到Render\Koyeb
devicons/devicon
Set of icons representing programming languages, designing & development tools
OpenBMB/Eurus
WUWei20/BLTO_CDMFT_benchmark
This repository is for benchmarking the cluster dynamical mean-field theory (CDMFT) study on the bilayer two-orbital Hubbard model of La3Ni2O7 [ YY Zheng and W Wú , arXiv:2312.03605 (2023)]
CompFUSE/DCA
DCA++
lbnlp/MatBERT
A pretrained BERT model on materials science literature
google-deepmind/language_modeling_is_compression
mCodingLLC/VideosSampleCode
Code from the mCoding sample videos
Helsinki-NLP/OpusTools
ExpressAI/DataLab
The unified platform for data-related resources.
deepseek-ai/DeepSeek-LLM
DeepSeek LLM: Let there be answers
OI-wiki/OI-wiki
:star2: Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)
mattbierbaum/arxiv-public-datasets
A set of scripts to grab public datasets from resources related to arXiv
conda/constructor
tool for creating installers from conda packages
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
lmmlzn/Awesome-LLMs-Datasets
Summarize existing representative LLMs text datasets.
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
arcee-ai/mergekit
Tools for merging pretrained large language models.