lolisun's Stars
iamadamdev/bypass-paywalls-chrome
Bypass Paywalls web browser extension for Chrome and Firefox.
fxsjy/jieba
结巴中文分词
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
fishaudio/fish-speech
Brand new TTS solution
cumulo-autumn/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
arcee-ai/mergekit
Tools for merging pretrained large language models.
tickstep/aliyunpan
阿里云盘命令行客户端,支持JavaScript插件,支持同步备份功能。
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
MooreThreads/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
ekzhu/datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
eyurtsev/kor
LLM(😽)
MaaXYZ/MaaFramework
基于图像识别的自动化黑盒测试框架 | An automation black-box testing framework based on image recognition
foxofice/sub_share
字幕共享计划
openai/Video-Pre-Training
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
vastxie/Happy-ChatGPT
ChatGPT 国粹版,和 GPT 一起学习地道的**话吧
AI-Hobbyist/Genshin_Datasets
Genshin Datasets For SVC/SVS/TTS
yangdongchao/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
X-PLUG/CValues
面向中文大模型价值观的评估与对齐研究
wenet-e2e/WeTextProcessing
Text Normalization & Inverse Text Normalization
facebookresearch/AudioDec
An Open-source Streaming High-fidelity Neural Audio Codec
Re-Align/URIAL