WangShaner's Stars
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
JeroenAdam/my-personal-knowledge-management-system
A Personal Knowledge Management System written in React. Try the demo: https://knowledge.adambahri.com / Java backend here: https://github.com/JeroenAdam/ta3lim-backend
minar09/awesome-virtual-try-on
A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.
zhanymkanov/fastapi-best-practices
FastAPI Best Practices and Conventions we used at our startup
sgrvinod/chess-transformers
Teaching transformers to play chess
RasaHQ/rasa
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
InternLM/MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
dessant/buster
Captcha solver extension for humans, available for Chrome, Edge and Firefox
Florian-Barthel/splatviz
Full python interactive 3D Gaussian Splatting viewer for real-time editing and analyzing.
hplt-project/OpusCleaner
OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.
multimodal-art-projection/MAP-NEO
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
google/or-tools
Google's Operations Research tools:
andreasfertig/cppinsights
C++ Insights - See your source code with the eyes of a compiler
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
fe1ixxu/ALMA
State-of-the-art LLM-based translation models.
RoyiRa/prompt-to-prompt-with-sdxl
An implementation of the Prompt-to-Prompt paper for the SDXL architecture
Wilfred/difftastic
a structural diff that understands syntax 🟥🟩
lllyasviel/sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
google/orbit
C/C++ Performance Profiler
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
huggingface/text-generation-inference
Large Language Model Text Generation Inference
xinghaochen/TinySAM
Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"
xx025/stable-video-diffusion-webui
stable-video-diffusion-webui, img to videos| 图片生成视频
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"