dilverse's Stars
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
xai-org/grok-1
Grok open release
triton-lang/triton
Development repository for the Triton language and compiler
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
dair-ai/ML-Papers-Explained
Explanation to key concepts in ML
LargeWorldModel/LWM
lavague-ai/LaVague
Large Action Model framework to develop AI Web Agents
vikhyat/moondream
tiny vision language model
OpenInterpreter/01
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
huggingface/parler-tts
Inference and training library for high-quality TTS models.
quarto-dev/quarto-cli
Open-source scientific and technical publishing system built on Pandoc.
philz1337x/clarity-upscaler
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
defog-ai/sqlcoder
SoTA LLM for converting natural language questions to SQL queries
pipecat-ai/pipecat
Open Source framework for voice and multimodal conversational AI
Filimoa/open-parse
Improved file parsing for LLM’s
projectx-codehagen/Badget
Badget aims to simplify financial management with a user-friendly interface and robust backend
hustvl/4DGaussians
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
qnguyen3/chat-with-mlx
An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.
xlang-ai/OSWorld
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
amazon-science/RefChecker
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
dingo-actual/infini-transformer
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)
AI4Bharat/Indic-TTS
Text-to-Speech for languages of India
liuff19/DreamReward
[ECCV 2024] DreamReward: Text-to-3D Generation with Human Preference
jiasenlu/LL3M
LL3M: Large Language and Multi-Modal Model in Jax
xiexh20/HDM
Official implementation for Hierarachical Diffusion Model in CVPR24 Template free reconstruction of human object interaction
Tencent-RoboticsX/NCP
The project page repository for Neural Categorical Priors for Physics-Based Character Control
pharaouk/dharma