neethanwu's Stars
coqui-ai/TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
intel/intel-extension-for-transformers
β‘ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platformsβ‘
Picsart-AI-Research/MI-GAN
[ICCV 2023] MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
yerfor/GeneFace
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
huangwl18/VoxPoser
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
postlight/parser
π Extract meaningful content from the chaos of a web page
huggingface/chat-ui
Open source codebase powering the HuggingChat app
Infisical/infisical
βΎ Infisical is the open-source secret management platform: Sync secrets across your team/infrastructure, prevent secret leaks, and manage internal PKI
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
microsoft/autogen
A programming framework for agentic AI π€
Orillusion/orillusion
Orillusion is a pure Web3D rendering engine which is fully developed based on the WebGPU standard.
mattneary/attention
visualizing attention for LLM users
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! β€οΈ
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
meta-llama/codellama
Inference code for CodeLlama models
abacusai/Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a modelβs information retrieval capabilities with context expansion. We also include key experimental results and instructions for reproducing and building on them.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
auth0/node-jsonwebtoken
JsonWebToken implementation for node.js http://self-issued.info/docs/draft-ietf-oauth-json-web-token.html
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
dqbd/tiktoken
JS port and JS/WASM bindings for openai/tiktoken
microsoft/lida
Automatic Generation of Visualizations and Infographics using Large Language Models
mshumer/gpt-prompt-engineer
mshumer/gpt-oracle-trainer
mshumer/gpt-llm-trainer
geekan/MetaGPT
π The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
openai/openai-node
The official Node.js / Typescript library for the OpenAI API