dragon18456's Stars
suno-ai/bark
š Text-Prompted Generative Audio Model
jgm/pandoc
Universal markup converter
moymix/TaskMatrix
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
stanfordnlp/dspy
DSPy: The framework for programmingānot promptingālanguage models
Anarios/return-youtube-dislike
Chrome extension to return youtube dislikes
cpacker/MemGPT
Letta (fka MemGPT) is a framework for creating stateful LLM services.
ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
FMInference/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
NVIDIA/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
cg123/mergekit
Tools for merging pretrained large language models.
john-kurkowski/tldextract
Accurately separates a URLās subdomain, domain, and public suffix, using the Public Suffix List (PSL).
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
SqueezeAILab/LLMCompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
MeetKai/functionary
Chat language model that can use tools and interpret the results
web-arena-x/webarena
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
RUC-NLPIR/LLM4IR-Survey
This is the repo for the survey of LLM4IR.
kssteven418/Squeezeformer
[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
kssteven418/I-BERT
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
ndom91/react-timezone-select
š An extremely usable and dynamic React timezone selector
toshikwa/gail-airl-ppo.pytorch
PyTorch implementation of GAIL and AIRL based on PPO.
upskyy/Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)
shawwn/tpunicorn
Babysit your preemptible TPUs
dawntcherian/Google-speech-to-text-python-websocket-server-using-microphone-stream
Python WebSocket server which converts input audio stream from microphone to text using Google speech to text
cheoljun95/sdhubert
google/t5patches
T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.
dragon18456/flask-sockets-google-speech-api
A simple demo in order to run google speech api on a flask backend with a websocket streaming audio input