0xSage's Stars
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
janhq/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
activepieces/activepieces
Your friendliest open source AI automation tool ✨ Workflow automation tool 200+ integration / Enterprise automation tool / Zapier Alternative
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
google/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
EricLBuehler/mistral.rs
Blazingly fast LLM inference.
collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
pytorch/glow
Compiler for Neural Network hardware accelerators
pytorch/executorch
On-device AI across mobile, embedded and edge for PyTorch
microsoft/DirectML
DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. DirectML provides GPU acceleration for common machine learning tasks across a broad range of supported hardware and drivers, including all DirectX 12-capable GPUs from vendors such as AMD, Intel, NVIDIA, and Qualcomm.
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
huggingface/optimum-nvidia
MrYxJ/calculate-flops.pytorch
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
facebookresearch/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
hubertsiuzdak/snac
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
YuanGongND/ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
Bip-Rep/sherpa
A mobile Implementation of llama.cpp
AudioLLMs/AudioBench
AudioBench: A Universal Benchmark for Audio Large Language Models
homebrewltd/llama3-s
Llama3.1 learns to Listen
premAI-io/benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
triton-inference-server/triton_cli
Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inference Server.
janhq/cortex.llamacpp
cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server at runtime.
ikigai-hq/ikigai
Ikigai is an AI-powered Open Assignment System
janhq/cortex.onnx