oweisad's Stars
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
hiyouga/LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
stas00/ml-engineering
Machine Learning Engineering Open Book
vosen/ZLUDA
CUDA on AMD GPUs
Stability-AI/StableCascade
Official Code for Stable Cascade
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
apple/ml-mgie
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
sgl-project/sglang
SGLang is yet another fast serving framework for large language models and vision language models.
turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
argmaxinc/WhisperKit
On-device Speech Recognition for Apple Silicon
adamcohenhillel/ADeus
An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it will have all the right context about what you want to talk about - a truly personalized, personal AI.
mut-ex/gligen-gui
An intuitive GUI for GLIGEN that uses ComfyUI in the backend
epfLLM/meditron
Meditron is a suite of open-source medical Large Language Models (LLMs).
YangLing0818/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Nutlope/notesGPT
Record voice notes & transcribe, summarize, and get tasks
lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
nomic-ai/nomic
Interact, analyze and structure massive text, image, embedding, audio and video datasets
cyang-kth/fmm
Fast map matching, an open source framework in C++
ZHO-ZHO-ZHO/ComfyUI-YoloWorld-EfficientSAM
Unofficial implementation of YOLO-World + EfficientSAM for ComfyUI
isaac879/Pan-Tilt-Mount
A stepper motor driven, 3D printed and Arduino controlled pan/tilt mount.
tsujuifu/pytorch_mgie
A Gradio demo of MGIE
google-research/syn-rep-learn
Learning from synthetic data - code and models
sheng00125/LIV_handhold
andrewgcodes/repo2prompt
Turn a Github Repo's contents into a big prompt for long-context models like Claude 3 Opus.
UMich-CURLY/drift
Dead Reckoning In Field Time: Symmetry-Preserving State Estimation Library for Various Robotic Platforms
Peter-obi/Video_summarization_mlx
Transcribe and summarize videos using whisper and llms on apple mlx framework
bachittle/open-voice-pilot
Open-source AI for voice control, rivaling Alexa and Siri
etown/LifeNarration