bchamand's Stars
immich-app/immich
High performance self-hosted photo and video management solution.
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
astral-sh/uv
An extremely fast Python package and project manager, written in Rust.
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
albumentations-team/albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
daytonaio/daytona
The Open Source Dev Environment Manager.
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Vaibhavs10/insanely-fast-whisper
Linzaer/Ultra-Light-Fast-Generic-Face-Detector-1MB
💎1MB lightweight face detection model (1MB轻量级人脸检测模型)
yangchris11/samurai
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
facebookresearch/sapiens
High-resolution models for human tasks.
apple/ml-mgie
airtai/faststream
FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.
lyuwenyu/RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
AgentOps-AI/tokencost
Easy token price estimates for 400+ LLMs. TokenOps.
OpenGenerativeAI/llm-colosseum
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
hkchengrex/Tracking-Anything-with-DEVA
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
timmeinhardt/trackformer
Implementation of "TrackFormer: Multi-Object Tracking with Transformers”. [Conference on Computer Vision and Pattern Recognition (CVPR), 2022]
OpenLemur/Lemur
[ICLR 2024] Lemur: Open Foundation Models for Language Agents
revdotcom/reverb
Open source inference code for Rev's model
cvdfoundation/ava-dataset
The AVA dataset densely annotates 80 atomic visual actions in 351k movie clips with actions localized in space and time, resulting in 1.65M action labels with multiple labels per human occurring frequently.
G-U-N/Be-Your-Outpainter
[ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745
mogwai/nanodrz
Speaker Diarization with Transformers
Martlgap/face-alignment-mtcnn
A lightweight python implementation of face alignment with MTCNN landmarks using tensorflow-lite runtime