woody0105's Stars
siyuanliii/masa
Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
lujiazho/SegDrawer
Simple static web-based mask drawer, supporting semantic segmentation and video segmentation with interactive Segment Anything Model 2 (SAM2).
z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
ttengwang/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
MCG-NKU/E2FGVI
Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)
vietanhdev/anylabeling
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
pion/webrtc
Pure Go implementation of the WebRTC API
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
salesforce/CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
NVlabs/DiscoBox
The Official PyTorch Implementation of DiscoBox.
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
facebookresearch/ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
ultralytics/yolov5
YOLOv5 π in PyTorch > ONNX > CoreML > TFLite
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
cuttingedge1109/matrix-alertmanager-chart
Helm Chart of Matrix Alertmanager
simon-ging/coot-videotext
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
JamesWanglf/face_recognition_v2
xwying/torchshow
Visualize PyTorch tensors with a single line of code.
youtubevos/MaskTrackRCNN
MaskTrackRCNN for video instance segmentation based on mmdetection
open-mmlab/mmtracking
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
matterport/Mask_RCNN
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
dbolya/tide
A General Toolbox for Identifying Object Detection Errors
iovisor/bcc
BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more
cdnbye/hlsjs-p2p-engine
Let your viewers become your unlimitedly scalable CDN.
EgorLakomkin/KTSpeechCrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos
leandromoreira/ffmpeg-libav-tutorial
FFmpeg libav tutorial - learn how media works from basic to transmuxing, transcoding and more. Translations: πΊπΈ π¨π³ π°π· πͺπΈ π»π³ π§π·