Pinned Repositories
ASR-corpus-collection
audiofpdemo
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.
chatbot_simbert
检索类型的微信聊天机器人/问答系统,通过API异步通信,实现在微信上交互,本项目包括模型和工程化部署一体化。包含查天气,知识图谱聊天查询,生成式问答聊天查询,图片识别,多次重复回答等;涉及到命名实体识别,相似匹配(bm25,bool检索,simbert等),bert+seq2seq生成,neo4j知识图谱查询等技术。
coco-annotator
:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints
CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
coot-videotext
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
datasetforaudiofp
Top 1000 spotify music download data
DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
dejavu
Audio fingerprinting and recognition in Python
woody0105's Repositories
woody0105/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.
woody0105/coco-annotator
:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints
woody0105/CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
woody0105/E2FGVI
Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)
woody0105/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
woody0105/gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
woody0105/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
woody0105/guided-inpainting
Towards Unified Keyframe Propagation Models
woody0105/i-Code
woody0105/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
woody0105/labelme
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
woody0105/Mask_RCNN
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
woody0105/mmtracking
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
woody0105/ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
woody0105/PixelLib
Visit PixelLib's official documentation https://pixellib.readthedocs.io/en/latest/
woody0105/RL4LMs
A modular RL library to fine-tune language models to human preferences
woody0105/SegDrawer
Simple static web-based mask drawer, supporting semantic segmentation with Segment Anything Model (SAM) and video segmentation with XMem.
woody0105/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
woody0105/segmentation
woody0105/smarttranscoding
woody0105/super-gradients
Easily train or fine-tune SOTA computer vision models with one open source training library
woody0105/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
woody0105/Tracking-Solov2-Deepsort
The MOT implement by Solov2+DeepSORT with C++ (Libtorch, TensorRT).
woody0105/trlx
woody0105/Video-Captioning
Video Captioning is an encoder decoder mode based on sequence to sequence learning
woody0105/websocket-mse-demo
Stream H264 to browsers with websocket and w3 media source extensions
woody0105/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
woody0105/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
woody0105/whisper.cpp
Port of OpenAI's Whisper model in C/C++
woody0105/yolov7-segmentation
YOLOv7 Instance Segmentation using OpenCV and PyTorch