woody0105

Keep It Simple Stupid.

Pinned Repositories

ASR-corpus-collection
00
audiofpdemo
Language:Python00
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.
Language:Python00
chatbot_simbert
检索类型的微信聊天机器人/问答系统，通过API异步通信，实现在微信上交互，本项目包括模型和工程化部署一体化。包含查天气，知识图谱聊天查询，生成式问答聊天查询，图片识别，多次重复回答等；涉及到命名实体识别，相似匹配（bm25，bool检索，simbert等），bert+seq2seq生成，neo4j知识图谱查询等技术。
Language:Python00
coco-annotator
:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints
Language:Vue00
CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
Language:Python00
coot-videotext
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
Language:Python00
datasetforaudiofp
Top 1000 spotify music download data
00
DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
Language:C++00
dejavu
Audio fingerprinting and recognition in Python
Language:Python00

woody0105's Repositories

woody0105/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.
Language:Python00
woody0105/coco-annotator
:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints
Language:Vue00
woody0105/CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
Language:Python00
woody0105/E2FGVI
Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)
woody0105/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
woody0105/gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
woody0105/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
woody0105/guided-inpainting
Towards Unified Keyframe Propagation Models
woody0105/i-Code
woody0105/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
woody0105/labelme
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
woody0105/Mask_RCNN
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
woody0105/mmtracking
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
woody0105/ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
woody0105/PixelLib
Visit PixelLib's official documentation https://pixellib.readthedocs.io/en/latest/
woody0105/RL4LMs
A modular RL library to fine-tune language models to human preferences
woody0105/SegDrawer
Simple static web-based mask drawer, supporting semantic segmentation with Segment Anything Model (SAM) and video segmentation with XMem.
Language:Python
woody0105/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
woody0105/segmentation
Language:C
woody0105/smarttranscoding
Language:C
woody0105/super-gradients
Easily train or fine-tune SOTA computer vision models with one open source training library
woody0105/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
woody0105/Tracking-Solov2-Deepsort
The MOT implement by Solov2+DeepSORT with C++ (Libtorch, TensorRT).
woody0105/trlx
woody0105/Video-Captioning
Video Captioning is an encoder decoder mode based on sequence to sequence learning
woody0105/websocket-mse-demo
Stream H264 to browsers with websocket and w3 media source extensions
woody0105/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
woody0105/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
woody0105/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C0 0
woody0105/yolov7-segmentation
YOLOv7 Instance Segmentation using OpenCV and PyTorch