Pinned Repositories
ASR-corpus-collection
audiofpdemo
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.
chatbot_simbert
检索类型的微信聊天机器人/问答系统,通过API异步通信,实现在微信上交互,本项目包括模型和工程化部署一体化。包含查天气,知识图谱聊天查询,生成式问答聊天查询,图片识别,多次重复回答等;涉及到命名实体识别,相似匹配(bm25,bool检索,simbert等),bert+seq2seq生成,neo4j知识图谱查询等技术。
coco-annotator
:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints
CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
coot-videotext
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
datasetforaudiofp
Top 1000 spotify music download data
DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
dejavu
Audio fingerprinting and recognition in Python
woody0105's Repositories
woody0105/hlsjs-p2p-engine
Let your viewers become your unlimitedly scalable CDN.
woody0105/lpmspoc
woody0105/p2p-media-loader
An open-source engine for P2P streaming of live and on demand video directly in a web browser HTML page
woody0105/coot-videotext
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
woody0105/pitree
Practical Implementation of ABR Algorithms Using Decision Trees (ACM MM 2019)
woody0105/fpdemo
woody0105/audiofpdemo
woody0105/go-webrtcvad
cgo interface to WebRTC Voice Activity Dectection
woody0105/chatbot_simbert
检索类型的微信聊天机器人/问答系统,通过API异步通信,实现在微信上交互,本项目包括模型和工程化部署一体化。包含查天气,知识图谱聊天查询,生成式问答聊天查询,图片识别,多次重复回答等;涉及到命名实体识别,相似匹配(bm25,bool检索,simbert等),bert+seq2seq生成,neo4j知识图谱查询等技术。
woody0105/datasetforaudiofp
Top 1000 spotify music download data
woody0105/kissfft
a Fast Fourier Transform (FFT) library that tries to Keep it Simple, Stupid
woody0105/LRAT
woody0105/lpmsdemo
Livepeer media server for scene classification & object detection
woody0105/dejavu
Audio fingerprinting and recognition in Python
woody0105/ASR-corpus-collection
woody0105/ffmpeg-libav-tutorial
FFmpeg libav tutorial - learn how media works from basic to transmuxing, transcoding and more
woody0105/kaldi
This is the official location of the Kaldi project.
woody0105/DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
woody0105/lpmsclient
woody0105/hls.js
JavaScript HLS client using Media Source Extension
woody0105/exercises
woody0105/KTSpeechCrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos
woody0105/MaskTrackRCNN
MaskTrackRCNN for video instance segmentation based on mmdetection
woody0105/enigmadecrypt
Python script that decrypts enigma
woody0105/Denari
Denari
woody0105/video-speech-recognition
Auto-generated video subtitles for the web using machine learning
woody0105/Mask-RCNN-bottle-training
woody0105/Paper-Implementations
Use PyTorch to implement some classic frameworks
woody0105/ffplaymfc