woody0105

Keep It Simple Stupid.

Pinned Repositories

ASR-corpus-collection
00
audiofpdemo
Language:Python00
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.
Language:Python00
chatbot_simbert
检索类型的微信聊天机器人/问答系统，通过API异步通信，实现在微信上交互，本项目包括模型和工程化部署一体化。包含查天气，知识图谱聊天查询，生成式问答聊天查询，图片识别，多次重复回答等；涉及到命名实体识别，相似匹配（bm25，bool检索，simbert等），bert+seq2seq生成，neo4j知识图谱查询等技术。
Language:Python0 0 00
coco-annotator
:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints
Language:Vue0 0 00
CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
Language:Python0 0 00
coot-videotext
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
Language:Python0 0 00
datasetforaudiofp
Top 1000 spotify music download data
0 1 00
DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
Language:C++0 0 00
dejavu
Audio fingerprinting and recognition in Python
Language:Python0 0 00

woody0105's Repositories

woody0105/hlsjs-p2p-engine
Let your viewers become your unlimitedly scalable CDN.
woody0105/lpmspoc
Language:Go
woody0105/p2p-media-loader
An open-source engine for P2P streaming of live and on demand video directly in a web browser HTML page
woody0105/coot-videotext
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
woody0105/pitree
Practical Implementation of ABR Algorithms Using Decision Trees (ACM MM 2019)
woody0105/fpdemo
Language:Go
woody0105/audiofpdemo
Language:Python
woody0105/go-webrtcvad
cgo interface to WebRTC Voice Activity Dectection
woody0105/chatbot_simbert
检索类型的微信聊天机器人/问答系统，通过API异步通信，实现在微信上交互，本项目包括模型和工程化部署一体化。包含查天气，知识图谱聊天查询，生成式问答聊天查询，图片识别，多次重复回答等；涉及到命名实体识别，相似匹配（bm25，bool检索，simbert等），bert+seq2seq生成，neo4j知识图谱查询等技术。
woody0105/datasetforaudiofp
Top 1000 spotify music download data
woody0105/kissfft
a Fast Fourier Transform (FFT) library that tries to Keep it Simple, Stupid
woody0105/LRAT
Language:C
woody0105/lpmsdemo
Livepeer media server for scene classification & object detection
Language:Go
woody0105/dejavu
Audio fingerprinting and recognition in Python
woody0105/ASR-corpus-collection
woody0105/ffmpeg-libav-tutorial
FFmpeg libav tutorial - learn how media works from basic to transmuxing, transcoding and more
woody0105/kaldi
This is the official location of the Kaldi project.
woody0105/DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
woody0105/lpmsclient
Language:JavaScript
woody0105/hls.js
JavaScript HLS client using Media Source Extension
Language:JavaScript
woody0105/exercises
woody0105/KTSpeechCrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos
woody0105/MaskTrackRCNN
MaskTrackRCNN for video instance segmentation based on mmdetection
woody0105/enigmadecrypt
Python script that decrypts enigma
Language:Python
woody0105/Denari
Denari
woody0105/video-speech-recognition
Auto-generated video subtitles for the web using machine learning
woody0105/Mask-RCNN-bottle-training
woody0105/Paper-Implementations
Use PyTorch to implement some classic frameworks
woody0105/ffplaymfc