Pinned Repositories
ai-chatbot-framework
A python chatbot framework with Natural Language Understanding and Artificial Intelligence.
AllPythonProjects
android_hls_slice
移植ffmpeg到Android平台并对本地视频文件和摄像头实时视频流文件进行ts切片并生成m3u8文件
api__google-calendar_nodejs
ASR-wav2vec2.0
This repo is for zh-TW ASR with wav2vec2.0.
ASR_benchmark
Program to benchmark various speech recognition APIs
Assistant
auto-meeting
Auto Meeting opens Teams, Meet, and Zoom links in your Google calendar events just in time, so you'll never again forget to join.
Automate-Zoom-Meetings
Python application to automatically join meetings scheduled on Google Calendar
avcapture
A chrome to FFmpeg pipeline for capturing audio/video in a webpage
coderboy24x7's Repositories
coderboy24x7/ai-chatbot-framework
A python chatbot framework with Natural Language Understanding and Artificial Intelligence.
coderboy24x7/cliam
Agnostic transactional email sending in Node.js environment
coderboy24x7/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.
coderboy24x7/deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
coderboy24x7/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
coderboy24x7/dust
A more intuitive version of du in rust
coderboy24x7/fastDeploy
Deploy DL/ ML inference pipelines with minimal extra code.
coderboy24x7/glow-speak
Neural text to speech system that uses eSpeak as a text/phoneme front-end
coderboy24x7/insomnia
The Open Source API Client and Design Platform for GraphQL, REST and gRPC
coderboy24x7/kiwituri
coderboy24x7/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
coderboy24x7/nativefier
Make any web page a desktop application
coderboy24x7/NemoSTT
coderboy24x7/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
coderboy24x7/openaudiosearch
Open Audio Search
coderboy24x7/puppeteer-stream
A Library for puppeteer to retrieve audio and/or video streams
coderboy24x7/ScribeSalad
A collection of YouTube videos transcripts : Podcasts (Joe Rogan Experience, Tim Ferris, Jocko podcast, ..), lectures (YaleCourses, MIT lectures, Jordan B. Peterson talks, ..). A big transcripts salad spanning history, geography, science, politics, film making and more.
coderboy24x7/silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
coderboy24x7/simplified_vakyansh
Simplified code based on Vakyansh project for converting indic speech to text
coderboy24x7/SpeechTextDatasetConstruct
Construct speech dataset
coderboy24x7/thunder-client-support
Thunder Client is a lightweight Rest API Client Extension for VS Code.
coderboy24x7/typeplate
REST API boilerplate with Typescript, Express.js, Typeorm and Mocha.
coderboy24x7/typescript-rest
This is a lightweight annotation-based expressjs extension for typescript.
coderboy24x7/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
coderboy24x7/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
coderboy24x7/Video-Sharing-App
Development of a mobile application that allows you to record and publish videos.
coderboy24x7/voice100-runtime
Voice100 runtime. Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.
coderboy24x7/Voice100AndroidApp
Voice100 Android App is a TTS/ASR sample app that uses ONNX Runtime and Voice100 neural TTS/ASR models on Xamarin. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.
coderboy24x7/wav2vec2-asr
wav2vec2 asr with transformers
coderboy24x7/wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding