mayank-git-hub
I am currently studying Electrical Engineering at IIT Bombay. I am interested in Machine Learning specifically in combining audio & video.
Sony Research and Development JapanMumbai, India
mayank-git-hub's Stars
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
pytube/pytube
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
abraunegg/onedrive
OneDrive Client for Linux
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
mosaicml/llm-foundry
LLM training code for Databricks foundation models
facebookresearch/LASER
Language-Agnostic SEntence Representations
Stability-AI/stable-audio-tools
Generative models for conditional audio generation
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
graykode/gpt-2-Pytorch
Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation
Kyubyong/g2p
g2p: English Grapheme To Phoneme Conversion
samc621/SneakerBot
All-in-one bot, with auto captcha-solving and proxy management, using Node.js and Puppeteer.
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
swesterfeld/audiowmark
Audio Watermarking
sony/ctm
AI4Bharat/IndicTrans2
Translation models for 22 scheduled languages of India
wavmark/wavmark
AI-based Audio Watermarking Tool
aliutkus/torchinterp1d
1D interpolation for pytorch
saiteja-talluri/Speech2Face
Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL
TemugeB/python_stereo_camera_calibrate
Stereo camera calibration with python and openCV
CHerSun/NoSleep
Lightweight Windows utility to prevent screen locking
HSU-ANT/gstpeaq
GstPEAQ - A GStreamer plugin for Perceptual Evaluation of Audio Quality (PEAQ)
AI4Bharat/IndicNLP-Transliteration
Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/IndicXlit
KimythAnly/qqdm
A lightweight, fast and pretty progress bar for Python
mayank-git-hub/Text-Recognition
Text Recognition and Detection based on Pixel-Link paper implemented in pytorch
dbigham/ARC
Abstraction and Reasoning Corpus
onedrivejs/onedrive
Cross-platform OneDrive client written in JavaScript for node.js
abdelmaged/anime-dl
CLI to download anime episodes from anime websites like GoGoAnime
IshwaryaAnant/codec-perceptual-loss
Code accompanying our submission to ACM MM on a codec-inspired perceptual loss function