vivekgoquest

CEO at Goquest Media

Goquest Media VenturesMumbai

vivekgoquest's Stars

descriptinc/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Language:Python981214
Curated-Awesome-Lists/awesome-ai-music-generation
A curated compilation of AI-driven generative music resources and projects. Explore the blend of machine learning algorithms and musical creativity.
20716
adefossez/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Language:Python1k98
IAHispano/Applio
A simple, high-quality voice conversion tool focused on ease of use and performance
Language:Python1.8k295
aris-ai/Audio-and-text-based-emotion-recognition
A multimodal approach on emotion recognition using audio and text.
Language:Jupyter Notebook16231
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python7.1k761
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python3.6k302
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python12.8k1.1k
Shahabks/my-voice-analysis
My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundamental frequency contours, and formants.
Language:Python30091
Majdoddin/nlp
Language:Jupyter Notebook46156
ancs21/awesome-openai-whisper
A curated list of awesome OpenAI's Whisper
934
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python12.7k1.3k
mshumer/ai-researcher
Language:Jupyter Notebook902102
open-mmlab/mmtracking
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
Language:Python3.6k598
yuzhms/Streaming-Video-Model
[CVPR2023] Code for "Streaming Video Model"
Language:Python774
mu4farooqi/whisperX
WhisperX: Automatic Speech Recognition with Accurate Word-level Timestamps.
Language:Python11
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Language:Jupyter Notebook3.8k336
jarredou/MVSEP-MDX23-Colab_v2
Colab adaptation of MVSep Model for MDX23 music separation contest
Language:Python27543
nrl-ai/pautobot
🔥 Your private task assistant with GPT 🔥 - Ask questions about your documents.
Language:Python15546
justinjohn0306/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
Language:Python6247
justinjohn0306/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Language:Python104
Human-Lambdas/human-lambdas
Open Source Human in the Loop platform for anyone to run their own private Mechanical Turk.
Language:TypeScript3410
crowd-sh/crowd-sh
Mechanical Turk for Airtable
Language:Go243

vivekgoquest

vivekgoquest's Stars

descriptinc/melgan-neurips

Curated-Awesome-Lists/awesome-ai-music-generation

adefossez/demucs

IAHispano/Applio

aris-ai/Audio-and-text-based-emotion-recognition

modelscope/FunASR

huggingface/distil-whisper

SYSTRAN/faster-whisper

Shahabks/my-voice-analysis

Majdoddin/nlp

ancs21/awesome-openai-whisper

m-bain/whisperX

mshumer/ai-researcher

open-mmlab/mmtracking

yuzhms/Streaming-Video-Model

mu4farooqi/whisperX

MahmoudAshraf97/whisper-diarization

jarredou/MVSEP-MDX23-Colab_v2

nrl-ai/pautobot

justinjohn0306/Wav2Lip

justinjohn0306/video-retalking

Human-Lambdas/human-lambdas

crowd-sh/crowd-sh