SutirthaChakraborty

Senior Research Engineer (Xperi.Co), Ph.D (Human-Robot Synchronization for Musical Ensemble, Maynooth University), Music Composer

PhDMaynooth

SutirthaChakraborty's Stars

xtekky/gpt4free
The official gpt4free repository | various collection of powerful language models
Language:Python61k 475 1.4k13.3k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python55.8k 457 1325.7k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python35.1k 292 1.1k4.3k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python22.1k 186 4902.2k
spmallick/learnopencv
Learn OpenCV : C++ and Python Examples
Language:Jupyter Notebook21.3k 884 31911.6k
PaddlePaddle/PaddleHub
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)【安全加固，暂停交互，请耐心等待】
Language:Python12.7k 182 1.3k2.1k
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python8.4k 121 350798
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python6.8k 64 1.2k717
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Language:Jupyter Notebook5.9k 86 145593
meta-llama/llama-models
Utilities intended for use with Llama models.
Language:Python4.6k 62 113807
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Language:Python4.6k 41 447447
Deci-AI/super-gradients
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
Language:Jupyter Notebook4.6k 45 667505
towhee-io/towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Language:Python3.2k 29 665247
timsainb/noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
Language:Jupyter Notebook1.5k 23 77231
mhamilton723/FeatUp
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
Language:Jupyter Notebook1.4k 18 6478
haoheliu/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Language:Python1.1k 27 56111
niconielsen32/ComputerVision
Language:Jupyter Notebook983 24 24596
av-savchenko/face-emotion-recognition
Efficient face emotion recognition in photos and videos
Language:Jupyter Notebook676 9 52125
krantiparida/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
665 18 269
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python619 15 4343
TIGER-AI-Lab/AnyV2V
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)
Language:Jupyter Notebook491 17 1136
jcvasquezc/DisVoice
feature extraction from speech signals
Language:Jupyter Notebook354 13 2979
yistLin/dvector
Speaker embedding (d-vector) trained with GE2E loss
Language:Python272 11 1046
lucidrains/lumiere-pytorch
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
Language:Python246 22 512
FORARTfe/HyMPS
HyMPS will be a platform-indipendent software suite for advanced audio/video contents production.
114 10 910
sunlicai/HiCMAE
[Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition
Language:Python89 1 97
BreezeWhite/interesting-colabs
Personal colab collections which I feel interesting.
Language:Jupyter Notebook50 3 03
yochaiye/LipVoicer
Official Code implementation for the ICLR paper "LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading"
Language:Python46 2 47
ThisIs-Developer/Body-Language-Detection-with-MediaPipe-and-OpenCV
Explore the world of non-verbal communication like never before with our Body Language Detection solution. Utilizing the advanced capabilities of MediaPipe and OpenCV, we provide real-time insights into human gestures, postures, and facial expressions.
Language:Jupyter Notebook5 1 02
haoheliu/nider
Python package to add text to images, textures and different backgrounds
Language:Python1 1 0

SutirthaChakraborty

SutirthaChakraborty's Stars

xtekky/gpt4free

labmlai/annotated_deep_learning_paper_implementations

coqui-ai/TTS

hpcaitech/Open-Sora

spmallick/learnopencv

PaddlePaddle/PaddleHub

THUDM/CogVideo

modelscope/FunASR

HVision-NKU/StoryDiffusion

meta-llama/llama-models

AILab-CVC/YOLO-World

Deci-AI/super-gradients

towhee-io/towhee

timsainb/noisereduce

mhamilton723/FeatUp

haoheliu/versatile_audio_super_resolution

niconielsen32/ComputerVision

av-savchenko/face-emotion-recognition

krantiparida/awesome-audio-visual

ddlBoJack/emotion2vec

TIGER-AI-Lab/AnyV2V

jcvasquezc/DisVoice

yistLin/dvector

lucidrains/lumiere-pytorch

FORARTfe/HyMPS

sunlicai/HiCMAE

BreezeWhite/interesting-colabs

yochaiye/LipVoicer

ThisIs-Developer/Body-Language-Detection-with-MediaPipe-and-OpenCV

haoheliu/nider