alexdiment's Stars
Mu-Y/mpl-mdd
[Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
MStypulkowski/diffused-heads
Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
lutzroeder/netron
Visualizer for neural network, deep learning and machine learning models
152334H/DL-Art-School
TorToiSe fine-tuning with DLAS
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
replicate/cog
Containers for machine learning
mamba-org/mamba
The Fast Cross-Platform Package Manager
haoheliu/voicefixer
General Speech Restoration
jonatasgrosman/huggingsound
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
ahrm/sioyek
Sioyek is a PDF viewer with a focus on textbooks and research papers
thunlp/OpenPrompt
An Open-Source Framework for Prompt-Learning.
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
CornellNLP/ConvoKit
ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the use of the toolkit on these datasets.
EvgenyKashin/stylegan2-distillation
zademn/mnist-mlops-learning
In this project I played with mlflow, streamlit and fastapi to create a training and prediction app on digits
LSYS/LexicalRichness
:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).
AndreyGuzhov/AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
tsproisl/textcomplexity
Linguistic and stylistic complexity measures for (literary) texts
ideonate/streamlit-launchpad
Browse a folder containing multiple streamlit apps and launch them immediately
pariajm/english-fisher-annotations
A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset
pariajm/joint-disfluency-detector-and-parser
Improving Disfluency Detection by Self-Training a Self-Attentive Model
google-research-datasets/Disfl-QA
A Benchmark Dataset for Understanding Disfluencies in Question Answering
4uiiurz1/keras-cosine-annealing
Keras implementation of Cosine Annealing Scheduler
grammarly/gector
Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)
PrithivirajDamodaran/Styleformer
A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/casual, active/passive, and many more. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
PrithivirajDamodaran/Gramformer
A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
Maxmudjon/Get_MiHome_devices_token
Get Mi Home devices token Windows/MacOS app.
smallwood69/homebridge-cgllc-airmonitor-s1
phurwicz/hover
:speedboat: Label data at scale. Fun and precision included.