iskaj

iskaj's Stars

mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Language:Python4.1k462
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Language:Python10.2k1k
eduardzamfir/seemoredetails
Repository for "See More Details: Efficient Image Super-Resolution by Experts Mining", ICML 2024
Language:Python1192
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python73.9k8.8k
modelscope/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Language:Jupyter Notebook9.2k864
AIGCDesignGroup/ReplaceAnything
2.4k96
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python3.7k305
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.3k490
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
Language:Python2.8k433
C3Imaging/child_tts_fastpitch
Fastpitch text-to-speech (TTS) model for generating high-quality synthetic child speech. This study uses the transfer learning training pipeline. The approach involved finetuning a multi-speaker TTS model to work with child speech. We use the publicly available MyST dataset (55 hours) for our finetuning experiments.
4
Rikorose/DeepFilterNet
Noise supression using deep filtering
Language:Python2.6k244
GXYM/TextBPN-Plus-Plus
Arbitrary Shape Text Detection via Boundary Transformer；The paper at: https://arxiv.org/abs/2205.05320, which has been accepted by IEEE Transactions on Multimedia (T-MM 2023).
Language:Python17938
gabriben/awesome-generative-information-retrieval
63349
jianfch/stable-ts
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
Language:Python1.7k182
SubtitleEdit/subtitleedit-cli
Subtitle Edit cli (without System.Drawing)
Language:C#255
kennethleungty/Failed-ML
Compilation of high-profile real-world examples of failed machine learning projects
72048
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Language:HTML11.3k952
laurensw75/Words2Num_nl
Convert spelled out numbers in Dutch to numeric form
Language:Python3
jonatasgrosman/huggingsound
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
Language:Python43944
rmcelreath/stat_rethinking_2022
Statistical Rethinking course winter 2022
Language:R4.1k444
jonatasgrosman/asrecognition
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
Language:Python516
RameenAbdal/StyleFlow
StyleFlow: Attribute-conditioned Exploration of StyleGAN-generated Images using Conditional Continuous Normalizing Flows (ACM TOG 2021)
Language:Python2.4k344
iskaj/Newsgram
Airconsole Game for Education on News Literacy
Language:JavaScript1
advimman/HiDT
Official repository for the paper "High-Resolution Daytime Translation Without Domain Labels" (CVPR2020, Oral)
Language:Jupyter Notebook65085
YolandaDuan/AVI_LeapMotion_Exergaming
Language:C#11