Yip-Jia-Qi's Stars
SpeechColab/GigaSpeech2
An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
feder-cr/linkedIn_auto_jobs_applier_with_AI
LinkedIn_AIHawk is a tool that automates the jobs application process on LinkedIn. Utilizing artificial intelligence, it enables users to apply for multiple job offers in an automated and personalized way.
robatwilliams/openai-excel-functions
Create OpenAI chat completions from Excel formulas
inoueakimitsu/ExcelAgentTemplate
Sample Excel add-in and Python script code to run an agent using LLM from an Excel function
hrishioa/mandark
Simple AI coder that can do most of my work for me, including working on himself.
sp-uhh/ears_benchmark
Generation scripts for EARS-WHAM and EARS-Reverb
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
vgel/repeng
A library for making RepE control vectors
impresso/llm-transcript-postcorrection
A repository for preliminary work on HTR/OCR/ASR post-correction based on GPT models.
TomohikoNakamura/asteroid_jaCappella
dynamic-superb/dynamic-superb
The official repository of Dynamic-SUPERB.
samwit/llm-tutorials
A set of LLM Tutorials from my youtube channel
sfcompute/tinynarrations
A synthetic story narration dataset to study small audio LMs.
microsoft/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
TransformerLensOrg/TransformerLens
A library for mechanistic interpretability of GPT-style language models
PrathameshDhande22/PdfTxtBot
A Telegram bot which extract Text from PDF, also extract the Images of PDF Pages. Made with Python
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Swall0w/torchstat
Model analyzer in PyTorch
Lyken17/pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
mpariente/pywsj0-mix
wsj0-{2, 3, 4, 5} mix generation scripts, in Python.
mooey5775/DePerceiver
Improving Small Object Detection in DETR. Submitted as a 16824 final project
xbresson/CS6208_2023
Advanced Topics in Artificial Intelligence, NUS CS6208, 2023
lucidrains/block-recurrent-transformer-pytorch
Implementation of Block Recurrent Transformer - Pytorch
Emotional-Text-to-Speech/dl-for-emo-tts
:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:
CheyneyComputerScience/CREMA-D
Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)
HLTSingapore/Emotional-Speech-Data
This is the GitHub page for publicly available emotional speech data.
alibabasglab/MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
lindermanlab/S5
JusperLee/TDANet
An efficient speech separation method