Yip-Jia-Qi

Yip-Jia-Qi's Stars

SpeechColab/GigaSpeech2
An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
Language:Python1034
feder-cr/linkedIn_auto_jobs_applier_with_AI
LinkedIn_AIHawk is a tool that automates the jobs application process on LinkedIn. Utilizing artificial intelligence, it enables users to apply for multiple job offers in an automated and personalized way.
Language:Python11.7k1.8k
robatwilliams/openai-excel-functions
Create OpenAI chat completions from Excel formulas
Language:JavaScript293
inoueakimitsu/ExcelAgentTemplate
Sample Excel add-in and Python script code to run an agent using LLM from an Excel function
Language:C#41
hrishioa/mandark
Simple AI coder that can do most of my work for me, including working on himself.
Language:TypeScript21316
sp-uhh/ears_benchmark
Generation scripts for EARS-WHAM and EARS-Reverb
Language:Python172
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python57442
vgel/repeng
A library for making RepE control vectors
Language:Jupyter Notebook44937
impresso/llm-transcript-postcorrection
A repository for preliminary work on HTR/OCR/ASR post-correction based on GPT models.
Language:Jupyter Notebook6
TomohikoNakamura/asteroid_jaCappella
Language:Python122
dynamic-superb/dynamic-superb
The official repository of Dynamic-SUPERB.
Language:Python14890
samwit/llm-tutorials
A set of LLM Tutorials from my youtube channel
Language:Jupyter Notebook601176
sfcompute/tinynarrations
A synthetic story narration dataset to study small audio LMs.
Language:Python283
microsoft/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
Language:Python1.1k408
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Language:Python1.1k101
TransformerLensOrg/TransformerLens
A library for mechanistic interpretability of GPT-style language models
Language:Python1.4k270
PrathameshDhande22/PdfTxtBot
A Telegram bot which extract Text from PDF, also extract the Images of PDF Pages. Made with Python
Language:Python42
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python1.1k94
Swall0w/torchstat
Model analyzer in PyTorch
Language:Python1.5k144
Lyken17/pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
Language:Python4.8k529
mpariente/pywsj0-mix
wsj0-{2, 3, 4, 5} mix generation scripts, in Python.
Language:Python465
mooey5775/DePerceiver
Improving Small Object Detection in DETR. Submitted as a 16824 final project
Language:Python8
xbresson/CS6208_2023
Advanced Topics in Artificial Intelligence, NUS CS6208, 2023
Language:Jupyter Notebook30747
lucidrains/block-recurrent-transformer-pytorch
Implementation of Block Recurrent Transformer - Pytorch
Language:Python21119
Emotional-Text-to-Speech/dl-for-emo-tts
:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:
Language:Jupyter Notebook40844
CheyneyComputerScience/CREMA-D
Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)
Language:R339119
HLTSingapore/Emotional-Speech-Data
This is the GitHub page for publicly available emotional speech data.
31422
alibabasglab/MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
797
lindermanlab/S5
Language:Python24643
JusperLee/TDANet
An efficient speech separation method
Language:Python21927