swasun's Stars
openai/openai-cookbook
Examples and guides for using the OpenAI API
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
KindXiaoming/pykan
Kolmogorov Arnold Networks
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
mravanelli/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
lxneng/xpinyin
Translate Chinese hanzi to pinyin (拼音) by Python, 汉字转拼音
zjc062/mind-vis
Code base for MinD-Vis
LaPreprint/LaPreprint
📝 A nicely formatted LaTeX preprint template
johnmarktaylor91/torchlens
Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.
as-ideas/DeepPhonemizer
Grapheme to phoneme conversion with deep learning.
soundata/soundata
Python library for downloading, loading & working with sound datasets
Rongjiehuang/GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
pyt-team/TopoModelX
Topological Deep Learning
lrnzgiusti/awesome-topological-deep-learning
A curated list of topological deep learning (TDL) resources and links.
ASR-project/Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.
Rongjiehuang/TranSpeech
PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation
ViCCo-Group/thingsvision
Python package for extracting representations from state-of-the-art computer vision models
artemyk/ibsgd
Zhangyanbo/MLP-KAN
Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)
ftyers/commonvoice-utils
Linguistic processing for Common Voice
arthur-pe/slicetca
Library to perform Slice Tensor Component Analysis (sliceTCA)
danchern97/RTD_AE
This is an official repository for "Learning topology-preserving data representations" presented at ICLR 2023 conference.
ViCCo-Group/frrsa
Python package to conduct feature-reweighted representational similarity analysis.
uzaymacar/simple-speech-features
Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.
zjc062/MindVideo
neuralcodinglab/brain2gan
caycogajiclab/sliceTCA_paper
code accompanying Pellegrino*, Stein*, & Cayco-Gajic (2023).
timsainb/neuroethology_paper_2021
for 2021 current opinions paper