josebeo2016's Stars
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
ErosRos/conformer-based-classifier-for-anti-spoofing
Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.
Qiskit/qiskit
Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
asvspoof-challenge/asvspoof5
nii-yamagishilab/SpeechSPC-mini
Speech Security and Privacy Compendium - Mini
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
sony/diffiner
junkunyuan/Awesome-Domain-Generalization
Awesome things about domain generalization, including papers, code, etc.
zhaoxin94/awesome-domain-adaptation
A collection of AWESOME things about domian adaptation
xieyuankun/Codecfake
This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".
tim-learn/awesome-test-time-adaptation
Collection of awesome test-time (domain/batch/instance) adaptation methods
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
serverok/squid-proxy-installer
Install Sqid Proxy on Ubuntu/Debian
daniilrobnikov/vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
hungdinhxuan/MMS_TTS
chiphuyen/dmls-book
Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)
sasv-challenge/SASV2_Baseline
SASV2 baseline, a track on ASVspoof5 phase2 challenge
ControlNet/AV-Deepfake1M
[ACM MM] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
TakHemlata/T-EER
Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"
pytorch/android-demo-app
PyTorch android examples of usage in applications
OverLordGoldDragon/ssqueezepy
Synchrosqueezing, wavelet transforms, and time-frequency analysis in Python
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
wangyongjie-ntu/Awesome-explainable-AI
A collection of research materials on explainable AI/ML
MWiechmann/enron_spam_data
The Enron-Spam dataset preprocessed in a single, clean csv file.
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)