josebeo2016

josebeo2016's Stars

mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.5k363
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Language:Python669112
ErosRos/conformer-based-classifier-for-anti-spoofing
Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.
Language:Python161
Qiskit/qiskit
Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
Language:Python5k2.3k
asvspoof-challenge/asvspoof5
Language:Python272
nii-yamagishilab/SpeechSPC-mini
Speech Security and Privacy Compendium - Mini
Language:Python6
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python19.6k3k
sony/diffiner
Language:Python574
junkunyuan/Awesome-Domain-Generalization
Awesome things about domain generalization, including papers, code, etc.
35339
zhaoxin94/awesome-domain-adaptation
A collection of AWESOME things about domian adaptation
5k866
xieyuankun/Codecfake
This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".
Language:Python292
tim-learn/awesome-test-time-adaptation
Collection of awesome test-time (domain/batch/instance) adaptation methods
67448
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
57726
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Language:Python1.8k187
serverok/squid-proxy-installer
Install Sqid Proxy on Ubuntu/Debian
Language:Shell189126
daniilrobnikov/vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
Language:Jupyter Notebook45947
hungdinhxuan/MMS_TTS
Language:Jupyter Notebook2
chiphuyen/dmls-book
Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)
2.2k313
sasv-challenge/SASV2_Baseline
SASV2 baseline, a track on ASVspoof5 phase2 challenge
Language:Python227
ControlNet/AV-Deepfake1M
[ACM MM] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
642
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Language:Python56.1k6.9k
TakHemlata/T-EER
Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"
Language:Python12
pytorch/android-demo-app
PyTorch android examples of usage in applications
Language:Java1.5k606
OverLordGoldDragon/ssqueezepy
Synchrosqueezing, wavelet transforms, and time-frequency analysis in Python
Language:Python62396
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
62942
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Language:Python10.2k1.5k
wangyongjie-ntu/Awesome-explainable-AI
A collection of research materials on explainable AI/ML
1.4k187
MWiechmann/enron_spam_data
The Enron-Spam dataset preprocessed in a single, clean csv file.
Language:Python324
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.5k489
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.3k2.9k