lsh-research
student from china,Xinjiang University, College of Information Science and Engineering
@Xinjiang University
Pinned Repositories
attention-target-detection
[CVPR2020] "Detecting Attended Visual Targets in Video"
EPSANet
External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
TS-TalkNet
INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues
3D-Speaker
A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.
ACA-Net
Pytorch Implementation of ACA-Net for Speaker Verification
AFRCNN-For-Speech-Separation
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Agent-Attention
Official repository of Agent Attention
Agglomerative-Hierarchical-Clustering-from-scratch
Build Agglomerative hierarchical clustering algorithm from scratch, i.e. WITHOUT any advance libraries such as Numpy, Pandas, Scikit-learn, etc.
AK-DE-biGRU
Improving Response Selection in Multi-turn Dialogue Systems by Incorporating Domain Knowledge
lsh-research's Repositories
lsh-research/MFA-EBranchformer
lsh-research/ReDimNet
The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"
lsh-research/DSCNet
Pytorch Implement of Dynamic Snake Convolution (ICCV2023)
lsh-research/UniRepLKNet
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
lsh-research/Agent-Attention
Official repository of Agent Attention
lsh-research/objectdetection_script
一些关于目标检测的脚本的改进思路代码,详细请看readme.md
lsh-research/EMA-attention-module
Implementation Code for the ICCASSP 2023 paper " Efficient Multi-Scale Attention Module with Cross-Spatial Learning" and is available at: https://arxiv.org/abs/2305.13563v2
lsh-research/wandb
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
lsh-research/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
lsh-research/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
lsh-research/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
lsh-research/lightning
Deep learning framework to train, deploy, and ship AI products Lightning fast.
lsh-research/3D-Speaker
A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.
lsh-research/ACA-Net
Pytorch Implementation of ACA-Net for Speaker Verification
lsh-research/ODConv
The official project website of "Omni-Dimensional Dynamic Convolution" (ODConv for short, spotlight in ICLR 2022).
lsh-research/DAT
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
lsh-research/DS-TDNN
Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch
lsh-research/NeMo
NeMo: a toolkit for conversational AI
lsh-research/speechbrain
A PyTorch-based Speech Toolkit
lsh-research/PEL4VAD
Official code for "Learning Prompt-Enhanced Context features for Weakly-Supervised Video Anomlay Detection"
lsh-research/cluster-analysis
K-Means++(HCM), Fuzzy C-Means(FCM), Hierarchical Clustering, DBscan
lsh-research/FunASR
A Fundamental End-to-End Speech Recognition Toolkit
lsh-research/CBAM.PyTorch
Non-official implement of Paper:CBAM: Convolutional Block Attention Module
lsh-research/wespeaker
Research and Production Oriented Speaker Recognition Toolkit
lsh-research/s3prl
Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)
lsh-research/SpectralCluster
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
lsh-research/Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
lsh-research/AV-Sepformer
lsh-research/EfficientConformer
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
lsh-research/EEND_PyTorch
A PyTorch implementation of End-to-End Neural Diarization