Pinned Repositories
1D-Speech-Emotion-Recognition
Speech Emotion Recognition from raw speech signals using 1D CNN-LSTM
An-End-to-End-Multitask-Model-to-Improve-Speech-Emotion-Recognition
code for the ICASSP conference paper
bird_audio_detection_challenge
DenseNets for the detection of singing birds in audio files
EmotiW2018_Group-level_Emotion_Recognition
Code for Group-Level Emotion Recognition Using Hybrid Deep Models Based on Faces, Scenes, Skeletons and Visual Attentions
GM-TCNet
Accepted by Journal of Speech Communication (CCF-B).
Human-Activity-Recognition
Multimodal human activity recognition using wrist-worn wearable sensors.
LOUPE_Keras
Rewrite the LOUPE library (https://github.com/antoine77340/LOUPE) into Keras version. Many learnable pooling or differentiable aggregation methods are covered. (NetVLAD, NetRVLAD, SoftDBoW, NetFV, CG)
Speech-Depression-Detection
Speech-Emotion-Recognition-2
Speech emotion recognition using LSTM, SVM and MLP | 语音情感识别
tf-openpose
Openpose from CMU implemented using Tensorflow with Custom Architecture for fast inference.
aascode's Repositories
aascode/Attentions_for_speech_classification
Code for the paper 'Attentions for short duration speech classification'.
aascode/awesome-multimodal-deception-detection
A reading list for research topics in multimodal deception detection.
aascode/BERT-Text-Analysis
Text Analysis done on a business text dataset using KeyBERT and BERTopic
aascode/COVID-19-Cough-Detection
An SVM-based model which can recognize COVID coughs using short audio cough recordings. An attempt at the INTERSPEECH 2021 Computational Paralinguistics Sub-Challenge.
aascode/DAIC-WoZ-Summarization
aascode/dana
DANA: Dimension-Adaptive Neural Architecture (UbiComp'21)( ACM IMWUT)
aascode/DCASE2021-Task1
Codes related to DCASE2021 Task 1 - Acoustic Scene Classification
aascode/dcase2021_umaps
Repository for our paper: "USING UMAP TO INSPECT AUDIO DATA FOR UNSUPERVISED ANOMALY DETECTION UNDER DOMAIN-SHIFT CONDITIONS" (Fernandez, Plumbley 2021)
aascode/Emotion_Recognition_with_Wav2Vec
aascode/FT-w2v2-ser
aascode/FunMatch-Distillation
TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
aascode/ICASSP2022
aascode/Identifying_Actions_for_Sound_Event_Classification
aascode/KD-SVD
[INTERSPEECH 2021] Official Keras Implementation of "Knowledge Distillation for Singing Voice Detection"
aascode/Meta-Learning-for-EEG-Classification-in-Schizophrenia
Notebooks and pre-processing code for a meta learning paper/project involving the classification of EEG spectrograms.
aascode/motion-sense
MotionSense Dataset for Human Activity and Attribute Recognition ( time-series data generated by smartphone's sensors: accelerometer and gyroscope) (PMC Journal) (IoTDI'19)
aascode/multilingual_speech_valence_classification_datasets
Multilingual datasets with raw audio for speech emotion recognition
aascode/Natural-Language-Processing-with-Reddit-Post
Using Random Forest , Bi Direction LSTM and Tensorflow Transfer Learning to do a text classification project. Compare model differences between tokenization and word embedding.
aascode/reddit-sentiment-and-stock-volatility
Reddit Sentiment Analysis of r/Wallstreetbets/$AMC and Bollinger Bands Technical Indicator Correlation
aascode/reddit_corpora
Collection of NLP corpora based on Reddit comments
aascode/SER-wav2vec
Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.
aascode/ser-with-w2v2
aascode/SIIM-FISABIO-RSNA-COVID-19-Detection
7th place solution
aascode/soxan
Wav2Vec for speech recognition, classification, and audio classification
aascode/Spider-Monkey-Whinny-Detection
Code to replicate the experiments of the Interspeech 2021 paper "Multi-Attentive Detection of the Spider Monkey Whinny in the (Actual) Wild".
aascode/stgcn_parkinsonism_prediction
Code for predicting clinical scores of parkinsonism (UPDRS-gait/SAS-gait) from skeleton trajectory data.
aascode/StrengthNet
StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis
aascode/Topic-Modelling-YouTube-Comments
This notebook attempts to do topic modelling on Cyberpunk 2077 YouTube videos using the LDA method.
aascode/wav2vec2-large-xlsr-53-th
Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0
aascode/wav2vec_finetune