Pinned Repositories
3dgan-release
3D Generative Adversarial Network
acnn_speaker_recog
acnn for text-independent speaker recognition
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
AnimeGAN
A Tensorflow implementation of AnimeGAN for fast photo animation ! This is the Open source of the paper 「AnimeGAN: a novel lightweight GAN for photo animation」, which uses the GAN framwork to transform real-world photos into anime images.
AnimeGANv2
[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime
asv-subtools
An Open Source Tools for Speaker Recognition
AWESOME-FER
Top conferences & Journals focused on Facial expression recognition (FER)/ Facial action unit (FAU)
DCASE2022-TASK3
2022 dcase task3 rank 2
deep-head-pose
:fire::fire: Deep Learning Head Pose Estimation using PyTorch.
dgl-ke
High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.
dawenxi-only's Repositories
dawenxi-only/acnn_speaker_recog
acnn for text-independent speaker recognition
dawenxi-only/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
dawenxi-only/AnimeGAN
A Tensorflow implementation of AnimeGAN for fast photo animation ! This is the Open source of the paper 「AnimeGAN: a novel lightweight GAN for photo animation」, which uses the GAN framwork to transform real-world photos into anime images.
dawenxi-only/AnimeGANv2
[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime
dawenxi-only/asv-subtools
An Open Source Tools for Speaker Recognition
dawenxi-only/AWESOME-FER
Top conferences & Journals focused on Facial expression recognition (FER)/ Facial action unit (FAU)
dawenxi-only/DCASE2022-TASK3
2022 dcase task3 rank 2
dawenxi-only/deep-head-pose
:fire::fire: Deep Learning Head Pose Estimation using PyTorch.
dawenxi-only/dgl-ke
High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.
dawenxi-only/DGL_Chinese_Manual
This is the Chinese manual of the graph neural network library DGL, currently contains the User Guide.
dawenxi-only/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
dawenxi-only/ED-MTT
A novel Engagement Detection with Multi-Task Training (ED-MTT) system which minimizes MSE and triplet loss together to determine the engagement level of students in an e-learning environment.
dawenxi-only/Engagement_Detection_OpenFace_Bi-LSTM
dawenxi-only/GazeTracking
👀 Eye Tracking library easily implementable to your projects
dawenxi-only/head-pose-estimation
Head pose estimation by TensorFlow and OpenCV
dawenxi-only/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
dawenxi-only/Keras-GAN
Keras implementations of Generative Adversarial Networks.
dawenxi-only/Make-A-Protagonist
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
dawenxi-only/nft-mix
dawenxi-only/OpenKE
An Open-Source Package for Knowledge Embedding (KE)
dawenxi-only/OpenNE
An Open-Source Package for Network Embedding (NE)
dawenxi-only/pb_sed
Paderborn Sound Event Detection
dawenxi-only/PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
dawenxi-only/RawGAT-ST-antispoofing
This repository includes the code to reproduce our paper "End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection" (https://arxiv.org/abs/2107.12710) published in the ASVspoof 2021 workshop.
dawenxi-only/speechbrain
A PyTorch-based Speech Toolkit
dawenxi-only/svox2
Plenoxels: Radiance Fields without Neural Networks, Code release WIP
dawenxi-only/TCAE
Self-supervised Representation Learning from Videos for Facial Action Unit Detection
dawenxi-only/UniRepLKNet
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
dawenxi-only/VON
[NeurIPS 2018] Visual Object Networks: Image Generation with Disentangled 3D Representation.
dawenxi-only/voxceleb_trainer
In defence of metric learning for speaker recognition