Pinned Repositories
AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
avatarface_implement
Face Swapping
character-mining
Mining individual characters in multiparty dialogue
controllable_evc_code
This is the code for controllable EVC framework for seen and unseen emotion generation.
CycleTransGAN-EVC
CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer
CZ-HP
dataset_medical
医学影像数据集列表 『An Index for Medical Imaging Datasets』
deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
DemoPage-C-CycleTransGAN-VoiceConversion
FaceSwapping
Face swapping function with Paper: Motion Representations for Articulated Animation
CZ26's Repositories
CZ26/CycleTransGAN-EVC
CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer
CZ26/FaceSwapping
Face swapping function with Paper: Motion Representations for Articulated Animation
CZ26/AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
CZ26/avatarface_implement
Face Swapping
CZ26/character-mining
Mining individual characters in multiparty dialogue
CZ26/CZ-HP
CZ26/dataset_medical
医学影像数据集列表 『An Index for Medical Imaging Datasets』
CZ26/DemoPage-C-CycleTransGAN-VoiceConversion
CZ26/DemoPage-CycleTransGAN-EmotionalSpeechConversion
CZ26/Depression_FAU-guided
Depression_FAU-guided
CZ26/EmoLLM
心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral
CZ26/facial-expression-analysis
Dimensional estimation of emotions (Arousal, Valence, Intensity) from facial landmarks extracted by DLIB.
CZ26/facial-landmark-frontalization
Function to frontalize non-frontal 2D facial landmarks generated from the DLIB library
CZ26/HierarchicalFusionMER
CZ26/himallgg
himallgg
CZ26/icassp2021-emotion-tts
Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/
CZ26/ICE-Talk
Interface for Controllable Expressive Talking Machine
CZ26/Learning-Graph-Representation-of-Person-specific-Cognitive-Processes-from-Audio-visual-Behaviours-fo
CZ26/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC
CZ26/phonemizer
Simple text to phones converter for multiple languages
CZ26/PythonPark
Python 开源项目之「自学编程之路」,保姆级教程:AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。
CZ26/remote-opencv-streaming-live-video
A remote live video streaming connection with Flask
CZ26/seq2seq-EVC
CZ26/SKAIG-ERC
The code for "Past, Present, and Future: Conversational Emotion Recognition through Structural Modeling of Psychological Commonsense Knowledge" plus the code of models in "A Hierarchical Transformer with Speaker Modeling for Emotion Recognition in Conversations"
CZ26/statisticbooks
CZ26/Transformer-TTS
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
CZ26/video_features
Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, ResNet features.
CZ26/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
CZ26/w2v2-vad
A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition
CZ26/XrayGLM
🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that Chest Radiographs Summarization.