Cb1ock

student of Hefei University of Technology @HFUT

Hefei University of Technology

Cb1ock's Stars

YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Language:Jupyter Notebook1.2k220
YuanGongND/cav-mae
Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".
Language:Python24223
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.3k90
MSA-LMC/S2D
[TAFFC 2024] The official implementation of paper: From Static to Dynamic: Adapting Landmark-Aware Image Models for Facial Expression Recognition in Videos
Language:Python531
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
Language:Python23.7k5.4k
facebookresearch/MovieGenBench
Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen
34419
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language:Python1.5k92
HFUTTUG/HFUT-Beamer
Collection of Beamer themes for Hefei University of Technology
Language:TeX173
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Language:Python4.7k586
GenjiB/AVSiam
Siamese Vision Transformers are Scalable Audio-visual Learners
Language:Python81
Xuemantou/R3nzSkin-For-China-Server
Skin changer for League of Legends (LOL)
Language:C++18913
ZebangCheng/Emotion-LLaMA
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
Language:Python14215
nttcslab/msm-mae
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations
Language:Jupyter Notebook898
facebookresearch/mae_st
Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"
Language:Python32334
stoneMo/DeepAVFusion
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
Language:Python252
52CV/CVPR-2024-Papers
87848
QwenLM/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.5k111
zeroQiaoba/MERTools
Toolkits for Multimodal Emotion Recognition
Language:Python16815
MuiseDestiny/zotero-reference
PDF references add-on for Zotero.
Language:JavaScript2.2k59
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
7.5k920
ad-m/github-push-action
GitHub actions to push back to repository eg. updated code
Language:JavaScript1.2k231
ZhuoYulang/CIF-MMIN
Language:Python223
facebookresearch/MAViL
The repo host the code and model of MAViL.
421
xai-org/grok-1
Grok open release
Language:Python49.7k8.3k
sunlicai/MAE-DFER
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition (ACM MM 2023)
Language:Python9515
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.7k6.4k
facebookresearch/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
Language:Python55346
TadasBaltrusaitis/OpenFace
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
Language:MATLAB7k1.9k
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Language:Jupyter Notebook65.7k33.7k
microsoft/AI-For-Beginners
12 Weeks, 24 Lessons, AI for All!
Language:Jupyter Notebook35.1k6k

Cb1ock

Cb1ock's Stars

YuanGongND/ast

YuanGongND/cav-mae

QwenLM/Qwen2-Audio

MSA-LMC/S2D

deepinsight/insightface

facebookresearch/MovieGenBench

OpenGVLab/InternVideo

HFUTTUG/HFUT-Beamer

Zejun-Yang/AniPortrait

GenjiB/AVSiam

Xuemantou/R3nzSkin-For-China-Server

ZebangCheng/Emotion-LLaMA

nttcslab/msm-mae

facebookresearch/mae_st

stoneMo/DeepAVFusion

52CV/CVPR-2024-Papers

QwenLM/Qwen-Audio

zeroQiaoba/MERTools

MuiseDestiny/zotero-reference

HumanAIGC/EMO

ad-m/github-push-action

ZhuoYulang/CIF-MMIN

facebookresearch/MAViL

xai-org/grok-1

sunlicai/MAE-DFER

facebookresearch/fairseq

facebookresearch/AudioMAE

TadasBaltrusaitis/OpenFace

microsoft/generative-ai-for-beginners

microsoft/AI-For-Beginners