catherine-qian's Stars
ageitgey/face_recognition
The world's simplest facial recognition api for Python and the command line
CMU-Perceptual-Computing-Lab/openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
google/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
ShusenTang/Dive-into-DL-PyTorch
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
cmusatyalab/openface
Face recognition with deep neural networks.
jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Vay-keen/Machine-learning-learning-notes
周志华《机器学习》又称西瓜书是一本较为全面的书籍,书中详细介绍了机器学习领域不同类型的算法(例如:监督学习、无监督学习、半监督学习、强化学习、集成降维、特征选择等),记录了本人在学习过程中的理解思路与扩展知识点,希望对新人阅读西瓜书有所帮助!
harvardnlp/annotated-transformer
An annotated implementation of the Transformer paper.
foolwood/benchmark_results
Visual Tracking Paper List
hunkim/PyTorchZeroToAll
Simple PyTorch Tutorials Zero to ALL!
citation-style-language/styles
Official repository for Citation Style Language (CSL) citation styles.
xingyizhou/CenterTrack
Simultaneous object detection and tracking using center points.
natanielruiz/deep-head-pose
:fire::fire: Deep Learning Head Pose Estimation using PyTorch.
nsoojin/coursera-ml-py
Python programming assignments for Machine Learning by Prof. Andrew Ng in Coursera
yinguobing/head-pose-estimation
Realtime human head pose estimation with ONNXRuntime and OpenCV.
introlab/odas
ODAS: Open embeddeD Audition System
CMU-Perceptual-Computing-Lab/MonocularTotalCapture
Code for CVPR19 paper "Monocular Total Capture: Posing Face, Body and Hands in the Wild"
qiexing/face-landmark-localization
cnn network predict face landmarks (68 points) and head pose (3d pose, yaw,roll,pitch).
sharathadavanne/seld-net
Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional recurrent neural network
tobran/DF-GAN
[CVPR2022 oral] A Simple and Effective Baseline for Text-to-Image Synthesis
Ha0Tang/XingGAN
[ECCV 2020] XingGAN for Person Image Generation
YapengTian/AVE-ECCV18
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
zhangqianhui/GazeAnimation
Give a portrait face, move the gaze up (ACM MM 2020)
Ha0Tang/BiGraphGAN
[BMVC 2020 Oral] Bipartite Graph Reasoning GANs for Person Image Generation
georgesterpu/avsr-tf1
Audio-Visual Speech Recognition using Sequence to Sequence Models
fandulu/MPLT
Multi-person 3D panoramic localization tracking
orcc/orcc
Open RVC-CAL Compiler
hunterhawk/Attention-on-Audio
Attention Mechanism with DNN on Audio Classification