Pinned Repositories
2D-and-3D-face-alignment
This repository implements a demo of the networks described in "How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks)" paper.
3D-Caffe
3DDFA
The pytorch improved re-implementation of TPAMI 2017 paper: Face Alignment in Full Pose Range: A 3D Total Solution.
3DGS_and_Beyond_Docs
This is a collective repository for all 3DGS related progresses in research and industry world
hackaway
books, projects and memos
pinglmlcv's Repositories
pinglmlcv/ALIA
Augmenting with Language-guided Image Augmentation
pinglmlcv/Ask-Anything
[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
pinglmlcv/Awesome-Colorful-LLM
Learn the colorful world (Vision/Speech/Robotic) from LLM
pinglmlcv/Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
pinglmlcv/baadd
Code for Backdoor Attacks Against Dataset Distillation
pinglmlcv/BlackVIP
Official implementation for CVPR'23 paper "BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning"
pinglmlcv/CLCAE
Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint
pinglmlcv/Clip2Protect
[CVPR 2023] Official repository of paper titled "CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent Search".
pinglmlcv/CUDA_LTR
Official Implementation of Curriculum of Data Augmentation for Long-tailed Recognition (CUDA) (ICLR'23 Spotlight)
pinglmlcv/custom-diffusion
custom^2 diffusion
pinglmlcv/DeltaEdit
pinglmlcv/FedDG-GA
pinglmlcv/GatedPromptTuning
pinglmlcv/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
pinglmlcv/google-research
Google Research
pinglmlcv/IDM
pinglmlcv/ILM-VP
[CVPR23] "Understanding and Improving Visual Prompting: A Label-Mapping Perspective" by Aochuan Chen, Yuguang Yao, Pin-Yu Chen, Yihua Zhang, and Sijia Liu
pinglmlcv/LADS
Official Implementation of LADS (Latent Augmentation using Domain descriptionS)
pinglmlcv/LLaVA
Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
pinglmlcv/Megatron-LM
Ongoing research training transformer models at scale
pinglmlcv/Otter
š¦¦ Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
pinglmlcv/PLOT
[ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models
pinglmlcv/Point-In-Context
Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding
pinglmlcv/Prompt-Diffusion
Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"
pinglmlcv/ScalableGANFingerprints
The official TensorFlow implementation for ICLR'22 Spotlight paper 'Responsible Disclosure of Generative Models Using Scalable Fingerprinting'
pinglmlcv/SRe2L
Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves highest 60.8% on original ImageNet-1K val set.
pinglmlcv/stable-diffusion
A latent text-to-image diffusion model
pinglmlcv/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
pinglmlcv/TalkLip
pinglmlcv/Voice2Series-Reprogramming
ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time Series Classification