Pinned Repositories
ABINet
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Agent-Attention
Official repository of Agent Attention (ECCV2024)
ai_yanxishe
AI研习社
albert-chinese-large-webqa
基于百度webqa与dureader数据集训练的Albert Large QA模型
ALIKE
ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction
AnimateAnyone_unofficial
Unofficial implementation of Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
RepVGG-Tensorflow-2
subchar-transformers
This repository holds the source codes for training and fine-tuning Chinese pre-trained transformers which are aware of Chinese sub-character features and are optimized with tokenization. In addition, sememe features are added to enhancing knowledge representations.
guome's Repositories
guome/Agent-Attention
Official repository of Agent Attention (ECCV2024)
guome/AnimateAnyone_unofficial
Unofficial implementation of Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
guome/AnomalyGPT
The first LVLM based IAD method!
guome/BGAD
Pytorch Implementation for CVPR2023 paper: Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection
guome/DiffPoseTalk
DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
guome/DiffusionAD
guome/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
guome/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
guome/EfficientAD
Unofficial implementation of EfficientAD https://arxiv.org/abs/2303.14535
guome/EmoTalk_release
This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"
guome/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
guome/haystack
:mag: Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs (GPT-4, ChatGPT and alike). Haystack offers production-ready tools to quickly build complex decision making, question answering, semantic search, text generation applications, and more.
guome/LightCDNet
LightCDNet: Lightweight Change Detection Network Based on VHR Images
guome/Mamba-YOLO
the official pytorch implementation of “Mamba-YOLO:SSMs-based for Object Detection”
guome/MemSeg
Unofficial re-implementation of MemSeg for Anomaly Detection
guome/MI-GAN
[ICCV 2023] MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices
guome/MiM-ISTD
Official pytorch code of our paper "MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection"
guome/ml-destseg
guome/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
guome/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
guome/rcg
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
guome/SCTNet
Official implementation of SCTNet (AAAI2024)
guome/SimpleNet
guome/SPTSv2
The official implementation of SPTS v2: Single-Point Text Spotting
guome/U-Mamba
U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation
guome/UniAD
[NeurIPS 2022 Spotlight] A Unified Model for Multi-class Anomaly Detection
guome/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
guome/Vim
guome/workflow-core
Lightweight workflow engine for .NET Standard
guome/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information