Pinned Repositories
algorithms
Awesome-PoseEstimation
This a collecttion of papers for pose estimation.
BAM-CBAM-pytorch
Pytorch implementation of BAM("BAM: Bottleneck Attention Module", BMVC18) and CBAM(“CBAM: Convolutional Block Attention Module”, ECCV18)
GAN-pytorch
Pytorch implementation of GAN("Generative Adversarial Nets", NIPS2014)
ImageClassification-pytorch
This code purpose to evaluate of popular model architectures, such as ResNet, VGG on the ImageNet dataset.
MaskDINO
Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
SENet-pytorch
Pytorch implementation of SENet("Squeeze-and-Excitation Networks", cvpr18)
HOI-Learning-List
A list of Human-Object Interaction Learning.
yolov7_d2
🔥🔥🔥🔥 (Earlier YOLOv7 not official one) YOLO with Transformers and Instance Segmentation, with TensorRT acceleration! 🔥🔥🔥
MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
asdf2kr's Repositories
asdf2kr/AnimateDiff
Forked version of AnimateDiff, attempts to add init images. If you are look into original repo, please go to https://github.com/guoyww/animatediff/
asdf2kr/AnimateDiff-MotionDirector
MotionDirector Training For AnimateDiff. Train a MotionLoRA and run it on any compatible AnimateDiff UI.
asdf2kr/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
asdf2kr/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
asdf2kr/Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
asdf2kr/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
asdf2kr/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
asdf2kr/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
asdf2kr/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
asdf2kr/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
asdf2kr/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
asdf2kr/evals
Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
asdf2kr/generative-models
Generative Models by Stability AI
asdf2kr/H-Deformable-DETR
[CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".
asdf2kr/HOI-Learning-List
A list of Human-Object Interaction Learning.
asdf2kr/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
asdf2kr/LOCATE
LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)
asdf2kr/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
asdf2kr/open-llms
📋 A list of open LLMs available for commercial use.
asdf2kr/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
asdf2kr/roop
one-click face swap
asdf2kr/sd-scripts
asdf2kr/tdw
ThreeDWorld simulation environment
asdf2kr/Tensorrt-Deformable-Detr
Tensorrt-Deformable-Detr
asdf2kr/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
asdf2kr/TypiClust
Active Learning on a Budget - Opposite Strategies Suit High and Low Budgets
asdf2kr/video-generation-survey
A reading list of video generation
asdf2kr/VideoCrafter
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
asdf2kr/visual-chatgpt
Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
asdf2kr/VLM_survey
Vision-Language Models for Vision Tasks: A Survey