Pinned Repositories
EndoGaussian
EndoGaussian: Real-time Gaussian Splatting for Dynamic Endoscopic Scene Reconstruction
Endora
Endora: Video Generation Models as Endoscopy Simulators (MICCAI 2024)
EndoSparse
[MICCAI 2024] EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting
U-KAN
[ArXiv' 24] U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation
animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
CogVideo
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
GCN-DE
Pytorch implementation of our paper accepted by CBM2021 -- Few-shot medical image segmentation using a global correlation network with discriminative embedding
StegaNeRF
Official Pytorch implementation of "StegaNeRF: Embedding Invisible Information within Neueral Radiance Fields", ICCV2023
Vessel-Seg
Pytorch implementation of our paper accepted by NCA2021 -- Hierarchical Deep Network with Uncertainty aware Semi-supervised Learning for Vesse Segmentation
XGGNet's Repositories
XGGNet/StegaNeRF
Official Pytorch implementation of "StegaNeRF: Embedding Invisible Information within Neueral Radiance Fields", ICCV2023
XGGNet/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
XGGNet/CogVideo
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
XGGNet/Grounded-Segment-Anything
Marrying Grounding DINO with Segment Anything & Stable Diffusion & Tag2Text & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs
XGGNet/llama-dl
High-speed download of LLaMA, Facebook's 65B parameter GPT model
XGGNet/multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
XGGNet/XGGNet.github.io
XGGNet/Academic-project-page-template
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
XGGNet/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
XGGNet/Awesome-Dataset-Distillation
Awesome Dataset Distillation Papers
XGGNet/Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
XGGNet/CF-ViT
Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"
XGGNet/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
XGGNet/Endo-FM
[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train
XGGNet/Endo-FM-1
[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train
XGGNet/fastcomposer
XGGNet/generative-ai-roadmap
生成式AI的应用路线图 The roadmap of generative AI: use cases and applications
XGGNet/generative-models
Generative Models by Stability AI
XGGNet/Latte
Latte: Latent Diffusion Transformer for Video Generation.
XGGNet/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
XGGNet/LightGaussian
"LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS", Zhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia Xu, Zhangyang Wang
XGGNet/Multi-Modality-Arena
Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!
XGGNet/PhysGaussian
[CVPR 2024] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
XGGNet/SEED
Official implementation of SEED-LLaMA (ICLR 2024).
XGGNet/SOMA
[ICCV' 23 ORAL] Novel Scenes & Classes: Towards Adaptive Open-set Object Detection
XGGNet/Source-Free-Domain-Generalization
An open-world scenario domain generalization code base
XGGNet/Test
XGGNet/ULIP
XGGNet/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
XGGNet/XGGNet