lkhl's Stars
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
SaFoLab-WISC/AdaShield
[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting."
mikeqzy/3dgs-avatar-release
3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting
Sherrylone/PQDiff
[ICLR 2024] Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach Link: https://arxiv.org/abs/2401.15652
clownrat6/OpenVIS
Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.
HowardLi1984/ECDFormer
The official code for "Deep peak property learning for efficient chiral molecules ECD spectra prediction"
cwchenwang/awesome-3d-diffusion
A collection of papers on diffusion models for 3D generation.
CuriseJia/FreeStyleRet
Precision Search through Multi-Style Inputs
VIStA-H/GPT-4V_Social_Media
GPT-4V(ision) as A Social Media Analysis Engine
PKU-YuanGroup/Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
AILab-CVC/HiFi-123
[ECCV 2024] HiFi-123: Towards High-fidelity One Image to 3D Content Generation
jpthu17/jpthu17.github.io
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
nerfies/nerfies.github.io
cvlab-columbia/zero123
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)
eladrich/latent-nerf
Official Implementation for "Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures"
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
rain305f/OSP
[CVPR 2023] Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning
lkhl/MIS
[ICCV 2023] Implementation of the paper “Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation”
nebuly-ai/optimate
A collection of libraries to optimise AI model performances
Algy/fast-slic
20x Real-time superpixel SLIC Implementation with CPU
ma-xu/Context-Cluster
[ICLR 2023 Oral] Image as Set of Points
yanmin-wu/EDA
[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
clownrat6/VectorNet
The implementation of VectorNet. Done and Lose
jpthu17/HBI
[CVPR 2023 Highlight] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
microsoft/X-Decoder
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language