lkhl

lkhl's Stars

Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Language:Python56517
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Language:Python71948
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language:Python1.3k124
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.2k1k
SaFoLab-WISC/AdaShield
[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting."
Language:Python34
mikeqzy/3dgs-avatar-release
3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting
Language:Python31430
Sherrylone/PQDiff
[ICLR 2024] Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach Link: https://arxiv.org/abs/2401.15652
Language:Python651
clownrat6/OpenVIS
Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.
Language:Python15
HowardLi1984/ECDFormer
The official code for "Deep peak property learning for efficient chiral molecules ECD spectra prediction"
Language:Python28
cwchenwang/awesome-3d-diffusion
A collection of papers on diffusion models for 3D generation.
75930
CuriseJia/FreeStyleRet
Precision Search through Multi-Style Inputs
Language:Python456
VIStA-H/GPT-4V_Social_Media
GPT-4V(ision) as A Social Media Analysis Engine
302
PKU-YuanGroup/Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Language:Python75441
AILab-CVC/HiFi-123
[ECCV 2024] HiFi-123: Towards High-fidelity One Image to 3D Content Generation
Language:Python57
jpthu17/jpthu17.github.io
Language:CSS4
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python64k7.9k
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Language:Python32.2k2.4k
nerfies/nerfies.github.io
Language:JavaScript2.3k813
cvlab-columbia/zero123
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)
Language:Python2.6k191
eladrich/latent-nerf
Official Implementation for "Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures"
Language:Python69249
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
Language:Python8.2k721
rain305f/OSP
[CVPR 2023] Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning
Language:Python212
lkhl/MIS
[ICCV 2023] Implementation of the paper “Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation”
Language:C++71
nebuly-ai/optimate
A collection of libraries to optimise AI model performances
Language:Python8.4k643
Algy/fast-slic
20x Real-time superpixel SLIC Implementation with CPU
Language:C++26134
ma-xu/Context-Cluster
[ICLR 2023 Oral] Image as Set of Points
Language:Python53440
yanmin-wu/EDA
[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
Language:Python1014
clownrat6/VectorNet
The implementation of VectorNet. Done and Lose
Language:Python417
jpthu17/HBI
[CVPR 2023 Highlight] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
Language:Python1034
microsoft/X-Decoder
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
Language:Python1.3k132