Pine-sha

Pine-sha's Stars

rinongal/textual_inversion
Language:Jupyter Notebook2.9k281
zju-vipa/awesome-neural-trees
Introduction, selected papers and possible corresponding codes in our review paper "A Survey of Neural Trees"
798
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python22.4k2.2k
avisingh599/imitation-dagger
[Reimplementation Ross et al 2011] An implementation of DAGGER using ConvNets for driving from pixels.
Language:Python7320
montrealrobotics/active-domainrand
Code repository for Active Domain Randomization (CoRL 2019, https://arxiv.org/abs/1904.04762)
Language:Python9618
lukasHoel/text2room
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
Language:Python1k74
FrozenBurning/SceneDreamer
[TPAMI 2023] SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections
Language:Python61640
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:Python3.8k672
lucidsim/lucidsim
Official Repo for the paper "Learning Visual Parkour from Generated Images" (CoRL 2024).
Language:Python888
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Language:Python4.7k460
UX-Decoder/DINOv
[CVPR 2024] Official implementation of the paper "Visual In-context Learning"
Language:Python41119
Oswald522/ams-thesis
院使用的Latex论文模板
Language:TeX41
IDEA-Research/T-Rex
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Language:Python2.3k149
autonomousvision/gaussian-opacity-fields
[SIGGRAPH Asia'24 & TOG] Gaussian Opacity Fields: Efficient Adaptive Surface Reconstruction in Unbounded Scenes
Language:Python76845
hugobl1/ray_gauss
RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis
Language:C++623
THU-luvision/OmniSeg3D
Segment Everything All at Once
Language:Python1114
VAST-AI-Research/TriplaneGaussian
TriplaneGaussian: A new hybrid representation for single-view 3D reconstruction.
Language:Python78252
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.2k478
hrz2000/CustomNeRF
[CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
Language:Python351
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
Language:Python8.3k735
adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
Language:Python1.9k139
google/dreambooth
87279
zhengli97/Awesome-Prompt-Adapter-Learning-for-VLMs
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
31714
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python26.4k5.4k
wkentaro/labelme
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Language:Python13.6k3.4k
Megvii-BaseDetection/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Language:Python9.5k2.2k
OpenRobotLab/PointLLM
[ECCV 2024 Best Paper Candidate] PointLLM: Empowering Large Language Models to Understand Point Clouds
Language:Python66132
lzhnb/Analytic-Splatting
[ECCV 2024 - Oral] Analytic-Splatting Anti-Aliased 3D Gaussian Splatting via Analytic Integration
Language:Python1272
nv-tlabs/lift-splat-shoot
Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)
Language:Python1.1k221
tobiasfshr/map4d
Photo-realistic mapping of dynamic urban areas
Language:Python2338