markkua
Doctoral student @ PRS, ETH Zurich. Machine learning, 3D Vision, Remote Sensing
ETH ZurichZurich
markkua's Stars
NVIDIA/Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
SysCV/shift-dev
SHIFT Dataset DevKit - CVPR2022
DepthAnything/Video-Depth-Anything
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Tencent/Hunyuan3D-2
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
genmoai/mochi
The best OSS video generation models
lucidrains/local-attention
An implementation of local windowed attention for language modeling
NVIDIA/Cosmos-Tokenizer
A suite of image and video neural tokenizers
prs-eth/Marigold-DC
Zero-Shot Monocular Depth Completion with Guided Diffusion
princeton-vl/DROID-SLAM
microsoft/TRELLIS
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
prs-eth/RollingDepth
Video Depth without Video Models
prs-eth/ukraine-damage-mapping-tool
An Open-Source Tool for Mapping War Destruction at Scale in Ukraine using Sentinel-1 Time Series
PyAV-Org/PyAV
Pythonic bindings for FFmpeg's libraries.
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
SunYangtian/Splatter_A_Video
[NeurIPS 2024] Official code for "Splatter a Video: Video Gaussian Representation for Versatile Processing"
y-zheng18/point_odyssey
Official code for PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking (ICCV 2023)
apple/ml-depth-pro
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
naver/mast3r
Grounding Image Matching in 3D with MASt3R
VisualComputingInstitute/diffusion-e2e-ft
[WACV 2025] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
facebookresearch/dynamic_stereo
[CVPR 2023] DynamicStereo: Consistent Dynamic Depth from Stereo Videos.
jimmycv07/DiffIR2VR-Zero
Tencent/DepthCrafter
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
HKBU-HPML/IRS
IRS: A Large Synthetic Indoor Robotics Stereo Dataset for Disparity and Surface Normal Estimation
osmr/propainter
Streaming ProPainter
Chen-Rao/VD-Diff
Official implementation of Paper "Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model" (ECCV 2024)
Jack-bo1220/Awesome-Remote-Sensing-Foundation-Models
Vinayak-VG/GAURA
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
360CVGroup/FancyVideo
This is the official reproduction of FancyVideo.
basilevh/gcd
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation