Pinned Repositories
360monodepth
Code release for 360monodepth. With our framework we achieve monocular depth estimation for high resolution 360° images based on aligning and blending perspective depth maps.
3D-LLM
Preliminary Code for 3D-LLM: Injecting the 3D World into Large Language Models
AutoRAG
AutoML tool for RAG
ControlNet_AnimalPose
Adding a quadruped pose control model to ControlNet!
corenet
CoreNet: A library for training deep neural networks
insanely-fast-whisper-v3
Incredibly fast Whisper-large-v3
ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Visual-Tracking-Development
Visual Object Tracking
Paperwave's Repositories
paperwave/AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
paperwave/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
paperwave/chameleon
Repository for Meta Chameleon a mixed-modal early-fusion foundation model from FAIR.
paperwave/clarity-upscaler
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
paperwave/clothedreamer
paperwave/ComfyUI-MimicMotionWrapper
paperwave/Depth-Anything-V2
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
paperwave/DiffIR2VR-Zero
paperwave/DiffSynth-Studio
Enjoy the magic of Diffusion models!
paperwave/EvTexture
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
paperwave/hallo-x
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
paperwave/LLaVA-NeXT
paperwave/llm-comparator
LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR team.
paperwave/maestro
A framework for Claude Opus to intelligently orchestrate subagents.
paperwave/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
paperwave/MotionBooth
The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"
paperwave/MotionClone
Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation
paperwave/OpenDiT
OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference
paperwave/pipecat
Open Source framework for voice and multimodal conversational AI
paperwave/PowerPaint
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型,可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成,只需要一个模型
paperwave/prompt-api
A proposal for a web API for prompting browser-provided language models
paperwave/RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
paperwave/sd-scripts
paperwave/search-agents
Code for the paper 🌳 Tree Search for Language Model Agents
paperwave/subcloning
implementation of https://arxiv.org/pdf/2312.09299
paperwave/unet.cu
UNet diffusion model in pure CUDA
paperwave/UniAnimate
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
paperwave/Video-Infinity
Video-Infinity generates long videos quickly using multiple GPUs without extra training.
paperwave/VividPose
Official code for VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation.
paperwave/yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.