Pinned Repositories
360monodepth
Code release for 360monodepth. With our framework we achieve monocular depth estimation for high resolution 360° images based on aligning and blending perspective depth maps.
3D-LLM
Preliminary Code for 3D-LLM: Injecting the 3D World into Large Language Models
AutoRAG
AutoML tool for RAG
ControlNet_AnimalPose
Adding a quadruped pose control model to ControlNet!
corenet
CoreNet: A library for training deep neural networks
insanely-fast-whisper-v3
Incredibly fast Whisper-large-v3
ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Visual-Tracking-Development
Visual Object Tracking
Paperwave's Repositories
paperwave/PARQ
Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection (ICCV23)
paperwave/MVDream
Multi-view Diffusion for 3D Generation
paperwave/RoleLLM-public
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models (WIP)
paperwave/streaming-llm
Efficient Streaming Language Models with Attention Sinks
paperwave/HyP-NeRF
Code Implementation for HyP-NeRF (WIP)
paperwave/gpt4all
gpt4all: open-source LLM chatbots that you can run anywhere
paperwave/q-diffusion
[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.
paperwave/SyncDiffusion
Official implementation of SyncDiffusion.
paperwave/ltu
Github Repo for Paper "Listen, Think, and Understand".
paperwave/AnimeInbet
Code and data for ICCV23 work "Deep Geometrized Cartoon Line Inbetweening"
paperwave/concept-graphs
Official code release for ConceptGraphs
paperwave/TempoTokens
This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
paperwave/fill
Generative fill in 3D.
paperwave/LivelySpeaker
[ICCV-2023] The official repo for the paper "LivelySpeaker: Towards Semantic-aware Co-Speech Gesture Generation". (WIP)
paperwave/Tube-Link
Universal Video Segmentaion For VSS, VPS and VIS (ICCV-2023)
paperwave/whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
paperwave/transformer_vq
paperwave/open_lm
A repository for research on medium sized language models.
paperwave/NeuralSurfaceField
NSF: Neural Surface Fields for Human Modeling from Monocular Depth
paperwave/PromptIR
PromptIR: Prompting for All-in-One Blind Image Restoration
paperwave/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
paperwave/CogView
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
paperwave/shape-aware-text-driven-layered-video-editing-release
paperwave/samat
SAM Annotaton Tool
paperwave/app.enfugue.ai
ENFUGUE is a feature-rich Stable Diffusion web app for desktop or server
paperwave/fabric
https://arxiv.org/abs/2307.10159
paperwave/PGDiff
[NeurIPS 2023] PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance (WIP)
paperwave/CelebBasis
Official Implementation of 'Inserting Anybody in Diffusion Models via Celeb Basis'
paperwave/oven_eval
ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities
paperwave/GPT4RoI
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest