aroslanov

CG/VFX generalist, AI enthusiast

aroslanov.comLos Angeles, CA, USA

aroslanov's Stars

LuminanceHDR/LuminanceHDR
A complete workflow for HDR imaging
Language:C++615104
fallenshock/FlowEdit
Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"
Language:Python27611
TencentARC/StereoCrafter
A framework to convert any 2D videos to immersive stereoscopic 3D
Language:Python505
LucipherDev/ComfyUI-AniDoc
ComfyUI Custom Nodes for "AniDoc: Animation Creation Made Easier". This approach automates line art video colorization using a novel model that aligns color information from references, ensures temporal consistency, and reduces manual effort in animation production.
Language:Python252
akatz-ai/ComfyUI-Environment-Manager
A Pinokio application to manage ComfyUI environments.
Language:Python331
nerlfield/iss-urine-tank-monitor-bot
Language:Python3
browser-use/browser-use
Make websites accessible for AI agents
Language:Python7.5k547
iamxym/Deep-Fourier-based-Arbitrary-scale-Super-resolution-for-Real-time-Rendering
SIGGRAPH 2024 Conference Paper: Deep Fourier-based Arbitrary-scale Super-resolution for Real-time Rendering
Language:Python1169
colmap/colmap
COLMAP - Structure-from-Motion and Multi-View Stereo
Language:C++8k1.6k
DS4SD/docling
Get your documents ready for gen AI
Language:Python16.8k870
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
Language:Python20.3k1.5k
Helicone/helicone
🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
Language:TypeScript2.9k287
janreges/siteone-crawler
SiteOne Crawler is a cross-platform website crawler and analyzer for SEO, security, accessibility, and performance optimization—ideal for developers, DevOps, QA engineers, and consultants. Supports Windows, macOS, and Linux (x64 and arm64).
Language:PHP37320
janreges/siteone-crawler-gui
SiteOne Crawler GUI is a cross-platform website crawler and analyzer for SEO, security, accessibility, and performance optimization—ideal for developers, DevOps, QA engineers, and consultants. Supports Windows, macOS, and Linux (x64 and arm64).
Language:Svelte1319
google-gemini/cookbook
Examples and guides for using the Gemini API
Language:Jupyter Notebook9.9k1.1k
snap-research/InstantRestore
Official Implementation for "InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention"
842
NexaAI/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
Language:Python5.1k741
Purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
1.4k71
kijai/ComfyUI-MMAudio
Language:Python1809
arifyaman/Face-Depth-Frame-Mancer
Face Depth Frame Mancer Documentation
101
Shubhamsaboo/awesome-llm-apps
Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.
Language:Python10k1k
hkchengrex/MMAudio
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Language:Python75171
Francis-Rings/StableAnimator
We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a sequence of poses.
Language:Python99244
souzatharsis/podcastfy
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
Language:Python2.2k260
Automattic/harper
The Grammar Checker for Developers
Language:Rust2.5k53
DroneSplat/anonymous_code
DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery
Language:Python497
microsoft/TRELLIS
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
Language:Python5.6k345
cline/cline
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Language:TypeScript16.7k1.4k
jiah-cloud/Align3R
[arXiv'24] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
Language:Python28812
yformer/EfficientTAM
Efficient Track Anything
Language:Python40711

aroslanov

aroslanov's Stars

LuminanceHDR/LuminanceHDR

fallenshock/FlowEdit

TencentARC/StereoCrafter

LucipherDev/ComfyUI-AniDoc

akatz-ai/ComfyUI-Environment-Manager

nerlfield/iss-urine-tank-monitor-bot

browser-use/browser-use

iamxym/Deep-Fourier-based-Arbitrary-scale-Super-resolution-for-Real-time-Rendering

colmap/colmap

DS4SD/docling

Genesis-Embodied-AI/Genesis

Helicone/helicone

janreges/siteone-crawler

janreges/siteone-crawler-gui

google-gemini/cookbook

snap-research/InstantRestore

NexaAI/nexa-sdk

Purfview/whisper-standalone-win

kijai/ComfyUI-MMAudio

arifyaman/Face-Depth-Frame-Mancer

Shubhamsaboo/awesome-llm-apps

hkchengrex/MMAudio

Francis-Rings/StableAnimator

souzatharsis/podcastfy

Automattic/harper

DroneSplat/anonymous_code

microsoft/TRELLIS

cline/cline

jiah-cloud/Align3R

yformer/EfficientTAM