aroslanov's Stars
arifyaman/Face-Depth-Frame-Mancer
Face Depth Frame Mancer Documentation
Shubhamsaboo/awesome-llm-apps
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
hkchengrex/MMAudio
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Francis-Rings/StableAnimator
We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a sequence of poses.
souzatharsis/podcastfy
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
Automattic/harper
The Grammar Checker for Developers
DroneSplat/anonymous_code
DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery
microsoft/TRELLIS
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
cline/cline
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
jiah-cloud/Align3R
[arXiv'24] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
yformer/EfficientTAM
Efficient Track Anything
2noise/ChatTTS
A generative speech model for daily dialogue.
huanngzh/MV-Adapter
[768 Resolution] [Any "SDXL" Model] [Various Conditions] [Arbitrary Views] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"
takemetosiberia/ComfyUI-SAMURAI--SAM2-
This is my version of nodes based on SAMURAI project. The project is made for entertainment purposes, I will not be engaged in further development and improvement. The project is based on official implementation of SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory
steel-dev/steel-browser
🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web without worrying about infrastructure.
microsoft/TinyTroupe
LLM-powered multiagent persona simulation for imagination enhancement and business insights.
QwenLM/Qwen-Agent
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
ArthurBrussee/brush
3D Reconstruction for all
cgwire/kitsu
Collaboration Platform for Animation and VFXÂ Productions
simonw/files-to-prompt
Concatenate a directory full of files into a single prompt for use with LLMs
logtd/ComfyUI-LTXTricks
A set of ComfyUI nodes providing additional control for the LTX Video model
akatz-ai/ComfyUI-DepthCrafter-Nodes
A port of tencent/DepthCrafter into ComfyUI
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
3DTopia/MaterialAnything
Material Anything: Generating Materials for Any 3D Object via Diffusion
levihsu/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
modelcontextprotocol/servers
Model Context Protocol Servers
nftblackmagic/catvton-flux
kaibioinfo/ComfyUI_AdvancedRefluxControl
robertvoy/ComfyUI-Flux-Continuum
A modular workflow for FLUX inside of ComfyUI that brings order to the chaos of image generation pipelines.
yangchris11/samurai
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"