fritzj6's Stars
directus/directus
The flexible backend for all your projects 🐰 Turn your DB into a headless CMS, admin panels, or apps with a custom UI, instant APIs, auth & more.
Stability-AI/stable-audio-tools
Generative models for conditional audio generation
laudspeaker/laudspeaker
📢 Laudspeaker is an Open Source Customer Engagement and Product Onboarding Platform. Open Source alternative to Braze / One Signal / Customer Io / Appcues / Pendo . Use Laudspeaker to design product onboarding flows and send product and event triggered emails, sms, push and more.
Ai00-X/ai00_server
The all-in-one RWKV runtime box with embed, RAG, AI agents, and more.
Doubiiu/ToonCrafter
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
sdbds/ToonCrafter-for-windows
2noise/ChatTTS
A generative speech model for daily dialogue.
ButzYung/SystemAnimatorOnline
XR Animator, AI-based Full Body Motion Capture and Extended Reality (XR) solution, powered by System Animator Online
darkroomengineering/lenis
How smooth scroll should be
quarylabs/quary
Open-source BI for engineers
megvii-research/HiDiffusion
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
version-fox/vfox
A cross-platform and extendable version manager with support for Java, Node.js, Flutter, .Net & more
ShineChen1024/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
ExponentialML/ComfyUI_ELLA
ComfyUI Implementaion of ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
BladeTransformerLLC/gauzilla
Gauzilla: a 3D Gaussian Splatting renderer written in Rust for WebAssembly with lock-free multithreading
ttxskk/AiOS
[CVPR 2024] Official Code for "AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
nilsherzig/LLocalSearch
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.
SPRIGHT-T2I/SPRIGHT
[ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"
LGUG2Z/komorebi
A tiling window manager for Windows 🍉
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
wandb/openui
OpenUI let's you describe UI using your imagination, then see it rendered live.
dashpresshq/dashpress
Generate powerful admin apps without writing a single line of code - Run `npx dashpress` to see some magic!
banodoco/Steerable-Motion
A ComfyUI node for driving videos using batches of images.
banodoco/Dough
Dough is a open source tool for steering AI animations with precision.
YuelangX/Gaussian-Head-Avatar
[CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"
heyform/heyform
Open-Source Form Builder
hzxie/CityDreamer
The official implementation of "CityDreamer: Compositional Generative Model of Unbounded 3D Cities". (Xie et al., CVPR 2024)
fofr/cog-face-to-many
Turn any face into a video game character, pixel art, claymation, 3D or toy