gokhanbaydar's Stars
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.
CMU-Perceptual-Computing-Lab/openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
joaomdmoura/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
ml-explore/mlx
MLX: An array framework for Apple silicon
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
rapidsai/cudf
cuDF - GPU DataFrame Library
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
jagenjo/litegraph.js
A graph node engine and editor written in Javascript similar to PD or UDK Blueprints, comes with its own editor in HTML5 Canvas2D. The engine can run client side or server side using Node. It allows to export graphs as JSONs to be included in applications independently.
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
ToonCrafter/ToonCrafter
a research paper for generative cartoon interpolation
VAST-AI-Research/TripoSR
Fanghua-Yu/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
IceClear/StableSR
[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution
PRIS-CV/DemoFusion
Let us democratise high-resolution generation! (CVPR 2024)
ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
xinntao/facexlib
FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.
tianweiy/DMD2
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
hamadichihaoui/BIRD
This is the official implementation of "Blind Image Restoration via Fast Diffusion Inversion"
THtianhao/ComfyUI-Portrait-Maker
Srameo/LE3D
HDR 3D Scene Editing!
pwillia7/Basic_ComfyUI_Workflows
Basic Stable Diffusion Workflows for ComyUI using minimal custom nodes
kijai/ComfyUI-DDColor
ComfyUI node for DDColor
MackinationsAi/Upgraded-Depth-Anything-V2
Upgraded repo includes more capabilities, converted the cmd .py scripts to function more intuitively, added 147 different depth output colour map methods, introduced batch image as well as video processing, everything is automatically saved to an outputs folder (w/ file-naming conventions) & I've converted the .pth models to .safetensors.