keremk
Engineering leader, ex @StuartApp, @xing, @soundcloud, @microsoft, co-founder NCompass Labs with exit to MSFT
Coding VenturesBarcelona
keremk's Stars
browser-use/browser-use
Make websites accessible for AI agents
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
microsoft/markitdown
Python tool for converting files and office documents to Markdown.
cline/cline
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
unslothai/unsloth
Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
GraphiteEditor/Graphite
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
getomni-ai/zerox
OCR & Document Extraction using vision models
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
microsoft/TinyTroupe
LLM-powered multiagent persona simulation for imagination enhancement and business insights.
awslabs/multi-agent-orchestrator
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
poloclub/transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
souzatharsis/podcastfy
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
corbt/agent.exe
circlemind-ai/fast-graphrag
RAG that intelligently adapts to your use case, data, and queries
MrForExample/ComfyUI-3D-Pack
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)
pingcap/autoflow
pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidb.ai
malmeloo/FindMy.py
🍏 + 🎯 + 🐍 = Everything you need to query Apple's FindMy network!
GaParmar/img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
google-gemini/multimodal-live-api-web-console
A react-based starter app for using the Multimodal Live API over websockets with Gemini
midday-ai/languine
Translate your application with Languine CLI powered by AI.
huanngzh/MV-Adapter
[768 Resolution] [Any "SDXL" Model] [Various Conditions] [Arbitrary Views] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"
aidenybai/bippy
⚠️ hack into react internals
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
transitive-bullshit/openai-realtime-api
TypeScript client for OpenAI's realtime voice API.
bnurbekov/company-email-validator
Checks whether an email is a company email (useful for B2B forms)