BrunoGeorgevich
I am a Ph.D. candidate at the University of Porto, currently conducting research on a distributed modularized semantic mapping architecture for robotics.
EDGEPorto - Portugal
BrunoGeorgevich's Stars
coqui-ai/TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
tw93/Pake
π€±π» Turn any webpage into a desktop app with Rust. π€±π» ε©η¨ Rust θ½»ζΎζε»Ίθ½»ιηΊ§ε€η«―ζ‘ι’εΊη¨
ManimCommunity/manim
A community-maintained Python framework for creating mathematical animations.
openai/openai-python
The official Python library for the OpenAI API
ArchiveBox/ArchiveBox
π Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
unclecode/crawl4ai
π₯π·οΈ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
th-ch/youtube-music
YouTube Music Desktop App bundled with custom plugins (and built-in ad blocker / downloader)
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
getomni-ai/zerox
PDF to Markdown with vision models
iam-veeramalla/Jenkins-Zero-To-Hero
Install Jenkins, configure Docker as slave, set up cicd, deploy applications to k8s using Argo CD in GitOps way.
microsoft/OmniParser
A simple screen parsing tool towards pure vision based GUI agent
livekit/agents
Build real-time multimodal AI applications π€ποΈπΉ
langchain-ai/open-canvas
π A better UX for chat, writing content, and coding with LLMs.
usefulsensors/moonshine
Fast and accurate automatic speech recognition (ASR) for edge devices
KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
eloialonso/diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
gabrielchua/open-notebooklm
Convert any PDF into a podcast episode!
neuml/paperai
π π€ Semantic search and workflows for medical/scientific papers
halajun/VDO_SLAM
VDO-SLAM: A Visual Dynamic Object-aware SLAM System
theJayTea/WritingTools
The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works with the free Gemini API, local LLMs, and other cloud providers.
reriiasu/speech-to-text
Real-time transcription using faster-whisper
vietanhdev/llama-assistant
AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions based on your commands: summarizing text, rephasing sentences, answering questions, writing emails, and more.
pyloid/pyloid
Pyloid is the Python backend version of Electron and Tauri, providing an open-source project that allows you to easily utilize various Python integration features. With Pyloid, developing desktop applications becomes simple, enabling you to build apps by integrating Python's powerful capabilities.
UniBwTAS/ccma
Curvature Corrected Moving Average: An accurate and model-free path smoothing algorithm.
MIT-SPARK/Clio
iKrishneel/octomap_server2
ROS2 stack for mapping with OctoMap, contains octomap_server package
keras-team/keras-rs
Multi-backend recommender systems with Keras 3