coder543
Software engineer with a focus on high performance, backend systems written in either Rust or Go.
Tampa, FL
coder543's Stars
immich-app/immich
High performance self-hosted photo and video management solution.
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
plausible/analytics
Simple, open source, lightweight (< 1 KB) and privacy-friendly web analytics alternative to Google Analytics.
continuedev/continue
⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains
danny-avila/LibreChat
Enhanced ChatGPT Clone: Features Agents, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.
Infisical/infisical
♾ Infisical is the open-source secret management platform: Sync secrets across your team/infrastructure, prevent secret leaks, and manage internal PKI
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
rhasspy/piper
A fast, local neural text to speech system
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Deci-AI/super-gradients
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
WhisperSpeech/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
danielgross/localpilot
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
aras-p/UnityGaussianSplatting
Toy Gaussian Splatting visualization in Unity
smallcloudai/refact
WebUI for Fine-Tuning and Self-hosting of Open-Source Large Language Models for Coding
apple/ml-aim
This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.
saagarjha/Ensemble
Cast Mac windows to visionOS
LAION-AI/natural_voice_assistant
Fuzzy-Search/realtime-bakllava
llama.cpp with BakLLaVA model describes what does it see
kingyiusuen/clip-image-search
Search images with a text or image query, using Open AI's pretrained CLIP model.
spacelift-io/spacectl
Spacelift client and CLI