parano
Founder/CEO of BentoML, previously @databricks, ML/AI Platforms & Systems, Product Design
San Francisco, CA
parano's Stars
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
backstage/backstage
Backstage is an open framework for building developer portals
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Codium-ai/pr-agent
🚀CodiumAI PR-Agent: An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍
zsviczian/obsidian-excalidraw-plugin
A plugin to edit and view Excalidraw drawings in Obsidian
NVIDIA/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
unitycatalog/unitycatalog
Open, Multi-modal Catalog for Data & AI
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
MeetKai/functionary
Chat language model that can use tools and interpret the results
stanford-oval/WikiChat
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
test-time-training/ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
test-time-training/ttt-lm-jax
Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
readwiseio/obsidian-readwise
Official Readwise plugin for Obsidian
bentoml/BentoVLLM
Self-host LLMs with vLLM and BentoML
bentoml/llm-bench
bentoml/rag-tutorials
a series of tutorials implementing rag service with BentoML and LlamaIndex
bentoml/BentoChatTTS
bentoml/BentoLMDeploy
Self-host LLMs with LMDeploy and BentoML
bentoml/openllm-models
bentoml/BentoTRTLLM
bentoml/BentoYolo
BentoML service of YOLO v8
bentoml/llm-router
Multi-LLM Routing API Endpoint with BentoML
bentoml/BentoMLCLLM
mlops-club/vscode-bentoml
bentoml/BentoTGI