samos123
Creator of KubeAI, a K8s operator to serve LLMs in production.
@GoogleCloudPlatform San Francisco Bay Area
Pinned Repositories
chatgpt-blog
Prototype using chatGPT to anser Golang Stackoverflow questions
cloudvision-core
docker-drupal
Drupal image based official php image, database information should be passed by environment variables or linked container. Drush is included
docker-veth
show which veth interface is associated with each container
gke-gcs-fuse-unprivileged
gke-node-ca-importer
multi-cloud-messaging
stable-diffusion-webui-docker
kubeai
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports LLMs, embeddings, and speech-to-text.
websu
Website Speed and Performance Optimization and monitoring
samos123's Repositories
samos123/gke-gcs-fuse-unprivileged
samos123/chatgpt-blog
Prototype using chatGPT to anser Golang Stackoverflow questions
samos123/gke-node-ca-importer
samos123/stable-diffusion-webui-docker
samos123/ai-on-gke
samos123/weaviate
vector search engine
samos123/weaviate-helm
Helm charts to deploy Weaviate to k8s
samos123/weaviate-java-client
Official Weaviate Java Client
samos123/weaviate-spark-connector
Weaviate connector for Apache Spark
samos123/ai-infra-cluster-provisioning
samos123/airbyte
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
samos123/axlearn
samos123/cluster-health-scanner
samos123/dotsam
Some of the dotfiles
samos123/faster-whisper-server
samos123/gemma-7b-sql
samos123/infinity
Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip
samos123/kind
Kubernetes IN Docker - local clusters for testing Kubernetes
samos123/lingo
LLM proxy and autoscaler for K8s
samos123/logto-js
🤓 Logto JS SDKs.
samos123/mpi-operator
Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)
samos123/nccl-tests
NVIDIA NCCL Tests for Distributed Training
samos123/pspmigrator-fork
pspmigrator is a tool to migrate from PSP to PSA
samos123/python-storage
samos123/sambalsports.com
samos123/samos123.github.com
Personal blog and site
samos123/Verba
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
samos123/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
samos123/weaviate-io
Website for the Weaviate vector search engine
samos123/weaviate-python-client
A python native client for easy interaction with a Weaviate instance.