SachinVarghese

Software Engineer - AI, Data & Cloud

Capital OneUnited States

Pinned Repositories

kubeflow
Machine Learning Toolkit for Kubernetes
Language:TypeScript14.4k 363 3.8k2.4k
dcgm-exporter
NVIDIA GPU metrics exporter for Prometheus leveraging DCGM
Language:Go933 14 256161
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.8k 94 2k1k
citicern
A platform for crowd-sourcing data for scientific research.
Language:JavaScript1 2 01
Elixir_SymWorld
This is a simulation project for an Elixir Actor World
Language:Elixir1 1 00
profile
Profile website source repository
Language:JavaScript1 1 00
seldon-core
Machine Learning Deployment for Kubernetes
Language:HTML2 1 00
MLServer
An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more
Language:Python727 28 408183
seldon-core
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
Language:HTML4.4k 86 2.3k832
python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
Language:C++555 14 0146

SachinVarghese's Repositories

SachinVarghese/seldon-core
Machine Learning Deployment for Kubernetes
Language:HTML2 1 00
SachinVarghese/profile
Profile website source repository
Language:JavaScript1 1 00
SachinVarghese/SachinVarghese
1
SachinVarghese/telma
Toolkit Evaluator for Language Model Agents
Language:Jupyter Notebook1
SachinVarghese/alibi-detect
Algorithms for outlier, adversarial and drift detection
Language:Python0 1 00
SachinVarghese/pgamber
Data observability for postgreSQL using alibi-detect
Language:Go0 2 00
SachinVarghese/aws-saa-code
SachinVarghese/beatoven-public-api
SachinVarghese/dcgm-exporter
NVIDIA GPU metrics exporter for Prometheus leveraging DCGM
Language:Go1 0
SachinVarghese/edge-cloud-inference
Language:Python
SachinVarghese/etc3
etc3 is the controller that powers Iter8, the AI-driven Kubernetes experimentation platform. etc3 stands for Extensible Thin Controller with Composable CRDs.
SachinVarghese/foundation-model-benchmarking-tool
Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.
SachinVarghese/genai-landscape
Language:Jupyter Notebook
SachinVarghese/genar8t
Language:JavaScript
SachinVarghese/kserve
Serverless Inferencing on Kubernetes
Language:Python1 0
SachinVarghese/langchain
⚡ Building applications with LLMs through composability ⚡
SachinVarghese/manifests
A repository for Kustomize manifests
Language:YAML1 0
SachinVarghese/ml-prediction-schema
1 0
SachinVarghese/MLServer
An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more
Language:Python1 0
SachinVarghese/nuclear
2 0
SachinVarghese/pinot-docs
Apache Pinot Documentation
1 0
SachinVarghese/python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
SachinVarghese/seldon-deploy-sdk
Language:Python1 0
SachinVarghese/seldon-gitops
SachinVarghese/sns-infinity
Language:HTML2 0
SachinVarghese/story-score
Automatic background scores from text inputs
Language:Jupyter Notebook1 0
SachinVarghese/tempo
MLOps Python Library
Language:Python1 0
SachinVarghese/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
SachinVarghese/text-generation-inference
Large Language Model Text Generation Inference
SachinVarghese/virtual-web-museum
Language:JavaScript2 0