xsqian's Stars
milvus-io/bootcamp
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
milvus-io/milvus
A cloud-native vector database, storage for next generation AI applications
unitycatalog/unitycatalog
Open, Multi-modal Catalog for Data & AI
dilverse/rag-with-minio
Automate your RAG pipeline with MinIO Bucket Notification
duckdb/duckdb
DuckDB is an analytical in-process SQL database management system
GailGithub/GAIL_Dev
GAIL is a suite of algorithms for integration problems in one, many, and infinite dimensions, and whose answers are guaranteed to be correct. GAIL is created, developed, and maintained by Fred Hickernell (Illinois Institute of Technology), Sou-Cheng Choi (University of Chicago and Argonne National Laboratory), and their collaborators including Yuhan Ding (IIT), Lan Jiang (IIT), and Yizhi Zhang (IIT).
opendatahub-io/opendatahub-documentation
Repository for official documentation of ODH Core components
vmware-labs/distribution-tooling-for-helm
Helm Distribution plugin is is a set of utilities and Helm Plugin for making offline work with Helm Charts easier. It is meant to be used for creating reproducible and relocatable packages for Helm Charts that can be moved around registries without hassles. This is particularly useful for distributing Helm Charts into airgapped environments.
hystax/optscale
FinOps, MLOps and cloud cost optimization tool. Supports AWS, Azure, GCP, Alibaba Cloud and Kubernetes.
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
truera/trulens
Evaluation and Tracking for LLM Experiments
GoogleCloudPlatform/asl-ml-immersion
This repos contains notebooks for the Advanced Solutions Lab: ML Immersion
RamiKrispin/awesome-ds-setting
A tutorial for setting a new machine with core data science tools
kedro-org/kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
d2l-ai/d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
GoogleCloudPlatform/python-docs-samples
Code samples used on cloud.google.com
trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
mlrun/mlrun
MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications.
influxdata/influxdb
Scalable datastore for metrics, events, and real-time analytics
kubeflow/kubeflow
Machine Learning Toolkit for Kubernetes
ruoshan/autoportforward
Bidirectional port-forwarding for docker, podman and kubernetes
awootton/knotfreeiot
Knotfree.net is a tool for creating IOT products. This is the code for a distributed pub/sub system. Supports a minimal subset of MQTT (3.1 and 5) and other convenient formats. Go and kubernetes. It's live at knotfree.net It runs on tokens so there's no signup to use it.
datamechanics/delight
A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
igz-us-sales/igz-platform-deployment
mlrun/demo-github-actions
demo CI/CD pipeline using MLRun, Kubeflow and GitHub Actions
kubeflow/mpi-operator
Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)
dotnet/iot
This repo includes .NET Core implementations for various IoT boards, chips, displays and PCBs.
dotnet/try
Try .NET provides developers and content authors with tools to create interactive experiences.
databricks/spark-deep-learning
Deep Learning Pipelines for Apache Spark