Pinned Repositories
996.ICU
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
adventures-in-ml-code
This repository holds all the code for the site http://www.adventuresinmachinelearning.com
be-a-professional-programmer
成为专业程序员路上用到的各种优秀资料、神器及框架
data_science_by_example
Examples of Data Science Tools & Libraries
realdeal
A data pipeline to scrape and process data from real estate websites.
skypilot
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
spark
Mirror of Apache Spark
tutorials
The "REST With Spring" Classes:
zillow
example of zillow web crawl
wangmiao1981's Repositories
wangmiao1981/spark
Mirror of Apache Spark
wangmiao1981/skypilot
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
wangmiao1981/amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
wangmiao1981/arrow-datafusion
Apache Arrow DataFusion and Ballista query engines
wangmiao1981/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
wangmiao1981/CVE-2021-44228-Apache-Log4j-Rce
Apache Log4j 远程代码执行
wangmiao1981/debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
wangmiao1981/deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
wangmiao1981/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
wangmiao1981/egeria
Open Metadata and Governance
wangmiao1981/graphql-engine
Blazing fast, instant realtime GraphQL APIs on your DB with fine grained access control, also trigger webhooks on database events.
wangmiao1981/incubator-iceberg
Apache Iceberg (Incubating)
wangmiao1981/istio
Connect, secure, control, and observe services.
wangmiao1981/JNDI-Injection-Exploit
JNDI注入测试工具(A tool which generates JNDI links can start several servers to exploit JNDI Injection vulnerability,like Jackson,Fastjson,etc)
wangmiao1981/KubeArmor
Container-aware Runtime Security Enforcement System
wangmiao1981/Logout4Shell
Use Log4Shell vulnerability to vaccinate a victim server against Log4Shell
wangmiao1981/marvin
A batteries-included library for building AI-powered software
wangmiao1981/nessie
Nessie provides Git-like capabilities for your Data Lake
wangmiao1981/reaction
Mailchimp Open Commerce is an API-first, headless commerce platform built using Node.js, React, GraphQL. Deployed via Docker and Kubernetes.
wangmiao1981/redpanda
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
wangmiao1981/rocksdb
A library that provides an embeddable, persistent key-value store for fast storage.
wangmiao1981/saleor
A modular, high performance, headless e-commerce platform built with Python, GraphQL, Django, and React.
wangmiao1981/scylla
NoSQL data store using the seastar framework, compatible with Apache Cassandra
wangmiao1981/spree
Open Source modular headless multi-language/multi-currency/multi-store e-commerce platform
wangmiao1981/temporal
Temporal service
wangmiao1981/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
wangmiao1981/trino-the-definitive-guide
Resource for the book Trino: The Definitive Guide (and formerly Presto: The Definitive Guide)
wangmiao1981/unitycatalog
Open, Multi-modal Catalog for Data & AI
wangmiao1981/xskipper
An Extensible Data Skipping Framework
wangmiao1981/yugabyte-db
The high-performance distributed SQL database for global, internet-scale apps.