/databend

Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. Cloud: https://databend.com

Primary LanguageRustOtherNOASSERTION

Databend: The Next-Gen Cloud [Data+AI] Analytics

slack feishu
CI Status Linux Platform
databend

🐋 Introduction

Databend is an open-source, elastic, and workload-aware cloud data warehouse built in Rust, offering a cost-effective alternative to Snowflake. It's designed for complex analysis of the world's largest datasets.

🚀 Why Databend

  • Cloud-Friendly: Seamlessly integrates with various cloud storages like AWS S3, Azure Blob, Google Cloud, and more.

  • High Performance: Built in Rust, utilizing SIMD and vectorized processing for rapid analytics. See ClickBench.

  • Cost-Efficient Elasticity: Innovative design for separate scaling of storage and computation, optimizing both costs and performance.

  • Easy Data Management: Integrated data preprocessing during ingestion eliminates the need for external ETL tools.

  • Data Version Control: Offers Git-like multi-version storage, enabling easy data querying, cloning, and reverting from any point in time.

  • Rich Data Support: Handles diverse data formats and types, including JSON, CSV, Parquet, ARRAY, TUPLE, MAP, and JSON.

  • AI-Enhanced Analytics: Offers advanced analytics capabilities with integrated AI Functions.

  • Community-Driven: Benefit from a friendly, growing community that offers an easy-to-use platform for all your cloud analytics.

📐 Architecture

Databend Architecture

🚀 Try Databend

1. Databend Serverless Cloud

The fastest way to try Databend, Databend Cloud

2. Install Databend from Docker

Prepare the image (once) from Docker Hub (this will download about 170 MB data):

docker pull datafuselabs/databend

To run Databend quickly:

docker run --net=host  datafuselabs/databend

🚀 Getting Started

Deploying Databend
Connecting to Databend
Loading Data into Databend
Loading Data Tools with Databend
Visualize Tools with Databend
Managing Users
Managing Databases
Managing Tables
Managing Data
Managing Views
AI Functions
Data Governance
Securing Databend
Performance

🤝 Contributing

Databend thrives on community contributions! Whether it's through ideas, code, or documentation, every effort helps in enhancing our project. As a token of our appreciation, once your code is merged, your name will be eternally preserved in the system.contributors table.

Here are some resources to help you get started:

👥 Community

For guidance on using Databend, we recommend starting with the official documentation. If you need further assistance, explore the following community channels:

🛣️ Roadmap

Stay updated with Databend's development journey. Here are our roadmap milestones:

📜 License

Databend is released under a combination of two licenses: the Apache License 2.0 and the Elastic License 2.0.

When contributing to Databend, you can find the relevant license header in each file.

For more information, see the LICENSE file and Licensing FAQs.

🙏 Acknowledgement

  • Inspiration: Databend's design draws inspiration from industry leaders ClickHouse and Snowflake.

  • Computing Model: Our computing foundation is built upon Arrow2, a faster and more secure rendition of the Apache Arrow Columnar Format.

  • Documentation Hosting: The Databend documentation website proudly runs on Vercel.