Databend is an open-source, Elastic, Workload-Aware cloud data warehouse engineered for blazing-speed data analytics at a massive scale. Crafted with Rust, it's your most efficient alternative to Snowflake. In Databend, everything is distributed execution.
-
🌐 Cloud-Agnostic: Deploy on any cloud platform, including S3, Azure Blob, Google Cloud Storage, Alibaba Cloud OSS, Tencent Cloud COS, Cloudflare R2, Wasabi or MinIO.
-
⚡ Performance-Driven: Built with Rust and leveraging advanced technologies like SIMD and vectorized query processing, Databend ensures exceptional analytic speeds, outperforming traditional cloud data warehouses.
-
📈 Cost-Efficient Elasticity: With its unique architecture, Databend decouples storage and computation. You can dynamically scale based on your needs, optimizing costs and performance.
-
🔄 Simplified Data Management: Say goodbye to traditional ETL complications. Databend preprocesses data during ingestion, simplifying your data flow.
-
🏡 Lakehouse Design: Combining data lake scalability with data warehouse efficiency, Databend smoothly integrates with Hive and Iceberg, granting the best of both domains.
-
🕰 Snapshot and MVCC: Tap into Git-like MVCC storage to conveniently query, clone, or revert data from any historical moment.
-
📑 Rich Data Support: Whether it's semi-structured data like JSON, CSV, or Parquet, or data types like ARRAY, TUPLE, or MAP, Databend handles them with ease.
-
🤖 AI-Integrated Analytics: Elevate your data analytics with integrated AI capabilities, opening new avenues for insights.
-
👥 Open Source & Community-Driven: With a growing community, Databend ensures transparency, constant updates, and an open platform for all your cloud analytics needs.
The fastest way to try Databend, Databend Cloud
Prepare the image (once) from Docker Hub (this will download about 170 MB data):
docker pull datafuselabs/databend
To run Databend quickly:
docker run --net=host datafuselabs/databend
Deploying Databend
Connecting to Databend
Loading Data into Databend
Loading Data Tools with Databend
Managing Users
Managing Databases
Managing Tables
Managing Views
AI Functions
Data Governance
Securing Databend
Performance
Databend thrives on community contributions! Whether it's through ideas, code, or documentation, every effort helps in enhancing our project. As a token of our appreciation, once your code is merged, your name will be eternally preserved in the system.contributors table.
Here are some resources to help you get started:
For guidance on using Databend, we recommend starting with the official documentation. If you need further assistance, explore the following community channels:
- Slack (For live discussion with the Community)
- GitHub (Feature/Bug reports, Contributions)
- Twitter (Get the news fast)
- I'm feeling lucky (Pick up a good first issue now!)
Stay updated with Databend's development journey. Here are our roadmap milestones:
Databend is released under a combination of two licenses: the Apache License 2.0 and the Elastic License 2.0.
When contributing to Databend, you can find the relevant license header in each file.
For more information, see the LICENSE file and Licensing FAQs.
-
Inspiration: Databend's design draws inspiration from industry leaders ClickHouse and Snowflake.
-
Computing Model: Our computing foundation is built upon Arrow2, a faster and more secure rendition of the Apache Arrow Columnar Format.
-
Documentation Hosting: The Databend documentation website proudly runs on Vercel.