hive

There are 1828 repositories under hive topic.

  • cube

    cube-js/cube

    📊 Cube — The Semantic Layer for Building Data Applications

    Language:Rust17k1542.1k1.7k
  • Tencent/APIJSON

    🏆 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构。 🏆 A JSON Transmission Protocol and an ORM Library 🚀 provides APIs and Docs without writing any code.

    Language:Java16.5k3825512.1k
  • presto

    prestodb/presto

    The official home of the Presto distributed SQL query engine for big data

    Language:Java15.5k8656.2k5.2k
  • heibaiying/BigData-Notes

    大数据入门指南 :star:

    Language:Java15.2k443434.1k
  • apache/doris

    Apache Doris is an easy-to-use, high performance and unified analytics database.

    Language:Java11.2k2786.7k3k
  • trinodb/trino

    Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

    Language:Java9.4k1686.1k2.7k
  • wangzhiwubigdata/God-Of-BigData

    专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

  • apache/hive

    Apache Hive

    Language:Java5.3k33604.6k
  • tobymao/sqlglot

    Python SQL Parser and Transpiler

    Language:Python5.3k381.3k509
  • hive

    isar/hive

    Lightweight and blazing fast key-value database written in pure Dart.

    Language:Dart3.9k601.1k382
  • liyupi/sql-generator

    🔨 用 JSON 来生成结构化的 SQL 语句,基于 Vue3 + TypeScript + Vite + Ant Design + MonacoEditor 实现,项目简单(重逻辑轻页面)、适合练手~

    Language:Vue3.4k2021695
  • apache/linkis

    Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

    Language:Java3.2k2612.5k1.1k
  • WeBankFinTech/DataSphereStudio

    DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

    Language:Java2.9k180740981
  • LuckyZXL2016/Movie_Recommend

    基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统

    Language:Java2.7k108181k
  • MoRan1607/BigDataGuide

    大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料

    Language:Java2.5k465834
  • SZT-bigdata

    geekyouth/SZT-bigdata

    深圳地铁大数据客流分析系统🚇🚄🌟

    Language:Scala2.1k6221598
  • Qihoo360/Quicksql

    A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources

    Language:Java2k122137580
  • apache/kyuubi

    Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

    Language:Scala1.9k632.1k848
  • apache/drill

    Apache Drill is a distributed MPP query layer for self describing data

    Language:Java1.9k158116976
  • pinterest/querybook

    Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.

    Language:TypeScript1.7k34210205
  • dropbox/PyHive

    Python interface to Hive and Presto. 🐝

    Language:Python1.7k62288552
  • docs4dev/docs4dev

    后端开发常用框架文档及中文翻译,包含 Spring 系列文档(Spring, Spring Boot, Spring Cloud, Spring Security, Spring Session),大数据(Apache Hive, HBase, Apache Flume),日志(Log4j2, Logback),Http Server(NGINX,Apache),Python,数据库(OpenTSDB,MySQL,PostgreSQL)等最新官方文档以及对应的中文翻译。

  • DTStack/Taier

    Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display

    Language:Java1.3k33475309
  • collabH/bigdata-growth

    大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

    Language:Shell1.2k304315
  • Addax

    wgzhao/Addax

    Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration.

    Language:Java1.1k31274278
  • WeBankFinTech/Scriptis

    Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.

    Language:Vue7997440265
  • devlive-community/datacap

    DataCap is integrated software for data transformation, integration, and visualization. Support a variety of data sources, file types, big data related database, relational database, NoSQL database, etc. Through the software can realize the management of multiple data sources, the data under the source of various operations conversion ...

    Language:Java7871115272
  • OBenner/data-engineering-interview-questions

    More than 2000+ Data engineer interview questions.

  • nielsbasjes/yauaa

    Yet Another UserAgent Analyzer

    Language:Java71936356124
  • macbre/sql-metadata

    Uses tokenized query returned by python-sqlparse and generates query metadata

    Language:Python71815165117
  • sunnyandgood/BigData

    💎🔥大数据学习笔记

    Language:Java663302225
  • WeBankFinTech/WeDataSphere

    WeDataSphere is a financial grade, one-stop big data platform suite.

  • yanagishima/yanagishima

    Web UI for Trino, Hive and SparkSQL

    Language:Java62129126197
  • gangly/datafaker

    Datafaker is a large-scale test data and flow test data generation tool. Datafaker fakes data and inserts to varied data sources. 测试数据生成工具

    Language:Python60620107167
  • ploomber/jupysql

    Better SQL in Jupyter. 📊

    Language:Python579647370
  • TurboWay/spiderman

    基于 scrapy-redis 的通用分布式爬虫框架

    Language:Python5551822119