kunal449's Stars
awesomedata/awesome-public-datasets
A topic-centric list of HQ open datasets.
SimpleDataLabsInc/prophecy-build-tool
Prophecy-built-tool (PBT) allows you to quickly build projects generated by Prophecy (your standard Spark Scala and PySpark pipelines) to integrate them with your own CI / CD (e.g. Github Actions), build system (e.g. Jenkins), and orchestration (e.g. Databricks Workflows).
aimacode/aima-scala
foundweekends/giter8
a command line tool to apply templates defined on GitHub
tomwhite/hadoop-book
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
mysql/mysql-server
MySQL Server, the world's most popular open source database, and MySQL Cluster, a real-time, open source transactional database.
memcached/memcached
memcached development tree
jtleek/datasharing
The Leek group guide to data sharing
words-sdsc/coursera
Data sets and scripts for Coursera Big Data Specialization.
facebook/zstd
Zstandard - Fast real-time compression algorithm
apache/storm
Apache Storm
swiftlang/swift
The Swift Programming Language
milinda/calcite-tutorial
Apache Calcite Tutorial
julianhyde/optiq-csv
Obsolete - now the CSV adapter in Apache Calcite
julianhyde/optiq
Obsolete - superseded by Apache Calcite
apache/calcite
Apache Calcite
apache/hadoop
Apache Hadoop
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
akka/akka
Build highly concurrent, distributed, and resilient message-driven applications on the JVM
scala/scala
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3