Pinned Repositories
spark
Apache Spark - A unified analytics engine for large-scale data processing
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Anserini
Anserini retrieval platform
api-samples
Code samples for YouTube APIs, including the YouTube Data API, YouTube Analytics API, and YouTube Live Streaming API. The repo contains language-specific directories that contain the samples.
aws-glue-data-catalog-client-for-apache-hive-metastore
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions
bayes_nets
variable Elimination , bayes_nets
bespin
Reference implementations of "big data" algorithms in MapReduce and Spark
bigdata-2018w
CS 451/651 431/631 Data-Intensive Distribute Computing (Winter 2018) at the University of Waterloo
tsp_a_star_search
Solving Traveling Salesman Problem with A Star Search
tsp_simulated_annealing
Solving Traveling Salesman Problem with Simulated Annealing
youngbink's Repositories
youngbink doesn’t have any repository yet.