youngbink

@Databricks

Pinned Repositories

spark
Apache Spark - A unified analytics engine for large-scale data processing
Language:Scala39.3k 2k 028.2k
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Language:Scala7.4k 216 1.5k1.7k
Anserini
Anserini retrieval platform
Language:Java00
api-samples
Code samples for YouTube APIs, including the YouTube Data API, YouTube Analytics API, and YouTube Live Streaming API. The repo contains language-specific directories that contain the samples.
Language:Java00
aws-glue-data-catalog-client-for-apache-hive-metastore
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions
Language:Java00
bayes_nets
variable Elimination , bayes_nets
Language:Python01
bespin
Reference implementations of "big data" algorithms in MapReduce and Spark
Language:Java00
bigdata-2018w
CS 451/651 431/631 Data-Intensive Distribute Computing (Winter 2018) at the University of Waterloo
Language:HTML00
tsp_a_star_search
Solving Traveling Salesman Problem with A Star Search
Language:Java10
tsp_simulated_annealing
Solving Traveling Salesman Problem with Simulated Annealing
Language:Java31

youngbink's Repositories

youngbink doesn’t have any repository yet.