hdinsight
There are 43 repositories under hdinsight topic.
dotnet/spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
microsoft/data-accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
microsoft/AzureSMR
AzureSMR is no longer being actively developed. For ongoing support of Azure in R, see: https://github.com/Azure/AzureR
hdinsight/hdinsight-kafka-tools
HDInsight Kafka Tools
caiomsouza/microsoft-big-data-scientist-and-ai
Microsoft Big Data, Data Scientist, and AI
kcheeeung/hive-benchmark
Automated TPC-DS and TPC-H benchmark for Apache Hive LLAP
SyncfusionSuccinctlyE-Books/HDInsight-Succinctly
This is the companion repo for HDInsight Succinctly by James Beresford. Published by Syncfusion.
UnoSD/SparkSharp
C# Livy client to submit Spark jobs to HDInsight and other Spark clusters
kojish/hdinsight-spark-livy-client
Java client for submitting a remote job to HDInsight Spark cluster via Livy.
angadsingh/airflow-ditto
An airflow DAG transformation framework
angadsingh/airflow-hdinsight
HDInsight provider for Airflow
syedhassaanahmed/azure-kafka-spark-adls
Azure ARM template to deploy Kafka and Spark clusters in same VNet with ADLS
hau-mal/articles
BigData Blog articles
AdamPaternostro/Azure-Databricks-HDInsight-Hive-Metastore
How to share an HDInsight Hive Metastore with Azure Databricks
AdamPaternostro/Azure-HDI-DistCP
Creates a HDInsight cluster then runs distcp remotely to copy data between blob and/or data lake (ADLS)
AdamPaternostro/Azure-HDInsight-ARM-Template
Creates an HDInsight cluster that has an external Hive metastore and access to Azure Data Lake Store
AdamPaternostro/Azure-Spark-Livy-Application-Insights-External-Dependency
Use Spark with Livy along with Application Insights. Learn to host your external dependencies in data lake.
AfonsoFeliciano/Azure-Data-Factory-for-Data-Engineers
Repositório do curso de Azure Data Factory for Data Engineers - Project Covid 19
anirudhgupta22/Microsoft-Azure-HDInsight
Short documentation on Microsoft's Azure HDInsight
iBalajiShanmugam/covid19-adf
COVID19-ADF is a project that leverages Azure services to collect, analyze, and visualize COVID-19 data. With seamless data integration and advanced analytics, it provides valuable insights into the pandemic's impact, enabling informed decision-making in the fight against COVID-19.
kcheeeung/hive-testbench
TPC-DS benchmark for experimenting with Apache Hive at any data scale
kcm117/jupyter_local_hdispark_config
Configure local jupyter with HDInsight Spark cluster
kojish/hbase-client
HBase client application
murggu/azure-hdinsight-aks-terraform
An example repo for provisioning a complete HDInsight on AKS environment.
windson/HDInsight-Top-N-OverPriced-Products-MapReduce
Top N OverPriced Products Using HDInsight streaming MapReduce Job
windson/HDInsight-TopN-Reviews-MapReduce
TopN Products by category using HDInsight Streaming MapReduce
windson/ReviewsByDate
Get date wise number of reviews in the descending order using HDInsight
zjplab/Processing-Big-data-with-Hadoop-in-Azure-HdInsight
Microsoft edx course DAT202.1x
calhaley/covid19_report_azure_data_factory
Integration of Covid-19 data utilising Azure Data Factory to perform data ingestion, transformation and storage activities. The goal of this guided project was to become familiar with Microsoft Azure technologies, including; Azure Data Factory(ADF), Azure Data Lake Storage Gen2, Azure SQL Database, Azure Blob Storage, Dataflow, Databricks, etc.
cs-uche/adf-pandemic-analytics
Pandemic Analytics with Data Factory
Datatamer/terraform-azure-hdinsight-hbase
Terraform module for terraform-azure-hdinsight-hbase
khoinguyen19k8/adf-covid19
Data pipeline that processes Covid19 data in Azure Data Factory. CI/CD with Azure Devops.
proazr/hdinsight-script-actions
Custom HDInsight Script Actions
anjijava16/Azure-Cloud-utils
Azure Analytics (azure_cloud_utils)
mpfishe2/store-data-generator
This is a grocery store data generator for emulating batch and real-time POS transactions and sending them to either Azure Event Hubs or Apache Kafka (test with Azure HDInsight Kafka)