cloudera
There are 178 repositories under cloudera topic.
HariSekhon/DevOps-Bash-tools
1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..
OryxProject/oryx
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
HariSekhon/Nagios-Plugins
450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
hortonworks/cloudbreak
CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and artificial intelligence functionality along with secure user access and data governance features.
timveil/hive-jdbc-uber-jar
Hive JDBC "uber" or "standalone" jar based on the latest Apache Hive version
HariSekhon/HAProxy-configs
80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Kubernetes, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
Pushkr/Apache-Spark-Hands-On
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
AlionSSS/CDH-Install-Manual
CDH安装手册
teamclairvoyant/hadoop-deployment-bash
Code for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.
ryandawsonuk/data-platforms-tools
Guide to data platforms and tools
san089/Cloudera_Material
Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.
oracle-quickstart/oci-cloudera
Terraform module to deploy Cloudera on Oracle Cloud Infrastructure (OCI)
teamclairvoyant/apache-airflow-cloudera-csd
CSD for Apache Airflow
HariSekhon/lib
Perl Utility Library for my other repos
teamclairvoyant/apache-airflow-cloudera-parcel
Parcel for Apache Airflow
ummmme/setup_cdh
CDH5.16.2 离线安装脚本
cloudera/tutorial-assets
Assets used in Cloudera Tutorials
srowen/cdsw-simple-serving
Modeling Lifecycle with ACME Occupancy Detection and Cloudera
tspannhw/FLiPStackWeekly
FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
cloudera/cdp-sdk-java
Cloudera CDP SDK for Java
dmilan77/cloudera-phoenix
CDH compliant Apache Phoenix
HuemulSolutions/huemul-bigdatagovernance
Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de derechos ARCO para facilitar la implementación de leyes de protección de datos tipo GDPR, identificar los niveles de seguridad y si se está aplicando algún tipo de encriptación. Adicionalmente permite agregar reglas de validación más complejas sobre la misma tabla.
chezou/homebrew-cloudera
Homebrew Formulas for cloudera tools
cloudera/cdpcli
CDP command line interface (CLI)
kongyew/greenplum-dockers
Create Greenplum docker files
Sathiyarajan/big-data-pipeline
Big Data
thammuio/doc-genius-ai
DocGenius AI - Generative AI Chatbot for your Documents - Powered by Cloudera Machine Learning (CML)
tspannhw/ClouderaFlowManagementWorkshop
Cloudera Flow Management Workshop with Apache NiFi
cloudera/cdpcurl
Curl like tool with CDP request signing.
NFPA/LocationTools
Geocoding and Reverse Geocoding at Scale
Powerspace/kudu-from-avro
A small Command Line tool to create an Kudu table from an Avro schema or from SQL script
ptobarra/Business-Intelligence-on-Big-Data-_-U-TAD-2017-Big-Data-Master-Final-Project
This is the final project I had to do to finish my Big Data Expert Program in U-TAD in September 2017. It uses the following technologies: Apache Spark v2.2.0, Python v2.7.3, Jupyter Notebook (PySpark), HDFS, Hive, Cloudera Impala, Cloudera HUE and Tableau.
Ranjandas/Dirty-CDH-Docker
A quick and dirty CDH cluster skeleton using Docker for Testing
tspannhw/minifi-jetson-nano
MiNiFi Agent Configuration and Scripts for NVidia Jetson Nano device
tspannhw/MmFLaNK
Mm FLaNK Stack (MXNet, MiNiFi, Flink, NiFi, Kafka, Kudu) for AI-IoT