cloudera
There are 185 repositories under cloudera topic.
HariSekhon/DevOps-Bash-tools
1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..
OryxProject/oryx
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
HariSekhon/Nagios-Plugins
450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
hortonworks/cloudbreak
CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and artificial intelligence functionality along with secure user access and data governance features.
timveil/hive-jdbc-uber-jar
Hive JDBC "uber" or "standalone" jar based on the latest Apache Hive version
HariSekhon/HAProxy-configs
80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Kubernetes, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
Pushkr/Apache-Spark-Hands-On
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
AlionSSS/CDH-Install-Manual
CDH安装手册
san089/Cloudera_Material
Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.
teamclairvoyant/hadoop-deployment-bash
Code for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.
ryandawsonuk/data-platforms-tools
Guide to data platforms and tools
frischHWC/one-script-deploy
One Click Script to Deploy CDP (CDP PvC & HDP & CDH)
tspannhw/FLiPStackWeekly
FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
oracle-quickstart/oci-cloudera
Terraform module to deploy Cloudera on Oracle Cloud Infrastructure (OCI)
teamclairvoyant/apache-airflow-cloudera-csd
CSD for Apache Airflow
cloudera/tutorial-assets
Assets used in Cloudera Tutorials
HariSekhon/lib
Perl Utility Library for my other repos
teamclairvoyant/apache-airflow-cloudera-parcel
Parcel for Apache Airflow
ummmme/setup_cdh
CDH5.16.2 离线安装脚本
frischHWC/datagen
Datagenerator for Data Services
srowen/cdsw-simple-serving
Modeling Lifecycle with ACME Occupancy Detection and Cloudera
cloudera/cdp-sdk-java
Cloudera CDP SDK for Java
thammuio/doc-genius-ai
DocGenius AI - Generative AI Chatbot for your Documents
dmilan77/cloudera-phoenix
CDH compliant Apache Phoenix
cloudera/cdpcli
CDP command line interface (CLI)
HuemulSolutions/huemul-bigdatagovernance
Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de derechos ARCO para facilitar la implementación de leyes de protección de datos tipo GDPR, identificar los niveles de seguridad y si se está aplicando algún tipo de encriptación. Adicionalmente permite agregar reglas de validación más complejas sobre la misma tabla.
chezou/homebrew-cloudera
Homebrew Formulas for cloudera tools
kongyew/greenplum-dockers
Create Greenplum docker files
tspannhw/ClouderaFlowManagementWorkshop
Cloudera Flow Management Workshop with Apache NiFi
Sathiyarajan/big-data-pipeline
Big Data
cloudera/cdpcurl
Curl like tool with CDP request signing.
NFPA/LocationTools
Geocoding and Reverse Geocoding at Scale
Powerspace/kudu-from-avro
A small Command Line tool to create an Kudu table from an Avro schema or from SQL script
tspannhw/minifi-jetson-nano
MiNiFi Agent Configuration and Scripts for NVidia Jetson Nano device
tspannhw/MmFLaNK
Mm FLaNK Stack (MXNet, MiNiFi, Flink, NiFi, Kafka, Kudu) for AI-IoT