Pinned Repositories
cdh-twitter-example
Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive
cloudera-playbook
Cloudera deployment automation with Ansible
cm_api
Cloudera Manager API Client
cm_ext
Cloudera Manager Extensibility Tools and Documentation.
cod-examples
cod-examples
flink-tutorials
flume
WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic applications.
hue
Open source SQL Query Assistant service for Databases/Warehouses
impyla
Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)
livy
Livy is an open source REST interface for interacting with Apache Spark from anywhere
Cloudera's Repositories
cloudera/cdh-twitter-example
Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive
cloudera/kitten
The fast and fun way to write YARN applications.
cloudera/kudu-examples
Example code for Kudu
cloudera/hs2client
C++ native client for Impala and Hive, with Python / pandas bindings
cloudera/impala-udf-samples
Sample UDF and UDAs for Impala.
cloudera/director-scripts
Cloudera Director sample code
cloudera/ades
An analysis of adverse drug event data using Hadoop, R, and Gephi
cloudera/kafka-examples
Kafka Examples repository.
cloudera/mapreduce-tutorial
cloudera/madlibport
Madlib port for Cloudera Impala
cloudera/parquet-examples
Example programs and scripts for accessing parquet files
cloudera/cdsw-training
Example Python and R code for Cloudera Data Science Workbench training
cloudera/crepo
cloudera repo management tool
cloudera/navigator-sdk
Navigator SDK
cloudera/earthquake
cloudera/strata-tutorial-2016-nyc
cloudera/whirr
Mirror of Apache Whirr
cloudera/datafu
cloudera/hcatalog-examples
Sample code for reading and writing tables with hcatalog
cloudera/blog-eclipse
cloudera/director-vsphere-plugin
Cloudera Director - VMware vSphere integration (developed by VMware)
cloudera/jetty-hadoop-fix
A patched Jetty 6.1.26 for use in Hadoop
cloudera/accumulo
CDH specific changes and backports on top of Apache Accumulo
cloudera/accumulo-upgrade-test
Testing for Apache Accumulo upgrades
cloudera/dist-tf
dist-tf
cloudera/jackson-databind
General data-binding package for Jackson (2.x): works on streaming API (core) implementation(s)
cloudera/jcarder
git clone of jcarder with some improvements - see lockclasses-new branch
cloudera/oscar
OSCAR is a diagnostic tool that assesses whether an object store is suitable for use with Apache Hadoop.
cloudera/pelican-elegant
A responsive, minimal, and stylish theme for Pelican
cloudera/tomcat60
Cloudera Fork of Apache Tomcat 6.0.48 - For Security Vulnerabilities Post-EOL