Pinned Repositories
cdh-twitter-example
Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive
cloudera-playbook
Cloudera deployment automation with Ansible
cm_api
Cloudera Manager API Client
cm_ext
Cloudera Manager Extensibility Tools and Documentation.
cod-examples
cod-examples
flink-tutorials
flume
WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic applications.
hue
Open source SQL Query Assistant service for Databases/Warehouses
impyla
Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)
livy
Livy is an open source REST interface for interacting with Apache Spark from anywhere
Cloudera's Repositories
cloudera/cdh-twitter-example
Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive
cloudera/python-ngrams
cloudera/bigtop
Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a community around the packaging and interoperability testing of Hadoop-related projects. This includes testing at various levels (packaging, platform, runtime, upgrade, etc...) developed by a community with a focus on the system as a whole, rather than individual projects.
cloudera/ades
An analysis of adverse drug event data using Hadoop, R, and Gephi
cloudera/mapreduce-tutorial
cloudera/madlibport
Madlib port for Cloudera Impala
cloudera/seismichadoop
System for performing seismic data processing on a Hadoop cluster.
cloudera/whirr-cm
cloudera/crepo
cloudera repo management tool
cloudera/emailarchive
Hadoop for archiving email
cloudera/earthquake
cloudera/piglatin-mode
PigLatin mode for Emacs.
cloudera/strata-tutorial-2016-nyc
cloudera/poisson_sampling
cloudera/art-widgets
cloudera/haivvreo
Hive + Avro. Serde for working with Avro in Hive
cloudera/hcatalog-examples
Sample code for reading and writing tables with hcatalog
cloudera/sshj
ssh, scp and sftp for java
cloudera/blog-eclipse
cloudera/director-vsphere-plugin
Cloudera Director - VMware vSphere integration (developed by VMware)
cloudera/github-jira-gateway
A Grails app to serve as a gateway between an internal GitHub Enterprise server and an external JIRA server
cloudera/puppet-apt
Puppet module to help manage Apt
cloudera/jetty-hadoop-fix
A patched Jetty 6.1.26 for use in Hadoop
cloudera/accumulo-upgrade-test
Testing for Apache Accumulo upgrades
cloudera/hcatalog
cloudera/jcarder
git clone of jcarder with some improvements - see lockclasses-new branch
cloudera/ops-testing
cloudera/pelican-elegant
A responsive, minimal, and stylish theme for Pelican
cloudera/sentry-solr-integration
Mirror of Apache Sentry
cloudera/tomcat60
Cloudera Fork of Apache Tomcat 6.0.48 - For Security Vulnerabilities Post-EOL