/cascading-jdbc-on-hdp

Sample project for running Cascading-jdbc on Hortonworks

Primary LanguageJavaGNU General Public License v2.0GPL-2.0

cascading-jdbc-on-hdp

Sample project for running Cascading-jdbc on Hortonworks

Installation and Deployment

Use gradle as a build system. This project was developed and tested using

  • Gradle 3.4
  • Hortonworks Sandbox 2.4
  • JDK 8, compiling for JDK 7 (due to Java-Version on the sandbox)

Use

gradle uploadToHortonworksSandbox

to upload the packaged project to the sandbox. Adjust the target path before!

Getting started

  • Make sure that small.txt is available in HFS, the current implementation looks for the file in the current users directory, which in my case is /user/root

  • start a postgres database somewhere, I was using docker as follows: docker run -P -e POSTGRES_PASSWORD=password --rm postgres which resulted in a container running on 172.16.102.85 listening at port 32768 (see TestJdbc1.java)

  • If not done, deploy the packaged project to the Hortonworks Sandbox

    • Either run gradle assemble copyAllDependencies and copy the contents of build/libs to the server, or
    • Update ext.targetDir (approx. line 53 in build.gradle), create the directory on the server once, and run gradle uploadToHortonworksSandbox
  • Put the job to yarn, using

    yarn jar cascading-jdbc-on-hdp-1.0.0.jar net.brunel.TestJdbc1

from the command line shell.