Pinned Repositories
Cassandra
Cassandra development using Java,Scala
DataDog-Work
Datadog is a monitoring service for cloud-scale applications, bringing together data from servers, databases, tools, and services to present a unified view of an entire stack.
ElasticSearch-5.x-Java-Client
This project describes Basic CRUD operation on ElasticSearch5.x using java transport client.
Gobblin-Work
This Gobblin project contains some sample code to publish data from Kafka/FileSystem to HDFS in different output formats.
Hadoop
Hadoop Progaming
Hive-Pig-Hbase
Hive,Pig,Hbase,Sqoop examples
HiveMetaStoreClient
This Project explains how to use HiveMetaStoreClient, HiveJdbcDriver, HiveServer2
oozie-work
Demonstrates how to develop an Oozie workflow application and aim's to show-case Oozie's features.
Spark
Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .
Spark-Java
This project will have sample programs for Spark in java .
spider-123-eng's Repositories
spider-123-eng/Spark
Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .
spider-123-eng/Hive-Pig-Hbase
Hive,Pig,Hbase,Sqoop examples
spider-123-eng/Hadoop
Hadoop Progaming
spider-123-eng/HiveMetaStoreClient
This Project explains how to use HiveMetaStoreClient, HiveJdbcDriver, HiveServer2
spider-123-eng/ElasticSearch-5.x-Java-Client
This project describes Basic CRUD operation on ElasticSearch5.x using java transport client.
spider-123-eng/Gobblin-Work
This Gobblin project contains some sample code to publish data from Kafka/FileSystem to HDFS in different output formats.
spider-123-eng/Spark-Java
This project will have sample programs for Spark in java .
spider-123-eng/DataDog-Work
Datadog is a monitoring service for cloud-scale applications, bringing together data from servers, databases, tools, and services to present a unified view of an entire stack.
spider-123-eng/Spring3-Hibernate3
Here we will learn how to integrate spring and hibernate
spider-123-eng/Cassandra
Cassandra development using Java,Scala
spider-123-eng/oozie-work
Demonstrates how to develop an Oozie workflow application and aim's to show-case Oozie's features.
spider-123-eng/Ansible
Ansible is an open-source automation engine that automates software provisioning, configuration management, and application deployment.
spider-123-eng/docker
spider-123-eng/docker-1
Docker Playground
spider-123-eng/ElasticSearch-2.x-Java-Client
This project describes Basic CRUD operation on ElasticSearch using java transport client.
spider-123-eng/examples
spider-123-eng/kafka-producer-examples
Examples of using the DataStax Apache Kafka Connector.
spider-123-eng/Kafka-Streams-Real-time-Stream-Processing
This is the central repository for all materials related to Kafka Streams : Real-time Stream Processing! Book by Prashant Pandey.
spider-123-eng/kubelabs
Kubernetes - Beginners | Intermediate | Advanced
spider-123-eng/kubernetes
Kubernetes playground
spider-123-eng/OozieSamples
Oozie Samples
spider-123-eng/samza
Apache Samza is a distributed stream processing framework.. It uses Apache Kafka for messaging, and Apache Hadoop YARN to provide fault tolerance, processor isolation, security, and resource management.