/sparkavro

Load Avro data into Spark with sparklyr

Primary LanguageRApache License 2.0Apache-2.0

Travis-CI Build Status

sparkavro

Load Avro data into Spark with sparklyr. It is a wrapper of spark-avro

Installation

Install using {devtools} as follows:

devtools::install_github("chezou/sparkavro")

Usage

library(sparklyr)
library(sparkavro)
sc <- spark_connect(master = "spark://HOST:PORT")
df <- spark_read_avro(sc, "test_table", "/user/foo/test.avro")

spark_write_avro(df, "/tmp/output")

Example data are from https://github.com/miguno/avro-cli-examples