Load Avro data into Spark with sparklyr. It is a wrapper of spark-avro
Install using {devtools}
as follows:
devtools::install_github("chezou/sparkavro")
library(sparklyr)
library(sparkavro)
sc <- spark_connect(master = "spark://HOST:PORT")
df <- spark_read_avro(sc, "test_table", "/user/foo/test.avro")
spark_write_avro(df, "/tmp/output")
Example data are from https://github.com/miguno/avro-cli-examples