spark-datasource
There are 4 repositories under spark-datasource topic.
StabRise/spark-pdf
PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it
miraisolutions/spark-bigquery
Google BigQuery data source for Apache Spark
spark-root/laurelin
Allows reading ROOT TTrees into Apache Spark as DataFrames
SA01/spark-custom-datasource-tutorial
Contains the code and examples for my article on Medium, which explains how to create a custom JDBC read-only data source in Apache Spark 3