lucidworks/spark-solr

PySpark: java.lang.ClassNotFoundException: Failed to find data source: solr.

svanschalkwyk opened this issue · 1 comments

PySpark cannot find any "solr" or "lucidworks...." class.

df = sparkSession.read.format("solr").option(
      "zkhost", zkhost).option(
      "collection", "Partgenix2").load()
      

Fusion 4.3 / Fusion 5....

Try:
pyspark --jars /path/to/spark-solr-3.6.0-shaded.jar
or
spark-submit --jars /path/to/spark-solr-3.6.0-shaded.jar