https://www.oracle.com/java/technologies/javase/javase8u211-later-archive-downloads.html
- how to check java version ==enter command in cmd or anaconda prompt==command== java -version
- Download link : https://spark.apache.org/downloads.html
- Unzip with the help of 7zip
- Make new folder with the name of Spark and save extract version of Spark in it.
-
Download hadoop winutils from this below link: https://github.com/cdarlint/winutils/tree/master/hadoop-3.2.1/bin
-
Environmnet Variables Update
-
Variable name= SPARK_HOME | Variable Value = C:\spark\spark-2.2.1-bin-hadoop2.7
-
Variable name=HADOOP_HOME | Variable Value = C:\Hadoop
-
For JAVA===> Variable name = JAVA_HOME | Variable Value= C:\Program Files\Java\jdk1.8.0_311
- ! pip install pyspark (Write this command in Jupyter Notebook or in anaconda prompt)
- ! pip install findspark