big-data-europe/docker-hadoop-spark-workbench

[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook and HDFS FileBrowser.

Makefile

Issues

There are 1 datanode(s) running and 1 node(s) are excluded in this operation
#51 opened 7 years ago by lokinell
13
TaskSchedulerImpl: Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient resources
#72 opened 3 years ago by nguacon90
0
NoReverseMatch: u'about' is not a registered namespace
#71 opened 3 years ago by geekyouth
0
java.lang.NumberFormatException: For input string: "tcp://10.153.3.65:8080"
#70 opened 3 years ago by geekyouth
0
Unable to query the hive table in container from spark-shell running in Windows
#68 opened 4 years ago by ZosBHAI
0
beeline
#67 opened 4 years ago by omarelnahas23
0
Job aborted due to stage failure while reading a simple Text File from HDFS
#49 opened 7 years ago by radianv
4
spark cannot read file in hadoop in overlay network(swarm mode)
#58 opened 6 years ago by veerapatyok
2
swarm makefile traefik:1.7 syntax
#66 opened 5 years ago by alxdembo
0
How to read data into Spark from HDFS?
#64 opened 5 years ago by aleksandarskrbic
1
Directory /hadoop/dfs/name is in an inconsistent state: storage directory does not exist or is not accessible.
#65 opened 5 years ago by wdorninger
0
Great repo! But how can we use Pyspark?
#53 opened 6 years ago by haydenliu
2
java.lang.IllegalArgumentException: java.net.UnknownHostException: namenode
#63 opened 6 years ago by dh12306
2
pyspark
#62 opened 6 years ago by dh12306
0
can not open Spark-notebook: http://localhost:9001
#60 opened 6 years ago by mokundong
1
error in spark notebook
#57 opened 6 years ago by gsumar
0
Why HDFS run without YARN ?
#56 opened 6 years ago by dolenam317
1
incompatible clusterID Hadoop
#55 opened 6 years ago by Atahualkpa
4
How to add a file from local file system?
#54 opened 6 years ago by alexeytochin
1
Faile to connect to namenode:8020
#52 opened 7 years ago by kkalugerov
1
Global Values not found
#50 opened 7 years ago by smrazaabbas
2
Spark Worker not connected to Spark Master
#48 opened 7 years ago by Vzzarr
2
GlusterFS?
#47 opened 7 years ago by antonkulaga
1
org.apache.hadoop.hdfs.server.common.IncorrectVersionException
#46 opened 7 years ago by Vzzarr
2
latest version of hue and hue.ini in volume
#29 opened 7 years ago by antonkulaga
2
java.net.UnknownHostException: namenode
#45 opened 7 years ago by C0mmander198
1
Question: how to connect from external client (intellij) to spark?
#44 opened 7 years ago by tomer-ben-david
4
links to any official mailing lists /chats/etc.
#43 opened 7 years ago by antonkulaga
1
Binding 8081 port error when I execute scale up
#24 opened 7 years ago by VijayQin
4
sparkContext not found in spark notebook
#36 opened 7 years ago by Henrilin28
1
docker-compose scale spark-worker=3 & spark-submit
#38 opened 7 years ago by Vzzarr
2
command: ["./wait-for-it.sh"] for swarm mode
#39 opened 7 years ago by antonkulaga
3
Switch spark-notebook to apache zeppelin
#40 opened 7 years ago by earthquakesan
6
Question: how can i run hdfs command line commands such as "hdfs dfs fs -ls"
#41 opened 7 years ago by tomer-ben-david
1
Q: How can you scale out on multiple hosts?
#42 opened 7 years ago by cmantas
5
ClassCastException while running example from BDE2020 blog
#37 opened 7 years ago by wanly3643
1
How do i get my hadoop home directory?
#35 opened 7 years ago by akinmail
5
General Questions - Multihost, Spark Version and Apache Zeppelin
#15 opened 7 years ago by prof-schacht
12
How to enable Pyspark in Jupyter Notebook
#18 opened 7 years ago by agmistry
2
Steps compatible to Windows?
#19 opened 7 years ago by prashantladha
2
Override SPARK_MASTER and SPARK_MASTER_PORT env variables
#32 opened 7 years ago by eschizoid
2
Any way to run this in swarm mode?
#31 opened 7 years ago by nirobayo
7
MapReduce jobs causes Namenode to stop
#27 opened 7 years ago by georgiosgeorgiadis
4
Copying files to HDFS
#28 opened 7 years ago by ashemery
4
Failed on Connection Exception
#26 opened 7 years ago by spaghettifunk
2
Error when accessing hdfs file from remote spark driver
#23 opened 7 years ago by jrabary
1
Hue is not working
#21 opened 7 years ago
1
Broken hadoop package - missing GLIBC_2.14
#20 opened 7 years ago by marcino239
3
Connect spark-notebook to spark cluster
#17 opened 8 years ago by Miguel-Alonso
3
Unable to view worker details or Jobs
#16 opened 8 years ago by brett--anderson
3