carlomazzaferro/kryptoflow

Docker port

giranntu opened this issue · 3 comments

When I am using Dockerfile created image,

Did I need to open the port or something?
Due to I encounter the situation that it can't connect the kafka server like below.(When I run reddit.py to parse data)

{'sentences': 'You are 100% right, prepared to get downvoted though, somehow this sub enjoys blatant manipulation.', 'polarity': 0.4588, 'sentence_count': 1, 'ts': '2018-04-28 09:44:44'}
sentences You are 100% right, prepared to get downvoted though, somehow this sub enjoys blatant manipulation. <class 'str'> <class 'str'>
polarity 0.4588 <class 'str'> <class 'float'>
sentence_count 1 <class 'str'> <class 'int'>
ts 2018-04-28 09:44:44 <class 'str'> <class 'str'>
ERROR HTTPConnectionPool(host='localhost', port=8081): Max retries exceeded with url: /subjects/reddit-value/versions (Caused by NewConnectionError('<requests.packages.urllib3.connection.HTTPConnection object at 0x7f42a7484550>: Failed to establish a new connection: [Errno 111] Connection refused',))

Sorry for misleading,
The current setting seems need to run data gathering without docker,
what if I need to run data gathering also in docker ?(since I only have server...)

I was actually working on getting the scrapers working within docker but got stuck with some (probably the same) networking issues you are facing. I will try resolving them shortly but tbh just installing the packages and running the python scripts was much easier. If you have command line access to the server I can guide you on how to install all the dependencies, which will be ultimately easier.

Thanks for working this amazing project,
I am starting working without docker for short term solution and it work well.
Since it's the company server I need properly do it in docker. :p

Thanks for the kindly support, will keep try on it.