download input from sftp
Closed this issue · 6 comments
rkravinderkumar05 commented
Is there any way i can give input dataframe path as sftp server and metorikku downloads the file and use it?
lyogev commented
Yes.
If you're running metorikku with spark-submit use the following command:
spark-submit --packages com.springml:spark-sftp_2.11:1.1.5 --class com.yotpo.metorikku.Metorikku metorikku.jar -c config.yaml
Then in your config file in the input define:
input_sftp:
file:
path: /sample.csv
format: com.springml.spark.sftp
options:
host: SFTP_HOST
username: SFTP_USER
password: SFTP_PASSWORD
Check out the documentation of all available options here:
https://github.com/springml/spark-sftp
rkravinderkumar05 commented
lyogev commented
I think maybe the yaml isn't formatted correctly, can you send the full YAML?
rkravinderkumar05 commented
Hi, Please have a look.
sample.zip
lyogev commented
inputs:
movies:
file:
path: /home/movies.csv
format: com.springml.spark.sftp
options:
host: HOST
username: USER
password: PASSWD
lyogev commented
Please reopen if still not working