/s3mysqldump

dump mysql tables to s3

Primary LanguagePython

s3mysqldump

s3mysqldump is a tool to dump mysql tables to S3, so they can be consumed by Elastic MapReduce, etc.

Installation

From source:

python setup.py install

To export to CSV on s3

python s3mysqldump.py --convert-to-csv --use-mysql <db name> <table name> <s3 location> --db-host=<db host location> -m ~/.my.cnf --compress --header-output --header-plain

Where .my.cnf can contain [client] user=<username> password=<password>

The AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY should be set as environmental variables. 'boto.cfg' is the configure file for s3 connection which specifies things like aws credentials etc. One can pass -b boto.cfg to above command too

A Simple Example

The following command dumps 'user' table in 'db' database to s3 bucket s3://emr-storage/. 'my.cnf' specifies mysql parameters. 'boto.cfg' is the configure file for s3 connection which specifies things like aws credentials etc.

s3mysqldump -v --force -m my.cnf -s -b boto.cfg db user s3://emr-storage/user.sql