Please note that this is an open source project which is not officially supported by Exasol. We will try to help you as much as possible, but can't guarantee anything since this is not an official Exasol product.
This project contains SQL scripts for automatically importing data from various data management systems into Exasol.
You'll find SQL scripts which you can execute on Exasol to load data from certain databases or database management systems. The scripts try to extract the meta data from the source system and create the appropriate IMPORT statements automatically so that you don't have to care about table names, column names and types.
If you want to optimize existing scripts or create new scripts for additional systems, we would be very glad if you share your work with the Exasol user community.
Azure SQL is essentially Microsoft SQL Server. You need to specify a DB you are working on in your connection-string. See script azure_sql_to_exasol.sql
The method of importing a CSV file depends on the location of the file.
- Import a file stored on your local machine via EXAplus:
IMPORT INTO <table> FROM LOCAL CSV FILE '<filename>' <options>;
Example:
IMPORT INTO MY_SCHEMA.MY_TABLE
FROM LOCAL CSV FILE 'C:\Users\my_user\Downloads\data.csv'
COLUMN SEPARATOR = ','
COLUMN DELIMITER = '"'
ROW SEPARATOR = 'CRLF' -- CR when file was generated on a unix systems, CRLF when created on windows
SKIP = 1 -- skip the header
;
-
Import from HDFS: See SOL-322
-
Import from S3: See Exasol-1774 for single file import, for importing multiple files scroll down to S3
For more details on IMPORT
see paragraph 2.2.2 in the User Manual. For further help on typical CSV-formatting issues, see
- How to load bad CSV files
- Proper csv export from MySQL
- Proper csv export from IBM DB2
- Proper csv export from Oracle
- Proper csv export from PostgreSQL
- Proper csv export from Microsoft SQL Server
See script db2_to_exasol.sql
Step by Step guide:
- create connection to the Exasol database you want to import from
- create the exasol_to_exasol.sql script
- adapt the variables for the execute script accordingly to your scenario and run the statement
- depending on your system and the amount of tables this might take a few seconds
- copy the result set to another session and execute the statement in the output order
This script will generate the following information:
- create schema
- create table with primary keys
- alter table add foreign keys
- alter table set partion by
- alter table set distribution keys
- import data
See script exasol_to_exasol.sql for more information!
In order to connect Exasol to Google BigQuery you need to carry out the steps outlined in Connecting Google BigQuery to Exasol.
Now, test the connectivity with a simple query:
SELECT *
FROM (
IMPORT FROM JDBC AT <name_of_connection>
STATEMENT 'SELECT 1'
);
For the actual data-migration, see script bigquery_to_exasol.sql
Note: Due to the lack of an alternative datatype, the following Google BigQuery datatypes; DATE
,DATETIME
,TIMESTAMP
and ARRAY
are stored as VARCHAR.
Create a connection:
CREATE CONNECTION <name_of_connection>
TO 'jdbc:mysql://192.168.137.5:3306'
USER '<user>'
IDENTIFIED BY '<password>';
Test your connection:
SELECT * FROM
(
IMPORT FROM JDBC AT <name_of_connection>
STATEMENT 'SELECT 42 FROM DUAL'
);
Then you're ready to use the migration script: mysql_to_exasol.sql
The first thing you need to do is add the IBM Netezza JDBC driver to Exasol. Since Netezza has run out of support in June 2019, the JDBC-driver (nzjdbc3.jar
) can no longer be found on the official JDBC Download-page of IBM. Anyhow, the driver can be found within your Netezza distribution under following path:
nz/kit.version_number/sbin/nzjdbc3.jar (eg. nz/kit.7.2.1.0/sbin/nzjdbc3.jar)
In order to add the driver to Exasol log into your EXAOperations, select the 'Software'-, then 'JDBC Drivers'-Tab.
Click Add, then specify the following details:
- Driver Name:
Netezza
- Main Class:
org.netezza.Driver
- Prefix:
jdbc:netezza:
- Disable Security Manager:
Check this box
After clicking Apply, you will see the newly added driver's details on the top section of the driver list. Select the Netezza driver by locating the nzjdbc3.jar and upload it. When done the .jar file should be listed in the files column for the IBM Netezza driver.
The standard port for Netezza is 5480
.
The Connection-String should look like the following:
"jdbc:netezza://'host_ip':'port'/Database-Name" (User-ID, Password)
(e.g. jdbc:netezza://127.0.0.1:5480/SYSTEM, User-ID: ADMIN, Password: Password)
To test the connectivity of Exasol to your Netezza Instance create the following connection in your SQL-client:
CREATE OR REPLACE CONNECTION <name_of_connection>
TO 'jdbc:netezza://<host_name>:<port>'
USER '<netezza_username>'
IDENTIFIED BY '<netezza_password>';
You need to have CREATE CONNECTION privilege granted to the user in order to do this.
Test the connectivity with a simple query like:
SELECT *
FROM (
IMPORT FROM JDBC AT netezza_connection
STATEMENT 'SELECT 1 as "sucessfully_connected" from _v_dual '
);
For the actual data-migration, see script netezza_to_exasol.sql
When importing from Oracle, you have two options. You could import via JDBC or the native Oracle interface (OCI).
-
OCI: Log in to EXAoperation. Go to Configuration -> Software. Download the instant client from Oracle and select it at
Software Update File
. ClickSubmit
to upload.Create a connection:
CREATE CONNECTION <name_of_connection> TO '192.168.99.100:1521/xe' USER '<user>' IDENTIFIED BY '<password>';
-
JDBC: If you are using the community edition, you need to upload a JDBC driver in EXAoperation before being able to establish a connection, see SOL-179.
Create a connection:
CREATE CONNECTION <name_of_connection> TO 'jdbc:oracle:thin:@//192.168.99.100:1521/xe' USER '<user>' IDENTIFIED BY '<password>';
Test your connection:
SELECT * FROM
(
IMPORT FROM <conn_type> AT <name_of_connection>
STATEMENT 'SELECT 42 FROM DUAL'
);
<con_type>
is eitherJDBC
or ORA
, depending on your connection
Then you're ready to use the migration script: oracle_to_exasol.sql
See script postgres_to_exasol.sql
See script redshift_to_exasol.sql
The script s3_to_exasol.sql looks different than the other import scripts. It's made to load data from S3 in parallel and needs some preparation before you can use it. See our documentation for detailed instructions. If you just want to import a single file, see 'Import from CSV' above.
The first thing you need to do is add the SAP Hana JDBC driver to Exasol. The JDBC driver is located in your local SAP Hana Installation folder:
- eg: (C:\Program Files\sap\ngdbc.jar) on Microsoft Windows platforms
- eg: (/usr/sap/ngdbc.jar) on Linux and UNIX platforms
In order to add the driver to Exasol log into your EXAOperations, select the 'Software', then 'JDBC Drivers'-Tab.
Click Add then specify the following details:
- Driver Name:
SAP
- Main Class:
com.sap.db.jdbc.Driver
- Prefix:
jdbc:sap:
- Disable Security Manager:
Check this box
After clicking Apply, you will see the newly added driver's details on the top section of the driver list. Select the SAP Hana driver by locating the ngdbc.jar and upload it. When done the .jar file should be listed in the files column for the SAP Hana driver.
You can find a detailed information about configuring the SAP Hana driver at the following link: https://help.sap.com/viewer/52715f71adba4aaeb480d946c742d1f6/2.0.00/en-US/ff15928cf5594d78b841fbbe649f04b4.html
The standard port number format for different instances 3NN15, NN- represents the instance number of HANA system to be used in client tools. (eg. 31015 Instance No 10, 30015 Instance No 00) In order to find out the instance number of your Hana-System type the following into your console: /usr/sap/HXE and press Autocomplete by tab. The Instance-Number should be displayed by the number of the HDB(XX)-File (eg. HDB90) Insert the number into your port number (-> eg. 39015)
The Connection-String should look like the following: "jdbc:sap://'host_ip':'port'/" (User-ID: SYSTEM, Password: Your password)
To test the connectivity of Exasol to your SAP Instance create the following connection in your SQL-client:
CREATE OR REPLACE CONNECTION <name_of_connection>
TO 'jdbc:sap://<host_name>:<port>'
USER '<sap_username>'
IDENTIFIED BY '<sap_password>';
You need to have CREATE CONNECTION privilege granted to the user used to do this.
Now, test the connectivity with a simple query like:
SELECT *
FROM (
IMPORT FROM JDBC AT <name_of_connection>
STATEMENT 'SELECT 1 from dummy'
);
For the actual data-migration, see script sap_hana_to_exasol.sql
See script sqlserver_to_exasol.sql
The first thing you need to do is add the Teradata JDBC driver to Exasol. The driver can be downloaded from Teradata's Download site. You need to register first, it's free. Make sure that you download the right version of the JDBC driver, matching the version of the Teradata database.
The downloaded package contains the file:
terajdbc4.jar
contains the actual Java classes of the driver
This file needs to be uploaded when you add the Teradata JDBC driver for Exasol. To do this, log into EXAoperations, then select Software, then the JDBC Drivers tab.
Click Add
then specify the following details:
- Driver Name:
Teradata
(or something similar) - Main Class:
com.teradata.jdbc.TeraDriver
- Prefix:
jdbc:teradata:
- Comment:
Version 15.10
(or something similar)
After clicking Apply
, you will see the newly added driver's details on the top section of the driver list.
Select the Teradata driver (the radio button in the first column) and then locate the terajdbc4.jar
and upload it.
Next step is to test the connectivity. First, create a connection to the remote Teradata database:
CREATE OR REPLACE CONNECTION <name_of_connection>
TO 'jdbc:teradata://<host_name_or_ip_address>'
USER '<td_username>'
IDENTIFIED BY '<td_password>';
You need to have CREATE CONNECTION
privilege granted to the user used to do this.
Additional JDBC connection parameters (such as CHARSET
might need to be specified in the connection string/URL); see information
on these here.
Now, test the connectivity with a simple query:
SELECT *
FROM (
IMPORT FROM JDBC AT <name_of_connection>
STATEMENT 'SELECT 1'
);
For the actual data-migration, see script teradata_to_exasol.sql
See script vectorwise_to_exasol.sql
See script vertica_to_exasol.sql
This folder contains scripts that can be used after having imported data from another database via the scripts above. What they do:
- Optimize the column's datatypes to minimize storage space on disk
- Import primary keys from other databases
This folder contains a script that can be used if you want to import data on a regular basis. What it does:
- Import only data that hasn't been imported yet by performing a delta import based on a given column (further explaination inside the folder)