The Office 365 connector allows Dremio to connect to and query data in Office 365. This can then allow you to build custom reports, dashboards, or even just ad-hoc SQL via your client tool of choice. Note that it does require a third-party JDBC driver that is not free, but does allow a free trial.
Edit the contents of the file pom.xml:
- Find {VERSION} in pom.xml and replace it with your Dremio version, e.g. 25.1.1-202409260159070462-716c0676
The Advanced Relational Pushdown (ARP) Framework allows for the creation of Dremio plugins for any data source which has a JDBC driver and accepts SQL as a query language. It allows for a mostly code-free creation of a plugin, allowing for modification of queries issued by Dremio using a configuration file.
There are two files that are necessary for creation of an ARP-based plugin: the storage plugin configuration, which is code, and the plugin ARP file, which is a YAML (https://yaml.org/) file.
The storage plugin configuration file tells Dremio what the name of the plugin should be, what connection options should be displayed in the source UI, what the name of the ARP file is, which JDBC driver to use and how to make a connection to the JDBC driver.
The ARP YAML file is what is used to modify the SQL queries that are sent to the JDBC driver, allowing you to specify support for different data types and functions, as well as rewrite them if tweaks need to be made for your specific data source.
The ARP file is broken down into several sections:
metadata
- This section outlines some high level metadata about the plugin.
syntax
- This section allows for specifying some general syntax items like the identifier quote character.
data_types
- This section outlines which data types are supported by the plugin, their names as they appear in the JDBC driver, and how they map to Dremio types.
relational_algebra - This section is divided up into a number of other subsections:
- aggregation
- Specify what aggregate functions, such as SUM, MAX, etc, are supported and what signatures they have. You can also specify a rewrite to alter the SQL for how this is issued.
- except/project/join/sort/union/union_all/values
- These sections indicate if the specific operation is supported or not.
- expressions
- This section outlines general operations that are supported. The main sections are:
- operators
- Outlines which scalar functions, such as SIN, SUBSTR, LOWER, etc, are supported, along with the signatures of these functions which are supported. Finally, you can also specify a rewrite to alter the SQL for how this is issued.
- variable_length_operators
- The same as operators, but allows specification of functions which may have a variable number of arguments, such as AND and OR.
If an operation or function is not specified in the ARP file, then Dremio will handle the operation itself. Any operations which are indicated as supported but need to be stacked on operations which are not will not be pushed down to the SQL query.
- Run the following command in the root directory for the Connector (the connector that contains the pom.xml file):
mvn clean install
- Place the built JAR file (from the target folder) in the /jars/ directory of your Dremio installation. For example:
docker cp PATH\TO\dremio-office365-plugin-20.0.0.jar dremio_image_name:/opt/dremio/jars/
- Download and install the Office 365 JDBC Driver from CData* and copy the JAR file to the /jars/3rdparty/ directory of your Dremio installation. For example:
docker cp PATH\TO\cdata.jdbc.office365.jar dremio_image_name:/opt/dremio/jars/3rdparty/
- Restart Dremio
*Note: you will need a trial or paid license of the CData JDBC Driver to use the Driver in Dremio.