The DataSHIELD Interface (DSI) defines a set of S4 classes and generic methods that can be implemented for accessing a data repository supporting the DataSHIELD infrastructure: controlled R commands to be executed on the server side are garanteeing that non disclosive information is returned to client side.
Learn more about DataSHIELD.
The DSI classes are:
DSObject
a common base class for all DSI,DSDriver
drives the creation of a connection object,DSConnection
allows the interaction with the remote server; DataSHIELD operations such as aggregation and assignment return a result object; DataSHIELD setup status check can be performed (dataset access, configuration comparision),DSResult
wraps access to the result, which can be fetched either synchronously or asynchronously depending on the capabilities of the data repository server.
All classes are virtual: they cannot be instantiated directly and instead must be subclassed. See DSOpal for a reference implementation of DSI based on the Opal data repository.
These S4 classes and generic methods are meant to be used for implementing connection to a DataSHIELD-aware data repository.
In addition to these S4 classes, DSI provides functions to handle a list of remote data repository servers:
datashield.login
anddatashield.logout
will make use of theDSDriver
paradigm to createDSConnection
s to the data repositories,datashield.aggregate
anddatashield.assign
will perform typical DataSHIELD operations onDSConnection
s, which result will be fetched throughDSResult
objects,datashield.connections
,datashield.connections_default
anddatashield.connections_find
are functions for managing the list ofDSConnection
objects that will be discovered and used by the client-side analytic functions.- Other data management functions are provided by the
DSConnection
objects:datashield.workspaces
,datashield.workspace_save
anddatashield.workspace_rm
allow to manage R images of the remote DataSHIELD sessions (to speed up data analysis sessions),datashield.symbols
anddatashield.symbol_rm
offer a minimalistic management of the R symbols living in the remote DataSHIELD sessions,datashield.tables
,datashield.table_status
list the tables and their accessibility across a set of data repositories,datashield.resources
,datashield.resource_status
list the resources and their accessibility across a set of data repositories,datashield.pkg_status
,datashield.method_status
anddatashield.methods
are utility functions to explore the DataSHIELD setup across a set of data repositories.
These datashield.*
functions are meant to be used by DataSHIELD packages developers and users.
Some options can be set to modify the behavior of the DSI:
datashield.env
is the R environment in which theDSConnection
object list is to be looking for. Default value is the Global Environment:globalenv()
.datashield.progress
is a logical to enable the visibility of the progress bars. Default value isTRUE
.datashield.progress.clear
is a logical to make the progress bar disappear after it has been completed. Default value isFALSE
.datashiel.error.stop
is a logical to alter error handling behavior: ifTRUE
an error is raised when at least one server has failed, otherwise a warning message is issued. Default value isTRUE
.