Mu-Sigma/analysis-pipelines
Enables data scientists to compose pipelines of analysis which consist of data manipulation, exploratory analysis & reporting, as well as modeling steps. Data scientists can use tools of their choice through an R interface, and compose interoperable pipelines between R, Spark, and Python.
RNOASSERTION
Issues
- 2
- 2
Resolve WARN issues in CRAN checks occurring due to the updated version of tibble (3.0.0 +)
#58 opened by naren1991 - 1
analysisPipelines for R 3.3.3
#57 opened by ravin2309 - 3
Outputs saved from pipeline saved as RDS are significantly larger than they should be
#56 opened by husamrahman - 2
- 2
Feature Request: Be able to pass multiple outputs from previous steps into a list to be used in a another step
#52 opened by husamrahman - 0
- 3
Have the ability to name each step in the pipeline and use that in complex pipelines
#51 opened by husamrahman - 2
- 2
User-Defined Functions Not being saved
#49 opened by husamrahman - 1
Export a pipeline to a BPMN flow
#14 opened by musigma-admin - 19
Name for the package
#13 opened by musigma-admin - 5
- 3
- 4
Add functions to get different details from the object - recipes, registry, outputs etc.
#6 opened by musigma-admin - 3
- 8
- 3
- 8
Resolve vignette issues
#10 opened by musigma-admin - 3
Review package design
#1 opened by musigma-admin - 6
Document all the eda utils functions
#2 opened by musigma-admin - 9
- 1
Base functionality to work with Spark and Spark data frames including structured streaming
#5 opened by musigma-admin - 6
Add a vignette with a meta-pipeline
#36 opened by musigma-admin - 3
- 2
- 2
In core functions, add a check for presence of SparkR, before calling SparkR functions in the batch execution code
#39 opened by musigma-admin - 3
Change the design for `isDataFunction` handling to be consistent with `outAsIn` and enable branching of pipelines
#40 opened by musigma-admin - 5
- 2
Function registration - defaults not stored
#33 opened by musigma-admin - 2
- 2
When the a function definition is updated, on 'registerFunction', the metadata in the registry is not updated
#35 opened by musigma-admin - 2
Add logging capabilities, such that errors , warnings and so on can be printed on console or written to file
#29 opened by musigma-admin - 2
Have a common registry across object instances of the Analysis Pipeline classes
#31 opened by musigma-admin - 1
- 8
- 4
- 2
- 2
- 5
- 1
Add exception handling mechanisms
#19 opened by musigma-admin - 2
- 3
- 3
Problems with loading "analysisRecipes" package before the sparkR read stream
#16 opened by musigma-admin - 9
Add support for Analysis Pipelines containing both R and SparkR batch functions
#17 opened by musigma-admin - 2
- 0
- 1
- 6
- 2
Integration with 'recipes' package
#4 opened by musigma-admin