OpenKBC/multiple_sclerosis_proj

Data/engineering preparation including github repository

swiri021 opened this issue · 3 comments

  • Setting Github repo
  • Data preparation in the local PC or AWS (S3 bucket)
  • Setup feather instead of CSV
  • Make test-set for gene expression data with random samples
  • Establishing engineering strategy (IDE, Jupyter notebook)
  • Please ask IAM for bucket access for raw data
  • Added reviewNB for pull request of jupyter notebook

IDE: VS code
Main analysis: Jupyter Notebook, R studio(?)