MSKCC-Epi-Bio/gnomeR

Add annotation of point mutations in create gene binary pipeline

Opened this issue · 3 comments

We can require this as a preprocessing step, or add an arg in create_gene_binary(). We will think on it more and maybe ask Caroline and Alli for preference.

Maybe we want to do fuzzy matching to throw warnings when their selected mutation doesn't exist.

@hfuchs5 to get test data for this and maybe assign to someone else.

Explore adding an argument to create_gene_binary() to filter and add an event column for a specific point mutation (if found in their file). If not found, throw message.

Notes:

  • Make sure periods in hugo symbol/ point mutation name are handled correctly (eg. does it conflict with .fus, .Del etc)
  • Make sure the hugo symbol is identifiable within point mutation column name
  • perhaps make them name them "PM etc."