/CSC525-Project

This repo houses all material related to the CSC 525 course project

Primary LanguageTeX

CSC 525 Project

Our idea is to characterize RNA Binding Proteins related to Alzheimer's Disease (AD). This idea is still in its infancy and needs to be fleshed out more.

Proposals (from Derek's CSC-578B course)

Note from Derek: This is here as a guideline to help us build our proposal. I think that if our proposal is really good it will save us a lot of time thinking about what we have to do.

It is tempting to treat the proposal as a gimme and not put effort into it. And yet in MY view, it is the most critical part of the project. Spend some serious time on it. (like 5-10 hours I would say)

Your proposal should answer the following questions:

  • What is the existing study or studies that you are replicating or building on? Even for a new study, you should have a clear sense of other approaches to this problem. Your life will be easier if you have a clear guide. Don't worry if it feels like "cheating" because the original paper was so clear and the data so accessible. We can easily add complexity; it is very hard to take it away.
  • The dataset you are using, and the experiments you have done with it. Don't just trust that the paper URL is still there, or that the data is accessible or useful. Download it, load it into R/Jupyter, and do some simple experiments.
  • Any other tools you will need, how well you know them, and what they cost. For example, you might require a .NET component that the original study used, but you do not have a Windows machine. Or you will use a DL approach that requires Google TPUs to train.
  • The research questions you are trying to answer. What is the contribution your paper will make? I suggest using the answers to #1 above, checking what those papers say was either hard, interesting new directions, or questions they didn't have time to answer. You might also find gaps in the original analysis.
  • Rough sense of methodology you will follow and who will be doing each task. At the very least there will be writing of the paper, creating the video, running the analysis, processing/preparing the data, writing analysis code, reading relevant background papers, ... start outlining that.
  • Relative to the methodology, specify exactly what the workflow is: data sources, filtering criteria, data science algorithms to use, analysis validation etc. Don't underestimate how long it takes to acquire data and write analysis code.