|BioSCI 1120 |Independent Research Project
The goal of this assignment is confirm that you have obtained a copy of your raw data and to provide basic descriptions of what will need to be done to this data (if anything) in order for it to be moved in to R. This assignment should also help you identify if you need to reformat the data so that you can do your analysis in R. This assignment therefore focuses on the rawest of the raw data you have to work with, warts and all.
Share a copy of the raw data or at least a screen grab of the spreadsheet(s). Then answer the following questions about your data set. Some of these questions might have very short answers (“No” or “does not apply”). Feel free to say “I don’t know” but follow up on this with your best guess on how you will deal with it. For insights into this process ask your PI, consult people in your lab, talk with other students, read the references below, and/or come to office horus.
If you are unsure about your answers to any of these we can discuss them. You should also consult these papers:
Broman, KW and K Woo. 2018. Data organization in spreadsheet. The American Statistician. https://doi.org/10.1080/00031305.2017.1375989
Goodman et al. 2014. Ten Simple Rules for the Care and Feeding of Scientific Data. PLoS Computational Biology. https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003542
Ellis, SE and JT Leek. 2018. How to share data for collaboration. The American Statistician. https://doi.org/10.1080/00031305.2017.1375987