Last updated: 2019-04-29
Checks: 2 0
Knit directory: MSTPsummerstatistics/
This reproducible R Markdown analysis was created with workflowr (version 1.3.0). The Checks tab describes the reproducibility checks that were applied when the results were created. The Past versions tab lists the development history.
Great! Since the R Markdown file has been committed to the Git repository, you know the exact version of the code that produced these results.
Great! You are using Git for version control. Tracking code development and connecting the code version to the results is critical for reproducibility. The version displayed above was the version of the Git repository at the time these results were generated.
Note that you need to be careful to ensure that all relevant files for the analysis have been committed to Git prior to generating the results (you can use wflow_publish
or wflow_git_commit
). workflowr only checks the R Markdown file, but you know if there are other scripts or data files that it depends on. Below is the status of the Git repository when the results were generated:
Ignored files:
Ignored: .Rproj.user/
Note that any generated files, e.g. HTML, png, CSS, etc., are not included in this status report because it is ok for generated content to have uncommitted changes.
These are the previous versions of the R Markdown and HTML files. If you’ve configured a remote Git repository (see ?wflow_git_remote
), click on the hyperlinks in the table below to view them.
File | Version | Author | Date | Message |
---|---|---|---|---|
Rmd | 22ae3cd | Anthony Hung | 2019-04-29 | Add HMM file |
html | e746cf5 | Anthony Hung | 2019-04-28 | Build site. |
Rmd | 133df4a | Anthony Hung | 2019-04-28 | introR |
html | 133df4a | Anthony Hung | 2019-04-28 | introR |
html | 22b3720 | Anthony Hung | 2019-04-26 | Build site. |
html | ddb3114 | Anthony Hung | 2019-04-26 | Build site. |
html | 413d065 | Anthony Hung | 2019-04-26 | Build site. |
html | 6b98d6c | Anthony Hung | 2019-04-26 | Build site. |
html | 602e0f9 | Anthony Hung | 2019-04-25 | Build site. |
Rmd | ecc06a5 | Anthony Hung | 2019-04-24 | Add syllabus: |
html | ecc06a5 | Anthony Hung | 2019-04-24 | Add syllabus: |
Rmd | 2459910 | Anthony Hung | 2018-09-28 | Update all webpages |
html | 2459910 | Anthony Hung | 2018-09-28 | Update all webpages |
A thorough understanding of statistics is essential for both experimental design and data analysis. Too often, time and resources are wasted due to a poor understanding of sample size and power calculations, and the reliability of scientific reports has repeatedly been scrutinized in recent years due to questionable, if not fraudulent, application of statistical tests.
In an era of increasingly accessible computational tools, big data, and an emphasis on open-access databases, the need for rigorous training in statistical methods has become more important than ever. As a requirement for entry into Pritzker, all MSTP students must have taken a statistics or biomathematics course in college. At present, the biostatics course required for all MSTP trainees is very similar in content to what is covered in most introductory college statistics courses, making it unappealing and ineffective for most students. However, many of us agree that further training in statistics would be a valuable resource and would help us feel more confident about the work we are doing.
Foundations of theoretical and applied understanding of probability and statistics
Working knowledge of how to use R
Deeper understanding of statistical tests specific to research area
Connect students with resources to seek out statistical advising in the future
Data analysis project using real data
Week 1: Why statistics matters; Intro to R; Probability and distributions
Week 2: Review of basic statistical tests
Week 3: PCA, t-SNE, bootstrapping and machine learning
Week 4: Markov Chains; Hidden Markov Models; Linear Regression; Mutliple Testing
Weeks 5-9: Breakouts within specific programs
Work with PIs or senior grad students in same research area
Handle real data and work hands-on with analysis tools used by people in the field
Prepare a presentation with results of statistical analyses
Week 10 : Data presentations
Thank you very much to Frank Wen and Katie Lee, who started the course and gave helpful advice.