workflowr
Last updated: 2025-01-07
Checks: 7
0
Knit directory: analysis-user-group/
This reproducible R Markdown
analysis was created with workflowr (version
1.7.1). The Checks tab describes the reproducibility checks
that were applied when the results were created. The Past
versions tab lists the development history.
Great! Since the R Markdown file has been committed to the Git
repository, you know the exact version of the code that produced these
results.
Great job! The global environment was empty. Objects defined in the
global environment can affect the analysis in your R Markdown file in
unknown ways. For reproduciblity it’s best to always run the code in an
empty environment.
The command set.seed(1337)
was run prior to running the
code in the R Markdown file. Setting a seed ensures that any results
that rely on randomness, e.g. subsampling or permutations, are
reproducible.
Nice! There were no cached chunks for this analysis, so you can be
confident that you successfully produced the results during this
run.
Great job! Using relative paths to the files within your workflowr
project makes it easier to run your code on other machines.
Great! You are using Git for version control. Tracking code development
and connecting the code version to the results is critical for
reproducibility.
The results in this page were generated with repository version
0ecb65d .
See the Past versions tab to see a history of the changes made
to the R Markdown and HTML files.
Note that you need to be careful to ensure that all relevant files for
the analysis have been committed to Git prior to generating the results
(you can use wflow_publish
or
wflow_git_commit
). workflowr only checks the R Markdown
file, but you know if there are other scripts or data files that it
depends on. Below is the status of the Git repository when the results
were generated:
Ignored files:
Ignored: .DS_Store
Ignored: analysis/.DS_Store
Unstaged changes:
Modified: workflow.R
Note that any generated files, e.g. HTML, png, CSS, etc., are not
included in this status report because it is ok for generated content to
have uncommitted changes.
These are the previous versions of the repository in which changes were
made to the R Markdown (analysis/0_start.Rmd
) and HTML
(docs/0_start.html
) files. If you’ve configured a remote
Git repository (see ?wflow_git_remote
), click on the
hyperlinks in the table below to view the files as they were in that
past version.
File
Version
Author
Date
Message
Rmd
0ecb65d
DrThomasOneil
2025-01-07
Updated Getting Started in R. Chapter 1 completed
html
c893d70
DrThomasOneil
2025-01-06
Build site.
Rmd
8eec2ce
DrThomasOneil
2025-01-06
Initial Deployment
html
660b0f8
DrThomasOneil
2025-01-06
Build site.
html
2e79a1d
DrThomasOneil
2025-01-06
Build site.
Rmd
451a21f
DrThomasOneil
2025-01-06
Initial Deployment
Rmd
2eeb8dc
DrThomasOneil
2024-12-26
first draft
This 10-session workshop series is designed to introduce users to R
programming with a focus on bioinformatics workflows and
reproducibility. It emphasizes the mindset shift required to transition
from manual tools like Excel to programmatic data analysis.
Start
Tutorial →
Session 0: What Is Programming? Shifting
Mindsets
Goals:
Introduce programming concepts and the mindset behind programmatic
workflows.
• Transition from manual tools (e.g. Excel & Prism) to
programming-based workflows.
• Understand the concept of handling data using code.
• Learn to navigate directories and file paths programmatically.
• Emphasize reproducibility and structured workflows.
Go
to Chapter →
Chapter 1: Getting Started with R and RStudio
Goal: Set up R and RStudio, and get familiar with
the interface.
Installing R and RStudio.
Exploring the interface – Console, Environment, Scripts.
Setting up a project.
Installing and loading packages.
Practice: - Create and save an R script. - Install
and load a package (e.g., tidyverse
).
Go
to Chapter →
Chapter 2: Data Types and Structures
Goal: Learn about data types and structures in
R.
Basic data types: numeric, character, logical.
Data structures: vectors, matrices, data frames, lists.
Importing data (Excel and CSV files).
Viewing and summarizing data.
Practice: - Load flow cytometry data and explore
structure. - Create and index vectors and data frames.
Go
to Chapter →
Chapter 3: Basic Programming and Control
Structures
Goal: Understand programming logic for automating
tasks.
Variables and assignments.
Conditional statements (if
, else
).
Loops (for
, while
).
Writing functions.
Practice: - Write functions to calculate percentages
dynamically. - Automate filtering tasks.
Go
to Chapter →
Chapter 4: Data Manipulation with
dplyr
Goal: Introduce dplyr
for filtering,
mutating, and summarizing data.
Filtering rows (filter()
), selecting columns
(select()
).
Adding new columns (mutate()
).
Grouping and summarizing (group_by()
+
summarize()
).
Practice: - Calculate proportions for CD4/CD8
populations. - Add calculated columns for analysis.
Go
to Chapter →
Chapter 5: Visualization with ggplot2
– Part
1
Goal: Create and customize visualizations.
Basics of ggplot2
– scatter, bar, and boxplots.
Customizing labels, themes, and colors.
Practice: - Visualize CD4 vs CD8 proportions. - Add
colors and themes.
Go
to Chapter →
Chapter 6: Advanced Visualization with ggplot2
– Part 2
Goal: Learn advanced visualization techniques.
Faceting and small multiples.
Combining plots (grid layouts).
Saving high-resolution plots.
Practice: - Create faceted boxplots for subsets. -
Export plots for reports.
Go
to Chapter →
Chapter 7: Statistical Analysis
Goal: Understand descriptive and inferential
statistics.
Descriptive statistics: mean, median, mode.
Hypothesis testing (t-tests, ANOVA).
Correlation and regression.
Practice: - Test differences in CD4 proportions. -
Perform correlation analysis.
Go
to Chapter →
Chapter 8: Reproducible Reports with RMarkdown
Goal: Build dynamic and shareable reports.
Introduction to RMarkdown.
Combining text, code, and visuals.
Exporting to PDF and HTML.
Practice: - Create a report summarizing flow
cytometry data. - Embed visualizations and tables.
Go
to Chapter →
Chapter 9: Mini Project – Putting It All
Together
Goal: Apply skills to a complete workflow.
Practice: - Load and clean data. - Summarize and
visualize trends. - Run tests and compile everything into a report.
Go
to Chapter →
Chapter 10: Troubleshooting and Workflow
Design
Goal: Teach debugging strategies and workflow
optimization.
Debugging errors and warnings.
Writing modular scripts (functions).
Organizing larger projects.
Brief intro to Git/GitHub for version control.
Practice: - Debug a script with errors. -
Restructure code for reusability.
Go
to Chapter →
Session information
R version 4.4.0 (2024-04-24)
Platform: aarch64-apple-darwin20
Running under: macOS Sonoma 14.3
Matrix products: default
BLAS: /Library/Frameworks/R.framework/Versions/4.4-arm64/Resources/lib/libRblas.0.dylib
LAPACK: /Library/Frameworks/R.framework/Versions/4.4-arm64/Resources/lib/libRlapack.dylib; LAPACK version 3.12.0
locale:
[1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
time zone: Australia/Sydney
tzcode source: internal
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] workflowr_1.7.1
loaded via a namespace (and not attached):
[1] vctrs_0.6.5 httr_1.4.7 cli_3.6.3 knitr_1.49
[5] rlang_1.1.4 xfun_0.49 stringi_1.8.4 processx_3.8.4
[9] promises_1.3.2 jsonlite_1.8.9 glue_1.8.0 rprojroot_2.0.4
[13] git2r_0.35.0 htmltools_0.5.8.1 httpuv_1.6.15 ps_1.8.1
[17] sass_0.4.9 fansi_1.0.6 rmarkdown_2.29 tibble_3.2.1
[21] jquerylib_0.1.4 evaluate_1.0.1 fastmap_1.2.0 yaml_2.3.10
[25] lifecycle_1.0.4 whisker_0.4.1 stringr_1.5.1 compiler_4.4.0
[29] fs_1.6.5 pkgconfig_2.0.3 Rcpp_1.0.13-1 rstudioapi_0.17.1
[33] later_1.4.1 digest_0.6.37 R6_2.5.1 utf8_1.2.4
[37] pillar_1.9.0 callr_3.7.6 magrittr_2.0.3 bslib_0.8.0
[41] tools_4.4.0 cachem_1.1.0 getPass_0.2-4
---
title: "Getting Started in R (2025)"
author: "Thomas R. O'Neil"
output:
  workflowr::wflow_html:
    toc: no
---

```{html, include=F}
<script>
document.addEventListener('DOMContentLoaded', function () {
  const checkboxes = document.querySelectorAll('input[type="checkbox"]');
  checkboxes.forEach((checkbox) => {
    checkbox.checked = localStorage.getItem(checkbox.id) === 'true';
    checkbox.addEventListener('change', () => {
      localStorage.setItem(checkbox.id, checkbox.checked);
    });
  });
});
</script>
```

This 10-session workshop series is designed to introduce users to R programming with a focus on bioinformatics workflows and reproducibility. It emphasizes the mindset shift required to transition from manual tools like Excel to programmatic data analysis.

[Start Tutorial →](https://drthomasoneil.github.io/analysis-user-group/r-tutorial/_book/index.html)

<wimr>

## **Session 0: What Is Programming? Shifting Mindsets**

<div class="hint-goals">    
**Goals:**    
Introduce programming concepts and the mindset behind programmatic workflows.   
&nbsp;&nbsp;&#8226; Transition from manual tools (e.g. Excel & Prism) to programming-based workflows.  
&nbsp;&nbsp;&#8226; Understand the concept of handling data using code.  
&nbsp;&nbsp;&#8226; Learn to navigate directories and file paths programmatically.  
&nbsp;&nbsp;&#8226; Emphasize reproducibility and structured workflows.    
</div>

[Go to Chapter →](https://drthomasoneil.github.io/analysis-user-group/r-tutorial/_book/introduction.html)

<wimr>

## **Chapter 1: Getting Started with R and RStudio**
**Goal:** Set up R and RStudio, and get familiar with the interface.

- Installing R and RStudio.
- Exploring the interface – Console, Environment, Scripts.
- Setting up a project.
- Installing and loading packages.

**Practice:**
- Create and save an R script.
- Install and load a package (e.g., `tidyverse`).

[Go to Chapter →](https://drthomasoneil.github.io/analysis-user-group/r-tutorial/_book/chapter-1-getting-started-in-R-and-RStudio.html)

<wimr>

## **Chapter 2: Data Types and Structures**
**Goal:** Learn about data types and structures in R.

- Basic data types: numeric, character, logical.
- Data structures: vectors, matrices, data frames, lists.
- Importing data (Excel and CSV files).
- Viewing and summarizing data.

**Practice:**
- Load flow cytometry data and explore structure.
- Create and index vectors and data frames.

[Go to Chapter →](https://drthomasoneil.github.io/analysis-user-group/r-tutorial/_book/chapter-2-data-types-and-structures.html)

<wimr>

## **Chapter 3: Basic Programming and Control Structures**
**Goal:** Understand programming logic for automating tasks.

- Variables and assignments.
- Conditional statements (`if`, `else`).
- Loops (`for`, `while`).
- Writing functions.

**Practice:**
- Write functions to calculate percentages dynamically.
- Automate filtering tasks.

[Go to Chapter →](https://drthomasoneil.github.io/analysis-user-group/r-tutorial/_book/chapter-3-basic-programming.html)

<wimr>

## **Chapter 4: Data Manipulation with `dplyr`**
**Goal:** Introduce `dplyr` for filtering, mutating, and summarizing data.

- Filtering rows (`filter()`), selecting columns (`select()`).
- Adding new columns (`mutate()`).
- Grouping and summarizing (`group_by()` + `summarize()`).

**Practice:**
- Calculate proportions for CD4/CD8 populations.
- Add calculated columns for analysis.

[Go to Chapter →](https://drthomasoneil.github.io/analysis-user-group/r-tutorial/_book/chapter-4-data-manipulation-with-dplyr.html)

<wimr>

## **Chapter 5: Visualization with `ggplot2` – Part 1**
**Goal:** Create and customize visualizations.

- Basics of `ggplot2` – scatter, bar, and boxplots.
- Customizing labels, themes, and colors.

**Practice:**
- Visualize CD4 vs CD8 proportions.
- Add colors and themes.

[Go to Chapter →](https://drthomasoneil.github.io/analysis-user-group/r-tutorial/_book/chapter-5-data-visualisation-with-ggplot2.html)

<wimr>

## **Chapter 6: Advanced Visualization with `ggplot2` – Part 2**
**Goal:** Learn advanced visualization techniques.

- Faceting and small multiples.
- Combining plots (grid layouts).
- Saving high-resolution plots.

**Practice:**
- Create faceted boxplots for subsets.
- Export plots for reports.

[Go to Chapter →](https://drthomasoneil.github.io/analysis-user-group/r-tutorial/_book/chapter-6-advanced-data-visualisation-with-ggplot2.html)

<wimr>

## **Chapter 7: Statistical Analysis**
**Goal:** Understand descriptive and inferential statistics.

- Descriptive statistics: mean, median, mode.
- Hypothesis testing (t-tests, ANOVA).
- Correlation and regression.

**Practice:**
- Test differences in CD4 proportions.
- Perform correlation analysis.

[Go to Chapter →](https://drthomasoneil.github.io/analysis-user-group/r-tutorial/_book/chapter-7-statistical-analysis.html)

<wimr>

## **Chapter 8: Reproducible Reports with RMarkdown**
**Goal:** Build dynamic and shareable reports.

- Introduction to RMarkdown.
- Combining text, code, and visuals.
- Exporting to PDF and HTML.

**Practice:**
- Create a report summarizing flow cytometry data.
- Embed visualizations and tables.

[Go to Chapter →](https://drthomasoneil.github.io/analysis-user-group/r-tutorial/_book/chapter-8-reproducible-reports-with-rmarkdown.html)

<wimr>

## **Chapter 9: Mini Project – Putting It All Together**
**Goal:** Apply skills to a complete workflow.

**Practice:**
- Load and clean data.
- Summarize and visualize trends.
- Run tests and compile everything into a report.

[Go to Chapter →](https://drthomasoneil.github.io/analysis-user-group/r-tutorial/_book/chapter-9-mini-project-putting-it-all-together.html)

<wimr>

## **Chapter 10: Troubleshooting and Workflow Design**
**Goal:** Teach debugging strategies and workflow optimization.

- Debugging errors and warnings.
- Writing modular scripts (functions).
- Organizing larger projects.
- Brief intro to Git/GitHub for version control.

**Practice:**
- Debug a script with errors.
- Restructure code for reusability.

[Go to Chapter →](https://drthomasoneil.github.io/analysis-user-group/r-tutorial/_book/chapter-10-troubleshooting-and-workflow-design.html)

<wimr>


