We used this site to collaborate and share our results. Please feel free to explore. The results that made it into the final paper are in the section Finalizing below. Here are some useful links:


Finalizing

Analysis

Supervised method learning: finalizing on data holiding out validation samples

Supervised method learning: building the analysis approach

Approaches to fitting cyclical trend in gene expression data

Cell cycle signal in gene expression data

  1. We investigated cell cycle signals in the sequencing data alone.
  2. We then assign categorical labels of cell cycle and explored the expresson profiles of these categories.
  3. We inferred an angle for each cell on a unit circle using FUCCI intensities alone.
  4. I used nonparametric methods to identify genes that may be cyclical along cell cycle phases.
    • Fit smash and kernel regression on circular variables on a subset of genes with detection rate > .8.
    • Fit trendfilter on a subset of genes (5) that are observed (visually) to have cyclical pattern. trendfilter is robust to small proportion of undetected cells, approx 2 or 3%. In cases of simulation when increasing proportion of undetected cells to 20%, we observed a flat line in gene expression for genes previously identified to tend to a cyclical pattern.
    • Next, we fit trendfilter on all genes after transforming the data to follow standard normal distribution, permutation-based p-values for PVE are used to select 101 significant cyclical genes.
  5. Additional analysis done to identify top cyclical genes in each individual. The top 5 are not shared across the six individuals. Results

RNA-seq data preprcessing

  1. The first step in preprocessing RNA-seq data consists of QC and filtering.
  2. We then analyzed and corrected for batch effect due to C1 plate in the sequencing data

Microscopy image analysis

We evaluated and pre-processed the results of image analysis as follows:

  1. We visually inspect images deteced to have none or more than one nucleus. For cases that are inconsistent with visual inspection, we correct the number of nuclei detected.
  2. We applied background correction to the intensity measurements of GFP, RFP and DAPI based on the following analyses.
  3. We analyzed intensity variation across individuals and batches and considers approaches for removing batch effects in the data.
  4. We investigated the cell time estimates based on FUCCI intensities.

This R Markdown site was created with workflowr