Last updated: 2018-10-04
workflowr checks: (Click a bullet for more information)wflow_publish
to commit the R Markdown file and build the HTML.
wflow_publish
or wflow_git_commit
). workflowr only checks the R Markdown file, but you know if there are other scripts or data files that it depends on. Below is the status of the Git repository when the results were generated:
Ignored files:
Ignored: .DS_Store
Ignored: .Rhistory
Ignored: .Rproj.user/
Ignored: analysis/.DS_Store
Ignored: code/.DS_Store
Ignored: data/.DS_Store
Ignored: data/derived/.DS_Store
Ignored: data/derived/output/.DS_Store
Ignored: data/input/.DS_Store
Ignored: figures/.DS_Store
Untracked files:
Untracked: .pversion
Untracked: ExDeconDemo_c.log
Untracked: ExDeconDemo_loglike.log
Untracked: TEST_Venn.tiff
Untracked: analysis/checking_mashr_results.Rmd
Untracked: code/Drosophila_GWAS.Rmd
Untracked: data/derived/.!25440!dgrp2_focal_lines.bed
Untracked: data/derived/.pversion
Untracked: data/derived/GRM.txt
Untracked: data/derived/TWAS_result_females.csv
Untracked: data/derived/TWAS_result_males.csv
Untracked: data/derived/all_four_traits_SNP_clumps.txt
Untracked: data/derived/all_univariate_GEMMA_results.csv
Untracked: data/derived/annotations.sqlite3
Untracked: data/derived/dgrp2_QC_all_lines.bed
Untracked: data/derived/dgrp2_QC_all_lines.bim
Untracked: data/derived/dgrp2_QC_all_lines.fam
Untracked: data/derived/dgrp2_QC_all_lines.log
Untracked: data/derived/dgrp2_QC_focal_lines.bed
Untracked: data/derived/dgrp2_QC_focal_lines.bim
Untracked: data/derived/dgrp2_QC_focal_lines.bk
Untracked: data/derived/dgrp2_QC_focal_lines.fam
Untracked: data/derived/dgrp2_QC_focal_lines.log
Untracked: data/derived/dgrp2_QC_focal_lines.rds
Untracked: data/derived/dgrp2_transcriptome_subset.bed
Untracked: data/derived/dgrp2_transcriptome_subset.bim
Untracked: data/derived/dgrp2_transcriptome_subset.fam
Untracked: data/derived/dgrp2_transcriptome_subset.log
Untracked: data/derived/female_early_SNP_clumps.txt
Untracked: data/derived/female_late_SNP_clumps.txt
Untracked: data/derived/lines_to_keep.txt
Untracked: data/derived/lmm_ED_mashr_results.rds
Untracked: data/derived/lmm_canonical_mashr_results.rds
Untracked: data/derived/male_early_SNP_clumps.txt
Untracked: data/derived/male_late_SNP_clumps.txt
Untracked: data/derived/mashr_results_ED.rds
Untracked: data/derived/mashr_results_canonical.rds
Untracked: data/derived/output/.pversion
Untracked: data/derived/output/GRM.cXX.txt
Untracked: data/derived/output/GRM.log.txt
Untracked: data/derived/output/GRM_transcriptome.cXX.txt
Untracked: data/derived/output/GRM_transcriptome.log.txt
Untracked: data/derived/output/all_four_traits.assoc.txt
Untracked: data/derived/output/all_four_traits.log.txt
Untracked: data/derived/output/allele_freq_count.assoc.txt
Untracked: data/derived/output/allele_freq_count.log.txt
Untracked: data/derived/output/eigen_decomp.eigenD.txt
Untracked: data/derived/output/eigen_decomp.eigenU.txt
Untracked: data/derived/output/eigen_decomp.log.txt
Untracked: data/derived/output/eigen_decomp_transcriptome.eigenD.txt
Untracked: data/derived/output/eigen_decomp_transcriptome.eigenU.txt
Untracked: data/derived/output/eigen_decomp_transcriptome.log.txt
Untracked: data/derived/plink.frq
Untracked: data/input/.pversion
Untracked: data/input/dgrp2.fam copy
Untracked: data/input/dgrp2.fam.bak
Untracked: data/input/huang_2015_tableS11_eQTL_females.csv
Untracked: data/input/huang_2015_tableS12_eQTL_males.csv
Untracked: data/input/huang_2015_tableS2_gene_expression.csv
Untracked: data/input/huang_2015_tableS5_transcript_heritability.csv
Untracked: dgrp2_clean.log
Untracked: docs/figure/checking_mashr_results.Rmd/
Untracked: gwas_adaptive_shrinkage.R
Unstaged changes:
Modified: analysis/GWAS_tables.Rmd
Modified: analysis/TWAS.Rmd
Modified: analysis/gwas_adaptive_shrinkage.Rmd
Modified: analysis/index.Rmd
Modified: analysis/make_annotation_database.Rmd
Modified: analysis/perform_gwas.Rmd
Modified: analysis/plotting_results.Rmd
Deleted: data/derived/DGRP_SNP_genos.sqlite3
Deleted: data/derived/gwas_db.sqlite3
Deleted: data/derived/lm_results_ashr.csv
Modified: data/derived/lmm_results_ashr.csv
Deleted: data/derived/output/DGRP_GRM.cXX.txt
Deleted: data/derived/output/DGRP_GRM.log.txt
Deleted: data/derived/output/female_early_bslmm.bv.txt
Deleted: data/derived/output/female_early_bslmm.gamma.txt
Deleted: data/derived/output/female_early_bslmm.hyp.txt
Deleted: data/derived/output/female_early_bslmm.log.txt
Deleted: data/derived/output/female_early_bslmm.param.txt
Deleted: data/derived/output/female_early_bslmm_preds.log.txt
Deleted: data/derived/output/female_early_bslmm_preds.prdt.txt
Deleted: data/derived/output/female_early_female_late.assoc.txt
Deleted: data/derived/output/female_early_female_late.log.txt
Deleted: data/derived/output/female_early_lm.assoc.txt
Deleted: data/derived/output/female_early_lm.log.txt
Modified: data/derived/output/female_early_lmm.assoc.txt
Modified: data/derived/output/female_early_lmm.log.txt
Deleted: data/derived/output/female_early_male_early.assoc.txt
Deleted: data/derived/output/female_early_male_early.log.txt
Deleted: data/derived/output/female_late_lm.assoc.txt
Deleted: data/derived/output/female_late_lm.log.txt
Modified: data/derived/output/female_late_lmm.assoc.txt
Modified: data/derived/output/female_late_lmm.log.txt
Deleted: data/derived/output/female_late_male_late.assoc.txt
Deleted: data/derived/output/female_late_male_late.log.txt
Deleted: data/derived/output/male_early_lm.assoc.txt
Deleted: data/derived/output/male_early_lm.log.txt
Modified: data/derived/output/male_early_lmm.assoc.txt
Modified: data/derived/output/male_early_lmm.log.txt
Deleted: data/derived/output/male_early_male_late.assoc.txt
Deleted: data/derived/output/male_early_male_late.log.txt
Deleted: data/derived/output/male_late_lm.assoc.txt
Deleted: data/derived/output/male_late_lm.log.txt
Modified: data/derived/output/male_late_lmm.assoc.txt
Modified: data/derived/output/male_late_lmm.log.txt
Deleted: data/derived/output/result.log.txt
Deleted: data/derived/output/result.prdt.txt
Deleted: data/derived/summed_interSNP_correlations.csv
Deleted: data/derived/trimmed_DGRP.bed
Deleted: data/derived/trimmed_DGRP.bim
Deleted: data/derived/trimmed_DGRP.fam
Deleted: data/derived/trimmed_DGRP.rds
Modified: data/input/dgrp2.fam
Note that any generated files, e.g. HTML, png, CSS, etc., are not included in this status report because it is ok for generated content to have uncommitted changes. File | Version | Author | Date | Message |
---|---|---|---|---|
Rmd | 3db7587 | Luke Holman | 2018-10-01 | 1st October - lots more work |
Rmd | b6c43ed | Luke Holman | 2018-09-14 | Start workflowr project. |
This script makes SQLite3 database holding two tables: one with annotations for each variant (provided by the Mackay lab), and one with annotations for each gene (from Bioconductor). Later on, we add the GWAS results to the database as well, allowing for memory-efficient processing of the results.
Using Bayesian mixed models implemented in the package brms
to estimate the line means for our four fitness traits.
The script first performs quality control and imputation on the dataset of SNPs and indels for the DGRP. Second, it runs univariate and multivariate association tests on our four fitness traits using the software GEMMA.
Here, we use the package mashr
to perform multivariate adaptive shrinkage on the results of the univariate association tests. We ran mashr
using two modes: canonical, and data-driven.
This script generates some plots to check that mashr
is performing as expected. The plots also led us to select the data-driven mashr
results as the most conservative.
Heritability - Correlated with selection experienced by the gene? Angatonism? - Correlated with sex-specificity?
eQTLs - Are there eQTLs with matching, independent, and opposite-sex effects? NEED TO RE-DO eQTL ANALYSIS A BIT - Are there more overlaps between the eQTLs and fitness QTLs than expected by chance? Correlation in p-vals?
Network - Modules are not correlated with fitness, so no evidence of selection on them? - Find the PCAs of the transcriptome, male vs female axis? Young vs old axis? - Are the modules heritable, and are there SNPs for eigengenes?
Predictions - eQTLs that affect transciption the same way in both sexes will have SA fitness effects for transcripts under SA selection (or concordant for concordant) We need: - Effect size for the eQTL on the transcript - Effect size for the eQTL on fitness - Effect size for the transcript on fitness - “Allele 1 made the transcript increase in both sexes, good for males and bad for females” - eQTLs that only affect transciption in one sex will have a more sex-specific fitness effect as well (not concordant, not antag) - eQTLs where the + and - alleles are swapped between sexes will be antagonistic or concordant, depending on sex-specific selection on that transcript - Transcripts will show the twin peaks relationship between sex bias in expression and SA selection - Genes targetted by dsx will be extra antagonistic
This reproducible R Markdown analysis was created with workflowr 1.1.1