Main Hypothesis and findings
- most constrained regulatory regions both in cis or distal coordinate with genes with essential functions
- plot medians of pLI scores against CDTS for variants in each CDTS quantile bin
- Each genomic bin within 15kb of a gene (cis) was assigned the pLI score of the closest gene.
- Each enhancer was assigned a pLI score of the paired gene based on in situ Hi-C or pcHi-C (distal). The distances between enhancer and gene are up to 2Mb.
Noncoding pathogenic variants associated with Mendelian traits are enriched at the lowest CDTS percentile.
- CDTS ranking is a good proxy to score functionality and consequences of mutations for non-coding sequences.
- benchmark different metrics for noncoding variants: perfomance on detecting Mendelian noncoding variants
- CDTS captures the highest proportion of variants uniquely detected by a single metric
- CDTS requires no prior knowledge (no overfitting issue)
This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com.
When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within the document. You can embed an R code chunk like this:
summary(cars)
speed dist
Min. : 4.0 Min. : 2.00
1st Qu.:12.0 1st Qu.: 26.00
Median :15.0 Median : 36.00
Mean :15.4 Mean : 42.98
3rd Qu.:19.0 3rd Qu.: 56.00
Max. :25.0 Max. :120.00
sessionInfo()
R version 4.0.4 (2021-02-15)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Scientific Linux 7.4 (Nitrogen)
Matrix products: default
BLAS/LAPACK: /software/openblas-0.3.13-el7-x86_64/lib/libopenblas_haswellp-r0.3.13.so
locale:
[1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
[3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
[5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
[7] LC_PAPER=en_US.UTF-8 LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] workflowr_1.6.2
loaded via a namespace (and not attached):
[1] Rcpp_1.0.6 whisker_0.4 knitr_1.31 magrittr_2.0.1
[5] R6_2.5.0 rlang_0.4.10 fansi_0.4.2 stringr_1.4.0
[9] tools_4.0.4 xfun_0.21 utf8_1.1.4 git2r_0.28.0
[13] jquerylib_0.1.3 htmltools_0.5.1.1 ellipsis_0.3.1 rprojroot_2.0.2
[17] yaml_2.2.1 digest_0.6.27 tibble_3.0.6 lifecycle_1.0.0
[21] crayon_1.4.1 later_1.1.0.1 sass_0.3.1 vctrs_0.3.6
[25] promises_1.2.0.1 fs_1.5.0 glue_1.4.2 evaluate_0.14
[29] rmarkdown_2.7 stringi_1.5.3 bslib_0.2.4 compiler_4.0.4
[33] pillar_1.5.0 jsonlite_1.7.2 httpuv_1.5.5 pkgconfig_2.0.3