Identifying Active and Alternative Promoters from RNA-Seq data with proActiv (original) (raw)

Summary

Most human genes have multiple promoters that control the expression of distinct isoforms. The use of these alternative promoters enables the regulation of isoform expression pre-transcriptionally. Alternative promoters have been found to be important in a wide number of cell types and diseases.

proActiv is a method that enables the analysis of promoters from RNA-Seq data. proActiv uses aligned reads as input, and then generates counts and normalized promoter activity estimates for each annotated promoter. These estimates can then be used to identify which promoter is active, which promoter is inactive, and which promoters change their activity across conditions.

Here we present a quick start guide to using proActiv, and a detailed workflow for identifying active promoters and alternative promoters across 2 conditions.

If you use proActiv in your research, please cite:

Demircioğlu, Deniz, et al. “A Pan-cancer Transcriptome Analysis Reveals Pervasive Regulation through Alternative Promoters.” Cell 178.6 (2019): 1465-1477.

Contents

Session information

#> R version 4.0.3 (2020-10-10)
#> Platform: x86_64-w64-mingw32/x64 (64-bit)
#> Running under: Windows 10 x64 (build 19041)
#> 
#> Matrix products: default
#> 
#> locale:
#> [1] LC_COLLATE=English_Singapore.1252  LC_CTYPE=English_Singapore.1252   
#> [3] LC_MONETARY=English_Singapore.1252 LC_NUMERIC=C                      
#> [5] LC_TIME=English_Singapore.1252    
#> 
#> attached base packages:
#> [1] stats     graphics  grDevices utils     datasets  methods   base     
#> 
#> other attached packages:
#> [1] Rtsne_0.15     ggplot2_3.3.3  gridExtra_2.3  proActiv_1.3.3
#> 
#> loaded via a namespace (and not attached):
#>   [1] bitops_1.0-6                matrixStats_0.57.0         
#>   [3] fs_1.5.0                    bit64_4.0.5                
#>   [5] RColorBrewer_1.1-2          progress_1.2.2             
#>   [7] httr_1.4.2                  rprojroot_2.0.2            
#>   [9] GenomeInfoDb_1.26.2         tools_4.0.3                
#>  [11] R6_2.5.0                    KernSmooth_2.23-17         
#>  [13] DBI_1.1.0                   BiocGenerics_0.36.0        
#>  [15] colorspace_2.0-0            withr_2.3.0                
#>  [17] tidyselect_1.1.0            prettyunits_1.1.1          
#>  [19] DESeq2_1.30.0               curl_4.3                   
#>  [21] bit_4.0.4                   compiler_4.0.3             
#>  [23] textshaping_0.2.1           Biobase_2.50.0             
#>  [25] xml2_1.3.2                  desc_1.2.0                 
#>  [27] DelayedArray_0.16.0         labeling_0.4.2             
#>  [29] rtracklayer_1.49.5          caTools_1.18.0             
#>  [31] scales_1.1.1                genefilter_1.72.0          
#>  [33] askpass_1.1                 rappdirs_0.3.1             
#>  [35] pkgdown_1.6.1               systemfonts_0.3.2          
#>  [37] stringr_1.4.0               digest_0.6.27              
#>  [39] Rsamtools_2.6.0             rmarkdown_2.6              
#>  [41] XVector_0.30.0              pkgconfig_2.0.3            
#>  [43] htmltools_0.5.1.1           MatrixGenerics_1.2.0       
#>  [45] highr_0.8                   dbplyr_2.0.0               
#>  [47] rlang_0.4.9                 rstudioapi_0.13            
#>  [49] RSQLite_2.2.1               farver_2.0.3               
#>  [51] generics_0.1.0              gtools_3.8.2               
#>  [53] BiocParallel_1.24.1         dplyr_1.0.2                
#>  [55] RCurl_1.98-1.2              magrittr_2.0.1             
#>  [57] GenomeInfoDbData_1.2.4      Matrix_1.2-18              
#>  [59] Rcpp_1.0.5                  munsell_0.5.0              
#>  [61] S4Vectors_0.28.1            lifecycle_0.2.0            
#>  [63] stringi_1.5.3               yaml_2.2.1                 
#>  [65] SummarizedExperiment_1.20.0 zlibbioc_1.36.0            
#>  [67] gplots_3.1.1                BiocFileCache_1.14.0       
#>  [69] grid_4.0.3                  blob_1.2.1                 
#>  [71] parallel_4.0.3              crayon_1.3.4               
#>  [73] lattice_0.20-41             Biostrings_2.58.0          
#>  [75] splines_4.0.3               GenomicFeatures_1.42.1     
#>  [77] annotate_1.68.0             hms_0.5.3                  
#>  [79] locfit_1.5-9.4              knitr_1.30                 
#>  [81] pillar_1.4.7                GenomicRanges_1.42.0       
#>  [83] geneplotter_1.68.0          biomaRt_2.46.0             
#>  [85] stats4_4.0.3                XML_3.99-0.5               
#>  [87] glue_1.4.2                  evaluate_0.14              
#>  [89] data.table_1.13.6           vctrs_0.3.6                
#>  [91] gtable_0.3.0                openssl_1.4.3              
#>  [93] purrr_0.3.4                 assertthat_0.2.1           
#>  [95] xfun_0.19                   xtable_1.8-4               
#>  [97] ragg_0.4.0                  survival_3.2-7             
#>  [99] tibble_3.0.4                GenomicAlignments_1.26.0   
#> [101] AnnotationDbi_1.52.0        memoise_1.1.0              
#> [103] IRanges_2.24.1              ellipsis_0.3.1