RNAmodR.Data 1.2.0
RNAmodR.Data contains example data for the RNAmodR and related packages.
The data is provided as gff3, fasta and bam files.
Four sets of data with multiple files are included
## snapshotDate(): 2020-04-27
library(RNAmodR.Data)
eh <- ExperimentHub()
## snapshotDate(): 2020-04-27
ExperimentHub::listResources(eh, "RNAmodR.Data")
## [1] "RNAmodR.Data.example.fasta" "RNAmodR.Data.example.gff3"
## [3] "RNAmodR.Data.example.bam.1" "RNAmodR.Data.example.bam.2"
## [5] "RNAmodR.Data.example.bam.3" "RNAmodR.Data.example.RMS.fasta"
## [7] "RNAmodR.Data.example.RMS.gff3" "RNAmodR.Data.example.RMS.1"
## [9] "RNAmodR.Data.example.RMS.2" "RNAmodR.Data.example.AAS.fasta"
## [11] "RNAmodR.Data.example.AAS.gff3" "RNAmodR.Data.example.bud23.1"
## [13] "RNAmodR.Data.example.bud23.2" "RNAmodR.Data.example.trm8.1"
## [15] "RNAmodR.Data.example.trm8.2" "RNAmodR.Data.example.wt.1"
## [17] "RNAmodR.Data.example.wt.2" "RNAmodR.Data.example.wt.3"
## [19] "RNAmodR.Data.example.man.fasta" "RNAmodR.Data.example.man.gff3"
## [21] "RNAmodR.Data.snoRNAdb"
These resources are grouped based on topic. Please have a look at the following man pages:
?RNAmodR.Data.example for general example data used for different purposes?RNAmodR.Data.RMS for example data for RiboMethSeq?RNAmodR.Data.AAS for example data for AlkAnilineSeq?RNAmodR.Data.man for small data set for man page examples?RNAmodR.Data.snoRNAdb for snoRNAdb as csv fileRNAmodR.Data.snoRNAdb consists of a table containing the published data from
the snoRNAdb [Lestrade and Weber (2006)]. The can be loaded as a GRanges
object.
library(GenomicRanges)
table <- read.csv2(RNAmodR.Data.snoRNAdb(), stringsAsFactors = FALSE)
## snapshotDate(): 2020-04-27
## see ?RNAmodR.Data and browseVignettes('RNAmodR.Data') for documentation
## loading from cache
head(table, n = 2)
# keep only the current coordinates
table <- table[,1:7]
snoRNAdb <- GRanges(seqnames = table$hgnc_symbol,
ranges = IRanges(start = table$position, width = 1),strand = "+",
type = "RNAMOD",
mod = table$modification,
Parent = table$hgnc_symbol,
Activity = CharacterList(strsplit(table$guide,",")))
# convert to current gene name
snoRNAdb <- snoRNAdb[vapply(snoRNAdb$Activity != "unknown",all,logical(1)),]
snoRNAdb <- split(snoRNAdb,snoRNAdb$Parent)
head(snoRNAdb)
## GRangesList object of length 6:
## $RNA18SN5
## GRanges object with 69 ranges and 4 metadata columns:
## seqnames ranges strand | type mod Parent
## <Rle> <IRanges> <Rle> | <character> <character> <character>
## [1] RNA18SN5 27 + | RNAMOD Am RNA18SN5
## [2] RNA18SN5 34 + | RNAMOD Y RNA18SN5
## [3] RNA18SN5 36 + | RNAMOD Y RNA18SN5
## [4] RNA18SN5 93 + | RNAMOD Y RNA18SN5
## [5] RNA18SN5 99 + | RNAMOD Am RNA18SN5
## ... ... ... ... . ... ... ...
## [65] RNA18SN5 1643 + | RNAMOD Y RNA18SN5
## [66] RNA18SN5 1678 + | RNAMOD Am RNA18SN5
## [67] RNA18SN5 1692 + | RNAMOD Y RNA18SN5
## [68] RNA18SN5 1703 + | RNAMOD Cm RNA18SN5
## [69] RNA18SN5 1804 + | RNAMOD Um RNA18SN5
## Activity
## <CharacterList>
## [1] SNORD27
## [2] SNORA50A,SNORA76
## [3] SNORA69,SNORA55
## [4] SNORA75
## [5] SNORD57
## ... ...
## [65] SNORA41
## [66] SNORD82
## [67] SNORD70A,SNORD70B,SNORD70C,...
## [68] SNORD43
## [69] SNORD20
## -------
## seqinfo: 9 sequences from an unspecified genome; no seqlengths
##
## ...
## <5 more elements>
sessionInfo()
## R version 4.0.0 (2020-04-24)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 18.04.4 LTS
##
## Matrix products: default
## BLAS: /home/biocbuild/bbs-3.11-bioc/R/lib/libRblas.so
## LAPACK: /home/biocbuild/bbs-3.11-bioc/R/lib/libRlapack.so
##
## locale:
## [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
## [3] LC_TIME=en_US.UTF-8 LC_COLLATE=C
## [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
## [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
## [9] LC_ADDRESS=C LC_TELEPHONE=C
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
##
## attached base packages:
## [1] stats4 parallel stats graphics grDevices utils datasets
## [8] methods base
##
## other attached packages:
## [1] RNAmodR.Data_1.2.0 ExperimentHubData_1.14.0 AnnotationHubData_1.18.0
## [4] futile.logger_1.4.3 GenomicRanges_1.40.0 GenomeInfoDb_1.24.0
## [7] IRanges_2.22.1 S4Vectors_0.26.0 ExperimentHub_1.14.0
## [10] AnnotationHub_2.20.0 BiocFileCache_1.12.0 dbplyr_1.4.3
## [13] BiocGenerics_0.34.0 BiocStyle_2.16.0
##
## loaded via a namespace (and not attached):
## [1] bitops_1.0-6 matrixStats_0.56.0
## [3] bit64_0.9-7 progress_1.2.2
## [5] httr_1.4.1 tools_4.0.0
## [7] R6_2.4.1 DBI_1.1.0
## [9] tidyselect_1.0.0 prettyunits_1.1.1
## [11] bit_1.1-15.2 curl_4.3
## [13] compiler_4.0.0 graph_1.66.0
## [15] Biobase_2.48.0 BiocCheck_1.24.0
## [17] formatR_1.7 DelayedArray_0.14.0
## [19] rtracklayer_1.48.0 bookdown_0.18
## [21] RBGL_1.64.0 askpass_1.1
## [23] rappdirs_0.3.1 stringr_1.4.0
## [25] digest_0.6.25 Rsamtools_2.4.0
## [27] rmarkdown_2.1 stringdist_0.9.5.5
## [29] AnnotationForge_1.30.1 XVector_0.28.0
## [31] rBiopaxParser_2.28.0 pkgconfig_2.0.3
## [33] htmltools_0.4.0 fastmap_1.0.1
## [35] rlang_0.4.6 RSQLite_2.2.0
## [37] shiny_1.4.0.2 jsonlite_1.6.1
## [39] BiocParallel_1.22.0 dplyr_0.8.5
## [41] RCurl_1.98-1.2 magrittr_1.5
## [43] GenomeInfoDbData_1.2.3 Matrix_1.2-18
## [45] Rcpp_1.0.4.6 lifecycle_0.2.0
## [47] stringi_1.4.6 yaml_2.2.1
## [49] SummarizedExperiment_1.18.1 zlibbioc_1.34.0
## [51] biocViews_1.56.0 grid_4.0.0
## [53] blob_1.2.1 promises_1.1.0
## [55] crayon_1.3.4 lattice_0.20-41
## [57] Biostrings_2.56.0 GenomicFeatures_1.40.0
## [59] hms_0.5.3 knitr_1.28
## [61] pillar_1.4.4 optparse_1.6.6
## [63] RUnit_0.4.32 codetools_0.2-16
## [65] biomaRt_2.44.0 futile.options_1.0.1
## [67] XML_3.99-0.3 glue_1.4.0
## [69] BiocVersion_3.11.1 evaluate_0.14
## [71] lambda.r_1.2.4 data.table_1.12.8
## [73] BiocManager_1.30.10 vctrs_0.2.4
## [75] httpuv_1.5.2 getopt_1.20.3
## [77] openssl_1.4.1 purrr_0.3.4
## [79] assertthat_0.2.1 xfun_0.13
## [81] mime_0.9 xtable_1.8-4
## [83] later_1.0.0 tibble_3.0.1
## [85] OrganismDbi_1.30.0 GenomicAlignments_1.24.0
## [87] AnnotationDbi_1.50.0 memoise_1.1.0
## [89] ellipsis_0.3.0 interactiveDisplayBase_1.26.0
Lestrade, Laurent, and Michel J. Weber. 2006. “snoRNA-LBME-db, a comprehensive database of human H/ACA and C/D box snoRNAs.” Nucleic Acids Research 34 (January):D158–D162. https://doi.org/10.1093/nar/gkj002.