
Add MS2 spectra data into mass_dataset class object
Xiaotao Shen
Created on 2021-12-04 and updated on 2026-03-04
Source:vignettes/mutate_ms2.Rmd
mutate_ms2.Rmdmass_data class object can also contain MS2 data.
Data preparation
mass_data class object
We need to create a mass_data class object first; see this
document. Here we use the demo datasets bundled with the
package.
library(massdataset)
data("expression_data")
data("sample_info")
data("variable_info")
object_pos <-
create_mass_dataset(
expression_data = expression_data,
sample_info = sample_info,
variable_info = variable_info
)MS2 data
The MS2 raw data should be converted to mgf format data. Please refer to this document.
Here we use the MGF example bundled with
massdataset.
Add MS2 to mass_dataset class
object
Positive mode.
ms2_dir <- system.file("ms2_data", package = "massdataset")
object_pos2 =
mutate_ms2(
object = object_pos,
column = "rp",
polarity = "positive",
ms1.ms2.match.mz.tol = 10,
ms1.ms2.match.rt.tol = 15,
path = ms2_dir
)Session information
sessionInfo()
#> R version 4.5.2 (2025-10-31)
#> Platform: aarch64-apple-darwin20
#> Running under: macOS Tahoe 26.3
#>
#> Matrix products: default
#> BLAS: /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib
#> LAPACK: /Library/Frameworks/R.framework/Versions/4.5-arm64/Resources/lib/libRlapack.dylib; LAPACK version 3.12.1
#>
#> locale:
#> [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
#>
#> time zone: Asia/Singapore
#> tzcode source: internal
#>
#> attached base packages:
#> [1] stats graphics grDevices utils datasets methods base
#>
#> other attached packages:
#> [1] magrittr_2.0.3 dplyr_1.1.4 ggplot2_4.0.2 massdataset_0.99.3
#>
#> loaded via a namespace (and not attached):
#> [1] tidyselect_1.2.1 farver_2.1.2
#> [3] S7_0.2.0 fastmap_1.2.0
#> [5] digest_0.6.37 lifecycle_1.0.4
#> [7] cluster_2.1.8.1 compiler_4.5.2
#> [9] rlang_1.1.6 sass_0.4.10
#> [11] tools_4.5.2 yaml_2.3.10
#> [13] knitr_1.50 S4Arrays_1.8.1
#> [15] htmlwidgets_1.6.4 DelayedArray_0.34.1
#> [17] RColorBrewer_1.1-3 abind_1.4-8
#> [19] withr_3.0.2 purrr_1.1.0
#> [21] BiocGenerics_0.54.0 desc_1.4.3
#> [23] grid_4.5.2 stats4_4.5.2
#> [25] colorspace_2.1-1 scales_1.4.0
#> [27] iterators_1.0.14 dichromat_2.0-0.1
#> [29] SummarizedExperiment_1.38.1 cli_3.6.5
#> [31] rmarkdown_2.29 crayon_1.5.3
#> [33] ragg_1.4.0 generics_0.1.4
#> [35] rstudioapi_0.17.1 httr_1.4.7
#> [37] tzdb_0.5.0 rjson_0.2.23
#> [39] cachem_1.1.0 stringr_1.5.1
#> [41] parallel_4.5.2 XVector_0.48.0
#> [43] matrixStats_1.5.0 vctrs_0.6.5
#> [45] Matrix_1.7-4 jsonlite_2.0.0
#> [47] IRanges_2.42.0 hms_1.1.3
#> [49] GetoptLong_1.0.5 S4Vectors_0.48.0
#> [51] clue_0.3-66 systemfonts_1.2.3
#> [53] foreach_1.5.2 tidyr_1.3.1
#> [55] jquerylib_0.1.4 glue_1.8.0
#> [57] pkgdown_2.1.3 codetools_0.2-20
#> [59] stringi_1.8.7 shape_1.4.6.1
#> [61] gtable_0.3.6 GenomeInfoDb_1.44.2
#> [63] GenomicRanges_1.60.0 UCSC.utils_1.4.0
#> [65] ComplexHeatmap_2.24.1 tibble_3.3.0
#> [67] pillar_1.11.0 htmltools_0.5.8.1
#> [69] GenomeInfoDbData_1.2.14 circlize_0.4.16
#> [71] R6_2.6.1 textshaping_1.0.1
#> [73] doParallel_1.0.17 evaluate_1.0.4
#> [75] Biobase_2.68.0 lattice_0.22-7
#> [77] readr_2.1.5 png_0.1-8
#> [79] openxlsx_4.2.8 bslib_0.9.0
#> [81] Rcpp_1.1.0 zip_2.3.3
#> [83] SparseArray_1.8.1 xfun_0.53
#> [85] fs_1.6.6 MatrixGenerics_1.20.0
#> [87] pkgconfig_2.0.3 GlobalOptions_0.1.2