--- title: "Data analysis of metabolomics and other omics datasets using the structToolbox" author: - name: Gavin R Lloyd affiliation: Phenome Centre Birmingham, University of Birmingham, UK email: g.r.lloyd@bham.ac.uk - name: Andris Jankevics affiliation: Phenome Centre Birmingham, University of Birmingham, UK email: a.jankevics@bham.ac.uk - name: Ralf J Weber affiliation: Phenome Centre Birmingham, University of Birmingham, UK email: r.j.weber@bham.ac.uk output: BiocStyle::html_document: toc: true toc_depth: 2 number_sections: true toc_float: true package: structToolbox abstract: | Data (pre-)processing and data analysis of Metabolomics and other omics datasets using struct and structToolbox, including univariate/multivariate statistics and machine learning approaches. vignette: > %\VignetteIndexEntry{Data analysis of metabolomics and other omics datasets using the structToolbox} %\VignetteEncoding{UTF-8} %\VignetteEngine{knitr::rmarkdown} --- ```{r, include=FALSE} knitr::opts_chunk$set( dpi=96,fig.width=5,fig.height=5.5,fig.retina = 1,fig.small = TRUE ) set.seed(57475) ``` # Introduction The 'structToolbox' includes an extensive set of data (pre-)processing and analysis tools for metabolomics and other omics, with a strong emphasis on statistics and machine learning. The methods and tools have been implemented using class-based templates available via the `struct` (Statistics in R Using Class-based Templates) package. The aim of this vignette is to introduce the reader to basic and more advanced structToolbox-based operations and implementations, such as the use of `struct` objects, getting/setting methods/parameters, and building workflows for the analysis of mass spectrometry (MS) and nuclear magnetic resonance (NMR)-based Metabolomics and proteomics datasets. The workflows demonstrated here include a wide range of methods and tools including pre-processing such as filtering, normalisation and scaling, followed by univariate and/or multivariate statistics, and machine learning approaches. # Getting started The latest version of `structToolbox` compatible with your current R version can be installed using `BiocManager`. ```{r, eval=FALSE, include=TRUE} # install BiocManager if not present if (!requireNamespace("BiocManager", quietly = TRUE)) install.packages("BiocManager") # install structToolbox and dependencies BiocManager::install("structToolbox") ``` A number of additional packages are needed for this vignette. ```{r, message=FALSE, warning=FALSE} ## install additional bioc packages for vignette if needed #BiocManager::install(c('pmp', 'ropls', 'BiocFileCache')) ## install additional CRAN packages if needed #install.packages(c('cowplot', 'openxlsx')) suppressPackageStartupMessages({ # Bioconductor packages library(structToolbox) library(pmp) library(ropls) library(BiocFileCache) library(BiocStyle) # CRAN libraries library(ggplot2) library(gridExtra) library(cowplot) library(openxlsx) }) # use the BiocFileCache bfc <- BiocFileCache(ask = FALSE) ``` ```{r child="articles/introduction_to_struct_objects.Rmd"} ``` # Case Studies ```{r child="articles/typical_MS_workflow.Rmd"} ``` ```{r child="articles/pls_clinical_lcms.Rmd"} ``` ```{r child="articles/univariate_clinical_nmr.Rmd"} ``` ```{r child="articles/stategra_proteomics_metabolomics.Rmd"} ``` # Tutorials ```{r child="articles/metabolomics_and_svm.Rmd"} ``` # Session Info ```{r} sessionInfo() ```