Uncategorized

Explainable machine learning identifies multi-omics signatures of muscle response to spaceflight in mice




doi: 10.1038/s41526-023-00337-5.

Affiliations

Free PMC article

Item in Clipboard

Kevin Li et al.


NPJ Microgravity.


.

Free PMC article

Abstract

The adverse effects of microgravity exposure on mammalian physiology during spaceflight necessitate a deep understanding of the underlying mechanisms to develop effective countermeasures. One such concern is muscle atrophy, which is partly attributed to the dysregulation of calcium levels due to abnormalities in SERCA pump functioning. To identify potential biomarkers for this condition, multi-omics data and physiological data available on the NASA Open Science Data Repository (osdr.nasa.gov) were used, and machine learning methods were employed. Specifically, we used multi-omics (transcriptomic, proteomic, and DNA methylation) data and calcium reuptake data collected from C57BL/6 J mouse soleus and tibialis anterior tissues during several 30+ day-long missions on the international space station. The QLattice symbolic regression algorithm was introduced to generate highly explainable models that predict either experimental conditions or calcium reuptake levels based on multi-omics features. The list of candidate models established by QLattice was used to identify key features contributing to the predictive capability of these models, with Acyp1 and Rps7 proteins found to be the most predictive biomarkers related to the resilience of the tibialis anterior muscle in space. These findings could serve as targets for future interventions aiming to reduce the extent of muscle atrophy during space travel.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures


Fig. 1



Fig. 1. QLattice regression analysis of TA multi-omics data and calcium uptake.

a Representative examples of the mathematical relationships between multi-omic features identified by QLattice to predict calcium reuptake in TA muscle during LOOCV. b Top 9 features ranked by how many times they were used in a model found by QLattice during LOOCV. c T1 and T10 cross-validated R2 scores, as well as the number of RNA-seq and proteomic features that were found among the top 50 features. d Gene set enrichment analysis results using the top nine genes from the QLattice analysis.


Fig. 2



Fig. 2. QLattice regression analysis of SOL multi-omics data and calcium uptake.

a Representative examples of the mathematical relationships between multi-omic features identified by QLattice to predict calcium reuptake in SOL muscle during LOOCV. b Top 13 features ranked by how many times they were used in a model found by QLattice during LOOCV. c T1 and T10 cross-validated R2 scores, as well as the number of RNA-seq and methylation features that were found among the top 50 features. d Gene set enrichment analysis results using the top 13 genes from the QLattice analysis.


Fig. 3



Fig. 3. Gene/protein relationship with calcium reuptake in SOL and TA muscles and putative mechanism.

The expression levels of the top key genes identified by QLattice analysis and their corresponding protein levels are shown against the calcium reuptake AUC in both TA and SOL muscles flown in space (FLT) or from ground controls (GC). a QLattice key protein levels in TA. b QLattice key gene expression levels in TA. c QLattice key gene expression levels in SOL. d Putative mechanism based on the Acyp1 response, showing up in the majority of the models for TA muscle and e SOL muscle. f Calcium reuptake AUC in TA muscle. g Calcium reuptake AUC in SOL muscle. All significance was calculated using Mann–Whitney–Wilcoxon test two-sided: *: 1.00e−02 < p ≤ 5.00e−02, **:1.00e−03 < p ≤ 1.00e−02).


Fig. 4



Fig. 4. QLattice classification analysis of TA multi-omics data and FLT/GC groups.

a Representative examples of the mathematical relationships between multi-omic features identified by QLattice to predict FLT versus GC in TA muscle during LOOCV. b Top 11 features ranked by how many times they were used in a model found by QLattice during LOOCV. c T1 and T10 cross-validated R2 scores, as well as the total number of RNA-seq, proteomics, and methylation features across all models. d Gene set enrichment analysis results using the top 11 features from the QLattice analysis.


Fig. 5



Fig. 5. Expression levels of top proteins and genes identified by QLattice TA and SOL classification analysis and muscle weights.

a QLattice key protein levels in TA. b QLattice key gene levels in TA. c QLattice key gene levels in SOL. All significance was calculated using Mann–Whitney–Wilcoxon test two-sided: *: 1.00e−02 < p ≤ 5.00e−02, **:1.00e−03 < p ≤ 1.00e−02).


Fig. 6



Fig. 6. QLattice classification analysis of SOL multi-omics data and FLT/GC groups.

a Representative examples of the mathematical relationships between multi-omic features identified by QLattice to predict FLT versus GC in SOL muscle during LOOCV. b Top 9 features ranked by how many times they were used in a model found by QLattice during LOOCV. c T1 and T10 cross-validated R2 scores, as well as the total number of RNA-seq and methylation features across all models. d Gene set enrichment analysis results using the top 9 features from the QLattice analysis.

References

    1. Braun, J. L., Geromella, M. S., Hamstra, S. I., Messner, H. N. & Fajardo, V. A. Characterizing SERCA function in murine skeletal muscles after 35-37 days of spaceflight. Int. J. Mol. Sci. 22 (2021).



      PMC



      PubMed

    1. Juhl OJ, 4th, et al. Update on the effects of microgravity on the musculoskeletal system. NPJ Microgravity. 2021;7:28. doi: 10.1038/s41526-021-00158-4.



      DOI



      PMC



      PubMed

    1. Ulanova A, et al. Isoform composition and gene expression of thick and thin filament proteins in striated muscles of mice after 30-day space flight. Biomed. Res. Int. 2015;2015:104735. doi: 10.1155/2015/104735.



      DOI



      PMC



      PubMed

    1. Reel PS, Reel S, Pearson E, Trucco E, Jefferson E. Using machine learning approaches for multi-omics data analysis: a review. Biotechnol. Adv. 2021;49:107739. doi: 10.1016/j.biotechadv.2021.107739.



      DOI



      PubMed

    1. Bzdok D, Altman N, Krzywinski M. Statistics versus machine learning. Nat. Methods. 2018;15:233–234. doi: 10.1038/nmeth.4642.



      DOI



      PMC



      PubMed

LinkOut – more resources

  • Full Text Sources

  • Miscellaneous



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *