. 2023 Dec 17;13(12):1199.
doi: 10.3390/metabo13121199.
Affiliations
Free PMC article
Item in Clipboard
Metabolites.
.
Free PMC article
Abstract
A major challenge to integrating public metabolic resources is the use of different nomenclatures by individual databases. This paper presents md_harmonize, an open-source Python package for harmonizing compounds and metabolic reactions across various metabolic databases. The md_harmonize package utilizes a neighborhood-specific graph coloring method for generating a unique identifier for each compound via atom identifiers based on a compound’s chemical structure. The resulting harmonized compounds and reactions can be used for various downstream analyses, including the construction of atom-resolved metabolic networks and models for metabolic flux analysis. Parts of the md_harmonize package have been optimized using a variety of computational techniques to allow certain NP-complete problems handled by the software to be tractable for these specific use-cases. The software is available on GitHub and through the Python Package Index, with end-user documentation hosted on GitHub Pages.
Keywords:
Python package; database harmonization; maximum common substructure; metabolite.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
References
LinkOut – more resources
-
Full Text Sources
-
Miscellaneous