Repository logo
 

Comparative analysis of chemical similarity methods for modular natural products with a hypothetical structure enumeration algorithm

dc.contributor.authorSkinnider, Michael A.
dc.contributor.authorDejong, Chris A.
dc.contributor.authorFranczak, Brian C.
dc.contributor.authorMcNicholas, Paul D.
dc.contributor.authorMagarvey, Nathan A.
dc.date.accessioned2020-10-26
dc.date.accessioned2022-05-31T01:16:03Z
dc.date.available2022-05-31T01:16:03Z
dc.date.issued2017
dc.description.abstractNatural products represent a prominent source of pharmaceutically and industrially important agents. Calculating the chemical similarity of two molecules is a central task in cheminformatics, with applications at multiple stages of the drug discovery pipeline. Quantifying the similarity of natural products is a particularly important problem, as the biological activities of these molecules have been extensively optimized by natural selection. The large and structurally complex scaffolds of natural products distinguish their physical and chemical properties from those of synthetic compounds. However, no analysis of the performance of existing methods for molecular similarity calculation specific to natural products has been reported to date. Here, we present LEMONS, an algorithm for the enumeration of hypothetical modular natural product structures. We leverage this algorithm to conduct a comparative analysis of molecular similarity methods within the unique chemical space occupied by modular natural products using controlled synthetic data, and comprehensively investigate the impact of diverse biosynthetic parameters on similarity search. We additionally investigate a recently described algorithm for natural product retrobiosynthesis and alignment, and find that when rule-based retrobiosynthesis can be applied, this approach outperforms conventional two-dimensional fingerprints, suggesting it may represent a valuable approach for the targeted exploration of natural product chemical space and microbial genome mining. Our open-source algorithm is an extensible method of enumerating hypothetical natural product structures with diverse potential applications in bioinformatics.
dc.format.extent1.02MB
dc.format.mimetypePDF
dc.identifier.citationSkinnider, M. A., Dejong, C. A., Franczak, B. C., McNicholas, P. D., and Magarvey, N. A. (2017). Comparative analysis of chemical similarity methods for modular natural products with a hypothetical structure enumeration algorithm. Journal of Cheminformatics, 9(46). https://doi.org/10.1186/s13321-017-0234-y
dc.identifier.doihttps://doi.org/10.1186/s13321-017-0234-y
dc.identifier.urihttps://hdl.handle.net/20.500.14078/1971
dc.languageEnglish
dc.language.isoen
dc.rightsAttribution (CC BY)
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectchemical similarity
dc.subjectnatural products
dc.subjectchemical fingerprints
dc.subjectchemical structure enumeration
dc.titleComparative analysis of chemical similarity methods for modular natural products with a hypothetical structure enumeration algorithmen
dc.typeArticle

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Comparative_analysis_of_chemical_similarity-_2017_roam.pdf
Size:
1.02 MB
Format:
Adobe Portable Document Format