100+ datasets found
  1. Australian Natural Products dataset

    • data.csiro.au
    • researchdata.edu.au
    Updated Jun 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Simon Saubern; Alex Shmaylov; Katherine Locock; Don McGilvery; David Collins (2025). Australian Natural Products dataset [Dataset]. http://doi.org/10.25919/v2qx-vp27
    Explore at:
    Dataset updated
    Jun 30, 2025
    Dataset provided by
    CSIROhttps://www.csiro.au/
    Authors
    Simon Saubern; Alex Shmaylov; Katherine Locock; Don McGilvery; David Collins
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Australia
    Dataset funded by
    CSIROhttps://www.csiro.au/
    Description

    A continuation of the "Phytochemistry of Australian Plants" database compiled by David Collins and Don McGilvery. Contains chemical structures, references, species names, with persistent identifiers to the literature and Atlas of Living Australia (ALA) for geographical distributions. The current curation effort here adds DOIs/ISBNs/ISSNs for ~80% of references, persistent IDs for all species or genus to the ALA or other datasets, and validated structures (smiles) for ~70% of structures. No new entries have been added since the last update to the original database in 2022. Change log is in the README file.

    Data provided here was obtained by the listed authors on linked publications, and these authors may have no association with CSIRO. CSIRO acknowledges that the publications linked here may contain Indigenous Cultural and Intellectual Property (ICIP), including traditional knowledge. CSIRO recognizes that First Nations peoples have the right to control, own and maintain their ICIP in accordance with Article 31 of the United Nations Declaration on the Rights of Indigenous Peoples. Users of this dataset may need to obtain permission from First Nations peoples for use of the information in linked publications. Users intending to collect and use biological specimens containing the compounds described in the dataset may also require permission of First Nations peoples, and may require permits and access permission from landholders. Recognizing that any ICIP in the linked publications is already publicly available but that the publications are not readily accessible by First Nations peoples, CSIRO is committed to finding ways to make the ICIP in these publications more findable and accessible to the First Nations communities from which the knowledge was originally obtained. Users should be aware that because of the historical context of some of the linked publications, they may contain words, descriptions, images or terms which may be culturally sensitive and/or offensive and that reflect authors’ views, or those of the period in which the content was created but may not be considered appropriate today. If First Nations people identify content within this dataset that they consider breaches cultural protocols they are encouraged to contact CSIRO on csiroenquiries@csiro.au or +61 3 9545 2176 to request its removal from the dataset. Please note that while CSIRO is able to administer the data housed within this dataset, this control does not extend to the associated publications. Requests to remove publications should be directed to the associated publishing company. Lineage: Original data extracted in 2022 from https://fms05.filemakerstudio.com.au/fmi/webd?homeurl=http://www.monash.edu/#PhytoChem by kind permission of David Collins and Don McGilvery.

  2. Data from: Fragment Library of Natural Products and Compound Databases for...

    • figshare.com
    • datasetcatalog.nlm.nih.gov
    txt
    Updated Oct 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ana Luisa Chávez-Hernández; JOSÉ LUIS MEDINA-FRANCO; Norberto Sánchez-Cruz (2020). Fragment Library of Natural Products and Compound Databases for Drug Discovery [Dataset]. http://doi.org/10.6084/m9.figshare.13064231.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Oct 8, 2020
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Ana Luisa Chávez-Hernández; JOSÉ LUIS MEDINA-FRANCO; Norberto Sánchez-Cruz
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Natural products and semi-synthetic compounds continue to be a significant source of drug candidates for a broad range of diseases, including the current pandemic caused by COVID-19. Besides being attractive sources of bioactive compounds for further development or optimization, natural products are excellent candidates of unique substructures for fragment-based drug discovery inspired on natural products. To this end, fragment libraries are required that can be incorporated into automated drug design pipelines. However, it is still scarce to have public fragment libraries based on extensive collections of natural products. Herein we report the generation and analysis of a fragment library of natural products derived from a database with more than 400,000 compounds. We also report fragment libraries of food chemical databases and other compound data sets of interest in drug discovery, including compound libraries relevant for COVID-19 drug discovery. The fragment libraries were characterized in terms of contents and diversity.Sopporting information contains: COCONUT_COMPOUNDS.csv, FooDB_COMPOUNDS.csv, DCM_COMPOUNDS.csv, CAS_COMPOUNDS.csv, 3CLP_COMPOUNDS.csv. All datasets contain the curated structures and the following information: identicator number (ID), simplified molecular input line entry system (Smiles), Average Molecular Weight (AMW), number of carbons, oxygens, nitrogens, heavy atoms, aliphatic rings, aromatic rings, heterocycles, bridgehead atoms, fraction of sp3 carbon atoms and chiral carbons, and a list of fragments generated from each compound. FRAGMENTS_COCONUT.csv, FRAGMENTS_FooDB.csv, FRAGMENTS_DCM.csv, FRAGMENTS_CAS.csv, FRAGMENTS_3CLP.csv. All libraries contain structures generated (Fragments) from each compound library (Dataset) and the following information: number of compounds that contain that fragment in a dataset (Count) and fraction of them (Proportion), average Molecular Weight (AMW), number of carbons, oxygens, nitrogens, heavy atoms, aliphatic rings, aromatic rings, heterocycles, bridgehead atoms, fraction of sp3 carbon atoms and chiral carbons.

  3. Natural products structure database LOTUS supplemented with predicted 13C...

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    bin, zip
    Updated Jul 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jean-Marc Nuzillard; Jean-Marc Nuzillard (2023). Natural products structure database LOTUS supplemented with predicted 13C NMR chemical shifts. [Dataset]. http://doi.org/10.5281/zenodo.8175939
    Explore at:
    zip, binAvailable download formats
    Dataset updated
    Jul 25, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Jean-Marc Nuzillard; Jean-Marc Nuzillard
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A structure database of natural products in SDF format was created from the LOTUS database version 9 .

    This database is intended to facilitate the dereplication of natural products.

    The LOTUS database was described in this publication (free download).

    File 220916_frozen_metadata.csv was downloaded from the LOTUS database version 9 and the SMILES chains of the compounds were collected.

    The SMILES chains were translated to 2D chemical structures using python scripts relying on the RDKit library.

    Each compound was associated to predicted 13C NMR chemical shifts by means of an already reported procedure (free download).

    Each compound was also supplemented with metadata from file 220916_frozen_metadata.csv .

    Archive file acd_lotusv9.sdf.zip contains acd_lotusv9.sdf with 218,478 compound descriptions inside.

    Archive file acd_lotusv9.NMRUDB.zip is a compressed version of acd_lotusv9.NMRUDB, itself created by importation of file acd_lotusv9.sdf in an ACD/Labs database file (new with version 0.0.4).

    The description of the first compound was copied in file firstmolv9.sdf and is provided for a quick inspection of the database content.

    The title line in firstmolv9.sdf is Q43656_2, meaning that more data about this compound may be found by searching in Wikidata for Q43656 and that the initial data was given by line 2 in file 220916_frozen_metadata.csv .

    Files acd_lotusv9.sdf acd_lotusv9.NMRUDB contain biological taxonomy data from file 220916_frozen_metadata.csv that were not exploited in acd_lotusv7. Sub-files dealing with a particular taxon can be easily produced now.

    Chemical shift calculations for 13C nuclei using the HOSE code approach are available here for the compounds in acd_lotusv7.

  4. Curated LOTUS database

    • figshare.com
    txt
    Updated May 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dillon Tay; Qisong Xu (2024). Curated LOTUS database [Dataset]. http://doi.org/10.6084/m9.figshare.25745325.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 3, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Dillon Tay; Qisong Xu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Curated database of natural products and their corresponding kingdom classification (e.g. GBIF Backbone Taxonomy, Catalogue of Life) from the LOTUS database (https://lotus.naturalproducts.net/). df_curated.csv data for Github Repository at https://github.com/SIBERanalytics/NPTaxonomy/

  5. n

    benzaldehyde

    • coconut.naturalproducts.net
    Updated Apr 25, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    COCONUT - COlleCtion of Open Natural prodUcTs (2024). benzaldehyde [Dataset]. https://coconut.naturalproducts.net/compounds/CNP0105179.0
    Explore at:
    Dataset updated
    Apr 25, 2024
    Dataset authored and provided by
    COCONUT - COlleCtion of Open Natural prodUcTs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Natural product in the COCONUT database with details of source organisms, geolocations and citations.

  6. n

    MG(0:0/22:4(7Z,10Z,13Z,16Z)/0:0)

    • coconut.naturalproducts.net
    Updated May 17, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    COCONUT - COlleCtion of Open Natural prodUcTs (2024). MG(0:0/22:4(7Z,10Z,13Z,16Z)/0:0) [Dataset]. https://coconut.naturalproducts.net/compounds/CNP0555828.0
    Explore at:
    Dataset updated
    May 17, 2024
    Dataset authored and provided by
    COCONUT - COlleCtion of Open Natural prodUcTs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Natural product in the COCONUT database with details of source organisms, geolocations and citations.

  7. Natural Product Compound Database

    • kaggle.com
    zip
    Updated Nov 11, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maria Congenie (2021). Natural Product Compound Database [Dataset]. https://kaggle.com/mariacongenie/nat-prod-chemicals-and-descriptors
    Explore at:
    zip(10449884 bytes)Available download formats
    Dataset updated
    Nov 11, 2021
    Authors
    Maria Congenie
    Description

    A chunk of the opensource Coconut Natural Product Database I found at: https://coconut.naturalproducts.net/

    Exploring natural product chemical space loosely based around drug-like properties (more lipophilic and larger end of spectrum). Had a bit of trouble finding csv files (not sdf) for chemical compound collections including basic physiochemical descriptors so thought I'd add one! I'm working on mapping the spread of this space based on common descriptors.

  8. b

    Natural Product Activity and Species Source Database

    • bioregistry.io
    Updated Apr 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Natural Product Activity and Species Source Database [Dataset]. https://bioregistry.io/npass
    Explore at:
    Dataset updated
    Apr 26, 2021
    License

    https://bioregistry.io/spdx:CC-BY-NChttps://bioregistry.io/spdx:CC-BY-NC

    Description

    Database for integrating species source of natural products & connecting natural products to biological targets via experimental-derived quantitative activity data.

  9. n

    Hernangerine

    • coconut.naturalproducts.net
    Updated May 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    COCONUT - COlleCtion of Open Natural prodUcTs (2024). Hernangerine [Dataset]. https://coconut.naturalproducts.net/compounds/CNP0217669.1
    Explore at:
    Dataset updated
    May 16, 2024
    Dataset authored and provided by
    COCONUT - COlleCtion of Open Natural prodUcTs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Natural product in the COCONUT database with details of source organisms, geolocations and citations.

  10. NPASS Natural Compounds Bioactivity Dataset

    • kaggle.com
    zip
    Updated Dec 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carol (2025). NPASS Natural Compounds Bioactivity Dataset [Dataset]. https://www.kaggle.com/datasets/ikrambeg/npass-natural-compounds-bioactivity-dataset
    Explore at:
    zip(129085 bytes)Available download formats
    Dataset updated
    Dec 20, 2025
    Authors
    Carol
    Description

    NPASS Natural Compounds Bioactivity Dataset

    This dataset is extracted from the NPASS database and provides a clean collection of natural compounds with their chemical representations and bioactivity values. It includes:

    • np_id: Compound identifier
    • SMILES: Chemical structure representation (for model training)
    • InChI and InChIKey: Optional chemical identifiers for verification
    • activity_value: Bioactivity measurement (can be used for QSAR or other predictive modeling)

    This dataset is ready for training deep learning models, such as Transformers, to learn chemical compound representations and explore bioactivity relationships.

    Source: NPASS database – Natural Product Activity & Species Source Database

    Suitable for researchers in computational chemistry, drug discovery, or any project focusing on chemical compound modeling.

  11. COCONUT 2.0 - Complete database

    • zenodo.org
    zip
    Updated Aug 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Venkata Chandrasekhar Nainala; Venkata Chandrasekhar Nainala; Sri Ram Sagar Kanakam; Sri Ram Sagar Kanakam; Nisha Sharma; Nisha Sharma; Viktor Weißenborn; Viktor Weißenborn; Jonas Schaub; Jonas Schaub; Christoph Steinbeck; Christoph Steinbeck; Kohulan Rajan; Kohulan Rajan (2024). COCONUT 2.0 - Complete database [Dataset]. http://doi.org/10.5281/zenodo.13382751
    Explore at:
    zipAvailable download formats
    Dataset updated
    Aug 28, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Venkata Chandrasekhar Nainala; Venkata Chandrasekhar Nainala; Sri Ram Sagar Kanakam; Sri Ram Sagar Kanakam; Nisha Sharma; Nisha Sharma; Viktor Weißenborn; Viktor Weißenborn; Jonas Schaub; Jonas Schaub; Christoph Steinbeck; Christoph Steinbeck; Kohulan Rajan; Kohulan Rajan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    COCONUT (Collection of Open Natural Products) Online


    COlleCtion of Open NatUral producTs (COCONUT) is an aggregated dataset comprising elucidated and predicted natural products (NPs) from open repositories. It offers a user-friendly web interface for browsing, searching, and efficiently downloading NPs. The latest database integrates more than 63 open NP resources, providing unrestricted access to data free of charge. Each entry in the database represents a "flat" NP structure, accompanied by information on its known stereochemical forms, relevant literature, producing organisms, natural geographical distribution, and precomputed molecular properties.

    Natural products are small bioactive molecules produced by living organisms with potential applications in pharmacology and various industries. The significance of these compounds has driven global interest in NP research across diverse fields. However, despite the growing number of general and specialized NP databases, no comprehensive online resource has consolidated all known NPs in one place—until COCONUT. This became a resource facilitating NP research, enabling computational screening and other in-silico applications.

    Summary of the COCONUT database's statistics:

    Total MoleculesTotal CollectionsUnique OrganismsCitations Mapped
    621,6316355,252

    24,272

    COCONUT was meticulously assembled from a range of public databases and primary sources, including:

    S.NoDatabase nameEntries integrated in COCONUTLatest resource URL
    1AfroCancer390Fidele Ntie-Kang, Justina Ngozi Nwodo, Akachukwu Ibezim, Conrad Veranso Simoben, Berin Karaman, Valery Fuh Ngwa, Wolfgang Sippl, Michael Umale Adikwu, and Luc Meva’a Mbaze Journal of Chemical Information and Modeling 2014 54 (9), 2433-2450 https://doi.org/10.1021/ci5003697
    2AfroDB953Fidele Ntie-Kang ,Denis Zofou,Smith B. Babiaka,Rolande Meudom,Michael Scharfe,Lydia L. Lifongo,James A. Mbah,Luc Meva’a Mbaze,Wolfgang Sippl,Simon M. N. Efange https://doi.org/10.1371/journal.pone.0078085
    3AfroMalariaDB265Onguéné, P.A., Ntie-Kang, F., Mbah, J.A. et al. The potential of anti-malarial compounds derived from African medicinal plants, part III: an in silico evaluation of drug metabolism and pharmacokinetics profiling. Org Med Chem Lett 4, 6 (2014). https://doi.org/10.1186/s13588-014-0006-x
    4AnalytiCon Discovery NPs5,147Natural products are a sebset of AnalytiCon Discovery NPs https://ac-discovery.com/screening-libraries/
    5BIOFACQUIM605Pilón-Jiménez, B.A.; Saldívar-González, F.I.; Díaz-Eufracio, B.I.; Medina-Franco, J.L. BIOFACQUIM: A Mexican Compound Database of Natural Products. Biomolecules 2019, 9, 31. https://doi.org/10.3390/biom9010031
    6BitterDB685Ayana Dagan-Wiener, Antonella Di Pizio, Ido Nissim, Malkeet S Bahia, Nitzan Dubovski, Eitan Margulis, Masha Y Niv, BitterDB: taste ligands and receptors database in 2019, Nucleic Acids Research, Volume 47, Issue D1, 08 January 2019, Pages D1179–D1185, https://doi.org/10.1093/nar/gky974
    7Carotenoids Database1,195Junko Yabuzaki, Carotenoids Database: structures, chemical fingerprints and distribution among organisms, Database, Volume 2017, 2017, bax004, https://doi.org/10.1093/database/bax004
    8ChEBI NPs16,215Janna Hastings, Paula de Matos, Adriano Dekker, Marcus Ennis, Bhavana Harsha, Namrata Kale, Venkatesh Muthukrishnan, Gareth Owen, Steve Turner, Mark Williams, Christoph Steinbeck, The ChEBI reference database and ontology for biologically relevant chemistry: enhancements for 2013, Nucleic Acids Research, Volume 41, Issue D1, 1 January 2013, Pages D456–D463, https://doi.org/10.1093/nar/gks1146
    9ChEMBL NPs1,910Anna Gaulton, Anne Hersey, Michał Nowotka, A. Patrícia Bento, Jon Chambers, David Mendez, Prudence Mutowo, Francis Atkinson, Louisa J. Bellis, Elena Cibrián-Uhalte, Mark Davies, Nathan Dedman, Anneli Karlsson, María Paula Magariños, John P. Overington, George Papadatos, Ines Smit, Andrew R. Leach, The ChEMBL database in 2017, Nucleic Acids Research, Volume 45, Issue D1, January 2017, Pages D945–D954, https://doi.org/10.1093/nar/gkw1074
    10ChemSpider NPs9,740Harry E. Pence and Antony Williams Journal of Chemical Education 2010 87 (11), 1123-1124 https://doi.org/10.1021/ed100697w
    11CMAUP (cCollective molecular activities of useful plants)47,593Xian Zeng, Peng Zhang, Yali Wang, Chu Qin, Shangying Chen, Weidong He, Lin Tao, Ying Tan, Dan Gao, Bohua Wang, Zhe Chen, Weiping Chen, Yu Yang Jiang, Yu Zong Chen, CMAUP: a database of collective molecular activities of useful plants, Nucleic Acids Research, Volume 47, Issue D1, 08 January 2019, Pages D1118–D1127, https://doi.org/10.1093/nar/gky965
    12ConMedNP3,111DOI https://doi.org/10.1039/C3RA43754J
    13ETM (Ethiopian Traditional Medicine) DB1,798Bultum, L.E., Woyessa, A.M. & Lee, D. ETM-DB: integrated Ethiopian traditional herbal medicine and phytochemicals database. BMC Complement Altern Med 19, 212 (2019). https://doi.org/10.1186/s12906-019-2634-1
    14Exposome-explorer434Vanessa Neveu, Alice Moussy, Héloïse Rouaix, Roland Wedekind, Allison Pon, Craig Knox, David S. Wishart, Augustin Scalbert, Exposome-Explorer: a manually-curated database on biomarkers of exposure to dietary and environmental factors, Nucleic Acids Research, Volume 45, Issue D1, January 2017, Pages D979–D984, https://doi.org/10.1093/nar/gkw980
    15FoodDB70,385Natural products are a sebset of FoodDB https://foodb.ca/
    16GNPS (Global Natural Products Social Molecular Networking)11,103Wang, M., Carver, J., Phelan, V. et al. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat Biotechnol 34, 828–837 (2016). https://doi.org/10.1038/nbt.3597
    17HIM (Herbal Ingredients in-vivo Metabolism database)1,259Kang, H., Tang, K., Liu, Q. et al. HIM-herbal ingredients in-vivo metabolism database. J Cheminform 5, 28 (2013). https://doi.org/10.1186/1758-2946-5-28
    18HIT (Herbal Ingredients Targets)530Hao Ye, Li Ye, Hong Kang, Duanfeng Zhang, Lin Tao, Kailin Tang, Xueping Liu, Ruixin Zhu, Qi Liu, Y. Z. Chen, Yixue Li, Zhiwei Cao, HIT: linking herbal active ingredients to targets, Nucleic Acids Research, Volume 39, Issue suppl_1, 1 January 2011, Pages D1055–D1059, https://doi.org/10.1093/nar/gkq1165
    19Indofine Chemical Company46Natural products are a sebset of Indofine Chemical Company https://indofinechemical.com/
    20InflamNat664Ruihan Zhang, Jing Lin, Yan Zou, Xing-Jie Zhang, and Wei-Lie Xiao Journal of Chemical Information and Modeling 2019 59 (1), 66-73 DOI: 10.1021/acs.jcim.8b00560 <a href="https://doi.org/10.1021/acs.jcim.8b00560" target="_blank" rel="noopener

  12. f

    Data from: Database for Rapid Dereplication of Known Natural Products Using...

    • datasetcatalog.nlm.nih.gov
    Updated Jun 14, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carroll, Anthony R.; Zani, Carlos L. (2017). Database for Rapid Dereplication of Known Natural Products Using Data from MS and Fast NMR Experiments [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001828893
    Explore at:
    Dataset updated
    Jun 14, 2017
    Authors
    Carroll, Anthony R.; Zani, Carlos L.
    Description

    The discovery of novel and/or new bioactive natural products from biota sources is often confounded by the reisolation of known natural products. Dereplication strategies that involve the analysis of NMR and MS spectroscopic data to infer structural features present in purified natural products in combination with database searches of these substructures provide an efficient method to rapidly identify known natural products. Unfortunately this strategy has been hampered by the lack of publically available and comprehensive natural product databases and open source cheminformatics tools. A new platform, DEREP-NP, has been developed to help solve this problem. DEREP-NP uses the open source cheminformatics program DataWarrior to generate a database containing counts of 65 structural fragments present in 229 358 natural product structures derived from plants, animals, and microorganisms, published before 2013 and freely available in the nonproprietary Universal Natural Products Database (UNPD). By counting the number of times one or more of these structural features occurs in an unknown compound, as deduced from the analysis of its NMR (1H, HSQC, and/or HMBC) and/or MS data, matching structures carrying the same numeric combination of searched structural features can be retrieved from the database. Confirmation that the matching structure is the same compound can then be verified through literature comparison of spectroscopic data. This methodology can be applied to both purified natural products and fractions containing a small number of individual compounds that are often generated as screening libraries. The utility of DEREP-NP has been verified through the analysis of spectra derived from compounds (and fractions containing two or three compounds) isolated from plant, marine invertebrate, and fungal sources. DEREP-NP is freely available at https://github.com/clzani/DEREP-NP and will help to streamline the natural product discovery process.

  13. n

    NSC29854

    • coconut.naturalproducts.net
    Updated May 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    COCONUT - COlleCtion of Open Natural prodUcTs (2024). NSC29854 [Dataset]. https://coconut.naturalproducts.net/compounds/CNP0136376.0
    Explore at:
    Dataset updated
    May 16, 2024
    Dataset authored and provided by
    COCONUT - COlleCtion of Open Natural prodUcTs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Natural product in the COCONUT database with details of source organisms, geolocations and citations.

  14. f

    Data from: Enhanced Bioactivity of Natural Products by Halogenation: A...

    • datasetcatalog.nlm.nih.gov
    • acs.figshare.com
    Updated May 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Liao, Qingyi; Li, Jintian; Zhu, Weiliang; Cai, Tingting; Zhang, Qian; Xu, Zhijian; Cao, Ruini; Zhang, Yong; Shao, Mei (2025). Enhanced Bioactivity of Natural Products by Halogenation: A Database Survey and Quantum Chemistry Calculation Study [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0002099866
    Explore at:
    Dataset updated
    May 8, 2025
    Authors
    Liao, Qingyi; Li, Jintian; Zhu, Weiliang; Cai, Tingting; Zhang, Qian; Xu, Zhijian; Cao, Ruini; Zhang, Yong; Shao, Mei
    Description

    Natural products (NPs) have long been the cornerstone of drug discovery. Halogenated organic NPs are limited, while around one-fourth of approved chemical drugs are organohalogens. This suggests that the introduction of halogens into NPs may enhance their potential for transformation into drugs. In this study, we utilized a matched molecular pair (MMP) approach alongside a database survey to investigate the impact of halogenation on this transformation. The study revealed that halogenation increased the bioactivity of 70.3% of NPs, with 50.3% exhibiting at least a 2-fold enhancement. Halogen bonds (XBs) are prevalent between organohalogens and their targets. To explore whether halogenated NPs could form XBs with their targets, computational studies were performed and demonstrated that halogenated NPs or NP-derived drugs formed strong XBs with their targets, resulting in improved binding affinities. This study highlights the considerable potential of introducing halogens into NPs as a strategic approach for enhancing bioactivity and facilitating the development of drugs.

  15. f

    Additional file 2 of InflamNat: web-based database and predictor of...

    • datasetcatalog.nlm.nih.gov
    • springernature.figshare.com
    Updated Jun 6, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Li, Jin; Shen, Tianze; Dai, Qi; Li, Xiaoli; Ren, Shoupeng; Zhang, Ruihan; Xiao, Weilie (2022). Additional file 2 of InflamNat: web-based database and predictor of anti-inflammatory natural products [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000431535
    Explore at:
    Dataset updated
    Jun 6, 2022
    Authors
    Li, Jin; Shen, Tianze; Dai, Qi; Li, Xiaoli; Ren, Shoupeng; Zhang, Ruihan; Xiao, Weilie
    Description

    Additional file 2. Physicochemical properties and cell-based anti-inflammatory activity of InflamNat Compounds.

  16. U

    16104-96-4

    • coconut.naturalproducts.net
    Updated Sep 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    COCONUT - COlleCtion of Open Natural prodUcTs (2024). 16104-96-4 [Dataset]. https://coconut.naturalproducts.net/compounds/CNP0129133.0
    Explore at:
    Dataset updated
    Sep 23, 2024
    Dataset authored and provided by
    COCONUT - COlleCtion of Open Natural prodUcTs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Natural product in the COCONUT database with details of source organisms, geolocations and citations.

  17. U

    Data from: cyclocreatine

    • coconut.naturalproducts.net
    Updated Aug 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    COCONUT - COlleCtion of Open Natural prodUcTs (2024). cyclocreatine [Dataset]. https://coconut.naturalproducts.net/compounds/CNP0573898.0
    Explore at:
    Dataset updated
    Aug 20, 2024
    Dataset authored and provided by
    COCONUT - COlleCtion of Open Natural prodUcTs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Natural product in the COCONUT database with details of source organisms, geolocations and citations.

  18. Z

    Data from: A collection of molecular formula databases for HERMES

    • data.niaid.nih.gov
    Updated Jul 16, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Roger Giné Bertomeu; Maria Vinaixa Crevillent; Òscar Yanes Torrado (2021). A collection of molecular formula databases for HERMES [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5025559
    Explore at:
    Dataset updated
    Jul 16, 2021
    Dataset provided by
    Universitat Rovira i Virgili, IISPV and CIBER
    Universitat Rovira i Virgili and IISPV
    Authors
    Roger Giné Bertomeu; Maria Vinaixa Crevillent; Òscar Yanes Torrado
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A compilation of different molecule databases ready to be used in HERMES. We have compiled different open-access DBs and adapted their format to the HERMES requisite columns. Since all databases share the "Name" and "MolecularFormula" columns, merges between databases can be easily generated.

    More databases and merges will be added in the future. If you have any suggestions or want to contribute, feel free to contact us!

    All rights reserved to the original authors of the databases.

    Description of the files:

    ECMDB.csv: Entries from E. coli Metabolome Database. 3760 compounds.

    Merge_KEGG_ECMDB.csv: a merge between all metabolites from KEGG pathways associated to E.coli K12 with the ECMDB.csv from above. 6107 compounds.

    Merge_LipidMaps_LipidBlast.csv: a merge between lipid entities from LipidMaps LMSD and the metadata (just Name and Molecular Formula) of LipidBlast entries. 163453 compounds.

    norman.xls: Entries from NORMAN SusDat, containing common and emerging drugs, pollutants, etc. 52019 compounds.

    PubChemLite_31Oct2020.csv Adapted column names from https://zenodo.org/record/4183801. 371,663 compounds related to exposomics.

    MS1_2ID.csv. Merge of HMDB, ChEBI and NORMAN compounds. 183911 compounds related to Human Metabolism, drugs, etc..

    COCONUT_NP.csv: parsed collection of entries from the COlleCtion of Open Natural ProdUcTs (COCONUT).406752 compounds.

    DiTriPeptides.csv: a list of all theoretically possible dipeptides (400) and tripeptides (8000) and their associated molecular formulas. 8400 compounds.

  19. U

    3051-84-1

    • coconut.naturalproducts.net
    Updated Sep 23, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    COCONUT - COlleCtion of Open Natural prodUcTs (2024). 3051-84-1 [Dataset]. https://coconut.naturalproducts.net/compounds/CNP0268109.0
    Explore at:
    Dataset updated
    Sep 23, 2024
    Dataset authored and provided by
    COCONUT - COlleCtion of Open Natural prodUcTs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Natural product in the COCONUT database with details of source organisms, geolocations and citations.

  20. n

    DB-083027

    • coconut.naturalproducts.net
    Updated May 17, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    COCONUT - COlleCtion of Open Natural prodUcTs (2024). DB-083027 [Dataset]. https://coconut.naturalproducts.net/compounds/CNP0230767.0
    Explore at:
    Dataset updated
    May 17, 2024
    Dataset authored and provided by
    COCONUT - COlleCtion of Open Natural prodUcTs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Natural product in the COCONUT database with details of source organisms, geolocations and citations.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Simon Saubern; Alex Shmaylov; Katherine Locock; Don McGilvery; David Collins (2025). Australian Natural Products dataset [Dataset]. http://doi.org/10.25919/v2qx-vp27
Organization logo

Australian Natural Products dataset

Explore at:
19 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jun 30, 2025
Dataset provided by
CSIROhttps://www.csiro.au/
Authors
Simon Saubern; Alex Shmaylov; Katherine Locock; Don McGilvery; David Collins
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Area covered
Australia
Dataset funded by
CSIROhttps://www.csiro.au/
Description

A continuation of the "Phytochemistry of Australian Plants" database compiled by David Collins and Don McGilvery. Contains chemical structures, references, species names, with persistent identifiers to the literature and Atlas of Living Australia (ALA) for geographical distributions. The current curation effort here adds DOIs/ISBNs/ISSNs for ~80% of references, persistent IDs for all species or genus to the ALA or other datasets, and validated structures (smiles) for ~70% of structures. No new entries have been added since the last update to the original database in 2022. Change log is in the README file.

Data provided here was obtained by the listed authors on linked publications, and these authors may have no association with CSIRO. CSIRO acknowledges that the publications linked here may contain Indigenous Cultural and Intellectual Property (ICIP), including traditional knowledge. CSIRO recognizes that First Nations peoples have the right to control, own and maintain their ICIP in accordance with Article 31 of the United Nations Declaration on the Rights of Indigenous Peoples. Users of this dataset may need to obtain permission from First Nations peoples for use of the information in linked publications. Users intending to collect and use biological specimens containing the compounds described in the dataset may also require permission of First Nations peoples, and may require permits and access permission from landholders. Recognizing that any ICIP in the linked publications is already publicly available but that the publications are not readily accessible by First Nations peoples, CSIRO is committed to finding ways to make the ICIP in these publications more findable and accessible to the First Nations communities from which the knowledge was originally obtained. Users should be aware that because of the historical context of some of the linked publications, they may contain words, descriptions, images or terms which may be culturally sensitive and/or offensive and that reflect authors’ views, or those of the period in which the content was created but may not be considered appropriate today. If First Nations people identify content within this dataset that they consider breaches cultural protocols they are encouraged to contact CSIRO on csiroenquiries@csiro.au or +61 3 9545 2176 to request its removal from the dataset. Please note that while CSIRO is able to administer the data housed within this dataset, this control does not extend to the associated publications. Requests to remove publications should be directed to the associated publishing company. Lineage: Original data extracted in 2022 from https://fms05.filemakerstudio.com.au/fmi/webd?homeurl=http://www.monash.edu/#PhytoChem by kind permission of David Collins and Don McGilvery.

Search
Clear search
Close search
Google apps
Main menu