100+ datasets found
  1. NIST Chemical Kinetics Database

    • catalog.data.gov
    • gimi9.com
    • +2more
    Updated Jul 29, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2022). NIST Chemical Kinetics Database [Dataset]. https://catalog.data.gov/dataset/nist-chemical-kinetics-database-bee86
    Explore at:
    Dataset updated
    Jul 29, 2022
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Description

    The NIST Chemical Kinetics Database includes essentially all reported kinetics results for thermal gas-phase chemical reactions. The database is designed to be searched for kinetics data based on the specific reactants involved, for reactions resulting in specified products, for all the reactions of a particular species, or for various combinations of these. In addition, the bibliography can be searched by author name or combination of names. The database contains in excess of 38,000 separate reaction records for over 11,700 distinct reactant pairs. These data have been abstracted from over 12,000 papers with literature coverage through early 2000. Rate constant records for a specified reaction are found by searching the Reaction Database. All rate constant records for that reaction are returned, with a link to 'Details' on that record. Each rate constant record contains the following information (as available): a) Reactants and, if defined, reaction products; b) Rate parameters: A, n, Ea/R, where k = A (T/298)*n exp[-(Ea/R)/T], where T is the temperature in Kelvins; c) Uncertainty in A, n, and Ea/R, if reported; d) Temperature range of experiment or temperature range of validity of a review or theoretical paper; e) Pressure range and bulk gas of the experiment; f) Data type of the record (i.e., experimental, relative rate measurement, theoretical calculation, modeling result, etc.). If the result is a relative rate measurement, then the reaction to which the rate is relative is also given; g) Experimental procedure, including separate fields for the description of the apparatus, the time resolution of the experiment, and the excitation technique. A majority of contemporary chemical kinetics methods are represented. The Kinetics Database is being expanded to include other resources for the convenience of the users. Presently this includes direct links to the corresponding NIST WebBook page for all substances for which such a link is possible. This is indicated by underling and highlighting the species. The WebBook provides thermodynamic, spectral, and other data on the species. Note that the link to the WebBook is opened as a new frame in your browser.

  2. RGD1-CNHO Database

    • figshare.com
    • data.niaid.nih.gov
    hdf
    Updated Nov 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qiyuan Zhao; Brett Savoie; Michael Woulfe; Sai Mahit Vaddadi; Lawal A. Ogunfowora; Sanjay Garimella (2023). RGD1-CNHO Database [Dataset]. http://doi.org/10.6084/m9.figshare.21066901.v9
    Explore at:
    hdfAvailable download formats
    Dataset updated
    Nov 26, 2023
    Dataset provided by
    figshare
    Authors
    Qiyuan Zhao; Brett Savoie; Michael Woulfe; Sai Mahit Vaddadi; Lawal A. Ogunfowora; Sanjay Garimella
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This reaction database is generated along with the manuscript "Comprehensive exploration of graphically defined reaction spaces".RGD1CHNO_AMsmiles.csv contains atom-mapped SMILES, activation energies, and enthalpies of formation for each reaction. RGD!_CHNO.h5 contains the geometry information and can be iterated by a python script from Github (https://github.com/zhaoqy1996/RGD1/parse_data.py). DFT_reaction_info.csv is supplied to reproduce figures in the article.RandP_smiles.txt is a dictionary to map the reactant and product smiles appear in RGD!_CHNO.h5 to a molecule index (molX).RGD1_RPs.h5 provides xtb and DFT optimized geometries of each individual reactant/product molecules. 3D ML models can be trained by combining RGD1_RPs.h5, RGD!_CHNO.h5, and RandP_smiles.txt (see https://github.com/zhaoqy1996/RGD1 for more details)IMPORTANT: We provided an UPDATED VERSION of RGD1 dataset in Ari 24, 2023. The initially posted version of the dataset reported swapped activation energies for ~24% of the forward/reverse reactions which were all corrected in this updated version.

  3. n

    Database of Chemical Compounds and Reactions in Biological Pathways

    • neuinfo.org
    • scicrunch.org
    • +1more
    Updated Aug 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Database of Chemical Compounds and Reactions in Biological Pathways [Dataset]. http://identifiers.org/RRID:SCR_006851
    Explore at:
    Dataset updated
    Aug 11, 2024
    Description

    KEGG LIGAND contains knowledge of chemical substances and reactions that are relevant to life. It is a composite database consisting of COMPOUND, GLYCAN, REACTION, RPAIR, and ENZYME databases, whose entries are identified by C, G, R, RP, and EC numbers, respectively. ENZYME is derived from the IUBMB/IUPAC Enzyme Nomenclature, but the others are internally developed and maintained. The primary database of KEGG LIGAND is a relational database with the KegDraw interface, which is used to generated the secondary (flat file) database for DBGET.

  4. NDRL/NIST Solution Kinetics Database - SRD 40

    • catalog.data.gov
    • data.wu.ac.at
    Updated Jul 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2022). NDRL/NIST Solution Kinetics Database - SRD 40 [Dataset]. https://catalog.data.gov/dataset/ndrl-nist-solution-kinetics-database-srd-40-f87a2
    Explore at:
    Dataset updated
    Jul 29, 2022
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Description

    The NDRL/NIST Solution Kinetics Database contains data on rate constants for solution-phase chemical reactions. The database is designed to be searched by reactants, products, solvents, or any combination of these. In addition, the bibliography may be searched by author name, title words, journal, page(s), and/or year. This is not the same database as the one at Notre Dame, although both databases share a common data source.

  5. n

    Biochemical Pathways Reaction Kinetics Database

    • neuinfo.org
    • scicrunch.org
    • +1more
    Updated Jun 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Biochemical Pathways Reaction Kinetics Database [Dataset]. http://identifiers.org/RRID:SCR_002122
    Explore at:
    Dataset updated
    Jun 14, 2025
    Description

    A database based on the SABIO relational database that contains information about biochemical reactions, their kinetic equations with their parameters, and the experimental conditions under which these parameters were measured. It aims to support modelers in the setting-up of models of biochemical networks, but it is also useful for experimentalists or researchers with interest in biochemical reactions and their kinetics. SABIO-RK contains and merges information about reactions such as reactants and modifiers, organism, tissue and cellular location, as well as the kinetic properties of the reactions. The type of the kinetic mechanism, modes of inhibition or activation, and corresponding rate equations are presented together with their parameters and measured values, specifying the experimental conditions under which these were determined. Links to other databases are provided for users to gather further information and to refer to the original publication. Information about reactions and their kinetic data can be exported to an SBML file. The reaction kinetics data are obtained by manual extraction from literature sources and curated.

  6. Data from: Data-Driven Multi-Objective Optimization Tactics for Catalytic...

    • acs.figshare.com
    zip
    Updated Jun 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jordan J. Dotson; Lucy van Dijk; Jacob C. Timmerman; Samantha Grosslight; Richard C. Walroth; Francis Gosselin; Kurt Püntener; Kyle A. Mack; Matthew S. Sigman (2023). Data-Driven Multi-Objective Optimization Tactics for Catalytic Asymmetric Reactions Using Bisphosphine Ligands [Dataset]. http://doi.org/10.1021/jacs.2c08513.s001
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 5, 2023
    Dataset provided by
    ACS Publications
    Authors
    Jordan J. Dotson; Lucy van Dijk; Jacob C. Timmerman; Samantha Grosslight; Richard C. Walroth; Francis Gosselin; Kurt Püntener; Kyle A. Mack; Matthew S. Sigman
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Optimization of the catalyst structure to simultaneously improve multiple reaction objectives (e.g., yield, enantioselectivity, and regioselectivity) remains a formidable challenge. Herein, we describe a machine learning workflow for the multi-objective optimization of catalytic reactions that employ chiral bisphosphine ligands. This was demonstrated through the optimization of two sequential reactions required in the asymmetric synthesis of an active pharmaceutical ingredient. To accomplish this, a density functional theory-derived database of

    550 bisphosphine ligands was constructed, and a designer chemical space mapping technique was established. The protocol used classification methods to identify active catalysts, followed by linear regression to model reaction selectivity. This led to the prediction and validation of significantly improved ligands for all reaction outputs, suggesting a general strategy that can be readily implemented for reaction optimizations where performance is controlled by bisphosphine ligands.

  7. Z

    Benchmark Data for Chemprop

    • data.niaid.nih.gov
    Updated Nov 9, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chung, Yunsie (2023). Benchmark Data for Chemprop [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_8174267
    Explore at:
    Dataset updated
    Nov 9, 2023
    Dataset provided by
    Heid, Esther
    Wu, Haoyang
    Chung, Yunsie
    Greenman, Kevin P.
    McGill, Charles J.
    Green, William H.
    Li, Shih-Cheng
    Vermeire, Florence H.
    Graff, David E.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Datasets and splits of the manuscript "Chemprop: Machine Learning Package for Chemical Property Prediction." Train, validation and test splits are located within each folder, as well as additional data necessary for some of the benchmarks. To train Chemprop models, refer to our code repository to obtain ready-to-use scripts to train machine learning models for each of the systems. Available benchmarking systems:

    hiv HIV replication inhibition from MoleculeNet and OGB with scaffold splits pcba_random Biological activities from MoleculeNet with random splits (with missing targets filled in with zeros as provided by MoleculeNet) pcba_random_nans Biological activities from MoleculeNet with random splits and data format to match OGB (with missing targets not filled in with zeros) pcba_scaffold Biological activities from OGB with scaffold splits qm9_multitask DFT calculated properties from MoleculeNet and OGB, trained as a multi-task model qm9_u0 DFT calculated properties from MoleculeNet and OGB, trained as a single-task model on the target U0 only qm9_gap DFT calculated properties from MoleculeNet and OGB, trained as a single-task model on the target gap only sampl Water-octanol partition coefficients, used to predict molecules from the SAMPL6, 7 and 9 challenges atom_bond_137k Quantum-mechanical atom and bond descriptors bde Bond dissociation enthalpies trained as single-task model bde_charges Bond dissociation enthalpies trained as multi-task model together with atomic partial charges charges_eps_4 Partial charges at a dielectric constant of 4 (in protein) charges_eps_78 Partial charges at a dielectric constant of 78 (in water) barriers_e2 Reaction barrier heights of E2 reactions barriers_sn2 Reaction barrier heights of SN2 reactions barriers_cycloadd Reaction barrier heights of cycloaddition reactions barriers_rdb7 Reaction barrier heights in the RDB7 dataset barriers_rgd1 Reaction barrier heights in the RGD1-CNHO dataset multi_molecule UV/Vis peak absorption wavelengths in different solvents ir IR Spectra pcqm4mv2 HOMO-LUMO gaps of the PCQM4Mv2 dataset uncertainty_ensemble Uncertainty estimation using an ensemble using the QM9 gap dataset uncertainty_evidential Uncertainty estimation using evidential learning using the QM9 gap dataset uncertainty_mve Uncertainty estimation using mean-variance estimation using the QM9 gap dataset timing Timing benchmark using subsets of QM9 gap Version: This version of the dataset (Version 2) is compatible with all versions of Chemprop (supporting the respective functionality). Version 1 of this dataset is compatible with all versions except Chemprop v.1.6.1, which cannot process the charges_eps_4 and charges_eps_78 datasets (all other benchmarks work as expected). We therefore recommend to always use Version 2 of the dataset (with reformatted charges_eps_4 and charges_eps_78 datasets), since it is compatible with all versions of Chemprop. For use with any other ML software, you can use any version.

  8. Data from: Algorithm for Reaction Classification

    • figshare.com
    • acs.figshare.com
    zip
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hans Kraut; Josef Eiblmaier; Guenter Grethe; Peter Löw; Heinz Matuszczyk; Heinz Saller (2023). Algorithm for Reaction Classification [Dataset]. http://doi.org/10.1021/ci400442f.s002
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    ACS Publications
    Authors
    Hans Kraut; Josef Eiblmaier; Guenter Grethe; Peter Löw; Heinz Matuszczyk; Heinz Saller
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Reaction classification has important applications, and many approaches to classification have been applied. Our own algorithm tests all maximum common substructures (MCS) between all reactant and product molecules in order to find an atom mapping containing the minimum chemical distance (MCD). Recent publications have concluded that new MCS algorithms need to be compared with existing methods in a reproducible environment, preferably on a generalized test set, yet the number of test sets available is small, and they are not truly representative of the range of reactions that occur in real reaction databases. We have designed a challenging test set of reactions and are making it publicly available and usable with InfoChem’s software or other classification algorithms. We supply a representative set of example reactions, grouped into different levels of difficulty, from a large number of reaction databases that chemists actually encounter in practice, in order to demonstrate the basic requirements for a mapping algorithm to detect the reaction centers in a consistent way. We invite the scientific community to contribute to the future extension and improvement of this data set, to achieve the goal of a common standard.

  9. NIST Thermodynamics of Enzyme-Catalyzed Reactions Database - SRD 74

    • catalog.data.gov
    • data.nist.gov
    Updated Jul 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2022). NIST Thermodynamics of Enzyme-Catalyzed Reactions Database - SRD 74 [Dataset]. https://catalog.data.gov/dataset/nist-thermodynamics-of-enzyme-catalyzed-reactions-database-srd-74-80158
    Explore at:
    Dataset updated
    Jul 29, 2022
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Description

    The following information is given for each entry in this database: the reference for the data; the reaction studied; the name of the enzyme used and its Enzyme Commission number; the method of measurement; the conditions of measurement (temperature, pH, ionic strength, and the buffer(s) and cofactor(s) used); the data and an evaluation of it; and, sometimes, commentary on the data and on any corrections which have been applied to it. The absence of a piece of information indicates that it was not found in the paper cited.

  10. Reaction SMILES dataset

    • figshare.com
    txt
    Updated Apr 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rik van der Lingen (2023). Reaction SMILES dataset [Dataset]. http://doi.org/10.6084/m9.figshare.22491730.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Apr 1, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Rik van der Lingen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Reaction SMILES dataset update (now 733K), each line in the file represents a valid reaction SMILES. Source material US patents (2005 - 2016) collection by Daniel Lowe with data enhancement. Source material also includes reaction SMILES drawn from the general literature. Also includes USPTO data from 2022 and 2023. All SMILES are valid by RDKit. Also see https://kmt.vander-lingen.nl

  11. Z

    USPTO-LLM: A Large Language Model-Assisted Information-enriched Chemical...

    • data.niaid.nih.gov
    Updated Dec 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xu, Hongteng (2024). USPTO-LLM: A Large Language Model-Assisted Information-enriched Chemical Reaction Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_11464251
    Explore at:
    Dataset updated
    Dec 12, 2024
    Dataset provided by
    Xu, Hongteng
    Yuan, Shen
    Gong, Shukai
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    USPTO-LLM is an information-enriched chemical reaction dataset that provides more side information (reaction conditions and reaction steps division) for developing new reaction prediction and retrosynthesis methods and inspires new problems, such as reaction condition prediction. It comprises over 247K chemical reactions extracted from the patent documents of USPTO (United States Patent and Trademark Office), encompassing abundant information on reaction conditions.

    We employ large language models to expedite the data collection procedures automatically with a reliable quality control process. The extracted chemical reactions are organized as heterogeneous directed graphs, allowing us to formulate a series of prediction tasks, such as reaction prediction, retrosynthesis, and reaction condition prediction, in a unified graph-filling framework.

  12. Canada Vigilance Adverse Reaction Online Database

    • open.canada.ca
    • ouvert.canada.ca
    • +1more
    html, json, xml, zip
    Updated May 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Health Canada (2025). Canada Vigilance Adverse Reaction Online Database [Dataset]. https://open.canada.ca/data/en/dataset/9cbaef00-b52c-4a70-9fed-d9aa8263ab74
    Explore at:
    json, xml, html, zipAvailable download formats
    Dataset updated
    May 28, 2025
    Dataset provided by
    Health Canadahttp://www.hc-sc.gc.ca/
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Area covered
    Canada
    Description

    The data extract is a series of compressed ASCII text files of the full data set contained in the Canada Vigilance Adverse Reaction Online Database. It is intended for users who are familiar with database structures and setting up their own queries. Find details on the data structure required for the data file in the Canada Vigilance Adverse Reaction Online Database - Data Structure. In order to use the data, the file must be loaded into an existing database or information system provided by the user. The Canada Vigilance Adverse Reaction Online Database contains information about suspected adverse reactions (also known as side effects) to health products, captured from adverse reaction reports submitted to Health Canada by consumers and health professionals, who submit reports voluntarily, as well as by market authorization holders (manufacturers and distributors), who are required to submit reports according to the Food and Drugs Regulations. Information concerning vaccines used for immunization have only been included in the database since January 1, 2011. Indication data has recently been added to the data extract files and the Detailed Adverse Reaction Report. Indication refers to the particular condition for which a health product was taken. For example, diabetes is an indication for insulin. Health products are often authorised for use in treating more than one indication. Note: The database cannot be used on its own to evaluate a health product's safety profile. It does not provide conclusive information on the safety of health products, and is not a substitute for medical advice. Should you have an issue of medical concern, consult a qualified health professional.

  13. Z

    S73 | METXBIODB | Metabolite Reaction Database from BioTransformer

    • data.niaid.nih.gov
    • zenodo.org
    Updated Aug 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Djoumbou-Feunang, Yannick (2024). S73 | METXBIODB | Metabolite Reaction Database from BioTransformer [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4056560
    Explore at:
    Dataset updated
    Aug 6, 2024
    Dataset provided by
    Zhang, Jeff
    Djoumbou-Feunang, Yannick
    Wishart, David S.
    Schymanski, Emma
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is the collection associated with list S73 MetXBioDB Metabolite Reaction Database from BioTransformer on the NORMAN Suspect List Exchange.

    https://www.norman-network.com/nds/SLE/

    This dataset is extracted from the database behind BioTransformer (http://biotransformer.ca/) by Yannick Djoumbou-Feunang, David S. Wishart and colleagues, for addition to the PubChem Transformations section. Change logs and version tracking at the ECI GitLab site.

    Please cite the BioTransformer article when using this set: https://jcheminf.biomedcentral.com/articles/10.1186/s13321-018-0324-5

    NOTE: This deposition is work in progress ...

    Change log: 13 Oct: added InChIKey file. 16 Oct: updated substances with missing CIDs and transformations. 5/11 many bug fixes finally committed, added DTXSIDs. 22/6/2023 adjusted one CID that changed upon PubChem standardization. 15 Nov 2023: fixed typo in reaction description. 26 Feb 2024: corrected name for CID 65564. 6 Aug 2024: fixed many triazine synonyms.

  14. n

    SCRIPDB

    • neuinfo.org
    • scicrunch.org
    • +1more
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). SCRIPDB [Dataset]. http://identifiers.org/RRID:SCR_008922
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    A database of chemicals and reactions inside of US patents (2001 - 2011). SCRIPDB provides the full original patent text, reactions and relationships described within any individual patent, in addition to the molecular files common to structural databases. The patent literature is a rich catalog of biologically relevant chemicals; many public and commercial molecular databases contain the structures disclosed in patent claims. However, patents are an equally rich source of metadata about bioactive molecules, including mechanism of action, disease class, homologous experimental series, structural alternatives, or the synthetic pathways used to produce molecules of interest. Unfortunately, this metadata is discarded when chemical structures are deposited separately in databases. SCRIPDB is a chemical structure database designed to make this metadata accessible. The SCRIPDB information is valuable in medical text mining, chemical image analysis, reaction extraction and in silico pharmaceutical lead optimization. SCRIPDB may be searched by exact chemical structure, substructure or molecular similarity and the results may be restricted to patents describing synthetic routes.

  15. d

    Evaluated Nuclear Data File

    • datadiscoverystudio.org
    resource url
    Updated 1963
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (1963). Evaluated Nuclear Data File [Dataset]. http://datadiscoverystudio.org/geoportal/rest/metadata/item/28b9cdc95e9747a6b632ea1b2dfa1889/html
    Explore at:
    resource urlAvailable download formats
    Dataset updated
    1963
    Area covered
    Description

    Link Function: information

  16. d

    RHEA

    • dknet.org
    • scicrunch.org
    Updated Aug 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). RHEA [Dataset]. http://identifiers.org/RRID:SCR_004713/resolver/mentions?q=&i=rrid
    Explore at:
    Dataset updated
    Aug 15, 2024
    Description

    Manually annotated reaction database where all reaction participants (reactants and products) are linked to the ChEBI database (Chemical Entities of Biological Interest) which provides detailed information about structure, formula and charge. Rhea provides built-in validations that ensure both elemental and charge balance of the reactions. The database has been populated with the reactions found in the Enzyme Commission (EC) list (and in the IntEnz and ENZYME databases), extending it with additional known reactions of biological interest. While the main focus of Rhea is enzyme-catalyzed reactions, other biochemical reactions are also included. Rhea is a manually annotated resource and it provides: stable reaction identifiers for each of its reactions; directionality information if the physiological direction of the reaction is known; the possibility to link several reactions together to form overall reactions; extensive cross-references to other resources including enzyme-catalyzed and other metabolic reactions, such as the EC list (in IntEnz), KEGG, MetaCyc and UniPathway; and chemical substructure and similarity searches on compounds in Rhea.

  17. f

    Reaction SMILES CRD 1.37M dataset

    • figshare.com
    zip
    Updated Jan 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rik van der Lingen (2025). Reaction SMILES CRD 1.37M dataset [Dataset]. http://doi.org/10.6084/m9.figshare.28230053.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 17, 2025
    Dataset provided by
    figshare
    Authors
    Rik van der Lingen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Collection of reaction SMILES (reactants, reagents, solvents, products) 1.37M lines total from patent literature (USPTO 1976 - 2024) and from academic literature (2.5% total). Data converted from existing USPTO dataset 1] and data generated by parsing by custom design. Data extraction by OSCAR (semantic) or ChatGPT (LLM), molecule identification by OPSIN and custom synonym list. All SMILES are RDKit-safe with duplicate reactions removed. Please note that the data have been collected in an semi-automated process, the dataset is certainly not without errors.More information on https://kmt.vander-lingen.nl.1] Chemical reactions from US patents (1976-Sep2016), Daniel Lowe. Link.

  18. P

    USPTO-50k Dataset

    • paperswithcode.com
    • opendatalab.com
    Updated Mar 21, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nadine Schneider; Nikolaus Stiefl; Gregory A. Landrum (2022). USPTO-50k Dataset [Dataset]. https://paperswithcode.com/dataset/uspto-50k
    Explore at:
    Dataset updated
    Mar 21, 2022
    Authors
    Nadine Schneider; Nikolaus Stiefl; Gregory A. Landrum
    Description

    Subset and preprocessed version of Chemical reactions from US patents (1976-Sep2016) by Daniel Lowe. It includes 50K randomly selected reactions that was later classified into 10 reaction classes by Nadine Schneider et al.

  19. o

    Data from: Reaction Mechanism Generator: Automatic construction of chemical...

    • explore.openaire.eu
    • data.mendeley.com
    Updated Nov 29, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Connie W. Gao (2019). Reaction Mechanism Generator: Automatic construction of chemical kinetic mechanisms [Dataset]. http://doi.org/10.17632/5n9vprvyhz.1
    Explore at:
    Dataset updated
    Nov 29, 2019
    Authors
    Connie W. Gao
    Description

    Abstract Reaction Mechanism Generator (RMG) constructs kinetic models composed of elementary chemical reaction steps using a general understanding of how molecules react. Species thermochemistry is estimated through Benson group additivity and reaction rate coefficients are estimated using a database of known rate rules and reaction templates. At its core, RMG relies on two fundamental data structures: graphs and trees. Graphs are used to represent chemical structures, and trees are used to represent ... Title of program: RMG Catalogue Id: AEZW_v1_0 Nature of problem Automatic generation of chemical kinetic mechanisms for molecules containing C, H, O, S, and N. Versions of this program held in the CPC repository in Mendeley Data AEZW_v1_0; RMG; 10.1016/j.cpc.2016.02.013 This program has been imported from the CPC Program Library held at Queen's University Belfast (1969-2018)

  20. f

    Data from: General reactive machine learning potentials for CHON elements

    • figshare.com
    application/x-gzip
    Updated Jun 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    bowen li (2025). General reactive machine learning potentials for CHON elements [Dataset]. http://doi.org/10.6084/m9.figshare.29311196.v1
    Explore at:
    application/x-gzipAvailable download formats
    Dataset updated
    Jun 13, 2025
    Dataset provided by
    figshare
    Authors
    bowen li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data of the article "General reactive machine learning potentials for CHON elements"

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
National Institute of Standards and Technology (2022). NIST Chemical Kinetics Database [Dataset]. https://catalog.data.gov/dataset/nist-chemical-kinetics-database-bee86
Organization logo

NIST Chemical Kinetics Database

Explore at:
Dataset updated
Jul 29, 2022
Dataset provided by
National Institute of Standards and Technologyhttp://www.nist.gov/
Description

The NIST Chemical Kinetics Database includes essentially all reported kinetics results for thermal gas-phase chemical reactions. The database is designed to be searched for kinetics data based on the specific reactants involved, for reactions resulting in specified products, for all the reactions of a particular species, or for various combinations of these. In addition, the bibliography can be searched by author name or combination of names. The database contains in excess of 38,000 separate reaction records for over 11,700 distinct reactant pairs. These data have been abstracted from over 12,000 papers with literature coverage through early 2000. Rate constant records for a specified reaction are found by searching the Reaction Database. All rate constant records for that reaction are returned, with a link to 'Details' on that record. Each rate constant record contains the following information (as available): a) Reactants and, if defined, reaction products; b) Rate parameters: A, n, Ea/R, where k = A (T/298)*n exp[-(Ea/R)/T], where T is the temperature in Kelvins; c) Uncertainty in A, n, and Ea/R, if reported; d) Temperature range of experiment or temperature range of validity of a review or theoretical paper; e) Pressure range and bulk gas of the experiment; f) Data type of the record (i.e., experimental, relative rate measurement, theoretical calculation, modeling result, etc.). If the result is a relative rate measurement, then the reaction to which the rate is relative is also given; g) Experimental procedure, including separate fields for the description of the apparatus, the time resolution of the experiment, and the excitation technique. A majority of contemporary chemical kinetics methods are represented. The Kinetics Database is being expanded to include other resources for the convenience of the users. Presently this includes direct links to the corresponding NIST WebBook page for all substances for which such a link is possible. This is indicated by underling and highlighting the species. The WebBook provides thermodynamic, spectral, and other data on the species. Note that the link to the WebBook is opened as a new frame in your browser.

Search
Clear search
Close search
Google apps
Main menu