100+ datasets found
  1. m

    Massive Atomic Diversity: a compact universal dataset for atomistic machine...

    • archive.materialscloud.org
    application/gzip, bin +2
    Updated Jun 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arslan Mazitov; Sofiia Chorna; Guillaume Fraux; Marnik Bercx; Giovanni Pizzi; Sandip De; Michele Ceriotti; Arslan Mazitov; Sofiia Chorna; Guillaume Fraux; Marnik Bercx; Giovanni Pizzi; Sandip De; Michele Ceriotti (2025). Massive Atomic Diversity: a compact universal dataset for atomistic machine learning [Dataset]. http://doi.org/10.24435/materialscloud:vd-e8
    Explore at:
    bin, xyz, application/gzip, text/markdownAvailable download formats
    Dataset updated
    Jun 26, 2025
    Dataset provided by
    Materials Cloud
    Authors
    Arslan Mazitov; Sofiia Chorna; Guillaume Fraux; Marnik Bercx; Giovanni Pizzi; Sandip De; Michele Ceriotti; Arslan Mazitov; Sofiia Chorna; Guillaume Fraux; Marnik Bercx; Giovanni Pizzi; Sandip De; Michele Ceriotti
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The development of machine-learning models for atomic-scale simulations has benefitted tremendously from the large databases of materials and molecular properties computed in the past two decades using electronic-structure calculations. More recently, these databases have made it possible to train “universal” models that aim at making accurate predictions for arbitrary atomic geometries and compositions. The construction of many of these databases was however in itself aimed at materials discovery, and therefore targeted primarily to sample stable, or at least plausible, structures and to make the most accurate predictions for each compound – e.g. adjusting the calculation details to the material at hand. Here we introduce a dataset designed specifically to train models that can provide reasonable predictions for arbitrary structures, and that therefore follows a different philosophy. Starting from relatively small sets of stable structures, the dataset is built to contain “massive atomic diversity” (MAD) by aggressively distorting these configurations, with near-complete disregard for the stability of the resulting configurations. The electronic structure details, on the other hand, are chosen to maximize consistency rather than to obtain the most accurate prediction for
    a given structure, or to minimize computational effort. The MAD dataset we present here, despite containing fewer than 100k structures, has already been shown to enable training universal interatomic potentials that are competitive with models trained on traditional datasets with two to three orders of magnitude more structures. We describe in detail the philosophy and details of the construction of the MAD dataset. We also introduce a low-dimensional structural latent space that allows us to compare it with other popular datasets, and that can also be used as a general-purpose materials cartography tool.

  2. m

    The Materials Cloud 2D database (MC2D)

    • archive.materialscloud.org
    • materialscloud-archive-failover.cineca.it
    bin, json, pdf +3
    Updated Jun 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Davide Campi; Nicolas Mounet; Marco Gibertini; Giovanni Pizzi; Nicola Marzari; Davide Campi; Nicolas Mounet; Marco Gibertini; Giovanni Pizzi; Nicola Marzari (2022). The Materials Cloud 2D database (MC2D) [Dataset]. http://doi.org/10.24435/materialscloud:36-nd
    Explore at:
    zip, txt, pdf, bin, json, text/markdownAvailable download formats
    Dataset updated
    Jun 24, 2022
    Dataset provided by
    Materials Cloud
    Authors
    Davide Campi; Nicolas Mounet; Marco Gibertini; Giovanni Pizzi; Nicola Marzari; Davide Campi; Nicolas Mounet; Marco Gibertini; Giovanni Pizzi; Nicola Marzari
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Two-dimensional (2D) materials are among the most promising candidates for beyond silicon electronic and optoelectronic applications. Recently, their recognized importance, sparked a race to discover and characterize new 2D materials. Within few years the number of experimentally exfoliated or synthesized 2D materials went from a couple of dozens to few hundreds while the number theoretically predicted compounds reached a few thousands. In 2018 we first contributed to this effort with the identification of 1825 compounds that are either easily (1036) or potentially (789) exfoliable from experimentally known 3D compounds. In the present work we report on the new materials recently added to the 2D-portfolio thanks to the extension of the screening to an additional experimental database (MPDS) as well as the most up-to-date versions of the two databases (ICSD and COD) used in our previous work. This expansion led to the discovery of an additional 1252 unique monolayers bringing the total to 3077 compounds and, notably, almost doubling the number of easily exfoliable materials (2004). Moreover, we optimized the structural properties of all the materials (regardless of their binding energy or number of atoms in the unit cell) as isolated mono-layer and explored their electronic band structure. This archive entry contains the database of 2D materials in particular it contains the structural parameters for all the 3077 structures of the global Material Cloud 2D database as extracted from their bulk 3D parent, 2710 optimized 2D structures and 2345 electronic band structure together with the provenance of all data and calculations as stored by AiiDA.

  3. m

    Complexity of many-body interactions in transition metals via...

    • archive.materialscloud.org
    text/markdown, zip
    Updated Mar 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cameron Owen; Steven Torrisi; Yu Xie; Simon Batzner; Kyle Bystrom; Jennifer Coulter; Albert Musaelian; Lixin Sun; Boris Kozinsky; Cameron Owen; Steven Torrisi; Yu Xie; Simon Batzner; Kyle Bystrom; Jennifer Coulter; Albert Musaelian; Lixin Sun; Boris Kozinsky (2024). Complexity of many-body interactions in transition metals via machine-learned force fields from the TM23 data set [Dataset]. http://doi.org/10.24435/materialscloud:6c-b3
    Explore at:
    zip, text/markdownAvailable download formats
    Dataset updated
    Mar 22, 2024
    Dataset provided by
    Materials Cloud
    Authors
    Cameron Owen; Steven Torrisi; Yu Xie; Simon Batzner; Kyle Bystrom; Jennifer Coulter; Albert Musaelian; Lixin Sun; Boris Kozinsky; Cameron Owen; Steven Torrisi; Yu Xie; Simon Batzner; Kyle Bystrom; Jennifer Coulter; Albert Musaelian; Lixin Sun; Boris Kozinsky
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This work examines challenges associated with the accuracy of machine-learned force fields (MLFFs) for bulk solid and liquid phases of d-block elements. In exhaustive detail, we contrast the performance of force, energy, and stress predictions across the transition metals for two leading MLFF models: a kernel-based atomic cluster expansion method implemented using sparse Gaussian processes (FLARE), and an equivariant message-passing neural network (NequIP). Early transition metals present higher relative errors and are more difficult to learn relative to late platinum- and coinage-group elements, and this trend persists across model architectures. Trends in complexity of interatomic interactions for different metals are revealed via comparison of the performance of representations with different many-body order and angular resolution. Using arguments based on perturbation theory on the occupied and unoccupied d states near the Fermi level, we determine that the large, sharp d density of states both above and below the Fermi level in early transition metals leads to a more complex, harder-to-learn potential energy surface for these metals. Increasing the fictitious electronic temperature (smearing) modifies the angular sensitivity of forces and makes the early transition metal forces easier to learn. This work illustrates challenges in capturing intricate properties of metallic bonding with current leading MLFFs and provides a reference data set for transition metals, aimed at benchmarking the accuracy and improving the development of emerging machine-learned approximations.

  4. c

    Materials Cloud three-dimensional crystals database (MC3D)

    • materialscloud-archive-failover.cineca.it
    • archive.materialscloud.org
    bin, text/markdown +1
    Updated Mar 12, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sebastiaan Huber; Marnik Bercx; Nicolas Hörmann; Martin Uhrin; Giovanni Pizzi; Nicola Marzari; Sebastiaan Huber; Marnik Bercx; Nicolas Hörmann; Martin Uhrin; Giovanni Pizzi; Nicola Marzari (2022). Materials Cloud three-dimensional crystals database (MC3D) [Dataset]. http://doi.org/10.24435/materialscloud:rw-t0
    Explore at:
    zip, text/markdown, binAvailable download formats
    Dataset updated
    Mar 12, 2022
    Dataset provided by
    Materials Cloud
    Authors
    Sebastiaan Huber; Marnik Bercx; Nicolas Hörmann; Martin Uhrin; Giovanni Pizzi; Nicola Marzari; Sebastiaan Huber; Marnik Bercx; Nicolas Hörmann; Martin Uhrin; Giovanni Pizzi; Nicola Marzari
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Materials Cloud three-dimensional database is a curated set of relaxed three-dimensional crystal structures based on raw CIF data taken from the external experimental databases MPDS, COD and ICSD. The raw CIF data have been imported, cleaned and parsed into a crystal structure; their ground-state has been computed using the SIRIUS-enabled pw.x code of the Quantum ESPRESSO distribution, and tight tolerance criteria for the calculations using the SSSP protocols.

    This entire procedure is encoded into an AiiDA workflow which automates the process while keeping full data provenance. Here, since the original source data of the ICSD and MPDS databases are copyrighted, only the provenance of the final SCF calculation on the relaxed structures can be made publicly available.

    The MC3D ID numbers come from a list of unique "parent" stoichiometric structures that has been created and curated from a collection of these experimental databases. Once a parent structure has been optimized using density-functional theory, it is made public and added to the online Discover section of the Materials Cloud (as mentioned, copyright might prevent publishing the original parent). Note that since not all structures have been calculated, some ID numbers are missing from the public version of the database. The full ID of each structure also contains as an appended modifier the functional that was used in the calculations. Since the ID number points to the same unique parent, mc3d-1234/pbe and mc3d-1234/pbesol have the same starting point, but have been then relaxed according to their respective functionals.

  5. m

    Polymer descriptor data set for machine learning prediction of specific heat...

    • archive.materialscloud.org
    • materialscloud-archive-failover.cineca.it
    csv, text/markdown +2
    Updated Jun 19, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rahul Bhowmik; Rahul Bhowmik (2020). Polymer descriptor data set for machine learning prediction of specific heat [Dataset]. http://doi.org/10.24435/materialscloud:18-nr
    Explore at:
    text/markdown, csv, txt, zipAvailable download formats
    Dataset updated
    Jun 19, 2020
    Dataset provided by
    Materials Cloud
    Authors
    Rahul Bhowmik; Rahul Bhowmik
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We have developed a polymer descriptor data set from the existing data. The data set has 188 descriptors which describe polymer atomic and molecular behavior. The descriptors are mapped to the specific heat of polymers using supervised and unsupervised machine learning approaches. The mapping helps predict the specific heat of polymers at room temperature. The descriptor data set is useful in synthesizing novel polymers with desired heat capacities.

  6. TCSP2.0_database

    • figshare.com
    application/gzip
    Updated Feb 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lai Wei (2025). TCSP2.0_database [Dataset]. http://doi.org/10.6084/m9.figshare.28379060.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Feb 10, 2025
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Lai Wei
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    TCSP 2.0 templte database, it includes the Materials Project (MP) database, Materials Cloud database (both 2D and 3D), The Computational 2D Materials Database (C2DB), and Graph Networks for Materials Science database(GNoME).

  7. c

    Data from: HTA - An open-source software for assigning heads and tails to...

    • materialscloud-archive-failover.cineca.it
    • archive.materialscloud.org
    csv, text/markdown
    Updated Jul 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Brenda de Souza Ferrari; Ronaldo Giro; Mathias B. Steiner; Brenda de Souza Ferrari; Ronaldo Giro; Mathias B. Steiner (2025). HTA - An open-source software for assigning heads and tails to SMILES in polymerization reactions [Dataset]. http://doi.org/10.24435/materialscloud:2x-9j
    Explore at:
    text/markdown, csvAvailable download formats
    Dataset updated
    Jul 31, 2025
    Dataset provided by
    Materials Cloud
    Authors
    Brenda de Souza Ferrari; Ronaldo Giro; Mathias B. Steiner; Brenda de Souza Ferrari; Ronaldo Giro; Mathias B. Steiner
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Polymers are versatile materials with a wide range of applications. The improvement of polymer properties rises the importance on the way that the repeating units are connected (head-to-tail,head-to-head,tail-to-tail) to build the polymer structure since it directly influences the morphology, chain topology and consequently its properties. Artificial intelligence (AI) based approaches are beginning to impact several domains of human life, science and technology. Polymer informatics is one such domain where AI and machine learning (ML) tools are being used in the efficient development, design and discovery of polymer. One key enabling factor for the essential foundations for Polymer Informatics is the machine-readable polymer representation. Polymer have been represented in a string format with special characters used to tag the head and tail positions indicating where the linking bond happens between repeat units. Available tools to assign the head and tail position limits its applicability in a broad sense. In this work we show a new tool to assign the head and tail atoms for a given monomer. From a database of 206 polymer precursors curated from the literature, our algorithm correctly predicted the class of 201 data points, which represents 97.6% of accuracy and regarding the the head and tail assignment, correctly assigned the positions for 188 data points, which translates to 91.3% of accuracy.

  8. c

    Dataset of 80,000 solvated joint-DFT free energies for ORR on spinel-oxide...

    • materialscloud-archive-failover.cineca.it
    text/markdown, zip
    Updated Apr 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Colin Bundschu; Colin Bundschu (2025). Dataset of 80,000 solvated joint-DFT free energies for ORR on spinel-oxide (100) and (111) facets [Dataset]. http://doi.org/10.24435/materialscloud:ka-8a
    Explore at:
    zip, text/markdownAvailable download formats
    Dataset updated
    Apr 30, 2025
    Dataset provided by
    Materials Cloud
    Authors
    Colin Bundschu; Colin Bundschu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Earth-abundant spinel oxides are promising alkaline oxygen-reduction catalysts, yet mechanistic models still invoke a vacuum-DFT associative *OOH/*OO route. Here we combine >80,000 fully solvated joint-DFT calculations to map oxygen-reduction energetics across 442 Al-, Co-, Cr-, Fe-, Ga-, Mn-, Ni- and Zn-containing spinels on the (100) and (111) facets. Associative intermediates are >0.5 eV less stable than *OH/H states, revealing a four-step dissociative cycle in which surface-hydrogen passivation shuttles protons. These results establish solvated high-throughput DFT as a predictive lens on the kinetic oxygen-reduction limit of spinel oxides.

  9. c

    One dimensional edge localized YSR states in CrCl₃ on NbSe₂

    • materialscloud-archive-failover.cineca.it
    • archive.materialscloud.org
    bin, png +2
    Updated Jan 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jan P. Cuperus; Arnold H. Kole; Andrés R. Botello-Méndez; Zeila Zanolli; Daniel Vanmaekelbergh; Ingmar Swart; Jan P. Cuperus; Arnold H. Kole; Andrés R. Botello-Méndez; Zeila Zanolli; Daniel Vanmaekelbergh; Ingmar Swart (2025). One dimensional edge localized YSR states in CrCl₃ on NbSe₂ [Dataset]. http://doi.org/10.24435/materialscloud:1b-aa
    Explore at:
    bin, text/markdown, png, txtAvailable download formats
    Dataset updated
    Jan 22, 2025
    Dataset provided by
    Materials Cloud
    Authors
    Jan P. Cuperus; Arnold H. Kole; Andrés R. Botello-Méndez; Zeila Zanolli; Daniel Vanmaekelbergh; Ingmar Swart; Jan P. Cuperus; Arnold H. Kole; Andrés R. Botello-Méndez; Zeila Zanolli; Daniel Vanmaekelbergh; Ingmar Swart
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Magnet/superconductor hybrid systems have been put forward as a platform for realizing topological superconductivity. We investigated the heterostructure of ferromagnetic monolayer CrCl₃ and superconducting NbSe₂. Using low-temperature scanning tunneling microscopy, we observe topologically trivial Yu-Shiba-Rusinov (YSR) states localized at the edge of CrCl₃ islands. DFT simulations reveal that the Cr atoms at the edge have an enhanced d-orbital DOS close to the Fermi energy. This leads to an exchange coupling between these atoms and the substrate that rationalizes the edge-localization of the YSR states.

    This dataset contains the first-principles calculations performed on a nanoribbon of CrCl₃ on NbSe₂ and the associated notebooks used to generate figures from this data.

  10. m

    Tunable topological Dirac surface states and van Hove singularities in...

    • archive.materialscloud.org
    bin, text/markdown
    Updated Sep 26, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yong Hu; Xianxin Wu; Yongqi Yang; Shunye Gao; Nicholas C. Plumb; Andreas P. Schnyder; Weiwei Xie; Junzhang Ma; Ming Shi; Yong Hu; Xianxin Wu; Yongqi Yang; Shunye Gao; Nicholas C. Plumb; Andreas P. Schnyder; Weiwei Xie; Junzhang Ma; Ming Shi (2022). Tunable topological Dirac surface states and van Hove singularities in kagome metal GdV6Sn6 [Dataset]. http://doi.org/10.24435/materialscloud:64-3c
    Explore at:
    text/markdown, binAvailable download formats
    Dataset updated
    Sep 26, 2022
    Dataset provided by
    Materials Cloud
    Authors
    Yong Hu; Xianxin Wu; Yongqi Yang; Shunye Gao; Nicholas C. Plumb; Andreas P. Schnyder; Weiwei Xie; Junzhang Ma; Ming Shi; Yong Hu; Xianxin Wu; Yongqi Yang; Shunye Gao; Nicholas C. Plumb; Andreas P. Schnyder; Weiwei Xie; Junzhang Ma; Ming Shi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Transition-metal-based kagome materials at van Hove filling are a rich frontier for the investigation of novel topological electronic states and correlated phenomena. To date, in the idealized two-dimensional kagome lattice, topologically Dirac surface states (TDSSs) have not been unambiguously observed, and the manipulation of TDSSs and van Hove singularities (VHSs) remains largely unexplored. Here, we reveal TDSSs originating from a Z2 bulk topology and identify multiple VHSs near the Fermi level (EF) in magnetic kagome material GdV6Sn6. Using in situ surface potassium deposition, we successfully realize manipulation of the TDSSs and VHSs. The Dirac point of the TDSSs can be tuned from above to below EF, which reverses the chirality of the spin texture at the Fermi surface. These results establish GdV6Sn6 as a fascinating platform for studying the nontrivial topology, magnetism, and correlation effects native to kagome lattices. They also suggest potential application of spintronic devices based on kagome materials.

  11. c

    TopoMat: a database of high-throughput first-principles calculations of...

    • materialscloud-archive-failover.cineca.it
    • archive.materialscloud.org
    application/gzip +2
    Updated May 15, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    QuanSheng Wu; Gabriel Autès; Nicolas Mounet; Oleg V. Yazyev; QuanSheng Wu; Gabriel Autès; Nicolas Mounet; Oleg V. Yazyev (2019). TopoMat: a database of high-throughput first-principles calculations of topological materials [Dataset]. http://doi.org/10.24435/materialscloud:2019.0019/v1
    Explore at:
    text/markdown, application/gzip, txtAvailable download formats
    Dataset updated
    May 15, 2019
    Dataset provided by
    Materials Cloud
    Authors
    QuanSheng Wu; Gabriel Autès; Nicolas Mounet; Oleg V. Yazyev; QuanSheng Wu; Gabriel Autès; Nicolas Mounet; Oleg V. Yazyev
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    We present a database of topological materials predicted from high-throughput first-principles calculations. The database contains electronic band structures and topological indices of 13628 materials calculated on experimental crystal structures taken from the Inorganic Crystal Structure Database (ICSD) and the Crystallography Open Database (COD). The calculations have been performed on non-magnetic phases taking into account the spin-orbit interactions using the Quantum ESPRESSO package. The Fu-Kane method and the Wannier charge center method implemented in the Z2pack code have been utilized to calculate the Z2 topological numbers of centrosymmetric and non-centrosymmetric materials, respectively. Over 4000 topologically non-trivial materials have been identified.

  12. c

    Data from: Exploring the magnetic landscape of easily-exfoliable...

    • materialscloud-archive-failover.cineca.it
    • archive.materialscloud.org
    text/markdown, txt +1
    Updated May 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fatemeh Haddadi; Davide Campi; Flaviano Dos Santos; Nicolas Mounet; Louis Ponet; Nicola Marzari; Marco Gibertini; Fatemeh Haddadi; Davide Campi; Flaviano Dos Santos; Nicolas Mounet; Louis Ponet; Nicola Marzari; Marco Gibertini (2025). Exploring the magnetic landscape of easily-exfoliable two-dimensional materials [Dataset]. http://doi.org/10.24435/materialscloud:m7-m9
    Explore at:
    zip, text/markdown, txtAvailable download formats
    Dataset updated
    May 26, 2025
    Dataset provided by
    Materials Cloud
    Authors
    Fatemeh Haddadi; Davide Campi; Flaviano Dos Santos; Nicolas Mounet; Louis Ponet; Nicola Marzari; Marco Gibertini; Fatemeh Haddadi; Davide Campi; Flaviano Dos Santos; Nicolas Mounet; Louis Ponet; Nicola Marzari; Marco Gibertini
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Magnetic materials often exhibit complex energy landscapes with multiple local minima, each corresponding to a self-consistent electronic structure solution. Finding the global minimum is challenging, and heuristic methods are not always guaranteed to succeed. We apply an automated workflow to systematically explore the energy landscape of 194 magnetic monolayers from the Materials Cloud 2D crystals database and determine their ground-state magnetic order. Our approach enables effective control and sampling of orbital occupation matrices, allowing rapid identification of local minima. We reveal a diverse set of self-consistent collinear metastable states, further enriched by Hubbard-corrected energy functionals with U parameters computed from first principles using linear response theory. We categorize the monolayers by their magnetic ordering and highlight promising candidates for applications.

  13. Materials Cloud, An Open Science Portal for FAIR Data Sharing

    • figshare.com
    mp4
    Updated Jun 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Scientific Data; Aliaksandr V. Yakutovich (2023). Materials Cloud, An Open Science Portal for FAIR Data Sharing [Dataset]. http://doi.org/10.6084/m9.figshare.7611347.v1
    Explore at:
    mp4Available download formats
    Dataset updated
    Jun 5, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Scientific Data; Aliaksandr V. Yakutovich
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    20 minute lightning talk presentation given by Aliaksandr Yakutovich, from École Polyechnique Fédérale de Lausanne, at the Better Science through Better Data 2018 event. The video recording and scribe are included.

  14. c

    Machine learning on multiple topological materials datasets

    • materialscloud-archive-failover.cineca.it
    • archive.materialscloud.org
    application/gzip, bin +1
    Updated Feb 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yuqing He; Pierre-Paul De Breuck; Hongming Weng; Matteo Giantomassi; Gian-Marco Rignanese; Yuqing He; Pierre-Paul De Breuck; Hongming Weng; Matteo Giantomassi; Gian-Marco Rignanese (2025). Machine learning on multiple topological materials datasets [Dataset]. http://doi.org/10.24435/materialscloud:zk-gc
    Explore at:
    bin, application/gzip, text/markdownAvailable download formats
    Dataset updated
    Feb 26, 2025
    Dataset provided by
    Materials Cloud
    Authors
    Yuqing He; Pierre-Paul De Breuck; Hongming Weng; Matteo Giantomassi; Gian-Marco Rignanese; Yuqing He; Pierre-Paul De Breuck; Hongming Weng; Matteo Giantomassi; Gian-Marco Rignanese
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A dataset of 35,608 materials with their topological properties is constructed by combining the density functional theory (DFT) results of Materiae and the Topological Materials Database. Thanks to this, machine-learning approaches are developed to categorize materials into five distinct topological types, with the XGBoost model achieving an impressive 85.2% classification accuracy. By conducting generalization tests on different sub-datasets, differences are identified between the original datasets in terms of topological types, chemical elements, unknown magnetic compounds, and feature space coverage. Their impact on model performance is analyzed. Turning to the simpler binary classification between trivial insulators and nontrivial topological materials, three different approaches are also tested. Key characteristics influencing material topology are identified, with the maximum packing efficiency and the fraction of p valence electrons being highlighted as critical features.

  15. c

    Data from: Crystal-graph attention networks for the prediction of stable...

    • materialscloud-archive-failover.cineca.it
    • archive.materialscloud.org
    application/gzip +3
    Updated Dec 16, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan Schmidt; Love Pettersson; Claudio Verdozzi; Silvana Botti; Miguel Marques; Jonathan Schmidt; Love Pettersson; Claudio Verdozzi; Silvana Botti; Miguel Marques (2021). Crystal-graph attention networks for the prediction of stable materials [Dataset]. http://doi.org/10.24435/materialscloud:j9-bf
    Explore at:
    text/markdown, txt, application/gzip, xzAvailable download formats
    Dataset updated
    Dec 16, 2021
    Dataset provided by
    Materials Cloud
    Authors
    Jonathan Schmidt; Love Pettersson; Claudio Verdozzi; Silvana Botti; Miguel Marques; Jonathan Schmidt; Love Pettersson; Claudio Verdozzi; Silvana Botti; Miguel Marques
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Graph neural networks have enjoyed great success in the prediction of material properties for both molecules and crystals. These networks typically use the atomic positions (usually expanded in a Gaussian basis) and the atomic species as input. Unfortunately, this information is in general not available when predicting new materials, for which the precise geometrical information is unknown. In this work, we circumvent this problem by predicting the thermodynamic stability of crystal structures without using the knowledge of the precise bond distances. We replace this information with embeddings of graph distances, allowing our networks to be used directly in high-throughput studies based on both composition and crystal structure prototype. Using these embeddings, we combine the newest developments in graph neural networks and apply them to the prediction of the distances to the convex hull. To train these networks, we curate a dataset of over 2 million density-functional calculations of crystals with consistent calculation parameters from various sources. The new dataset allows for the creation of a high quality convex hull and a large scale transfer learning approach. We apply the resulting model to the high-throughput search of 15 million tetragonal perovskites of composition ABCD2. As a result, we identify several thousand potentially stable compounds and demonstrate that transfer learning from the newly curated dataset reduces the required training data by 50%.

  16. c

    Data from: General invariance and equilibrium conditions for lattice...

    • materialscloud-archive-failover.cineca.it
    • archive.materialscloud.org
    bin, text/markdown +2
    Updated Sep 12, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Changpeng Lin; Samuel Poncé; Nicola Marzari; Changpeng Lin; Samuel Poncé; Nicola Marzari (2022). General invariance and equilibrium conditions for lattice dynamics in 1D, 2D, and 3D materials [Dataset]. http://doi.org/10.24435/materialscloud:gf-3n
    Explore at:
    bin, zip, txt, text/markdownAvailable download formats
    Dataset updated
    Sep 12, 2022
    Dataset provided by
    Materials Cloud
    Authors
    Changpeng Lin; Samuel Poncé; Nicola Marzari; Changpeng Lin; Samuel Poncé; Nicola Marzari
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    The long-wavelength behavior of vibrational modes plays a central role in carrier transport, phonon-assisted optical properties, superconductivity, and thermomechanical and thermoelectric properties of materials. Here, we present general invariance and equilibrium conditions of the lattice potential; these allow to recover the quadratic dispersions of flexural phonons in low-dimensional materials, in agreement with the phenomenological model for long-wavelength bending modes. We prove that for any low-dimensional material, the bending modes can have a purely out-of-plane polarization in the vacuum direction and a quadratic dispersion in the long-wavelength limit. In addition, we propose an effective approach to treat the invariance conditions in crystals with non-vanishing Born effective charges where the long-range dipole-dipole interactions induce a contribution to the stress tensor. Our approach has been successfully applied to the phonon dispersions of 158 two-dimensional materials, opening new avenues for the studies of phonon-mediated properties of low-dimensional materials. The dataset uploaded here contains an AiiDA database for new phonon dispersions of all 245 two-dimensional materials produced in this work and essential data for reproducing the main results of this work. These data include the modified q2r and matdyn code of Quantum ESPRESSO distribution, pseudopotentials used in this work, optimized crystal structures, interatomic force constants and phonon dispersions.

  17. m

    Data from: Large-scale machine-learning-assisted exploration of the whole...

    • archive.materialscloud.org
    bz2, text/markdown +2
    Updated Oct 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan Schmidt; Noah Hoffmann; Hai-Chen Wang; Pedro Borlido; Pedro J. M.A. Carriço; Tiago F. T. Cerqueira; Silvana Botti; Miguel A. L. Marques; Jonathan Schmidt; Noah Hoffmann; Hai-Chen Wang; Pedro Borlido; Pedro J. M.A. Carriço; Tiago F. T. Cerqueira; Silvana Botti; Miguel A. L. Marques (2022). Large-scale machine-learning-assisted exploration of the whole materials space [Dataset]. http://doi.org/10.24435/materialscloud:m7-50
    Explore at:
    bz2, txt, text/markdown, text/x-pythonAvailable download formats
    Dataset updated
    Oct 4, 2022
    Dataset provided by
    Materials Cloud
    Authors
    Jonathan Schmidt; Noah Hoffmann; Hai-Chen Wang; Pedro Borlido; Pedro J. M.A. Carriço; Tiago F. T. Cerqueira; Silvana Botti; Miguel A. L. Marques; Jonathan Schmidt; Noah Hoffmann; Hai-Chen Wang; Pedro Borlido; Pedro J. M.A. Carriço; Tiago F. T. Cerqueira; Silvana Botti; Miguel A. L. Marques
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Crystal-graph attention networks have emerged recently as remarkable tools for the prediction of thermodynamic stability and materials properties from unrelaxed crystal structures. Previous networks trained on two million materials exhibited, however, strong biases originating from underrepresented chemical elements and structural prototypes in the available data. We tackled this issue computing additional data to provide better balance across both chemical and crystal-symmetry space. Crystal-graph networks trained with this new data show unprecedented generalization accuracy, and allow for reliable, accelerated exploration of the whole space of inorganic compounds. We applied this universal network to performed machine-learning assisted high-throughput materials searches including 2500 binary and ternary prototypes and spanning about 1 billion compounds. After validation using density-functional theory, we uncover in total 19512 additional materials on the convex hull of thermodynamic stability and around 150000 compounds with a distance of less than 50 meV/atom from the hull. Here we include the DCGAT-1, DCGAT-2, and DCGAT-3 datasets used in this work.

  18. c

    Data from: Data-driven studies of magnetic two-dimensional materials

    • materialscloud-archive-failover.cineca.it
    • archive.materialscloud.org
    csv, rtf +1
    Updated May 20, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Trevor David Rhone; Wei Chen; Shaan Desai; Amir Yacoby; Efthimios Kaxiras; Trevor David Rhone; Wei Chen; Shaan Desai; Amir Yacoby; Efthimios Kaxiras (2019). Data-driven studies of magnetic two-dimensional materials [Dataset]. http://doi.org/10.24435/materialscloud:2019.0020/v1
    Explore at:
    rtf, csv, text/markdownAvailable download formats
    Dataset updated
    May 20, 2019
    Dataset provided by
    Materials Cloud
    Authors
    Trevor David Rhone; Wei Chen; Shaan Desai; Amir Yacoby; Efthimios Kaxiras; Trevor David Rhone; Wei Chen; Shaan Desai; Amir Yacoby; Efthimios Kaxiras
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We use a data-driven approach to study the magnetic and thermodynamic properties of van der Waals (vdW) layered materials. We investigate monolayers of the form A2B2X6, based on the known material Cr2Ge2Te6, using density functional theory (DFT) calculations and determine their magnetic properties, such as magnetic order and magnetic moment. We also examine formation energies and use them as a proxy for chemical stability.

  19. c

    Data from: Accurate and efficient protocols for high-throughput...

    • materialscloud-archive-failover.cineca.it
    • archive.materialscloud.org
    bin, text/markdown +1
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriel de Miranda Nascimento; Flaviano José dos Santos; Marnik Bercx; Giovanni Pizzi; Nicola Marzari; Gabriel de Miranda Nascimento; Flaviano José dos Santos; Marnik Bercx; Giovanni Pizzi; Nicola Marzari (2025). Accurate and efficient protocols for high-throughput first-principles materials simulations [Dataset]. http://doi.org/10.24435/materialscloud:nr-hq
    Explore at:
    bin, zip, text/markdownAvailable download formats
    Dataset updated
    Apr 17, 2025
    Dataset provided by
    Materials Cloud
    Authors
    Gabriel de Miranda Nascimento; Flaviano José dos Santos; Marnik Bercx; Giovanni Pizzi; Nicola Marzari; Gabriel de Miranda Nascimento; Flaviano José dos Santos; Marnik Bercx; Giovanni Pizzi; Nicola Marzari
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A major challenge in first-principles high-throughput materials simulations is automating the selection of parameters used by simulation codes in a way that robustly ensures numerical precision and computational efficiency. Here, we propose a rigorous methodology to assess the quality of self-consistent DFT calculations with respect to smearing and k-point sampling across a wide range of crystalline materials. To achieve this, we develop criteria to reliably control average errors in total energies, forces, and other properties as a function of the desired computational efficiency, while consistently suppressing uncontrollable k-point sampling errors. Our results provide automated protocols for selecting optimized parameters based on different precision and efficiency tradeoffs. This archive contains all data related to the material structures and calculation workflows developed in this work.

  20. M

    Materials Testing Software Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Dec 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). Materials Testing Software Report [Dataset]. https://www.archivemarketresearch.com/reports/materials-testing-software-566360
    Explore at:
    doc, pdf, pptAvailable download formats
    Dataset updated
    Dec 17, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2026 - 2034
    Area covered
    Global
    Variables measured
    Market Size
    Description

    Discover the booming Materials Testing Software market! Explore a projected $2.5B market in 2025 with an 8% CAGR through 2033, driven by cloud adoption and AI integration. Learn about key players, regional trends, and the future of materials testing in automotive, aerospace, and construction.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Arslan Mazitov; Sofiia Chorna; Guillaume Fraux; Marnik Bercx; Giovanni Pizzi; Sandip De; Michele Ceriotti; Arslan Mazitov; Sofiia Chorna; Guillaume Fraux; Marnik Bercx; Giovanni Pizzi; Sandip De; Michele Ceriotti (2025). Massive Atomic Diversity: a compact universal dataset for atomistic machine learning [Dataset]. http://doi.org/10.24435/materialscloud:vd-e8

Massive Atomic Diversity: a compact universal dataset for atomistic machine learning

Explore at:
bin, xyz, application/gzip, text/markdownAvailable download formats
Dataset updated
Jun 26, 2025
Dataset provided by
Materials Cloud
Authors
Arslan Mazitov; Sofiia Chorna; Guillaume Fraux; Marnik Bercx; Giovanni Pizzi; Sandip De; Michele Ceriotti; Arslan Mazitov; Sofiia Chorna; Guillaume Fraux; Marnik Bercx; Giovanni Pizzi; Sandip De; Michele Ceriotti
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The development of machine-learning models for atomic-scale simulations has benefitted tremendously from the large databases of materials and molecular properties computed in the past two decades using electronic-structure calculations. More recently, these databases have made it possible to train “universal” models that aim at making accurate predictions for arbitrary atomic geometries and compositions. The construction of many of these databases was however in itself aimed at materials discovery, and therefore targeted primarily to sample stable, or at least plausible, structures and to make the most accurate predictions for each compound – e.g. adjusting the calculation details to the material at hand. Here we introduce a dataset designed specifically to train models that can provide reasonable predictions for arbitrary structures, and that therefore follows a different philosophy. Starting from relatively small sets of stable structures, the dataset is built to contain “massive atomic diversity” (MAD) by aggressively distorting these configurations, with near-complete disregard for the stability of the resulting configurations. The electronic structure details, on the other hand, are chosen to maximize consistency rather than to obtain the most accurate prediction for
a given structure, or to minimize computational effort. The MAD dataset we present here, despite containing fewer than 100k structures, has already been shown to enable training universal interatomic potentials that are competitive with models trained on traditional datasets with two to three orders of magnitude more structures. We describe in detail the philosophy and details of the construction of the MAD dataset. We also introduce a low-dimensional structural latent space that allows us to compare it with other popular datasets, and that can also be used as a general-purpose materials cartography tool.

Search
Clear search
Close search
Google apps
Main menu