Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The development of machine-learning models for atomic-scale simulations has benefitted tremendously from the large databases of materials and molecular properties computed in the past two decades using electronic-structure calculations. More recently, these databases have made it possible to train “universal” models that aim at making accurate predictions for arbitrary atomic geometries and compositions. The construction of many of these databases was however in itself aimed at materials discovery, and therefore targeted primarily to sample stable, or at least plausible, structures and to make the most accurate predictions for each compound – e.g. adjusting the calculation details to the material at hand. Here we introduce a dataset designed specifically to train models that can provide reasonable predictions for arbitrary structures, and that therefore follows a different philosophy. Starting from relatively small sets of stable structures, the dataset is built to contain “massive atomic diversity” (MAD) by aggressively distorting these configurations, with near-complete disregard for the stability of the resulting configurations. The electronic structure details, on the other hand, are chosen to maximize consistency rather than to obtain the most accurate prediction for
a given structure, or to minimize computational effort. The MAD dataset we present here, despite containing fewer than 100k structures, has already been shown to enable training universal interatomic potentials that are competitive with models trained on traditional datasets with two to three orders of magnitude more structures. We describe in detail the philosophy and details of the construction of the MAD dataset. We also introduce a low-dimensional structural latent space that allows us to compare it with other popular datasets, and that can also be used as a general-purpose materials cartography tool.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Two-dimensional (2D) materials are among the most promising candidates for beyond silicon electronic and optoelectronic applications. Recently, their recognized importance, sparked a race to discover and characterize new 2D materials. Within few years the number of experimentally exfoliated or synthesized 2D materials went from a couple of dozens to few hundreds while the number theoretically predicted compounds reached a few thousands. In 2018 we first contributed to this effort with the identification of 1825 compounds that are either easily (1036) or potentially (789) exfoliable from experimentally known 3D compounds. In the present work we report on the new materials recently added to the 2D-portfolio thanks to the extension of the screening to an additional experimental database (MPDS) as well as the most up-to-date versions of the two databases (ICSD and COD) used in our previous work. This expansion led to the discovery of an additional 1252 unique monolayers bringing the total to 3077 compounds and, notably, almost doubling the number of easily exfoliable materials (2004). Moreover, we optimized the structural properties of all the materials (regardless of their binding energy or number of atoms in the unit cell) as isolated mono-layer and explored their electronic band structure. This archive entry contains the database of 2D materials in particular it contains the structural parameters for all the 3077 structures of the global Material Cloud 2D database as extracted from their bulk 3D parent, 2710 optimized 2D structures and 2345 electronic band structure together with the provenance of all data and calculations as stored by AiiDA.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Materials Cloud three-dimensional database is a curated set of relaxed three-dimensional crystal structures based on raw CIF data taken from the external experimental databases MPDS, COD and ICSD. The raw CIF data have been imported, cleaned and parsed into a crystal structure; their ground-state has been computed using the SIRIUS-enabled pw.x code of the Quantum ESPRESSO distribution, and tight tolerance criteria for the calculations using the SSSP protocols.
This entire procedure is encoded into an AiiDA workflow which automates the process while keeping full data provenance. Here, since the original source data of the ICSD and MPDS databases are copyrighted, only the provenance of the final SCF calculation on the relaxed structures can be made publicly available.
The MC3D ID numbers come from a list of unique "parent" stoichiometric structures that has been created and curated from a collection of these experimental databases. Once a parent structure has been optimized using density-functional theory, it is made public and added to the online Discover section of the Materials Cloud (as mentioned, copyright might prevent publishing the original parent). Note that since not all structures have been calculated, some ID numbers are missing from the public version of the database. The full ID of each structure also contains as an appended modifier the functional that was used in the calculations. Since the ID number points to the same unique parent, mc3d-1234/pbe and mc3d-1234/pbesol have the same starting point, but have been then relaxed according to their respective functionals.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Functionalizing Silicon Nanowires (SiNWs) through covalent attachment of organic molecules offers diverse advantages, including surface passivation, introduction of new functionalities, and enhanced material performance in applications like electronic devices and biosensors. Given the wide range of available functional molecules, systematic large-scale screening is crucial. Therefore, we developed an automated computational workflow using Python scripts in conjunction with the AiiDa framework to explore structural configurations of functional molecules adsorbed onto silicon surfaces. This workflow generates multiple adhesion configurations corresponding to different binding orientations using surface and functional molecule structures as inputs.
This dataset contains data related to the structural optimization of molecules with single, double, and triple carbon-carbon bonds attached to the nanowire surface in various adhesion configurations. We describe the chemisorption on SiNWs using the slab models for the Si facets since our reference are samples with diameters of SiNWs around 50 nm, while the quantum confinement effects are important for diameters below 10 nm. For each configuration, structural characterization was conducted by calculating quantities including the bond distance between the two carbons closest to the surface and their respective bond angle relative to the z-axis, the carbon-silicon bond distance and its respective bond angle relative to the z-axis, along with the molecule's rotation angle in the xy plane. The values obtained are summarized in the main folder. The version v1 of dataset contains data related to the Si(111) surface and alkanes, alkenes, and alkynes with lengths from C2 to C10. The version v2 extends the dataset to moieties from C12 to C18. This version (v3) extends the dataset with new configurations for moieties from C2 to C18. The dataset will be extended to characterize the Si(110) surface of the nanowire. For each system the most stable configuration will be identified, and the analysis of the electronic properties will be conducted.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This work examines challenges associated with the accuracy of machine-learned force fields (MLFFs) for bulk solid and liquid phases of d-block elements. In exhaustive detail, we contrast the performance of force, energy, and stress predictions across the transition metals for two leading MLFF models: a kernel-based atomic cluster expansion method implemented using sparse Gaussian processes (FLARE), and an equivariant message-passing neural network (NequIP). Early transition metals present higher relative errors and are more difficult to learn relative to late platinum- and coinage-group elements, and this trend persists across model architectures. Trends in complexity of interatomic interactions for different metals are revealed via comparison of the performance of representations with different many-body order and angular resolution. Using arguments based on perturbation theory on the occupied and unoccupied d states near the Fermi level, we determine that the large, sharp d density of states both above and below the Fermi level in early transition metals leads to a more complex, harder-to-learn potential energy surface for these metals. Increasing the fictitious electronic temperature (smearing) modifies the angular sensitivity of forces and makes the early transition metal forces easier to learn. This work illustrates challenges in capturing intricate properties of metallic bonding with current leading MLFFs and provides a reference data set for transition metals, aimed at benchmarking the accuracy and improving the development of emerging machine-learned approximations.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
TCSP 2.0 templte database, it includes the Materials Project (MP) database, Materials Cloud database (both 2D and 3D), The Computational 2D Materials Database (C2DB), and Graph Networks for Materials Science database(GNoME).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Crystal-graph attention networks have emerged recently as remarkable tools for the prediction of thermodynamic stability and materials properties from unrelaxed crystal structures. Previous networks trained on two million materials exhibited, however, strong biases originating from underrepresented chemical elements and structural prototypes in the available data. We tackled this issue computing additional data to provide better balance across both chemical and crystal-symmetry space. Crystal-graph networks trained with this new data show unprecedented generalization accuracy, and allow for reliable, accelerated exploration of the whole space of inorganic compounds. We applied this universal network to performed machine-learning assisted high-throughput materials searches including 2500 binary and ternary prototypes and spanning about 1 billion compounds. After validation using density-functional theory, we uncover in total 19512 additional materials on the convex hull of thermodynamic stability and around 150000 compounds with a distance of less than 50 meV/atom from the hull. Here we include the DCGAT-1, DCGAT-2, and DCGAT-3 datasets used in this work.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We have developed a polymer descriptor data set from the existing data. The data set has 188 descriptors which describe polymer atomic and molecular behavior. The descriptors are mapped to the specific heat of polymers using supervised and unsupervised machine learning approaches. The mapping helps predict the specific heat of polymers at room temperature. The descriptor data set is useful in synthesizing novel polymers with desired heat capacities.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Magnet/superconductor hybrid systems have been put forward as a platform for realizing topological superconductivity. We investigated the heterostructure of ferromagnetic monolayer CrCl₃ and superconducting NbSe₂. Using low-temperature scanning tunneling microscopy, we observe topologically trivial Yu-Shiba-Rusinov (YSR) states localized at the edge of CrCl₃ islands. DFT simulations reveal that the Cr atoms at the edge have an enhanced d-orbital DOS close to the Fermi energy. This leads to an exchange coupling between these atoms and the substrate that rationalizes the edge-localization of the YSR states.
This dataset contains the first-principles calculations performed on a nanoribbon of CrCl₃ on NbSe₂ and the associated notebooks used to generate figures from this data.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Magnetic materials often exhibit complex energy landscapes with multiple local minima, each corresponding to a self-consistent electronic structure solution. Finding the global minimum is challenging, and heuristic methods are not always guaranteed to succeed. We apply an automated workflow to systematically explore the energy landscape of 194 magnetic monolayers from the Materials Cloud 2D crystals database and determine their ground-state magnetic order. Our approach enables effective control and sampling of orbital occupation matrices, allowing rapid identification of local minima. We reveal a diverse set of self-consistent collinear metastable states, further enriched by Hubbard-corrected energy functionals with U parameters computed from first principles using linear response theory. We categorize the monolayers by their magnetic ordering and highlight promising candidates for applications.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Earth-abundant spinel oxides are promising alkaline oxygen-reduction catalysts, yet mechanistic models still invoke a vacuum-DFT associative *OOH/*OO route. Here we combine >80,000 fully solvated joint-DFT calculations to map oxygen-reduction energetics across 442 Al-, Co-, Cr-, Fe-, Ga-, Mn-, Ni- and Zn-containing spinels on the (100) and (111) facets. Associative intermediates are >0.5 eV less stable than *OH/H states, revealing a four-step dissociative cycle in which surface-hydrogen passivation shuttles protons. These results establish solvated high-throughput DFT as a predictive lens on the kinetic oxygen-reduction limit of spinel oxides.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
We present a database of topological materials predicted from high-throughput first-principles calculations. The database contains electronic band structures and topological indices of 13628 materials calculated on experimental crystal structures taken from the Inorganic Crystal Structure Database (ICSD) and the Crystallography Open Database (COD). The calculations have been performed on non-magnetic phases taking into account the spin-orbit interactions using the Quantum ESPRESSO package. The Fu-Kane method and the Wannier charge center method implemented in the Z2pack code have been utilized to calculate the Z2 topological numbers of centrosymmetric and non-centrosymmetric materials, respectively. Over 4000 topologically non-trivial materials have been identified.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Polymers are versatile materials with a wide range of applications. The improvement of polymer properties rises the importance on the way that the repeating units are connected (head-to-tail,head-to-head,tail-to-tail) to build the polymer structure since it directly influences the morphology, chain topology and consequently its properties. Artificial intelligence (AI) based approaches are beginning to impact several domains of human life, science and technology. Polymer informatics is one such domain where AI and machine learning (ML) tools are being used in the efficient development, design and discovery of polymer. One key enabling factor for the essential foundations for Polymer Informatics is the machine-readable polymer representation. Polymer have been represented in a string format with special characters used to tag the head and tail positions indicating where the linking bond happens between repeat units. Available tools to assign the head and tail position limits its applicability in a broad sense. In this work we show a new tool to assign the head and tail atoms for a given monomer. From a database of 206 polymer precursors curated from the literature, our algorithm correctly predicted the class of 201 data points, which represents 97.6% of accuracy and regarding the the head and tail assignment, correctly assigned the positions for 188 data points, which translates to 91.3% of accuracy.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
20 minute lightning talk presentation given by Aliaksandr Yakutovich, from École Polyechnique Fédérale de Lausanne, at the Better Science through Better Data 2018 event. The video recording and scribe are included.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
A major challenge in first-principles high-throughput materials simulations is automating the selection of parameters used by simulation codes in a way that robustly ensures numerical precision and computational efficiency. Here, we propose a rigorous methodology to assess the quality of self-consistent DFT calculations with respect to smearing and k-point sampling across a wide range of crystalline materials. To achieve this, we develop criteria to reliably control average errors in total energies, forces, and other properties as a function of the desired computational efficiency, while consistently suppressing uncontrollable k-point sampling errors. Our results provide automated protocols for selecting optimized parameters based on different precision and efficiency tradeoffs. This archive contains all data related to the material structures and calculation workflows developed in this work.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We use a data-driven approach to study the magnetic and thermodynamic properties of van der Waals (vdW) layered materials. We investigate monolayers of the form A2B2X6, based on the known material Cr2Ge2Te6, using density functional theory (DFT) calculations and determine their magnetic properties, such as magnetic order and magnetic moment. We also examine formation energies and use them as a proxy for chemical stability.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Understanding how the vibrational and thermal properties of solids are influenced by atomistic structural disorder is of fundamental scientific interest, and paramount to designing materials for next-generation energy technologies. While several studies indicate that structural disorder strongly influences the thermal conductivity, the fundamental physics governing the disorder-conductivity relation remains elusive. Here we show that order-of-magnitude, disorder-induced variations of conductivity in network solids can be predicted from a bond-network entropy, an atomistic structural descriptor that quantifies heterogeneity in the topology of the atomic-bond network. We employ the Wigner formulation of thermal transport to demonstrate the existence of a relation between the bond-network entropy, and observables such as smoothness of the vibrational density of states (VDOS) and macroscopic conductivity. We also show that the smoothing of the VDOS encodes information about the thermal resistance induced by disorder, and can be directly related to phenomenological models for phonon-disorder scattering based on the semiclassical Peierls-Boltzmann equation. Our findings rationalize the conductivity variations of disordered carbon polymorphs ranging from nanoporous electrodes to defective graphite used as a moderator in nuclear reactors. This database contains the models of structures reported in the paper referenced below.
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
The long-wavelength behavior of vibrational modes plays a central role in carrier transport, phonon-assisted optical properties, superconductivity, and thermomechanical and thermoelectric properties of materials. Here, we present general invariance and equilibrium conditions of the lattice potential; these allow to recover the quadratic dispersions of flexural phonons in low-dimensional materials, in agreement with the phenomenological model for long-wavelength bending modes. We prove that for any low-dimensional material, the bending modes can have a purely out-of-plane polarization in the vacuum direction and a quadratic dispersion in the long-wavelength limit. In addition, we propose an effective approach to treat the invariance conditions in crystals with non-vanishing Born effective charges where the long-range dipole-dipole interactions induce a contribution to the stress tensor. Our approach has been successfully applied to the phonon dispersions of 158 two-dimensional materials, opening new avenues for the studies of phonon-mediated properties of low-dimensional materials. The dataset uploaded here contains an AiiDA database for new phonon dispersions of all 245 two-dimensional materials produced in this work and essential data for reproducing the main results of this work. These data include the modified q2r and matdyn code of Quantum ESPRESSO distribution, pseudopotentials used in this work, optimized crystal structures, interatomic force constants and phonon dispersions.
Facebook
Twitterhttps://www.mordorintelligence.com/privacy-policyhttps://www.mordorintelligence.com/privacy-policy
The Material Informatics Market Report is Segmented by Component (Software and Services), Deployment Mode (Cloud-Based and On-Premises), Application (Materials Discovery and Design, Formulation Optimization, Process Optimization and Scale-Up, and More), End-User Industry (Chemicals and Advanced Materials, Pharmaceuticals and Life Sciences, and More), and Geography. The Market Forecasts are Provided in Terms of Value (USD).
Facebook
Twitterhttps://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
The cloud-based molecular modeling software market is booming, projected to reach $6.12 billion by 2033 with a 15% CAGR. Discover key trends, drivers, restraints, and leading companies shaping this dynamic sector. Explore applications in drug discovery, materials science, and more.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The development of machine-learning models for atomic-scale simulations has benefitted tremendously from the large databases of materials and molecular properties computed in the past two decades using electronic-structure calculations. More recently, these databases have made it possible to train “universal” models that aim at making accurate predictions for arbitrary atomic geometries and compositions. The construction of many of these databases was however in itself aimed at materials discovery, and therefore targeted primarily to sample stable, or at least plausible, structures and to make the most accurate predictions for each compound – e.g. adjusting the calculation details to the material at hand. Here we introduce a dataset designed specifically to train models that can provide reasonable predictions for arbitrary structures, and that therefore follows a different philosophy. Starting from relatively small sets of stable structures, the dataset is built to contain “massive atomic diversity” (MAD) by aggressively distorting these configurations, with near-complete disregard for the stability of the resulting configurations. The electronic structure details, on the other hand, are chosen to maximize consistency rather than to obtain the most accurate prediction for
a given structure, or to minimize computational effort. The MAD dataset we present here, despite containing fewer than 100k structures, has already been shown to enable training universal interatomic potentials that are competitive with models trained on traditional datasets with two to three orders of magnitude more structures. We describe in detail the philosophy and details of the construction of the MAD dataset. We also introduce a low-dimensional structural latent space that allows us to compare it with other popular datasets, and that can also be used as a general-purpose materials cartography tool.