2 datasets found

f
DataSheet1_qCLUE: a quantum clustering algorithm for multi-dimensional...
frontiersin.figshare.com
pdf
Updated Oct 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dhruv Gopalakrishnan; Luca Dellantonio; Antonio Di Pilato; Wahid Redjeb; Felice Pantaleo; Michele Mosca (2024). DataSheet1_qCLUE: a quantum clustering algorithm for multi-dimensional datasets.pdf [Dataset]. http://doi.org/10.3389/frqst.2024.1462004.s001
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.3389/frqst.2024.1462004.s001
Dataset updated
Oct 11, 2024
Dataset provided by
Frontiers
Authors
Dhruv Gopalakrishnan; Luca Dellantonio; Antonio Di Pilato; Wahid Redjeb; Felice Pantaleo; Michele Mosca
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Clustering algorithms are at the basis of several technological applications, and are fueling the development of rapidly evolving fields such as machine learning. In the recent past, however, it has become apparent that they face challenges stemming from datasets that span more spatial dimensions. In fact, the best-performing clustering algorithms scale linearly in the number of points, but quadratically with respect to the local density of points. In this work, we introduce qCLUE, a quantum clustering algorithm that scales linearly in both the number of points and their density. qCLUE is inspired by CLUE, an algorithm developed to address the challenging time and memory budgets of Event Reconstruction (ER) in future High-Energy Physics experiments. As such, qCLUE marries decades of development with the quadratic speedup provided by quantum computers. We numerically test qCLUE in several scenarios, demonstrating its effectiveness and proving it to be a promising route to handle complex data analysis tasks – especially in high-dimensional datasets with high densities of points.
S
The properties of chlorinated polycyclic aromatic hydrocarbons calculated at...
scidb.cn
Updated Dec 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dmitry Frolov; Igor Sedov (2024). The properties of chlorinated polycyclic aromatic hydrocarbons calculated at different levels of quantum chemical theory [Dataset]. http://doi.org/10.57760/sciencedb.18703
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57760/sciencedb.18703
Dataset updated
Dec 27, 2024
Dataset provided by
Science Data Bank
Authors
Dmitry Frolov; Igor Sedov
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The PACHQA dataset contains the results of quantum chemical calculations for 3551 molecules comprising 3417 chlorinated polycyclic aromatic hydrocarbons (Cl-PAHs) with up to 6 rings and a different number of chlorine atoms in their structure together with 134 parent polycyclic aromatic hydrocarbons (PAHs). Cl-PAHs, the products of incomplete combustion of organic substances and materials, are hazardous pollutants with carcinogenic and mutagenic activity. Quantum chemistry methods are important to understand their formation mechanisms and properties. The large scale calculations at different levels of quantum chemical theory are useful for training the machine learning algorithms that aim to correct the values of properties obtained with computationally inexpensive methods to the accuracy of higher levels of theory.The computational procedure includes subsequent optimization in the MMFF94 force field, optimization and calculation of the vibrational frequencies and thermochemical properties with the semiempirical tight-binding GFN2-xTB method (this level is denoted as xtb2), optimization and calculation of the vibrational frequencies and thermochemical properties with the composite DFT method r2SCAN-3c (denoted as r2scan), and single-point energy calculations with the range-separated hybrid ωB97X-D4 functional and the def2-TZVP basis set (denoted as d4tzvp). The list of molecules and a number of their properties obtained at different theory levels are compiled in the props.csv file (3.8 MB). The complete list of data fields in props.csv is given in the annotation.pdf file (65 kB). The optimized geometries and more calculated properties which may be useful for machine learning tasks are available in PACHQA1-main.7z (geometries, xtb output, ORCA property reports, 183 MB) and PACHQA2-full_outfiles.7z (full ORCA output files, 343 MB) archives. The file PACHQA3-wfns.7z (57 GB) contains wavefunctions, electron densities, and xtb electrostatic potentials. All other files produced during calculations including the outputs of calculations that resulted in imaginary frequencies are collected in the PACHQA4-other.7z file (7 GB).
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Dhruv Gopalakrishnan; Luca Dellantonio; Antonio Di Pilato; Wahid Redjeb; Felice Pantaleo; Michele Mosca (2024). DataSheet1_qCLUE: a quantum clustering algorithm for multi-dimensional datasets.pdf [Dataset]. http://doi.org/10.3389/frqst.2024.1462004.s001

DataSheet1_qCLUE: a quantum clustering algorithm for multi-dimensional datasets.pdf

Explore at:

pdfAvailable download formats

Unique identifier

https://doi.org/10.3389/frqst.2024.1462004.s001

Dataset updated

Oct 11, 2024

Dataset provided by

Frontiers

Authors

Dhruv Gopalakrishnan; Luca Dellantonio; Antonio Di Pilato; Wahid Redjeb; Felice Pantaleo; Michele Mosca

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Clustering algorithms are at the basis of several technological applications, and are fueling the development of rapidly evolving fields such as machine learning. In the recent past, however, it has become apparent that they face challenges stemming from datasets that span more spatial dimensions. In fact, the best-performing clustering algorithms scale linearly in the number of points, but quadratically with respect to the local density of points. In this work, we introduce qCLUE, a quantum clustering algorithm that scales linearly in both the number of points and their density. qCLUE is inspired by CLUE, an algorithm developed to address the challenging time and memory budgets of Event Reconstruction (ER) in future High-Energy Physics experiments. As such, qCLUE marries decades of development with the quadratic speedup provided by quantum computers. We numerically test qCLUE in several scenarios, demonstrating its effectiveness and proving it to be a promising route to handle complex data analysis tasks – especially in high-dimensional datasets with high densities of points.

Clear search

Close search

Google apps

Main menu

DataSheet1_qCLUE: a quantum clustering algorithm for multi-dimensional...

The properties of chlorinated polycyclic aromatic hydrocarbons calculated at...

DataSheet1_qCLUE: a quantum clustering algorithm for multi-dimensional datasets.pdf