7 datasets found

R
Aaai Dataset
universe.roboflow.com
zip
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AAAI (2023). Aaai Dataset [Dataset]. https://universe.roboflow.com/aaai-norvb/aaai-3aozj/dataset/3
Explore at:
zipAvailable download formats
Dataset updated
May 31, 2023
Dataset authored and provided by
AAAI
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
OPG Abnormality Bounding Boxes
Description
Here are a few use cases for this project:

Dental Health Monitoring: The model can be employed by dentists or dental health technologies to monitor the dental health of patients over time, assist in detecting abnormalities, and track the progress of dental treatments or interventions.

Dental Education: It can be used as an educational tool in dental schools, allowing students to practice identifying normal and abnormal OPG images and enhancing their diagnostic skills.

Dental Insurance Claims: Insurance companies can use the model to verify dental insurance claims, ensuring that billed procedures match the identified abnormalities in the OPG images.

Prevention in Oral Health Diseases: The model can be integrated into telemedicine applications, enabling routine check-ups and early detection of any oral health problem for patients, especially during pandemic-related lockdowns or for remote communities.

Research Studies: Researchers studying oral health trends and patterns can leverage this AI model to analyze large amounts of dental images, helping to establish correlations between various dental abnormalities and other health factors. They can also use it to evaluate the effectiveness of different dental procedures and treatments.
h
3DTime
huggingface.co
Updated Aug 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
3DTime Dataset (2024). 3DTime [Dataset]. https://huggingface.co/datasets/3DTimeDataset/3DTime
Explore at:
Dataset updated
Aug 9, 2024
Authors
3DTime Dataset
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
3DTime: A Large Dataset of Multi-Annotated Multivariate Time-Series for 3D-printing Duration

3DTime is a paper dataset currently under review at the AAAI-26 conference, for the main technical track. We could not find a way to anonymously publish the full 1.2 TB dataset, hence this smaller version.

Dataset content

The original dataset contains:

9,930 3D models, each sliced 4 times 39,720 annotated G-code files (compressed) 39,720 binary files A total of 12,442,224,222… See the full description on the dataset page: https://huggingface.co/datasets/3DTimeDataset/3DTime.
A web tracking data set of online browsing behavior of 2,148 users
zenodo.org
data.niaid.nih.gov
application/gzip, txt +1
Updated May 14, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Juhi Kulshrestha; Juhi Kulshrestha; Marcos Oliveira; Marcos Oliveira; Orkut Karacalik; Denis Bonnay; Claudia Wagner; Orkut Karacalik; Denis Bonnay; Claudia Wagner (2021). A web tracking data set of online browsing behavior of 2,148 users [Dataset]. http://doi.org/10.5281/zenodo.4757574
Explore at:
zip, txt, application/gzipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.4757574
Dataset updated
May 14, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Juhi Kulshrestha; Juhi Kulshrestha; Marcos Oliveira; Marcos Oliveira; Orkut Karacalik; Denis Bonnay; Claudia Wagner; Orkut Karacalik; Denis Bonnay; Claudia Wagner
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
This anonymized data set consists of one month's (October 2018) web tracking data of 2,148 German users. For each user, the data contains the anonymized URL of the webpage the user visited, the domain of the webpage, category of the domain, which provides 41 distinct categories. In total, these 2,148 users made 9,151,243 URL visits, spanning 49,918 unique domains. For each user in our data set, we have self-reported information (collected via a survey) about their gender and age.

We acknowledge the support of Respondi AG, which provided the web tracking and survey data free of charge for research purposes, with special thanks to François Erner and Luc Kalaora at Respondi for their insights and help with data extraction.

The data set is analyzed in the following paper:

Kulshrestha, J., Oliveira, M., Karacalik, O., Bonnay, D., Wagner, C. "Web Routineness and Limits of Predictability: Investigating Demographic and Behavioral Differences Using Web Tracking Data." Proceedings of the International AAAI Conference on Web and Social Media. 2021. https://arxiv.org/abs/2012.15112.

The code used to analyze the data is also available at https://github.com/gesiscss/web_tracking.

If you use data or code from this repository, please cite the paper above and the Zenodo link.
f
Swallow and martin roosts detected on WSR in the Great Lakes region from...
figshare.com
csv
Updated Dec 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Maria Belotti; Wenlong Zhao; Yuting Deng; Zezhou Cheng; Gustavo Perez; Victoria Simons; elske tielens; Subhransu Maji; Daniel Sheldon; Jeff Kelly; Kyle Horton (2024). Swallow and martin roosts detected on WSR in the Great Lakes region from 2000 to 2022 [Dataset]. http://doi.org/10.6084/m9.figshare.20137961.v2
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.20137961.v2
Dataset updated
Dec 9, 2024
Dataset provided by
figshare
Authors
Maria Belotti; Wenlong Zhao; Yuting Deng; Zezhou Cheng; Gustavo Perez; Victoria Simons; elske tielens; Subhransu Maji; Daniel Sheldon; Jeff Kelly; Kyle Horton
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
The Great Lakes
Description
This dataset contains the results the following data processing steps:1) Rendering of Weather Surveillance Radar data and using it as input to the machine learning model described in Cheng et al. (2020) that is capable of detecting and tracking swallow and martin roost signatures.2) Manual screening of the model's results, labeling each true positive track according to the type and degree of contamination of the roost by other sources of non-biological scattering.3) Grouping of overlapping tracks that can be considered as the same roost dispersal event.5) Calculation of number of birds per detection, assuming a Purple Martin (Progne subis) radar cross-section (following the procedure proposed by Chilson et al 2012), and summarizing counts across all radar sweeps.These procedures have been described in detail in the following publication: Belotti, M.C.T.D., Deng, Y., Zhao, W., Simons, V.F., Cheng, Z., Perez, G., Tielens, E., Maji, S., Sheldon, D., Kelly, J.F. and Horton, K.G. (2023), Long-term analysis of persistence and size of swallow and martin roosts in the US Great Lakes. Remote Sens Ecol Conserv, 9: 469-482. https://doi.org/10.1002/rse2.323Code for the analysis of these data can be found here: https://gitlab.com/mariabelotti/prjct_number_of_birdsReferences:Cheng, Z., Gabriel, S., Bhambhani, P., Sheldon, D., Maji, S., Laughlin, A., & Winkler, D. (2020). Detecting and Tracking Communal Bird Roosts in Weather Radar Data. Proceedings of the AAAI Conference on Artificial Intelligence, 34(01), 378–385. https://doi.org/10.1609/aaai.v34i01.5373Chilson, P. B., W. F. Frick, P. M. Stepanian, J. R. Shipley, T. H. Kunz, and J. F. Kelly. (2012). Estimating animal densities in the aerosphere using weather radar: To Z or not to Z? Ecosphere, 3(8):72. http://dx.doi.org/10.1890/ES12-00027.1
Data from: VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19...
zenodo.org
csv
Updated Feb 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yida Mu; Mali Jin; Charlie Grimshaw; Carolina Scarton; Kalina Bontcheva; Xingyi Song; Yida Mu; Mali Jin; Charlie Grimshaw; Carolina Scarton; Kalina Bontcheva; Xingyi Song (2024). VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter [Dataset]. http://doi.org/10.5281/zenodo.7601328
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.7601328
Dataset updated
Feb 23, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Yida Mu; Mali Jin; Charlie Grimshaw; Carolina Scarton; Kalina Bontcheva; Xingyi Song; Yida Mu; Mali Jin; Charlie Grimshaw; Carolina Scarton; Kalina Bontcheva; Xingyi Song
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We create a publicly available dataset of over 3,100 COVID-19 vaccine-related tweets labeled as one of four stance categories: pro-vaxx, anti-vaxx, vaxx-hesitant, or irrelevant.

***

Please use the V2 version.

***

We split our dataset into two separate files:

(1) VaccineHesitancy_train_v2.csv (Single + Double annotated)

(2) VaccineHesitancy_test.csv (Double annotated)

We present the details of this dataset here:

VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter (ICWSM 2023)

Our Pre-trained model (GateNLP/covid-vaccine-twitter-bert) : https://huggingface.co/GateNLP/covid-vaccine-twitter-bert

Paper: https://ojs.aaai.org/index.php/ICWSM/article/view/22213/21992

@inproceedings{mu2023vaxxhesitancy, title={VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter}, author={Mu, Yida and Jin, Mali and Grimshaw, Charlie and Scarton, Carolina and Bontcheva, Kalina and Song, Xingyi}, booktitle={Proceedings of the International AAAI Conference on Web and Social Media}, volume={17}, pages={1052--1062}, year={2023} }
h
HarmEval
huggingface.co
Updated Jan 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
SoftMiner Group (2025). HarmEval [Dataset]. https://huggingface.co/datasets/SoftMINER-Group/HarmEval
Explore at:
Dataset updated
Jan 8, 2025
Dataset authored and provided by
SoftMiner Group
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
🚀 SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models

🎉 Accepted at AAAI-2025 (Long Paper) — Alignment Track

👉 Code
We developed HarmEval, a dataset based on prohibited scenarios listed in OpenAI and Meta’s usage policies. HarmEval categorizes risks into 11 main categories, resulting in approximately ∼550 crafted harmful queries. We employed a two-step verification process for these queries. First, we used GPT-4 to classify the… See the full description on the dataset page: https://huggingface.co/datasets/SoftMINER-Group/HarmEval.
O
Lakh Pianoroll Dataset
opendatalab.com
data.niaid.nih.gov
+1more
zip
Updated Mar 24, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Tsing Hua University (2023). Lakh Pianoroll Dataset [Dataset]. https://opendatalab.com/OpenDataLab/Lakh_Pianoroll_Dataset
Explore at:
zipAvailable download formats
Dataset updated
Mar 24, 2023
Dataset provided by
Academia Sinica, Taiwan
National Tsing Hua University
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The Lakh Pianoroll Dataset (LPD) is a collection of 174,154 multitrack pianorolls derived from the Lakh MIDI Dataset (LMD). Getting the dataset We provide multiple subsets and versions of the dataset (see here). The dataset is available here. Using LPD The multitrack pianorolls in LPD are stored in a special format for efficient I/O and to save space. We recommend to load the data with Pypianoroll (The dataset is created using Pypianoroll v0.3.0.). See here to learn how the data is stored and how to load the data properly. License Lakh Pianoroll Dataset is a derivative of Lakh MIDI Dataset by Colin Raffel, used under CC BY 4.0. Lakh Pianoroll Dataset is licensed under CC BY 4.0 by Hao-Wen Dong and Wen-Yi Hsiao. Please cite the following papers if you use Lakh Pianoroll Dataset in a published work. Hao-Wen Dong, Wen-Yi Hsiao, Li-Chia Yang, and Yi-Hsuan Yang, "MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment," in Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI), 2018. Colin Raffel, "Learning-Based Methods for Comparing Sequences, with Applications to Audio-to-MIDI Alignment and Matching," PhD Thesis, 2016. Related projects MuseGAN LeadSheetGAN
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

AAAI (2023). Aaai Dataset [Dataset]. https://universe.roboflow.com/aaai-norvb/aaai-3aozj/dataset/3

Aaai Dataset

aaai-3aozj

aaai-dataset

Explore at:

zipAvailable download formats

Dataset updated

May 31, 2023

Dataset authored and provided by

AAAI

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Variables measured

OPG Abnormality Bounding Boxes

Description

Here are a few use cases for this project:

Dental Health Monitoring: The model can be employed by dentists or dental health technologies to monitor the dental health of patients over time, assist in detecting abnormalities, and track the progress of dental treatments or interventions.
Dental Education: It can be used as an educational tool in dental schools, allowing students to practice identifying normal and abnormal OPG images and enhancing their diagnostic skills.
Dental Insurance Claims: Insurance companies can use the model to verify dental insurance claims, ensuring that billed procedures match the identified abnormalities in the OPG images.
Prevention in Oral Health Diseases: The model can be integrated into telemedicine applications, enabling routine check-ups and early detection of any oral health problem for patients, especially during pandemic-related lockdowns or for remote communities.
Research Studies: Researchers studying oral health trends and patterns can leverage this AI model to analyze large amounts of dental images, helping to establish correlations between various dental abnormalities and other health factors. They can also use it to evaluate the effectiveness of different dental procedures and treatments.

Clear search

Close search

Google apps

Main menu

Aaai Dataset

3DTime

A web tracking data set of online browsing behavior of 2,148 users

Swallow and martin roosts detected on WSR in the Great Lakes region from...

Data from: VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19...

HarmEval

Lakh Pianoroll Dataset

Aaai Dataset

aaai-3aozj

aaai-dataset