100+ datasets found

h
full-modality-data
huggingface.co
Updated Aug 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
LMMs-Lab (2025). full-modality-data [Dataset]. https://huggingface.co/datasets/lmms-lab/full-modality-data
Explore at:
Dataset updated
Aug 1, 2025
Dataset authored and provided by
LMMs-Lab
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Full Modality Dataset Statistics

Video Statistics

Total Videos: 28,472 Total Duration: 1422.33 hours Average Duration: 179.84 seconds Median Duration: 160.08 seconds Duration Range: 10.04s - 1780.03s

QA Statistics

Total Questions: 1,444,526 Average Questions per Video: 50.7 Questions per Video Range: 14 - 450

Question Type Distribution

OE: 1,444,526 (100.0%)

Question Category Distribution

temporal: 96,873 (6.7%) causal: 96,873… See the full description on the dataset page: https://huggingface.co/datasets/lmms-lab/full-modality-data.
V
School Learning Modalities, 2021-2022
data.virginia.gov
datahub.hhs.gov
+5more
csv, json, rdf, xsl
Updated Jun 28, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centers for Disease Control and Prevention (2024). School Learning Modalities, 2021-2022 [Dataset]. https://data.virginia.gov/dataset/school-learning-modalities-2021-2022
Explore at:
json, csv, xsl, rdfAvailable download formats
Dataset updated
Jun 28, 2024
Dataset provided by
Centers for Disease Control and Prevention
Description
The 2021-2022 School Learning Modalities dataset provides weekly estimates of school learning modality (including in-person, remote, or hybrid learning) for U.S. K-12 public and independent charter school districts for the 2021-2022 school year and the Fall 2022 semester, from August 2021 – December 2022.
These data were modeled using multiple sources of input data (see below) to infer the most likely learning modality of a school district for a given week. These data should be considered district-level estimates and may not always reflect true learning modality, particularly for districts in which data are unavailable. If a district reports multiple modality types within the same week, the modality offered for the majority of those days is reflected in the weekly estimate. All school district metadata are sourced from the https://nces.ed.gov/ccd/files.asp#Fiscal:2,LevelId:5,SchoolYearId:35,Page:1">National Center for Educational Statistics (NCES) for 2020-2021.
School learning modality types are defined as follows:

In-Person: All schools within the district offer face-to-face instruction 5 days per week to all students at all available grade levels.

Remote: Schools within the district do not offer face-to-face instruction; all learning is conducted online/remotely to all students at all available grade levels.

Hybrid: Schools within the district offer a combination of in-person and remote learning; face-to-face instruction is offered less than 5 days per week, or only to a subset of students.

Data Information

School learning modality data provided here are model estimates using combined input data and are not guaranteed to be 100% accurate. This learning modality dataset was generated by combining data from four different sources: Burbio [1], MCH Strategic Data [2], the AEI/Return to Learn Tracker [3], and state dashboards [4-20]. These data were combined using a Hidden Markov model which infers the sequence of learning modalities (In-Person, Hybrid, or Remote) for each district that is most likely to produce the modalities reported by these sources. This model was trained using data from the 2020-2021 school year. Metadata describing the location, number of schools and number of students in each district comes from NCES [21].

You can read more about the model in the CDC MMWR: https://www.cdc.gov/mmwr/volumes/70/wr/mm7039e2.htm" target="_blank">COVID-19–Related School Closures and Learning Modality Changes — United States, August 1–September 17, 2021.

The metrics listed for each school learning modality reflect totals by district and the number of enrolled students per district for which data are available. School districts represented here exclude private schools and include the following NCES subtypes:

Public school district that is NOT a component of a supervisory union

Public school district that is a component of a supervisory union

Independent charter district

“BI” in the state column refers to school districts funded by the Bureau of Indian Education.

Technical Notes

Data from August 1, 2021 to June 24, 2022 correspond to the 2021-2022 school year. During this time frame, data from the AEI/Return to Learn Tracker and most state dashboards were not available. Inferred modalities with a probability below 0.6 were deemed inconclusive and were omitted. During the Fall 2022 semester, modalities for districts with a school closure reported by Burbio were updated to either “Remote”, if the closure spanned the entire week, or “Hybrid”, if the closure spanned 1-4 days of the week.

Data from August
f
VGG16 Metrics, Average AUC = 0.9987707721217087.
plos.figshare.com
xls
Updated Dec 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Craig Macfadyen; Ajay Duraiswamy; David Harris-Birtill (2023). VGG16 Metrics, Average AUC = 0.9987707721217087. [Dataset]. http://doi.org/10.1371/journal.pdig.0000191.t005
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pdig.0000191.t005
Dataset updated
Dec 13, 2023
Dataset provided by
PLOS Digital Health
Authors
Craig Macfadyen; Ajay Duraiswamy; David Harris-Birtill
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Algorithms that classify hyper-scale multi-modal datasets, comprising of millions of images, into constituent modality types can help researchers quickly retrieve and classify diagnostic imaging data, accelerating clinical outcomes. This research aims to demonstrate that a deep neural network that is trained on a hyper-scale dataset (4.5 million images) composed of heterogeneous multi-modal data can be used to obtain significant modality classification accuracy (96%). By combining 102 medical imaging datasets, a dataset of 4.5 million images was created. A ResNet-50, ResNet-18, and VGG16 were trained to classify these images by the imaging modality used to capture them (Computed Tomography (CT), Magnetic Resonance Imaging (MRI), Positron Emission Tomography (PET), and X-ray) across many body locations. The classification accuracy of the models was then tested on unseen data. The best performing model achieved classification accuracy of 96% on unseen data, which is on-par, or exceeds the accuracy of more complex implementations using EfficientNets or Vision Transformers (ViTs). The model achieved a balanced accuracy of 86%. This research shows it is possible to train Deep Learning (DL) Convolutional Neural Networks (CNNs) with hyper-scale multimodal datasets, composed of millions of images. Such models can find use in real-world applications with volumes of image data in the hyper-scale range, such as medical imaging repositories, or national healthcare institutions. Further research can expand this classification capability to include 3D-scans.
Modality-based Multitasking, practice data
openneuro.org
Updated Mar 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marie Mueckstein; Kai Görgen; Stephan Heinzel; Urs Granacher; A. Michael Rapp; Christine Stelzel (2024). Modality-based Multitasking, practice data [Dataset]. http://doi.org/10.18112/openneuro.ds005038.v1.0.0
Explore at:
Unique identifier
https://doi.org/10.18112/openneuro.ds005038.v1.0.0
Dataset updated
Mar 19, 2024
Dataset provided by
OpenNeurohttps://openneuro.org/
Authors
Marie Mueckstein; Kai Görgen; Stephan Heinzel; Urs Granacher; A. Michael Rapp; Christine Stelzel
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This dataset contsin the raw fMRI data of a preregistered study. Dataset includes:

session pre 1. anat/ anatomical scans (T1-weighted images) for each subject 2. func/ whole-brain EPI data from all task runs (8x single task, 2x dual task, 1x resting state and 2x localizer task) 3. fmap/ fieldmaps with magnitude1, magnitude2 and phasediff

session post 2. func/ whole-brain EPI data from all task runs (8x single task, 2x dual task) 3. fmap/ fieldmaps with magnitude1, magnitude2 and phasediff

Please note, some participants did not complete the post session. We updated our consent form to get explicit permission to publish the individual data, although not all participants resigned the new version. Those participants are excluded here but part of the t-maps on neurovault (compare participants.tsv).

Tasks were always included either visual or/and auditory input and required either manual or/and vocal responses (visual+manual and auditory+vocal are modality compatible and visual+vocal and auditory+manual are modality incompatible). Tasks were presented as either single task, or dual task. Participants completed a practice intervention prior to session post in which one group worked for 80 minutes outside the scanner on modality incompatible dual-tasks, one on modality compatible dual-task and the third one paused for 80 min.

For exact tasks description and material and scripts, please see the preregistration: https://osf.io/whpz8
S
Data from: Typical Concept-Driven Modality-missing Deep Cross-Modal...
scidb.cn
Updated Apr 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xia Xinyu; Zhu Lei; Nie Xiushan; Dong Guohua; Zhang Huaxiang (2025). Typical Concept-Driven Modality-missing Deep Cross-Modal Retrieval [Dataset]. http://doi.org/10.57760/sciencedb.24176
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57760/sciencedb.24176
Dataset updated
Apr 24, 2025
Dataset provided by
Science Data Bank
Authors
Xia Xinyu; Zhu Lei; Nie Xiushan; Dong Guohua; Zhang Huaxiang
License
https://api.github.com/licenses/mithttps://api.github.com/licenses/mit
Description
Cross-modal retrieval takes one modality data as a query and retrieves semantically relevant data in another modality. Most existing cross-modal retrieval methods are designed for scenarios with complete modality data. However, in real-world applications, incomplete modality data often exists, which these methods struggle to handle effectively. In this paper, we propose a typical concept-driven modality-missing deep cross-modal retrieval model. Specifically, we first propose a multi-modal Transformer integrated with multi-modal pretraining networks, which can fully capture the multi-modal fine-grained semantic interaction in the incomplete modality data, extract multi-modal fusion semantics and construct cross-modal subspace, and at the same time supervise the learning process to generate typical concepts. In addition, the typical concepts are used as the cross-attention key and value to drive the training of the modal mapping network, so that it can adaptively preserve the implicit multi-modal semantic concepts of the query modality data, generate cross-modal retrieval features, and fully preserve the pre-extracted multi-modal fusion semantics. More information about the source code: https://gitee.com/MrSummer123/CPCMR
f
Learning modalities reported by source.
plos.figshare.com
xls
Updated Oct 4, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mark J. Panaggio; Mike Fang; Hyunseung Bang; Paige A. Armstrong; Alison M. Binder; Julian E. Grass; Jake Magid; Marc Papazian; Carrie K. Shapiro-Mendoza; Sharyn E. Parks (2023). Learning modalities reported by source. [Dataset]. http://doi.org/10.1371/journal.pone.0292354.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0292354.t001
Dataset updated
Oct 4, 2023
Dataset provided by
PLOS ONE
Authors
Mark J. Panaggio; Mike Fang; Hyunseung Bang; Paige A. Armstrong; Alison M. Binder; Julian E. Grass; Jake Magid; Marc Papazian; Carrie K. Shapiro-Mendoza; Sharyn E. Parks
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
During the COVID-19 pandemic, many public schools across the United States shifted from fully in-person learning to alternative learning modalities such as hybrid and fully remote learning. In this study, data from 14,688 unique school districts from August 2020 to June 2021 were collected to track changes in the proportion of schools offering fully in-person, hybrid and fully remote learning over time. These data were provided by Burbio, MCH Strategic Data, the American Enterprise Institute’s Return to Learn Tracker and individual state dashboards. Because the modalities reported by these sources were incomplete and occasionally misaligned, a model was needed to combine and deconflict these data to provide a more comprehensive description of modalities nationwide. A hidden Markov model (HMM) was used to infer the most likely learning modality for each district on a weekly basis. This method yielded higher spatiotemporal coverage than any individual data source and higher agreement with three of the four data sources than any other single source. The model output revealed that the percentage of districts offering fully in-person learning rose from 40.3% in September 2020 to 54.7% in June of 2021 with increases across 45 states and in both urban and rural districts. This type of probabilistic model can serve as a tool for fusion of incomplete and contradictory data sources in order to obtain more reliable data in support of public health surveillance and research efforts.
d
School Learning Modalities, 2020-2021
catalog.data.gov
data.virginia.gov
+4more
Updated Mar 26, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centers for Disease Control and Prevention (2025). School Learning Modalities, 2020-2021 [Dataset]. https://catalog.data.gov/dataset/school-learning-modalities-2020-2021
Explore at:
Dataset updated
Mar 26, 2025
Dataset provided by
Centers for Disease Control and Prevention
Description
The 2020-2021 School Learning Modalities dataset provides weekly estimates of school learning modality (including in-person, remote, or hybrid learning) for U.S. K-12 public and independent charter school districts for the 2020-2021 school year, from August 2020 – June 2021. These data were modeled using multiple sources of input data (see below) to infer the most likely learning modality of a school district for a given week. These data should be considered district-level estimates and may not always reflect true learning modality, particularly for districts in which data are unavailable. If a district reports multiple modality types within the same week, the modality offered for the majority of those days is reflected in the weekly estimate. All school district metadata are sourced from the National Center for Educational Statistics (NCES) for 2020-2021. School learning modality types are defined as follows: In-Person: All schools within the district offer face-to-face instruction 5 days per week to all students at all available grade levels. Remote: Schools within the district do not offer face-to-face instruction; all learning is conducted online/remotely to all students at all available grade levels. Hybrid: Schools within the district offer a combination of in-person and remote learning; face-to-face instruction is offered less than 5 days per week, or only to a subset of students. Data Information School learning modality data provided here are model estimates using combined input data and are not guaranteed to be 100% accurate. This learning modality dataset was generated by combining data from four different sources: Burbio [1], MCH Strategic Data [2], the AEI/Return to Learn Tracker [3], and state dashboards [4-20]. These data were combined using a Hidden Markov model which infers the sequence of learning modalities (In-Person, Hybrid, or Remote) for each district that is most likely to produce the modalities reported by these sources. This model was trained using data from the 2020-2021 school year. Metadata describing the location, number of schools and number of students in each district comes from NCES [21]. You can read more about the model in the CDC MMWR: COVID-19–Related School Closures and Learning Modality Changes — United States, August 1–September 17, 2021. The metrics listed for each school learning modality reflect totals by district and the number of enrolled students per district for which data are available. School districts represented here exclude private schools and include the following NCES subtypes: Public school district that is NOT a component of a supervisory union Public school district that is a component of a supervisory union Independent charter district “BI” in the state column refers to school districts funded by the Bureau of Indian Education. Technical Notes Data from September 1, 2020 to June 25, 2021 correspond to the 2020-2021 school year. During this timeframe, all four sources of data were available. Inferred modalities with a probability below 0.75 were deemed inconclusive and were omitted. Data for the month of July may show “In Person” status although most school districts are effectively closed during this time for summer break. Users may wish to exclude July data from use for this reason where applicable. Sources K-12 School Opening Tracker. Burbio 2021; https
V
Student Enrollment and Attendance Data by Teaching Modality - 2020 - 2021
odgavaprod.ogopendata.com
opendata.winchesterva.gov
xlsx
Updated Jul 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Education (2024). Student Enrollment and Attendance Data by Teaching Modality - 2020 - 2021 [Dataset]. https://odgavaprod.ogopendata.com/dataset/student-enrollment-and-attendance-data-by-teaching-modality-2020
Explore at:
xlsx(393380)Available download formats
Dataset updated
Jul 23, 2024
Dataset authored and provided by
Department of Education
Description
Student enrollment data disaggregated by students from low-income families, students from each racial and ethnic group, gender, English learners, children with disabilities, children experiencing homelessness, children in foster care, and migratory students for each mode of instruction.
Multi-modality medical image dataset for medical image processing in Python...
zenodo.org
zip
Updated Aug 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Candace Moore; Candace Moore; Giulia Crocioni; Giulia Crocioni (2024). Multi-modality medical image dataset for medical image processing in Python lesson [Dataset]. http://doi.org/10.5281/zenodo.13305760
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.13305760
Dataset updated
Aug 12, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Candace Moore; Candace Moore; Giulia Crocioni; Giulia Crocioni
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains a collection of medical imaging files for use in the "Medical Image Processing with Python" lesson, developed by the Netherlands eScience Center.

The dataset includes:

SimpleITK compatible files: MRI T1 and CT scans (training_001_mr_T1.mha, training_001_ct.mha), digital X-ray (digital_xray.dcm in DICOM format), neuroimaging data (A1_grayT1.nrrd, A1_grayT2.nrrd). Data have been downloaded from here.

MRI data: a T2-weighted image (OBJECT_phantom_T2W_TSE_Cor_14_1.nii in NIfTI-1 format). Data have been downloaded from here.

Example images for the machine learning lesson: chest X-rays (rotatechest.png, other_op.png), cardiomegaly example (cardiomegaly_cc0.png).

Additional anonymized data: TBA

These files represent various medical imaging modalities and formats commonly used in clinical research and practice. They are intended for educational purposes, allowing students to practice image processing techniques, machine learning applications, and statistical analysis of medical images using Python libraries such as scikit-image, pydicom, and SimpleITK.
D
Data from: Towards Cross-Modality Modeling for Time Series Analytics: A...
researchdata.ntu.edu.sg
Updated May 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
DR-NTU (Data) (2025). Towards Cross-Modality Modeling for Time Series Analytics: A Survey in the LLM Era [Dataset]. http://doi.org/10.21979/N9/I0HOYZ
Explore at:
Unique identifier
https://doi.org/10.21979/N9/I0HOYZ
Dataset updated
May 13, 2025
Dataset provided by
DR-NTU (Data)
License
https://researchdata.ntu.edu.sg/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.21979/N9/I0HOYZhttps://researchdata.ntu.edu.sg/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.21979/N9/I0HOYZ
Description
The proliferation of edge devices has generated an unprecedented volume of time series data across different domains, motivating various well-customized methods. Recently, Large Language Models (LLMs) have emerged as a new paradigm for time series analytics by leveraging the shared sequential nature of textual data and time series. However, a fundamental cross-modality gap between time series and LLMs exists, as LLMs are pre-trained on textual corpora and are not inherently optimized for time series. Many recent proposals are designed to address this issue. In this survey, we provide an up-to-date overview of LLMs-based cross-modality modeling for time series analytics. We first introduce a taxonomy that classifies existing approaches into four groups based on the type of textual data employed for time series modeling. We then summarize key cross-modality strategies, e.g., alignment and fusion, and discuss their applications across a range of downstream tasks. Furthermore, we conduct experiments on multimodal datasets from different application domains to investigate effective combinations of textual data and cross-modality strategies for enhancing time series analytics. Finally, we suggest several promising directions for future research. This survey is designed for a range of professionals, researchers, and practitioners interested in LLM-based time series modeling.
Multi-Modal Imaging Data-Integration Market Research Report 2033
growthmarketreports.com
csv, pdf, pptx
Updated Jul 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Growth Market Reports (2025). Multi-Modal Imaging Data-Integration Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/multi-modal-imaging-data-integration-market
Explore at:
pptx, csv, pdfAvailable download formats
Dataset updated
Jul 5, 2025
Dataset authored and provided by
Growth Market Reports
Time period covered
2024 - 2032
Area covered
Global
Description
Multi-Modal Imaging Data-Integration Market Outlook

According to our latest research, the global Multi-Modal Imaging Data-Integration market size reached USD 1.67 billion in 2024. The market is expected to expand at a robust CAGR of 9.5% during the forecast period, reaching a projected value of USD 3.78 billion by 2033. This impressive growth is driven by the increasing demand for integrated imaging solutions in clinical diagnostics and research, as well as technological advancements in imaging modalities and data analytics platforms. As per our detailed analysis, the integration of multiple imaging modalities is revolutionizing the way healthcare professionals diagnose and treat complex diseases, offering comprehensive insights that single-modality imaging cannot provide.

One of the primary growth factors propelling the Multi-Modal Imaging Data-Integration market is the rising prevalence of chronic diseases such as cancer, cardiovascular disorders, and neurological conditions. These diseases often require precise and multifaceted diagnostic approaches, which multi-modal imaging excels at delivering. By combining data from modalities like MRI, CT, PET, and ultrasound, clinicians can achieve a more holistic view of patient pathology, leading to improved treatment planning and patient outcomes. Moreover, the increasing adoption of personalized medicine is further driving the need for integrated imaging data, as tailored therapeutic strategies rely heavily on accurate, multi-dimensional diagnostic information.

Another significant driver is the rapid technological evolution in both imaging hardware and software. Innovations such as artificial intelligence (AI) and machine learning are enabling more effective integration and interpretation of complex imaging datasets. Advanced integration techniques, including software-based and hybrid solutions, are making it feasible to seamlessly combine anatomical, functional, and molecular information from various imaging platforms. This technological leap is not only enhancing diagnostic precision but also reducing the time and cost associated with traditional, single-modality imaging workflows. The ongoing investment in research and development by both public and private sectors is ensuring a steady pipeline of improvements in multi-modal imaging data-integration.

The growing adoption of digital health solutions, including cloud-based imaging data repositories and telemedicine platforms, is also contributing to market expansion. Healthcare institutions are increasingly recognizing the value of integrated imaging data in facilitating remote consultations, multidisciplinary team discussions, and collaborative research. The shift toward value-based care models emphasizes outcomes and efficiency, making multi-modal data-integration an attractive proposition for hospitals, diagnostic centers, and research institutes. Additionally, regulatory support for interoperability and data standardization is gradually lowering barriers to adoption, fostering a more conducive environment for market growth.

From a regional perspective, North America continues to dominate the Multi-Modal Imaging Data-Integration market, accounting for the largest revenue share in 2024. This leadership is attributed to the region’s advanced healthcare infrastructure, high adoption rates of cutting-edge imaging technologies, and significant investments in healthcare IT. Europe follows closely, benefiting from robust government initiatives and a strong focus on research collaborations. The Asia Pacific region is emerging as the fastest-growing market, driven by expanding healthcare access, rising investments in medical technology, and an increasing burden of chronic diseases. Latin America and the Middle East & Africa, while currently holding smaller shares, are expected to witness steady growth due to improving healthcare systems and rising awareness of integrated imaging benefits.

Imaging Modality Analysis

The Imaging Modality segment forms the b
S
Data from: CMShipReID: A Cross-Modality Ship Dataset for the...
scidb.cn
Updated Apr 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xu Congan; Gao Long; Liu Yu; Zhang Qi; Su Nan; Zhang Shaoxuan; Li Tianyu; Zheng Xiaomei (2025). CMShipReID: A Cross-Modality Ship Dataset for the Re-IDentification Task [Dataset]. http://doi.org/10.57760/sciencedb.radars.00051
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57760/sciencedb.radars.00051
Dataset updated
Apr 29, 2025
Dataset provided by
Science Data Bank
Authors
Xu Congan; Gao Long; Liu Yu; Zhang Qi; Su Nan; Zhang Shaoxuan; Li Tianyu; Zheng Xiaomei
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Image-based ship target analysis is an important task in the field of ship monitoring. Previous studies have achieved remarkable results in ship detection and recognition tasks. However, these related studies mainly rely on unimodal datasets, and there is still no publicly available ship individual re-identification dataset released, which restricts the research in the field of cross-modal individual re-identification of ship targets. To address this issue, we have constructed the first cross-modal ship re-identification dataset, CMShipReID. This dataset contains data from three modalities, namely visible light, near-infrared, and thermal infrared, which are collected by drones. It covers 10 categories, approximately 138 individual ships, and 8,337 images, thus providing data support for the research on cross-modal individual re-identification of ships. We have tested the mainstream re-identification algorithms as the performance benchmark for this dataset, which can serve as a fundamental reference for relevant scholars.
v
Global Multimodal AI Market Size By Offering (Solutions, Services), By Data...
verifiedmarketresearch.com
Updated Feb 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
VERIFIED MARKET RESEARCH (2025). Global Multimodal AI Market Size By Offering (Solutions, Services), By Data Modality (Image, Audio), By Technology (ML, NLP, Computer Vision, Context Awareness), By Geographic Scope and Forecast [Dataset]. https://www.verifiedmarketresearch.com/product/multimodal-ai-market/
Explore at:
Dataset updated
Feb 26, 2025
Dataset authored and provided by
VERIFIED MARKET RESEARCH
License
https://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/
Area covered
Global
Description
Multimodal AI Market size was valued at USD 1.74 Billion in 2024 and is projected to reach USD 15.89 Billion by 2032, growing at a CAGR of 4.8% from 2026 to 2032.

Multimodal AI Market Drivers

Enhanced Understanding of Complex Data: Multimodal AI allows systems to process and integrate information from various sources (text, images, audio, video), leading to a richer understanding of complex data and real-world scenarios. Improved Human-Computer Interaction: By understanding multiple modalities, AI systems can interact with humans in a more natural and intuitive way, mimicking human communication. Increased Accuracy and Robustness: Combining information from different modalities can improve the accuracy and robustness of AI models, especially in noisy or ambiguous environments. Expansion of AI Applications: Multimodal AI is enabling new applications in various fields, including: Healthcare (e.g., diagnosis from medical images and patient records) Education (e.g., personalized learning with interactive multimedia) Entertainment (e.g., immersive virtual reality experiences) Retail (e.g., enhanced product search and recommendations) Autonomous vehicles (combining visual and lidar data) Advancements in Deep Learning: Deep learning techniques, particularly transformer models, have proven highly effective in processing and integrating multimodal data. Availability of Large Multimodal Datasets: The increasing availability of large, diverse datasets containing information from multiple modalities is fueling the development of multimodal AI models. Growing Demand for Contextual AI: Multimodal AI allows AI to gain a much better understanding of context, which is vital for many applications.
g
School Learning Modalities, 2020-2021 | gimi9.com
gimi9.com
Updated Mar 7, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). School Learning Modalities, 2020-2021 | gimi9.com [Dataset]. https://gimi9.com/dataset/data-gov_school-learning-modalities-2020-2021/
Explore at:
Dataset updated
Mar 7, 2023
Description
🇺🇸 미국 English The 2020-2021 School Learning Modalities dataset provides weekly estimates of school learning modality (including in-person, remote, or hybrid learning) for U.S. K-12 public and independent charter school districts for the 2020-2021 school year, from August 2020 – June 2021. These data were modeled using multiple sources of input data (see below) to infer the most likely learning modality of a school district for a given week. These data should be considered district-level estimates and may not always reflect true learning modality, particularly for districts in which data are unavailable. If a district reports multiple modality types within the same week, the modality offered for the majority of those days is reflected in the weekly estimate. All school district metadata are sourced from the National Center for Educational Statistics (NCES) for 2020-2021. School learning modality types are defined as follows: In-Person: All schools within the district offer face-to-face instruction 5 days per week to all students at all available grade levels.
Z
Data from: MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark...
data.niaid.nih.gov
Updated Apr 19, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jiancheng Yang (2023). MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4269851
Explore at:
Dataset updated
Apr 19, 2023
Dataset provided by
Bingbing Ni
Jiancheng Yang
Rui Shi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This data repository for MedMNIST v1 is out of date! Please check the latest version of MedMNIST v2.

Abstract

We present MedMNIST, a collection of 10 pre-processed medical open datasets. MedMNIST is standardized to perform classification tasks on lightweight 28x28 images, which requires no background knowledge. Covering the primary data modalities in medical image analysis, it is diverse on data scale (from 100 to 100,000) and tasks (binary/multi-class, ordinal regression and multi-label). MedMNIST could be used for educational purpose, rapid prototyping, multi-modal machine learning or AutoML in medical image analysis. Moreover, MedMNIST Classification Decathlon is designed to benchmark AutoML algorithms on all 10 datasets; We have compared several baseline methods, including open-source or commercial AutoML tools. The datasets, evaluation code and baseline methods for MedMNIST are publicly available at https://medmnist.github.io/.

Please note that this dataset is NOT intended for clinical use.

We recommend our official code to download, parse and use the MedMNIST dataset:

pip install medmnist

Citation and Licenses

If you find this project useful, please cite our ISBI'21 paper as: Jiancheng Yang, Rui Shi, Bingbing Ni. "MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis," arXiv preprint arXiv:2010.14925, 2020.

or using bibtex: @article{medmnist, title={MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis}, author={Yang, Jiancheng and Shi, Rui and Ni, Bingbing}, journal={arXiv preprint arXiv:2010.14925}, year={2020} }

Besides, please cite the corresponding paper if you use any subset of MedMNIST. Each subset uses the same license as that of the source dataset.

PathMNIST

Jakob Nikolas Kather, Johannes Krisam, et al., "Predicting survival from colorectal cancer histology slides using deep learning: A retrospective multicenter study," PLOS Medicine, vol. 16, no. 1, pp. 1–22, 01 2019.

License: CC BY 4.0

ChestMNIST

Xiaosong Wang, Yifan Peng, et al., "Chestx-ray8: Hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases," in CVPR, 2017, pp. 3462–3471.

License: CC0 1.0

DermaMNIST

Philipp Tschandl, Cliff Rosendahl, and Harald Kittler, "The ham10000 dataset, a large collection of multisource dermatoscopic images of common pigmented skin lesions," Scientific data, vol. 5, pp. 180161, 2018.

Noel Codella, Veronica Rotemberg, Philipp Tschandl, M. Emre Celebi, Stephen Dusza, David Gutman, Brian Helba, Aadi Kalloo, Konstantinos Liopyris, Michael Marchetti, Harald Kittler, and Allan Halpern: “Skin Lesion Analysis Toward Melanoma Detection 2018: A Challenge Hosted by the International Skin Imaging Collaboration (ISIC)”, 2018; arXiv:1902.03368.

License: CC BY-NC 4.0

OCTMNIST/PneumoniaMNIST

Daniel S. Kermany, Michael Goldbaum, et al., "Identifying medical diagnoses and treatable diseases by image-based deep learning," Cell, vol. 172, no. 5, pp. 1122 – 1131.e9, 2018.

License: CC BY 4.0

RetinaMNIST

DeepDR Diabetic Retinopathy Image Dataset (DeepDRiD), "The 2nd diabetic retinopathy – grading and image quality estimation challenge," https://isbi.deepdr.org/data.html, 2020.

License: CC BY 4.0

BreastMNIST

Walid Al-Dhabyani, Mohammed Gomaa, Hussien Khaled, and Aly Fahmy, "Dataset of breast ultrasound images," Data in Brief, vol. 28, pp. 104863, 2020.

License: CC BY 4.0

OrganMNIST_{Axial,Coronal,Sagittal}

Patrick Bilic, Patrick Ferdinand Christ, et al., "The liver tumor segmentation benchmark (lits)," arXiv preprint arXiv:1901.04056, 2019.

Xuanang Xu, Fugen Zhou, et al., "Efficient multiple organ localization in ct image using 3d region proposal network," IEEE Transactions on Medical Imaging, vol. 38, no. 8, pp. 1885–1898, 2019.

License: CC BY 4.0
Data from: Leverage Points in Modality shifts: Comparing Language-only and...
zenodo.org
zip
Updated Oct 14, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anonymous; Anonymous (2022). Leverage Points in Modality shifts: Comparing Language-only and Multimodal Word Representations [Dataset]. http://doi.org/10.5281/zenodo.7198705
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.7198705
Dataset updated
Oct 14, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Anonymous; Anonymous
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset for the blind submission of Leverage Points in Modality shifts: Comparing Language-only and Multimodal Word Representations paper
d
Replication Data for: When modality and tense meet. The future marker budet...
dataone.org
dataverse.no
+1more
Updated Sep 25, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zhamaletdinova, Elmira (2024). Replication Data for: When modality and tense meet. The future marker budet ‘will’ in impersonal constructions with the modal adverb možno ‘be possible’ [Dataset]. http://doi.org/10.18710/MOJBDK
Explore at:
Unique identifier
https://doi.org/10.18710/MOJBDK
Dataset updated
Sep 25, 2024
Dataset provided by
DataverseNO
Authors
Zhamaletdinova, Elmira
Time period covered
Jan 1, 1826 - Jan 1, 2015
Description
Dataset description: This is a study of examples of Russian impersonal constructions with the modal word možno ‘can, be possible’ with and without the future copula budet ‘will be,’ i.e., možno + budet + INF and možno + INF. The data was collected in 2020-2021 from the old version of the Russian National Corpus (ruscorpora.ru). In the spreadsheet 01DataMoznoBudet, the data merges the results of four searches conducted to extract examples of sentences with the following construction types: možno + budet + INF.PFV, možno + budet + INF.IPFV, možno + INF.PFV and možno + INF.IPFV. The results for each search were downloaded, pseudorandomized, and the first 200 examples were manually annotated, based on the syntactic analyses given in the corpus. The syntactic and morphological categories used in the corpus are explained here: https://ruscorpora.ru/corpus/main. In the spreadsheet 01DataZavtraMoznoBudet, the data merges the results of four searches conducted to extract examples of sentences with the following structure: zavtra + možno + budet + INF.PFV, zavtra + možno + budet + INF.IPFV, zavtra + možno + INF.PFV and zavtra + možno + INF.IPFV. All of the examples (103 sentences) were imported to a spreadsheet and annotated manually, based on the syntactic analyses given in the corpus. The syntactic and morphological categories used in the corpus are explained here: https://ruscorpora.ru/corpus/main. Article abstract: This paper examines Russian impersonal constructions with the modal word možno ‘can, be possible’ with and without the future copula budet ‘will be,’ i.e., možno + budet + INF and možno + INF. My contribution can be summarized as follows. First, corpus-based evidence reveals that možno + INF constructions are vastly more frequent than constructions with copula. Second, the meaning of constructions without the future copula is more flexible: while the possibility is typically located in the present, the situation denoted by the infinitive may be located in the present or the future. Third, I show that the možno + INF construction is more ambiguous and can denote present, gnomic or future situations. Fourth, I identify a number of contextual factors that unambiguously locate the situation in the future. I demonstrate that such factors are more frequently used with the future copula, and thus motivate the choice between the two constructions. Finally, I illustrate the interpretations in a straightforward manner by means of schemas of the type used in cognitive linguistics.
h
Modality-Interference-in-MLLMs-DATA
huggingface.co
Updated May 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Luis Rui (2025). Modality-Interference-in-MLLMs-DATA [Dataset]. https://huggingface.co/datasets/luisrui/Modality-Interference-in-MLLMs-DATA
Explore at:
Dataset updated
May 4, 2025
Authors
Luis Rui
Description
luisrui/Modality-Interference-in-MLLMs-DATA dataset hosted on Hugging Face and contributed by the HF Datasets community
N
Data from: Task-Dependent Recruitment of Modality-Specific and Multimodal...
neurovault.org
zip
Updated Feb 25, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2021). Task-Dependent Recruitment of Modality-Specific and Multimodal Regions during Conceptual Processing [Dataset]. http://identifiers.org/neurovault.collection:9490
Explore at:
zipAvailable download formats
Unique identifier
https://identifiers.org/neurovault.collection:9490
Dataset updated
Feb 25, 2021
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
A collection of 7 brain maps. Each brain map is a 3D array of values representing properties of the brain at different locations.

Collection description

This collection contains group-level activation maps for the fMRI study "Task-Dependent Recruitment of Modality-Specific and Multimodal Regions during Conceptual Processing" by Kuhnke et al. (2020). The study investigated task dependency of conceptual knowledge retrieval, focusing on sound and action features of word meaning.
f
Data from: Mandarin Chinese modality exclusivity norms
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Feb 20, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Huang, Chu-Ren; Chen, I-Hsuan; Lu, Qin; Zhao, Qingqing; Long, Yunfei (2019). Mandarin Chinese modality exclusivity norms [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000113319
Explore at:
Dataset updated
Feb 20, 2019
Authors
Huang, Chu-Ren; Chen, I-Hsuan; Lu, Qin; Zhao, Qingqing; Long, Yunfei
Description
Modality exclusivity norms have been developed in different languages for research on the relationship between perceptual and conceptual systems. This paper sets up the first modality exclusivity norms for Chinese, a Sino-Tibetan language with semantics as its orthographically relevant level. The norms are collected through two studies based on Chinese sensory words. The experimental designs take into consideration the morpho-lexical and orthographic structures of Chinese. Study 1 provides a set of norms for Mandarin Chinese single-morpheme words in mean ratings of the extent to which a word is experienced through the five sense modalities. The degrees of modality exclusivity are also provided. The collected norms are further analyzed to examine how sub-lexical orthographic representations of sense modalities in Chinese characters affect speakers’ interpretation of the sensory words. In particular, we found higher modality exclusivity rating for the sense modality explicitly represented by a semantic radical component, as well as higher auditory dominant modality rating for characters with transparent phonetic symbol components. Study 2 presents the mean ratings and modality exclusivity of coordinate disyllabic compounds involving multiple sense modalities. These studies open new perspectives in the study of modality exclusivity. First, links between modality exclusivity and writing systems have been established which has strengthened previous accounts of the influence of orthography in the processing of visual information in reading. Second, a new set of modality exclusivity norms of compounds is proposed to show the competition of influence on modality exclusivity from different linguistic factors and potentially allow such norms to be linked to studies on synesthesia and semantic transparency.

Facebook

Twitter

Click to copy link

Link copied

Cite

LMMs-Lab (2025). full-modality-data [Dataset]. https://huggingface.co/datasets/lmms-lab/full-modality-data

full-modality-data

lmms-lab/full-modality-data

Explore at:

51 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Aug 1, 2025

Dataset authored and provided by

LMMs-Lab

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

Full Modality Dataset Statistics

  Video Statistics

Total Videos: 28,472 Total Duration: 1422.33 hours Average Duration: 179.84 seconds Median Duration: 160.08 seconds Duration Range: 10.04s - 1780.03s

  QA Statistics

Total Questions: 1,444,526 Average Questions per Video: 50.7 Questions per Video Range: 14 - 450

  Question Type Distribution

OE: 1,444,526 (100.0%)

  Question Category Distribution

temporal: 96,873 (6.7%) causal: 96,873… See the full description on the dataset page: https://huggingface.co/datasets/lmms-lab/full-modality-data.

Clear search

Close search

Google apps

Main menu

full-modality-data

School Learning Modalities, 2021-2022

VGG16 Metrics, Average AUC = 0.9987707721217087.

Modality-based Multitasking, practice data

Data from: Typical Concept-Driven Modality-missing Deep Cross-Modal...

Learning modalities reported by source.

School Learning Modalities, 2020-2021

Student Enrollment and Attendance Data by Teaching Modality - 2020 - 2021

Multi-modality medical image dataset for medical image processing in Python...

Data from: Towards Cross-Modality Modeling for Time Series Analytics: A...

Multi-Modal Imaging Data-Integration Market Research Report 2033

Multi-Modal Imaging Data-Integration Market Outlook

Imaging Modality Analysis

Data from: CMShipReID: A Cross-Modality Ship Dataset for the...

Global Multimodal AI Market Size By Offering (Solutions, Services), By Data...

School Learning Modalities, 2020-2021 | gimi9.com

Data from: MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark...

Data from: Leverage Points in Modality shifts: Comparing Language-only and...

Replication Data for: When modality and tense meet. The future marker budet...

Modality-Interference-in-MLLMs-DATA

Data from: Task-Dependent Recruitment of Modality-Specific and Multimodal...

Collection description

Data from: Mandarin Chinese modality exclusivity norms

full-modality-data

lmms-lab/full-modality-data