100+ datasets found

Dataset for "Large Language Models for Structuring and Integration of...
zenodo.org
pdf
Updated Jan 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Henrik Bongertmann; Benjamin Nast; Benjamin Nast; Leon Griesch; Leon Griesch; Henry Rotzoll; Kurt Sandkuhl; Kurt Sandkuhl; Henrik Bongertmann; Henry Rotzoll (2025). Dataset for "Large Language Models for Structuring and Integration of Heterogeneous Data" [Dataset]. http://doi.org/10.5281/zenodo.14779110
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.14779110
Dataset updated
Jan 31, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Henrik Bongertmann; Benjamin Nast; Benjamin Nast; Leon Griesch; Leon Griesch; Henry Rotzoll; Kurt Sandkuhl; Kurt Sandkuhl; Henrik Bongertmann; Henry Rotzoll
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This is the dataset for the paper "Large Language Models for Structuring and Integration of Heterogeneous Data" (add DOI).

It contains:

Example documents (anonymized)

Comparison results of open-source LLMs

Additional material employed in the case study (e.g., prompt or JSON template)

Results of the case study
i
A multi-source heterogeneous data monitoring method based on latent subspace...
ieee-dataport.org
Updated Oct 7, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yunpeng Fan (2020). A multi-source heterogeneous data monitoring method based on latent subspace [Dataset]. https://ieee-dataport.org/documents/multi-source-heterogeneous-data-monitoring-method-based-latent-subspace
Explore at:
Dataset updated
Oct 7, 2020
Authors
Yunpeng Fan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
the main contributions of this paper are threefold.
Enhanced Stock Price Prediction with Optimized Ensemble Modeling Using...
figshare.com
xlsx
Updated Nov 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hongjiu Liu (2024). Enhanced Stock Price Prediction with Optimized Ensemble Modeling Using Multi-source Heterogeneous Data [Dataset]. http://doi.org/10.6084/m9.figshare.27328590.v2
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.27328590.v2
Dataset updated
Nov 5, 2024
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Hongjiu Liu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
the dataset can used for the test of models of deep learning which include structured data: stock price and unstructured data: stock bar posts. so, the dataset is Multi-source Heterogeneous Data.
t
Data from: Dcase2024 Task 4: Sound event detection with heterogeneous data...
service.tib.eu
Updated Dec 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Dcase2024 Task 4: Sound event detection with heterogeneous data and missing labels [Dataset]. https://service.tib.eu/ldmservice/dataset/dcase2024-task-4--sound-event-detection-with-heterogeneous-data-and-missing-labels
Explore at:
Dataset updated
Dec 16, 2024
Description
Sound event detection (SED) task with heterogeneous datasets, including Domestic Environ-ment Sound Event Detection (DESED) and Multi-Annotator Estimated STROng labels (MAESTRO)
Data from: Heterogeneous Multi-Source Data Fusion Through Input Mapping And...
zenodo.org
bin, csv
Updated Jan 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yigitcan Comlek; Yigitcan Comlek; Sandipp Krishnan Ravi; Sandipp Krishnan Ravi; Piyush Pandita; Sayan Ghosh; Liping Wang; Wei Chen; Piyush Pandita; Sayan Ghosh; Liping Wang; Wei Chen (2025). Heterogeneous Multi-Source Data Fusion Through Input Mapping And Latent Variable Gaussian Process [Dataset]. http://doi.org/10.5281/zenodo.14681801
Explore at:
csv, binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.14681801
Dataset updated
Jan 23, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Yigitcan Comlek; Yigitcan Comlek; Sandipp Krishnan Ravi; Sandipp Krishnan Ravi; Piyush Pandita; Sayan Ghosh; Liping Wang; Wei Chen; Piyush Pandita; Sayan Ghosh; Liping Wang; Wei Chen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This repository contains the data used for “Heterogeneous Multi-Source Data Fusion Through Input Mapping And Latent Variable Gaussian Process” paper by Yigitcan Comlek, Sandipp Krishnan Ravi, Piyush Pandita, Sayan Ghosh, Liping Wang, and Wei Chen. For all correspondence, please contact Dr. Wei Chen (weichen@northwestern.edu) or Dr. Sandipp Krishnan Ravi (sandippk@umich.edu).

Please use the below BibTex format to cite this work:

@article{comlek2024heterogenous,

title={Heterogenous Multi-Source Data Fusion Through Input Mapping and Latent Variable Gaussian Process},

author={Comlek, Yigitcan and Ravi, Sandipp Krishnan and Pandita, Piyush and Ghosh, Sayan and Wang, Liping and Chen, Wei},

journal={arXiv preprint arXiv:2407.11268},

year={2024}

}

The repository consists of data used in three case studies. All the data available is in .csv format. Each csv file contains the data for the specific source used in the case study. Below is a summary of the files for each of the three case studies.

Case Study 1 (Cantilever Beam)

· Source1_RectangularBeam.csv

· Source2_RectangularHollowBeam.csv

· Source3_CircularHollowBeam.csv

Case Study 2 (Ellipsoidal Void)

· Source1_2DEllipse.csv

· Source2_3DEllipse.csv

· Source3_3DEllipseRot.csv

Case Study 3 (Ti6AlV Alloys)

· Source1_LBPF.csv [1,2]

· Source2_EBM.csv [3]

· Source3_FSW.csv [4]

For this case study the data is collected from the below papers:

[1] Q. Luo, L. Yin, T. W. Simpson, and A. M. Beese, “Effect of processing parameters on pore structures, grain features, and mechanical properties in ti-6al-4v by laser powder bed fusion,” Additive Manufacturing, vol. 56, p. 102 915, 2022.

[2] Q. Luo, L. Yin, T. W. Simpson, and A. M. Beese, “Dataset of process-structure-property feature relationship for laser powder bed fusion additive manufactured ti-6al-4v material.,” Data in Brief, vol. 46, p. 108 911, 2023.

[3] J. Ran, F. Jiang, X. Sun, Z. Chen, C. Tian, and H. Zhao, “Microstructure and mechanical properties of ti-6al-4v fabricated by electron beam melting,” Crystals, vol. 10, no. 11, p. 972, 2020.

[4] A. Fall, M. Jahazi, A. Khdabandeh, and M. Fesharaki, “Effect of process parameters on microstructure and mechanical properties of friction stir-welded ti–6al–4v joints,” The International Journal of Advanced Manufacturing Technology, vol. 91, pp. 2919–2931, 2017
From RAGs to Riches: Utilizing Richly Attributed Graphs to Reason from...
figshare.com
pptx
Updated Jan 20, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NeuroData (2016). From RAGs to Riches: Utilizing Richly Attributed Graphs to Reason from Heterogeneous Data: Part 1 [Dataset]. http://doi.org/10.6084/m9.figshare.1593271.v1
Explore at:
pptxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.1593271.v1
Dataset updated
Jan 20, 2016
Dataset provided by
Figsharehttp://figshare.com/
Authors
NeuroData
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Utilizing Richly Attributed Graphs to Reason from Heterogeneous Data - Part 1
Data from: Learn2Link: Linking the Social and Academic Profiles of...
zenodo.org
data.niaid.nih.gov
bin, zip
Updated Apr 1, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Asmelash Teka Hadgu; Asmelash Teka Hadgu; Jayanth Kumar Reddy Gundam; Jayanth Kumar Reddy Gundam (2020). Learn2Link: Linking the Social and Academic Profiles of Researchers [Dataset]. http://doi.org/10.5281/zenodo.3735448
Explore at:
zip, binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3735448
Dataset updated
Apr 1, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Asmelash Teka Hadgu; Asmelash Teka Hadgu; Jayanth Kumar Reddy Gundam; Jayanth Kumar Reddy Gundam
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Data and code to reproduce work in Learn2Link: Linking the Social and Academic Profiles of Researchers.
g
Towards Digital Twinning on the Web: Heterogeneous 3D Data Fusion Based on...
eleonasrepo.getmap.gr
Updated Jan 26, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Towards Digital Twinning on the Web: Heterogeneous 3D Data Fusion Based on Open-Source Structure - Datasets - eLeonas Data Hub [Dataset]. https://eleonasrepo.getmap.gr/dataset/towards-digital-twinning-on-the-web-heterogeneous-3d-data-fusion-based-on-open-source-structure
Explore at:
Dataset updated
Jan 26, 2023
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Recent advances in Computer Science and the spread of internet connection have allowed specialists to virtualize complex environments on the web and offer further information with realistic exploration experiences. At the same time, the fruition of complex geospatial datasets (point clouds, Building Information Modelling (BIM) models, 2D and 3D models) on the web is still a challenge, because usually it involves the usage of different proprietary software solutions, and the input data need further simplification for computational effort reduction. Moreover, integrating geospatial datasets acquired in different ways with various sensors remains a challenge. An interesting question, in that respect, is how to integrate 3D information in a 3D GIS (Geographic Information System) environment and manage different scales of information in the same application. Integrating a multiscale level of information is currently the first step when it comes to digital twinning. It is needed to properly manage complex urban datasets in digital twins related to the management of the buildings (cadastral management, prevention of natural and anthropogenic hazards, structure monitoring, etc.). Therefore, the current research shows the development of a freely accessible 3D Web navigation model based on open-source technology that allows the visualization of heterogeneous complex geospatial datasets in the same virtual environment. This solution employs JavaScript libraries based on WebGL technology. The model is accessible through web browsers and does not need software installation from the user side. The case study is the new building of the University of Twente-Faculty of Geo-Information (ITC), located in Enschede (the Netherlands). The developed solution allows switching between heterogeneous datasets (point clouds, BIM, 2D and 3D models) at different scales and visualization (indoor first-person navigation, outdoor navigation, urban navigation). This solution could be employed by governmental stakeholders or the private sector to remotely visualize complex datasets on the web in a unique visualization, and take decisions only based on open-source solutions. Furthermore, this system can incorporate underground data or real-time sensor data from the IoT (Internet of Things) for digital twinning tasks.
m
Documents Data for "Composition of Heterogeneous Web Services: A Systematic...
data.mendeley.com
Updated Jan 4, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alexis Huf (2019). Documents Data for "Composition of Heterogeneous Web Services: A Systematic Review" [Dataset]. http://doi.org/10.17632/hcbcg23836.2
Explore at:
Unique identifier
https://doi.org/10.17632/hcbcg23836.2
Dataset updated
Jan 4, 2019
Authors
Alexis Huf
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This repository contains document data for a Systematic Literature Review (SLR) titled "Composition of Heterogeneous Web Services: A Systematic Review". Inclusion/exclusion decision and extracted data from the documents are included.

Three main types can be identified on the Web and on corporate networks: SOAP services, which use the homonym protocol and well established technologies, such as WSDL; RESTful services which employ HTTP directly and conform to the constraints of the REST architectural style; and event-oriented services that take the initiative in notifying their clients about relevant facts. The co-existence of these service types has brought considerable research interest on service type heterogeneity in Web Service composition. The research question of SLR is "How are services of heterogeneous types (SOAP, RESTful and \event-oriented services) composed?".

Documents that may answer this question were searched in Scopus and IEEE Xplore, from conferences and journal sources, without a time limit. Search results were last updated in July 22, 2018 and yielded 63 relevant documents published from 2005 to 2018.

Most works (48) target SOAP and RESTful services heterogeneity, followed by a smaller group of 18 targeting SOAP and event-oriented heterogeneity. The other two combinations, RESTful/event-oriented and SOAP/RESTful/event-oriented sum 5 documents. Among these documents, RESTful support was found to be incipient, with most documents violating constraints of the REST architectural style. The method used for heterogeneity support were classified in 7 archetypes: 1. Common description 2. Proxy 3. Middleware 4. Workflow language 5. Event processor 6. Automatic composition 7. Direct Implementation
i
Heterogeneous and Similarity Network Data
ieee-dataport.org
Updated Nov 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
wen wang (2024). Heterogeneous and Similarity Network Data [Dataset]. https://ieee-dataport.org/documents/heterogeneous-and-similarity-network-data
Explore at:
Dataset updated
Nov 13, 2024
Authors
wen wang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
current methods often rely on single-modal data and fail to effectively integrate multimodal information when representing node attributes.
d
Data from: Subtyping of common complex diseases and disorders by integrating...
datadryad.org
data.niaid.nih.gov
zip
Updated Jul 6, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Victor Andreev; Victor P. Andreev; Margaret E. Helmuth; Abigail R. Smith; Robert M. Merion; Claire C. Yang; Anne P. Cameron; J. Eric Jelovsek; Cindy L. Amundsen; Brian T. Helfand; Catherine S. Bradley; John O. L. DeLancey; James W. Griffith; Alexander P. Glaser; Brenda W. Gillespie; J. Quentin Clemens; H. Henry Lai; Margaret Helmuth; Gang Liu (2022). Subtyping of common complex diseases and disorders by integrating heterogeneous data. Identifying clusters among women with lower urinary tract symptoms in the LURN study [Dataset]. http://doi.org/10.5061/dryad.f4qrfj6zd
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.f4qrfj6zd
Dataset updated
Jul 6, 2022
Dataset provided by
Dryad
Authors
Victor Andreev; Victor P. Andreev; Margaret E. Helmuth; Abigail R. Smith; Robert M. Merion; Claire C. Yang; Anne P. Cameron; J. Eric Jelovsek; Cindy L. Amundsen; Brian T. Helfand; Catherine S. Bradley; John O. L. DeLancey; James W. Griffith; Alexander P. Glaser; Brenda W. Gillespie; J. Quentin Clemens; H. Henry Lai; Margaret Helmuth; Gang Liu
Time period covered
2022
Description
We present a methodology for subtyping of persons with a common clinical symptom complex by integrating heterogeneous continuous and categorical data. We illustrate it by clustering women with lower urinary tract symptoms (LUTS), who represent a heterogeneous cohort with overlapping symptoms and multifactorial etiology. Data collected in the Symptoms of Lower Urinary Tract Dysfunction Research Network (LURN), a multi-center observational study, included self-reported urinary and non-urinary symptoms, bladder diaries, and physical examination data for 545 women. Heterogeneity in these multidimensional data required thorough and non-trivial preprocessing, including scaling by controls and weighting to mitigate data redundancy, while the various data types (continuous and categorical) required novel methodology using a weighted Tanimoto indices approach. Data domains only available on a subset of the cohort were integrated using a semi-supervised clustering approach. Novel contrast criteri...
S
Big Data Oriented Smart Tool Condition Monitoring System: Tool life...
scidb.cn
Updated Nov 2, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kunpeng Zhu; Guochao Li; Yu Zhang (2022). Big Data Oriented Smart Tool Condition Monitoring System: Tool life prediction result data [Dataset]. http://doi.org/10.57760/sciencedb.05993
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57760/sciencedb.05993
Dataset updated
Nov 2, 2022
Dataset provided by
Science Data Bank
Authors
Kunpeng Zhu; Guochao Li; Yu Zhang
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
c1_ wear.csv, c4_ Wear.csv and c6_ wear.csv are the open data set of tool wear experiment. The micro groove milling experiment was carried out on HSM600U high speed machining center. The tool used in the experiment is a vertical flat end milling cutter with a diameter of 800 microns. The cutting material is steel T4. Each cutting segment lasts for two seconds, and 100 cutting segments are conducted for each group of experiments. The cutting force signal is measured with a Kistler dynamometer, and the sampling frequency is 50KHz.Six different cutting conditions were used in the experiment. Experiments 1, 3 and 5 are used as training samples, and Experiments 2, 4 and 6 are used as test samples. After each milling, the tool wear data of each tooth is obtained by offline measurement. t_ tl_ 022619.fig and t_ RUL_ 022619.fig is the prediction result of the change of tool life and effective residual tool life with milling time respectively.
Data from: Who let the CAT out of the bag? accurately dealing with...
zenodo.org
data.niaid.nih.gov
+2more
pdf, zip
Updated Jul 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nathan V. Whelan; Kenneth M. Halanych; Nathan V. Whelan; Kenneth M. Halanych (2024). Data from: Who let the CAT out of the bag? accurately dealing with substitutional heterogeneity in phylogenomic analyses [Dataset]. http://doi.org/10.5061/dryad.85b2m
Explore at:
zip, pdfAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.85b2m
Dataset updated
Jul 19, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Nathan V. Whelan; Kenneth M. Halanych; Nathan V. Whelan; Kenneth M. Halanych
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
As phylogenetic datasets have increased in size, site-heterogeneous substitution models such as CAT-F81 and CAT-GTR have been advocated in favor of other models because they purportedly suppress long-branch attraction (LBA). These models are two of the most commonly used models in phylogenomics, and they have been applied to a variety of taxa ranging from Drosophila to land plants. However, many arguments in favor of CAT models have been based on tenuous assumptions about the true phylogeny rather than rigorous testing with known trees via simulation. Moreover, CAT models have not been compared to other approaches for handling substitutional heterogeneity such as data partitioning with site-homogeneous substitution models. We simulated amino acid sequence datasets with substitutional heterogeneity on a variety of tree shapes including those susceptible to LBA. Data were analyzed with both CAT models and partitioning to explore model performance; in total over 670,000 CPU hours were used, of which over 97% was spent running analyses with CAT models. In many cases, all models recovered branching patterns that were identical to the known tree. However, CAT-F81 consistently performed worse than other models in inferring the correct branching patterns, and both CAT models often overestimated substitutional heterogeneity. Additionally, reanalysis of two empirical metazoan datasets supports the notion that CAT-F81 tends to recover less accurate trees than data partitioning and CAT-GTR. Given these results, we conclude that partitioning and CAT-GTR perform similarly in recovering accurate branching patterns. However, computation time can be orders of magnitude less for data partitioning, with commonly used implementations of CAT-GTR often failing to reach completion in a reasonable time frame (i.e., for Bayesian analyses to converge). Practices such as removing constant sites and parsimony uninformative characters, or using CAT-F81 when CAT-GTR is deemed too computationally expensive, cannot be logically justified. Given clear problems with CAT-F81, phylogenies previously inferred with this model should be reassessed.
d
Robust Map-Matching for Heterogeneous Data via Dominance Decomposition
dataone.org
Updated Nov 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Seybold, Martin P. (2023). Robust Map-Matching for Heterogeneous Data via Dominance Decomposition [Dataset]. http://doi.org/10.7910/DVN/QOBPC5
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/QOBPC5
Dataset updated
Nov 21, 2023
Dataset provided by
Harvard Dataverse
Authors
Seybold, Martin P.
Description
Java application implementing dominance decomposition map matching. Traces and graph data used in experiments in the paper submitted to the ECML-PKDD 2016 conference. Datasets are under the OpenStreetMap licence ODbL. © OpenStreetMap contributors. The Guava library is under the Apache licence.
f
Data from: Extreme Value Estimation for Heterogeneous Data
tandf.figshare.com
txt
Updated Feb 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
John H.J. Einmahl; Yi He (2024). Extreme Value Estimation for Heterogeneous Data [Dataset]. http://doi.org/10.6084/m9.figshare.17124050.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.17124050.v1
Dataset updated
Feb 19, 2024
Dataset provided by
Taylor & Francis
Authors
John H.J. Einmahl; Yi He
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We develop a universal econometric formulation of empirical power laws possibly driven by parameter heterogeneity. Our approach extends classical extreme value theory to specifying the tail behavior of the empirical distribution of a general data set with possibly heterogeneous marginal distributions. We discuss several model examples that satisfy our conditions and demonstrate in simulations how heterogeneity may generate empirical power laws. We observe a cross-sectional power law for US stock losses and show that this tail behavior is largely driven by the heterogeneous volatilities of the individual assets.
d
Data from: Carrying capacity in a heterogeneous environment with habitat...
catalog.data.gov
data.usgs.gov
+1more
Updated Jul 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). Carrying capacity in a heterogeneous environment with habitat connectivity [Dataset]. https://catalog.data.gov/dataset/carrying-capacity-in-a-heterogeneous-environment-with-habitat-connectivity
Explore at:
Dataset updated
Jul 6, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Description
The data are population sizes of yeast Saccharaomyces cerevisiae growth in laboratory cultures over a period of several days with different levels of growth inhibitor cycloheximide. Our results provide rigorous experimental tests of new and old theory, demonstrating how the traditional notion of carrying capacity is ambiguous for populations diffusing in spatially heterogeneous environments.
d
Data from: Diversification across a heterogeneous landscape
datadryad.org
data.niaid.nih.gov
+1more
zip
Updated Jul 11, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gregory M. Walter; Melanie J. Wilkinson; Maddie E. James; Thomas J. Richards; J. David Aguirre; Daniel Ortiz-Barrientos; Greg M. Walter (2016). Diversification across a heterogeneous landscape [Dataset]. http://doi.org/10.5061/dryad.d9f36
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.d9f36
Dataset updated
Jul 11, 2016
Dataset provided by
Dryad
Authors
Gregory M. Walter; Melanie J. Wilkinson; Maddie E. James; Thomas J. Richards; J. David Aguirre; Daniel Ortiz-Barrientos; Greg M. Walter
Time period covered
Jul 9, 2016
Area covered
Australia, North East New South Wales, South East Queensland
Description
Adaptation to contrasting environments across a heterogeneous landscape favors the formation of ecotypes by promoting ecological divergence. Patterns of fitness variation in the field can show whether natural selection drives local adaptation and ecotype formation. However, to demonstrate a link between ecological divergence and speciation, local adaptation must have consequences for reproductive isolation. Using contrasting ecotypes of an Australian wildflower, Senecio lautus in common garden experiments, hybridization experiments, and reciprocal transplants, we assessed how the environment shapes patterns of adaptation and the consequences of adaptive divergence for reproductive isolation. Local adaptation was strong between ecotypes, but weaker between populations of the same ecotype. F1 hybrids exhibited heterosis, but crosses involving one native parent performed better than those with two foreign parents. In a common garden experiment, F2 hybrids exhibited reduced fitness compared...
e
Areas of Heterogeneous Integrity
data.europa.eu
esri shape, wms
Updated Feb 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Areas of Heterogeneous Integrity [Dataset]. https://data.europa.eu/data/datasets/r_veneto-c11030140241_3ameterintegr
Explore at:
wms, esri shapeAvailable download formats
Dataset updated
Feb 4, 2025
Description
Areas of heterogeneous integrity, present in Table 3 of the PTRC approved in 1992
H
Replication Data for Estimating Heterogeneous Treatment Effects and the...
dataverse.harvard.edu
Updated Mar 15, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Justin Grimmer; Sean Westwood; Solomon Messing (2017). Replication Data for Estimating Heterogeneous Treatment Effects and the Effects of Heterogeneous Treatments with Ensemble Methods [Dataset]. http://doi.org/10.7910/DVN/BQMLQW
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/BQMLQW
Dataset updated
Mar 15, 2017
Dataset provided by
Harvard Dataverse
Authors
Justin Grimmer; Sean Westwood; Solomon Messing
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This provides the replication code and data for the paper "Estimating Heterogeneous Treatment Effects and the Effects of Heterogeneous Treatments with Ensemble Methods".
TEAMER: Heterogeneous Wave Energy Converter Test Data
data.openei.org
catalog.data.gov
archive, data
Updated Jun 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Olivia Vitale; Alaa Ahmed; Maha Haji; Olivia Vitale; Alaa Ahmed; Maha Haji (2025). TEAMER: Heterogeneous Wave Energy Converter Test Data [Dataset]. https://data.openei.org/submissions/8456
Explore at:
archive, dataAvailable download formats
Dataset updated
Jun 27, 2025
Dataset provided by
United States Department of Energyhttp://energy.gov/
Open Energy Data Initiative (OEDI)
Cornell University
Authors
Olivia Vitale; Alaa Ahmed; Maha Haji; Olivia Vitale; Alaa Ahmed; Maha Haji
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset was generated during the Heterogeneous Wave Energy Converter (HetWECs) experimental campaign conducted at the O.H. Hinsdale Direction Wave Basin at Oregon State University. Experiments include system identification, hydrodynamics, and power take-off (PTO) tests. The experiments feature 4- and 5-body heterogenous WEC arrays consisting of both oscillating surge WECs and heaving point absorbers. Data was collected using Qualysis motion capture of the device motion, resistive wave gauges to capture wave height data at 20 locations throughout the basin, S-shaped load cells to measure wave excitation force and radiation force, and a Vesc 6 75 to measure motor current, motor RPMs, and FOC current. The submission includes post-processing MATLAB code is to support data handling and figure generation as well as test matrices detailing the sea state conditions for each experimental run.

Facebook

Twitter

Click to copy link

Link copied

Cite

Henrik Bongertmann; Benjamin Nast; Benjamin Nast; Leon Griesch; Leon Griesch; Henry Rotzoll; Kurt Sandkuhl; Kurt Sandkuhl; Henrik Bongertmann; Henry Rotzoll (2025). Dataset for "Large Language Models for Structuring and Integration of Heterogeneous Data" [Dataset]. http://doi.org/10.5281/zenodo.14779110

Dataset for "Large Language Models for Structuring and Integration of Heterogeneous Data"

Explore at:

pdfAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.14779110

Dataset updated

Jan 31, 2025

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Henrik Bongertmann; Benjamin Nast; Benjamin Nast; Leon Griesch; Leon Griesch; Henry Rotzoll; Kurt Sandkuhl; Kurt Sandkuhl; Henrik Bongertmann; Henry Rotzoll

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

This is the dataset for the paper "Large Language Models for Structuring and Integration of Heterogeneous Data" (add DOI).

It contains:

Example documents (anonymized)
Comparison results of open-source LLMs
Additional material employed in the case study (e.g., prompt or JSON template)
Results of the case study

Clear search

Close search

Google apps

Main menu

Dataset for "Large Language Models for Structuring and Integration of...

A multi-source heterogeneous data monitoring method based on latent subspace...

Enhanced Stock Price Prediction with Optimized Ensemble Modeling Using...

Data from: Dcase2024 Task 4: Sound event detection with heterogeneous data...

Data from: Heterogeneous Multi-Source Data Fusion Through Input Mapping And...

From RAGs to Riches: Utilizing Richly Attributed Graphs to Reason from...

Data from: Learn2Link: Linking the Social and Academic Profiles of...

Towards Digital Twinning on the Web: Heterogeneous 3D Data Fusion Based on...

Documents Data for "Composition of Heterogeneous Web Services: A Systematic...

Heterogeneous and Similarity Network Data

Data from: Subtyping of common complex diseases and disorders by integrating...

Big Data Oriented Smart Tool Condition Monitoring System: Tool life...

Data from: Who let the CAT out of the bag? accurately dealing with...

Robust Map-Matching for Heterogeneous Data via Dominance Decomposition

Data from: Extreme Value Estimation for Heterogeneous Data

Data from: Carrying capacity in a heterogeneous environment with habitat...

Data from: Diversification across a heterogeneous landscape

Areas of Heterogeneous Integrity

Replication Data for Estimating Heterogeneous Treatment Effects and the...

TEAMER: Heterogeneous Wave Energy Converter Test Data

Dataset for "Large Language Models for Structuring and Integration of Heterogeneous Data"