Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This paper studies the estimation of a panel data model with latent structures where individuals can be classified into different groups with the slope parameters being homogeneous within the same group but heterogeneous across groups. To identify the unknown group structure of vector parameters, we design an algorithm called Panel-CARDS. We show that it can identify the true group structure asymptotically and estimate the model parameters consistently at the same time. Simulations evaluate the performance and corroborate the asymptotic theory in several practical design settings. The empirical application reveals the heterogeneous grouping effect of income on democracy.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
To evaluate variance homogeneity among groups for clustered data, Iachine et al. (Robust tests for the equality of variances for clustered data. J Stat Comput Simul 2010;80(4):365–377) introduced an extension of the well-known Levene test. However, this method does not account for informative cluster size (ICS) or informative within-cluster group size (IWCGS), which can occur in clustered data when cluster and group sizes are random variables. This article introduces two tests of variance homogeneity that are appropriate for data with ICS and IWCGS, one extending the Levene-style transformation method and one based on a direct comparison of estimates of variance. We demonstrate the properties of our tests in a detailed simulation study and show that they are resistant to the potentially biasing effects of ICS and IWCGS. We illustrate the use of these tests by applying them to a data set of x-ray diffraction measurements collected from a specimen of duplex steel.
Matlab codes to reproduce the results of the model section.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The populations of interest in modern studies are very often heterogeneous. The population heterogeneity, the qualitative nature of the outcome variable and the high dimensionality of the predictors pose significant challenge in statistical analysis. In this article, we introduce a category-adaptive screening procedure with high-dimensional heterogeneous data, which is to detect category-specific important covariates. The proposal is a model-free approach without any specification of a regression model and an adaptive procedure in the sense that the set of active variables is allowed to vary across different categories, thus making it more flexible to accommodate heterogeneity. For response-selective sampling data, another main discovery of this article is that the proposed method works directly without any modification. Under mild regularity conditions, the newly procedure is shown to possess the sure screening and ranking consistency properties. Simulation studies contain supportive evidence that the proposed method performs well under various settings and it is effective to extract category-specific information. Applications are illustrated with two real datasets. Supplementary materials for this article are available online.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Additional file 2. R code for type I errors and power.
Ethnoracial Homogeneity and Public Outcomes: The (Non)effects of Diversity
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Plural definite descriptions give rise to homogeneity effects: the positive The trucks are blue and the negative The trucks aren't blue are both neither true nor false when some of the trucks are blue and some are not, that is, when the group of trucks is not homogeneous with respect to the property of being blue (Löbner, 1987, 2000; Schwarzschild, 1994; Križ, 2015b). The only existing acquisition studies related to the phenomenon have examined children's comprehension only of the affirmative versions of such sentences, and moreover have yielded conflicting data; while one study reports that preschoolers interpret definite plurals maximally (Munn et al., 2006, see also Royle et al., 2018), two other studies report that preschoolers allow non-maximal interpretations of definite plurals where adults do not (Karmiloff-Smith, 1979; Caponigro et al., 2012). Moreover, there is no agreed upon developmental trajectory to adult homogeneity. In this paper, we turn to acquisition data to investigate the predictions of a recent analysis of homogeneity that treats homogeneous meanings as the result of a scalar implicature (Magri, 2014). We conducted two experiments targeting 4- and 5-year-old French-speaking children's interpretations of plural definite descriptions in positive and negative sentences, and tested the same children on standard cases of scalar implicature. The experiments revealed three distinct subgroups of children: those who interpreted the plural definite descriptions existentially and failed to compute implicatures; those who both accessed homogeneous interpretations and computed implicatures; and finally, a smaller subgroup of children who appeared to access homogeneous interpretations without computing implicatures. We discuss the implications of our findings, which appear to speak against the implicature theory as the adult-like means of generating homogeneous meanings.
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
In 2019, Genome Biology and Evolution (11:3341-3352) published three statistical tests for assessing whether alignments of genome sequences violate the phylogenetic assumption of evolution under homogeneous conditions. The new tests extend the matched-pairs tests of symmetry, marginal symmetry, and internal symmetry for alignments of n = 2 homologous sequences of nucleotides or amino acids to cases where alignments of n > 2 sequences are considered. Here we discuss the limitations of these new tests and then outline alternative approaches, which permit formal testing of multiple hypotheses (i.e., by controlling either the family-wise error rate or the false discovery rate). We show that the other approaches provide much greater insight into variation of the evolutionary process across lineages, via informal graphical methods and formal statistical procedures. Using one of the procedures (i.e., the Bonferroni test), we show that evolution under heterogeneous conditions is more prevalent than reported in the paper cited above and that the power of the matched-pairs tests of homogeneity is linked to the number of variant sites in an alignment. We release a new version of Homo, a program that allows for formal testing of multiple hypotheses and calculation of adjusted P values. Using Homo, we analysed an alignment of amino acids encoded by 116 flavivirus genomes, and reveal that these viral genomes are unlikely to have evolved under homogeneous conditions. To our knowledge, this is the first time that this has been reported for medically important Flavivirus genomes. Methods This submission contains batch scripts, sequence data, and protocols describing what was done in five computer-based experiments outlined in the manuscript with the above-mentioned title. The format of the submission was chosen such that it is as FAIR compliant as possible (i.e., that the data are Findable, Accessible, Interoperable, and Resuasable). In other words, the research done in the five experiment is reproducible.
The content in folder Experiment_1 relates to the analysis of the performance of the Maximum Symmetry test. In particular, the experiment was designed to ascertain whether the edge lengths of a tree have an impact on the Type I error rate. The file named 00_README, describe the method used. The content in folder Experiment_2 relates to the analysis of the performance of the Maximum Symmetry test. In particular, the experiment was designed to ascertain whether the edge lengths of a tree have an impact on the Type II error rate. The file named 00_README, describe the method used. The content in folder Experiment_3 relates to the analysis of the performance of the Maximum Symmetry test. In particular, the experiment was designed to ascertain whether alignment length has an impact on the type II error rate. The file named 00_README, describe the method used. The content in folder Experiment_4 relates to the analysis of the performance of four tests concerning the global null hypothesis (H_G) of evolution under SRH conditions. In this case, the tests considered are Bonferroni's (1936) test, Hommel's (1983) test, Simes' (1986) test, and Naser-Khdour et al's (2019) test. The file named 00_README, describe the method used. The content in folder Experiment_5 relates to the analysis of the performance of four tests concerning the global null hypothesis (H_G) of evolution under SRH conditions. In this case, the tests considered are Bonferroni's (1936) test, Hommel's (1983) test, Simes' (1986) test, and Naser-Khdour et al's (2019) test.
As for the genome data used in Experiment_2, we note:
The policistronically-encoded amino-acid sequences of 116 flavivirus genomes were retrieved from GenBank (https://www.ncbi.nlm.nih.gov/genbank/) and aligned using MAFFT v7.453 (using ginsi mode) (Mol. Biol. Evol., 30:772-780).
The completeness of the alignment was surveyed using AliStat v1.13 (NAR Genom. Bioinf., 2:lqaa024) and sites containing ambiguous characters were deleted if the proportion of such characters exceeded 0.2. The resulting alignment of 3,367 sites had a completeness score (Ca) of 0.9880 (for details, see NAR Genom. Bioinf., 2:lqaa024).
Model selection was done using ModelFinder (Nat. Methods, 14:587-589), which is implemented in IQ-TREE2 v2.1.3 (Mol. Biol. Evol., 37:1530-1534). We only considered substitution models for viral polypeptides. For each model of sequence evolution considered, tree space was searched under the AIC and BIC optimality criteria. The rtREV+FO+I+R9 model was optimal under the AIC and the rtREV+FO+I+R7 model was optimal under the BIC. Using IQ-TREE2, the same tree was identified under the two models.
The UFBoot2 procedure (Mol. Biol. Evol., 35:518-522) was used to assess the consistency of the phylogenetic signal.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Tests for incongruence as an indicator of among data partition conflict have played an important role in conditional data combination. When such tests reveal significant incongruence, this has been interpreted as rationale for not combining data in a single phylogenetic analysis. In this study of lorisiform phylogeny, we employ the incongruence length difference (ILD) test to assess conflict among three independent data sets. A large morphological data set and two unlinked molecular data sets, the mitochondrial cytochrome b gene and the nuclear interphotoreceptor retinoid binding protein (exon 1), are analyzed with various optimality criteria and weighting mechanisms in order to determine the phylogenetic relationships among slow lorises (Primates, Loridae). When analyzed separately, the morphological data show impressive statistical support for a monophyletic Loridae. Both molecular data sets resolve the Loridae as paraphyletic, though with different branching order depending on optimality criterion and/or character weighting employed. When the three data partitions are analyzed in various combinations, an inverse relationship between congruence and phylogenetic accuracy is observed. Nearly all combined analyses that recover monophyly indicate strong data partition incongruence (p = 0.00005, in the most extreme case) whereas all analyses that recover paraphyly indicate lack of significant incongruence. Numerous lines of evidence verify that monophyly is the accurate phylogenetic result. Therefore, this study contributes to a growing body of information that affirms that measures of incongruence should not be employed as indicators of data set combinability.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The data and R-script are part of the published article ‘Drug-Homogeneity Index in Mass-Spectrometry Imaging’. Date Submitted: 2020-02-15
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Replication data for The long tale of the long tail: The internet as counterforce to cultural homogeneity. While raw data is not available, this is a prepared stata file for statistical analysis.
https://www.nist.gov/open/licensehttps://www.nist.gov/open/license
An all-glass, dynamic recirculating still equipped with an ultrasonic homogenizer has been used in the determination of the vapour liquid equilibrium (VLE) and vapour liquid liquid equilibrium (VLLE). Consistent data for the ternary water + ethanol + toluene system are reported at 101.3 kPa at temperatures in the range of 347 357 K. These data have been compared with previously published VLLE data. The experimental data indicate that a ternary azeotrope is present at 347.60 K. It has been determined experimentally that the ternary azeotrope is homogeneous and not heterogeneous as is claimed in some of the studied references. The experimental results have been used to check the accuracy of the UNIFAC, UNIQUAC and NRTL models, proving that these models predict a bigger heterogeneous region than the experimental one.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Additional file 2. SA3 dataset.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
"Please if you use this datasets we appreciated that you reference this repository and cite the works related that made possible the generation of this dataset." This change detection datastet has different events, satellites, resolutions and includes both homogeneous/heterogeneous cases. The main idea of the dataset is to bring a benchmark on semantic change detection in remote sensing field.This dataset is the outcome of the following publications:
@article{ JimenezSierra2022graph,author={Jimenez-Sierra, David Alejandro and Quintero-Olaya, David Alfredo and Alvear-Mu{~n}oz, Juan Carlos and Ben{\'i}tez-Restrepo, Hern{\'a}n Dar{\'i}o and Florez-Ospina, Juan Felipe and Chanussot, Jocelyn},journal={IEEE Transactions on Geoscience and Remote Sensing},title={Graph Learning Based on Signal Smoothness Representation for Homogeneous and Heterogeneous Change Detection},year={2022},volume={60},number={},pages={1-16},doi={10.1109/TGRS.2022.3168126}} @article{ JimenezSierra2020graph,title={Graph-Based Data Fusion Applied to: Change Detection and Biomass Estimation in Rice Crops},author={Jimenez-Sierra, David Alejandro and Ben{\'i}tez-Restrepo, Hern{\'a}n Dar{\'i}o and Vargas-Cardona, Hern{\'a}n Dar{\'i}o and Chanussot, Jocelyn},journal={Remote Sensing},volume={12},number={17},pages={2683},year={2020},publisher={Multidisciplinary Digital Publishing Institute},doi={10.3390/rs12172683}} @inproceedings{jimenez2021blue,title={Blue noise sampling and Nystrom extension for graph based change detection},author={Jimenez-Sierra, David Alejandro and Ben{\'\i}tez-Restrepo, Hern{\'a}n Dar{\'\i}o and Arce, Gonzalo R and Florez-Ospina, Juan F},booktitle={2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS},ages={2895--2898},year={2021},organization={IEEE},doi={10.1109/IGARSS47720.2021.9555107}} @article{florez2023exploiting,title={Exploiting variational inequalities for generalized change detection on graphs},author={Florez-Ospina, Juan F and Jimenez Sierra, David A and Benitez-Restrepo, Hernan D and Arce, Gonzalo},journal={IEEE Transactions on Geoscience and Remote Sensing}, year={2023},volume={61},number={},pages={1-16},doi={10.1109/TGRS.2023.3322377}} @article{florez2023exploitingxiv,title={Exploiting variational inequalities for generalized change detection on graphs},author={Florez-Ospina, Juan F. and Jimenez-Sierra, David A. and Benitez-Restrepo, Hernan D. and Arce, Gonzalo R},year={2023},publisher={TechRxiv},doi={10.36227/techrxiv.23295866.v1}} In the table on the html file (dataset_table.html) are tabulated all the metadata and details related to each case within the dasetet. The cases with a link, were gathered from those sources and authors, therefore you should refer to their work as well. The rest of the cases or events (without a link), were obtained through the use of open sources such as:
Copernicus European Space Agency Alaska Satellite Facility (Vertex) Earth Data In addition, we carried out all the processing of the images by using the SNAP toolbox from the European Space Agency. This proccessing involves the following:
Data co-registration Cropping Apply Orbit (for SAR data) Calibration (for SAR data) Speckle Filter (for SAR data) Terrain Correction (for SAR data) Lastly, the ground truth was obtained from homogeneous images for pre/post events by drawing polygons to highlight the areas where a visible change was present. The images where layout and synchorized to be zoomed over the same are to have a better view of changes. This was an exhaustive work in order to be precise as possible.Feel free to improve and contribute to this dataset.
The concept of homogenous response units (HRU) was designed as a general concept for the delineation of basic spatial units. Only those characteristics of landscape, which are relatively stable over time (even under climate change) and largely unsusceptible to anthropogenic influence, were selected. The HRU can be seen as a basic spatial framework for the implementation of climate change and land management alternative scenarios into global modeling and therefore is a basic input for delineation of landscape units. HRUs are defined based on classifications of altitude (five classes: 1 (0 - 300m), 2 (300 - 600m), 3 (600 - 1100m), 4 (1100 - 2500m), 5 (> 2500m)), slope (seven classes(degrees): 1 (0 - 3), 2 (3 - 6), 3 (6 - 10), 4 (10 - 15), 5 (15 - 30), 6 (30 - 50), 7 (> 50)) and soil composition (five classes: 1 (sandy), 2 (loamy), 3 (clay), 4 (stony), 5 (peat)). e.g. HRU111 refers to Altitude class 1: 0-300m; Slope class 1: 0-3 degrees; and Soil class 1: sandy. Areas of non-soil are assigned 88. HRUs have a spatial resolution of approximately 10 km**2.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The dataset folder contains MSI data analyzed in the above paper. The main goal of the study was to use a methodological approach to link tumour heterogeneity with drug homogeneity. The MSI data files in the folder derived from two different cancer cell-lines (ovarian and colon) under two treatment conditions (with and without bevacizumab). Date Submitted: 2020-02-17
Recent phylogenetic analyses of cetacean relationships based on DNA sequence data have challenged the traditional view that baleen whales (Mysticeti) and toothed whales (Odontoceti) are each monophyletic, arguing instead that baleen whales are the sister group of the odontocete family Physeteridae (sperm whales). We reexamined this issue in light of a morphological data set composed of 207 characters and molecular data sets of published 12S, 16S, and cytochrome b mitochondrial DNA sequences. We reach four primary conclusions: (1) Our morphological data set strongly supports the traditional view of odontocete monophyly; (2) the unrooted molecular and morphological trees are very similar, and most of the conflict results from alternative rooting positions; (3) the rooting position of the molecular tree is sensitive to choice of artiodactyl outgroup taxa and the treatment of two small but ambiguously aligned regions of the 12S and 16S sequences, whereas the morphological root is strongly ...
In a widespread species, a matching of phenotypic traits to local environmental optima is generally attributed to site-specific adaptation. However, the same matching can occur via adaptive plasticity, without requiring genetic differences among populations. Adult sea kraits (Laticauda saintgironsi) are highly philopatric to small islands, but the entire population within the Neo-Caledonian lagoon is genetically homogenous because females migrate to the mainland to lay their eggs at communal sites; recruits disperse before settling, mixing up alleles. Consequently, any adaptive matching between local environments (e.g., prey sizes) and snake phenotypes (e.g., body sizes and relative jaw sizes) must be achieved via phenotypic plasticity rather than spatial heterogeneity in gene frequencies. We sampled 13 snake colonies spread along a ~200km northwest-southeast gradient (N>4,500 individuals) to measure two morphological features that affect maximum ingestible prey size in gape-limited ...
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
A continuous finite-time control scheme is introduced for bilateral teleoperation systems with asymmetric timevarying delays. Specifically
The narrowing of diversity in crop species contributing to the world’s food supplies has been considered a potential threat to food security. However, changes in this diversity have not been quantified globally. We assess trends over the past 50 y in the richness, abundance, and composition of crop species in national food supplies worldwide. Over this period, national per capita food supplies expanded in total quantities of food calories, protein, fat, and weight, with increased proportions of those quantities sourcing from energy-dense foods. At the same time the number of measured crop commodities contributing to national food supplies increased, the relative contribution of these commodities within these supplies became more even, and the dominance of the most significant commodities decreased. As a consequence, national food supplies worldwide became more similar in composition, correlated particularly with an increased supply of a number of globally important cereal and oil crops, and a decline of other cereal, oil, and starchy root species. The increase in homogeneity worldwide portends the establishment of a global standard food supply, which is relatively species-rich in regard to measured crops at the national level, but species-poor globally. These changes in food supplies heighten interdependence among countries in regard to availability and access to these food sources and the genetic resources supporting their production, and give further urgency to nutrition development priorities aimed at bolstering food security. This dataset is based on FAO national per capita data. For information on the value addition of the data please see the Materials and Methods section of the publication.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This paper studies the estimation of a panel data model with latent structures where individuals can be classified into different groups with the slope parameters being homogeneous within the same group but heterogeneous across groups. To identify the unknown group structure of vector parameters, we design an algorithm called Panel-CARDS. We show that it can identify the true group structure asymptotically and estimate the model parameters consistently at the same time. Simulations evaluate the performance and corroborate the asymptotic theory in several practical design settings. The empirical application reveals the heterogeneous grouping effect of income on democracy.