Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
BackgroundPrevious studies have emphasized the association between the intake of artificial sweeteners (AS) and type 2 diabetes mellitus (T2DM), but the causative relationship remains ambiguous.MethodsThis study employed univariate Mendelian randomization (MR) analysis to assess the causal link between AS intake from various sources and T2DM. Linkage disequilibrium score (LDSC) regression was used to evaluate the correlation between phenotypes. Multivariate and mediation MR were applied to investigate confounding factors and mediating effects. Data on AS intake from different sources (N = 64,949) were sourced from the UK Biobank, while T2DM data were derived from the DIAbetes Genetics Replication And Meta-analysis.The primary method adopted was inverse variance weighted (IVW), complemented by three validation techniques. Additionally, a series of sensitivity analyses were performed to evaluate pleiotropy and heterogeneity.ResultsLDSC analysis unveiled a significant genetic correlation between AS intake from different sources and T2DM (rg range: -0.006 to 0.15, all P < 0.05). After correction by the false discovery rate (FDR), the primary IVW method indicated that AS intake in coffee was a risk factor for T2DM (OR = 1.265, 95% CI: 1.035–1.545, P = 0.021, PFDR = 0.042). Further multivariable and mediation MR analyses pinpointed high density lipoprotein-cholesterol (HDL-C) as mediating a portion of this causal relationship. In reverse MR analysis, significant evidence suggested a positive correlation between T2DM and AS intake in coffee (β = 0.013, 95% CI: 0.004–0.022, P = 0.004, PFDR = 0.012), cereal (β = 0.007, 95% CI: 0.002–0.012, P = 0.004, PFDR = 0.012), and tea (β = 0.009, 95% CI: 0.001–0.017, P = 0.036, PFDR = 0.049). No other causal associations were identified (P > 0.05, PFDR > 0.05).ConclusionThe MR analysis has established a causal relationship between AS intake in coffee and T2DM. The mediation by HDL-C emphasizes potential metabolic pathways underpinning these relationships
Facebook
TwitterBackgroundPrior observational research identified dyslipidemia as a risk factor for endometriosis (EMS) but the causal relationship remains unestablished due to inherent study limitations.MethodsGenome-wide association study data for high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), triglycerides (TG), and total cholesterol (TC) from European (EUR) and East Asian (EAS) ancestries were sourced from the Global Lipids Genetics Consortium. Multi-ancestry EMS data came from various datasets. Univariable Mendelian randomization (MR) examined causal links between serum lipids and EMS. Multivariable and mediation MR explored the influence of seven confounding factors and mediators. Drug-target MR investigates the association between lipid-lowering target genes identified in positive results and EMS. The primary method was inverse-variance weighted (IVW), with replication datasets and meta-analyses reinforcing causal associations. Sensitivity analyses included false discovery rate (FDR) correction, causal analysis using summary effect estimates (CAUSE), and colocalization analysis.ResultsIVW analysis in EUR ancestry showed a significant causal association between TG and increased EMS risk (OR = 1.112, 95% CI 1.033–1.198, P = 5.03×10−3, PFDR = 0.03), supported by replication and meta-analyses. CAUSE analysis confirmed unbiased results (P < 0.05). Multivariable and mediation MR revealed that systolic blood pressure (Mediation effect: 7.52%, P = 0.02) and total testosterone (Mediation effect: 10.79%, P = 0.01) partly mediated this relationship. No causal links were found between other lipid traits and EMS (P > 0.05 & PFDR > 0.05). In EAS ancestry, no causal relationships with EMS were detected (P > 0.05 & PFDR > 0.05). Drug-target MR indicated suggestive evidence for the influence of ANGPTL3 on EMS mediated through TG (OR = 0.798, 95% CI 0.670–0.951, P = 0.01, PFDR = 0.04, PP.H4 = 0.85%).ConclusionsThis MR study in EUR ancestry indicated an increased EMS risk with higher serum TG levels.
Facebook
TwitterTitle: SECOM Data Set
Abstract: Data from a semi-conductor manufacturing process
Data Set Characteristics: Multivariate Number of Instances: 1567 Area: Computer Attribute Characteristics: Real Number of Attributes: 591 Date Donated: 2008-11-19 Associated Tasks: Classification, Causal-Discovery Missing Values? Yes
Source:
Authors: Michael McCann, Adrian Johnston
Data Set Information:
A complex modern semi-conductor manufacturing process is normally under consistent surveillance via the monitoring of signals/variables collected from sensors and or process measurement points. However, not all of these signals are equally valuable in a specific monitoring system. The measured signals contain a combination of useful information, irrelevant information as well as noise. It is often the case that useful information is buried in the latter two. Engineers typically have a much larger number of signals than are actually required. If we consider each type of signal as a feature, then feature selection may be applied to identify the most relevant signals. The Process Engineers may then use these signals to determine key factors contributing to yield excursions downstream in the process. This will enable an increase in process throughput, decreased time to learning and reduce the per unit production costs.
To enhance current business improvement techniques the application of feature selection as an intelligent systems technique is being investigated.
The dataset presented in this case represents a selection of such features where each example represents a single production entity with associated measured features and the labels represent a simple pass/fail yield for in house line testing, figure 2, and associated date time stamp. Where .1 corresponds to a pass and 1 corresponds to a fail and the data time stamp is for that specific test point.
Using feature selection techniques it is desired to rank features according to their impact on the overall yield for the product, causal relationships may also be considered with a view to identifying the key features.
Results may be submitted in terms of feature relevance for predictability using error rates as our evaluation metrics. It is suggested that cross validation be applied to generate these results. Some baseline results are shown below for basic feature selection techniques using a simple kernel ridge classifier and 10 fold cross validation.
Baseline Results: Pre-processing objects were applied to the dataset simply to standardize the data and remove the constant features and then a number of different feature selection objects selecting 40 highest ranked features were applied with a simple classifier to achieve some initial results. 10 fold cross validation was used and the balanced error rate (*BER) generated as our initial performance metric to help investigate this dataset.
SECOM Dataset: 1567 examples 591 features, 104 fails
FSmethod (40 features) BER % True + % True - % S2N (signal to noise) 34.5 +-2.6 57.8 +-5.3 73.1 +2.1 Ttest 33.7 +-2.1 59.6 +-4.7 73.0 +-1.8 Relief 40.1 +-2.8 48.3 +-5.9 71.6 +-3.2 Pearson 34.1 +-2.0 57.4 +-4.3 74.4 +-4.9 Ftest 33.5 +-2.2 59.1 +-4.8 73.8 +-1.8 Gram Schmidt 35.6 +-2.4 51.2 +-11.8 77.5 +-2.3
Attribute Information:
Key facts: Data Structure: The data consists of 2 files the dataset file SECOM consisting of 1567 examples each with 591 features a 1567 x 591 matrix and a labels file containing the classifications and date time stamp for each example.
As with any real life data situations this data contains null values varying in intensity depending on the individuals features. This needs to be taken into consideration when investigating the data either through pre-processing or within the technique applied.
The data is represented in a raw text file each line representing an individual example and the features seperated by spaces. The null values are represented by the 'NaN' value as per MatLab.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
BackgroundPrevious studies have emphasized the association between the intake of artificial sweeteners (AS) and type 2 diabetes mellitus (T2DM), but the causative relationship remains ambiguous.MethodsThis study employed univariate Mendelian randomization (MR) analysis to assess the causal link between AS intake from various sources and T2DM. Linkage disequilibrium score (LDSC) regression was used to evaluate the correlation between phenotypes. Multivariate and mediation MR were applied to investigate confounding factors and mediating effects. Data on AS intake from different sources (N = 64,949) were sourced from the UK Biobank, while T2DM data were derived from the DIAbetes Genetics Replication And Meta-analysis.The primary method adopted was inverse variance weighted (IVW), complemented by three validation techniques. Additionally, a series of sensitivity analyses were performed to evaluate pleiotropy and heterogeneity.ResultsLDSC analysis unveiled a significant genetic correlation between AS intake from different sources and T2DM (rg range: -0.006 to 0.15, all P < 0.05). After correction by the false discovery rate (FDR), the primary IVW method indicated that AS intake in coffee was a risk factor for T2DM (OR = 1.265, 95% CI: 1.035–1.545, P = 0.021, PFDR = 0.042). Further multivariable and mediation MR analyses pinpointed high density lipoprotein-cholesterol (HDL-C) as mediating a portion of this causal relationship. In reverse MR analysis, significant evidence suggested a positive correlation between T2DM and AS intake in coffee (β = 0.013, 95% CI: 0.004–0.022, P = 0.004, PFDR = 0.012), cereal (β = 0.007, 95% CI: 0.002–0.012, P = 0.004, PFDR = 0.012), and tea (β = 0.009, 95% CI: 0.001–0.017, P = 0.036, PFDR = 0.049). No other causal associations were identified (P > 0.05, PFDR > 0.05).ConclusionThe MR analysis has established a causal relationship between AS intake in coffee and T2DM. The mediation by HDL-C emphasizes potential metabolic pathways underpinning these relationships