https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This is a cleaned version of a Netflix movies dataset originally used for exploratory data analysis (EDA). The dataset contains information such as:
Missing values have been handled using appropriate methods (mean, median, unknown), and new features like rating_level
and popular
have been added for deeper analysis.
The dataset is ready for: - EDA - Data visualization - Machine learning tasks - Dashboard building
Used in the accompanying notebook
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
This measure determines the perceived value added by the University Centers (UCs) to their clients. EDA funds UCs to provide technical assistance and specialized services (for example, feasibility studies, marketing research, economic analysis, environmental services, and technology transfer) to local officials and communities. This assistance improves the community’s capacity to plan and manage successful development projects. UCs develop client profiles and report findings to EDA, which evaluates the performance of each center once every 3 years and verifies the data. “Taking action as a result of the assistance facilitated” means to implement an aspect of the technical assistance provided by the UC in one of several areas: economic development initiatives and training session development; linkages to crucial resources; economic development planning; project management; community investment package development; geographic information system services; strategic partnering to public or private sector entities; increased organizational capacity; feasibility plans; marketing studies; technology transfer; new company, product, or patent development; and other services.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Unsupervised exploratory data analysis (EDA) is often the first step in understanding complex data sets. While summary statistics are among the most efficient and convenient tools for exploring and describing sets of data, they are often overlooked in EDA. In this paper, we show multiple case studies that compare the performance, including clustering, of a series of summary statistics in EDA. The summary statistics considered here are pattern recognition entropy (PRE), the mean, standard deviation (STD), 1-norm, range, sum of squares (SSQ), and X4, which are compared with principal component analysis (PCA), multivariate curve resolution (MCR), and/or cluster analysis. PRE and the other summary statistics are direct methods for analyzing datathey are not factor-based approaches. To quantify the performance of summary statistics, we use the concept of the “critical pair,” which is employed in chromatography. The data analyzed here come from different analytical methods. Hyperspectral images, including one of a biological material, are also analyzed. In general, PRE outperforms the other summary statistics, especially in image analysis, although a suite of summary statistics is useful in exploring complex data sets. While PRE results were generally comparable to those from PCA and MCR, PRE is easier to apply. For example, there is no need to determine the number of factors that describe a data set. Finally, we introduce the concept of divided spectrum-PRE (DS-PRE) as a new EDA method. DS-PRE increases the discrimination power of PRE. We also show that DS-PRE can be used to provide the inputs for the k-nearest neighbor (kNN) algorithm. We recommend PRE and DS-PRE as rapid new tools for unsupervised EDA.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
The current dataset contributes to assess the accuracy of the Empatica 4 (E4) wristband for the detection of heart rate variability (HRV) and electrodermal activity (EDA) metrics in stress-inducing conditions and growing-risk driving scenarios. Heart Rate Variability (HRV) and ElectroDermal Activity (EDA) signals were recorded over six experimental conditions (i.e., Baseline, Video Clip, Scream, No Risk Driving, Low-Risk Driving, and High-Risk Driving) and by means of two measurement systems: the E4 device and a gold standard system. The raw quality of the physiological signals was enhanced by means of robust semi-automatic reconstruction algorithms. Heart Rate Variability time-domain parameters showed high accuracy in motion-free experimental conditions, while Heart Rate Variability frequency-domain parameters reported sufficient accuracy in almost every experimental condition.
https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy
The automotive EDA (Electronic Design Automation) market is experiencing robust growth, driven by the increasing complexity of electric vehicles (EVs), autonomous driving systems, and advanced driver-assistance systems (ADAS). The shift towards software-defined vehicles necessitates sophisticated simulation and design tools to ensure safety, performance, and reliability. This is fueling demand for cloud-based EDA solutions, offering scalability and collaborative design capabilities. While on-premise solutions remain relevant for specific needs, the cloud's flexibility and cost-effectiveness are driving a significant market shift. OEMs are major drivers of market growth, investing heavily in developing next-generation vehicles. However, the 4S shops and other service providers are also seeing increased adoption for maintenance and repair purposes and aftermarket modifications. The market is segmented by application (OEMs, 4S shops, others) and type (cloud-based, on-premise). Leading players like ANSYS, Altair Engineering, and Dassault Systèmes are investing in R&D and strategic acquisitions to expand their market share and capabilities. Geographic growth is widespread, with North America and Europe currently leading, followed by the rapidly expanding Asia-Pacific region due to increasing vehicle production and technological advancements in countries like China and India. Challenges include the high cost of EDA software and the need for skilled professionals to operate these complex tools. However, the long-term growth outlook remains very positive, fueled by ongoing technological advancements and increasing vehicle electrification and autonomy. The forecast period (2025-2033) suggests a sustained high CAGR, reflecting the continued integration of electronics and software in vehicles. Assuming a conservative CAGR of 15% and a 2025 market size of $5 billion, the market is projected to reach approximately $17 Billion by 2033. This growth will be driven by factors such as increased adoption of EV and autonomous driving technologies, stricter regulatory compliance requirements necessitating extensive simulation and validation, and growing demand for high-performance computing resources for complex simulations. The competitive landscape is characterized by both established players and emerging innovative companies. Successful players will be those that can adapt to the evolving technological landscape and offer flexible, scalable, and user-friendly solutions.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Protein-Protein, Genetic, and Chemical Interactions for EDA (Homo sapiens) curated by BioGRID (https://thebiogrid.org); DEFINITION: ectodysplasin A
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Dataset information -machine-operator_usage.csv This file contains the data for the machine-operator usage for July, October, and November for three grain cart tractors. This is the data for Figure 3. The table includes the following variables for machine-powered days: day, machine ID, time spent driving at slow speed, time at medium speed, time at fast speed, time machine was powered before harvesting, time machine was powered after harvesting, time stationary during harvesting, total time machine was powered, total distance travelled Dataset information -stress_combine.csv -stress_grain_cart.csv These files contain the stress rate and other unloading data for each on-the-go unload event (the ones that qualified for stress analysis). This the data for Figures 4–9. These tables include the following variables for unload events: unload ID, state, crop, unload event start time, unload event end time, automation type, duration auger on, automation time percentage, number of stressful events, number of invalid wristband data points, mean tractor speed, mean combine speed, stress rate, combine operator experience, stress rate manual mean, subject-normalized stress rate, combine operator ID, grain cart operator experience, grain cart operator ID Dataset information -XTEt_XTEc.csv This file contains cross-track error (XTE) standard deviation (SD) data from each on-the-go unload event that was used to evaluate steering performance. This is the data for Figure 10. The table includes the following variables for unload events: unload ID, state, crop, unload event start time, unload event end time, automation type, duration auger on, automation time percentage, mean tractor speed, mean combine speed, XTE SD combine, XTE SD tractor
https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy
The global electronic design automation (EDA) for semiconductor chips market size was USD 15.95 Billion in 2023 and is projected to reach USD 37.69 Billion by 2032, expanding at a CAGR of 10% during 2024–2032. The market growth is attributed to the use of these chips in modern technology.
Increasingly, Electronic Design Automation (EDA) for semiconductor chips is becoming a cornerstone of modern technology. EDA, a category of software tools for designing electronic systems such as printed circuit boards and integrated circuits, plays a pivotal role in the creation of complex electronic systems. The sophistication of these tools allows for the design and production of semiconductor chips with unparalleled efficiency and precision, making them indispensable in an era defined by rapid technological advancement.
Rising regulatory scrutiny is leading to the introduction of new rules and regulations by governing bodies such as the International Electrotechnical Commission (IEC) and the Institute of Electrical and Electronics Engineers (IEEE). These regulations, aimed at ensuring the safety, reliability, and environmental sustainability of EDA products and services, are reshaping the market. The impact of these regulations is likely to be profound, driving innovation and fostering a greater emphasis on compliance within the industry.
Artificial Intelligence (AI) has a considerable impact on the electronic design automation (EDA) for semiconductor chips market. AI accelerates the development process, reducing time-to-market and enhancing competitive advantage by automating intricate design tasks. It enables the creation of complex and powerful chips, driving innovation in a myriad of industries.
AI's predictive analytics capabilities facilitate proactive error detection, minimizing costly design flaws and improving product reliability. However, this technological shift necessitates a profound transformation in skill sets and processes, requiring significant investment in training and infrastructure. <
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
The Breast Cancer Dataset hosted on Kaggle is a powerful resource for researchers, data scientists, and machine learning enthusiasts looking to explore and develop predictive models for breast cancer diagnosis. This dataset, accessible via Kaggle, is designed for binary classification tasks to predict whether a breast tumor is benign or malignant. It provides a rich collection of features derived from digitized images of fine needle aspirates (FNA) of breast masses, making it an essential tool for advancing healthcare analytics and computational pathology. Below is a comprehensive, human-crafted description of the dataset, complete with examples and key highlights to make it engaging and informative.
The dataset originates from the Breast Cancer Wisconsin (Diagnostic) Data Set, a widely used benchmark in machine learning for medical diagnostics. It contains detailed measurements of cell nuclei from breast tissue samples, enabling the classification of tumors as either benign (non-cancerous) or malignant (cancerous). This dataset is particularly valuable for developing and testing machine learning models, such as logistic regression, support vector machines, or deep neural networks, to aid in early and accurate breast cancer detection.
Purpose: Binary classification to predict tumor type (benign or malignant). Source: University of Wisconsin, provided through Kaggle. Link: Breast Cancer Dataset on Kaggle. Application: Ideal for medical research, machine learning model development, and educational purposes.
##### Dataset Structure The dataset comprises 569 instances (rows) and 32 columns, including an ID column, a diagnosis label, and 30 numerical features describing cell nuclei characteristics. Each instance represents a single breast mass sample, with features computed from digitized FNA images. Key Columns:
ID: A unique identifier for each sample (e.g., 842302). Diagnosis: The target variable, labeled as: M (Malignant): Indicates a cancerous tumor. B (Benign): Indicates a non-cancerous tumor.
Features (30 columns): Numerical measurements of cell nuclei, such as radius, texture, perimeter, and area, derived from image analysis.
The 30 features are grouped into three main categories based on the characteristics of cell nuclei:
Mean: Average values of measurements (e.g., mean radius, mean texture). Standard Error (SE): Variability of measurements (e.g., standard error of radius, standard error of area). Worst: Largest (worst) values of measurements (e.g., worst radius, worst smoothness).
Each category includes 10 specific measurements:
Example Data Point: Here’s a simplified example of a single row in the dataset:
ID Diagnosis Radius_mean Texture_mean Perimeter_mean Area_mean Smoothness_mean ...
842302 M 17.99 10.38 122.80 1001.0 0.11840 ...
Interpretation: This sample (ID 842302) is malignant (M), with a mean radius of 17.99 units, a mean texture of 10.38, and so on. The remaining 27 columns provide additional measurements (e.g., standard error and worst values).
Key Highlights
Balanced Classes: The dataset includes 357 benign and 212 malignant cases, offering a relatively balanced distribution for training robust models. No Missing Values: The dataset is clean and preprocessed, with no missing or null values, making it ready for immediate analysis. High Dimensionality: With 30 numerical features, the dataset supports complex modeling techniques, including feature selection and dimensionality reduction. Real-World Impact: The dataset is widely used in research to improve diagnostic accuracy, contributing to early breast cancer detection and better patient outcomes. Open Access: Freely available on Kaggle, encouraging collaboration and innovation in the data science community.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The dataset contains data collected from patients suffering from cancer-related pain. The features extracted from clinical data (including typical cancer phenomena such as breakthrough pain) and the biosignal acquisitions contributed to the definition of a multidimensional dataset. This unique database can be useful for the characterization of the patient’s pain experience from a qualitative and quantitative perspective. We implemented measurable biosignals-related indicators of the individual’s pain response and of the overall Autonomic Nervous System (ANS) functioning. The most peculiar features extracted from EDA and ECG signals can be adopted to investigate the status and complex functioning of the ANS through the study of sympatho-vagal activations. Specifically, while EDA is mainly related sympathetic activation, the Heart Rate Variability (HRV), which can be derived from ECG recordings, is strictly related to the interplay between sympathetic and parasympathetic functioning.
As far as the EDA signal, two types of analyzes have been performed: (i) the Trough-To-Peak analysis (TTP), or min-max analysis, aimed at measuring the difference between the Skin Conductance (SC) at the peak of a response and its previous minimum within pre-established time-windows; (ii) the Continuous Decomposition Analysis (CDA), aimed at performing a decomposition of SC data into continuous signals of tonic (basic level of conductance) and phasic (short-duration changes in the SC) activity. Before applying the TPP analysis or the CDA, the signal was filtered by means of a fifth-order Butterworth low-pass filter with a cutoff frequency of 1 Hz and downsampled up to 10 Hz to reducing the computational burden of the analysis. The application of TPP and CDA allowed the detection and measurement of SC Responses (SCR) and the following parameters have been calculated for both TPP and CDA methodologies:
Concerning the ECG, the RR series of interbeat intervals (i.e., the time between successive R waves of the QRS complex on the ECG waveform) has been computed to extract time-domain parameters of the HRV. The R peak detection was carried out by adopting the Pan–Tompkins algorithm for QRS detection and R peak identification. The corresponding RR series of interbeat intervals were derived as the difference between successive R peaks.
The ECG-derived RR time series was then filtered by means of a recursive procedure to remove the intervals differing most from the mean of the surrounding RR intervals. Then, both the Time-Domain Analysis (TDA) and Frequency-Domain Analysis (FDA) of the HRV have been carried out to extract the main features characterizing the variability of the heart rhythm. Time-domain parameters are obtained from statistical analysis of the intervals between heart beats and are used to describe how much variability in the heartbeats is present at various time scales.
The parameters computed through the TDA include the following:
Frequency-domain parameters reflect the distribution of spectral power across different frequencies bands and are used to assess specific components of HRV (e.g., thermoregulation control loop, baroreflex control loop, and respiration control loop, which are regulated by both sympathetic and vagal nerves of the ANS).
The parameters computed through the FDA have been computed by adopting the Welch's Fourier periodogram method based on the Discrete Fourier Transform (DFT), which allows the expression of the RR series in the discrete frequency domain. However, due to the non-stationarity of the RR series, Welch Fourier periodogram method is used for dealing with non-stationarity. Specifically, Welch's periodogram divides the signal into specific periods of constant length appliying the Fast Fourier Transform (FFT) trasforming individually these parts of the signal. The periodogram is basically a way of estimating power spectral density of a time series.
The FDA parameters include the following:
https://www.kenresearch.com/terms-and-conditionshttps://www.kenresearch.com/terms-and-conditions
The APAC EDA Electronic Design Automation Market size is USD 7.89 billion in 2023, explores compliance trends, sourcing strategies, and innovation pipeline to define go-to-market priorities.
This data set contains booking information for a city hotel and a resort hotel and includes information such as when the booking was made, length of stay, the number of adults, children, and/or babies, and the number of available parking spaces, among other things.
All personally identifying information has been removed from the data.
Variable | Type | Description | Source/Engineering |
---|---|---|---|
ADR | Numeric | Average Daily Rate as defined by [5] | BO, BL and TR / Calculated by dividing the sum of all lodging transactions by the total number of staying nights |
Adults | Integer | Number of adults | BO and BL |
Agent | Categorical | ID of the travel agency that made the booking | BO and BL |
ArrivalDateDayOfMonth | Integer | Day of the month of the arrival date | BO and BL |
ArrivalDateMonth | Categorical | Month of arrival date with 12 categories: “January” to “December” | BO and BL |
ArrivalDateWeekNumber | Integer | Week number of the arrival date | BO and BL |
ArrivalDateYear | Integer | Year of arrival date | BO and BL |
AssignedRoomType | Categorical | Code for the type of room assigned to the booking. Sometimes the assigned room type differs from the reserved room type due to hotel operation reasons (e.g. overbooking) or by customer request. Code is presented instead of designation for anonymity reasons | BO and BL |
Babies | Integer | Number of babies | BO and BL |
BookingChanges | Integer | Number of changes/amendments made to the booking from the moment the booking was entered on the PMS until the moment of check-in or cancellation | BO and BL/Calculated by adding the number of unique iterations that change some of the booking attributes, namely: persons, arrival date, nights, reserved room type or meal |
Children | Integer | Number of children | BO and BL/Sum of both payable and non-payable children |
Company | Categorical | ID of the company/entity that made the booking or responsible for paying the booking. ID is presented instead of designation for anonymity reasons | BO and BL. |
Country | Categorical | Country of origin. Categories are represented in the ISO 3155–3:2013 format [6] | BO, BL and NT |
CustomerType | Categorical | Type of booking, assuming one of four categories: | BO and BL |
Contract - when the booking has an allotment or other type of contract associated to it; | |||
Group – when the booking is associated to a group; | |||
Transient – when the booking is not part of a group or contract, and is not associated to other transient booking; | |||
Transient-party – when the booking is transient, but is associated to at least other transient booking | |||
DaysInWaitingList | Integer | Number of days the booking was in the waiting list before it was confirmed to the customer | BO/Calculated by subtracting the date the booking was confirmed to the customer from the date the booking entered on the PMS |
DepositType | Categorical | Indication on if the customer made a deposit to guarantee the booking. This variable can assume three categories: | BO and TR/Value calculated based on the payments identified for the booking in the transaction (TR) table before the booking׳s arrival or cancellation date. |
No Deposit – no deposit was made; | |||
In case no payments were found the value is “No Deposit”. | |||
If the payment was equal or exceeded the total cost of stay, the value is set as “Non Refund”. | |||
Non Refund – a deposit was made in the value of the total stay cost; | |||
Otherwise the value is set as “Refundable” | |||
Refundable – a deposit was made wi... |
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Protein-Protein, Genetic, and Chemical Interactions for EDA (Escherichia coli (K12/W3110)) curated by BioGRID (https://thebiogrid.org); DEFINITION: multifunctional 2-keto-3-deoxygluconate 6-phosphate aldolase, 2-keto-4-hydroxyglutarate aldolase, and oxaloacetate decarboxylase
The process of radical decarboxylation is crucial in organic synthesis. Nevertheless, decarboxylation of dienoic acids presents a greater challenge compared to that of aliphatic carboxylic acids. Herein, catalyst- and additive-free visible-light-promoted [4 + 3] annulation of lactones and diamines was achieved via radical decarboxylation of dienoic acids. By means of this novel EDA-activated [4 + 3] annulation, the 1,5-benzodiazepines, which display a wide range of biological activities and are widely used in many fields, can be directly accessed in high yields under mild conditions. This visible-light-induced radical decarboxylation [4 + 3] annulation tolerates a broad array of functional groups and intricate molecules, including pharmaceutical-relevant compounds and natural products.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Deficiency of ectodysplasin A1 (EDA1) due to variants of the gene EDA causes X-linked hypohidrotic ectodermal dysplasia (XLHED), a rare genetic condition characterized by abnormal development of ectodermal structures. XLHED is defined by the triad of hypotrichosis, hypo- or anhidrosis, and hypo- or anodontia. Anhidrosis may lead to life-threatening hyperthermia. A definite genetic diagnosis is, thus, important for the patients’ management and amenability to a novel prenatal treatment option. Here, we describe five familial EDA variants segregating with the disease in three families, for which different prediction tools yielded discordant results with respect to their significance. Functional properties in vitro and levels of circulating serum EDA were compared with phenotypic data on skin, hair, eyes, teeth, and sweat glands. EDA1-Gly176Val, although associated with relevant hypohidrosis, still bound to the EDA receptor (EDAR). Subjects with EDA1-Pro389LeufsX27, -Ter392GlnfsX30, -Ser125Cys, and an EDA1 splice variant (c.924+7A > G) showed complete absence of pilocarpine-induced sweating. EDA1-Pro389LeufsX27 was incapable of binding to EDAR and undetectable in serum. EDA1-Ter392GlnfsX30, produced in much lower amounts than wild-type EDA1, could still bind to EDAR, and so did EDA1-Ser125Cys that was, however, undetectable in serum. The EDA splice variant c.924+7A > G resulted experimentally in a mix of wild-type EDA1 and EDA molecules truncated in the middle of the receptor-binding domain, with reduced EDA serum concentration. Thus, in vitro assays reflected the clinical phenotype in two of these difficult cases, but underestimated it in three others. Absence of circulating EDA seems to predict the full-blown phenotype of XLHED, while residual EDA levels may also be found in anhidrotic patients. This indicates that unborn subjects carrying variants of uncertain significance could benefit from an upcoming prenatal medical treatment even if circulating EDA levels or tests in vitro suggest residual EDA1 activity.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Welcome to the Retail Sales and Customer Demographics Dataset! This synthetic dataset has been meticulously crafted to simulate a dynamic retail environment, providing an ideal playground for those eager to sharpen their data analysis skills through exploratory data analysis (EDA). With a focus on retail sales and customer characteristics, this dataset invites you to unravel intricate patterns, draw insights, and gain a deeper understanding of customer behavior.
****Dataset Overview:**
This dataset is a snapshot of a fictional retail landscape, capturing essential attributes that drive retail operations and customer interactions. It includes key details such as Transaction ID, Date, Customer ID, Gender, Age, Product Category, Quantity, Price per Unit, and Total Amount. These attributes enable a multifaceted exploration of sales trends, demographic influences, and purchasing behaviors.
Why Explore This Dataset?
Questions to Explore:
Your EDA Journey:
Prepare to immerse yourself in a world of data-driven exploration. Through data visualization, statistical analysis, and correlation examination, you'll uncover the nuances that define retail operations and customer dynamics. EDA isn't just about numbers—it's about storytelling with data and extracting meaningful insights that can influence strategic decisions.
Embrace the Retail Sales and Customer Demographics Dataset as your canvas for discovery. As you traverse the landscape of this synthetic retail environment, you'll refine your analytical skills, pose intriguing questions, and contribute to the ever-evolving narrative of the retail industry. Happy exploring!
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
CONTEXT: Ectodysplasin A (EDA) was recently identified as a liver-secreted protein that is increased in the liver and plasma of obese mice and causes skeletal muscle insulin resistance.OBJECTIVE: To determine if liver and plasma EDA is associated with worsening non-alcoholic fatty liver disease (NAFLD) in obese patients and to evaluate plasma EDA as a biomarker for NAFLD.DESIGN AND SETTING: Cross-sectional study in a public hospital.PATIENTS, INTERVENTIONS AND MAIN OUTCOME MEASURES: Patients with a body mass index >30 kg/m2 (n=152) underwent liver biopsy for histopathology assessment and fasting liver EDA mRNA. Fasting plasma EDA levels were also assessed. Non-alcoholic fatty liver (NAFL) was defined as >5% hepatic steatosis and nonalcoholic steatohepatitis (NASH) as NAFLD activity score ≥3.RESULTS: Patients were divided into three groups: No NAFLD (n=45); NAFL (n=65); and NASH (n=42). Liver EDA mRNA was increased in patients with NASH compared with No NAFLD (P=0.05), but not NAFL. Plasma EDA levels were increased in NAFL and NASH compared with No NAFLD (P=0.03). Plasma EDA was related to worsening steatosis (P=0.02) and fibrosis (P=0.04), but not inflammation or hepatocellular ballooning. ROC analysis indicates that plasma EDA is not a reliable biomarker for NAFL or NASH. Plasma EDA was not increased in patients with type 2 diabetes and did not correlate with insulin resistance.CONCLUSIONS: Plasma EDA is increased in NAFL and NASH, is related to worsening steatosis and fibrosis but is not a reliable biomarker for NASH. Circulating EDA is not associated with insulin resistance in human obesity.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Failure to develop complete dentition, tooth agenesis, is a common developmental anomaly manifested most often as isolated but also as associated with many developmental syndromes. It typically affects third molars or one or few other permanent teeth but severe agenesis is also relatively prevalent. Here we report mutational analyses of seven candidate genes in a cohort of 127 probands with non-syndromic tooth agenesis. 82 lacked more than five permanent teeth excluding third molars, called as oligodontia. We identified 28 mutations, 17 of which were novel. Together with our previous reports, we have identified two mutations in MSX1, AXIN2 and EDARADD, five in PAX9, four in EDA and EDAR, and nine in WNT10A. They were observed in 58 probands (44%), with a mean number of missing teeth of 11.7 (range 4 to 34). Almost all of these probands had severe agenesis. Only few of the probands but several relatives with heterozygous genotypes of WNT10A or EDAR conformed to the common type of non-syndromic tooth agenesis, incisor-premolar hypodontia. Mutations in MSX1 and PAX9 affected predominantly posterior teeth, whereas both deciduous and permanent incisors were especially sensitive to mutations in EDA and EDAR. Many mutations in EDAR, EDARADD and WNT10A were present in several families. Biallelic or heterozygous genotypes of WNT10A were observed in 32 and hemizygous or heterozygous genotypes of EDA, EDAR or EDARADD in 22 probands. An EDARADD variant were in seven probands present together with variants in EDAR or WNT10A, suggesting combined phenotypic effects of alleles in distinct genes.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
IntroductionWhen multiple individuals are presented with narrative movie or audio clips, their electrodermal activity (EDA) and heart rate show significant similarities. Higher levels of such inter-subject physiological synchrony are related with higher levels of attention toward the narrative, as for instance expressed by more correctly answered questions about the narrative. We here investigate whether physiological synchrony in EDA and heart rate during watching of movie clips predicts performance on a subsequent vigilant attention task among participants exposed to a night of total sleep deprivation.MethodsWe recorded EDA and heart rate of 54 participants during a night of total sleep deprivation. Every hour from 22:00 to 07:00 participants watched a 10-min movie clip during which we computed inter-subject physiological synchrony. Afterwards, they answered questions about the movie and performed the psychomotor vigilance task (PVT) to capture attentional performance.ResultsWe replicated findings that inter-subject correlations in EDA and heart rate predicted the number of correct answers on questions about the movie clips. Furthermore, we found that inter-subject correlations in EDA, but not in heart rate, predicted PVT performance. Individuals' mean EDA and heart rate also predicted their PVT performance. For EDA, inter-subject correlations explained more variance of PVT performance than individuals' mean EDA.DiscussionTogether, these findings confirm the association between physiological synchrony and attention. Physiological synchrony in EDA does not only capture the attentional processing during the time that it is determined, but also proves valuable for capturing more general changes in the attentional state of monitored individuals.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This is a cleaned version of a Netflix movies dataset originally used for exploratory data analysis (EDA). The dataset contains information such as:
Missing values have been handled using appropriate methods (mean, median, unknown), and new features like rating_level
and popular
have been added for deeper analysis.
The dataset is ready for: - EDA - Data visualization - Machine learning tasks - Dashboard building
Used in the accompanying notebook