72 datasets found
  1. d

    Data from: Mining microarray expression data by literature profiling

    • catalog.data.gov
    • data.virginia.gov
    • +1more
    Updated Sep 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institutes of Health (2025). Mining microarray expression data by literature profiling [Dataset]. https://catalog.data.gov/dataset/mining-microarray-expression-data-by-literature-profiling
    Explore at:
    Dataset updated
    Sep 6, 2025
    Dataset provided by
    National Institutes of Health
    Description

    The lack of efficient techniques for assessing the biological implications of microarray gene-expression data remains an important obstacle in exploiting this information. To address this need, a mining technique has been developed based on the analysis of literature profiles generated by extracting the frequencies of certain terms from thousands of abstracts stored in the Medline literature database.

  2. G

    Privacy‑Preserving Data Mining Tools Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Aug 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). Privacy‑Preserving Data Mining Tools Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/privacypreserving-data-mining-tools-market
    Explore at:
    pptx, csv, pdfAvailable download formats
    Dataset updated
    Aug 29, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Privacy?Preserving Data Mining Tools Market Outlook



    According to our latest research, the global Privacy?Preserving Data Mining Tools market size reached USD 1.42 billion in 2024, reflecting robust adoption across diverse industries. The market is expected to exhibit a CAGR of 22.8% during the forecast period, propelling the market to USD 10.98 billion by 2033. This remarkable growth is driven by the increasing need for secure data analytics, stringent data protection regulations, and the rising frequency of data breaches, all of which are pushing organizations to adopt advanced privacy solutions.



    One of the primary growth factors for the Privacy?Preserving Data Mining Tools market is the exponential rise in data generation and the parallel escalation of privacy concerns. As organizations collect vast amounts of sensitive information, especially in sectors like healthcare and BFSI, the risk of data exposure and misuse grows. Governments worldwide are enacting stricter data protection laws, such as the GDPR in Europe and CCPA in California, compelling enterprises to integrate privacy?preserving technologies into their analytics workflows. These regulations not only mandate compliance but also foster consumer trust, making privacy?preserving data mining tools a strategic investment for businesses aiming to maintain a competitive edge while safeguarding user data.



    Another significant driver is the rapid digital transformation across industries, which necessitates the extraction of actionable insights from large, distributed data sets without compromising privacy. Privacy?preserving techniques, such as federated learning, homomorphic encryption, and differential privacy, are gaining traction as they allow organizations to collaborate and analyze data securely. The advent of cloud computing and the proliferation of connected devices further amplify the demand for scalable and secure data mining solutions. As enterprises embrace cloud-based analytics, the need for robust privacy-preserving mechanisms becomes paramount, fueling the adoption of advanced tools that can operate seamlessly in both on-premises and cloud environments.



    Moreover, the increasing sophistication of cyber threats and the growing awareness of the potential reputational and financial damage caused by data breaches are prompting organizations to prioritize data privacy. High-profile security incidents have underscored the vulnerabilities inherent in traditional data mining approaches, accelerating the shift towards privacy-preserving alternatives. The integration of artificial intelligence and machine learning with privacy-preserving technologies is also opening new avenues for innovation, enabling more granular and context-aware data analytics. This technological convergence is expected to further catalyze market growth, as organizations seek to harness the full potential of their data assets while maintaining stringent privacy standards.



    Privacy-Preserving Analytics is becoming a cornerstone in the modern data-driven landscape, offering organizations a way to extract valuable insights while maintaining stringent data privacy standards. This approach ensures that sensitive information remains protected even as it is analyzed, allowing businesses to comply with increasing regulatory demands without sacrificing the depth and breadth of their data analysis. By leveraging Privacy-Preserving Analytics, companies can foster greater trust among their customers and stakeholders, knowing that their data is being handled with the utmost care and security. This paradigm shift is not just about compliance; it’s about redefining how organizations approach data analytics in a world where privacy concerns are paramount.



    From a regional perspective, North America currently commands the largest share of the Privacy?Preserving Data Mining Tools market, driven by the presence of leading technology vendors, high awareness levels, and a robust regulatory framework. Europe follows closely, propelled by stringent data privacy laws and increasing investments in secure analytics infrastructure. The Asia Pacific region is witnessing the fastest growth, fueled by rapid digitalization, expanding IT ecosystems, and rising cybersecurity concerns in emerging economies such as China and India. Latin America and the Middle East & Africa are also experiencing steady growth, albeit from

  3. E

    Exploratory Data Analysis (EDA) Tools Report

    • marketreportanalytics.com
    doc, pdf, ppt
    Updated Apr 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Report Analytics (2025). Exploratory Data Analysis (EDA) Tools Report [Dataset]. https://www.marketreportanalytics.com/reports/exploratory-data-analysis-eda-tools-54369
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    Apr 2, 2025
    Dataset authored and provided by
    Market Report Analytics
    License

    https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    Discover the booming Exploratory Data Analysis (EDA) tools market! Our in-depth analysis reveals key trends, growth drivers, and top players shaping this $3 billion industry, projected for 15% CAGR through 2033. Learn about market segmentation, regional insights, and future opportunities.

  4. E

    Exploratory Data Analysis (EDA) Tools Report

    • marketreportanalytics.com
    doc, pdf, ppt
    Updated Apr 2, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Report Analytics (2025). Exploratory Data Analysis (EDA) Tools Report [Dataset]. https://www.marketreportanalytics.com/reports/exploratory-data-analysis-eda-tools-54164
    Explore at:
    ppt, pdf, docAvailable download formats
    Dataset updated
    Apr 2, 2025
    Dataset authored and provided by
    Market Report Analytics
    License

    https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Exploratory Data Analysis (EDA) tools market is experiencing robust growth, driven by the increasing volume and complexity of data across various industries. The market, estimated at $1.5 billion in 2025, is projected to exhibit a Compound Annual Growth Rate (CAGR) of 15% from 2025 to 2033, reaching approximately $5 billion by 2033. This expansion is fueled by several key factors. Firstly, the rising adoption of big data analytics and business intelligence initiatives across large enterprises and SMEs is creating a significant demand for efficient EDA tools. Secondly, the growing need for faster, more insightful data analysis to support better decision-making is driving the preference for user-friendly graphical EDA tools over traditional non-graphical methods. Furthermore, advancements in artificial intelligence and machine learning are seamlessly integrating into EDA tools, enhancing their capabilities and broadening their appeal. The market segmentation reveals a significant portion held by large enterprises, reflecting their greater resources and data handling needs. However, the SME segment is rapidly gaining traction, driven by the increasing affordability and accessibility of cloud-based EDA solutions. Geographically, North America currently dominates the market, but regions like Asia-Pacific are exhibiting high growth potential due to increasing digitalization and technological advancements. Despite this positive outlook, certain restraints remain. The high initial investment cost associated with implementing advanced EDA solutions can be a barrier for some SMEs. Additionally, the need for skilled professionals to effectively utilize these tools can create a challenge for organizations. However, the ongoing development of user-friendly interfaces and the availability of training resources are actively mitigating these limitations. The competitive landscape is characterized by a mix of established players like IBM and emerging innovative companies offering specialized solutions. Continuous innovation in areas like automated data preparation and advanced visualization techniques will further shape the future of the EDA tools market, ensuring its sustained growth trajectory.

  5. M

    Data from: Characterizing and classifying neuroendocrine neoplasms through...

    • datacatalog.mskcc.org
    • search.dataone.org
    • +1more
    Updated Sep 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nanayakkara, Jina; Yang, Xiaojing; Tyryshkin, Kathrin; Wong, Justin J.M.; Vanderbeck, Kaitlin; Ginter, Paula S.; Scognamiglio, Theresa; Chen, Yao-Tseng; Panarelli, Nicole; Cheung, Nai-Kong; Dijk, Frederike; Ben-Dov, Iddo Z.; Kim, Michelle Kang; Singh, Simron; Morozov, Pavel; Max, Klaas E. A.; Tuschl, Thomas; Renwick, Neil (2023). Characterizing and classifying neuroendocrine neoplasms through microRNA sequencing and data mining [Dataset]. http://doi.org/10.5061/dryad.fn2z34tqj
    Explore at:
    Dataset updated
    Sep 19, 2023
    Dataset provided by
    MSK Library
    Authors
    Nanayakkara, Jina; Yang, Xiaojing; Tyryshkin, Kathrin; Wong, Justin J.M.; Vanderbeck, Kaitlin; Ginter, Paula S.; Scognamiglio, Theresa; Chen, Yao-Tseng; Panarelli, Nicole; Cheung, Nai-Kong; Dijk, Frederike; Ben-Dov, Iddo Z.; Kim, Michelle Kang; Singh, Simron; Morozov, Pavel; Max, Klaas E. A.; Tuschl, Thomas; Renwick, Neil
    Description

    From Dryad entry:

    "Abstract
    Neuroendocrine neoplasms (NENs) are clinically diverse and incompletely characterized cancers that are challenging to classify. MicroRNAs (miRNAs) are small regulatory RNAs that can be used to classify cancers. Recently, a morphology-based classification framework for evaluating NENs from different anatomic sites was proposed by experts, with the requirement of improved molecular data integration. Here, we compiled 378 miRNA expression profiles to examine NEN classification through comprehensive miRNA profiling and data mining. Following data preprocessing, our final study cohort included 221 NEN and 114 non-NEN samples, representing 15 NEN pathological types and five site-matched non-NEN control groups. Unsupervised hierarchical clustering of miRNA expression profiles clearly separated NENs from non-NENs. Comparative analyses showed that miR-375 and miR-7 expression is substantially higher in NEN cases than non-NEN controls. Correlation analyses showed that NENs from diverse anatomic sites have convergent miRNA expression programs, likely reflecting morphologic and functional similarities. Using machine learning approaches, we identified 17 miRNAs to discriminate 15 NEN pathological types and subsequently constructed a multi-layer classifier, correctly identifying 217 (98%) of 221 samples and overturning one histologic diagnosis. Through our research, we have identified common and type-specific miRNA tissue markers and constructed an accurate miRNA-based classifier, advancing our understanding of NEN diversity.

    Methods
    Sequencing-based miRNA expression profiles from 378 clinical samples, comprising 239 neuroendocrine neoplasm (NEN) cases and 139 site-matched non-NEN controls, were used in this study. Expression profiles were either compiled from published studies (n=149) or generated through small RNA sequencing (n=229). Prior to sequencing, total RNA was isolated from formalin-fixed paraffin-embedded (FFPE) tissue blocks or fresh-frozen (FF) tissue samples. Small RNA cDNA libraries were sequenced on HiSeq 2500 Illumina platforms using an established small RNA sequencing (Hafner et al., 2012 Methods) and sequence annotation pipeline (Brown et al., 2013 Front Genet) to generate miRNA expression profiles. Scaling our existing approach to miRNA-based NEN classification (Panarelli et al., 2019 Endocr Relat Cancer; Ren et al., 2017 Oncotarget), we constructed and cross-validated a multi-layer classifier for discriminating NEN pathological types based on selected miRNAs.

    Usage notes
    Diagnostic histopathology and small RNA cDNA library preparation information for all samples are presented in Table S1 of the associated manuscript."

  6. Mining microarray expression data by literature profiling - 9qg9-prsr -...

    • healthdata.gov
    csv, xlsx, xml
    Updated Sep 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Mining microarray expression data by literature profiling - 9qg9-prsr - Archive Repository [Dataset]. https://healthdata.gov/d/nu38-zrn7
    Explore at:
    xlsx, csv, xmlAvailable download formats
    Dataset updated
    Sep 10, 2025
    Description

    This dataset tracks the updates made on the dataset "Mining microarray expression data by literature profiling" as a repository for previous versions of the data and metadata.

  7. d

    The Globalization of Personal Data (GPD) Project International Survey on...

    • dataone.org
    • borealisdata.ca
    Updated Dec 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Surveillance Studies Centre (2023). The Globalization of Personal Data (GPD) Project International Survey on Privacy and Surveillance [Dataset]. https://dataone.org/datasets/sha256%3A40f901a6a34637e53687870f937343ca6be1cd5b8243a3503d3f31ea29fedc5b
    Explore at:
    Dataset updated
    Dec 28, 2023
    Dataset provided by
    Borealis
    Authors
    Surveillance Studies Centre
    Time period covered
    Jan 1, 2006 - Jan 1, 2007
    Description

    The Globalization of Personal Data (GPD) was an international, multi-disciplinary and collaborative research initiative drawing mainly on the social sciences but also including information, computing, technology studies, and law, that explored the implications of processing personal and population data in electronic format from 2004 to 2008. Such data included everything from census statistics to surveillance camera images, from biometric passports to supermarket loyalty cards. The project ma intained a strong concern for ethics, politics and policy development around personal data. The project, funded by the Social Sciences and Humanities Research Council of Canada (SSHRCC) under its Initiative on the New Economy program, conducted research on why surveillance occurs, how it operates, and what this means for people's everyday lives (See http://www.sscqueens.org/projects/gpd). The unique aspect of the GPD included a major international survey on citizens' attitudes to issues of surveillance and privacy. The GPD project was conducted in nine countries: Canada, U.S.A., France, Spain, Hungary, Mexico, Brazil, China, and Japan. Three data files were produced: a Seven-Country file (Canada, U.S.A., France, Spain, Hungary, Mexico, and Brazil), a China file, and a Japan file. Country Report are available for download from QSpace (Queen's University Research and Learning Repository).

  8. n

    Public Expression Profiling Resource

    • neuinfo.org
    • dknet.org
    • +2more
    Updated Sep 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Public Expression Profiling Resource [Dataset]. http://identifiers.org/RRID:SCR_007274
    Explore at:
    Dataset updated
    Sep 13, 2024
    Description

    An experiment in web-database access to large multi-dimensional data sets using a standardized experimental platform to determine if the larger scientific community can be given simple, intuitive, and user-friendly web-based access to large microarray data sets. All data in PEPR is also available via NCBI GEO. The structure and goals of PEPR differ from other mRNA expression profiling databases in a number of important ways. * The experimental platform in PEPR is standardized, and is an Affymetrix - only database. All microarrays available in the PEPR web database should ascribe to quality control and standard operating procedures. A recent publication has described the QC/SOP criteria utilized in PEPR profiles ( The Tumor Analysis Best Practices Working Group 2004 ). * PEPR permits gene-based queries of large Affymetrix array data sets without any specialized software. For example, a number of large time series projects are available within PEPR, containing 40-60 microarrays, yet these can be simply queried via a dynamic web interface with no prior knowledge of microarray data analysis. * Projects in PEPR originate from scientists world-wide, but all data has been generated by the Research Center for Genetic Medicine, Children''''s National Medical Center, Washington DC. Future developments of PEPR will allow remote entry of Affymetrix data ascribing to the same QC/SOP protocols. They have previously described an initial implementation of PEPR, and a dynamic web-queried time series graphical interface ( Chen et al. 2004 ). A publication showing the utility of PEPR for pharmacodynamic data has recently been published ( Almon et al. 2003 ).

  9. Data from: Analysis of spatiotemporal specificity of small RNAs regulating...

    • figshare.com
    xlsx
    Updated Sep 29, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lu Li (2019). Analysis of spatiotemporal specificity of small RNAs regulating hPSC differentiation and beyond [Dataset]. http://doi.org/10.6084/m9.figshare.9911918.v2
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Sep 29, 2019
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Lu Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We present a quantitative analysis of small RNA dynamics during the transition from hPSCs to the three germ layer lineages to identify spatiotemporal-specific small RNAs that may be involved in hPSC differentiation. To determine the degree of spatiotemporal specificity, we utilized two algorithms, namely normalized maximum timepoint specificity index (NMTSI) and across-tissue specificity index (ASI). NMTSI could identify spatiotemporal-specific small RNAs that go up or down at just one timepoint in a specific lineage. ASI could identify spatiotemporal-specific small RNAs that maintain high expression from intermediate timepoints to the terminal timepoint in a specific lineage. Beyond analyzing single small RNAs, we also quantified the spatiotemporal-specificity of microRNA families and observed their differential expression patterns in certain lineages. To clarify the regulatory effects of group miRNAs on cellular events during lineage differentiation, we performed a gene ontology (GO) analysis on the downstream targets of synergistically up- and downregulated microRNAs. To provide an integrated interface for researchers to access and browse our analysis results, we designed a web-based tool at https://keyminer.pythonanywhere.com/km/.

  10. Enterprise Data Warehouse (EDW) Market Analysis, Size, and Forecast...

    • technavio.com
    pdf
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Technavio (2025). Enterprise Data Warehouse (EDW) Market Analysis, Size, and Forecast 2025-2029: North America (US and Canada), Europe (France, Germany, Italy, and UK), APAC (China, India, Japan, and South Korea), and Rest of World (ROW) [Dataset]. https://www.technavio.com/report/enterprise-data-warehouse-market-industry-analysis
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 15, 2025
    Dataset provided by
    TechNavio
    Authors
    Technavio
    License

    https://www.technavio.com/content/privacy-noticehttps://www.technavio.com/content/privacy-notice

    Time period covered
    2025 - 2029
    Area covered
    United States
    Description

    Snapshot img

    Enterprise Data Warehouse (EDW) Market Size 2025-2029

    The enterprise data warehouse (edw) market size is valued to increase USD 43.12 billion, at a CAGR of 28% from 2024 to 2029. Data explosion across industries will drive the enterprise data warehouse (edw) market.

    Major Market Trends & Insights

    APAC dominated the market and accounted for a 32% growth during the forecast period.
    By Product Type - Information and analytical processing segment was valued at USD 4.38 billion in 2023
    By Deployment - Cloud based segment accounted for the largest market revenue share in 2023
    

    Market Size & Forecast

    Market Opportunities: USD 857.82 million
    Market Future Opportunities: USD 43116.60 million
    CAGR : 28%
    APAC: Largest market in 2023
    

    Market Summary

    The market is a dynamic and ever-evolving landscape, characterized by continuous innovation and adaptation to industry demands. Core technologies, such as cloud computing and big data analytics, are driving the market's growth, enabling organizations to manage and analyze vast amounts of data more effectively. In terms of applications, business intelligence and data mining are leading the way, providing valuable insights for strategic decision-making. Service types, including consulting, implementation, and support, are essential components of the EDW market. According to recent reports, the consulting segment is expected to dominate the market due to the increasing demand for expert advice in implementing and optimizing EDW solutions. However, data security concerns remain a significant challenge, with regulations like GDPR and HIPAA driving the need for robust security measures. Despite these challenges, the market continues to expand, with data explosion across industries fueling the demand for EDW solutions. For instance, the healthcare sector is projected to witness a compound annual growth rate (CAGR) of 15.3% between 2021 and 2028. Furthermore, the market is witnessing a significant focus on new solution launches, with major players like Microsoft, IBM, and Oracle introducing advanced EDW offerings to meet the evolving needs of businesses.

    What will be the Size of the Enterprise Data Warehouse (EDW) Market during the forecast period?

    Get Key Insights on Market Forecast (PDF) Request Free Sample

    How is the Enterprise Data Warehouse (EDW) Market Segmented and what are the key trends of market segmentation?

    The enterprise data warehouse (edw) industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD million' for the period 2025-2029, as well as historical data from 2019-2023 for the following segments. Product TypeInformation and analytical processingData miningDeploymentCloud basedOn-premisesSectorLarge enterprisesSMEsEnd-userBFSIHealthcare and pharmaceuticalsRetail and E-commerceTelecom and ITOthersGeographyNorth AmericaUSCanadaEuropeFranceGermanyItalyUKAPACChinaIndiaJapanSouth KoreaRest of World (ROW)

    By Product Type Insights

    The information and analytical processing segment is estimated to witness significant growth during the forecast period.

    The market is experiencing significant growth, with data replication strategies becoming increasingly sophisticated to ensure capacity planning models accommodate expanding data volumes. ETL tool selection and business intelligence platforms are crucial components, enabling query optimization strategies and disaster recovery planning. Data warehouse migration, data profiling methods, and real-time data ingestion are essential for maintaining a competitive edge. Data warehouse automation, data quality metrics, and data warehouse modernization are ongoing priorities, with data cleansing techniques and dimensional modeling techniques essential for ensuring data accuracy. Data warehousing architecture, performance monitoring tools, and high availability solutions are integral to ensuring scalability and availability. Audit trail management, data lineage tracking, and data warehouse maintenance are critical for maintaining data security and compliance. Data security protocols and data encryption methods are essential for protecting sensitive information, while data virtualization techniques and access control mechanisms facilitate self-service business intelligence tools. ETL process optimization and data governance policies are key to streamlining operations and ensuring data consistency. The IT, BFSI, education, healthcare, and retail sectors are driving market growth, with information processing and analytical processing becoming increasingly important. The construction of web-based accessing tools integrated with web browsers is a current trend, enabling users to access data warehouses easily. According to recent studies, the market for data warehousing solutions is projected to grow by 18.5%, while the adoption of cloud data warehou

  11. u

    Mining camp vertical seismic profile studies - Catalogue - Canadian Urban...

    • data.urbandatacentre.ca
    Updated Oct 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Mining camp vertical seismic profile studies - Catalogue - Canadian Urban Data Catalogue (CUDC) [Dataset]. https://data.urbandatacentre.ca/dataset/gov-canada-16f17df6-9ce7-c2e8-ba59-2c6bb0b6d071
    Explore at:
    Dataset updated
    Oct 19, 2025
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Area covered
    Canada
    Description

    Vertical seismic profiling (VSP) surveys done by the Geological Survey of Canada for research into downhole seismic imaging techniques for mineral exploration.

  12. S

    Predictive data analysis techniques for higher education students dropout

    • scidb.cn
    Updated Apr 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cindy (2023). Predictive data analysis techniques for higher education students dropout [Dataset]. http://doi.org/10.57760/sciencedb.07894
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 10, 2023
    Dataset provided by
    Science Data Bank
    Authors
    Cindy
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    In this research, we have generated student retention alerts. The alerts are classified into two types: preventive and corrective. This classification varies according to the level of maturity of the data systematization process. Therefore, to systematize the data, data mining techniques have been applied. The experimental analytical method has been used, with a population of 13,715 students with 62 sociological, academic, family, personal, economic, psychological, and institutional variables, and factors such as academic follow-up and performance, financial situation, and personal information. In particular, information is collected on each of the problems or a combination of problems that could affect dropout rates. Following the methodology, the information has been generated through an abstract data model to reflect the profile of the dropout student. As advancement from previous research, this proposal will create preventive and corrective alternatives to avoid dropout higher education. Also, in contrast to previous work, we generated corrective warnings with the application of data mining techniques such as neural networks until reaching a precision of 97% and losses of 0.1052. In conclusion, this study pretends to analyze the behavior of students who drop out the university through the evaluation of predictive patterns. The overall objective is to predict the profile of student dropout, considering reasons such as admission to higher education and career changes. Consequently, using a data systematization process promotes the permanence of students in higher education. Once the profile of the dropout has been identified, student retention strategies have been approached, according to the time of its appearance and the point of view of the institution.

  13. Data from:...

    • osdr.nasa.gov
    Updated Aug 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yang Wang; Zhi-Hao Chen; Chun Yin; Jian-Hua Ma; Di-Jie Li; Fan Zhao; Yu-Long Sun; Li-Fang Hu; Peng Shang; Ai-Rong Qian (2024). Whole-gene-expression-data-from-osteocyte-like-cell-line-MLO-Y4-under-large-gradient-high-magnetic-field-LG-HMF [Dataset]. https://osdr.nasa.gov/bio/repo/data/studies/OSD-547
    Explore at:
    Dataset updated
    Aug 21, 2024
    Dataset provided by
    NASAhttp://nasa.gov/
    Authors
    Yang Wang; Zhi-Hao Chen; Chun Yin; Jian-Hua Ma; Di-Jie Li; Fan Zhao; Yu-Long Sun; Li-Fang Hu; Peng Shang; Ai-Rong Qian
    License

    Attribution 1.0 (CC BY 1.0)https://creativecommons.org/licenses/by/1.0/
    License information was derived automatically

    Description

    The diamagnetic levitation as a novel ground-based model for simulating a reduced gravity environment has recently been applied in life science research. In this study a specially designed superconducting magnet with a large gradient high magnetic field (LG-HMF), which can provide three apparent gravity levels (μ-g, 1-g, and 2-g), was used to simulate a space-like gravity environment. Osteocyte, as the most important mechanosensor in bone, takes a pivotal position in mediating the mechano-induced bone remodeling. In this study, the effects of LG-HMF on gene expression profiling of osteocyte-like cell line MLO-Y4 were investigated by Affymetrix DNA microarray. LG-HMF affected osteocyte gene expression profiling. Differentially expressed genes (DEGs) and data mining were further analyzed by using bioinfomatic tools, such as DAVID, iReport. 12 energy metabolism related genes (PFKL, AK4, ALDOC, COX7A1, STC1, ADM, CA9, CA12, P4HA1, APLN, GPR35 and GPR84) were further confirmed by real-time PCR. An integrated gene interaction network of 12 DEGs was constructed. Bio-data mining showed that genes involved in glucose metabolic process and apoptosis changed notablly. Our results demostrated that LG-HMF affected the expression of energy metabolism related genes in osteocyte. The identification of sensitive genes to special environments may provide some potential targets for preventing and treating bone loss or osteoporosis.

  14. Data from: Proteomics profiling of research models for studying pancreatic...

    • data.niaid.nih.gov
    • ebi.ac.uk
    xml
    Updated Dec 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Animesh Sharma; Lars Hagen (2024). Proteomics profiling of research models for studying pancreatic ductal adenocarcinoma [Dataset]. https://data.niaid.nih.gov/resources?id=pxd057607
    Explore at:
    xmlAvailable download formats
    Dataset updated
    Dec 18, 2024
    Dataset provided by
    Engineer at NTNU, Norway
    General Manager of PROMEC Department of Clinical and Molecular Medicine lars.hagen@ntnu.no +4773598508 +4772573342 Laboratoriesenteret, 231.04.008, Øya, Erling Skjalgssons gate 1 Norway
    Authors
    Animesh Sharma; Lars Hagen
    Variables measured
    Proteomics
    Description

    Pancreatic ductal adenocarcinoma (PDAC) remains one of the most lethal malignancies, with a five-year survival rate of 10-15% due to late-stage diagnosis and limited efficacy of existing treatments. This study utilized proteomics-based system modelling to generate multimodal datasets from various research models, including PDAC cells, spheroids, organoids, and tissues derived from murine and human samples. Identical mass spectrometry-based proteomics was applied across the different models. Preparation and validation of the research models and the proteomics were described in detail. The assembly datasets we present here contribute to the data collection on PDAC, which will be useful for systems modeling, data mining, knowledge discovery in databases, and bioinformatics of individual models. Further data analysis may lead to generation of research hypotheses, predictions of targets for diagnosis and treatment and relationships between data variables.

  15. s

    Mining mineral commodity trade llc USA Import & Buyer Data

    • seair.co.in
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim, Mining mineral commodity trade llc USA Import & Buyer Data [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset provided by
    Seair Info Solutions PVT LTD
    Authors
    Seair Exim
    Area covered
    United States
    Description

    Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.

  16. c

    LifeScience Data Mining And Visualization Market size was USD 5815.2 million...

    • cognitivemarketresearch.com
    pdf,excel,csv,ppt
    Updated Dec 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cognitive Market Research (2024). LifeScience Data Mining And Visualization Market size was USD 5815.2 million in 2023! [Dataset]. https://www.cognitivemarketresearch.com/lifescience-data-mining-and-visualization-market-report
    Explore at:
    pdf,excel,csv,pptAvailable download formats
    Dataset updated
    Dec 15, 2024
    Dataset authored and provided by
    Cognitive Market Research
    License

    https://www.cognitivemarketresearch.com/privacy-policyhttps://www.cognitivemarketresearch.com/privacy-policy

    Time period covered
    2021 - 2033
    Area covered
    Global
    Description

    According to Cognitive Market Research, the global Lifescience Data Mining And Visualization market size is USD 5815.2 million in 2023 and will expand at a compound annual growth rate (CAGR) of 9.60% from 2023 to 2030.

    North America held the major market of more than 40% of the global revenue with a market size of USD 2326.08 million in 2023 and will grow at a compound annual growth rate (CAGR) of 7.8% from 2023 to 2030
    Europe held the major market of more than 40% of the global revenue with a market size of USD 1744.56 million in 2023 and will grow at a compound annual growth rate (CAGR) of 8.1% from 2023 to 2030. 
    Asia Pacific held the fastest growing market of more than 23% of the global revenue with a market size of USD 1337.50 million in 2023 and will grow at a compound annual growth rate (CAGR) of 11.6% from 2023 to 2030
    Latin America market held of more than 5% of the global revenue with a market size of USD 290.76 million in 2023 and will grow at a compound annual growth rate (CAGR) of 9.0% from 2023 to 2030
    Middle East and Africa market held of more than 2.00% of the global revenue with a market size of USD 116.30 million in 2023 and will grow at a compound annual growth rate (CAGR) of 9.3% from 2023 to 2030
    The demand for Lifescience Data Mining And Visualizations is rising due to rapid growth in biological data and increasing emphasis on personalized medicine.
    Demand for On-Demand remains higher in the Lifescience Data Mining And Visualization market.
    The Pharmaceuticals category held the highest Lifescience Data Mining And Visualization market revenue share in 2023.
    

    Market Dynamics of Lifescience Data Mining And Visualization

    Key Drivers of Lifescience Data Mining And Visualization

    Advancements in Healthcare Informatics to Provide Viable Market Output
    

    The Lifescience Data Mining and Visualization market are driven by continuous advancements in healthcare informatics. As the life sciences industry generates vast volumes of complex data, sophisticated data mining and visualization tools are increasingly crucial. Advancements in healthcare informatics, including electronic health records (EHRs), genomics, and clinical trial data, provide a wealth of information. Data mining and visualization technologies empower researchers and healthcare professionals to extract meaningful insights, aiding in personalized medicine, drug discovery, and treatment optimization.

    August 2020: Johnson & Johnson and Regeneron Pharmaceuticals announced a strategic collaboration to develop and commercialize cancer immunotherapies.

    (Source:investor.regeneron.com/news-releases/news-release-details/regeneron-and-cytomx-announce-strategic-research-collaboration)

    Rising Focus on Precision Medicine Propel Market Growth
    

    A key driver in the Lifescience Data Mining and Visualization market is the growing focus on precision medicine. As healthcare shifts towards personalized treatment strategies, there is an increasing need to analyze diverse datasets, including genetic, clinical, and lifestyle information. Data mining and visualization tools facilitate the identification of patterns and correlations within this multidimensional data, enabling the development of tailored treatment approaches. The emphasis on precision medicine, driven by advancements in genomics and molecular profiling, positions data mining and visualization as essential components in deciphering the intricate relationships between biological factors and individual health, thereby fostering innovation in life science research and healthcare practices.

    In June 2022, SAS Institute Inc. (US) entered into an agreement with Gunvatta (US) to expedite clinical trials and FDA reporting through the SAS Life Science Analytics Framework on Azure.

    (Source:www.prnewswire.com/news-releases/clinical-research-and-drug-development-accelerated-via-analytics-301571580.html)

    Increasing adoption of artificial intelligence (AI) and machine learning (ML) algorithms is propelling the market growth of life science data mining and visualization
    

    These technologies have revolutionized the ability to analyze and interpret vast, complex datasets in fields such as drug discovery and personalized medicine. For instance, companies like Insitro are utilizing AI-driven models to analyze biological and chemical data, dramatically accelerating drug discovery timelines and optimizing the identification of new therape...

  17. n

    Data from: Carotenoid metabolic profiling and transcriptome-genome mining...

    • data-staging.niaid.nih.gov
    • data.niaid.nih.gov
    • +2more
    zip
    Updated May 2, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nazia Mojib; Maan Amad; Manjula Thimma; Naroa Aldanondo; Mande Kumaran; Xabier Irigoien (2014). Carotenoid metabolic profiling and transcriptome-genome mining reveal functional equivalence among blue-pigmented copepods and appendicularia [Dataset]. http://doi.org/10.5061/dryad.mm18b
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 2, 2014
    Dataset provided by
    King Abdullah University of Science and Technology
    Authors
    Nazia Mojib; Maan Amad; Manjula Thimma; Naroa Aldanondo; Mande Kumaran; Xabier Irigoien
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Area covered
    Red Sea
    Description

    The tropical oligotrophic oceanic areas are characterized by high water transparency and annual solar radiation. Under these conditions, a large number of phylogenetically diverse mesozooplankton species living in the surface waters (neuston) are found to be blue pigmented. In the present study, we focused on understanding the metabolic and genetic basis of the observed blue phenotype functional equivalence between the blue pigmented organisms from the phylum Arthropoda, subclass copepoda (Acartia fossae) and the phylum Chordata, class appendicularia (Oikopleura dioica) in the Red Sea. Previous studies have shown that carotenoid protein complexes are responsible for blue coloration in crustaceans. Therefore, we performed carotenoid metabolic profiling using both targeted and non-targeted (high-resolution mass spectrometry) approaches in four different blue-pigmented genera of copepods and one blue-pigmented species of appendicularia. Astaxanthin was found to be the principal carotenoid in all species. The pathway analysis showed that all species can synthesize astaxanthin from β-carotene, ingested from dietary sources, via 3-hydroxyechinenone, canthaxanthin, zeaxanthin, adonirubin or adonixanthin. Further, using de novo assembled transcriptome of blue A. fossae (subclass copepoda) we identified highly expressed homologous β-carotene hydroxylase enzymes and carotenoid binding proteins responsible for astaxanthin formation and the blue phenotype. In blue O. dioica (class appendicularia), corresponding putative genes were identified from the reference genome. Collectively, our data provide molecular evidences for the bioconversion and accumulation of blue astaxanthin-protein complexes underpinning the observed ecological functional equivalence and adaptive convergence among neustonic mesozooplankton.

  18. s

    Seair Exim Solutions

    • seair.co.in
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim, Seair Exim Solutions [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset provided by
    Seair Info Solutions PVT LTD
    Authors
    Seair Exim
    Area covered
    United States
    Description

    Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.

  19. Table_1_Transcriptomic Profiling Identifies Neutrophil-Specific Upregulation...

    • frontiersin.figshare.com
    docx
    Updated Jun 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrew J. Sawyer; Mathieu Garand; Damien Chaussabel; Carl G. Feng (2023). Table_1_Transcriptomic Profiling Identifies Neutrophil-Specific Upregulation of Cystatin F as a Marker of Acute Inflammation in Humans.docx [Dataset]. http://doi.org/10.3389/fimmu.2021.634119.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 8, 2023
    Dataset provided by
    Frontiers Mediahttp://www.frontiersin.org/
    Authors
    Andrew J. Sawyer; Mathieu Garand; Damien Chaussabel; Carl G. Feng
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Cystatin F encoded by CST7 is a cysteine peptidase inhibitor known to be expressed in natural killer (NK) and CD8+ T cells during steady-state conditions. However, little is known about its expression during inflammatory disease states in humans. We have developed an analytic approach capable of not only identifying previously poorly characterized disease-associated genes but also defining regulatory mechanisms controlling their expression. By exploring multiple cohorts of public transcriptome data comprising 43 individual datasets, we showed that CST7 is upregulated in the blood during a diverse set of infectious and non-infectious inflammatory conditions. Interestingly, this upregulation of CST7 was neutrophil-specific, as its expression was unchanged in NK and CD8+ T cells during sepsis. Further analysis demonstrated that known microbial products or cytokines commonly associated with inflammation failed to increase CST7 expression, suggesting that its expression in neutrophils is induced by an endogenous serum factor commonly present in human inflammatory conditions. Overall, through the identification of CST7 upregulation as a marker of acute inflammation in humans, our study demonstrates the value of publicly available transcriptome data in knowledge generation and potential biomarker discovery.

  20. q

    Blackhawk Mining LLC Business Operations, SWOT, PESTLE, Porters Five Forces...

    • quaintel.com
    Updated Nov 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Quaintel Research Solutions (2025). Blackhawk Mining LLC Business Operations, SWOT, PESTLE, Porters Five Forces and Financial Analysis [Dataset]. https://quaintel.com/store/report/blackhawk-mining-llc-company-profile-swot-pestle-porters-five-forces-analysis
    Explore at:
    Dataset updated
    Nov 6, 2025
    Dataset authored and provided by
    Quaintel Research Solutions
    License

    https://quaintel.com/privacy-policyhttps://quaintel.com/privacy-policy

    Area covered
    Global
    Description

    Blackhawk Mining LLC Business Operations, Opportunities, Challenges and Risk (SWOT, PESTLE and Porters Five Forces Analysis); Corporate and ESG Strategies; Competitive Intelligence; Financial KPI’s; Operational KPI’s; Recent Trends: “ Read More

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
National Institutes of Health (2025). Mining microarray expression data by literature profiling [Dataset]. https://catalog.data.gov/dataset/mining-microarray-expression-data-by-literature-profiling

Data from: Mining microarray expression data by literature profiling

Related Article
Explore at:
Dataset updated
Sep 6, 2025
Dataset provided by
National Institutes of Health
Description

The lack of efficient techniques for assessing the biological implications of microarray gene-expression data remains an important obstacle in exploiting this information. To address this need, a mining technique has been developed based on the analysis of literature profiles generated by extracting the frequencies of certain terms from thousands of abstracts stored in the Medline literature database.

Search
Clear search
Close search
Google apps
Main menu