7 datasets found
  1. f

    Data from: Integración de los algoritmos de minería de datos 1R, PRISM e ID3...

    • figshare.com
    jpeg
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yadira Robles Aranda; Anthony R. Sotolongo (2023). Integración de los algoritmos de minería de datos 1R, PRISM e ID3 a PostgreSQL [Dataset]. http://doi.org/10.6084/m9.figshare.20011649.v1
    Explore at:
    jpegAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    SciELO journals
    Authors
    Yadira Robles Aranda; Anthony R. Sotolongo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In this research, data mining and decision tree techniques were analyzed as well as the induction of rules to integrate their many algorithms into the database managing system (DBMS), PostgreSQL, due to the defficiencies of the free use tools avaialable. A mechanism to optimize the performance of the implemented algorithms was proposed with the purpose of taking advantage of the PostgreSQL. By means of an experiment, it was proven that the time response and results obtained are improved when the algorithms are integrated into the managing system.

  2. OCSCP

    • figshare.com
    7z
    Updated Sep 11, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    vanha tran (2020). OCSCP [Dataset]. http://doi.org/10.6084/m9.figshare.12941714.v1
    Explore at:
    7zAvailable download formats
    Dataset updated
    Sep 11, 2020
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    vanha tran
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is a set of spatial data sets which can use for discovering spatial co-location patterns.

  3. Data Analytics Market By Type (Descriptive Analytics, Predictive Analytics,...

    • verifiedmarketresearch.com
    Updated Oct 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VERIFIED MARKET RESEARCH (2024). Data Analytics Market By Type (Descriptive Analytics, Predictive Analytics, Augmented Analytics), Solution (Data Management, Data Mining, Data Monitoring), Application (Human Resource Management, Supply Chain Management, Database Management), By Geographic Scope And Forecast & Region for 2024-2031 [Dataset]. https://www.verifiedmarketresearch.com/product/data-analytics-market/
    Explore at:
    Dataset updated
    Oct 14, 2024
    Dataset provided by
    Verified Market Researchhttps://www.verifiedmarketresearch.com/
    Authors
    VERIFIED MARKET RESEARCH
    License

    https://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/

    Time period covered
    2024 - 2031
    Area covered
    Global
    Description

    Data Analytics Market Valuation – 2024-2031

    Data Analytics Market was valued at USD 68.83 Billion in 2024 and is projected to reach USD 482.73 Billion by 2031, growing at a CAGR of 30.41% from 2024 to 2031.

    Data Analytics Market Drivers

    Data Explosion: The proliferation of digital devices and the internet has led to an exponential increase in data generation. Businesses are increasingly recognizing the value of harnessing this data to gain competitive insights.

    Advancements in Technology: Advancements in data storage, processing power, and analytics tools have made it easier and more cost-effective for organizations to analyze large datasets.

    Increased Business Demand: Businesses across various industries are seeking data-driven insights to improve decision-making, optimize operations, and enhance customer experiences.

    Data Analytics Market Restraints

    Data Quality and Integrity: Ensuring the accuracy, completeness, and consistency of data is crucial for effective analytics. Poor data quality can hinder insights and lead to erroneous conclusions.

    Data Privacy and Security Concerns: As organizations collect and analyze sensitive data, concerns about data privacy and security are becoming increasingly important. Breaches can have significant financial and reputational consequences.

  4. f

    Evidence that identifiers are a source of problems for data integrators.

    • figshare.com
    pdf
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Julie McMurry (2023). Evidence that identifiers are a source of problems for data integrators. [Dataset]. http://doi.org/10.6084/m9.figshare.3394843.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    figshare
    Authors
    Julie McMurry
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Advances in computing power and expansion of the Internet have led to increasing optimism that big data will lead to new insights. However, in the life sciences, relevant data is not only "big"; it is also highly decentralized across thousands of online databases. Wringing value from it depends on the discipline of data science and on the humble bricks and mortar that make it possible -- identifiers. However, our collective handling of identifiers has lagged behind these advances. Diverse identifier problems (for instance broken links and ‘content drift’) make it difficult to integrate data and derive new knowledge from it. This is a snapshot of a living document intended to show real-world examples of identifier problems representative of those encountered by data integrators. It is not meant to be exhaustive.

  5. a

    Coal Mine Entries 2016

    • indianamapold-inmap.hub.arcgis.com
    • indianamap.org
    • +1more
    Updated Nov 15, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IndianaMap (2017). Coal Mine Entries 2016 [Dataset]. https://indianamapold-inmap.hub.arcgis.com/datasets/b42257779032462d8ba2810cb4391e8f
    Explore at:
    Dataset updated
    Nov 15, 2017
    Dataset authored and provided by
    IndianaMap
    Area covered
    Description

    COAL_MINE_ENTRIES_DNR_IN.SHP contains the entrance locations of all documented underground coal mine entrances that have operated in the coal region of Indiana since the mid-1880s, up to December 31, 2016. COAL_MINE_ENTRIES_DNR_IN.SHP is attributed to allow the mine entrances to be differentiated based on entrance type (hoist shaft, other shafts, slope, unknown), depth, mine number, and source information (map number). The following is excerpted from the metadata provided by IDNR, Division of Reclamation, for the source point feature class named "COAL_ENTRY_GEOREF": "Coal_Mine_Entries_DNR_IN is a compilation of all documented underground coal mine entrances in Indiana. Coal_Mine_Entries_DNR_IN was compiled by the Indiana Geological Survey (IGS) as part of a contract deliverable for the Abandoned Mine Lands program of the Indiana Department of Natural Resources, Division of Reclamation. Coal_Mine_Entries_DNR_IN incorporates mine entrance locations compiled as part of the Indiana Coal Mine Information System (CMIS), an integrated geographic information system (GIS) and database management system (DBMS) created to store, analyze, and help distribute coal mine data in Indiana. The system contains data for surface and underground coal mines that operated in Indiana from the mid-1830s to 2007. Original source information for Coal_Mine_Entries_DNR_IN includes company mine maps, field maps and notes of IGS geologists, IGS publications, and Reports of the Indiana State Mine Inspector. All mine data included in Coal_Mine_Entries_DNR_IN are organized in a GIS using ESRI ArcGIS software of the Environmental Systems Research Institute (ESRI) on the Windows platform. Scale of source data ranges from 1:4,800 to 1:100,000."

  6. Data from: Dataset for Vector space model and the usage patterns of...

    • figshare.com
    bin
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gede Primahadi Wijaya Rajeg; Karlina Denistia; Simon Musgrave (2023). Dataset for Vector space model and the usage patterns of Indonesian denominal verbs [Dataset]. http://doi.org/10.6084/m9.figshare.8187155.v1
    Explore at:
    binAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Gede Primahadi Wijaya Rajeg; Karlina Denistia; Simon Musgrave
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    PrefaceThis is the data repository for the paper accepted for publication in NUSA's special issue on Linguistic studies using large annotated corpora (co-edited by Hiroki Nomoto and David Moeljadi).How to cite the datasetIf you use, adapt, and/or modify any of the dataset in this repository for your research or teaching purposes (except for the malindo_dbase, see below), please cite as:Rajeg, Gede Primahadi Wijaya; Denistia, Karlina; Musgrave, Simon (2019): Dataset for Vector space model and the usage patterns of Indonesian denominal verbs. figshare. Fileset. https://doi.org/10.6084/m9.figshare.8187155.Alternatively, click on the dark pink Cite button to browse different citation style (default is DataCite).The malindo_dbase data in this repository is from Nomoto et al. (2018) (cf the GitHub repository). So please also cite their work if you use it for your research:Nomoto, Hiroki, Hannah Choi, David Moeljadi and Francis Bond. 2018. MALINDO Morph: Morphological dictionary and analyser for Malay/Indonesian. Kiyoaki Shirai (ed.) Proceedings of the LREC 2018 Workshop "The 13th Workshop on Asian Language Resources", 36-43.Tutorial on how to use the data together with the R Markdown Notebook for the analyses is available on GitHub and figshare:Rajeg, Gede Primahadi Wijaya; Denistia, Karlina; Musgrave, Simon (2019): R Markdown Notebook for Vector space model and the usage patterns of Indonesian denominal verbs. figshare. Software. doi: https://doi.org/10.6084/m9.figshare.9970205Dataset description1. Leipzig_w2v_vector_full.bin is the vector space model used in the paper. We built it using wordVectors package (Schmidt & Li 2017) via the MonARCH High Performance Computing Cluster (We thank Philip Chan for his help with access to MonARCH).2. Files beginning with ngramexmpl_... are data for the n-grams (i.e. words sequence) of verbs discussed in the paper. The files are in tab-separated format.3. Files beginning with sentence_... are full sentences for the verbs discussed in the paper (in the plain text format and R dataset format [.rds]). Information of the corpus file and sentence number in which the verb is found are included.4. me_parsed_nountaggedbase (in three different file-formats) contains database of the me- words with noun-tagged root that MorphInd identified to occur in three morphological schemas we focus on (me-, me-/-kan, and me-/-i). The database has columns for the verbs' token frequency in the corpus, root forms, MorphInd parsing output, among others.5. wordcount_leipzig_allcorpus (in three different file-formats) contains information on the size of each corpus file used in the paper and from which the vector space model is built.6. wordlist_leipzig_ME_DI_TER_percorpus.tsv is a tab-separated frequency list of words prefixed with me-, di-, and ter- in all thirteen corpus files used. The wordlist is built by first tokenising each corpus file, lowercasing the tokens, and then extracting the words with the corresponding three prefixes using the following regular expressions: - For me-: ^(?i)(me)([a-z-]{3,})$- For di-: ^(?i)(di)([a-z-]{3,})$- For ter-: ^(?i)(ter)([a-z-]{3,})$7. malindo_dbase is the MALINDO Morphological Dictionary (see above).ReferencesSchmidt, Ben & Jian Li. 2017. wordVectors: Tools for creating and analyzing vector-space models of texts. R package. http://github.com/bmschmidt/wordVectors.

  7. Tweets analysis

    • figshare.com
    html
    Updated Oct 17, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    marilena daquino; Sara Amayeh (2021). Tweets analysis [Dataset]. http://doi.org/10.6084/m9.figshare.16823164.v1
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Oct 17, 2021
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    marilena daquino; Sara Amayeh
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    in our study, we performed different analysis such as sentiment analysis on the tweets posted by the visitors of the most popular European museums before and after Covid-19 pandemic in order to understand how emotional responses of the users who are interested in art has changed by this global crisis.

  8. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Yadira Robles Aranda; Anthony R. Sotolongo (2023). Integración de los algoritmos de minería de datos 1R, PRISM e ID3 a PostgreSQL [Dataset]. http://doi.org/10.6084/m9.figshare.20011649.v1

Data from: Integración de los algoritmos de minería de datos 1R, PRISM e ID3 a PostgreSQL

Related Article
Explore at:
jpegAvailable download formats
Dataset updated
Jun 3, 2023
Dataset provided by
SciELO journals
Authors
Yadira Robles Aranda; Anthony R. Sotolongo
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

In this research, data mining and decision tree techniques were analyzed as well as the induction of rules to integrate their many algorithms into the database managing system (DBMS), PostgreSQL, due to the defficiencies of the free use tools avaialable. A mechanism to optimize the performance of the implemented algorithms was proposed with the purpose of taking advantage of the PostgreSQL. By means of an experiment, it was proven that the time response and results obtained are improved when the algorithms are integrated into the managing system.

Search
Clear search
Close search
Google apps
Main menu