100+ datasets found
  1. Riga Data Science Club

    • kaggle.com
    zip
    Updated Mar 29, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dmitry Yemelyanov (2021). Riga Data Science Club [Dataset]. https://www.kaggle.com/datasets/dmitryyemelyanov/rigadsclub
    Explore at:
    zip(494849 bytes)Available download formats
    Dataset updated
    Mar 29, 2021
    Authors
    Dmitry Yemelyanov
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    Riga
    Description

    Context

    Riga Data Science Club is a non-profit organisation to share ideas, experience and build machine learning projects together. Data Science community should known own data, so this is a dataset about ourselves: our website analytics, social media activity, slack statistics and even meetup transcriptions!

    Content

    Dataset is split up in several folders by the context: * linkedin - company page visitor, follower and post stats * slack - messaging and member activity * typeform - new member responses * website - website visitors by country, language, device, operating system, screen resolution * youtube - meetup transcriptions

    Inspiration

    Let's make Riga Data Science Club better! We expect this data to bring lots of insights on how to improve.

    "Know your c̶u̶s̶t̶o̶m̶e̶r̶ member" - Explore member interests by analysing sign-up survey (typeform) responses - Explore messaging patterns in Slack to understand how members are retained and when they are lost

    Social media intelligence * Define LinkedIn posting strategy based on historical engagement data * Define target user profile based on LinkedIn page attendance data

    Website * Define website localisation strategy based on data about visitor countries and languages * Define website responsive design strategy based on data about visitor devices, operating systems and screen resolutions

    Have some fun * NLP analysis of meetup transcriptions: word frequencies, question answering, something else?

  2. Data Science Stack Exchange Dataset

    • kaggle.com
    zip
    Updated Jul 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aneesh Tickoo (2022). Data Science Stack Exchange Dataset [Dataset]. https://www.kaggle.com/datasets/aneeshtickoo/data-science-stack-exchange
    Explore at:
    zip(91829637 bytes)Available download formats
    Dataset updated
    Jul 11, 2022
    Authors
    Aneesh Tickoo
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Stack Exchange is a network of question-and-answer websites on topics in diverse fields, each site covering a specific topic, where questions, answers, and users are subject to a reputation award process. The reputation system allows the sites to be self-moderating.

    The dataset here is specific to one such network site of Stack Exchange named Data Science Stack Exchange. The dataset is distributed over multiple files. It contains information on various Posts on data science that can be used for language processing, it has data on which posts are being liked by users more, etc. A lot of analysis can be done on this dataset.

  3. m

    Austin_Survey_for_MDCOR_Analyses

    • data.mendeley.com
    Updated Nov 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Manuel Gonzalez Canche (2022). Austin_Survey_for_MDCOR_Analyses [Dataset]. http://doi.org/10.17632/nb7yvhjvzk.1
    Explore at:
    Dataset updated
    Nov 14, 2022
    Authors
    Manuel Gonzalez Canche
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Austin
    Description

    The city of Austin has administered a community survey for the 2015, 2016, 2017, 2018 and 2019 years (https://data.austintexas.gov/City-Government/Community-Survey/s2py-ceb7), to “assess satisfaction with the delivery of the major City Services and to help determine priorities for the community as part of the City’s ongoing planning process.” To directly access this dataset from the city of Austin’s website, you can follow this link https://cutt.ly/VNqq5Kd. Although we downloaded the dataset analyzed in this study from the former link, given that the city of Austin is interested in continuing administering this survey, there is a chance that the data we used for this analysis and the data hosted in the city of Austin’s website may differ in the following years. Accordingly, to ensure the replication of our findings, we recommend researchers to download and analyze the dataset we employed in our analyses, which can be accessed at the following link https://github.com/democratizing-data-science/MDCOR/blob/main/Community_Survey.csv. Replication Features or Variables The community survey data has 10,684 rows and 251 columns. Of these columns, our analyses will rely on the following three indicators that are taken verbatim from the survey: “ID”, “Q25 - If there was one thing you could share with the Mayor regarding the City of Austin (any comment, suggestion, etc.), what would it be?", and “Do you own or rent your home?”

  4. Online Data Science Training Programs Market Analysis, Size, and Forecast...

    • technavio.com
    pdf
    Updated Feb 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Technavio (2025). Online Data Science Training Programs Market Analysis, Size, and Forecast 2025-2029: North America (Mexico), Europe (France, Germany, Italy, and UK), Middle East and Africa (UAE), APAC (Australia, China, India, Japan, and South Korea), South America (Brazil), and Rest of World (ROW) [Dataset]. https://www.technavio.com/report/online-data-science-training-programs-market-industry-analysis
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Feb 12, 2025
    Dataset provided by
    TechNavio
    Authors
    Technavio
    License

    https://www.technavio.com/content/privacy-noticehttps://www.technavio.com/content/privacy-notice

    Time period covered
    2025 - 2029
    Description

    Snapshot img

    Online Data Science Training Programs Market Size 2025-2029

    The online data science training programs market size is forecast to increase by USD 8.67 billion, at a CAGR of 35.8% between 2024 and 2029.

    The market is experiencing significant growth due to the increasing demand for data science professionals in various industries. The job market offers lucrative opportunities for individuals with data science skills, making online training programs an attractive option for those seeking to upskill or reskill. Another key driver in the market is the adoption of microlearning and gamification techniques in data science training. These approaches make learning more engaging and accessible, allowing individuals to acquire new skills at their own pace. Furthermore, the availability of open-source learning materials has democratized access to data science education, enabling a larger pool of learners to enter the field. However, the market also faces challenges, including the need for continuous updates to keep up with the rapidly evolving data science landscape and the lack of standardization in online training programs, which can make it difficult for employers to assess the quality of graduates. Companies seeking to capitalize on market opportunities should focus on offering up-to-date, high-quality training programs that incorporate microlearning and gamification techniques, while also addressing the challenges of continuous updates and standardization. By doing so, they can differentiate themselves in a competitive market and meet the evolving needs of learners and employers alike.

    What will be the Size of the Online Data Science Training Programs Market during the forecast period?

    Request Free SampleThe online data science training market continues to evolve, driven by the increasing demand for data-driven insights and innovations across various sectors. Data science applications, from computer vision and deep learning to natural language processing and predictive analytics, are revolutionizing industries and transforming business operations. Industry case studies showcase the impact of data science in action, with big data and machine learning driving advancements in healthcare, finance, and retail. Virtual labs enable learners to gain hands-on experience, while data scientist salaries remain competitive and attractive. Cloud computing and data science platforms facilitate interactive learning and collaborative research, fostering a vibrant data science community. Data privacy and security concerns are addressed through advanced data governance and ethical frameworks. Data science libraries, such as TensorFlow and Scikit-Learn, streamline the development process, while data storytelling tools help communicate complex insights effectively. Data mining and predictive analytics enable organizations to uncover hidden trends and patterns, driving innovation and growth. The future of data science is bright, with ongoing research and development in areas like data ethics, data governance, and artificial intelligence. Data science conferences and education programs provide opportunities for professionals to expand their knowledge and expertise, ensuring they remain at the forefront of this dynamic field.

    How is this Online Data Science Training Programs Industry segmented?

    The online data science training programs industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD million' for the period 2025-2029, as well as historical data from 2019-2023 for the following segments. TypeProfessional degree coursesCertification coursesApplicationStudentsWorking professionalsLanguageR programmingPythonBig MLSASOthersMethodLive streamingRecordedProgram TypeBootcampsCertificatesDegree ProgramsGeographyNorth AmericaUSMexicoEuropeFranceGermanyItalyUKMiddle East and AfricaUAEAPACAustraliaChinaIndiaJapanSouth KoreaSouth AmericaBrazilRest of World (ROW)

    By Type Insights

    The professional degree courses segment is estimated to witness significant growth during the forecast period.The market encompasses various segments catering to diverse learning needs. The professional degree course segment holds a significant position, offering comprehensive and in-depth training in data science. This segment's curriculum covers essential aspects such as statistical analysis, machine learning, data visualization, and data engineering. Delivered by industry professionals and academic experts, these courses ensure a high-quality education experience. Interactive learning environments, including live lectures, webinars, and group discussions, foster a collaborative and engaging experience. Data science applications, including deep learning, computer vision, and natural language processing, are integral to the market's growth. Data analysis, a crucial application, is gaining traction due to the increasing demand for data-driven decisio

  5. h

    Data-Science-Instruct-Dataset

    • huggingface.co
    Updated May 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohammed Habib Ahmed (2025). Data-Science-Instruct-Dataset [Dataset]. https://huggingface.co/datasets/HabibAhmed/Data-Science-Instruct-Dataset
    Explore at:
    Dataset updated
    May 3, 2025
    Authors
    Mohammed Habib Ahmed
    Description

    HabibAhmed/Data-Science-Instruct-Dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. Facebook Group Insights Dataset

    • kaggle.com
    zip
    Updated Oct 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Md Arif Hasan (2023). Facebook Group Insights Dataset [Dataset]. https://www.kaggle.com/datasets/arifhasan23/short-stories-community-facebook-group-insights
    Explore at:
    zip(67014 bytes)Available download formats
    Dataset updated
    Oct 17, 2023
    Authors
    Md Arif Hasan
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The "**Facebook Group Insights Dataset**" on Kaggle is a concise, data-rich resource for analysing the dynamics of a specific Facebook group.

    This dataset provides key information on admins, daily metrics, member demographics, geographic distribution, popular activity times, and top-performing posts from the past 28 days. It is an essential tool for researchers, social media analysts, and data enthusiasts looking to gain insights into online community behaviour and engagement strategies. Whether you're a social media manager or a data scientist, this dataset offers precise and valuable insights into the inner workings of Facebook groups.

  7. Reddit - Machine Learning and Data Science

    • kaggle.com
    zip
    Updated Jan 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Durgesh Samariya (2022). Reddit - Machine Learning and Data Science [Dataset]. https://www.kaggle.com/datasets/themlphdstudent/reddit-machine-learning-and-data-science
    Explore at:
    zip(8299407 bytes)Available download formats
    Dataset updated
    Jan 4, 2022
    Authors
    Durgesh Samariya
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Please, If you enjoyed this dataset, don't forget to upvote it.

    Content

    This dataset contains a couple of fields with the information based on Reddit post submission, such:

    • title
    • id
    • redditor
    • num_upvotes
    • subreddit
    • url
    • num_comments
    • created_on
    • body
    • upvote_ratio
    • over_18
    • link_flair_text
    • edited

    Method

    The data was extracted using the PRAW:The Python Reddit API Wrapper.

    Credits

    Cover Image: Photo by Marius Masalar on Unsplash

  8. G

    Community Analytics Platform Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Aug 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). Community Analytics Platform Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/community-analytics-platform-market
    Explore at:
    pdf, pptx, csvAvailable download formats
    Dataset updated
    Aug 22, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Community Analytics Platform Market Outlook



    According to our latest research, the global community analytics platform market size reached USD 2.8 billion in 2024, with a robust growth trajectory driven by the rising demand for actionable insights from online communities. The market is expected to expand at a CAGR of 17.2% from 2025 to 2033, reaching an estimated USD 12.9 billion by 2033. This growth is propelled by the increasing integration of artificial intelligence, machine learning, and advanced data analytics in community management tools, which enable organizations to better understand user behavior, enhance engagement, and optimize business strategies.



    One of the primary growth factors for the community analytics platform market is the exponential rise in digital communities and social media interactions across industries. As organizations increasingly rely on digital platforms to foster brand loyalty, provide customer support, and build engaged user bases, the need for robust analytics solutions becomes paramount. Community analytics platforms empower businesses to extract valuable insights from user-generated content, sentiment, and engagement patterns, enabling data-driven decision-making. The proliferation of online forums, brand communities, and social networking groups has created a goldmine of data, which, when properly analyzed, can significantly enhance customer engagement and drive business growth.



    Another significant driver is the rapid adoption of cloud-based analytics solutions. Cloud deployment offers scalability, flexibility, and cost-effectiveness, making it an attractive choice for organizations of all sizes. The shift towards cloud-based community analytics platforms is further accelerated by the need for real-time data processing and remote accessibility, especially in the post-pandemic era where remote work and virtual communities have become the norm. Cloud solutions also facilitate seamless integration with other business applications, enabling organizations to create a unified data ecosystem that enhances operational efficiency and strategic planning.



    Furthermore, advancements in artificial intelligence and machine learning are transforming the landscape of community analytics. AI-powered platforms can automate sentiment analysis, content moderation, and predictive analytics, providing deeper insights into community dynamics and user behavior. These technologies enable organizations to identify emerging trends, detect potential issues, and personalize interactions at scale. As a result, businesses are increasingly investing in AI-driven community analytics solutions to stay ahead of the competition, improve customer satisfaction, and foster long-term loyalty.



    From a regional perspective, North America continues to dominate the community analytics platform market, accounting for the largest revenue share in 2024. This dominance is attributed to the high adoption rate of advanced analytics technologies, the presence of major market players, and the strong digital infrastructure in the region. However, Asia Pacific is emerging as the fastest-growing market, fueled by rapid digitalization, increasing internet penetration, and the growing popularity of online communities in countries like China, India, and Japan. Europe also holds a significant market share, driven by the rising focus on customer experience and regulatory requirements for data-driven decision-making.





    Component Analysis



    The community analytics platform market by component is segmented into software and services, each playing a pivotal role in the ecosystem. The software segment encompasses a wide array of tools such as dashboards, reporting modules, sentiment analysis engines, and integration frameworks designed to extract, process, and visualize data from community interactions. These solutions are continuously evolving, with vendors integrating advanced features like natural language processing, real-time analytics, and automated reporting to provide comprehensive insights. As organizations increasingly seek to levera

  9. o

    Scientists' Data Sharing Behaviors

    • openicpsr.org
    Updated Aug 19, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Youngseek Kim (2016). Scientists' Data Sharing Behaviors [Dataset]. http://doi.org/10.3886/E100087V7
    Explore at:
    Dataset updated
    Aug 19, 2016
    Authors
    Youngseek Kim
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    United States
    Description

    The objective of this research is to investigate the factors influencing scientists’ data sharing behaviors in different scientific communities by examining both discipline and individual level predictors together. The target population of this research included faculty members and post-doctoral researchers in U.S. academic institutions who belong to STEM disciplines. The sampling frame of this research was identified from the scholar list in the Community of Science’s (CoS) Scholar Database (http://pivot.cos.com), which provides a researcher profile directory in the world mainly from universities and colleges. The final field survey instrument was distributed to the 16,165 potential survey participants in 56 STEM disciplines. From November 19, 2012 to February 15, 2013, a total of 2,470 valid responses were received for the initial data analysis (15.28% of response rate).

  10. H

    [Data] Orientations to Mentoring in Academic and Community Data Science

    • dataverse.harvard.edu
    • dataone.org
    Updated Oct 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nathan Alexander (2025). [Data] Orientations to Mentoring in Academic and Community Data Science [Dataset]. http://doi.org/10.7910/DVN/ZZIBYH
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 16, 2025
    Dataset provided by
    Harvard Dataverse
    Authors
    Nathan Alexander
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Citation database for the analysis conducted in "Orientations to Mentoring in Academic and Community Data Science."

  11. Data from: Bringing ecology blogging into the scientific fold: measuring...

    • zenodo.org
    • datasetcatalog.nlm.nih.gov
    • +2more
    Updated May 30, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Manu E. Saunders; Meghan A. Duffy; Stephen B. Heard; Margaret Kosmala; Simon R. Leather; Terrence P. McGlynn; Jeff Ollerton; Amy L. Parachnowitsch; Manu E. Saunders; Meghan A. Duffy; Stephen B. Heard; Margaret Kosmala; Simon R. Leather; Terrence P. McGlynn; Jeff Ollerton; Amy L. Parachnowitsch (2022). Data from: Bringing ecology blogging into the scientific fold: measuring reach and impact of science community blogs [Dataset]. http://doi.org/10.5061/dryad.kf8b0
    Explore at:
    Dataset updated
    May 30, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Manu E. Saunders; Meghan A. Duffy; Stephen B. Heard; Margaret Kosmala; Simon R. Leather; Terrence P. McGlynn; Jeff Ollerton; Amy L. Parachnowitsch; Manu E. Saunders; Meghan A. Duffy; Stephen B. Heard; Margaret Kosmala; Simon R. Leather; Terrence P. McGlynn; Jeff Ollerton; Amy L. Parachnowitsch
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The popularity of science blogging has increased in recent years, but the number of academic scientists who maintain regular blogs is limited. The role and impact of science communication blogs aimed at general audiences is often discussed, but the value of science community blogs aimed at the academic community has largely been overlooked. Here, we focus on our own experiences as bloggers to argue that science community blogs are valuable to the academic community. We use data from our own blogs (n = 7) to illustrate some of the factors influencing reach and impact of science community blogs. We then discuss the value of blogs as a standalone medium, where rapid communication of scholarly ideas, opinions, and short observational notes can enhance scientific discourse, and discussion of personal experiences can provide indirect mentorship for junior researchers and scientists from underrepresented groups. Finally, we argue that science community blogs can be treated as a primary source and provide some key points to consider when citing blogs in peer-reviewed literature.

  12. Data from: A Systematic Literature Review of Undergraduate Data Science...

    • tandf.figshare.com
    pdf
    Updated Oct 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mine Dogucu; Sinem Demirci; Harry Bendekgey; Federica Zoe Ricci; Catalina M. Medina (2025). A Systematic Literature Review of Undergraduate Data Science Education Research [Dataset]. http://doi.org/10.6084/m9.figshare.28715507.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Oct 7, 2025
    Dataset provided by
    Taylor & Francishttps://taylorandfrancis.com/
    Authors
    Mine Dogucu; Sinem Demirci; Harry Bendekgey; Federica Zoe Ricci; Catalina M. Medina
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The presence of data science has been profound in the scientific community in almost every discipline. An important part of the data science education expansion has been at the undergraduate level. We conducted a systematic literature review to (a) portray current evidence and knowledge gaps in self-proclaimed undergraduate data science education research and (b) inform policymakers and the data science education community about what educators may encounter when searching for literature using the general keyword “data science education.” While open-access publications that target a broader audience of data science educators and include multiple examples of data science programs and courses are a strength, substantial knowledge gaps remain. The undergraduate data science literature that we identified often lacks empirical data, research questions, and reproducibility. Certain disciplines are less visible. We recommend that we should (a) cherish data science as an interdisciplinary field; (b) adopt a consistent set of keywords/terminology to ensure data science education literature is easily identifiable; (c) prioritize investments in empirical studies.

  13. q

    Biobyte 4 - The role of data science principles and practices in...

    • qubeshub.org
    Updated Aug 15, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sam Donovan (2019). Biobyte 4 - The role of data science principles and practices in undergraduate biology [Dataset]. http://doi.org/10.25334/B3K4-7G59
    Explore at:
    Dataset updated
    Aug 15, 2019
    Dataset provided by
    QUBES
    Authors
    Sam Donovan
    Description

    This short activity was an effort to launch a community conversation around the interface of data science principles and practices and undergraduate biology education. A variety of resources, communities, and projects are shared.

  14. C

    Community-Driven Model Service Platform Report

    • marketreportanalytics.com
    doc, pdf, ppt
    Updated Apr 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Report Analytics (2025). Community-Driven Model Service Platform Report [Dataset]. https://www.marketreportanalytics.com/reports/community-driven-model-service-platform-73131
    Explore at:
    pdf, doc, pptAvailable download formats
    Dataset updated
    Apr 9, 2025
    Dataset authored and provided by
    Market Report Analytics
    License

    https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    Discover the booming Community-Driven Model Service Platform market! This comprehensive analysis reveals a CAGR of 10.1%, driven by AI adoption and open-source innovation. Explore market size, trends, segmentation (cloud, on-premises, adult, children), key players (Kaggle, GitHub, Hugging Face), and regional insights. Learn more about this rapidly expanding sector.

  15. h

    data-science-job-salaries

    • huggingface.co
    Updated Jun 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Omar Espejel (2023). data-science-job-salaries [Dataset]. https://huggingface.co/datasets/espejelomar/data-science-job-salaries
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 25, 2023
    Authors
    Omar Espejel
    Description

    espejelomar/data-science-job-salaries dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. a

    Open Data Analytics

    • community-esrica-apps.hub.arcgis.com
    • hub.arcgis.com
    • +1more
    Updated Sep 18, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Halifax Regional Municipality (2020). Open Data Analytics [Dataset]. https://community-esrica-apps.hub.arcgis.com/datasets/HRM::open-data-analytics
    Explore at:
    Dataset updated
    Sep 18, 2020
    Dataset authored and provided by
    Halifax Regional Municipality
    Description

    Table of usage statistics (number of views) for datasets within the Halifax Open Data Catalogue.The data was collected to show the usage of data within the Open Data Catalogue. Metadata

  17. An analysis and metric of reusable data licensing practices for biomedical...

    • plos.figshare.com
    docx
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seth Carbon; Robin Champieux; Julie A. McMurry; Lilly Winfree; Letisha R. Wyatt; Melissa A. Haendel (2023). An analysis and metric of reusable data licensing practices for biomedical resources [Dataset]. http://doi.org/10.1371/journal.pone.0213090
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Seth Carbon; Robin Champieux; Julie A. McMurry; Lilly Winfree; Letisha R. Wyatt; Melissa A. Haendel
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data are the foundation of science, and there is an increasing focus on how data can be reused and enhanced to drive scientific discoveries. However, most seemingly “open data” do not provide legal permissions for reuse and redistribution. The inability to integrate and redistribute our collective data resources blocks innovation and stymies the creation of life-improving diagnostic and drug selection tools. To help the biomedical research and research support communities (e.g. libraries, funders, repositories, etc.) understand and navigate the data licensing landscape, the (Re)usable Data Project (RDP) (http://reusabledata.org) assesses the licensing characteristics of data resources and how licensing behaviors impact reuse. We have created a ruleset to determine the reusability of data resources and have applied it to 56 scientific data resources (e.g. databases) to date. The results show significant reuse and interoperability barriers. Inspired by game-changing projects like Creative Commons, the Wikipedia Foundation, and the Free Software movement, we hope to engage the scientific community in the discussion regarding the legal use and reuse of scientific data, including the balance of openness and how to create sustainable data resources in an increasingly competitive environment.

  18. C

    Community-Driven Model Service Platform Report

    • marketreportanalytics.com
    doc, pdf, ppt
    Updated Apr 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Report Analytics (2025). Community-Driven Model Service Platform Report [Dataset]. https://www.marketreportanalytics.com/reports/community-driven-model-service-platform-73127
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    Apr 9, 2025
    Dataset authored and provided by
    Market Report Analytics
    License

    https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Community-Driven Model Service Platform market is experiencing robust growth, projected to reach $35.14 billion in 2025 and maintain a Compound Annual Growth Rate (CAGR) of 10.1% from 2025 to 2033. This expansion is fueled by several key factors. The increasing availability of open-source models and datasets, fostered by platforms like Kaggle, GitHub, and Hugging Face, is democratizing access to advanced machine learning capabilities. This, in turn, accelerates innovation and reduces the barrier to entry for both developers and businesses. Furthermore, the growing demand for specialized AI solutions across diverse sectors—from healthcare and finance to manufacturing and retail—is driving adoption. The cloud-based segment holds a significant market share due to its scalability, accessibility, and cost-effectiveness compared to on-premises solutions. The adult application segment is currently the largest, reflecting the high concentration of skilled professionals and research activities within this group; however, the children's application segment shows significant growth potential given increasing educational initiatives incorporating AI. Geographic distribution shows North America and Europe currently leading market adoption, while Asia-Pacific is expected to witness rapid expansion driven by increasing digitalization and technological advancements. The competitive landscape is characterized by a mix of established technology giants and emerging startups. Platforms like TensorFlow Hub and Model Zoo provide comprehensive model repositories, while companies like DrivenData and Cortex focus on data-centric approaches. This competitive environment encourages continuous improvement and innovation within the platform offerings. Challenges include ensuring data security and privacy, addressing biases in datasets, and maintaining a balance between open collaboration and intellectual property rights. However, the overall trajectory points toward sustained market growth, fueled by ongoing technological advancements, increasing adoption across diverse industries, and the continuous contribution of a vibrant community of developers and researchers. Future growth will hinge on platforms successfully addressing the challenges and further enhancing collaborative features, fostering community engagement, and expanding the available resources.

  19. f

    Data from: Citizen science participation in research in the environmental...

    • scielo.figshare.com
    jpeg
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DAVI G.F. CUNHA; JONATAS F. MARQUES; JULIANA C. DE RESENDE; PATRÍCIA B. DE FALCO; CHRISLAINE M. DE SOUZA; STEVEN A. LOISELLE (2023). Citizen science participation in research in the environmental sciences: key factors related to projects’ success and longevity [Dataset]. http://doi.org/10.6084/m9.figshare.5644456.v1
    Explore at:
    jpegAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    SciELO journals
    Authors
    DAVI G.F. CUNHA; JONATAS F. MARQUES; JULIANA C. DE RESENDE; PATRÍCIA B. DE FALCO; CHRISLAINE M. DE SOUZA; STEVEN A. LOISELLE
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ABSTRACT The potential impacts of citizen science initiatives are increasing across the globe, albeit in an imbalanced manner. In general, there is a strong element of trial and error in most projects, and the comparison of best practices and project structure between different initiatives remains difficult. In Brazil, the participation of volunteers in environmental research is limited. Identifying the factors related to citizen science projects’ success and longevity within a global perspective can contribute for consolidating such practices in the country. In this study, we explore past and present projects, including a case study in Brazil, to identify the spatial and temporal trends of citizen science programs as well as their best practices and challenges. We performed a bibliographic search using Google Scholar and considered results from 2005-2014. Although these results are subjective due to the Google Scholar’s algorithm and ranking criteria, we highlighted factors to compare projects across geographical and disciplinary areas and identified key matches between project proponents and participants, project goals and local priorities, participant profiles and engagement, scientific methods and funding. This approach is a useful starting point for future citizen science projects, allowing for a systematic analysis of potential inconsistencies and shortcomings in this emerging field.

  20. d

    The Convergence of High Performance Computing, Big Data, and Machine...

    • catalog.data.gov
    • s.cnmilf.com
    Updated May 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NCO NITRD (2025). The Convergence of High Performance Computing, Big Data, and Machine Learning: Summary of the Big Data and High End Computing Interagency Working Groups Joint Workshop [Dataset]. https://catalog.data.gov/dataset/the-convergence-of-high-performance-computing-big-data-and-machine-learning-summary-of-the
    Explore at:
    Dataset updated
    May 14, 2025
    Dataset provided by
    NCO NITRD
    Description

    The high performance computing (HPC) and big data (BD) communities traditionally have pursued independent trajectories in the world of computational science. HPC has been synonymous with modeling and simulation, and BD with ingesting and analyzing data from diverse sources, including from simulations. However, both communities are evolving in response to changing user needs and technological landscapes. Researchers are increasingly using machine learning (ML) not only for data analytics but also for modeling and simulation; science-based simulations are increasingly relying on embedded ML models not only to interpret results from massive data outputs but also to steer computations. Science-based models are being combined with data-driven models to represent complex systems and phenomena. There also is an increasing need for real-time data analytics, which requires large-scale computations to be performed closer to the data and data infrastructures, to adapt to HPC-like modes of operation. These new use cases create a vital need for HPC and BD systems to deal with simulations and data analytics in a more unified fashion. To explore this need, the NITRD Big Data and High-End Computing R&D Interagency Working Groups held a workshop, The Convergence of High-Performance Computing, Big Data, and Machine Learning, on October 29-30, 2018, in Bethesda, Maryland. The purposes of the workshop were to bring together representatives from the public, private, and academic sectors to share their knowledge and insights on integrating HPC, BD, and ML systems and approaches and to identify key research challenges and opportunities. The 58 workshop participants represented a balanced cross-section of stakeholders involved in or impacted by this area of research. Additional workshop information, including a webcast, is available at https://www.nitrd.gov/nitrdgroups/index.php?title=HPC-BD-Convergence.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Dmitry Yemelyanov (2021). Riga Data Science Club [Dataset]. https://www.kaggle.com/datasets/dmitryyemelyanov/rigadsclub
Organization logo

Riga Data Science Club

LinkedIn stats, meetup transcriptions, website analytics, typeform responses

Explore at:
zip(494849 bytes)Available download formats
Dataset updated
Mar 29, 2021
Authors
Dmitry Yemelyanov
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Area covered
Riga
Description

Context

Riga Data Science Club is a non-profit organisation to share ideas, experience and build machine learning projects together. Data Science community should known own data, so this is a dataset about ourselves: our website analytics, social media activity, slack statistics and even meetup transcriptions!

Content

Dataset is split up in several folders by the context: * linkedin - company page visitor, follower and post stats * slack - messaging and member activity * typeform - new member responses * website - website visitors by country, language, device, operating system, screen resolution * youtube - meetup transcriptions

Inspiration

Let's make Riga Data Science Club better! We expect this data to bring lots of insights on how to improve.

"Know your c̶u̶s̶t̶o̶m̶e̶r̶ member" - Explore member interests by analysing sign-up survey (typeform) responses - Explore messaging patterns in Slack to understand how members are retained and when they are lost

Social media intelligence * Define LinkedIn posting strategy based on historical engagement data * Define target user profile based on LinkedIn page attendance data

Website * Define website localisation strategy based on data about visitor countries and languages * Define website responsive design strategy based on data about visitor devices, operating systems and screen resolutions

Have some fun * NLP analysis of meetup transcriptions: word frequencies, question answering, something else?

Search
Clear search
Close search
Google apps
Main menu