14 datasets found
  1. Instagram Reach Analysis - Excel Project

    • kaggle.com
    Updated Jun 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Raghad Al-marshadi (2025). Instagram Reach Analysis - Excel Project [Dataset]. https://www.kaggle.com/datasets/raghadalmarshadi/instagram-reach-analysis-excel-project/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 14, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Raghad Al-marshadi
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    📊 Instagram Reach Analysis | تحليل الوصول في إنستغرام

    An exploratory data analysis project using Excel to understand what influences Instagram post reach and engagement.
    مشروع تحليل استكشافي لفهم العوامل المؤثرة في وصول منشورات إنستغرام وتفاعل المستخدمين، باستخدام Excel.

    📁 Project Description | وصف المشروع

    This project uses an Instagram dataset imported from Kaggle to explore how different factors like hashtags, saves, shares, and caption length influence impressions and engagement.
    يستخدم هذا المشروع بيانات من إنستغرام تم استيرادها من منصة Kaggle لتحليل كيف تؤثر عوامل مثل الهاشتاقات، الحفظ، المشاركة، وطول التسمية التوضيحية في عدد مرات الظهور والتفاعل.

    🛠️ Tools Used | الأدوات المستخدمة

    • Microsoft Excel
    • Pivot Tables
    • TRIM, WRAP, and other Excel formulas
    • مايكروسوفت إكسل
    • الجداول المحورية
    • دوال مثل TRIM و WRAP وغيرها في Excel

    🧹 Data Cleaning | تنظيف البيانات

    • Removed unnecessary spaces using TRIM
    • Removed 17 duplicate rows → 103 unique rows remained
    • Standardized formatting: freeze top row, wrap text, center align

    • إزالة المسافات غير الضرورية باستخدام TRIM

    • حذف 17 صفًا مكررًا → تبقى 103 صفوف فريدة

    • تنسيق موحد: تثبيت الصف الأول، لف النص، وتوسيط المحتوى

    🔍 Key Analysis Highlights | أبرز نتائج التحليل

    1. Impressions by Source | مرات الظهور حسب المصدر

    • Highest reach: Home > Hashtags > Explore > Other
    • Some totals exceed 100% due to overlapping

    2. Engagement Insights | رؤى حول التفاعل

    • Saves strongly correlate with higher impressions
    • Caption length is inversely related to likes
    • Shares have weak correlation with impressions

    3. Hashtag Patterns | تحليل الهاشتاقات

    • Most used: #Thecleverprogrammer, #Amankharwal, #Python
    • Repeating hashtags does not guarantee higher reach

    ✅ Conclusion | الخلاصة

    Shorter captions and higher save counts contribute more to reach than repeated hashtags. Profile visits are often linked to new followers.
    العناوين القصيرة وعدد الحفظات تلعب دورًا أكبر في الوصول من تكرار الهاشتاقات. كما أن زيارات الملف الشخصي ترتبط غالبًا بزيادة المتابعين.

    👩‍💻 Author | المؤلفة

    Raghad's LinkedIn

    🧠 Inspiration | الإلهام

    Inspired by content from TheCleverProgrammer, Aman Kharwal, and Kaggle datasets.
    استُلهم المشروع من محتوى TheCleverProgrammer وأمان خروال، وبيانات من Kaggle.

    💬 Feedback | الملاحظات

    Feel free to open an issue or share suggestions!
    يسعدنا تلقي ملاحظاتكم واقتراحاتكم عبر صفحة المشروع.

  2. S

    Spreadsheet Editor Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated May 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Spreadsheet Editor Report [Dataset]. https://www.datainsightsmarket.com/reports/spreadsheet-editor-1431362
    Explore at:
    ppt, pdf, docAvailable download formats
    Dataset updated
    May 6, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global spreadsheet editor market is experiencing robust growth, driven by the increasing digitization of businesses and the rising demand for efficient data management solutions across various industries. The market, estimated at $50 billion in 2025, is projected to witness a Compound Annual Growth Rate (CAGR) of 10% from 2025 to 2033, reaching approximately $130 billion by 2033. This growth is fueled by several factors, including the expanding adoption of cloud-based spreadsheet editors offering enhanced collaboration and accessibility features, the increasing need for data analysis and visualization tools within organizations of all sizes (Large Enterprises and SMBs), and the integration of spreadsheet software with other business applications through APIs offered by companies like Zapier. The free segment holds a significant market share, particularly among individual users and small businesses, while the paid segment, which offers advanced features and support, contributes substantially to overall market revenue. Key players such as Microsoft, Google, and LibreOffice dominate the market, but emerging players are continually introducing innovative features and pricing models to gain a competitive edge. Significant regional variations exist. North America currently holds the largest market share due to high technology adoption and a well-established digital infrastructure, followed by Europe and Asia-Pacific. However, the Asia-Pacific region is anticipated to experience the fastest growth in the forecast period due to rapid technological advancements and increasing internet penetration across countries like India and China. Growth restraints include security concerns related to cloud storage, the cost of implementation and training for complex software, and the increasing competition from specialized data analysis tools. Despite these challenges, the consistent demand for streamlined data management across diverse sectors ensures the continued expansion of the spreadsheet editor market in the coming years. The market’s evolution reflects a shift towards user-friendly, feature-rich, and collaborative solutions that are seamlessly integrated into broader business ecosystems.

  3. f

    Enhancing UNCDF Operations: Power BI Dashboard Development and Data Mapping

    • figshare.com
    Updated Jan 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maryam Binti Haji Abdul Halim (2025). Enhancing UNCDF Operations: Power BI Dashboard Development and Data Mapping [Dataset]. http://doi.org/10.6084/m9.figshare.28147451.v1
    Explore at:
    Dataset updated
    Jan 6, 2025
    Dataset provided by
    figshare
    Authors
    Maryam Binti Haji Abdul Halim
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This project focuses on data mapping, integration, and analysis to support the development and enhancement of six UNCDF operational applications: OrgTraveler, Comms Central, Internal Support Hub, Partnership 360, SmartHR, and TimeTrack. These apps streamline workflows for travel claims, internal support, partnership management, and time tracking within UNCDF.Key Features and Tools:Data Mapping for Salesforce CRM Migration: Structured and mapped data flows to ensure compatibility and seamless migration to Salesforce CRM.Python for Data Cleaning and Transformation: Utilized pandas, numpy, and APIs to clean, preprocess, and transform raw datasets into standardized formats.Power BI Dashboards: Designed interactive dashboards to visualize workflows and monitor performance metrics for decision-making.Collaboration Across Platforms: Integrated Google Collab for code collaboration and Microsoft Excel for data validation and analysis.

  4. Graph Database Market Report | Global Forecast From 2025 To 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Sep 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2024). Graph Database Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/global-graph-database-market
    Explore at:
    pptx, pdf, csvAvailable download formats
    Dataset updated
    Sep 22, 2024
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Graph Database Market Outlook



    The global graph database market size was valued at USD 1.5 billion in 2023 and is projected to reach USD 8.5 billion by 2032, growing at a CAGR of 21.2% from 2024 to 2032. The substantial growth of this market is driven primarily by increasing data complexity, advancements in data analytics technologies, and the rising need for more efficient database management systems.



    One of the primary growth factors for the graph database market is the exponential increase in data generation. As organizations generate vast amounts of data from various sources such as social media, e-commerce platforms, and IoT devices, the need for sophisticated data management and analysis tools becomes paramount. Traditional relational databases struggle to handle the complexity and interconnectivity of this data, leading to a shift towards graph databases which excel in managing such intricate relationships.



    Another significant driver is the growing adoption of artificial intelligence (AI) and machine learning (ML) technologies. These technologies rely heavily on connected data for predictive analytics and decision-making processes. Graph databases, with their inherent ability to model relationships between data points effectively, provide a robust foundation for AI and ML applications. This synergy between AI/ML and graph databases further accelerates market growth.



    Additionally, the increasing prevalence of personalized customer experiences across industries like retail, finance, and healthcare is fueling demand for graph databases. Businesses are leveraging graph databases to analyze customer behaviors, preferences, and interactions in real-time, enabling them to offer tailored recommendations and services. This enhanced customer experience translates to higher customer satisfaction and retention, driving further adoption of graph databases.



    From a regional perspective, North America currently holds the largest market share due to early adoption of advanced technologies and the presence of key market players. However, significant growth is also anticipated in the Asia-Pacific region, driven by rapid digital transformation, increasing investments in IT infrastructure, and growing awareness of the benefits of graph databases. Europe is also expected to witness steady growth, supported by stringent data management regulations and a strong focus on data privacy and security.



    Component Analysis



    The graph database market can be segmented into two primary components: software and services. The software segment holds the largest market share, driven by extensive adoption across various industries. Graph database software is designed to create, manage, and query graph databases, offering features such as scalability, high performance, and efficient handling of complex data relationships. The growth in this segment is propelled by continuous advancements and innovations in graph database technologies. Companies are increasingly investing in research and development to enhance the capabilities of their graph database software products, catering to the evolving needs of their customers.



    On the other hand, the services segment is also witnessing substantial growth. This segment includes consulting, implementation, and support services provided by vendors to help organizations effectively deploy and manage graph databases. As businesses recognize the benefits of graph databases, the demand for expert services to ensure successful implementation and integration into existing systems is rising. Additionally, ongoing support and maintenance services are crucial for the smooth operation of graph databases, driving further growth in this segment.



    The increasing complexity of data and the need for specialized expertise to manage and analyze it effectively are key factors contributing to the growth of the services segment. Organizations often lack the in-house skills required to harness the full potential of graph databases, prompting them to seek external assistance. This trend is particularly evident in large enterprises, where the scale and complexity of data necessitate robust support services.



    Moreover, the services segment is benefiting from the growing trend of outsourcing IT functions. Many organizations are opting to outsource their database management needs to specialized service providers, allowing them to focus on their core business activities. This shift towards outsourcing is further bolstering the demand for graph database services, driving market growth.


    &l

  5. Big Data Technology Market Report | Global Forecast From 2025 To 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Jan 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Big Data Technology Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/global-big-data-technology-market
    Explore at:
    csv, pptx, pdfAvailable download formats
    Dataset updated
    Jan 7, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Big Data Technology Market Outlook




    The global big data technology market size was valued at approximately $162 billion in 2023 and is projected to reach around $471 billion by 2032, growing at a Compound Annual Growth Rate (CAGR) of 12.6% during the forecast period. The growth of this market is primarily driven by the increasing demand for data analytics and insights to enhance business operations, coupled with advancements in AI and machine learning technologies.




    One of the principal growth factors of the big data technology market is the rapid digital transformation across various industries. Businesses are increasingly recognizing the value of data-driven decision-making processes, leading to the widespread adoption of big data analytics. Additionally, the proliferation of smart devices and the Internet of Things (IoT) has led to an exponential increase in data generation, necessitating robust big data solutions to analyze and extract meaningful insights. Organizations are leveraging big data to streamline operations, improve customer engagement, and gain a competitive edge.




    Another significant growth driver is the advent of advanced technologies like artificial intelligence (AI) and machine learning (ML). These technologies are being integrated into big data platforms to enhance predictive analytics and real-time decision-making capabilities. AI and ML algorithms excel at identifying patterns within large datasets, which can be invaluable for predictive maintenance in manufacturing, fraud detection in banking, and personalized marketing in retail. The combination of big data with AI and ML is enabling organizations to unlock new revenue streams, optimize resource utilization, and improve operational efficiency.




    Moreover, regulatory requirements and data privacy concerns are pushing organizations to adopt big data technologies. Governments worldwide are implementing stringent data protection regulations, like the General Data Protection Regulation (GDPR) in Europe and the California Consumer Privacy Act (CCPA) in the United States. These regulations necessitate robust data management and analytics solutions to ensure compliance and avoid hefty fines. As a result, organizations are investing heavily in big data platforms that offer secure and compliant data handling capabilities.



    As organizations continue to navigate the complexities of data management, the role of Big Data Professional Services becomes increasingly critical. These services offer specialized expertise in implementing and managing big data solutions, ensuring that businesses can effectively harness the power of their data. Professional services encompass a range of offerings, including consulting, system integration, and managed services, tailored to meet the unique needs of each organization. By leveraging the knowledge and experience of big data professionals, companies can optimize their data strategies, streamline operations, and achieve their business objectives more efficiently. The demand for these services is driven by the growing complexity of big data ecosystems and the need for seamless integration with existing IT infrastructure.




    Regionally, North America holds a dominant position in the big data technology market, primarily due to the early adoption of advanced technologies and the presence of key market players. The Asia Pacific region is expected to witness the highest growth rate during the forecast period, driven by increasing digitalization, the rapid growth of industries such as e-commerce and telecommunications, and supportive government initiatives aimed at fostering technological innovation.



    Component Analysis




    The big data technology market is segmented into software, hardware, and services. The software segment encompasses data management software, analytics software, and data visualization tools, among others. This segment is expected to witness substantial growth due to the increasing demand for data analytics solutions that can handle vast amounts of data. Advanced analytics software, in particular, is gaining traction as organizations seek to gain deeper insights and make data-driven decisions. Companies are increasingly adopting sophisticated data visualization tools to present complex data in an easily understandable format, thereby enhancing decision-making processes.


    <br /&

  6. Global Spreadsheet Software Market Size By Type of Software, By Deployment...

    • verifiedmarketresearch.com
    Updated Oct 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VERIFIED MARKET RESEARCH (2024). Global Spreadsheet Software Market Size By Type of Software, By Deployment Mode, By Industry Vertical, By Geographic Scope And Forecast [Dataset]. https://www.verifiedmarketresearch.com/product/spreadsheet-software-market/
    Explore at:
    Dataset updated
    Oct 9, 2024
    Dataset provided by
    Verified Market Researchhttps://www.verifiedmarketresearch.com/
    Authors
    VERIFIED MARKET RESEARCH
    License

    https://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/

    Time period covered
    2024 - 2031
    Area covered
    Global
    Description

    Spreadsheet Software Market Size And Forecast

    Spreadsheet Software Market size was valued at USD 10.05 Billion in 2023 and is expected to reach USD 14.55 Billion by 2031, with a CAGR of 7.8% from 2024-2031.

    Global Spreadsheet Software Market Drivers

    The market drivers for the Spreadsheet Software Market can be influenced by various factors. These may include:

    Increasing Data Volume: As organizations generate and collect more data, the need for efficient data analysis and management tools, such as spreadsheet software, grows. Rising Demand for Data Visualization: Users increasingly seek sophisticated tools to visualize data for better insights. Spreadsheet software can provide charts and graphs, making data interpretation easier.

    Global Spreadsheet Software Market Restraints

    Several factors can act as restraints or challenges for the Spreadsheet Software Market, These may include:

    Market Saturation: Many organizations already use established spreadsheet software such as Microsoft Excel or Google Sheets. The reliance on these platforms can make it difficult for new entrants or alternative solutions to capture market share. High Competition: The market is highly competitive, with numerous players offering similar features and functionalities. This can lead to price wars and reduced profit margins for software providers.

  7. s

    semantic knowledge graphing Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Jul 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). semantic knowledge graphing Report [Dataset]. https://www.datainsightsmarket.com/reports/semantic-knowledge-graphing-472152
    Explore at:
    pdf, doc, pptAvailable download formats
    Dataset updated
    Jul 5, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    CA
    Variables measured
    Market Size
    Description

    The semantic knowledge graph market is experiencing robust growth, driven by the increasing need for organizations to derive actionable insights from complex, unstructured data. The market, estimated at $5 billion in 2025, is projected to exhibit a Compound Annual Growth Rate (CAGR) of 25% from 2025 to 2033, reaching approximately $25 billion by 2033. This expansion is fueled by several key factors. Firstly, the proliferation of big data necessitates efficient data management and knowledge extraction tools; semantic knowledge graphs excel in this arena by organizing information into easily understandable and interlinked structures. Secondly, advancements in artificial intelligence (AI) and machine learning (ML) are enhancing the capabilities of semantic knowledge graphs, improving their ability to process and analyze ever-increasing volumes of data. Thirdly, the growing adoption of cloud-based solutions is simplifying deployment and accessibility, further driving market growth. Key players like Microsoft, Google, and Yandex are heavily investing in this technology, creating a competitive yet innovative landscape. However, challenges remain, including the complexity of implementing these systems, high initial investment costs, and the need for skilled professionals to manage and interpret the resulting knowledge graphs. Despite these restraints, the long-term prospects for the semantic knowledge graph market are incredibly positive. The increasing demand for improved data governance, enhanced business intelligence, and personalized customer experiences will continue to fuel adoption across various sectors, including finance, healthcare, and manufacturing. The market segmentation is expected to evolve, with increasing specialization in specific industry verticals and the development of more sophisticated analytics tools built on top of semantic knowledge graph technologies. The focus will likely shift towards the integration of semantic knowledge graphs with other emerging technologies such as blockchain and the Internet of Things (IoT) to unlock even greater value from data. This convergence will lead to the emergence of smarter and more autonomous systems capable of decision-making based on comprehensive, contextualized knowledge. Regions like North America and Europe are anticipated to maintain significant market shares, though Asia-Pacific is projected to witness substantial growth driven by increasing digitalization and technological advancements.

  8. u

    Data from: Low-Disturbance Manure Incorporation

    • agdatacommons.nal.usda.gov
    xlsx
    Updated Feb 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jessica Sherman; William Jokela; Carol Barford (2024). Low-Disturbance Manure Incorporation [Dataset]. http://doi.org/10.15482/USDA.ADC/1401975
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Feb 8, 2024
    Dataset provided by
    Ag Data Commons
    Authors
    Jessica Sherman; William Jokela; Carol Barford
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The LDMI experiment (Low-Disturbance Manure Incorporation) was designed to evaluate nutrient losses with conventional and improved liquid dairy manure management practices in a corn silage (Zea mays) / rye cover-crop (Secale cereale) system. The improved manure management treatments were designed to incorporate manure while maintaining crop residue for erosion control. Field observations included greenhouse gas (GHG) fluxes from soil, soil nutrient concentrations, crop growth and harvest biomass and nutrient content, as well as monitoring of soil physical and chemical properties. Observations from LDMI have been used for parameterization and validation of computer simulation models of GHG emissions from dairy farms (Gaillard et al., submitted). The LDMI experiment was performed as part of the Dairy CAP, described below. The experiment included ten different treatments: (1) broadcast manure with disk-harrow incorporation, (2) broadcast manure with no tillage incorporation, (3) manure application with “strip-tillage” which was sweep injection ridged with paired disks, (4) aerator band manure application, (5) low-disturbance sweep injection of manure, (6) Coulter injection of manure with sweep tillage, (7) no manure with urea to supply 60 lb N/acre (67 kg N/ha), (8) no manure with urea to supply 120 lb N/acre (135 kg N/ha), (9) no manure with urea to supply 180 lb N/acre (202 kg N/ha), (10) no manure / no fertilizer control. Manure was applied in the fall; fertilizer was applied in the spring. These ten treatments were replicated four times in a randomized complete block design. The LDMI experiment was conducted at the Marshfield Research Station of the University of Wisconsin and the USDA Agricultural Research Service (ARS) in Stratford, WI (Marathon County, Latitude 44.7627, Longitude -90.0938). Soils at the research station are from the Withee soil series, fine-loamy, mixed, superactive, frigid Aquic Glossudalf. Each experimental plot was approximately 70 square meters. A weather station was located at the south edge of field site. A secondary weather station (MARS South), for snow and snow water equivalence data and for backup of the main weather station, was located at Latitude 44.641445 and Longitude -90.133526 (16,093 meters southwest of the field site). The experiment was initiated on November 28, 2011 with fall tillage and manure application in each plot according to its treatment type. Each spring, corn silage was planted in rows at a rate of 87500 plants per hectare. The cultivar was Pioneer P8906HR. The LDMI experiment ended on November 30, 2015. The manure applied in this experiment was from the dairy herd at the Marshfield Research Station. Cows were fed a diet of 48% dry matter, 17.45% protein, and 72.8% total digestible nutrients. Liquid slurry manure, including feces, urine, and rain, was collected and stored in a lagoon on the site. Manure was withdrawn from the lagoon, spread on the plots and sampled for analysis all on the same day, once per year. Manure samples were analyzed at the University of Wisconsin Soil and Forage Lab in Marshfield (NH4-N, total P and total K) and at the Marshfield ARS (pH, dry matter, volatile solids, total N and total C). GHG fluxes from soil (CO2, CH4, N2O) were measured using static chambers as described in Parkin and Venterea (2010). Measurements were made with the chambers placed across the rows of corn. I Additional soil chemical and physical characteristics were measured as noted in the data dictionary and other metadata of the LDMI data set, included here. This experiment was part of “Climate Change Mitigation and Adaptation in Dairy Production Systems of the Great Lakes Region,” also known as the Dairy Coordinated Agricultural Project (Dairy CAP), funded by the United States Department of Agriculture - National Institute of Food and Agriculture (award number 2013-68002-20525). The main goal of the Dairy CAP was to improve understanding of the magnitudes and controlling factors over GHG emissions from dairy production in the Great Lakes region. Using this knowledge, the Dairy CAP has improved life cycle analysis (LCA) of GHG production by Great Lakes dairy farms, developing farm management tools, and conducting extension, education and outreach activities. Resources in this dataset:Resource Title: Data_dictionary_DairyCAP_LDMI. File Name: Data_dictionary_DairyCAP_LDMI.xlsxResource Description: This is the data dictionary for the Low-Disturbance Manure Incorporation (LDMI) experiment, conducted at the USDA-ARS research station in Marshfield, WI. (Separate spreadsheet tabs)Resource Software Recommended: Microsoft Excel 2016,url: https://products.office.com/en-us/excel Resource Title: DairyCAP_LDMI. File Name: DairyCAP_LDMI.xlsxResource Description: This is the data from the Low-Disturbance Manure Incorporation (LDMI) experiment, conducted at the USDA-ARS research station in Marshfield, WI.Resource Software Recommended: Microsoft Excel 2016,url: https://products.office.com/en-us/excel Resource Title: Data Dictionary DairyCAP LDMI. File Name: Data_dictionary_DairyCAP_LDMI.csvResource Description: This is the data dictionary for the Low-Disturbance Manure Incorporation (LDMI) experiment, conducted at the USDA-ARS research station in Marshfield, WI.

    Resource Title: Biomass Data. File Name: LDMI_Biomass.csvResource Title: Experimental Set-up Data. File Name: LDMI_Exp_setup.csvResource Title: Gas Flux Data. File Name: LDMI_Gas_Fluxes.csvResource Title: Management History Data. File Name: LDMI_Management_History.csvResource Title: Manure Analysis Data. File Name: LDMI_Manure_Analysis.csvResource Title: Soil Chemical Data. File Name: LDMI_Soil_Chem.csvResource Title: Soil Physical Data. File Name: LDMI_Soil_Phys.csvResource Title: Weather Data. File Name: LDMI_Weather.csv

  9. n

    Data from: Designing data science workshops for data-intensive environmental...

    • data.niaid.nih.gov
    • zenodo.org
    • +1more
    zip
    Updated Dec 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Allison Theobold; Stacey Hancock; Sara Mannheimer (2020). Designing data science workshops for data-intensive environmental science research [Dataset]. http://doi.org/10.5061/dryad.7wm37pvp7
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 8, 2020
    Dataset provided by
    Montana State University
    California State Polytechnic University
    Authors
    Allison Theobold; Stacey Hancock; Sara Mannheimer
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    Over the last 20 years, statistics preparation has become vital for a broad range of scientific fields, and statistics coursework has been readily incorporated into undergraduate and graduate programs. However, a gap remains between the computational skills taught in statistics service courses and those required for the use of statistics in scientific research. Ten years after the publication of "Computing in the Statistics Curriculum,'' the nature of statistics continues to change, and computing skills are more necessary than ever for modern scientific researchers. In this paper, we describe research on the design and implementation of a suite of data science workshops for environmental science graduate students, providing students with the skills necessary to retrieve, view, wrangle, visualize, and analyze their data using reproducible tools. These workshops help to bridge the gap between the computing skills necessary for scientific research and the computing skills with which students leave their statistics service courses. Moreover, though targeted to environmental science graduate students, these workshops are open to the larger academic community. As such, they promote the continued learning of the computational tools necessary for working with data, and provide resources for incorporating data science into the classroom.

    Methods Surveys from Carpentries style workshops the results of which are presented in the accompanying manuscript.

    Pre- and post-workshop surveys for each workshop (Introduction to R, Intermediate R, Data Wrangling in R, Data Visualization in R) were collected via Google Form.

    The surveys administered for the fall 2018, spring 2019 academic year are included as pre_workshop_survey and post_workshop_assessment PDF files. 
    The raw versions of these data are included in the Excel files ending in survey_raw or assessment_raw.
    
      The data files whose name includes survey contain raw data from pre-workshop surveys and the data files whose name includes assessment contain raw data from the post-workshop assessment survey.
    
    
    The annotated RMarkdown files used to clean the pre-workshop surveys and post-workshop assessments are included as workshop_survey_cleaning and workshop_assessment_cleaning, respectively. 
    The cleaned pre- and post-workshop survey data are included in the Excel files ending in clean. 
    The summaries and visualizations presented in the manuscript are included in the analysis annotated RMarkdown file.
    
  10. "9,565 Top-Rated Movies Dataset"

    • kaggle.com
    Updated Aug 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harshit@85 (2024). "9,565 Top-Rated Movies Dataset" [Dataset]. https://www.kaggle.com/datasets/harshit85/9565-top-rated-movies-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 19, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Harshit@85
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    About the Dataset

    Title: 9,565 Top-Rated Movies Dataset

    Description:
    This dataset offers a comprehensive collection of 9,565 of the highest-rated movies according to audience ratings on the Movie Database (TMDb). The dataset includes detailed information about each movie, such as its title, overview, release date, popularity score, average vote, and vote count. It is designed to be a valuable resource for anyone interested in exploring trends in popular cinema, analyzing factors that contribute to a movie’s success, or building recommendation engines.

    Key Features: - Title: The official title of each movie. - Overview: A brief synopsis or description of the movie's plot. - Release Date: The release date of the movie, formatted as YYYY-MM-DD. - Popularity: A score indicating the current popularity of the movie on TMDb, which can be used to gauge current interest. - Vote Average: The average rating of the movie, based on user votes. - Vote Count: The total number of votes the movie has received.

    Data Source: The data was sourced from the TMDb API, a well-regarded platform for movie information, using the /movie/top_rated endpoint. The dataset represents a snapshot of the highest-rated movies as of the time of data collection.

    Data Collection Process: - API Access: Data was retrieved programmatically using TMDb’s API. - Pagination Handling: Multiple API requests were made to cover all pages of top-rated movies, ensuring the dataset’s comprehensiveness. - Data Aggregation: Collected data was aggregated into a single, unified dataset using the pandas library. - Cleaning: Basic data cleaning was performed to remove duplicates and handle missing or malformed data entries.

    Potential Uses: - Trend Analysis: Analyze trends in movie ratings over time or compare ratings across different genres. - Recommendation Systems: Build and train models to recommend movies based on user preferences. - Sentiment Analysis: Perform text analysis on movie overviews to understand common themes and sentiments. - Statistical Analysis: Explore the relationship between popularity, vote count, and average ratings.

    Data Format: The dataset is provided in a structured tabular format (e.g., CSV), making it easy to load into data analysis tools like Python, R, or Excel.

    Usage License: The dataset is shared under [appropriate license], ensuring that it can be used for educational, research, or commercial purposes, with proper attribution to the data source (TMDb).

    This description provides a clear and detailed overview, helping potential users understand the dataset's content, origin, and potential applications.

  11. m

    Lean Manufacturing Tools Applied to Continuous Improvement and its Impact on...

    • data.mendeley.com
    Updated Mar 20, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jorge Luis García-Alcaraz (2023). Lean Manufacturing Tools Applied to Continuous Improvement and its Impact on Economic Sustainability [Dataset]. http://doi.org/10.17632/c6cjpb4yxs.1
    Explore at:
    Dataset updated
    Mar 20, 2023
    Authors
    Jorge Luis García-Alcaraz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In this Excel file the raw information is added, in addition, the model output, T-Ratios, Model Fit and Quality index, Reliabilities and Discriminant Validity are added. This information corresponds to the article entitled "Lean Manufacturing Tools Applied to Continuous Improvement and its Impact on Economic Sustainability"

  12. Students Test Data

    • kaggle.com
    Updated Sep 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ATHARV BHARASKAR (2023). Students Test Data [Dataset]. https://www.kaggle.com/datasets/atharvbharaskar/students-test-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 12, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ATHARV BHARASKAR
    License

    ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
    License information was derived automatically

    Description

    Dataset Overview: This dataset pertains to the examination results of students who participated in a series of academic assessments at a fictitious educational institution named "University of Exampleville." The assessments were administered across various courses and academic levels, with a focus on evaluating students' performance in general management and domain-specific topics.

    Columns: The dataset comprises 12 columns, each representing specific attributes and performance indicators of the students. These columns encompass information such as the students' names (which have been anonymized), their respective universities, academic program names (including BBA and MBA), specializations, the semester of the assessment, the type of examination domain (general management or domain-specific), general management scores (out of 50), domain-specific scores (out of 50), total scores (out of 100), student ranks, and percentiles.

    Data Collection: The examination data was collected during a standardized assessment process conducted by the University of Exampleville. The exams were designed to assess students' knowledge and skills in general management and their chosen domain-specific subjects. It involved students from both BBA and MBA programs who were in their final year of study.

    Data Format: The dataset is available in a structured format, typically as a CSV file. Each row represents a unique student's performance in the examination, while columns contain specific information about their results and academic details.

    Data Usage: This dataset is valuable for analyzing and gaining insights into the academic performance of students pursuing BBA and MBA degrees. It can be used for various purposes, including statistical analysis, performance trend identification, program assessment, and comparison of scores across domains and specializations. Furthermore, it can be employed in predictive modeling or decision-making related to curriculum development and student support.

    Data Quality: The dataset has undergone preprocessing and anonymization to protect the privacy of individual students. Nevertheless, it is essential to use the data responsibly and in compliance with relevant data protection regulations when conducting any analysis or research.

    Data Format: The exam data is typically provided in a structured format, commonly as a CSV (Comma-Separated Values) file. Each row in the dataset represents a unique student's examination performance, and each column contains specific attributes and scores related to the examination. The CSV format allows for easy import and analysis using various data analysis tools and programming languages like Python, R, or spreadsheet software like Microsoft Excel.

    Here's a column-wise description of the dataset:

    Name OF THE STUDENT: The full name of the student who took the exam. (Anonymized)

    UNIVERSITY: The university where the student is enrolled.

    PROGRAM NAME: The name of the academic program in which the student is enrolled (BBA or MBA).

    Specialization: If applicable, the specific area of specialization or major that the student has chosen within their program.

    Semester: The semester or academic term in which the student took the exam.

    Domain: Indicates whether the exam was divided into two parts: general management and domain-specific.

    GENERAL MANAGEMENT SCORE (OUT of 50): The score obtained by the student in the general management part of the exam, out of a maximum possible score of 50.

    Domain-Specific Score (Out of 50): The score obtained by the student in the domain-specific part of the exam, also out of a maximum possible score of 50.

    TOTAL SCORE (OUT of 100): The total score obtained by adding the scores from the general management and domain-specific parts, out of a maximum possible score of 100.

  13. d

    CEE6410 Semester project: Centralizing community water management in Costa...

    • search.dataone.org
    • hydroshare.org
    Updated Dec 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriela Tatiana Sancho Juarez (2024). CEE6410 Semester project: Centralizing community water management in Costa Rica [Dataset]. https://search.dataone.org/view/sha256%3A6266f2bcf5a99c9e3e10ff54091cd8fbae068f19c380df547d4b9723f21279ee
    Explore at:
    Dataset updated
    Dec 21, 2024
    Dataset provided by
    Hydroshare
    Authors
    Gabriela Tatiana Sancho Juarez
    Area covered
    Costa Rica
    Description

    Abstract Centralizing allocation of water can potentially benefit community water service. Water managers need robust data, modeling, and decision support tools to best allocate water. This report studies optimization modeling as a potential solution to reduce the amount of community water service systems (ASADAS) in Alajuela Costa Rica. Currently there are numerous ASADAS, which represents a challenge for management and adequate governmental support. Three systems are used as an example. The objectives of this project are: (a) Create an optimization model to improve water delivery in decentralized community water systems in Alajuela, Costa Rica. (b) Integrate environmental water constraints in optimization model. This model can be used by decision makers to join ASADAS and best allocate water among them, including ways of balancing the environmental and human water requirements. This model identified 5 links between water sources and communities; governmental accompaniment is needed to facilitate the systems joins within communities.

    Files: Semester Project Report: written report. GAMS code: script for the optimization model. Data set: excel spreadsheet, data used for the optimization model.

  14. c

    Data from: Usage patterns of the fraud risk management framework within...

    • esango.cput.ac.za
    Updated Mar 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan Efile (2025). Usage patterns of the fraud risk management framework within small retail enterprises operating in Cape Town [Dataset]. http://doi.org/10.25381/cput.28622225.v1
    Explore at:
    Dataset updated
    Mar 28, 2025
    Dataset provided by
    Cape Peninsula University of Technology
    Authors
    Jonathan Efile
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Area covered
    Cape Town
    Description

    Clearance Certificate No | 2021_FBMSREC_013This dataset includes all the information collected from participants, the dataset was coded using Excel to simplify it for analysis purpose. Furthermore, the analytical tools implemented to get the results from the coded data are SPSS Package and MS Excel Package.

  15. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Raghad Al-marshadi (2025). Instagram Reach Analysis - Excel Project [Dataset]. https://www.kaggle.com/datasets/raghadalmarshadi/instagram-reach-analysis-excel-project/code
Organization logo

Instagram Reach Analysis - Excel Project

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 14, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Raghad Al-marshadi
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

📊 Instagram Reach Analysis | تحليل الوصول في إنستغرام

An exploratory data analysis project using Excel to understand what influences Instagram post reach and engagement.
مشروع تحليل استكشافي لفهم العوامل المؤثرة في وصول منشورات إنستغرام وتفاعل المستخدمين، باستخدام Excel.

📁 Project Description | وصف المشروع

This project uses an Instagram dataset imported from Kaggle to explore how different factors like hashtags, saves, shares, and caption length influence impressions and engagement.
يستخدم هذا المشروع بيانات من إنستغرام تم استيرادها من منصة Kaggle لتحليل كيف تؤثر عوامل مثل الهاشتاقات، الحفظ، المشاركة، وطول التسمية التوضيحية في عدد مرات الظهور والتفاعل.

🛠️ Tools Used | الأدوات المستخدمة

  • Microsoft Excel
  • Pivot Tables
  • TRIM, WRAP, and other Excel formulas
  • مايكروسوفت إكسل
  • الجداول المحورية
  • دوال مثل TRIM و WRAP وغيرها في Excel

🧹 Data Cleaning | تنظيف البيانات

  • Removed unnecessary spaces using TRIM
  • Removed 17 duplicate rows → 103 unique rows remained
  • Standardized formatting: freeze top row, wrap text, center align

  • إزالة المسافات غير الضرورية باستخدام TRIM

  • حذف 17 صفًا مكررًا → تبقى 103 صفوف فريدة

  • تنسيق موحد: تثبيت الصف الأول، لف النص، وتوسيط المحتوى

🔍 Key Analysis Highlights | أبرز نتائج التحليل

1. Impressions by Source | مرات الظهور حسب المصدر

  • Highest reach: Home > Hashtags > Explore > Other
  • Some totals exceed 100% due to overlapping

2. Engagement Insights | رؤى حول التفاعل

  • Saves strongly correlate with higher impressions
  • Caption length is inversely related to likes
  • Shares have weak correlation with impressions

3. Hashtag Patterns | تحليل الهاشتاقات

  • Most used: #Thecleverprogrammer, #Amankharwal, #Python
  • Repeating hashtags does not guarantee higher reach

✅ Conclusion | الخلاصة

Shorter captions and higher save counts contribute more to reach than repeated hashtags. Profile visits are often linked to new followers.
العناوين القصيرة وعدد الحفظات تلعب دورًا أكبر في الوصول من تكرار الهاشتاقات. كما أن زيارات الملف الشخصي ترتبط غالبًا بزيادة المتابعين.

👩‍💻 Author | المؤلفة

Raghad's LinkedIn

🧠 Inspiration | الإلهام

Inspired by content from TheCleverProgrammer, Aman Kharwal, and Kaggle datasets.
استُلهم المشروع من محتوى TheCleverProgrammer وأمان خروال، وبيانات من Kaggle.

💬 Feedback | الملاحظات

Feel free to open an issue or share suggestions!
يسعدنا تلقي ملاحظاتكم واقتراحاتكم عبر صفحة المشروع.

Search
Clear search
Close search
Google apps
Main menu