100+ datasets found
  1. m

    Strategies for Using Websites to Support Programming and Their Impact on...

    • figshare.manchester.ac.uk
    mp4
    Updated Sep 15, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Strategies for Using Websites to Support Programming and Their Impact on Source Code: Video files. [Dataset]. https://figshare.manchester.ac.uk/articles/dataset/Strategies_for_Using_Websites_to_Support_Programming_and_Their_Impact_on_Source_Code_Video_files_/20379366
    Explore at:
    mp4Available download formats
    Dataset updated
    Sep 15, 2023
    Dataset provided by
    University of Manchester
    Authors
    Omar Alghamdi; Sarah Clinch; Mohammed Alhamadi; Caroline Jay
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    An observation study was conducted with undergraduate students. The study includes a video analysis of coding activities, an analysis of the source code written during coding, and a structured interview with participants. This dataset contains video recordings of participants coding with the websites. It also contains the interview questions along with transcriptions. In addition, it contains the source code resulting from the participants.

  2. h

    code-search-net-javascript

    • huggingface.co
    Updated Nov 13, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fernando Tarin Morales (2023). code-search-net-javascript [Dataset]. https://huggingface.co/datasets/Nan-Do/code-search-net-javascript
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 13, 2023
    Authors
    Fernando Tarin Morales
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for "code-search-net-javascript"

      Dataset Summary
    

    This dataset is the JavaScript portion of the CodeSarchNet annotated with a summary column.The code-search-net dataset includes open source functions that include comments found at GitHub.The summary is a short description of what the function does.

      Languages
    

    The dataset's comments are in English and the functions are coded in JavaScript

      Data Splits
    

    Train, test, validation labels are… See the full description on the dataset page: https://huggingface.co/datasets/Nan-Do/code-search-net-javascript.

  3. Z

    Network Traffic Analysis: Data and Code

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jun 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Honig, Joshua (2024). Network Traffic Analysis: Data and Code [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_11479410
    Explore at:
    Dataset updated
    Jun 12, 2024
    Dataset provided by
    Moran, Madeline
    Honig, Joshua
    Soni, Shreena
    Chan-Tin, Eric
    Homan, Sophia
    Ferrell, Nathan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Code:

    Packet_Features_Generator.py & Features.py

    To run this code:

    pkt_features.py [-h] -i TXTFILE [-x X] [-y Y] [-z Z] [-ml] [-s S] -j

    -h, --help show this help message and exit -i TXTFILE input text file -x X Add first X number of total packets as features. -y Y Add first Y number of negative packets as features. -z Z Add first Z number of positive packets as features. -ml Output to text file all websites in the format of websiteNumber1,feature1,feature2,... -s S Generate samples using size s. -j

    Purpose:

    Turns a text file containing lists of incomeing and outgoing network packet sizes into separate website objects with associative features.

    Uses Features.py to calcualte the features.

    startMachineLearning.sh & machineLearning.py

    To run this code:

    bash startMachineLearning.sh

    This code then runs machineLearning.py in a tmux session with the nessisary file paths and flags

    Options (to be edited within this file):

    --evaluate-only to test 5 fold cross validation accuracy

    --test-scaling-normalization to test 6 different combinations of scalers and normalizers

    Note: once the best combination is determined, it should be added to the data_preprocessing function in machineLearning.py for future use

    --grid-search to test the best grid search hyperparameters - note: the possible hyperparameters must be added to train_model under 'if not evaluateOnly:' - once best hyperparameters are determined, add them to train_model under 'if evaluateOnly:'

    Purpose:

    Using the .ml file generated by Packet_Features_Generator.py & Features.py, this program trains a RandomForest Classifier on the provided data and provides results using cross validation. These results include the best scaling and normailzation options for each data set as well as the best grid search hyperparameters based on the provided ranges.

    Data

    Encrypted network traffic was collected on an isolated computer visiting different Wikipedia and New York Times articles, different Google search queres (collected in the form of their autocomplete results and their results page), and different actions taken on a Virtual Reality head set.

    Data for this experiment was stored and analyzed in the form of a txt file for each experiment which contains:

    First number is a classification number to denote what website, query, or vr action is taking place.

    The remaining numbers in each line denote:

    The size of a packet,

    and the direction it is traveling.

    negative numbers denote incoming packets

    positive numbers denote outgoing packets

    Figure 4 Data

    This data uses specific lines from the Virtual Reality.txt file.

    The action 'LongText Search' refers to a user searching for "Saint Basils Cathedral" with text in the Wander app.

    The action 'ShortText Search' refers to a user searching for "Mexico" with text in the Wander app.

    The .xlsx and .csv file are identical

    Each file includes (from right to left):

    The origional packet data,

    each line of data organized from smallest to largest packet size in order to calculate the mean and standard deviation of each packet capture,

    and the final Cumulative Distrubution Function (CDF) caluclation that generated the Figure 4 Graph.

  4. m

    No Code Website Builder Tools Market Size, Share & Trends Analysis 2033

    • marketresearchintellect.com
    Updated Jun 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Intellect (2024). No Code Website Builder Tools Market Size, Share & Trends Analysis 2033 [Dataset]. https://www.marketresearchintellect.com/product/no-code-website-builder-tools-market/
    Explore at:
    Dataset updated
    Jun 25, 2024
    Dataset authored and provided by
    Market Research Intellect
    License

    https://www.marketresearchintellect.com/privacy-policyhttps://www.marketresearchintellect.com/privacy-policy

    Area covered
    Global
    Description

    Dive into Market Research Intellect's No Code Website Builder Tools Market Report, valued at USD 5.2 billion in 2024, and forecast to reach USD 12.3 billion by 2033, growing at a CAGR of 10.5% from 2026 to 2033.

  5. G

    Programs and Code for Geothermal Exploration Artificial Intelligence

    • gdr.openei.org
    • data.openei.org
    • +1more
    archive, code
    Updated Apr 27, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jim Moraga; Jim Moraga (2021). Programs and Code for Geothermal Exploration Artificial Intelligence [Dataset]. http://doi.org/10.15121/1787330
    Explore at:
    archive, codeAvailable download formats
    Dataset updated
    Apr 27, 2021
    Dataset provided by
    USDOE Office of Energy Efficiency and Renewable Energy (EERE), Renewable Power Office. Geothermal Technologies Program (EE-4G)
    Geothermal Data Repository
    Colorado School of Mines
    Authors
    Jim Moraga; Jim Moraga
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The scripts below are used to run the Geothermal Exploration Artificial Intelligence developed within the "Detection of Potential Geothermal Exploration Sites from Hyperspectral Images via Deep Learning" project. It includes all scripts for pre-processing and processing, including: - Land Surface Temperature K-Means classifier - Labeling AI using Self Organizing Maps (SOM) - Post-processing for Permanent Scatterer InSAR (PSInSAR) analysis with SOM - Mineral marker summarizing - Artificial Intelligence (AI) Data splitting: creates data set from a single raster file - Artificial Intelligence Model: creates AI from a single data set, after splitting in Train, Validation and Test subsets - AI Mapper: creates a classification map based on a raster file

  6. N

    No Code Website Builder Tools Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Jan 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). No Code Website Builder Tools Report [Dataset]. https://www.archivemarketresearch.com/reports/no-code-website-builder-tools-11250
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    Jan 23, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global no-code website builder tools market is projected to reach $43.7 billion by 2033, exhibiting a CAGR of 12.4% during the forecast period. The increasing adoption of digital platforms by businesses and the growing demand for user-friendly website development tools are driving the market growth. The availability of a wide range of templates and drag-and-drop functionality makes no-code website builders accessible to individuals and businesses with limited technical expertise. Key trends shaping the market include the integration of artificial intelligence (AI) and machine learning (ML) to enhance the user experience, the rise of headless CMS platforms that enable greater flexibility and scalability, and the growing popularity of cloud-based no-code website builders that offer convenience and cost-effectiveness. The market is segmented into various types, applications, and regions, with North America holding a significant share due to the presence of leading technology companies and a large number of small and medium-sized businesses. Major players in the market include Wix, Bubble, Webflow, Squarespace, and WordPress, among others, who are focusing on expanding their offerings, forming strategic partnerships, and investing in research and development to gain a competitive edge. The no-code website builder tools market is experiencing exponential growth, with its value projected to reach over $17.6 billion by 2026. These tools empower non-technical individuals and businesses to create professional-looking websites without the need for programming knowledge.

  7. m

    Web Code Merge Personnel Process

    • mygeohub.org
    Updated Oct 29, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Erich Huebner (2014). Web Code Merge Personnel Process [Dataset]. http://doi.org/10.13019/M2WC7P
    Explore at:
    Dataset updated
    Oct 29, 2014
    Dataset provided by
    MyGeohub
    Authors
    Erich Huebner
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This document is to provide you with the guidelines and processes required for committing web code in the hub environment.

  8. w

    sub-net-code-application.site - Historical whois Lookup

    • whoisdatacenter.com
    csv
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AllHeart Web Inc, sub-net-code-application.site - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/sub-net-code-application.site/
    Explore at:
    csvAvailable download formats
    Dataset authored and provided by
    AllHeart Web Inc
    License

    https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/

    Time period covered
    Mar 15, 1985 - May 29, 2025
    Description

    Explore the historical Whois records related to sub-net-code-application.site (Domain). Get insights into ownership history and changes over time.

  9. d

    Code Service - Site List

    • data.gov.tw
    csv
    Updated Jan 23, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ministry of the Interior Land Surveying and Mapping Center (2013). Code Service - Site List [Dataset]. https://data.gov.tw/en/datasets/102010
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jan 23, 2013
    Dataset authored and provided by
    Ministry of the Interior Land Surveying and Mapping Center
    License

    https://data.gov.tw/licensehttps://data.gov.tw/license

    Description

    API URL: https://api.nlsc.gov.tw/other/ListLandSectionInput parameters: /County code/Township codeReturned results: XML format; including land section code, land code, and land nameAPI syntax example: https://api.nlsc.gov.tw/other/ListLandSection/B/B01

  10. v

    Global No Code Website Builder Tools Market Size By Type, By Application, By...

    • verifiedmarketresearch.com
    Updated Nov 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VERIFIED MARKET RESEARCH (2024). Global No Code Website Builder Tools Market Size By Type, By Application, By End-User, By Geographic Scope And Forecast [Dataset]. https://www.verifiedmarketresearch.com/product/no-code-website-builder-tools-market/
    Explore at:
    Dataset updated
    Nov 11, 2024
    Dataset authored and provided by
    VERIFIED MARKET RESEARCH
    License

    https://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/

    Time period covered
    2024 - 2031
    Area covered
    Global
    Description

    No Code Website Builder Tools Market size was valued at USD 1.97 Billion in 2023 and is estimated to reach USD 3.58 Billion by 2031, growing at a CAGR of 7.73 % from 2024 to 2031.

    Global No Code Website Builder Tools Market Drivers

    Growing Interest in Do-It-Yourself Website Creation: The demand for user-friendly, no-code platforms has increased as more companies and individuals want to build websites themselves rather than hiring professionals. These tools remove the technical obstacles typically connected with web building and enable consumers to create expert websites using drag-and-drop features and templates.

    Economicalness: By removing the need to hire qualified developers, no-code website builders drastically lower the cost of web development, making it more accessible to startups, small enterprises, and individual entrepreneurs. A wide spectrum of consumers, including huge corporations and solopreneurs, find this cost appealing.

  11. Website Screenshots Dataset

    • universe.roboflow.com
    zip
    Updated Aug 19, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Roboflow (2022). Website Screenshots Dataset [Dataset]. https://universe.roboflow.com/roboflow-gw7yv/website-screenshots
    Explore at:
    zipAvailable download formats
    Dataset updated
    Aug 19, 2022
    Dataset authored and provided by
    Roboflowhttps://roboflow.com/
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Variables measured
    Elements Bounding Boxes
    Description

    About This Dataset

    The Roboflow Website Screenshots dataset is a synthetically generated dataset composed of screenshots from over 1000 of the world's top websites. They have been automatically annotated to label the following classes: :fa-spacer: * button - navigation links, tabs, etc. * heading - text that was enclosed in <h1> to <h6> tags. * link - inline, textual <a> tags. * label - text labeling form fields. * text - all other text. * image - <img>, <svg>, or <video> tags, and icons. * iframe - ads and 3rd party content.

    Example

    This is an example image and annotation from the dataset: https://i.imgur.com/mOG3u3Z.png" alt="WIkipedia Screenshot">

    Usage

    Annotated screenshots are very useful in Robotic Process Automation. But they can be expensive to label. This dataset would cost over $4000 for humans to label on popular labeling services. We hope this dataset provides a good starting point for your project. Try it with a model from our model library.

    Collecting Custom Data

    Roboflow is happy to provide a custom screenshots dataset to meet your particular needs. We can crawl public or internal web applications. Just reach out and we'll be happy to provide a quote!

    About Roboflow

    Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless. :fa-spacer: Developers reduce 50% of their boilerplate code when using Roboflow's workflow, save training time, and increase model reproducibility. :fa-spacer:

    Roboflow Wordmark

  12. h

    rosetta-code

    • huggingface.co
    Updated Apr 10, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Farouk (2024). rosetta-code [Dataset]. https://huggingface.co/datasets/pharaouk/rosetta-code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 10, 2024
    Authors
    Farouk
    License

    https://choosealicense.com/licenses/gfdl/https://choosealicense.com/licenses/gfdl/

    Description

    Dataset Card for the Rosetta Code Dataset

      Dataset Summary
    

    Rosetta Code is a programming chrestomathy site. The idea is to present solutions to the same task in as many different languages as possible, to demonstrate how languages are similar and different, and to aid a person with a grounding in one approach to a problem in learning another. Rosetta Code currently has 1,203 tasks, 389 draft tasks, and is aware of 883 languages, though we do not (and cannot) have… See the full description on the dataset page: https://huggingface.co/datasets/pharaouk/rosetta-code.

  13. C

    Code Editor Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Mar 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). Code Editor Report [Dataset]. https://www.archivemarketresearch.com/reports/code-editor-51988
    Explore at:
    pdf, ppt, docAvailable download formats
    Dataset updated
    Mar 6, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global code editor market is experiencing robust growth, driven by the increasing demand for software development and the rising adoption of cloud-based solutions. The market size in 2025 is estimated at $15 billion, exhibiting a Compound Annual Growth Rate (CAGR) of 12% from 2025 to 2033. This significant growth is fueled by several key factors, including the expansion of the software development industry, the rising popularity of programming languages, and the increasing demand for efficient and user-friendly code editing tools. The shift towards cloud-based and web-based code editors is a prominent trend, offering developers enhanced collaboration capabilities and accessibility. While the on-premises deployment model still holds a significant market share, particularly among enterprise users prioritizing data security, the web-based segment is projected to witness faster growth due to its inherent flexibility and cost-effectiveness. The market is segmented by application (personal and enterprise) and deployment (on-premises and web-based). Enterprise applications dominate the market currently, but the personal use segment is showing significant growth potential, particularly amongst independent developers and hobbyists. Competitive pressures are high, with both established players like Microsoft and GitHub, and smaller, niche players offering specialized features and functionalities. The growth trajectory is expected to continue throughout the forecast period, driven by advancements in Artificial Intelligence (AI) integration within code editors and the increasing demand for low-code/no-code development platforms. However, factors such as the high initial investment costs for enterprise-grade solutions and the need for continuous skill development among developers could pose some restraints to market expansion. The geographical distribution reveals a significant concentration of market share in North America and Europe, reflecting the advanced technological landscape and substantial software development activities in these regions. However, developing economies in Asia Pacific and other regions are expected to contribute significantly to market growth in the coming years, fueled by increasing digitalization efforts and the growing adoption of technology across diverse sectors.

  14. d

    Site code key for kelp forest community data collected along the coast of...

    • dataone.org
    • bco-dmo.org
    • +2more
    Updated Dec 5, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mark Carr; Tim Tinker (2021). Site code key for kelp forest community data collected along the coast of Monterey and Carmel, CA from 1999-2015 (Kelp Forest Resilience project) [Dataset]. https://dataone.org/datasets/sha256%3Ac619ae83dae57b0dfdb61efb74b0697173e4f1e9efd08e9417706c98cca60119
    Explore at:
    Dataset updated
    Dec 5, 2021
    Dataset provided by
    Biological and Chemical Oceanography Data Management Office (BCO-DMO)
    Authors
    Mark Carr; Tim Tinker
    Area covered
    Description

    Site code key for M. Carr and T. Tinker Kelp Forest Resilience project.

  15. Z

    Data from: Does Location Influence Code Quality? Mining Stack Overflow...

    • data.niaid.nih.gov
    • ourarchive.otago.ac.nz
    Updated Apr 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anonymised (2025). Does Location Influence Code Quality? Mining Stack Overflow Snippets Across the United States – Replication Package [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_13622419
    Explore at:
    Dataset updated
    Apr 2, 2025
    Dataset authored and provided by
    Anonymised
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    United States
    Description

    Developers routinely integrate Stack Overflow code snippets into their codebases. However, the quality of snippets embedded in users’ answers remain elusive, and existing evaluations of code quality tend to be language or context-specific. Moreover, literature have found that contribution patterns vary depending on geographical locales, creating an unexplained rift between code quality, user location, and latent contextual regional factors.

    The proposed study evaluates the quality of SQL, JavaScript, Python, Ruby, and Java snippets across reliability, readability, performance, and security dimensions, benchmarking findings across states in the USA and investigating how different diversity indicators correlate against code quality violations. The study culminates in a series of inductive content analyses that qualitatively supplement prior quality dimensions.

    This replication package is provided for those interested in further examining our research methodology.

  16. πŸ•΅οΈ Phishing Websites Data

    • kaggle.com
    Updated Feb 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sairaj Adhav (2025). πŸ•΅οΈ Phishing Websites Data [Dataset]. https://www.kaggle.com/datasets/sai10py/phishing-websites-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 24, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sairaj Adhav
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Phishing Websites Dataset

    Overview

    This dataset is designed to aid in the analysis and detection of phishing websites. It contains various features that help distinguish between legitimate and phishing websites based on their structural, security, and behavioral attributes.

    Dataset Information

    • Total Columns: 31 (30 Features + 1 Target)
    • Target Variable: Result (Indicates whether a website is phishing or legitimate)

    Features Description

    URL-Based Features

    • Prefix_Suffix – Checks if the URL contains a hyphen (-), which is commonly used in phishing domains.
    • double_slash_redirecting – Detects if the URL redirects using //, which may indicate a phishing attempt.
    • having_At_Symbol – Identifies the presence of @ in the URL, which can be used to deceive users.
    • Shortining_Service – Indicates whether the URL uses a shortening service (e.g., bit.ly, tinyurl).
    • URL_Length – Measures the length of the URL; phishing URLs tend to be longer.
    • having_IP_Address – Checks if an IP address is used in place of a domain name, which is suspicious.

    Domain-Based Features

    • having_Sub_Domain – Evaluates the number of subdomains; phishing sites often have excessive subdomains.
    • SSLfinal_State – Indicates whether the website has a valid SSL certificate (secure connection).
    • Domain_registeration_length – Measures the duration of domain registration; phishing sites often have short lifespans.
    • age_of_domain – The age of the domain in days; older domains are usually more trustworthy.
    • DNSRecord – Checks if the domain has valid DNS records; phishing domains may lack these.

    Webpage-Based Features

    • Favicon – Determines if the website uses an external favicon (which can be a sign of phishing).
    • port – Identifies if the site is using suspicious or non-standard ports.
    • HTTPS_token – Checks if "HTTPS" is included in the URL but is used deceptively.
    • Request_URL – Measures the percentage of external resources loaded from different domains.
    • URL_of_Anchor – Analyzes anchor tags (<a> links) and their trustworthiness.
    • Links_in_tags – Examines <meta>, <script>, and <link> tags for external links.
    • SFH (Server Form Handler) – Determines if form actions are handled suspiciously.
    • Submitting_to_email – Checks if forms submit data directly to an email instead of a web server.
    • Abnormal_URL – Identifies if the website’s URL structure is inconsistent with common patterns.
    • Redirect – Counts the number of redirects; phishing websites may have excessive redirects.

    Behavior-Based Features

    • on_mouseover – Checks if the website changes content when hovered over (used in deceptive techniques).
    • RightClick – Detects if right-click functionality is disabled (phishing sites may disable it).
    • popUpWindow – Identifies the presence of pop-ups, which can be used to trick users.
    • Iframe – Checks if the website uses <iframe> tags, often used in phishing attacks.

    Traffic & Search Engine Features

    • web_traffic – Measures the website’s Alexa ranking; phishing sites tend to have low traffic.
    • Page_Rank – Google PageRank score; phishing sites usually have a low PageRank.
    • Google_Index – Checks if the website is indexed by Google (phishing sites may not be indexed).
    • Links_pointing_to_page – Counts the number of backlinks pointing to the website.
    • Statistical_report – Uses external sources to verify if the website has been reported for phishing.

    Target Variable

    • Result – The classification label (1: Legitimate, -1: Phishing)

    Usage

    This dataset is valuable for:
    βœ… Machine Learning Models – Developing classifiers for phishing detection.
    βœ… Cybersecurity Research – Understanding patterns in phishing attacks.
    βœ… Browser Security Extensions – Enhancing anti-phishing tools.

  17. Vulnerability Fix Dataset

    • kaggle.com
    Updated Feb 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    JIS College of Engineering (2025). Vulnerability Fix Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/10658267
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 4, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    JIS College of Engineering
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Overview The Vulnerability Fix Dataset is a collection of 35,000 code snippets containing both vulnerable and fixed versions of code. The dataset focuses on common software security vulnerabilities and their corresponding fixes, making it highly valuable for research in secure coding practices, automated vulnerability detection, and software security analysis. ** Dataset Structure** This dataset consists of three main columns:

    vulnerability_type: The type of security vulnerability (e.g., SQL Injection, Cross-Site Scripting). vulnerable_code: The original code snippet containing the vulnerability. fixed_code: The secure version of the code with the vulnerability fixed. The dataset includes vulnerabilities across multiple programming languages, making it useful for machine learning, static analysis, and cybersecurity training.

    Features of the Dataset The Vulnerability Fix Dataset contains the following key features:

    vulnerability_type (String)

    The category of the security vulnerability present in the code. Examples: SQL Injection Cross-Site Scripting (XSS) Buffer Overflow Command Injection Insecure Cryptographic Practices vulnerable_code (String)

    The original code snippet that contains a security vulnerability. Written in various programming languages, including Java, Python, C, and JavaScript. Used for analyzing insecure coding patterns. fixed_code (String)

    The corrected version of the vulnerable_code with security improvements. Demonstrates best practices in secure coding. Helps in training AI models for automatic vulnerability fixing. This dataset is structured to support research in automated vulnerability detection, static code analysis, and secure software development.

  18. a

    CoDE

    • code-deegsnccu.hub.arcgis.com
    Updated Nov 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    North Carolina Central University (2023). CoDE [Dataset]. https://code-deegsnccu.hub.arcgis.com/content/380b665094f84d0c9cf6d08a0e798ec2
    Explore at:
    Dataset updated
    Nov 29, 2023
    Dataset authored and provided by
    North Carolina Central University
    Area covered
    Description

    Create your own initiative by combining existing applications with a custom site. Use this initiative to form teams around a problem and invite your community to participate.

  19. N

    No Code Website Builder Tools Report

    • marketresearchforecast.com
    doc, pdf, ppt
    Updated Mar 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Forecast (2025). No Code Website Builder Tools Report [Dataset]. https://www.marketresearchforecast.com/reports/no-code-website-builder-tools-39493
    Explore at:
    ppt, doc, pdfAvailable download formats
    Dataset updated
    Mar 18, 2025
    Dataset authored and provided by
    Market Research Forecast
    License

    https://www.marketresearchforecast.com/privacy-policyhttps://www.marketresearchforecast.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The no-code website builder market is experiencing robust growth, driven by the increasing demand for user-friendly website creation tools and the expanding digital presence of businesses and individuals. The market, estimated at $15 billion in 2025, is projected to maintain a healthy Compound Annual Growth Rate (CAGR) of 15% through 2033, reaching an estimated $45 billion. This growth is fueled by several key factors. Firstly, the ease of use offered by these platforms empowers non-technical users to build professional-looking websites without coding skills, significantly lowering the barrier to entry for online presence. Secondly, the diverse range of functionalities offered, from simple personal blogs to complex e-commerce stores, caters to a broad spectrum of users. Thirdly, the emergence of advanced features like AI-powered design assistance and integrated marketing tools further enhances the appeal and functionality of these platforms. Finally, the increasing adoption of mobile-first approaches and the growing need for cross-platform compatibility are driving innovation and demand within the market. Despite the rapid growth, market penetration remains relatively low, suggesting significant untapped potential. Competition among established players like Wix, Squarespace, and WordPress, alongside a wave of innovative newcomers, is intensifying. The market is segmented by website type (personal, business, community) and platform (mobile, desktop, application), reflecting the diverse needs of users. Geographic distribution shows strong growth across North America and Europe, while Asia-Pacific is emerging as a rapidly expanding market. While the ease of use is a major driver, some restraints include limitations on customization for complex websites and concerns regarding platform dependency and data security. Ongoing technological advancements, including AI-driven design and enhanced integration capabilities, will likely shape the future landscape of this dynamic market. The focus on user experience and innovative features will be crucial for players aiming for market leadership in the years to come.

  20. d

    Building and Safety - Code Enforcement Case - Open (N)

    • catalog.data.gov
    • data.lacity.org
    Updated Jun 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.lacity.org (2025). Building and Safety - Code Enforcement Case - Open (N) [Dataset]. https://catalog.data.gov/dataset/building-and-safety-code-enforcement-case-open-n
    Explore at:
    Dataset updated
    Jun 21, 2025
    Dataset provided by
    data.lacity.org
    Description

    The Department of Building and Safety is tasked with the enforcement of the City's building, electrical, mechanical and zoning regulations, which are elements of the Los Angeles Municipal Code. A Code Enforcement Case may originate from a customer service request to investigate a possible Code violation, from a planned and authorized inspection of a neighborhood for the purpose of reducing blight, or from a proactive inspection of a site due to an observed hazardous condition. A case is closed (indicated by a "C" status) when the site satisfies the requirements for Code compliance, or when no Code violation is found. A case remains open (indicated by an "O" status) until the site satisfies the requirements for Code compliance.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Strategies for Using Websites to Support Programming and Their Impact on Source Code: Video files. [Dataset]. https://figshare.manchester.ac.uk/articles/dataset/Strategies_for_Using_Websites_to_Support_Programming_and_Their_Impact_on_Source_Code_Video_files_/20379366

Strategies for Using Websites to Support Programming and Their Impact on Source Code: Video files.

Related Article
Explore at:
mp4Available download formats
Dataset updated
Sep 15, 2023
Dataset provided by
University of Manchester
Authors
Omar Alghamdi; Sarah Clinch; Mohammed Alhamadi; Caroline Jay
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

An observation study was conducted with undergraduate students. The study includes a video analysis of coding activities, an analysis of the source code written during coding, and a structured interview with participants. This dataset contains video recordings of participants coding with the websites. It also contains the interview questions along with transcriptions. In addition, it contains the source code resulting from the participants.

Search
Clear search
Close search
Google apps
Main menu