95 datasets found

Z
Enhanced Bug Prediction in JavaScript Programs with Hybrid Call-Graph Based...
data.niaid.nih.gov
zenodo.org
Updated Nov 21, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Antal, Gábor; Tóth, Zoltán Gábor; Hegedűs, Péter; Ferenc, Rudolf (2020). Enhanced Bug Prediction in JavaScript Programs with Hybrid Call-Graph Based Invocation Metrics (Training Dataset) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4281475
Explore at:
Dataset updated
Nov 21, 2020
Dataset provided by
University of Szeged
Authors
Antal, Gábor; Tóth, Zoltán Gábor; Hegedűs, Péter; Ferenc, Rudolf
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset consists of multiple files which contain bug prediction training data.

The entries in the dataset are JavaScript functions either being buggy or non-buggy. Bug related information was obtained from the project EsLint contained in BugsJS (https://github.com/BugsJS/eslint). The buggy instances were collected throughout the lifetime of the project, however we added non-buggy entries from the latest version which is tagged as fix (entries which were previously included as buggy were not included as non-buggy later on).

The dataset is based on hybrid call graphs which are constructed by https://github.com/sed-szeged/hcg-js-framework. The result of this tool is a call graph where the edges are associated with a confidence level which shows how likely the given edge is a valid call edge.

We used different threshold values from which we considered the edges to be valid. The following threshold values were used:

0.00

0.05

0.20

0.30

The prefix in the dataset file names are coming from the used threshold. The the datasets include coupling metrics NII (Nubmer of Incoming Invocations) and NOI (Number of Outgoing Invocations) which were calculated by a static source code analyzer called SourceMeter. Hybrid counterparts of these metrics (HNII and HNOI) are based on the given threshold values.

There are four variants for all of these datasets:

Both static (NII, NOi) and hybrid (HNII, HNOI) coupling metrics are included with additional static source code metrics and information about the entries (file without any postfix). Column contained only in this dataset are:

ID

Name

Longname

Parent ID

Component ID

Path

Line

Column

EndLine

EndColumn

Both static (NII, NOi) and hybrid (HNII, HNOI) coupling metrics are included with additional static source code metrics (file with '_h+s' postfix)

Only static (NII, NOI) coupling metrics are included with additional static source code metrics (file with '_s' postfix)

Only hybrid (HNII, HNOI) coupling metrics are included with additional static source code metrics (file with '_h' postfix)

Static source code metrics which are contained in all dataset are the following:

McCC - McCabe Cyclomatic Complexity

NL - Nesting Level

NLE - Nesting Level Else If

CD - Comment Density

CLOC - Comment Lines of Code

DLOC - Documentation Lines of Code

TCD - Total Comment Density (Comment Lines in an emedded function will be also considered)

TCLOC - Total Comment Lines of Code (Comment Lines in an emedded function will be also considered)

LLOC - Logical Lines of Code (Comment and empty lines not counted)

LOC - Lines of Code (Comment and empty lines are counted)

NOS - Number of Statements

NUMPAR - Number of Parameters

TLLOC - Logical Lines of Code (Lines in embedded functions are also counted)

TLOC - Lines of Code (Lines in embedded functions are also counted)

TNOS - Total Number of Statements (Statements in embedded functions are also counted)
h
autotrain-data-javascript-traing-1
huggingface.co
Updated Aug 14, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abdul Rehman Shahid (2023). autotrain-data-javascript-traing-1 [Dataset]. https://huggingface.co/datasets/ars-1/autotrain-data-javascript-traing-1
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 14, 2023
Authors
Abdul Rehman Shahid
Description
AutoTrain Dataset for project: javascript-traing-1

Dataset Description

This dataset has been automatically processed by AutoTrain for project javascript-traing-1.

Languages

The BCP-47 code for the dataset's language is unk.

Dataset Structure Data Instances

A sample from this dataset looks as follows: [ { "target": "test/NavbarSpec.js", "feat_repo_name": "aabenoja/react-bootstrap", "text": "import React from 'react'; import… See the full description on the dataset page: https://huggingface.co/datasets/ars-1/autotrain-data-javascript-traing-1.
DATS 6401 - Final Project - Yon ho Cheong.zip
figshare.com
zip
Updated Dec 15, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yon ho Cheong (2018). DATS 6401 - Final Project - Yon ho Cheong.zip [Dataset]. http://doi.org/10.6084/m9.figshare.7471007.v1
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7471007.v1
Dataset updated
Dec 15, 2018
Dataset provided by
Figsharehttp://figshare.com/
Authors
Yon ho Cheong
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
AbstractThe H1B is an employment-based visa category for temporary foreign workers in the United States. Every year, the US immigration department receives over 200,000 petitions and selects 85,000 applications through a random process and the U.S. employer must submit a petition for an H1B visa to the US immigration department. This is the most common visa status applied to international students once they complete college or higher education and begin working in a full-time position. The project provides essential information on job titles, preferred regions of settlement, foreign applicants and employers' trends for H1B visa application. According to locations, employers, job titles and salary range make up most of the H1B petitions, so different visualization utilizing tools will be used in order to analyze and interpreted in relation to the trends of the H1B visa to provide a recommendation to the applicant. This report is the base of the project for Visualization of Complex Data class at the George Washington University, some examples in this project has an analysis for the different relevant variables (Case Status, Employer Name, SOC name, Job Title, Prevailing Wage, Worksite, and Latitude and Longitude information) from Kaggle and Office of Foreign Labor Certification(OFLC) in order to see the H1B visa changes in the past several decades. Keywords: H1B visa, Data Analysis, Visualization of Complex Data, HTML, JavaScript, CSS, Tableau, D3.jsDatasetThe dataset contains 10 columns and covers a total of 3 million records spanning from 2011-2016. The relevant columns in the dataset include case status, employer name, SOC name, jobe title, full time position, prevailing wage, year, worksite, and latitude and longitude information.Link to dataset: https://www.kaggle.com/nsharan/h-1b-visaLink to dataset(FY2017): https://www.foreignlaborcert.doleta.gov/performancedata.cfmRunning the codeOpen Index.htmlData ProcessingDoing some data preprocessing to transform the raw data into an understandable format.Find and combine any other external datasets to enrich the analysis such as dataset of FY2017.To make appropriated Visualizations, variables should be Developed and compiled into visualization programs.Draw a geo map and scatter plot to compare the fastest growth in fixed value and in percentages.Extract some aspects and analyze the changes in employers’ preference as well as forecasts for the future trends.VisualizationsCombo chart: this chart shows the overall volume of receipts and approvals rate.Scatter plot: scatter plot shows the beneficiary country of birth.Geo map: this map shows All States of H1B petitions filed.Line chart: this chart shows top10 states of H1B petitions filed. Pie chart: this chart shows comparison of Education level and occupations for petitions FY2011 vs FY2017.Tree map: tree map shows overall top employers who submit the greatest number of applications.Side-by-side bar chart: this chart shows overall comparison of Data Scientist and Data Analyst.Highlight table: this table shows mean wage of a Data Scientist and Data Analyst with case status certified.Bubble chart: this chart shows top10 companies for Data Scientist and Data Analyst.Related ResearchThe H-1B Visa Debate, Explained - Harvard Business Reviewhttps://hbr.org/2017/05/the-h-1b-visa-debate-explainedForeign Labor Certification Data Centerhttps://www.foreignlaborcert.doleta.govKey facts about the U.S. H-1B visa programhttp://www.pewresearch.org/fact-tank/2017/04/27/key-facts-about-the-u-s-h-1b-visa-program/H1B visa News and Updates from The Economic Timeshttps://economictimes.indiatimes.com/topic/H1B-visa/newsH-1B visa - Wikipediahttps://en.wikipedia.org/wiki/H-1B_visaKey FindingsFrom the analysis, the government is cutting down the number of approvals for H1B on 2017.In the past decade, due to the nature of demand for high-skilled workers, visa holders have clustered in STEM fields and come mostly from countries in Asia such as China and India.Technical Jobs fill up the majority of Top 10 Jobs among foreign workers such as Computer Systems Analyst and Software Developers.The employers located in the metro areas thrive to find foreign workforce who can fill the technical position that they have in their organization.States like California, New York, Washington, New Jersey, Massachusetts, Illinois, and Texas are the prime location for foreign workers and provide many job opportunities. Top Companies such Infosys, Tata, IBM India that submit most H1B Visa Applications are companies based in India associated with software and IT services.Data Scientist position has experienced an exponential growth in terms of H1B visa applications and jobs are clustered in West region with the highest number.Visualization utilizing programsHTML, JavaScript, CSS, D3.js, Google API, Python, R, and Tableau
Website Statistics
data.wu.ac.at
lcc.portaljs.com
+2more
csv, pdf
Updated Jun 11, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lincolnshire County Council (2018). Website Statistics [Dataset]. https://data.wu.ac.at/schema/data_gov_uk/M2ZkZDBjOTUtMzNhYi00YWRjLWI1OWMtZmUzMzA5NjM0ZTdk
Explore at:
csv, pdfAvailable download formats
Dataset updated
Jun 11, 2018
Dataset provided by
Lincolnshire County Councilhttp://www.lincolnshire.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
This Website Statistics dataset has four resources showing usage of the Lincolnshire Open Data website. Web analytics terms used in each resource are defined in their accompanying Metadata file.

Website Usage Statistics: This document shows a statistical summary of usage of the Lincolnshire Open Data site for the latest calendar year.

Website Statistics Summary: This dataset shows a website statistics summary for the Lincolnshire Open Data site for the latest calendar year.

Webpage Statistics: This dataset shows statistics for individual Webpages on the Lincolnshire Open Data site by calendar year.

Dataset Statistics: This dataset shows cumulative totals for Datasets on the Lincolnshire Open Data site that have also been published on the national Open Data site Data.Gov.UK - see the Source link.

Note: Website and Webpage statistics (the first three resources above) show only UK users, and exclude API calls (automated requests for datasets). The Dataset Statistics are confined to users with javascript enabled, which excludes web crawlers and API calls.

These Website Statistics resources are updated annually in January by the Lincolnshire County Council Business Intelligence team. For any enquiries about the information contact opendata@lincolnshire.gov.uk.
h
rlvr-code-data-JavaScript
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Saurabh Shah, rlvr-code-data-JavaScript [Dataset]. https://huggingface.co/datasets/saurabh5/rlvr-code-data-JavaScript
Explore at:
Authors
Saurabh Shah
Description
saurabh5/rlvr-code-data-JavaScript dataset hosted on Hugging Face and contributed by the HF Datasets community
JS-data-v2
kaggle.com
zip
Updated Nov 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Margaritelli (2024). JS-data-v2 [Dataset]. https://www.kaggle.com/datasets/margaritelli/js-data-v2/code
Explore at:
zip(8445651294 bytes)Available download formats
Dataset updated
Nov 17, 2024
Authors
Margaritelli
Description
Dataset

This dataset was created by Margaritelli

Contents
Additional file 5 of Health figures: an open source JavaScript library for...
figshare.com
springernature.figshare.com
txt
Updated Dec 15, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andres Ledesma; Mohammed Al-Musawi; Hannu Nieminen (2016). Additional file 5 of Health figures: an open source JavaScript library for health data visualization [Dataset]. http://doi.org/10.6084/m9.figshare.c.3621374_D4.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.c.3621374_D4.v1
Dataset updated
Dec 15, 2016
Dataset provided by
Figsharehttp://figshare.com/
Authors
Andres Ledesma; Mohammed Al-Musawi; Hannu Nieminen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
After scenario questionnaire results. The data contains the results of the After Scenario Questionnaire answered by 14 participants. (CSV 149 kb)
Web Performance Metrics
kaggle.com
zip
Updated Oct 31, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lars E (2020). Web Performance Metrics [Dataset]. https://www.kaggle.com/indexhtml/web-performance-metrics
Explore at:
zip(85490717 bytes)Available download formats
Dataset updated
Oct 31, 2020
Authors
Lars E
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
JavaScript Errors and Performance in the wild

This data comes from an effort to render the top 1M domains on the web in a scripted browser, and recording performance metrics of each page. These metrics are published here in numpy format. See the starter notebook for an example showing how to use the data, and what the columns contain. The following posts for a more in depth write ups:

Analysis of the logged JavaScript errors

Analysis of performance metrics
Quakes
kaggle.com
zip
Updated Sep 6, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mathurin Aché (2020). Quakes [Dataset]. https://www.kaggle.com/mathurinache/quakes
Explore at:
zip(17473 bytes)Available download formats
Dataset updated
Sep 6, 2020
Authors
Mathurin Aché
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets)

Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
Data from: Mining Rule Violations in JavaScript Code Snippets
zenodo.org
csv
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Uriel Ferreira Campos; Guilherme Smethurst; João Pedro Moraes; Rodrigo Bonifácio; Gustavo Pinto; Uriel Ferreira Campos; Guilherme Smethurst; João Pedro Moraes; Rodrigo Bonifácio; Gustavo Pinto (2020). Mining Rule Violations in JavaScript Code Snippets [Dataset]. http://doi.org/10.5281/zenodo.2593818
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.2593818
Dataset updated
Jan 24, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Uriel Ferreira Campos; Guilherme Smethurst; João Pedro Moraes; Rodrigo Bonifácio; Gustavo Pinto; Uriel Ferreira Campos; Guilherme Smethurst; João Pedro Moraes; Rodrigo Bonifácio; Gustavo Pinto
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Content of this repository
This is the repository that contains the scripts and dataset for the MSR 2019 mining challenge

Github Repository with the software used : here.
=======

DATASET
The dataset was retrived utilizing google bigquery and dumped to a csv
file for further processing, this original file with no treatment is called jsanswers.csv, here we can find the following information :
1. The Id of the question (PostId)
2. The Content (in this case the code block)
3. the lenght of the code block
4. the line count of the code block
5. The score of the post
6. The title

A quick look at this files, one can notice that a postID can have multiple rows related to it, that's how multiple codeblocks are saved in the database.

Filtered Dataset:

Extracting code from CSV
We used a python script called "ExtractCodeFromCSV.py" to extract the code from the original csv and merge all the codeblocks in their respective javascript file with the postID as name, this resulted in 336 thousand files.

Running ESlint
Due to the single threaded nature of ESlint, we needed to create a script to run ESlint because it took a huge toll on the machine to run it on 336 thousand files, this script is named "ESlintRunnerScript.py", it splits the files in 20 evenly distributed parts and runs 20 processes of esLinter to generate the reports, as such it generates 20 json files.

Number of Violations per Rule
This information was extracted using the script named "parser.py", it generated the file named "NumberofViolationsPerRule.csv" which contains the number of violations per rule used in the linter configuration in the dataset.

Number of violations per Category
As a way to make relevant statistics of the dataset, we generated the number of violations per rule category as defined in the eslinter website, this information was extracted using the same "parser.py" script.

Individual Reports
This information was extracted from the json reports, it's a csv file with PostID and violations per rule.

Rules
The file Rules with categories contains all the rules used and their categories.
JavaScript Mastery's YouTube Channel Statistics
vidiq.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
vidIQ, JavaScript Mastery's YouTube Channel Statistics [Dataset]. https://vidiq.com/youtube-stats/channel/UCmXmlB4-HJytD7wek0Uo97A/
Explore at:
Dataset authored and provided by
vidIQ
Time period covered
Nov 1, 2025 - Nov 27, 2025
Area covered
YouTube, HR
Variables measured
subscribers, video count, video views, engagement rate, upload frequency, estimated earnings
Description
Comprehensive YouTube channel statistics for JavaScript Mastery, featuring 1,200,000 subscribers and 104,656,847 total views. This dataset includes detailed performance metrics such as subscriber growth, video views, engagement rates, and estimated revenue. The channel operates in the Lifestyle category and is based in HR. Track 187 videos with daily and monthly performance data, including view counts, subscriber changes, and earnings estimates. Analyze growth trends, engagement patterns, and compare performance against similar channels in the same category.
c
JavaScript Price Prediction Data
coinbase.com
Updated Nov 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). JavaScript Price Prediction Data [Dataset]. https://www.coinbase.com/en-ca/price-prediction/base-javascript-baaa
Explore at:
Dataset updated
Nov 25, 2025
Variables measured
Growth Rate, Predicted Price
Measurement technique
User-defined projections based on compound growth. This is not a formal financial forecast.
Description
This dataset contains the predicted prices of the asset JavaScript over the next 16 years. This data is calculated initially using a default 5 percent annual growth rate, and after page load, it features a sliding scale component where the user can then further adjust the growth rate to their own positive or negative projections. The maximum positive adjustable growth rate is 100 percent, and the minimum adjustable growth rate is -100 percent.
f
Descriptive statistics of the number of missed frames for SVG+JavaScript...
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Oct 10, 2014
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vadillo, Miguel A.; López-de-Ipiña, Diego; Garaizar, Pablo (2014). Descriptive statistics of the number of missed frames for SVG+JavaScript animations. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001194973
Explore at:
Dataset updated
Oct 10, 2014
Authors
Vadillo, Miguel A.; López-de-Ipiña, Diego; Garaizar, Pablo
Description
Descriptive statistics of the number of missed frames for SVG+JavaScript animations.
JS_Employees_data
kaggle.com
zip
Updated Jun 6, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shreyash Shandilya☑️ (2023). JS_Employees_data [Dataset]. https://www.kaggle.com/datasets/shreyashshandilya16/js-employees-data
Explore at:
zip(526 bytes)Available download formats
Dataset updated
Jun 6, 2023
Authors
Shreyash Shandilya☑️
Description
Here's a description of the columns in the dataset:

Employee_ID: A unique identifier for each employee. Name: The name of the employee. Age: The age of the employee. Gender: The gender of the employee. Department: The department in which the employee works. Salary: The salary of the employee. Years_Experience: The number of years of work experience the employee has. Performance_Rating: The performance rating of the employee on a scale of 1 to 5, with 5 being the highest.
Salaries of developers in Ukraine
kaggle.com
zip
Updated Nov 17, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mysha Rysh (2022). Salaries of developers in Ukraine [Dataset]. https://www.kaggle.com/datasets/mysha1rysh/salaries-of-developers-in-ukraine
Explore at:
zip(24303 bytes)Available download formats
Dataset updated
Nov 17, 2022
Authors
Mysha Rysh
Area covered
Ukraine
Description
This data was collected by the team https://dou.ua/ . This resource is very popular in Ukraine. It provides salary statistics, shows current vacancies and publishes useful articles related to the life of an IT specialist. This dataset was taken from the public repository https://github.com/devua/csv/tree/master/salaries . This dataset will include the following data for each of the developer: salary, position (f.e. Junior, Middle), experience, city, tech (f.e C#/.NET, JavaScript, Python). I think this dataset will be useful to our community. Thank you.
h
preprocess-videomme-data
huggingface.co
Updated Feb 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hyun, Jeongseok (2025). preprocess-videomme-data [Dataset]. https://huggingface.co/datasets/js-hyun/preprocess-videomme-data
Explore at:
Dataset updated
Feb 2, 2025
Authors
Hyun, Jeongseok
Description
js-hyun/preprocess-videomme-data dataset hosted on Hugging Face and contributed by the HF Datasets community
Examining the Capacity of Text Mining and Software Metrics in Vulnerability...
data.europa.eu
unknown
Updated Sep 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zenodo (2023). Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction [dataset] [Dataset]. https://data.europa.eu/data/datasets/oai-zenodo-org-8369963?locale=fr
Explore at:
unknown(79359120)Available download formats
Dataset updated
Sep 21, 2023
Dataset authored and provided by
Zenodohttp://zenodo.org/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains the extension of a publicly available dataset that was published initially by Ferenc et al. in their paper: “Ferenc, R.; Hegedus, P.; Gyimesi, P.; Antal, G.; Bán, D.; Gyimóthy, T. Challenging machine learning algorithms in predicting vulnerable javascript functions. 2019 IEEE/ACM 7th InternationalWorkshop on Realizing Artificial Intelligence Synergies in Software Engineering (RAISE). IEEE, 2019, pp. 8–14.” The dataset contained software metrics for source code functions written in JavaScript (JS) programming language. Each function was labeled as vulnerable or clean. The authors gathered vulnerabilities from publicly available vulnerability databases. In our paper entitled: “Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction” and cited as: “Kalouptsoglou I, Siavvas M, Kehagias D, Chatzigeorgiou A, Ampatzoglou A. Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction. Entropy. 2022; 24(5):651. https://doi.org/10.3390/e24050651” , we presented an extended version of the dataset by extracting textual features for the labeled JS functions. In particular, we got the dataset provided by Ferenc et al. in CSV format and then we gathered all the GitHub URLs of the dataset's functions (i.e., methods). Using these URLs, we collected the source code of the corresponding JS files from GitHub. Subsequently, by utilizing the start and end line information for every function, we cut off the code of the functions. Each function was then tokenized to construct a list of tokens per function. To extract text features, we used a text mining technique called sequences of tokens. As a result, we created a repository with all methods' source code, the token sequences of each method, and their labels. To boost the generalizability of type-specific tokens, all comments were eliminated, as well as all integers and strings, which were replaced with two unique IDs. The dataset contains 12,106 JavaScript functions, from which 1,493 are considered vulnerable. This dataset was created and utilized during the Vulnerability Prediction Task of the Horizon2020 IoTAC Project as training and evaluation data for the construction of vulnerability prediction models. The dataset is provided in the csv format. Each row of the csv file has the following parts: Label: Flag with values ‘1’ for vulnerable and ‘0’ for non-vulnerable methods Name: The name of the JavaScript method Longname: The longname of the JavaScript method Path: The path of the file of the method in the repository Full_repo_path: The GitHub URL of the file of the method TokenX: Each next row corresponds to each token included in the method
I
A Crawl of the Mobile Web Measuring Sensor Accesses
databank.illinois.edu
aws-databank-alb.library.illinois.edu
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anupam Das; Gunes Acar; Nikita Borisov; Amogh Pradeep, A Crawl of the Mobile Web Measuring Sensor Accesses [Dataset]. http://doi.org/10.13012/B2IDB-9213932_V1
Explore at:
Unique identifier
https://doi.org/10.13012/B2IDB-9213932_V1
Authors
Anupam Das; Gunes Acar; Nikita Borisov; Amogh Pradeep
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is the result of three crawls of the web performed in May 2018. The data contains raw crawl data and instrumentation captured by OpenWPM-Mobile, as well as analysis that identifies which scripts access mobile sensors, which ones perform some of browser fingerprinting, as well as clustering of scripts based on their intended use. The dataset is described in the included README.md file; more details about the methodology can be found in our ACM CCS'18 paper: Anupam Das, Gunes Acar, Nikita Borisov, Amogh Pradeep. The Web's Sixth Sense: A Study of Scripts Accessing Smartphone Sensors. In Proceedings of the 25th ACM Conference on Computer and Communications Security (CCS), Toronto, Canada, October 15–19, 2018. (Forthcoming)
Additional file 2 of Health figures: an open source JavaScript library for...
figshare.com
springernature.figshare.com
txt
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andres Ledesma; Mohammed Al-Musawi; Hannu Nieminen (2023). Additional file 2 of Health figures: an open source JavaScript library for health data visualization [Dataset]. http://doi.org/10.6084/m9.figshare.c.3621374_D1.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.c.3621374_D1.v1
Dataset updated
May 30, 2023
Dataset provided by
Figsharehttp://figshare.com/
Authors
Andres Ledesma; Mohammed Al-Musawi; Hannu Nieminen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Results from laboratory testing. The data contains the task identifier, the average time to completion, number of times the task was successfully completed and the total number of errors. (CSV 209 kb)
Data from: Investigating the Reproducibility of NPM Packages
zenodo.org
bin, zip
Updated Jun 25, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anonymous Anonymous; Anonymous Anonymous (2020). Investigating the Reproducibility of NPM Packages [Dataset]. http://doi.org/10.5281/zenodo.3698357
Explore at:
zip, binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3698357
Dataset updated
Jun 25, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Anonymous Anonymous; Anonymous Anonymous
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Open-source Dataset

This dataset contains the NPM packages that we built using our tool-chain. It consists of the diffoscope outputs, the versions built by our tool-chain, and the pre-built packages present on the npmjs registry.

Facebook

Twitter

Click to copy link

Link copied

Cite

Antal, Gábor; Tóth, Zoltán Gábor; Hegedűs, Péter; Ferenc, Rudolf (2020). Enhanced Bug Prediction in JavaScript Programs with Hybrid Call-Graph Based Invocation Metrics (Training Dataset) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4281475

Enhanced Bug Prediction in JavaScript Programs with Hybrid Call-Graph Based Invocation Metrics (Training Dataset)

Explore at:

Dataset updated

Nov 21, 2020

Dataset provided by

University of Szeged

Authors

Antal, Gábor; Tóth, Zoltán Gábor; Hegedűs, Péter; Ferenc, Rudolf

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

This dataset consists of multiple files which contain bug prediction training data.

The entries in the dataset are JavaScript functions either being buggy or non-buggy. Bug related information was obtained from the project EsLint contained in BugsJS (https://github.com/BugsJS/eslint). The buggy instances were collected throughout the lifetime of the project, however we added non-buggy entries from the latest version which is tagged as fix (entries which were previously included as buggy were not included as non-buggy later on).

The dataset is based on hybrid call graphs which are constructed by https://github.com/sed-szeged/hcg-js-framework. The result of this tool is a call graph where the edges are associated with a confidence level which shows how likely the given edge is a valid call edge.

We used different threshold values from which we considered the edges to be valid. The following threshold values were used:

0.00

0.05

0.20

0.30

The prefix in the dataset file names are coming from the used threshold. The the datasets include coupling metrics NII (Nubmer of Incoming Invocations) and NOI (Number of Outgoing Invocations) which were calculated by a static source code analyzer called SourceMeter. Hybrid counterparts of these metrics (HNII and HNOI) are based on the given threshold values.

There are four variants for all of these datasets:

Both static (NII, NOi) and hybrid (HNII, HNOI) coupling metrics are included with additional static source code metrics and information about the entries (file without any postfix). Column contained only in this dataset are:

Name

Longname

Parent ID

Component ID

Path

Line

Column

EndLine

EndColumn

Both static (NII, NOi) and hybrid (HNII, HNOI) coupling metrics are included with additional static source code metrics (file with '_h+s' postfix)

Only static (NII, NOI) coupling metrics are included with additional static source code metrics (file with '_s' postfix)

Only hybrid (HNII, HNOI) coupling metrics are included with additional static source code metrics (file with '_h' postfix)

Static source code metrics which are contained in all dataset are the following:

McCC - McCabe Cyclomatic Complexity

NL - Nesting Level

NLE - Nesting Level Else If

CD - Comment Density

CLOC - Comment Lines of Code

DLOC - Documentation Lines of Code

TCD - Total Comment Density (Comment Lines in an emedded function will be also considered)

TCLOC - Total Comment Lines of Code (Comment Lines in an emedded function will be also considered)

LLOC - Logical Lines of Code (Comment and empty lines not counted)

LOC - Lines of Code (Comment and empty lines are counted)

NOS - Number of Statements

NUMPAR - Number of Parameters

TLLOC - Logical Lines of Code (Lines in embedded functions are also counted)

TLOC - Lines of Code (Lines in embedded functions are also counted)

TNOS - Total Number of Statements (Statements in embedded functions are also counted)

Clear search

Close search

Google apps

Main menu

Enhanced Bug Prediction in JavaScript Programs with Hybrid Call-Graph Based...

autotrain-data-javascript-traing-1

DATS 6401 - Final Project - Yon ho Cheong.zip

Website Statistics

rlvr-code-data-JavaScript

JS-data-v2

Dataset

Contents

Additional file 5 of Health figures: an open source JavaScript library for...

Web Performance Metrics

JavaScript Errors and Performance in the wild

Quakes

Data from: Mining Rule Violations in JavaScript Code Snippets

JavaScript Mastery's YouTube Channel Statistics

JavaScript Price Prediction Data

Descriptive statistics of the number of missed frames for SVG+JavaScript...

JS_Employees_data

Salaries of developers in Ukraine

preprocess-videomme-data

Examining the Capacity of Text Mining and Software Metrics in Vulnerability...

A Crawl of the Mobile Web Measuring Sensor Accesses

Additional file 2 of Health figures: an open source JavaScript library for...

Data from: Investigating the Reproducibility of NPM Packages

Enhanced Bug Prediction in JavaScript Programs with Hybrid Call-Graph Based Invocation Metrics (Training Dataset)