100+ datasets found

f
Data_Sheet_1_GitHub Statistics as a Measure of the Impact of Open-Source...
frontiersin.figshare.com
figshare.com
pdf
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mikhail G. Dozmorov (2023). Data_Sheet_1_GitHub Statistics as a Measure of the Impact of Open-Source Bioinformatics Software.PDF [Dataset]. http://doi.org/10.3389/fbioe.2018.00198.s001
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.3389/fbioe.2018.00198.s001
Dataset updated
May 31, 2023
Dataset provided by
Frontiers
Authors
Mikhail G. Dozmorov
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Modern research is increasingly data-driven and reliant on bioinformatics software. Publication is a common way of introducing new software, but not all bioinformatics tools get published. Giving there are competing tools, it is important not merely to find the appropriate software, but have a metric for judging its usefulness. Journal's impact factor has been shown to be a poor predictor of software popularity; consequently, focusing on publications in high-impact journals limits user's choices in finding useful bioinformatics tools. Free and open source software repositories on popular code sharing platforms such as GitHub provide another venue to follow the latest bioinformatics trends. The open source component of GitHub allows users to bookmark and copy repositories that are most useful to them. This Perspective aims to demonstrate the utility of GitHub “stars,” “watchers,” and “forks” (GitHub statistics) as a measure of software impact. We compiled lists of impactful bioinformatics software and analyzed commonly used impact metrics and GitHub statistics of 50 genomics-oriented bioinformatics tools. We present examples of community-selected best bioinformatics resources and show that GitHub statistics are distinct from the journal's impact factor (JIF), citation counts, and alternative metrics (Altmetrics, CiteScore) in capturing the level of community attention. We suggest the use of GitHub statistics as an unbiased measure of the usability of bioinformatics software complementing the traditional impact metrics.
m
2025 Green Card Report for Biostatistics, Bioinformatics, and Systems...
myvisajobs.com
Updated Jan 16, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MyVisaJobs (2025). 2025 Green Card Report for Biostatistics, Bioinformatics, and Systems Biology [Dataset]. https://www.myvisajobs.com/reports/green-card/major/biostatistics,-bioinformatics,-and-systems-biology
Explore at:
Dataset updated
Jan 16, 2025
Dataset authored and provided by
MyVisaJobs
License
https://www.myvisajobs.com/terms-of-service/https://www.myvisajobs.com/terms-of-service/
Variables measured
Major, Salary, Petitions Filed
Description
A dataset that explores Green Card sponsorship trends, salary data, and employer insights for biostatistics, bioinformatics, and systems biology in the U.S.
Bioinformatics market in Latin America 2022-2027
statista.com
Updated Jul 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Bioinformatics market in Latin America 2022-2027 [Dataset]. https://www.statista.com/statistics/789013/bioinformatics-market-value-latin-america/
Explore at:
Dataset updated
Jul 11, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2022
Area covered
Latin America, LAC
Description
In 2022, the value of the bioinformatics market in Latin America was estimated at **** billion U.S. dollars. The figure was forecast to increase to **** billion U.S. dollars by 2025 and could reach **** billion U.S. dollars by 2027.
Bioinformatics Training Resources
figshare.com
html
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stephen Turner (2023). Bioinformatics Training Resources [Dataset]. http://doi.org/10.6084/m9.figshare.773083.v3
Explore at:
htmlAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.773083.v3
Dataset updated
May 31, 2023
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
Stephen Turner
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Markdown source, PDF, and HTML rendering of bioinformatics training resources from http://stephenturner.us/p/edu.
d
Two-step mixed model approach to analyzing differential alternative RNA...
datadryad.org
zip
Updated Sep 28, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Li Luo; Huining Kang; Xichen Li; Scott Ness; Christine Stidley (2020). Two-step mixed model approach to analyzing differential alternative RNA splicing: Datasets and R scripts for analysis of alternative splicing [Dataset]. http://doi.org/10.5061/dryad.66t1g1k0h
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.66t1g1k0h
Dataset updated
Sep 28, 2020
Dataset provided by
Dryad
Authors
Li Luo; Huining Kang; Xichen Li; Scott Ness; Christine Stidley
Time period covered
Sep 26, 2020
Description
The dataset was collected through whole-transcriptome RNA-Sequencing technologies. The processing method was described in the manuscript.
f
Bioinformatics Summary statistics together with NCBI accession numbers.
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated May 1, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tapia, Sebastián M.; Saenz-Agudelo, Pablo; Nespolo, Roberto F.; Villarroel, Carlos A.; Thompson, Dawn; Mikhalev, Ekaterina; Liti, Gianni; De Chiara, Matteo; Cubillos, Francisco A.; Urbina, Kamila; Mozzachiodi, Simone; Larrondo, Luis F.; Vega-Macaya, Franco; Oporto, Christian I. (2020). Bioinformatics Summary statistics together with NCBI accession numbers. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000455946
Explore at:
Dataset updated
May 1, 2020
Authors
Tapia, Sebastián M.; Saenz-Agudelo, Pablo; Nespolo, Roberto F.; Villarroel, Carlos A.; Thompson, Dawn; Mikhalev, Ekaterina; Liti, Gianni; De Chiara, Matteo; Cubillos, Francisco A.; Urbina, Kamila; Mozzachiodi, Simone; Larrondo, Luis F.; Vega-Macaya, Franco; Oporto, Christian I.
Description
(A) Bioinformatics Summary statistics and (B) Sequence identity matrix between strains. (XLSX)
m
NeonatalPortugal2018
data.mendeley.com
Updated Dec 7, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Francisco Machado e Costa (2019). NeonatalPortugal2018 [Dataset]. http://doi.org/10.17632/br8tnh3h47.1
Explore at:
Unique identifier
https://doi.org/10.17632/br8tnh3h47.1
Dataset updated
Dec 7, 2019
Authors
Francisco Machado e Costa
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Portuguese National Registry on low weight newborns between 2013 and 2018, made available for research purposes. Dataset is composed of 3823 unique entries registering birthweight, biological sex of the infant (1-Male; 2-Female), CRIB score (0-21) and survival (0-Survival; 1-Death).
d
Multidimensional scaling informed by F-statistic: Visualizing microbiome for...
dataone.org
search.dataone.org
+1more
Updated Oct 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hyungseok Kim; Soobin Kim; Jeff Kimbrel; Megan Morris; Xavier Mayali; Cullen Buie (2025). Multidimensional scaling informed by F-statistic: Visualizing microbiome for inference [Dataset]. http://doi.org/10.5061/dryad.vmcvdnd3x
Explore at:
Unique identifier
https://doi.org/10.5061/dryad.vmcvdnd3x
Dataset updated
Oct 14, 2025
Dataset provided by
Dryad Digital Repository
Authors
Hyungseok Kim; Soobin Kim; Jeff Kimbrel; Megan Morris; Xavier Mayali; Cullen Buie
Description
Multidimensional scaling (MDS) is a dimensionality reduction technique for microbial ecology data analysis that represents the multivariate structure while preserving pairwise distances between samples. While its improvements have enhanced the ability to reveal data patterns by sample groups, these MDS-based methods require prior assumptions for inference, limiting their application in general microbiome analysis. In this study, we introduce a new MDS-based ordination, â€œF-informed MDS,â€ which configures the data distribution based on the F-statistic, the ratio of dispersion between groups sharing common and different characteristics. Using simulated compositional datasets, we demonstrate that the proposed method is robust to hyperparameter selection while maintaining statistical significance throughout the ordination process. Various quality metrics for evaluating dimensionality reduction confirm that F-informed MDS is comparable to state-of-the-art methods in preserving both local and ..., , # Multidimensional scaling informed by F-statistic: Visualizing grouped microbiome data with inference

Dataset DOI: 10.5061/dryad.vmcvdnd3x

Software: https://github.com/soob-kim/FinfoMDS
(also in prep for Bioconductor submission)

File or folder names are italicized.Â Package or variable names are monospaced.Â

File: Data.zip

Description:Â Raw data used in this study. Includes 3 folders and 1 file (see below).

FolderÂ SimulatedÂ contains pairwise distances and ordination results from three simulated datasets. Includes 7 subfolders and 6 files.

Six files are the original dataset and its associated labels set. The names are formatted as "sim_<*x*>-<*type*>.*csv*" whereÂ <*x*> is the replicate number andÂ <*type*> indicates whether the file is the design matrix ("data") or response vector ("Y").

Seven subfolders are grouped by the ordination method. Likewise, the file ...,
c
Bioinformatics Market Size, Share, Growth, Trends | Revenue Forecast - 2031
consegicbusinessintelligence.com
pdf,excel,csv,ppt
Updated Oct 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Consegic Business Intelligence Pvt Ltd (2025). Bioinformatics Market Size, Share, Growth, Trends | Revenue Forecast - 2031 [Dataset]. https://www.consegicbusinessintelligence.com/bioinformatics-market
Explore at:
pdf,excel,csv,pptAvailable download formats
Dataset updated
Oct 1, 2025
Dataset authored and provided by
Consegic Business Intelligence Pvt Ltd
License
https://www.consegicbusinessintelligence.com/privacy-policyhttps://www.consegicbusinessintelligence.com/privacy-policy
Area covered
Global
Description
The bioinformatics market, valued at USD 15,135.48 million in 2023, is expected to grow at a steady CAGR of 10.2%, reaching USD 32,663.77 million by 2031. Asia-Pacific is forecasted to grow at the fastest CAGR of 10.9%.
m
SARS-CoV-2 GISAID UK-US isolates (2020-09-07) genotyping VCF
data.mendeley.com
Updated Nov 16, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Necla Koçhan (2020). SARS-CoV-2 GISAID UK-US isolates (2020-09-07) genotyping VCF [Dataset]. http://doi.org/10.17632/5dfj2hhnng.1
Explore at:
Unique identifier
https://doi.org/10.17632/5dfj2hhnng.1
Dataset updated
Nov 16, 2020
Authors
Necla Koçhan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United Kingdom, United States
Description
VCF files containing filtered mutated sites in SARS-CoV-2 genomes obtained from GISAID EpiCoV and submitted from the UK and the US, separated by individual mutations. The columns correspond to viral genome accession ID, nucleotide position in the genome, mutation ID (left blank in all rows), reference nucleotide, identified mutation, quality, filter, and information columns (all left blank), format (GT in all rows), column corresponding to reference genome (all 0, referring to reference nucleotide column), and columns corresponding to isolate genomes, with each row identifying the nucleotide in the POS column, and whether it is non-mutant (0), or the mutant indicated in the identified mutation column (1). The files is tab delimited, with the UK file having 12696 rows including the names, and 18135 columns, and the US file having 15588 rows including the names, and 16277 columns.

The file was generated to test the hypothesis whether the different SARS-CoV-2 genes or protein coding regions are positively or negatively selected differently between 14408C>T / 23403A>G double mutants and double wildtype isolates, using mutation rate models, and whether regional distributions affect the mutation rates. Our findings have shown that the RdRp coding region and the S gene show the highest amount of selection across viral generations, and that different countries can affect the synonymous and nonsynonymous mutation rates for individual genes.
F
Bioinformatics Services Market Size & Share - America, Europe, & APAC...
fundamentalbusinessinsights.com
Updated Sep 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fundamental Business Insights and Consulting (2024). Bioinformatics Services Market Size & Share - America, Europe, & APAC Evolution 2026-2035 [Dataset]. https://www.fundamentalbusinessinsights.com/industry-report/bioinformatics-services-market-8203
Explore at:
Dataset updated
Sep 27, 2024
Dataset authored and provided by
Fundamental Business Insights and Consulting
License
https://www.fundamentalbusinessinsights.com/terms-of-usehttps://www.fundamentalbusinessinsights.com/terms-of-use
Area covered
United States
Description
The global bioinformatics services market size is projected to grow from USD 4.21 billion in 2025 to USD 18.41 billion by 2035, recording a CAGR of 15.9%. Companies leading innovation in the industry are Illumina, Thermo Fisher, QIAGEN, BGI, Eurofins Scientific, contributing to the sector’s development and expansion.
i
Grant Giving Statistics for International Society of Big Data and...
instrumentl.com
Updated Feb 27, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Grant Giving Statistics for International Society of Big Data and Bioinformatics Inc. [Dataset]. https://www.instrumentl.com/990-report/international-society-of-big-data-and-bioinformatics-inc
Explore at:
Dataset updated
Feb 27, 2023
Variables measured
Total Assets, Total Giving
Description
Financial overview and grant giving statistics of International Society of Big Data and Bioinformatics Inc.
Z
Data associated with "Survival outcomes are associated with genomic...
data.niaid.nih.gov
Updated Dec 19, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
King, Lydia; Flaus, Andrew; Holian, Emma; Golden, Aaron (2021). Data associated with "Survival outcomes are associated with genomic instability in luminal breast cancers". [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5791191
Explore at:
Dataset updated
Dec 19, 2021
Dataset provided by
The SFI Centre for Research Training in Genomics Data Sciences, National University of Ireland Galway, Galway, Republic of Ireland, Bioinformatics and Biostatistics Research Cluster, School of Mathematics, Statistics and Applied Mathematics, National University of Ireland Galway, Galway, Republic of Ireland.
Bioinformatics and Biostatistics Research Cluster, School of Mathematics, Statistics and Applied Mathematics, National University of Ireland Galway, Galway, Republic of Ireland.
Centre for Chromosome Biology, Biochemistry, School of Natural Sciences, National University of Ireland Galway, Galway, Republic of Ireland.
Authors
King, Lydia; Flaus, Andrew; Holian, Emma; Golden, Aaron
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Data utilised in Survival outcomes are associated with genomic instability in luminal breast cancers.
F
Bioinformatics Market Size & Share - America, Europe, & APAC Entry...
fundamentalbusinessinsights.com
Updated Jun 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fundamental Business Insights and Consulting (2024). Bioinformatics Market Size & Share - America, Europe, & APAC Entry Strategies 2026-2035 [Dataset]. https://www.fundamentalbusinessinsights.com/industry-report/bioinformatics-market-3978
Explore at:
Dataset updated
Jun 17, 2024
Dataset authored and provided by
Fundamental Business Insights and Consulting
License
https://www.fundamentalbusinessinsights.com/terms-of-usehttps://www.fundamentalbusinessinsights.com/terms-of-use
Area covered
United States
Description
The global bioinformatics market size is expected to expand from USD 14.4 billion in 2025 to USD 52 billion by 2035, with CAGR growth exceeding 13.7%. Top companies operating in the industry include Illumina, Thermo Fisher Scientific, QIAGEN, PerkinElmer, BGI Genomics, shaping competitive strategies across the sector.
i
Grant Giving Statistics for Phoenix Bioinformatics Corporation
instrumentl.com
Updated Jan 13, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). Grant Giving Statistics for Phoenix Bioinformatics Corporation [Dataset]. https://www.instrumentl.com/990-report/phoenix-bioinformatics-corporation
Explore at:
Dataset updated
Jan 13, 2022
Variables measured
Total Assets, Total Giving
Description
Financial overview and grant giving statistics of Phoenix Bioinformatics Corporation
Knowledge and attitudes among life scientists towards reproducibility within...
figshare.com
datasetcatalog.nlm.nih.gov
xlsx
Updated Aug 11, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Evanthia Kaimaklioti Samota (2020). Knowledge and attitudes among life scientists towards reproducibility within journal articles_survey_datafile_raw_data [Dataset]. http://doi.org/10.6084/m9.figshare.7855592.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7855592.v1
Dataset updated
Aug 11, 2020
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Evanthia Kaimaklioti Samota
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Raw datafile of the survey data collected from the survey distributed to collect knowledge and attitudes among life scientists towards reproducibility within journal articles.
m
expam Benchmarking - Classifier Performance Statistics
bridges.monash.edu
researchdata.edu.au
xlsx
Updated May 18, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sean Solari; Remy Young; Vanessa Marcelino; Sam Forster (2022). expam Benchmarking - Classifier Performance Statistics [Dataset]. http://doi.org/10.26180/19771072.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.26180/19771072.v1
Dataset updated
May 18, 2022
Dataset provided by
Monash University
Authors
Sean Solari; Remy Young; Vanessa Marcelino; Sam Forster
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Excel document containing precision, recall and F1 scores for metagenomic classifiers used in the benchmarking of expam's performance. Classifiers were tested on 140 simulated metagenomic communities, at different taxonomic ranks.

Global Bioinformatics Software Service Market Research Report: By...

wiseguyreports.com

Updated Sep 15, 2025

+ more versions

Facebook

Twitter

Click to copy link

Link copied

Cite

(2025). Global Bioinformatics Software Service Market Research Report: By Application (Genomics, Proteomics, Metabolomics, Transcriptomics, Molecular Modeling), By Deployment Type (Cloud-Based, On-Premises, Hybrid), By End User (Pharmaceutical Companies, Academic Institutions, Research Organizations, Biotechnology Companies), By Software Type (Data Analysis Software, Sequence Analysis Software, Molecular Visualization Software, Biostatistics Software) and By Regional (North America, Europe, South America, Asia Pacific, Middle East and Africa) - Forecast to 2035 [Dataset]. https://www.wiseguyreports.com/reports/bioinformatic-software-service-market

Explore at:

Dataset updated

Sep 15, 2025

License

https://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy

Time period covered

Sep 25, 2025

Area covered

Global

Description

BASE YEAR	2024
HISTORICAL DATA	2019 - 2023
REGIONS COVERED	North America, Europe, APAC, South America, MEA
REPORT COVERAGE	Revenue Forecast, Competitive Landscape, Growth Factors, and Trends
MARKET SIZE 2024	7.05(USD Billion)
MARKET SIZE 2025	7.55(USD Billion)
MARKET SIZE 2035	15.0(USD Billion)
SEGMENTS COVERED	Application, Deployment Type, End User, Software Type, Regional
COUNTRIES COVERED	US, Canada, Germany, UK, France, Russia, Italy, Spain, Rest of Europe, China, India, Japan, South Korea, Malaysia, Thailand, Indonesia, Rest of APAC, Brazil, Mexico, Argentina, Rest of South America, GCC, South Africa, Rest of MEA
KEY MARKET DYNAMICS	increasing genomic data volume, rising demand for personalized medicine, advancements in cloud computing, integration of AI technologies, growing number of research collaborations
MARKET FORECAST UNITS	USD Billion
KEY COMPANIES PROFILED	Merck KGaA, CLC Bio, Illumina, Thermo Fisher Scientific, Qiagen, Seven Bridges, PerkinElmer, DNAnexus, Genomatix, GenoLogics, BioRad Laboratories, BMC Software, Agilent Technologies, Wuxi NextCODE, Geneious, SAS Institute
MARKET FORECAST PERIOD	2025 - 2035
KEY MARKET OPPORTUNITIES	Increased genomic research funding, Rise of personalized medicine, Advancements in AI and machine learning, Growing demand for data integration, Expanding cloud-based bioinformatics solutions
COMPOUND ANNUAL GROWTH RATE (CAGR)	7.1% (2025 - 2035)

f
Prophage statistics
open.flinders.edu.au
researchdata.edu.au
application/gzip
Updated Nov 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Robert Edwards (2025). Prophage statistics [Dataset]. http://doi.org/10.25451/flinders.22268722.v4
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.25451/flinders.22268722.v4
Dataset updated
Nov 5, 2025
Dataset provided by
Flinders University
Authors
Robert Edwards
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The presence of prophages in bacterial genomes.

This file has these columns: 0. GENOMEID - Genbank genome assembly accession 1. Genome Name - Definition of the genome in the genbank file 2. Contigs > 5kb - Number of contigs longer than 5 kb (only these were used to predict prophages) 3. Genome Contigs - Total number of contigs in the genome 4. Number of Coding Sequences - Total number of coding sequences in the genome 5. Too short - Number of phage predictions that were too short (less than 5 genes in the prediction) 6. Not enough phage hits - Number of phage predictions that did not have a single HMM match to VOGdb version 99 7. Kept - Number of high quality prophage predictions 8. Note - Outcome of the computation. You should read this column, especially if the sum of prophage predictions is zero
Z
Virus Pop Database V1
data.niaid.nih.gov
Updated Apr 26, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kende, Julia; Bigot, Thomas (2023). Virus Pop Database V1 [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7867258
Explore at:
Dataset updated
Apr 26, 2023
Dataset provided by
Institut Pasteur, Université Paris Cité, Bioinformatics and Biostatistics Hub, F-75015 Paris, France
Authors
Kende, Julia; Bigot, Thomas
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This archive is a database generated using the novel Virus Pop pipeline, which simulates realistic protein sequences and adds new branches to a protein phylogenetic tree. An article describing the pipeline is currently under review.

The database contains simulations of 995 different proteins from 93 virus genera, providing a total of 24,138,277 sequences, both in amino acid and nucleotide.

Facebook

Twitter

Click to copy link

Link copied

Cite

Mikhail G. Dozmorov (2023). Data_Sheet_1_GitHub Statistics as a Measure of the Impact of Open-Source Bioinformatics Software.PDF [Dataset]. http://doi.org/10.3389/fbioe.2018.00198.s001

Data_Sheet_1_GitHub Statistics as a Measure of the Impact of Open-Source Bioinformatics Software.PDF

Explore at:

pdfAvailable download formats

Unique identifier

https://doi.org/10.3389/fbioe.2018.00198.s001

Dataset updated

May 31, 2023

Dataset provided by

Frontiers

Authors

Mikhail G. Dozmorov

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Modern research is increasingly data-driven and reliant on bioinformatics software. Publication is a common way of introducing new software, but not all bioinformatics tools get published. Giving there are competing tools, it is important not merely to find the appropriate software, but have a metric for judging its usefulness. Journal's impact factor has been shown to be a poor predictor of software popularity; consequently, focusing on publications in high-impact journals limits user's choices in finding useful bioinformatics tools. Free and open source software repositories on popular code sharing platforms such as GitHub provide another venue to follow the latest bioinformatics trends. The open source component of GitHub allows users to bookmark and copy repositories that are most useful to them. This Perspective aims to demonstrate the utility of GitHub “stars,” “watchers,” and “forks” (GitHub statistics) as a measure of software impact. We compiled lists of impactful bioinformatics software and analyzed commonly used impact metrics and GitHub statistics of 50 genomics-oriented bioinformatics tools. We present examples of community-selected best bioinformatics resources and show that GitHub statistics are distinct from the journal's impact factor (JIF), citation counts, and alternative metrics (Altmetrics, CiteScore) in capturing the level of community attention. We suggest the use of GitHub statistics as an unbiased measure of the usability of bioinformatics software complementing the traditional impact metrics.

Clear search

Close search

Google apps

Main menu

Data_Sheet_1_GitHub Statistics as a Measure of the Impact of Open-Source...

2025 Green Card Report for Biostatistics, Bioinformatics, and Systems...

Bioinformatics market in Latin America 2022-2027

Bioinformatics Training Resources

Two-step mixed model approach to analyzing differential alternative RNA...

Bioinformatics Summary statistics together with NCBI accession numbers.

NeonatalPortugal2018

Multidimensional scaling informed by F-statistic: Visualizing microbiome for...

File: Data.zip

Description:Â Raw data used in this study. Includes 3 folders and 1 file (see below).

Bioinformatics Market Size, Share, Growth, Trends | Revenue Forecast - 2031

SARS-CoV-2 GISAID UK-US isolates (2020-09-07) genotyping VCF

Bioinformatics Services Market Size & Share - America, Europe, & APAC...

Grant Giving Statistics for International Society of Big Data and...

Data associated with "Survival outcomes are associated with genomic...

Bioinformatics Market Size & Share - America, Europe, & APAC Entry...

Grant Giving Statistics for Phoenix Bioinformatics Corporation

Knowledge and attitudes among life scientists towards reproducibility within...

expam Benchmarking - Classifier Performance Statistics

Global Bioinformatics Software Service Market Research Report: By...

Prophage statistics

Virus Pop Database V1

Data_Sheet_1_GitHub Statistics as a Measure of the Impact of Open-Source Bioinformatics Software.PDF