100+ datasets found
  1. n

    Data from: SkewDB: A comprehensive database of GC and 10 other skews for...

    • data.niaid.nih.gov
    • search.dataone.org
    • +2more
    zip
    Updated Oct 4, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bert Hubert (2021). SkewDB: A comprehensive database of GC and 10 other skews for over 28,000 chromosomes and plasmids [Dataset]. http://doi.org/10.5061/dryad.g4f4qrfr6
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 4, 2021
    Dataset provided by
    Independent researcher
    Authors
    Bert Hubert
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    GC skew denotes the relative excess of G nucleotides over C nucleotides on the leading versus the lagging replication strand of eubacteria. While the effect is small, typically around 2.5%, it is robust and pervasive. GC skew and the analogous TA skew are a localized deviation from Chargaff’s second parity rule, which states that G and C, and T and A occur with (mostly) equal frequency even within a strand.

    Most bacteria also show the analogous TA skew. Different phyla show different kinds of skew and differing relations between TA and GC skew. This article introduces an open access database (https://skewdb.org) of GC and 10 other skews for over 28,000 chromosomes and plasmids. Further details like codon bias, strand bias, strand lengths and taxonomic data are also included.

    The SkewDB database can be used to generate or verify hypotheses. Since the origins of both the second parity rule, as well as GC skew itself, are not yet satisfactorily explained, such a database may enhance our understanding of microbial DNA.

    Methods The SkewDB analysis relies exclusively on the tens of thousands of FASTA and GFF3 files available through the NCBI download service, which covers both GenBank and RefSeq. The database includes bacteria, archaea and their plasmids. Furthermore, to ease analysis, the NCBI Taxonomy database is sourced and merged so output data can quickly be related to (super)phyla or specific species. No other data is used, which greatly simplifies processing. Data is read directly in the compressed format provided by NCBI.

    All results are emitted as standard CSV files. In the first step of the analysis, for each organism the FASTA sequence and the GFF3 annotation file are parsed. Every chromosome in the FASTA file is traversed from beginning to end, while a running total is kept for cumulative GC and TA skew. In addition, within protein coding genes, such totals are also kept separately for these skews on the first, second and third codon position. Furthermore, separate totals are kept for regions which do not code for proteins. In addition, to enable strand bias measurements, a cumulative count is maintained of nucleotides that are part of a positive or negative sense gene. The counter is increased for positive sense nucleotides, decreased for negative sense nucleotides, and left alone for non-genic regions.

    A separate counter is kept for non-genic nucleotides. Finally, G and C nucleotides are counted, regardless of if they are part of a gene or not. These running totals are emitted at 4096 nucleotide intervals, a resolution suitable for determining skews and shifts. In addition, one line summaries are stored for each chromosome. These line includes the RefSeq identifier of the chromosome, the full name mentioned in the FASTA file, plus counts of A, C, G and T nucleotides. Finally five levels of taxonomic data are stored.

    Chromosomes and plasmids of fewer than 100 thousand nucleotides are ignored, as these are too noisy to model faithfully. Plasmids are clearly marked in the database, enabling researchers to focus on chromosomes if so desired. Fitting Once the genomes have been summarised at 4096-nucleotide resolution, the skews are fitted to a simple model. The fits are based on four parameters. Alpha1 and alpha2 denote the relative excess of G over C on the leading and lagging strands. If alpha1 is 0.046, this means that for every 1000 nucleotides on the leading strand, the cumulative count of G excess increases by 46. The third parameter is div and it describes how the chromosome is divided over leading and lagging strands. If this number is 0.557, the leading replication strand is modeled to make up 55.7% of the chromosome. The final parameter is shift (the dotted vertical line), and denotes the offset of the origin of replication compared to the DNA FASTA file. This parameter has no biological meaning of itself, and is an artifact of the DNA assembly process.

    The goodness-of-fit number consists of the root mean squared error of the fit, divided by the absolute mean skew. This latter correction is made to not penalize good fits for bacteria showing significant skew. GC skew tends to be defined very strongly, and it is therefore used to pick the div and shift parameters of the DNA sequence, which are then kept as a fixed constraint for all the other skews, which might not be present as clearly. The fitting process itself is a downhill simplex method optimization over the three dimensions, seeded with the average observed skew over the whole genome, and assuming there is no shift, and that the leading and lagging strands are evenly distributed. The simplex optimization is tuned so that it takes sufficiently large steps so it can reach the optimum even if some initial assumptions are off.

  2. Integrated Client Database

    • catalog.data.gov
    • s.cnmilf.com
    • +1more
    Updated Jul 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Social Security Administration (2025). Integrated Client Database [Dataset]. https://catalog.data.gov/dataset/integrated-client-database
    Explore at:
    Dataset updated
    Jul 4, 2025
    Dataset provided by
    Social Security Administrationhttp://ssa.gov/
    Description

    Database used to store client data both Identity and customer relationship management.

  3. d

    snoRNABase- a comprehensive database of human H/ACA and C/D box snoRNAs.

    • dknet.org
    • neuinfo.org
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). snoRNABase- a comprehensive database of human H/ACA and C/D box snoRNAs. [Dataset]. http://identifiers.org/RRID:SCR_007939
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    This is a database of human C/D box and H/ACA modification guide RNAs. Information on a particular snoRNA can be accessed by three ways: 1- On the Search page, just type the name of the snoRNA (for example ACA17) in the Id window. 2- The Find guide RNA contains the sequences of the human ribosomal rRNAs 28S, 18S and 5.8S, and of the snRNAs U1, U2, U4, U5 and U6, with the positions of modified (2''O-ribose methylated or pseudo-uridinylated) nucleotides, and the identity of the corresponding modification guide RNAs. You can click on the name of the relevant snoRNA. 3- By utilizing the link to the UCSC Human Genome Browser.

  4. d

    COMPLY Grantee Compliance Database

    • catalog.data.gov
    • res1catalogd-o-tdatad-o-tgov.vcapture.xyz
    Updated Aug 24, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office of Grants and Debarment (OMS/OGD) (2023). COMPLY Grantee Compliance Database [Dataset]. https://catalog.data.gov/dataset/comply-grantee-compliance-database-47c7c
    Explore at:
    Dataset updated
    Aug 24, 2023
    Dataset provided by
    Office of Grants and Debarment (OMS/OGD)
    Description

    The Grantee Compliance Database Comply App is a comprehensive database for summarizing a wide range of grant recipient related activities. In addition to providing an overview of award information related to each grantee recipient, this database also stores historical information related to the recipient's training activities, indirect cost rate negotiations, pre-award certifications, post award monitoring plans, as well as on-site review, off-site review, and technical assistance activities. All advanced monitoring activities must be recorded in the system with an attached report to count as part of the Grantee Compliance Assistance Initiative as outlined in EPA Order 5700.6. The database tracks information on planned and actual On-Site Evaluative, off-Site Evaluative and/or On-Site Technical Assistance Visits conducted by each Grants Management and Program Office in the Agency. The primary objective of this database is to provide accurate information to EPA staff in Headquarters, Regional Program, and Grants Management Offices regarding compliance activities that each Program and Grants Management Office performs or plans to perform during any given calendar year.

  5. f

    Table_1_MFPPDB: a comprehensive multi-functional plant peptide database.xlsx...

    • frontiersin.figshare.com
    xlsx
    Updated Oct 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yaozu Yang; Hongwei Wu; Yu Gao; Wei Tong; Ke Li (2023). Table_1_MFPPDB: a comprehensive multi-functional plant peptide database.xlsx [Dataset]. http://doi.org/10.3389/fpls.2023.1224394.s002
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Oct 16, 2023
    Dataset provided by
    Frontiers
    Authors
    Yaozu Yang; Hongwei Wu; Yu Gao; Wei Tong; Ke Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Plants produce a wide range of bioactive peptides as part of their innate defense mechanisms. With the explosive growth of plant-derived peptides, verifying the therapeutic function using traditional experimental methods are resources and time consuming. Therefore, it is necessary to predict the therapeutic function of plant-derived peptides more effectively and accurately with reduced waste of resources and thus expedite the development of plant peptides. We herein developed a repository of plant peptides predicted to have multiple therapeutic functions, named as MFPPDB (multi-functional plant peptide database). MFPPDB including 1,482,409 single or multiple functional plant origin therapeutic peptides derived from 121 fundamental plant species. The functional categories of these therapeutic peptides include 41 different features such as anti-bacterial, anti-fungal, anti-HIV, anti-viral, and anti-cancer. The detailed physicochemical information of these peptides was presented in functional search and physicochemical property search module, which can help users easily access the peptide information by the plant peptide species, ID, and functions, or by their peptide ID, isoelectric point, peptide sequence, and molecular weight through web-friendly interface. We further matched the predicted peptides to nine state-of-the-art curated functional peptide databases and found that at least 293,408 of the peptides possess functional potentials. Overall, MFPPDB integrated a massive number of plant peptides have single or multiple therapeutic functions, which will facilitate the comprehensive research in plant peptidomics. MFPPDB can be freely accessed through http://124.223.195.214:9188/mfppdb/index.

  6. D

    Database Security Solution Market Report | Global Forecast From 2025 To 2033...

    • dataintelo.com
    csv, pdf, pptx
    Updated Jan 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Database Security Solution Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/database-security-solution-market
    Explore at:
    csv, pdf, pptxAvailable download formats
    Dataset updated
    Jan 7, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Database Security Solution Market Outlook



    The global database security solution market was valued at USD 4.5 billion in 2023 and is projected to reach USD 11.7 billion by 2032, growing at a compound annual growth rate (CAGR) of 11.5% from 2024 to 2032. This remarkable growth can be attributed to the increasing volume of data generated and stored by organizations, rising cyber threats, regulatory compliance requirements, and the growing adoption of cloud-based services across various industries.



    One of the primary growth factors for the database security solution market is the exponential increase in data generation and storage. With the advent of big data, IoT, and advanced analytics, organizations are producing vast amounts of data that need to be securely stored and managed to prevent unauthorized access and data breaches. As a result, there is a growing demand for robust database security solutions that can protect sensitive information across diverse databases and platforms, ensuring data privacy and integrity.



    Another significant growth driver is the rising number of cyber threats and data breaches. Organizations face sophisticated cyber-attacks that target confidential and high-value data, leading to financial losses, reputational damage, and regulatory penalties. This has necessitated the implementation of advanced database security solutions that offer real-time threat detection, encryption, access control, and audit capabilities to safeguard critical data and maintain business continuity.



    Compliance with stringent regulatory frameworks is also propelling the growth of the database security solution market. Regulations such as GDPR, HIPAA, and CCPA mandate the protection of personal and sensitive information, compelling organizations to adopt comprehensive database security measures. Businesses are investing heavily in database security solutions to meet these regulatory requirements, avoid hefty fines, and build customer trust by ensuring data confidentiality and compliance.



    The advent of Big Data Security has become a pivotal aspect in the realm of database security solutions. As organizations increasingly rely on big data analytics to drive business insights, the security of this data becomes paramount. Big Data Security involves implementing comprehensive measures to protect large volumes of data from unauthorized access and breaches. It encompasses various strategies, including encryption, access controls, and real-time monitoring, to ensure that sensitive data remains protected throughout its lifecycle. As the volume and complexity of data continue to grow, the demand for advanced Big Data Security solutions is expected to rise, driving further innovation and investment in this area.



    Regionally, the database security solution market is witnessing significant growth, with North America leading the charge due to its advanced technological infrastructure, early adoption of innovative security solutions, and stringent data protection laws. Europe is also experiencing substantial growth driven by the enforcement of GDPR and increasing awareness of data privacy issues. The Asia Pacific region is projected to witness the highest CAGR during the forecast period, fueled by the rapid digital transformation, rising cyber threats, and growing government initiatives to enhance cybersecurity.



    Component Analysis



    The database security solution market can be segmented by component into software, hardware, and services. The software segment holds the largest market share, driven by the extensive use of database security software to protect data against unauthorized access, malware, and other cyber threats. These software solutions offer various functionalities such as encryption, access control, auditing, and monitoring, making them indispensable for organizations looking to secure their databases effectively.



    The hardware segment, although smaller compared to software, plays a crucial role in enhancing database security. Hardware-based security solutions, such as hardware security modules (HSMs), are used for cryptographic key management and secure storage of sensitive data. These solutions provide an additional layer of security by ensuring that cryptographic operations are performed in a tamper-resistant environment, thus preventing unauthorized access and key compromise.



    The services segment is also witnessing significant growth, driven by the increasing demand for m

  7. F

    Full Text Database Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Feb 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Full Text Database Report [Dataset]. https://www.datainsightsmarket.com/reports/full-text-database-1964932
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    Feb 12, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global full-text database market is projected to grow from XXX million in 2025 to XXX million by 2033, at a CAGR of XX% during the forecast period. The growth is attributed to increasing demand for information retrieval, advancements in technology, and rising need for efficient research and development. Key drivers of the market include growing adoption of digital libraries, rising demand for personalized content, and increasing focus on research and development. Key trends in the full-text database market include the emergence of artificial intelligence (AI) and machine learning (ML) technologies, the growth of open access publishing, and the increasing adoption of cloud-based solutions. The market is segmented by application (academic research, corporate research, legal research, and others) and by type (bibliographic, full-text, and abstract). Major players in the market include John Wiely & Sons, ICPSR, IEEE, EBSCO, UMI, Blackwell, Springer Link, Elsevier Science, Apache Solr, Elastic N.V., CNKI, China Science and Technology Journal Database, Wanfang Data Knowledge Service Platform, China Science Citation Database, and Chinese, Western, Japanese and Russian Journals Joint Directory Database. The market is expected to witness significant growth in emerging economies, such as China and India, due to rising literacy rates and increasing demand for information access.

  8. USAID Comprehensive Data Inventory March 2022

    • catalog.data.gov
    Updated Jun 25, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.usaid.gov (2024). USAID Comprehensive Data Inventory March 2022 [Dataset]. https://catalog.data.gov/dataset/usaid-comprehensive-data-inventory-march-2022
    Explore at:
    Dataset updated
    Jun 25, 2024
    Dataset provided by
    United States Agency for International Developmenthttp://usaid.gov/
    Description

    This is an inventory of all data assets maintained by USAID.

  9. d

    Complete data from the Barro Colorado 50-ha plot: 423617 trees, 35 years

    • datadryad.org
    • dataone.org
    zip
    Updated Jun 7, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Richard Condit; Rolando Pérez; Salomón Aguilar; Suzanne Lao; Robin Foster; Stephen Hubbell (2019). Complete data from the Barro Colorado 50-ha plot: 423617 trees, 35 years [Dataset]. http://doi.org/10.15146/5xcp-0d46
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 7, 2019
    Dataset provided by
    Dryad
    Authors
    Richard Condit; Rolando Pérez; Salomón Aguilar; Suzanne Lao; Robin Foster; Stephen Hubbell
    Time period covered
    2019
    Area covered
    Description

    See Condit (1998).

  10. f

    Data from: hccTAAb Atlas: An Integrated Knowledge Database for...

    • acs.figshare.com
    zip
    Updated Dec 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tiandong Li; Peng Wang; Guiying Sun; Yuanlin Zou; Yifan Cheng; Han Wang; Yin Lu; Jianxiang Shi; Keyan Wang; Qiang Zhang; Hua Ye (2023). hccTAAb Atlas: An Integrated Knowledge Database for Tumor-Associated Autoantibodies in Hepatocellular Carcinoma [Dataset]. http://doi.org/10.1021/acs.jproteome.3c00579.s001
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 29, 2023
    Dataset provided by
    ACS Publications
    Authors
    Tiandong Li; Peng Wang; Guiying Sun; Yuanlin Zou; Yifan Cheng; Han Wang; Yin Lu; Jianxiang Shi; Keyan Wang; Qiang Zhang; Hua Ye
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Tumor-associated autoantibodies (TAAbs) have demonstrated potential as biomarkers for cancer detection. However, the understanding of their role in hepatocellular carcinoma (HCC) remains limited. In this study, we aimed to systematically collect and standardize information about these TAAbs and establish a comprehensive database as a platform for in-depth research. A total of 170 TAAbs were identified from published papers retrieved from PubMed, Web of Science, and Embase. Following normative reannotation, these TAAbs were referred to as 162 official symbols. The hccTAAb (tumor-associated autoantibodies in hepatocellular carcinoma) atlas was developed using the R Shiny framework and incorporating literature-based and multiomics data sets. This comprehensive online resource provides key information such as sensitivity, specificity, and additional details such as official symbols, official full names, UniProt, NCBI, HPA, neXtProt, and aliases through hyperlinks. Additionally, hccTAAb offers six analytical modules for visualizing expression profiles, survival analysis, immune infiltration, similarity analysis, DNA methylation, and DNA mutation analysis. Overall, the hccTAAb Atlas provides valuable insights into the mechanisms underlying TAAb and has the potential to enhance the diagnosis and treatment of HCC using autoantibodies. The hccTAAb Atlas is freely accessible at https://nscc.v.zzu.edu.cn/hccTAAb/.

  11. w

    Global Financial Inclusion (Global Findex) Database 2021 - Canada

    • microdata.worldbank.org
    • datacatalog.ihsn.org
    Updated Dec 16, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Development Research Group, Finance and Private Sector Development Unit (2022). Global Financial Inclusion (Global Findex) Database 2021 - Canada [Dataset]. https://microdata.worldbank.org/index.php/catalog/4625
    Explore at:
    Dataset updated
    Dec 16, 2022
    Dataset authored and provided by
    Development Research Group, Finance and Private Sector Development Unit
    Time period covered
    2021
    Area covered
    Canada
    Description

    Abstract

    The fourth edition of the Global Findex offers a lens into how people accessed and used financial services during the COVID-19 pandemic, when mobility restrictions and health policies drove increased demand for digital services of all kinds.

    The Global Findex is the world's most comprehensive database on financial inclusion. It is also the only global demand-side data source allowing for global and regional cross-country analysis to provide a rigorous and multidimensional picture of how adults save, borrow, make payments, and manage financial risks. Global Findex 2021 data were collected from national representative surveys of about 128,000 adults in more than 120 economies. The latest edition follows the 2011, 2014, and 2017 editions, and it includes a number of new series measuring financial health and resilience and contains more granular data on digital payment adoption, including merchant and government payments.

    The Global Findex is an indispensable resource for financial service practitioners, policy makers, researchers, and development professionals.

    Geographic coverage

    Northwest Territories, Yukon, and Nunavut (representing approximately 0.3 percent of the Canadian population) were excluded.

    Analysis unit

    Individual

    Kind of data

    Observation data/ratings [obs]

    Sampling procedure

    In most developing economies, Global Findex data have traditionally been collected through face-to-face interviews. Surveys are conducted face-to-face in economies where telephone coverage represents less than 80 percent of the population or where in-person surveying is the customary methodology. However, because of ongoing COVID-19 related mobility restrictions, face-to-face interviewing was not possible in some of these economies in 2021. Phone-based surveys were therefore conducted in 67 economies that had been surveyed face-to-face in 2017. These 67 economies were selected for inclusion based on population size, phone penetration rate, COVID-19 infection rates, and the feasibility of executing phone-based methods where Gallup would otherwise conduct face-to-face data collection, while complying with all government-issued guidance throughout the interviewing process. Gallup takes both mobile phone and landline ownership into consideration. According to Gallup World Poll 2019 data, when face-to-face surveys were last carried out in these economies, at least 80 percent of adults in almost all of them reported mobile phone ownership. All samples are probability-based and nationally representative of the resident adult population. Phone surveys were not a viable option in 17 economies that had been part of previous Global Findex surveys, however, because of low mobile phone ownership and surveying restrictions. Data for these economies will be collected in 2022 and released in 2023.

    In economies where face-to-face surveys are conducted, the first stage of sampling is the identification of primary sampling units. These units are stratified by population size, geography, or both, and clustering is achieved through one or more stages of sampling. Where population information is available, sample selection is based on probabilities proportional to population size; otherwise, simple random sampling is used. Random route procedures are used to select sampled households. Unless an outright refusal occurs, interviewers make up to three attempts to survey the sampled household. To increase the probability of contact and completion, attempts are made at different times of the day and, where possible, on different days. If an interview cannot be obtained at the initial sampled household, a simple substitution method is used. Respondents are randomly selected within the selected households. Each eligible household member is listed, and the hand-held survey device randomly selects the household member to be interviewed. For paper surveys, the Kish grid method is used to select the respondent. In economies where cultural restrictions dictate gender matching, respondents are randomly selected from among all eligible adults of the interviewer's gender.

    In traditionally phone-based economies, respondent selection follows the same procedure as in previous years, using random digit dialing or a nationally representative list of phone numbers. In most economies where mobile phone and landline penetration is high, a dual sampling frame is used.

    The same respondent selection procedure is applied to the new phone-based economies. Dual frame (landline and mobile phone) random digital dialing is used where landline presence and use are 20 percent or higher based on historical Gallup estimates. Mobile phone random digital dialing is used in economies with limited to no landline presence (less than 20 percent).

    For landline respondents in economies where mobile phone or landline penetration is 80 percent or higher, random selection of respondents is achieved by using either the latest birthday or household enumeration method. For mobile phone respondents in these economies or in economies where mobile phone or landline penetration is less than 80 percent, no further selection is performed. At least three attempts are made to reach a person in each household, spread over different days and times of day.

    Sample size for Canada is 1007.

    Mode of data collection

    Landline and mobile telephone

    Research instrument

    Questionnaires are available on the website.

    Sampling error estimates

    Estimates of standard errors (which account for sampling error) vary by country and indicator. For country-specific margins of error, please refer to the Methodology section and corresponding table in Demirgüç-Kunt, Asli, Leora Klapper, Dorothe Singer, Saniya Ansar. 2022. The Global Findex Database 2021: Financial Inclusion, Digital Payments, and Resilience in the Age of COVID-19. Washington, DC: World Bank.

  12. n

    Viral Integrated Structural Evolution Dynamic Database

    • neuinfo.org
    • dknet.org
    • +2more
    Updated Jul 12, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2020). Viral Integrated Structural Evolution Dynamic Database [Dataset]. http://identifiers.org/RRID:SCR_018793
    Explore at:
    Dataset updated
    Jul 12, 2020
    Description

    Database of SARS-CoV-2 and other viruses. Integrates structural and dynamic insights with viral evolution for proteins coded by virus. Each virus within database has workflow performed on each protein. Workflow consists of protein modeling, molecular dynamic simulations, evolutionary analysis, and mapping of protein-protein interactions. On page for each protein is link to individual protein data folder system, video of protein rotating with conservation, details of protein function, widget to purchase 3D print of protein at cost of production, amino acid movement from molecular dynamic simulations, and table of data for each amino acid of protein.

  13. w

    Global Financial Inclusion (Global Findex) Database 2021 - Yemen, Rep.

    • microdata.worldbank.org
    • catalog.ihsn.org
    • +1more
    Updated Jun 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Development Research Group, Finance and Private Sector Development Unit (2023). Global Financial Inclusion (Global Findex) Database 2021 - Yemen, Rep. [Dataset]. https://microdata.worldbank.org/index.php/catalog/5862
    Explore at:
    Dataset updated
    Jun 8, 2023
    Dataset authored and provided by
    Development Research Group, Finance and Private Sector Development Unit
    Time period covered
    2022 - 2023
    Area covered
    Yemen
    Description

    Abstract

    The fourth edition of the Global Findex offers a lens into how people accessed and used financial services during the COVID-19 pandemic, when mobility restrictions and health policies drove increased demand for digital services of all kinds.

    The Global Findex is the world’s most comprehensive database on financial inclusion. It is also the only global demand-side data source allowing for global and regional cross-country analysis to provide a rigorous and multidimensional picture of how adults save, borrow, make payments, and manage financial risks. Global Findex 2021 data were collected from national representative surveys of almost 145,000 people in 139 economies, representing 97 percent of the world’s population. The latest edition follows the 2011, 2014, and 2017 editions, and it includes a number of new series measuring financial health and resilience and contains more granular data on digital payment adoption, including merchant and government payments.

    The Global Findex is an indispensable resource for financial service practitioners, policy makers, researchers, and development professionals.

    Geographic coverage

    Al Baydaa, Al Jawf, Mareb, Sadah, the Island of Socotra, and several districts in other governorates were excluded due to their small size, remoteness or security issues. The excluded areas represent approximately 23% of the population. In addition, due to the ongoing security situation, during field over one-fourth of the PSUs were replaced with a similar PSU in the same province.

    Kind of data

    Observation data/ratings [obs]

    Sampling procedure

    In most developing economies, Global Findex data have traditionally been collected through face-to-face interviews. Surveys are conducted face-to-face in economies where telephone coverage represents less than 80 percent of the population or where in-person surveying is the customary methodology. However, because of ongoing COVID-19–related mobility restrictions, face-to-face interviewing was not possible in some of these economies in 2021. Phone-based surveys were therefore conducted in 67 economies that had been surveyed face-to-face in 2017. These 67 economies were selected for inclusion based on population size, phone penetration rate, COVID-19 infection rates, and the feasibility of executing phone-based methods where Gallup would otherwise conduct face-to-face data collection, while complying with all government-issued guidance throughout the interviewing process. Gallup takes both mobile phone and landline ownership into consideration. According to Gallup World Poll 2019 data, when face-to-face surveys were last carried out in these economies, at least 80 percent of adults in almost all of them reported mobile phone ownership. All samples are probability-based and nationally representative of the resident adult population. Additionally, phone surveys were not a viable option in 16 economies in 2021, which were then surveyed in 2022.

    In economies where face-to-face surveys are conducted, the first stage of sampling is the identification of primary sampling units. These units are stratified by population size, geography, or both, and clustering is achieved through one or more stages of sampling. Where population information is available, sample selection is based on probabilities proportional to population size; otherwise, simple random sampling is used. Random route procedures are used to select sampled households. Unless an outright refusal occurs, interviewers make up to three attempts to survey the sampled household. To increase the probability of contact and completion, attempts are made at different times of the day and, where possible, on different days. If an interview cannot be obtained at the initial sampled household, a simple substitution method is used. Respondents are randomly selected within the selected households. Each eligible household member is listed, and the hand-held survey device randomly selects the household member to be interviewed. For paper surveys, the Kish grid method is used to select the respondent. In economies where cultural restrictions dictate gender matching, respondents are randomly selected from among all eligible adults of the interviewer's gender.

    In traditionally phone-based economies, respondent selection follows the same procedure as in previous years, using random digit dialing or a nationally representative list of phone numbers. In most economies where mobile phone and landline penetration is high, a dual sampling frame is used.

    The same respondent selection procedure is applied to the new phone-based economies. Dual frame (landline and mobile phone) random digital dialing is used where landline presence and use are 20 percent or higher based on historical Gallup estimates. Mobile phone random digital dialing is used in economies with limited to no landline presence (less than 20 percent).

    For landline respondents in economies where mobile phone or landline penetration is 80 percent or higher, random selection of respondents is achieved by using either the latest birthday or household enumeration method. For mobile phone respondents in these economies or in economies where mobile phone or landline penetration is less than 80 percent, no further selection is performed. At least three attempts are made to reach a person in each household, spread over different days and times of day.

    Sample size for Yemen, Rep. is 1000.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    Questionnaires are available on the website.

    Sampling error estimates

    Estimates of standard errors (which account for sampling error) vary by country and indicator. For country-specific margins of error, please refer to the Methodology section and corresponding table in Demirgüç-Kunt, Asli, Leora Klapper, Dorothe Singer, Saniya Ansar. 2022. The Global Findex Database 2021: Financial Inclusion, Digital Payments, and Resilience in the Age of COVID-19. Washington, DC: World Bank.

  14. f

    DataSheet_1_MFPPDB: a comprehensive multi-functional plant peptide...

    • frontiersin.figshare.com
    pdf
    Updated Oct 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yaozu Yang; Hongwei Wu; Yu Gao; Wei Tong; Ke Li (2023). DataSheet_1_MFPPDB: a comprehensive multi-functional plant peptide database.pdf [Dataset]. http://doi.org/10.3389/fpls.2023.1224394.s001
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Oct 16, 2023
    Dataset provided by
    Frontiers
    Authors
    Yaozu Yang; Hongwei Wu; Yu Gao; Wei Tong; Ke Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Plants produce a wide range of bioactive peptides as part of their innate defense mechanisms. With the explosive growth of plant-derived peptides, verifying the therapeutic function using traditional experimental methods are resources and time consuming. Therefore, it is necessary to predict the therapeutic function of plant-derived peptides more effectively and accurately with reduced waste of resources and thus expedite the development of plant peptides. We herein developed a repository of plant peptides predicted to have multiple therapeutic functions, named as MFPPDB (multi-functional plant peptide database). MFPPDB including 1,482,409 single or multiple functional plant origin therapeutic peptides derived from 121 fundamental plant species. The functional categories of these therapeutic peptides include 41 different features such as anti-bacterial, anti-fungal, anti-HIV, anti-viral, and anti-cancer. The detailed physicochemical information of these peptides was presented in functional search and physicochemical property search module, which can help users easily access the peptide information by the plant peptide species, ID, and functions, or by their peptide ID, isoelectric point, peptide sequence, and molecular weight through web-friendly interface. We further matched the predicted peptides to nine state-of-the-art curated functional peptide databases and found that at least 293,408 of the peptides possess functional potentials. Overall, MFPPDB integrated a massive number of plant peptides have single or multiple therapeutic functions, which will facilitate the comprehensive research in plant peptidomics. MFPPDB can be freely accessed through http://124.223.195.214:9188/mfppdb/index.

  15. D

    Database Security Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated May 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Database Security Report [Dataset]. https://www.datainsightsmarket.com/reports/database-security-1977256
    Explore at:
    doc, pdf, pptAvailable download formats
    Dataset updated
    May 12, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Database Security market is experiencing robust growth, projected to reach $2556.1 million in 2025 and exhibiting a Compound Annual Growth Rate (CAGR) of 11.4% from 2025 to 2033. This expansion is fueled by the increasing frequency and sophistication of cyberattacks targeting sensitive data stored in databases, coupled with stringent data privacy regulations like GDPR and CCPA. The rising adoption of cloud computing and the proliferation of big data also contribute significantly to market growth, as organizations require robust security solutions to protect their valuable data assets across diverse environments. The market is segmented by application (SMEs, Large Enterprises) and type (Marketing, Sales, Operations, Finance, HR & Legal), with large enterprises and applications involving sensitive financial data demonstrating particularly high demand for advanced database security solutions. North America currently holds a dominant market share due to early adoption of advanced technologies and a strong regulatory landscape, but the Asia-Pacific region is poised for significant growth, driven by increasing digitalization and a rapidly expanding economy. The competitive landscape is characterized by a mix of established players like Oracle and IBM, alongside specialized security vendors such as Trustwave and McAfee. These companies offer a wide range of solutions, including database activity monitoring, encryption, access control, and vulnerability management. The market is witnessing innovation in areas like AI-powered threat detection and automated security response, which are enhancing the effectiveness and efficiency of database security solutions. However, challenges remain, including the rising complexity of cyber threats, the skills gap in cybersecurity professionals, and the high cost of implementing and maintaining comprehensive database security systems. The continued evolution of cyberattacks and data privacy regulations will be key drivers shaping the future of this dynamic market.

  16. n

    Comparasite: full length cDNA database

    • neuinfo.org
    • dknet.org
    • +2more
    Updated Oct 16, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). Comparasite: full length cDNA database [Dataset]. http://identifiers.org/RRID:SCR_007608
    Explore at:
    Dataset updated
    Oct 16, 2019
    Description

    Comparasite is an integrated database of our original full-length cDNA sequence data. It consists of seven sub-databases of apicomplexa protozoa, Plasmodium falciparum, Plasmodium yoelii, Plasmodium vivax, Toxoplasma gondii, Cryptosporidium parvum, Echinococcus multilocularis. Homologous gene groups are clustered and comparative analysis of any combination of these seven species is implemented, such as interspecies comparisons as to cellular localization, motifs or transmembrane regions and so on. For submitted keywords and other search conditions, Comparasite retrieves orthologous gene groups containing a given protein motif/GO term etc in common or in a species-specific manner. By enabling multi-faceted comparative analyses of genes of apicomplexa protozoa, monophyletic organisms that have evolved to diversify to parasitize various hosts by adopting complex life cycles, Comparasite should help elucidate the mechanism behind parasitism.

  17. Data from: Comprehensive Housing Affordability Strategy (CHAS)

    • catalog.data.gov
    • res1catalogd-o-tdatad-o-tgov.vcapture.xyz
    Updated Mar 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Department of Housing and Urban Development (2024). Comprehensive Housing Affordability Strategy (CHAS) [Dataset]. https://catalog.data.gov/dataset/comprehensive-housing-affordability-strategy-chas-2008-2010
    Explore at:
    Dataset updated
    Mar 1, 2024
    Dataset provided by
    United States Department of Housing and Urban Developmenthttp://www.hud.gov/
    Description

    The U.S. Department of Housing and Urban Development (HUD) periodically receives custom tabulations of data from the U.S. Census Bureau that are largely not available through standard Census products. These data, known as the CHAS data (Comprehensive Housing Affordability Strategy), demonstrate the extent of housing problems and housing needs, particularly for low income households. The CHAS data are used by local governments to plan how to spend HUD funds, and may also be used by HUD to distribute grant funds

  18. w

    Newly Registered Domains Database of 2053-01-13

    • whoisdatacenter.com
    csv
    Updated Aug 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AllHeart Web Inc (2024). Newly Registered Domains Database of 2053-01-13 [Dataset]. https://whoisdatacenter.com/newly-registered-domains-database/2053-01-13/
    Explore at:
    csvAvailable download formats
    Dataset updated
    Aug 13, 2024
    Dataset authored and provided by
    AllHeart Web Inc
    License

    https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/

    Time period covered
    Mar 15, 1985 - Jun 19, 2025
    Description

    Explore Newly Registered Domains from January 13, 2053, with our comprehensive database. Access real-time data on recently created domains.

  19. w

    Global Financial Inclusion (Global Findex) Database 2021 - Kazakhstan

    • microdata.worldbank.org
    • catalog.ihsn.org
    • +1more
    Updated Dec 16, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Development Research Group, Finance and Private Sector Development Unit (2022). Global Financial Inclusion (Global Findex) Database 2021 - Kazakhstan [Dataset]. https://microdata.worldbank.org/index.php/catalog/4663
    Explore at:
    Dataset updated
    Dec 16, 2022
    Dataset authored and provided by
    Development Research Group, Finance and Private Sector Development Unit
    Time period covered
    2021
    Area covered
    Kazakhstan
    Description

    Abstract

    The fourth edition of the Global Findex offers a lens into how people accessed and used financial services during the COVID-19 pandemic, when mobility restrictions and health policies drove increased demand for digital services of all kinds.

    The Global Findex is the world's most comprehensive database on financial inclusion. It is also the only global demand-side data source allowing for global and regional cross-country analysis to provide a rigorous and multidimensional picture of how adults save, borrow, make payments, and manage financial risks. Global Findex 2021 data were collected from national representative surveys of about 128,000 adults in more than 120 economies. The latest edition follows the 2011, 2014, and 2017 editions, and it includes a number of new series measuring financial health and resilience and contains more granular data on digital payment adoption, including merchant and government payments.

    The Global Findex is an indispensable resource for financial service practitioners, policy makers, researchers, and development professionals.

    Geographic coverage

    National coverage

    Analysis unit

    Individual

    Kind of data

    Observation data/ratings [obs]

    Sampling procedure

    In most developing economies, Global Findex data have traditionally been collected through face-to-face interviews. Surveys are conducted face-to-face in economies where telephone coverage represents less than 80 percent of the population or where in-person surveying is the customary methodology. However, because of ongoing COVID-19 related mobility restrictions, face-to-face interviewing was not possible in some of these economies in 2021. Phone-based surveys were therefore conducted in 67 economies that had been surveyed face-to-face in 2017. These 67 economies were selected for inclusion based on population size, phone penetration rate, COVID-19 infection rates, and the feasibility of executing phone-based methods where Gallup would otherwise conduct face-to-face data collection, while complying with all government-issued guidance throughout the interviewing process. Gallup takes both mobile phone and landline ownership into consideration. According to Gallup World Poll 2019 data, when face-to-face surveys were last carried out in these economies, at least 80 percent of adults in almost all of them reported mobile phone ownership. All samples are probability-based and nationally representative of the resident adult population. Phone surveys were not a viable option in 17 economies that had been part of previous Global Findex surveys, however, because of low mobile phone ownership and surveying restrictions. Data for these economies will be collected in 2022 and released in 2023.

    In economies where face-to-face surveys are conducted, the first stage of sampling is the identification of primary sampling units. These units are stratified by population size, geography, or both, and clustering is achieved through one or more stages of sampling. Where population information is available, sample selection is based on probabilities proportional to population size; otherwise, simple random sampling is used. Random route procedures are used to select sampled households. Unless an outright refusal occurs, interviewers make up to three attempts to survey the sampled household. To increase the probability of contact and completion, attempts are made at different times of the day and, where possible, on different days. If an interview cannot be obtained at the initial sampled household, a simple substitution method is used. Respondents are randomly selected within the selected households. Each eligible household member is listed, and the hand-held survey device randomly selects the household member to be interviewed. For paper surveys, the Kish grid method is used to select the respondent. In economies where cultural restrictions dictate gender matching, respondents are randomly selected from among all eligible adults of the interviewer's gender.

    In traditionally phone-based economies, respondent selection follows the same procedure as in previous years, using random digit dialing or a nationally representative list of phone numbers. In most economies where mobile phone and landline penetration is high, a dual sampling frame is used.

    The same respondent selection procedure is applied to the new phone-based economies. Dual frame (landline and mobile phone) random digital dialing is used where landline presence and use are 20 percent or higher based on historical Gallup estimates. Mobile phone random digital dialing is used in economies with limited to no landline presence (less than 20 percent).

    For landline respondents in economies where mobile phone or landline penetration is 80 percent or higher, random selection of respondents is achieved by using either the latest birthday or household enumeration method. For mobile phone respondents in these economies or in economies where mobile phone or landline penetration is less than 80 percent, no further selection is performed. At least three attempts are made to reach a person in each household, spread over different days and times of day.

    Sample size for Kazakhstan is 1000.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    Questionnaires are available on the website.

    Sampling error estimates

    Estimates of standard errors (which account for sampling error) vary by country and indicator. For country-specific margins of error, please refer to the Methodology section and corresponding table in Demirgüç-Kunt, Asli, Leora Klapper, Dorothe Singer, Saniya Ansar. 2022. The Global Findex Database 2021: Financial Inclusion, Digital Payments, and Resilience in the Age of COVID-19. Washington, DC: World Bank.

  20. d

    Data from: DBGC: a database of human gastric cancer

    • datadryad.org
    • data.niaid.nih.gov
    • +2more
    zip
    Updated Jan 25, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chao Wang; Jun Zhang; Mingdeng Cai; Zhenggang Zhu; Wenjie Gu; Yingyan Yu; Xiaoyan Zhang (2016). DBGC: a database of human gastric cancer [Dataset]. http://doi.org/10.5061/dryad.271dk
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 25, 2016
    Dataset provided by
    Dryad
    Authors
    Chao Wang; Jun Zhang; Mingdeng Cai; Zhenggang Zhu; Wenjie Gu; Yingyan Yu; Xiaoyan Zhang
    Time period covered
    Oct 30, 2015
    Description

    DBGC: A Database of Human Gastric CancerOur work collected the data from transcriptome, proteome, mutation, biomarker and drug-sensitive genes. Totally comes out 9 files:

    1,transcriptome.txt -- transcriptome related to gastric cancer, that contains the type of experiment, platform, sample, volume, result and the linkage of those. 2,up_genes.txt -- up-regulated gene list come from transcriptome which based on experiment 3,down_genes.txt -- up-regulated gene list come from transcriptome which based on experiment 4,proteome.txt -- proteome related to gastric cancer, that contains the type of experiment, sample, technology and result etc. 5,up_proteins.txt -- up-regulated genes list come from proteome 6,down_proteins.txt -- down-regulated genes list come from proteome 7,mutation.txt -- mutation related to gastric cancer, that mainly include mutation type, postion etc. 8,biomarker.txt -- biomarker related to gastric cancer including types, stages as well sensitivity, specifity ect. 9,pharma...

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Bert Hubert (2021). SkewDB: A comprehensive database of GC and 10 other skews for over 28,000 chromosomes and plasmids [Dataset]. http://doi.org/10.5061/dryad.g4f4qrfr6

Data from: SkewDB: A comprehensive database of GC and 10 other skews for over 28,000 chromosomes and plasmids

Related Article
Explore at:
zipAvailable download formats
Dataset updated
Oct 4, 2021
Dataset provided by
Independent researcher
Authors
Bert Hubert
License

https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

Description

GC skew denotes the relative excess of G nucleotides over C nucleotides on the leading versus the lagging replication strand of eubacteria. While the effect is small, typically around 2.5%, it is robust and pervasive. GC skew and the analogous TA skew are a localized deviation from Chargaff’s second parity rule, which states that G and C, and T and A occur with (mostly) equal frequency even within a strand.

Most bacteria also show the analogous TA skew. Different phyla show different kinds of skew and differing relations between TA and GC skew. This article introduces an open access database (https://skewdb.org) of GC and 10 other skews for over 28,000 chromosomes and plasmids. Further details like codon bias, strand bias, strand lengths and taxonomic data are also included.

The SkewDB database can be used to generate or verify hypotheses. Since the origins of both the second parity rule, as well as GC skew itself, are not yet satisfactorily explained, such a database may enhance our understanding of microbial DNA.

Methods The SkewDB analysis relies exclusively on the tens of thousands of FASTA and GFF3 files available through the NCBI download service, which covers both GenBank and RefSeq. The database includes bacteria, archaea and their plasmids. Furthermore, to ease analysis, the NCBI Taxonomy database is sourced and merged so output data can quickly be related to (super)phyla or specific species. No other data is used, which greatly simplifies processing. Data is read directly in the compressed format provided by NCBI.

All results are emitted as standard CSV files. In the first step of the analysis, for each organism the FASTA sequence and the GFF3 annotation file are parsed. Every chromosome in the FASTA file is traversed from beginning to end, while a running total is kept for cumulative GC and TA skew. In addition, within protein coding genes, such totals are also kept separately for these skews on the first, second and third codon position. Furthermore, separate totals are kept for regions which do not code for proteins. In addition, to enable strand bias measurements, a cumulative count is maintained of nucleotides that are part of a positive or negative sense gene. The counter is increased for positive sense nucleotides, decreased for negative sense nucleotides, and left alone for non-genic regions.

A separate counter is kept for non-genic nucleotides. Finally, G and C nucleotides are counted, regardless of if they are part of a gene or not. These running totals are emitted at 4096 nucleotide intervals, a resolution suitable for determining skews and shifts. In addition, one line summaries are stored for each chromosome. These line includes the RefSeq identifier of the chromosome, the full name mentioned in the FASTA file, plus counts of A, C, G and T nucleotides. Finally five levels of taxonomic data are stored.

Chromosomes and plasmids of fewer than 100 thousand nucleotides are ignored, as these are too noisy to model faithfully. Plasmids are clearly marked in the database, enabling researchers to focus on chromosomes if so desired. Fitting Once the genomes have been summarised at 4096-nucleotide resolution, the skews are fitted to a simple model. The fits are based on four parameters. Alpha1 and alpha2 denote the relative excess of G over C on the leading and lagging strands. If alpha1 is 0.046, this means that for every 1000 nucleotides on the leading strand, the cumulative count of G excess increases by 46. The third parameter is div and it describes how the chromosome is divided over leading and lagging strands. If this number is 0.557, the leading replication strand is modeled to make up 55.7% of the chromosome. The final parameter is shift (the dotted vertical line), and denotes the offset of the origin of replication compared to the DNA FASTA file. This parameter has no biological meaning of itself, and is an artifact of the DNA assembly process.

The goodness-of-fit number consists of the root mean squared error of the fit, divided by the absolute mean skew. This latter correction is made to not penalize good fits for bacteria showing significant skew. GC skew tends to be defined very strongly, and it is therefore used to pick the div and shift parameters of the DNA sequence, which are then kept as a fixed constraint for all the other skews, which might not be present as clearly. The fitting process itself is a downhill simplex method optimization over the three dimensions, seeded with the average observed skew over the whole genome, and assuming there is no shift, and that the leading and lagging strands are evenly distributed. The simplex optimization is tuned so that it takes sufficiently large steps so it can reach the optimum even if some initial assumptions are off.

Search
Clear search
Close search
Google apps
Main menu