https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
GC skew denotes the relative excess of G nucleotides over C nucleotides on the leading versus the lagging replication strand of eubacteria. While the effect is small, typically around 2.5%, it is robust and pervasive. GC skew and the analogous TA skew are a localized deviation from Chargaff’s second parity rule, which states that G and C, and T and A occur with (mostly) equal frequency even within a strand.
Most bacteria also show the analogous TA skew. Different phyla show different kinds of skew and differing relations between TA and GC skew. This article introduces an open access database (https://skewdb.org) of GC and 10 other skews for over 28,000 chromosomes and plasmids. Further details like codon bias, strand bias, strand lengths and taxonomic data are also included.
The SkewDB database can be used to generate or verify hypotheses. Since the origins of both the second parity rule, as well as GC skew itself, are not yet satisfactorily explained, such a database may enhance our understanding of microbial DNA.
Methods The SkewDB analysis relies exclusively on the tens of thousands of FASTA and GFF3 files available through the NCBI download service, which covers both GenBank and RefSeq. The database includes bacteria, archaea and their plasmids. Furthermore, to ease analysis, the NCBI Taxonomy database is sourced and merged so output data can quickly be related to (super)phyla or specific species. No other data is used, which greatly simplifies processing. Data is read directly in the compressed format provided by NCBI.
All results are emitted as standard CSV files. In the first step of the analysis, for each organism the FASTA sequence and the GFF3 annotation file are parsed. Every chromosome in the FASTA file is traversed from beginning to end, while a running total is kept for cumulative GC and TA skew. In addition, within protein coding genes, such totals are also kept separately for these skews on the first, second and third codon position. Furthermore, separate totals are kept for regions which do not code for proteins. In addition, to enable strand bias measurements, a cumulative count is maintained of nucleotides that are part of a positive or negative sense gene. The counter is increased for positive sense nucleotides, decreased for negative sense nucleotides, and left alone for non-genic regions.
A separate counter is kept for non-genic nucleotides. Finally, G and C nucleotides are counted, regardless of if they are part of a gene or not. These running totals are emitted at 4096 nucleotide intervals, a resolution suitable for determining skews and shifts. In addition, one line summaries are stored for each chromosome. These line includes the RefSeq identifier of the chromosome, the full name mentioned in the FASTA file, plus counts of A, C, G and T nucleotides. Finally five levels of taxonomic data are stored.
Chromosomes and plasmids of fewer than 100 thousand nucleotides are ignored, as these are too noisy to model faithfully. Plasmids are clearly marked in the database, enabling researchers to focus on chromosomes if so desired. Fitting Once the genomes have been summarised at 4096-nucleotide resolution, the skews are fitted to a simple model. The fits are based on four parameters. Alpha1 and alpha2 denote the relative excess of G over C on the leading and lagging strands. If alpha1 is 0.046, this means that for every 1000 nucleotides on the leading strand, the cumulative count of G excess increases by 46. The third parameter is div and it describes how the chromosome is divided over leading and lagging strands. If this number is 0.557, the leading replication strand is modeled to make up 55.7% of the chromosome. The final parameter is shift (the dotted vertical line), and denotes the offset of the origin of replication compared to the DNA FASTA file. This parameter has no biological meaning of itself, and is an artifact of the DNA assembly process.
The goodness-of-fit number consists of the root mean squared error of the fit, divided by the absolute mean skew. This latter correction is made to not penalize good fits for bacteria showing significant skew. GC skew tends to be defined very strongly, and it is therefore used to pick the div and shift parameters of the DNA sequence, which are then kept as a fixed constraint for all the other skews, which might not be present as clearly. The fitting process itself is a downhill simplex method optimization over the three dimensions, seeded with the average observed skew over the whole genome, and assuming there is no shift, and that the leading and lagging strands are evenly distributed. The simplex optimization is tuned so that it takes sufficiently large steps so it can reach the optimum even if some initial assumptions are off.
Database used to store client data both Identity and customer relationship management.
This is a database of human C/D box and H/ACA modification guide RNAs. Information on a particular snoRNA can be accessed by three ways: 1- On the Search page, just type the name of the snoRNA (for example ACA17) in the Id window. 2- The Find guide RNA contains the sequences of the human ribosomal rRNAs 28S, 18S and 5.8S, and of the snRNAs U1, U2, U4, U5 and U6, with the positions of modified (2''O-ribose methylated or pseudo-uridinylated) nucleotides, and the identity of the corresponding modification guide RNAs. You can click on the name of the relevant snoRNA. 3- By utilizing the link to the UCSC Human Genome Browser.
The Grantee Compliance Database Comply App is a comprehensive database for summarizing a wide range of grant recipient related activities. In addition to providing an overview of award information related to each grantee recipient, this database also stores historical information related to the recipient's training activities, indirect cost rate negotiations, pre-award certifications, post award monitoring plans, as well as on-site review, off-site review, and technical assistance activities. All advanced monitoring activities must be recorded in the system with an attached report to count as part of the Grantee Compliance Assistance Initiative as outlined in EPA Order 5700.6. The database tracks information on planned and actual On-Site Evaluative, off-Site Evaluative and/or On-Site Technical Assistance Visits conducted by each Grants Management and Program Office in the Agency. The primary objective of this database is to provide accurate information to EPA staff in Headquarters, Regional Program, and Grants Management Offices regarding compliance activities that each Program and Grants Management Office performs or plans to perform during any given calendar year.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Plants produce a wide range of bioactive peptides as part of their innate defense mechanisms. With the explosive growth of plant-derived peptides, verifying the therapeutic function using traditional experimental methods are resources and time consuming. Therefore, it is necessary to predict the therapeutic function of plant-derived peptides more effectively and accurately with reduced waste of resources and thus expedite the development of plant peptides. We herein developed a repository of plant peptides predicted to have multiple therapeutic functions, named as MFPPDB (multi-functional plant peptide database). MFPPDB including 1,482,409 single or multiple functional plant origin therapeutic peptides derived from 121 fundamental plant species. The functional categories of these therapeutic peptides include 41 different features such as anti-bacterial, anti-fungal, anti-HIV, anti-viral, and anti-cancer. The detailed physicochemical information of these peptides was presented in functional search and physicochemical property search module, which can help users easily access the peptide information by the plant peptide species, ID, and functions, or by their peptide ID, isoelectric point, peptide sequence, and molecular weight through web-friendly interface. We further matched the predicted peptides to nine state-of-the-art curated functional peptide databases and found that at least 293,408 of the peptides possess functional potentials. Overall, MFPPDB integrated a massive number of plant peptides have single or multiple therapeutic functions, which will facilitate the comprehensive research in plant peptidomics. MFPPDB can be freely accessed through http://124.223.195.214:9188/mfppdb/index.
https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy
The global database security solution market was valued at USD 4.5 billion in 2023 and is projected to reach USD 11.7 billion by 2032, growing at a compound annual growth rate (CAGR) of 11.5% from 2024 to 2032. This remarkable growth can be attributed to the increasing volume of data generated and stored by organizations, rising cyber threats, regulatory compliance requirements, and the growing adoption of cloud-based services across various industries.
One of the primary growth factors for the database security solution market is the exponential increase in data generation and storage. With the advent of big data, IoT, and advanced analytics, organizations are producing vast amounts of data that need to be securely stored and managed to prevent unauthorized access and data breaches. As a result, there is a growing demand for robust database security solutions that can protect sensitive information across diverse databases and platforms, ensuring data privacy and integrity.
Another significant growth driver is the rising number of cyber threats and data breaches. Organizations face sophisticated cyber-attacks that target confidential and high-value data, leading to financial losses, reputational damage, and regulatory penalties. This has necessitated the implementation of advanced database security solutions that offer real-time threat detection, encryption, access control, and audit capabilities to safeguard critical data and maintain business continuity.
Compliance with stringent regulatory frameworks is also propelling the growth of the database security solution market. Regulations such as GDPR, HIPAA, and CCPA mandate the protection of personal and sensitive information, compelling organizations to adopt comprehensive database security measures. Businesses are investing heavily in database security solutions to meet these regulatory requirements, avoid hefty fines, and build customer trust by ensuring data confidentiality and compliance.
The advent of Big Data Security has become a pivotal aspect in the realm of database security solutions. As organizations increasingly rely on big data analytics to drive business insights, the security of this data becomes paramount. Big Data Security involves implementing comprehensive measures to protect large volumes of data from unauthorized access and breaches. It encompasses various strategies, including encryption, access controls, and real-time monitoring, to ensure that sensitive data remains protected throughout its lifecycle. As the volume and complexity of data continue to grow, the demand for advanced Big Data Security solutions is expected to rise, driving further innovation and investment in this area.
Regionally, the database security solution market is witnessing significant growth, with North America leading the charge due to its advanced technological infrastructure, early adoption of innovative security solutions, and stringent data protection laws. Europe is also experiencing substantial growth driven by the enforcement of GDPR and increasing awareness of data privacy issues. The Asia Pacific region is projected to witness the highest CAGR during the forecast period, fueled by the rapid digital transformation, rising cyber threats, and growing government initiatives to enhance cybersecurity.
The database security solution market can be segmented by component into software, hardware, and services. The software segment holds the largest market share, driven by the extensive use of database security software to protect data against unauthorized access, malware, and other cyber threats. These software solutions offer various functionalities such as encryption, access control, auditing, and monitoring, making them indispensable for organizations looking to secure their databases effectively.
The hardware segment, although smaller compared to software, plays a crucial role in enhancing database security. Hardware-based security solutions, such as hardware security modules (HSMs), are used for cryptographic key management and secure storage of sensitive data. These solutions provide an additional layer of security by ensuring that cryptographic operations are performed in a tamper-resistant environment, thus preventing unauthorized access and key compromise.
The services segment is also witnessing significant growth, driven by the increasing demand for m
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
The global full-text database market is projected to grow from XXX million in 2025 to XXX million by 2033, at a CAGR of XX% during the forecast period. The growth is attributed to increasing demand for information retrieval, advancements in technology, and rising need for efficient research and development. Key drivers of the market include growing adoption of digital libraries, rising demand for personalized content, and increasing focus on research and development. Key trends in the full-text database market include the emergence of artificial intelligence (AI) and machine learning (ML) technologies, the growth of open access publishing, and the increasing adoption of cloud-based solutions. The market is segmented by application (academic research, corporate research, legal research, and others) and by type (bibliographic, full-text, and abstract). Major players in the market include John Wiely & Sons, ICPSR, IEEE, EBSCO, UMI, Blackwell, Springer Link, Elsevier Science, Apache Solr, Elastic N.V., CNKI, China Science and Technology Journal Database, Wanfang Data Knowledge Service Platform, China Science Citation Database, and Chinese, Western, Japanese and Russian Journals Joint Directory Database. The market is expected to witness significant growth in emerging economies, such as China and India, due to rising literacy rates and increasing demand for information access.
This is an inventory of all data assets maintained by USAID.
See Condit (1998).
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Tumor-associated autoantibodies (TAAbs) have demonstrated potential as biomarkers for cancer detection. However, the understanding of their role in hepatocellular carcinoma (HCC) remains limited. In this study, we aimed to systematically collect and standardize information about these TAAbs and establish a comprehensive database as a platform for in-depth research. A total of 170 TAAbs were identified from published papers retrieved from PubMed, Web of Science, and Embase. Following normative reannotation, these TAAbs were referred to as 162 official symbols. The hccTAAb (tumor-associated autoantibodies in hepatocellular carcinoma) atlas was developed using the R Shiny framework and incorporating literature-based and multiomics data sets. This comprehensive online resource provides key information such as sensitivity, specificity, and additional details such as official symbols, official full names, UniProt, NCBI, HPA, neXtProt, and aliases through hyperlinks. Additionally, hccTAAb offers six analytical modules for visualizing expression profiles, survival analysis, immune infiltration, similarity analysis, DNA methylation, and DNA mutation analysis. Overall, the hccTAAb Atlas provides valuable insights into the mechanisms underlying TAAb and has the potential to enhance the diagnosis and treatment of HCC using autoantibodies. The hccTAAb Atlas is freely accessible at https://nscc.v.zzu.edu.cn/hccTAAb/.
The fourth edition of the Global Findex offers a lens into how people accessed and used financial services during the COVID-19 pandemic, when mobility restrictions and health policies drove increased demand for digital services of all kinds.
The Global Findex is the world's most comprehensive database on financial inclusion. It is also the only global demand-side data source allowing for global and regional cross-country analysis to provide a rigorous and multidimensional picture of how adults save, borrow, make payments, and manage financial risks. Global Findex 2021 data were collected from national representative surveys of about 128,000 adults in more than 120 economies. The latest edition follows the 2011, 2014, and 2017 editions, and it includes a number of new series measuring financial health and resilience and contains more granular data on digital payment adoption, including merchant and government payments.
The Global Findex is an indispensable resource for financial service practitioners, policy makers, researchers, and development professionals.
Northwest Territories, Yukon, and Nunavut (representing approximately 0.3 percent of the Canadian population) were excluded.
Individual
Observation data/ratings [obs]
In most developing economies, Global Findex data have traditionally been collected through face-to-face interviews. Surveys are conducted face-to-face in economies where telephone coverage represents less than 80 percent of the population or where in-person surveying is the customary methodology. However, because of ongoing COVID-19 related mobility restrictions, face-to-face interviewing was not possible in some of these economies in 2021. Phone-based surveys were therefore conducted in 67 economies that had been surveyed face-to-face in 2017. These 67 economies were selected for inclusion based on population size, phone penetration rate, COVID-19 infection rates, and the feasibility of executing phone-based methods where Gallup would otherwise conduct face-to-face data collection, while complying with all government-issued guidance throughout the interviewing process. Gallup takes both mobile phone and landline ownership into consideration. According to Gallup World Poll 2019 data, when face-to-face surveys were last carried out in these economies, at least 80 percent of adults in almost all of them reported mobile phone ownership. All samples are probability-based and nationally representative of the resident adult population. Phone surveys were not a viable option in 17 economies that had been part of previous Global Findex surveys, however, because of low mobile phone ownership and surveying restrictions. Data for these economies will be collected in 2022 and released in 2023.
In economies where face-to-face surveys are conducted, the first stage of sampling is the identification of primary sampling units. These units are stratified by population size, geography, or both, and clustering is achieved through one or more stages of sampling. Where population information is available, sample selection is based on probabilities proportional to population size; otherwise, simple random sampling is used. Random route procedures are used to select sampled households. Unless an outright refusal occurs, interviewers make up to three attempts to survey the sampled household. To increase the probability of contact and completion, attempts are made at different times of the day and, where possible, on different days. If an interview cannot be obtained at the initial sampled household, a simple substitution method is used. Respondents are randomly selected within the selected households. Each eligible household member is listed, and the hand-held survey device randomly selects the household member to be interviewed. For paper surveys, the Kish grid method is used to select the respondent. In economies where cultural restrictions dictate gender matching, respondents are randomly selected from among all eligible adults of the interviewer's gender.
In traditionally phone-based economies, respondent selection follows the same procedure as in previous years, using random digit dialing or a nationally representative list of phone numbers. In most economies where mobile phone and landline penetration is high, a dual sampling frame is used.
The same respondent selection procedure is applied to the new phone-based economies. Dual frame (landline and mobile phone) random digital dialing is used where landline presence and use are 20 percent or higher based on historical Gallup estimates. Mobile phone random digital dialing is used in economies with limited to no landline presence (less than 20 percent).
For landline respondents in economies where mobile phone or landline penetration is 80 percent or higher, random selection of respondents is achieved by using either the latest birthday or household enumeration method. For mobile phone respondents in these economies or in economies where mobile phone or landline penetration is less than 80 percent, no further selection is performed. At least three attempts are made to reach a person in each household, spread over different days and times of day.
Sample size for Canada is 1007.
Landline and mobile telephone
Questionnaires are available on the website.
Estimates of standard errors (which account for sampling error) vary by country and indicator. For country-specific margins of error, please refer to the Methodology section and corresponding table in Demirgüç-Kunt, Asli, Leora Klapper, Dorothe Singer, Saniya Ansar. 2022. The Global Findex Database 2021: Financial Inclusion, Digital Payments, and Resilience in the Age of COVID-19. Washington, DC: World Bank.
Database of SARS-CoV-2 and other viruses. Integrates structural and dynamic insights with viral evolution for proteins coded by virus. Each virus within database has workflow performed on each protein. Workflow consists of protein modeling, molecular dynamic simulations, evolutionary analysis, and mapping of protein-protein interactions. On page for each protein is link to individual protein data folder system, video of protein rotating with conservation, details of protein function, widget to purchase 3D print of protein at cost of production, amino acid movement from molecular dynamic simulations, and table of data for each amino acid of protein.
The fourth edition of the Global Findex offers a lens into how people accessed and used financial services during the COVID-19 pandemic, when mobility restrictions and health policies drove increased demand for digital services of all kinds.
The Global Findex is the world’s most comprehensive database on financial inclusion. It is also the only global demand-side data source allowing for global and regional cross-country analysis to provide a rigorous and multidimensional picture of how adults save, borrow, make payments, and manage financial risks. Global Findex 2021 data were collected from national representative surveys of almost 145,000 people in 139 economies, representing 97 percent of the world’s population. The latest edition follows the 2011, 2014, and 2017 editions, and it includes a number of new series measuring financial health and resilience and contains more granular data on digital payment adoption, including merchant and government payments.
The Global Findex is an indispensable resource for financial service practitioners, policy makers, researchers, and development professionals.
Al Baydaa, Al Jawf, Mareb, Sadah, the Island of Socotra, and several districts in other governorates were excluded due to their small size, remoteness or security issues. The excluded areas represent approximately 23% of the population. In addition, due to the ongoing security situation, during field over one-fourth of the PSUs were replaced with a similar PSU in the same province.
Observation data/ratings [obs]
In most developing economies, Global Findex data have traditionally been collected through face-to-face interviews. Surveys are conducted face-to-face in economies where telephone coverage represents less than 80 percent of the population or where in-person surveying is the customary methodology. However, because of ongoing COVID-19–related mobility restrictions, face-to-face interviewing was not possible in some of these economies in 2021. Phone-based surveys were therefore conducted in 67 economies that had been surveyed face-to-face in 2017. These 67 economies were selected for inclusion based on population size, phone penetration rate, COVID-19 infection rates, and the feasibility of executing phone-based methods where Gallup would otherwise conduct face-to-face data collection, while complying with all government-issued guidance throughout the interviewing process. Gallup takes both mobile phone and landline ownership into consideration. According to Gallup World Poll 2019 data, when face-to-face surveys were last carried out in these economies, at least 80 percent of adults in almost all of them reported mobile phone ownership. All samples are probability-based and nationally representative of the resident adult population. Additionally, phone surveys were not a viable option in 16 economies in 2021, which were then surveyed in 2022.
In economies where face-to-face surveys are conducted, the first stage of sampling is the identification of primary sampling units. These units are stratified by population size, geography, or both, and clustering is achieved through one or more stages of sampling. Where population information is available, sample selection is based on probabilities proportional to population size; otherwise, simple random sampling is used. Random route procedures are used to select sampled households. Unless an outright refusal occurs, interviewers make up to three attempts to survey the sampled household. To increase the probability of contact and completion, attempts are made at different times of the day and, where possible, on different days. If an interview cannot be obtained at the initial sampled household, a simple substitution method is used. Respondents are randomly selected within the selected households. Each eligible household member is listed, and the hand-held survey device randomly selects the household member to be interviewed. For paper surveys, the Kish grid method is used to select the respondent. In economies where cultural restrictions dictate gender matching, respondents are randomly selected from among all eligible adults of the interviewer's gender.
In traditionally phone-based economies, respondent selection follows the same procedure as in previous years, using random digit dialing or a nationally representative list of phone numbers. In most economies where mobile phone and landline penetration is high, a dual sampling frame is used.
The same respondent selection procedure is applied to the new phone-based economies. Dual frame (landline and mobile phone) random digital dialing is used where landline presence and use are 20 percent or higher based on historical Gallup estimates. Mobile phone random digital dialing is used in economies with limited to no landline presence (less than 20 percent).
For landline respondents in economies where mobile phone or landline penetration is 80 percent or higher, random selection of respondents is achieved by using either the latest birthday or household enumeration method. For mobile phone respondents in these economies or in economies where mobile phone or landline penetration is less than 80 percent, no further selection is performed. At least three attempts are made to reach a person in each household, spread over different days and times of day.
Sample size for Yemen, Rep. is 1000.
Face-to-face [f2f]
Questionnaires are available on the website.
Estimates of standard errors (which account for sampling error) vary by country and indicator. For country-specific margins of error, please refer to the Methodology section and corresponding table in Demirgüç-Kunt, Asli, Leora Klapper, Dorothe Singer, Saniya Ansar. 2022. The Global Findex Database 2021: Financial Inclusion, Digital Payments, and Resilience in the Age of COVID-19. Washington, DC: World Bank.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Plants produce a wide range of bioactive peptides as part of their innate defense mechanisms. With the explosive growth of plant-derived peptides, verifying the therapeutic function using traditional experimental methods are resources and time consuming. Therefore, it is necessary to predict the therapeutic function of plant-derived peptides more effectively and accurately with reduced waste of resources and thus expedite the development of plant peptides. We herein developed a repository of plant peptides predicted to have multiple therapeutic functions, named as MFPPDB (multi-functional plant peptide database). MFPPDB including 1,482,409 single or multiple functional plant origin therapeutic peptides derived from 121 fundamental plant species. The functional categories of these therapeutic peptides include 41 different features such as anti-bacterial, anti-fungal, anti-HIV, anti-viral, and anti-cancer. The detailed physicochemical information of these peptides was presented in functional search and physicochemical property search module, which can help users easily access the peptide information by the plant peptide species, ID, and functions, or by their peptide ID, isoelectric point, peptide sequence, and molecular weight through web-friendly interface. We further matched the predicted peptides to nine state-of-the-art curated functional peptide databases and found that at least 293,408 of the peptides possess functional potentials. Overall, MFPPDB integrated a massive number of plant peptides have single or multiple therapeutic functions, which will facilitate the comprehensive research in plant peptidomics. MFPPDB can be freely accessed through http://124.223.195.214:9188/mfppdb/index.
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
The Database Security market is experiencing robust growth, projected to reach $2556.1 million in 2025 and exhibiting a Compound Annual Growth Rate (CAGR) of 11.4% from 2025 to 2033. This expansion is fueled by the increasing frequency and sophistication of cyberattacks targeting sensitive data stored in databases, coupled with stringent data privacy regulations like GDPR and CCPA. The rising adoption of cloud computing and the proliferation of big data also contribute significantly to market growth, as organizations require robust security solutions to protect their valuable data assets across diverse environments. The market is segmented by application (SMEs, Large Enterprises) and type (Marketing, Sales, Operations, Finance, HR & Legal), with large enterprises and applications involving sensitive financial data demonstrating particularly high demand for advanced database security solutions. North America currently holds a dominant market share due to early adoption of advanced technologies and a strong regulatory landscape, but the Asia-Pacific region is poised for significant growth, driven by increasing digitalization and a rapidly expanding economy. The competitive landscape is characterized by a mix of established players like Oracle and IBM, alongside specialized security vendors such as Trustwave and McAfee. These companies offer a wide range of solutions, including database activity monitoring, encryption, access control, and vulnerability management. The market is witnessing innovation in areas like AI-powered threat detection and automated security response, which are enhancing the effectiveness and efficiency of database security solutions. However, challenges remain, including the rising complexity of cyber threats, the skills gap in cybersecurity professionals, and the high cost of implementing and maintaining comprehensive database security systems. The continued evolution of cyberattacks and data privacy regulations will be key drivers shaping the future of this dynamic market.
Comparasite is an integrated database of our original full-length cDNA sequence data. It consists of seven sub-databases of apicomplexa protozoa, Plasmodium falciparum, Plasmodium yoelii, Plasmodium vivax, Toxoplasma gondii, Cryptosporidium parvum, Echinococcus multilocularis. Homologous gene groups are clustered and comparative analysis of any combination of these seven species is implemented, such as interspecies comparisons as to cellular localization, motifs or transmembrane regions and so on. For submitted keywords and other search conditions, Comparasite retrieves orthologous gene groups containing a given protein motif/GO term etc in common or in a species-specific manner. By enabling multi-faceted comparative analyses of genes of apicomplexa protozoa, monophyletic organisms that have evolved to diversify to parasitize various hosts by adopting complex life cycles, Comparasite should help elucidate the mechanism behind parasitism.
The U.S. Department of Housing and Urban Development (HUD) periodically receives custom tabulations of data from the U.S. Census Bureau that are largely not available through standard Census products. These data, known as the CHAS data (Comprehensive Housing Affordability Strategy), demonstrate the extent of housing problems and housing needs, particularly for low income households. The CHAS data are used by local governments to plan how to spend HUD funds, and may also be used by HUD to distribute grant funds
https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/
Explore Newly Registered Domains from January 13, 2053, with our comprehensive database. Access real-time data on recently created domains.
The fourth edition of the Global Findex offers a lens into how people accessed and used financial services during the COVID-19 pandemic, when mobility restrictions and health policies drove increased demand for digital services of all kinds.
The Global Findex is the world's most comprehensive database on financial inclusion. It is also the only global demand-side data source allowing for global and regional cross-country analysis to provide a rigorous and multidimensional picture of how adults save, borrow, make payments, and manage financial risks. Global Findex 2021 data were collected from national representative surveys of about 128,000 adults in more than 120 economies. The latest edition follows the 2011, 2014, and 2017 editions, and it includes a number of new series measuring financial health and resilience and contains more granular data on digital payment adoption, including merchant and government payments.
The Global Findex is an indispensable resource for financial service practitioners, policy makers, researchers, and development professionals.
National coverage
Individual
Observation data/ratings [obs]
In most developing economies, Global Findex data have traditionally been collected through face-to-face interviews. Surveys are conducted face-to-face in economies where telephone coverage represents less than 80 percent of the population or where in-person surveying is the customary methodology. However, because of ongoing COVID-19 related mobility restrictions, face-to-face interviewing was not possible in some of these economies in 2021. Phone-based surveys were therefore conducted in 67 economies that had been surveyed face-to-face in 2017. These 67 economies were selected for inclusion based on population size, phone penetration rate, COVID-19 infection rates, and the feasibility of executing phone-based methods where Gallup would otherwise conduct face-to-face data collection, while complying with all government-issued guidance throughout the interviewing process. Gallup takes both mobile phone and landline ownership into consideration. According to Gallup World Poll 2019 data, when face-to-face surveys were last carried out in these economies, at least 80 percent of adults in almost all of them reported mobile phone ownership. All samples are probability-based and nationally representative of the resident adult population. Phone surveys were not a viable option in 17 economies that had been part of previous Global Findex surveys, however, because of low mobile phone ownership and surveying restrictions. Data for these economies will be collected in 2022 and released in 2023.
In economies where face-to-face surveys are conducted, the first stage of sampling is the identification of primary sampling units. These units are stratified by population size, geography, or both, and clustering is achieved through one or more stages of sampling. Where population information is available, sample selection is based on probabilities proportional to population size; otherwise, simple random sampling is used. Random route procedures are used to select sampled households. Unless an outright refusal occurs, interviewers make up to three attempts to survey the sampled household. To increase the probability of contact and completion, attempts are made at different times of the day and, where possible, on different days. If an interview cannot be obtained at the initial sampled household, a simple substitution method is used. Respondents are randomly selected within the selected households. Each eligible household member is listed, and the hand-held survey device randomly selects the household member to be interviewed. For paper surveys, the Kish grid method is used to select the respondent. In economies where cultural restrictions dictate gender matching, respondents are randomly selected from among all eligible adults of the interviewer's gender.
In traditionally phone-based economies, respondent selection follows the same procedure as in previous years, using random digit dialing or a nationally representative list of phone numbers. In most economies where mobile phone and landline penetration is high, a dual sampling frame is used.
The same respondent selection procedure is applied to the new phone-based economies. Dual frame (landline and mobile phone) random digital dialing is used where landline presence and use are 20 percent or higher based on historical Gallup estimates. Mobile phone random digital dialing is used in economies with limited to no landline presence (less than 20 percent).
For landline respondents in economies where mobile phone or landline penetration is 80 percent or higher, random selection of respondents is achieved by using either the latest birthday or household enumeration method. For mobile phone respondents in these economies or in economies where mobile phone or landline penetration is less than 80 percent, no further selection is performed. At least three attempts are made to reach a person in each household, spread over different days and times of day.
Sample size for Kazakhstan is 1000.
Face-to-face [f2f]
Questionnaires are available on the website.
Estimates of standard errors (which account for sampling error) vary by country and indicator. For country-specific margins of error, please refer to the Methodology section and corresponding table in Demirgüç-Kunt, Asli, Leora Klapper, Dorothe Singer, Saniya Ansar. 2022. The Global Findex Database 2021: Financial Inclusion, Digital Payments, and Resilience in the Age of COVID-19. Washington, DC: World Bank.
DBGC: A Database of Human Gastric CancerOur work collected the data from transcriptome, proteome, mutation, biomarker and drug-sensitive genes. Totally comes out 9 files:
1,transcriptome.txt -- transcriptome related to gastric cancer, that contains the type of experiment, platform, sample, volume, result and the linkage of those. 2,up_genes.txt -- up-regulated gene list come from transcriptome which based on experiment 3,down_genes.txt -- up-regulated gene list come from transcriptome which based on experiment 4,proteome.txt -- proteome related to gastric cancer, that contains the type of experiment, sample, technology and result etc. 5,up_proteins.txt -- up-regulated genes list come from proteome 6,down_proteins.txt -- down-regulated genes list come from proteome 7,mutation.txt -- mutation related to gastric cancer, that mainly include mutation type, postion etc. 8,biomarker.txt -- biomarker related to gastric cancer including types, stages as well sensitivity, specifity ect. 9,pharma...
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
GC skew denotes the relative excess of G nucleotides over C nucleotides on the leading versus the lagging replication strand of eubacteria. While the effect is small, typically around 2.5%, it is robust and pervasive. GC skew and the analogous TA skew are a localized deviation from Chargaff’s second parity rule, which states that G and C, and T and A occur with (mostly) equal frequency even within a strand.
Most bacteria also show the analogous TA skew. Different phyla show different kinds of skew and differing relations between TA and GC skew. This article introduces an open access database (https://skewdb.org) of GC and 10 other skews for over 28,000 chromosomes and plasmids. Further details like codon bias, strand bias, strand lengths and taxonomic data are also included.
The SkewDB database can be used to generate or verify hypotheses. Since the origins of both the second parity rule, as well as GC skew itself, are not yet satisfactorily explained, such a database may enhance our understanding of microbial DNA.
Methods The SkewDB analysis relies exclusively on the tens of thousands of FASTA and GFF3 files available through the NCBI download service, which covers both GenBank and RefSeq. The database includes bacteria, archaea and their plasmids. Furthermore, to ease analysis, the NCBI Taxonomy database is sourced and merged so output data can quickly be related to (super)phyla or specific species. No other data is used, which greatly simplifies processing. Data is read directly in the compressed format provided by NCBI.
All results are emitted as standard CSV files. In the first step of the analysis, for each organism the FASTA sequence and the GFF3 annotation file are parsed. Every chromosome in the FASTA file is traversed from beginning to end, while a running total is kept for cumulative GC and TA skew. In addition, within protein coding genes, such totals are also kept separately for these skews on the first, second and third codon position. Furthermore, separate totals are kept for regions which do not code for proteins. In addition, to enable strand bias measurements, a cumulative count is maintained of nucleotides that are part of a positive or negative sense gene. The counter is increased for positive sense nucleotides, decreased for negative sense nucleotides, and left alone for non-genic regions.
A separate counter is kept for non-genic nucleotides. Finally, G and C nucleotides are counted, regardless of if they are part of a gene or not. These running totals are emitted at 4096 nucleotide intervals, a resolution suitable for determining skews and shifts. In addition, one line summaries are stored for each chromosome. These line includes the RefSeq identifier of the chromosome, the full name mentioned in the FASTA file, plus counts of A, C, G and T nucleotides. Finally five levels of taxonomic data are stored.
Chromosomes and plasmids of fewer than 100 thousand nucleotides are ignored, as these are too noisy to model faithfully. Plasmids are clearly marked in the database, enabling researchers to focus on chromosomes if so desired. Fitting Once the genomes have been summarised at 4096-nucleotide resolution, the skews are fitted to a simple model. The fits are based on four parameters. Alpha1 and alpha2 denote the relative excess of G over C on the leading and lagging strands. If alpha1 is 0.046, this means that for every 1000 nucleotides on the leading strand, the cumulative count of G excess increases by 46. The third parameter is div and it describes how the chromosome is divided over leading and lagging strands. If this number is 0.557, the leading replication strand is modeled to make up 55.7% of the chromosome. The final parameter is shift (the dotted vertical line), and denotes the offset of the origin of replication compared to the DNA FASTA file. This parameter has no biological meaning of itself, and is an artifact of the DNA assembly process.
The goodness-of-fit number consists of the root mean squared error of the fit, divided by the absolute mean skew. This latter correction is made to not penalize good fits for bacteria showing significant skew. GC skew tends to be defined very strongly, and it is therefore used to pick the div and shift parameters of the DNA sequence, which are then kept as a fixed constraint for all the other skews, which might not be present as clearly. The fitting process itself is a downhill simplex method optimization over the three dimensions, seeded with the average observed skew over the whole genome, and assuming there is no shift, and that the leading and lagging strands are evenly distributed. The simplex optimization is tuned so that it takes sufficiently large steps so it can reach the optimum even if some initial assumptions are off.