58 datasets found
  1. h

    AIDS

    • huggingface.co
    Updated Apr 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Graph Datasets (2023). AIDS [Dataset]. https://huggingface.co/datasets/graphs-datasets/AIDS
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 20, 2023
    Dataset authored and provided by
    Graph Datasets
    Description

    Dataset Card for AIDS

      Dataset Summary
    

    The AIDS dataset is a dataset containing compounds checked for evidence of anti-HIV activity..

      Supported Tasks and Leaderboards
    

    AIDS should be used for molecular classification, a binary classification task. The score used is accuracy with cross validation.

      External Use
    
    
    
    
    
      PyGeometric
    

    To load in PyGeometric, do the following: from datasets import load_dataset

    from torch_geometric.data import Data from… See the full description on the dataset page: https://huggingface.co/datasets/graphs-datasets/AIDS.

  2. P

    AIDS Dataset

    • paperswithcode.com
    • opendatalab.com
    • +1more
    Updated Apr 16, 2008
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kaspar Riesen; Horst Bunke (2008). AIDS Dataset [Dataset]. https://paperswithcode.com/dataset/aids
    Explore at:
    Dataset updated
    Apr 16, 2008
    Authors
    Kaspar Riesen; Horst Bunke
    Description

    AIDS is a graph dataset. It consists of 2000 graphs representing molecular compounds which are constructed from the AIDS Antiviral Screen Database of Active Compounds. It contains 4395 chemical compounds, of which 423 belong to class CA, 1081 to CM, and the remaining compounds to CI.

  3. Total HIV/AIDS funding by the National Institutes for Health 2013-2025

    • ai-chatbox.pro
    • statista.com
    Updated May 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matej Mikulic (2025). Total HIV/AIDS funding by the National Institutes for Health 2013-2025 [Dataset]. https://www.ai-chatbox.pro/?_=%2Ftopics%2F3082%2Fhiv-aids-in-the-us%2F%23XgboD02vawLKoDs%2BT%2BQLIV8B6B4Q9itA
    Explore at:
    Dataset updated
    May 23, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Matej Mikulic
    Description

    HIV/AIDS funding by the NIH stood at around 3.3 billion U.S. dollars in fiscal year 2023. This graph shows the total HIV/AIDS funding by the National Institutes for Health (NIH) from FY 2013 to FY 2023 and estimates for FY 2024 and FY 2025.

  4. BRFSS: Graph of Current HIV-AIDS testing among adults

    • data.wu.ac.at
    csv, json, xml
    Updated Jun 10, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Centers for Disease Control and Prevention (2015). BRFSS: Graph of Current HIV-AIDS testing among adults [Dataset]. https://data.wu.ac.at/schema/data_cdc_gov/Z2JkaC02eGNy
    Explore at:
    xml, json, csvAvailable download formats
    Dataset updated
    Jun 10, 2015
    Dataset provided by
    Centers for Disease Control and Preventionhttp://www.cdc.gov/
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    2011 to present. BRFSS combined land line and cell phone prevalence data. BRFSS is a continuous, state-based surveillance system that collects information about modifiable risk factors for chronic diseases and other leading causes of death. Data will be updated annually as it becomes available. Detailed information on sampling methodology and quality assurance can be found on the BRFSS website (http://www.cdc.gov/brfss). Methodology: http://www.cdc.gov/brfss/factsheets/pdf/DBS_BRFSS_survey.pdf Glossary: https://chronicdata.cdc.gov/Behavioral-Risk-Factors/Behavioral-Risk-Factor-Surveillance-System-BRFSS-H/iuq5-y9ct

  5. P

    AIDS Antiviral Screen Dataset

    • paperswithcode.com
    • opendatalab.com
    Updated Apr 16, 2008
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kaspar Riesen; Horst Bunke (2008). AIDS Antiviral Screen Dataset [Dataset]. https://paperswithcode.com/dataset/aids-antiviral-screen
    Explore at:
    Dataset updated
    Apr 16, 2008
    Authors
    Kaspar Riesen; Horst Bunke
    Description

    The AIDS Antiviral Screen dataset is a dataset of screens checking tens of thousands of compounds for evidence of anti-HIV activity. The available screen results are chemical graph-structured data of these various compounds.

  6. Number of people with HIV in select countries in Africa 2023

    • statista.com
    • ai-chatbox.pro
    Updated Aug 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Number of people with HIV in select countries in Africa 2023 [Dataset]. https://www.statista.com/statistics/1305217/number-people-with-hiv-african-countries/
    Explore at:
    Dataset updated
    Aug 21, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2023
    Area covered
    Africa
    Description

    As of 2023, South Africa was the country with the highest number of people living with HIV in Africa. At that time, around 7.7 million people in South Africa were HIV positive. In Mozambique, the country with the second-highest number of HIV-positive people in Africa, around 2.4 million people were living with HIV. Which country in Africa has the highest prevalence of HIV? Although South Africa has the highest total number of people living with HIV in Africa, it does not have the highest prevalence of HIV on the continent. Eswatini currently has the highest prevalence of HIV in Africa and worldwide, with almost 26 percent of the population living with HIV. South Africa has the third-highest prevalence, with around 18 percent of the population HIV positive. Eswatini also has the highest rate of new HIV infections per 1,000 population worldwide, followed by Lesotho and South Africa. However, South Africa had the highest total number of new HIV infections in 2023, with around 150,000 people newly infected with HIV that year. Deaths from HIV in Africa Thanks to advances in treatment and awareness, HIV/AIDS no longer contributes to a significant amount of death in many countries. However, the disease is still the fourth leading cause of death in Africa, accounting for around 5.6 percent of all deaths. In 2023, South Africa and Nigeria were the countries with the highest number of AIDS-related deaths worldwide with 50,000 and 45,000 such deaths, respectively. Although not every country in the leading 25 for AIDS-related deaths is found in Africa, African countries account for the majority of countries on the list. Fortunately, HIV treatment has become more accessible in Africa over the years and now up to 95 percent of people living with HIV in Eswatini are receiving antiretroviral therapy (ART). Access to ART does vary from country to country, however, with around 77 percent of people who are HIV positive in South Africa receiving ART, and only 31 percent in the Congo.

  7. IAM Graph Database

    • zenodo.org
    zip
    Updated Sep 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kaspar Riesen; Kaspar Riesen (2024). IAM Graph Database [Dataset]. http://doi.org/10.1007/978-3-540-89689-0_33
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 14, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Kaspar Riesen; Kaspar Riesen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This database defined from the AIDS Antiviral Screen Database of Active Compounds is composed of 2000 chemical compounds some of them being disconnected. These chemical compounds have been screened as active or inactive against HIV and they are split into three different sets:

    • A train set composed of 250 compounds used to train SVM.
    • A validation set composed of 250 compounds used to find parameters giving the best accuracy result.
    • A test set composed of remaining 1500 compounds used to test the classification model.

    Results on AIDS dataset.

    MethodClassification accuracy (%)
    (1)Riesen and Bunke (2008)97.3
    (2)Suard et al. (2002)98.5
    (3)Vishwanathan et al. (2010)98.5
    (4)Neuhaus and Bunke (2007)99.7
    (5)Riesen et al. (2007)98.2
    (6)Graph Laplacian kernel99.3
    (7)Gauzere el al. (2012)99.1

    References

    • Gaüzère, B., et al. Two new graphs kernels in chemoinformatics. Pattern Recognition Lett. (2012), http://dx.doi.org/10.1016/j.patrec.2012.03.020.
    • Neuhaus, M., Bunke, H., 2007. Bridging the Gap between Graph Edit Distance and Kernel Machines. World Scientific Pub Co Inc..
    • Riesen, K., Neuhaus, M., Bunke, H., 2007. Graph embedding in vector spaces by means of prototype selection. In: Escolano, F., Vento, M. (Eds.), 6th IAPR-TC15 Internat. Workshop GbRPR 2007. IAPR TC15. Springer-Verlag, pp. 383–393.
    • Riesen, K., Bunke, H., 2008. Iam graph database repository for graph based pattern recognition and machine learning. In: Proc. 2008 Joint IAPR Internat. Workshop on Structural, Syntactic, and Statistical Pattern Recognition. SSPR & SPR ’08. Springer-Verlag, Berlin, Heidelberg, pp. 287–297.
    • Suard, F., Rakotomamonjy, A., Bensrhair, A., 2002. Kernel on bag of paths for measuring similarity of shapes. In: European Symposium on Artificial Neural Networks. pp. 355–360.
    • Vishwanathan, S., Borgwardt, K.M., Kondor, I.R., Schraudolph, N.N., 2010. Graph kernels. J. Machine Learn. Res. 11, 1201–1242.
  8. HIV: annual data

    • gov.uk
    Updated Oct 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UK Health Security Agency (2024). HIV: annual data [Dataset]. https://www.gov.uk/government/statistics/hiv-annual-data-tables
    Explore at:
    Dataset updated
    Oct 1, 2024
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    UK Health Security Agency
    Description

    The following slide sets are available to download for presentational use:

    New HIV diagnoses, AIDS and deaths are collected from HIV outpatient clinics, laboratories and other healthcare settings. Data relating to people living with HIV is collected from HIV outpatient clinics. Data relates to England, Wales, Northern Ireland and Scotland, unless stated.

    HIV testing, pre-exposure prophylaxis, and post-exposure prophylaxis data relates to activity at sexual health services in England only.

    View the pre-release access lists for these statistics.

    Previous reports, data tables and slide sets are also available for:

    Our statistical practice is regulated by the Office for Statistics Regulation (OSR). The OSR sets the standards of trustworthiness, quality and value in the https://code.statisticsauthority.gov.uk/" class="govuk-link">Code of Practice for Statistics that all producers of Official Statistics should adhere to.

    Additional information on HIV surveillance can be found in the HIV Action Plan for England monitoring and evaluation framework reports. Other HIV in the UK reports published by Public Health England (PHE) are available online.

  9. Countries with the highest prevalence of HIV in 2000 and 2023

    • statista.com
    • ai-chatbox.pro
    Updated Jun 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Countries with the highest prevalence of HIV in 2000 and 2023 [Dataset]. https://www.statista.com/statistics/270209/countries-with-the-highest-global-hiv-prevalence/
    Explore at:
    Dataset updated
    Jun 23, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    Among all countries worldwide those in sub-Saharan Africa have the highest rates of HIV. The countries with the highest rates of HIV include Eswatini, Lesotho, and South Africa. In 2023, Eswatini had the highest prevalence of HIV with a rate of around ** percent. Other countries, such as Zimbabwe, have significantly decreased their HIV prevalence. Community-based HIV services are considered crucial to the prevention and treatment of HIV. HIV Worldwide The human immunodeficiency virus (HIV) is a viral infection that is transmitted via exposure to infected semen, blood, vaginal and anal fluids and breast milk. HIV destroys the human immune system, rendering the host unable to fight off secondary infections. Globally, the number of people living with HIV has generally increased over the past two decades. However, the number of HIV-related deaths has decreased significantly in recent years. Despite being a serious illness that affects millions of people, medication exists that effectively manages the progression of the virus in the body. These medications are called antiretroviral drugs. HIV Treatment Generally, global access to antiretroviral treatment has increased in recent years. However, despite being available worldwide, not all adults have access to antiretroviral drugs. Europe and North America have the highest rates of antiretroviral use among people living with HIV. There are many different antiretroviral drugs available on the market. As of 2024, ********, an antiretroviral marketed by Gilead, was the leading HIV treatment based on revenue.

  10. Gross rate of care for HIV/AIDS in France 2019, by age group

    • statista.com
    Updated Sep 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Gross rate of care for HIV/AIDS in France 2019, by age group [Dataset]. https://www.statista.com/statistics/1117324/hiv-rate-gross-taken-in-charge-men-by-age-france/
    Explore at:
    Dataset updated
    Sep 23, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2019
    Area covered
    France
    Description

    This graph illustrates the gross rate of health insurance coverage for HIV in France in 2019, by age. It shows that the highest gross rate was found among people aged 55 to 64 with around 4.7 per thousand people.

  11. Discrimination against HIV positive people in China

    • statista.com
    Updated Jun 15, 2010
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2010). Discrimination against HIV positive people in China [Dataset]. https://www.statista.com/statistics/271647/discrimination-against-hiv-positive-people-in-china/
    Explore at:
    Dataset updated
    Jun 15, 2010
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2009
    Area covered
    China
    Description

    This graph depicts the percentage of the types of discrimination against HIV positive people in China in 2009. 2.9 percent of HIV-positive women reported having been physically assaulted.

  12. Total number of AIDS-related deaths worldwide 2000-2023

    • statista.com
    Updated Jul 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Total number of AIDS-related deaths worldwide 2000-2023 [Dataset]. https://www.statista.com/statistics/257209/number-of-aids-related-deaths-worldwide-since-2001/
    Explore at:
    Dataset updated
    Jul 9, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    UNAIDS estimated that there were some ******* people worldwide that died from acquired immune deficiency syndrome (AIDS) in 2023. This statistic depicts the total number of annual AIDS-related deaths worldwide from 2000 to 2023. HIV/AIDS burden A majority of countries with the highest burden due to HIV and AIDS are in Africa- in 2023, the highest number of AIDS-related deaths occurred in South Africa and Nigeria and the highest prevalence of HIV was found in Eswatini. Although access to life-saving antiretroviral therapy treatment (ART) has increased globally over recent years, many individuals living with HIV still lack access to ART. Barriers and interventions In part due to the development of ART, the number of people living with HIV worldwide is continuing to increase, reaching almost ** million in 2023. Important public health measures to combat the burden of the disease include a combination of biomedical and behavioral interventions such as pre- and post-exposure prophylaxis, and context-specific structural interventions to reduce barriers to supplies and education. One prominent barrier faced by those living with HIV is stigma, which can often cause disadvantages in many areas of life, including employment, use of health services, and social support.

  13. f

    A Graph is Worth a Thousand Words: How Overconfidence and Graphical...

    • figshare.com
    tiff
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ricardo Lopes Cardoso; Rodrigo Oliveira Leite; André Carlos Busanelli de Aquino (2023). A Graph is Worth a Thousand Words: How Overconfidence and Graphical Disclosure of Numerical Information Influence Financial Analysts Accuracy on Decision Making [Dataset]. http://doi.org/10.1371/journal.pone.0160443
    Explore at:
    tiffAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Ricardo Lopes Cardoso; Rodrigo Oliveira Leite; André Carlos Busanelli de Aquino
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Previous researches support that graphs are relevant decision aids to tasks related to the interpretation of numerical information. Moreover, literature shows that different types of graphical information can help or harm the accuracy on decision making of accountants and financial analysts. We conducted a 4×2 mixed-design experiment to examine the effects of numerical information disclosure on financial analysts’ accuracy, and investigated the role of overconfidence in decision making. Results show that compared to text, column graph enhanced accuracy on decision making, followed by line graphs. No difference was found between table and textual disclosure. Overconfidence harmed accuracy, and both genders behaved overconfidently. Additionally, the type of disclosure (text, table, line graph and column graph) did not affect the overconfidence of individuals, providing evidence that overconfidence is a personal trait. This study makes three contributions. First, it provides evidence from a larger sample size (295) of financial analysts instead of a smaller sample size of students that graphs are relevant decision aids to tasks related to the interpretation of numerical information. Second, it uses the text as a baseline comparison to test how different ways of information disclosure (line and column graphs, and tables) can enhance understandability of information. Third, it brings an internal factor to this process: overconfidence, a personal trait that harms the decision-making process of individuals. At the end of this paper several research paths are highlighted to further study the effect of internal factors (personal traits) on financial analysts’ accuracy on decision making regarding numerical information presented in a graphical form. In addition, we offer suggestions concerning some practical implications for professional accountants, auditors, financial analysts and standard setters.

  14. Area under the curve (AUC) of a receiver-operator characteristics (ROC)...

    • plos.figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sikhulile Moyo; Alain Vandormael; Eduan Wilkinson; Susan Engelbrecht; Simani Gaseitsiwe; Kenanao P. Kotokwe; Rosemary Musonda; Frank Tanser; Max Essex; Vladimir Novitsky; Tulio de Oliveira (2023). Area under the curve (AUC) of a receiver-operator characteristics (ROC) graph comparing the accuracy of the PwD, BED, and LAg assays in identifying HIV infection recency. [Dataset]. http://doi.org/10.1371/journal.pone.0160649.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Sikhulile Moyo; Alain Vandormael; Eduan Wilkinson; Susan Engelbrecht; Simani Gaseitsiwe; Kenanao P. Kotokwe; Rosemary Musonda; Frank Tanser; Max Essex; Vladimir Novitsky; Tulio de Oliveira
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Area under the curve (AUC) of a receiver-operator characteristics (ROC) graph comparing the accuracy of the PwD, BED, and LAg assays in identifying HIV infection recency.

  15. Equal Graph Partitioning on Estimated Infection Network as an Effective...

    • plos.figshare.com
    wmv
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jeremy Hadidjojo; Siew Ann Cheong (2023). Equal Graph Partitioning on Estimated Infection Network as an Effective Epidemic Mitigation Measure [Dataset]. http://doi.org/10.1371/journal.pone.0022124
    Explore at:
    wmvAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Jeremy Hadidjojo; Siew Ann Cheong
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Controlling severe outbreaks remains the most important problem in infectious disease area. With time, this problem will only become more severe as population density in urban centers grows. Social interactions play a very important role in determining how infectious diseases spread, and organization of people along social lines gives rise to non-spatial networks in which the infections spread. Infection networks are different for diseases with different transmission modes, but are likely to be identical or highly similar for diseases that spread the same way. Hence, infection networks estimated from common infections can be useful to contain epidemics of a more severe disease with the same transmission mode. Here we present a proof-of-concept study demonstrating the effectiveness of epidemic mitigation based on such estimated infection networks. We first generate artificial social networks of different sizes and average degrees, but with roughly the same clustering characteristic. We then start SIR epidemics on these networks, censor the simulated incidences, and use them to reconstruct the infection network. We then efficiently fragment the estimated network by removing the smallest number of nodes identified by a graph partitioning algorithm. Finally, we demonstrate the effectiveness of this targeted strategy, by comparing it against traditional untargeted strategies, in slowing down and reducing the size of advancing epidemics.

  16. f

    Life table of cohort of HIV/AIDS patients attending follow-up at public...

    • plos.figshare.com
    xls
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maycas Dembelu; Mesfin Kote; Girma Gilano; Temesgen Mohammed (2023). Life table of cohort of HIV/AIDS patients attending follow-up at public health facilities in Arba Minch Town, Ethiopia. [Dataset]. http://doi.org/10.1371/journal.pone.0261454.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Maycas Dembelu; Mesfin Kote; Girma Gilano; Temesgen Mohammed
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Arba Minch, Ethiopia, Arba Minch Town
    Description

    Life table of cohort of HIV/AIDS patients attending follow-up at public health facilities in Arba Minch Town, Ethiopia.

  17. New cases of HIV diagnosed in the United Kingdom (UK) 2013-2023

    • statista.com
    Updated Jul 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). New cases of HIV diagnosed in the United Kingdom (UK) 2013-2023 [Dataset]. https://www.statista.com/statistics/648728/new-hiv-cases-diagnosed-in-the-united-kingdom-uk/
    Explore at:
    Dataset updated
    Jul 9, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    United Kingdom
    Description

    The number of new cases of HIV diagnosed in the UK fluctuated over the observed period. In 2023, there were ***** new HIV cases recorded in the UK, highest in the given period. Cases of AIDS in the UK were significantly lower, with *** cases in 2023. STIs in the UK Other common STIs in the UK are herpes, gonorrhea, and chlamydia. Especially for gonorrhea and chlamydia, an increase in cases was observed between 2012 and 2019, while in 2020 and 2021 figures fell dramatically due to the COVID-19 pandemic and resulting lockdowns and social distancing. HIV in Europe New cases of HIV in Europe amounted to roughly **** thousand in 2023, of which **** thousand were among males. Among male individuals, the most common mode of HIV transmission in Europe in 2023 was among men having homosexual intercourse.

  18. P

    SciGraphQA Dataset

    • library.toponeai.link
    • paperswithcode.com
    Updated Aug 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shengzhi Li; Nima Tajbakhsh (2023). SciGraphQA Dataset [Dataset]. https://library.toponeai.link/dataset/scigraphqa
    Explore at:
    Dataset updated
    Aug 8, 2023
    Authors
    Shengzhi Li; Nima Tajbakhsh
    Description

    SciGraphQA is a large-scale, open-domain dataset focused on generating multi-turn conversational question-answering dialogues centered around understanding and describing scientific graphs and figures. It contains over 300,000 samples derived from academic research papers in computer science and machine learning domains.

    Each sample in ScFiGraphQA consists of a scientific graph image sourced from papers on ArXiv, accompanied by rich textual context including the paper's title, abstract, figure caption, and a paragraph from the paper referencing the figure. Using this comprehensive context, the dataset employs a to produce multi-turn question-answer dialogues aimed at explaining the given graph in an interactive, conversational format. On average, each sample contains 2-3 turns of question-answer exchange.

    The key motivation behind SciGraphQA is providing a large-scale resource to support research and development of multi-modal AI systems that can engage in informative, open-ended conversations about graphs and data visualizations. The multi-turn dialogue format presents a more natural and interactive setting compared to standard visual question answering datasets that use fixed sets of standalone questions.

    Potential use cases of SciGraphQA include pre-training and benchmarking multi-modal conversational models for scientific graph comprehension, building AI assistants that can discuss data insights, and developing aids to help individuals understand complex figures and diagrams interactively. The academic source material also provides a way to evaluate model capabilities on expert-level graphs spanning diverse topics and complex visual encodings.

  19. Z

    Dataset - Clustering Semantic Predicates in the Open Research Knowledge...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Aug 8, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arab Oghli, Omar (2022). Dataset - Clustering Semantic Predicates in the Open Research Knowledge Graph [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6513498
    Explore at:
    Dataset updated
    Aug 8, 2022
    Dataset authored and provided by
    Arab Oghli, Omar
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset has been created for implementing a content-based recommender system in the context of the Open Research Knowledge Graph (ORKG). The recommender system accepts research paper's title and abstracts as input and recommends existing predicates in the ORKG semantically relevant to the given paper.

    The paper instances in the dataset are grouped by ORKG comparisons and therefore the data.json file is more comprehensive than training_set.json and test_set.json.

    data.json

    The main JSON object consists of a list of comparisons. Each comparisons object has an ID, label, list of papers and list of predicates, whereas each paper object has ID, label, DOI, research field, research problems and abstract. Each predicate object has an ID and a label. See an example instance below.

    { "comparisons": [ { "id": "R108331", "label": "Analysis of approaches based on required elements in way of modeling", "papers": [ { "id": "R108312", "label": "Rapid knowledge work visualization for organizations", "doi": "10.1108/13673270710762747", "research_field": { "id": "R134", "label": "Computer and Systems Architecture" }, "research_problems": [ { "id": "R108294", "label": "Enterprise engineering" } ], "abstract": "Purpose \u2013 The purpose of this contribution is to motivate a new, rapid approach to modeling knowledge work in organizational settings and to introduce a software tool that demonstrates the viability of the envisioned concept.Design/methodology/approach \u2013 Based on existing modeling structures, the KnowFlow toolset that aids knowledge analysts in rapidly conducting interviews and in conducting multi\u2010perspective analysis of organizational knowledge work is introduced.Findings \u2013 This article demonstrates how rapid knowledge work visualization can be conducted largely without human modelers by developing an interview structure that allows for self\u2010service interviews. Two application scenarios illustrate the pressing need for and the potentials of rapid knowledge work visualizations in organizational settings.Research limitations/implications \u2013 The efforts necessary for traditional modeling approaches in the area of knowledge management are often prohibitive. This contribution argues that future research needs ..." }, .... ], "predicates": [ { "id": "P37126", "label": "activities, behaviours, means [for knowledge development and/or for knowledge conveyance and transformation" }, { "id": "P36081", "label": "approach name" }, .... ] }, .... ] }

    training_set.json and test_set.json

    The main JSON object consists of a list of training/test instances. Each instance has an instance_id with the format (comparison_id X paper_id) and a text. The text is a concatenation of the paper's label (title) and abstract. See an example instance below.

    Note that test instances are not duplicated and do not occur in the training set. Training instances are also not duplicated, BUT training papers can be duplicated in a concatenation with different comparisons.

    { "instances": [ { "instance_id": "R108331xR108301", "comparison_id": "R108331", "paper_id": "R108301", "text": "A notation for Knowledge-Intensive Processes Business process modeling has become essential for managing organizational knowledge artifacts. However, this is not an easy task, especially when it comes to the so-called Knowledge-Intensive Processes (KIPs). A KIP comprises activities based on acquisition, sharing, storage, and (re)use of knowledge, as well as collaboration among participants, so that the amount of value added to the organization depends on process agents' knowledge. The previously developed Knowledge Intensive Process Ontology (KIPO) structures all the concepts (and relationships among them) to make a KIP explicit. Nevertheless, KIPO does not include a graphical notation, which is crucial for KIP stakeholders to reach a common understanding about it. This paper proposes the Knowledge Intensive Process Notation (KIPN), a notation for building knowledge-intensive processes graphical models." }, ... ] }

    Dataset Statistics:

        -
        Papers
        Predicates
        Research Fields
        Research Problems
    
    
    
    
        Min/Comparison
        2
        2
        1
        0
    
    
        Max/Comparison
        202
        112
        5
        23
    
    
        Avg./Comparison
        21,54
        12,79
        1,20
        1,09
    
    
        Total
        4060
        1816
        46
        178
    

    Dataset Splits:

        -
        Papers
        Comparisons
    
    
    
    
        Training Set
        2857
        214
    
    
        Test Set
        1203
        180
    
  20. f

    Types of OI reoccurred among cohort of HIV/AIDS patient attending ART at...

    • plos.figshare.com
    xls
    Updated Jun 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maycas Dembelu; Mesfin Kote; Girma Gilano; Temesgen Mohammed (2023). Types of OI reoccurred among cohort of HIV/AIDS patient attending ART at public health facility in Arba Minch Town, Ethiopia. [Dataset]. http://doi.org/10.1371/journal.pone.0261454.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 8, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Maycas Dembelu; Mesfin Kote; Girma Gilano; Temesgen Mohammed
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Arba Minch, Ethiopia, Arba Minch Town
    Description

    Types of OI reoccurred among cohort of HIV/AIDS patient attending ART at public health facility in Arba Minch Town, Ethiopia.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Graph Datasets (2023). AIDS [Dataset]. https://huggingface.co/datasets/graphs-datasets/AIDS

AIDS

graphs-datasets/AIDS

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 20, 2023
Dataset authored and provided by
Graph Datasets
Description

Dataset Card for AIDS

  Dataset Summary

The AIDS dataset is a dataset containing compounds checked for evidence of anti-HIV activity..

  Supported Tasks and Leaderboards

AIDS should be used for molecular classification, a binary classification task. The score used is accuracy with cross validation.

  External Use





  PyGeometric

To load in PyGeometric, do the following: from datasets import load_dataset

from torch_geometric.data import Data from… See the full description on the dataset page: https://huggingface.co/datasets/graphs-datasets/AIDS.

Search
Clear search
Close search
Google apps
Main menu