5 datasets found
  1. h

    MegaScience

    • huggingface.co
    Updated Feb 10, 2010
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MegaScience (2010). MegaScience [Dataset]. https://huggingface.co/datasets/MegaScience/MegaScience
    Explore at:
    Dataset updated
    Feb 10, 2010
    Dataset authored and provided by
    MegaScience
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

    Code: https://github.com/GAIR-NLP/MegaScience Project Page: https://huggingface.co/MegaScience MegaScience is a large-scale mixture of high-quality open-source datasets consisting of 1.25 million instances. We first collect multiple public datasets, then conduct comprehensive ablation studies across different data selection methods to identify the optimal approach for each dataset, thereby… See the full description on the dataset page: https://huggingface.co/datasets/MegaScience/MegaScience.

  2. h

    TextbookReasoning

    • huggingface.co
    Updated Feb 10, 2010
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MegaScience (2010). TextbookReasoning [Dataset]. https://huggingface.co/datasets/MegaScience/TextbookReasoning
    Explore at:
    Dataset updated
    Feb 10, 2010
    Dataset authored and provided by
    MegaScience
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

      Dataset Description
    

    Scientific reasoning is critical for developing AI scientists and supporting human researchers in advancing the frontiers of natural science discovery. However, the open-source community has primarily focused on mathematics and coding while neglecting the scientific domain, largely due to the absence of open, large-scale, high-quality, verifiable scientific reasoning… See the full description on the dataset page: https://huggingface.co/datasets/MegaScience/TextbookReasoning.

  3. h

    MegaScience-Qwen3-Tokenized

    • huggingface.co
    Updated Jul 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seba (2025). MegaScience-Qwen3-Tokenized [Dataset]. https://huggingface.co/datasets/seba/MegaScience-Qwen3-Tokenized
    Explore at:
    Dataset updated
    Jul 30, 2025
    Authors
    Seba
    Description

    seba/MegaScience-Qwen3-Tokenized dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. Mega Science Co Ltd Company profile with phone,email, buyers, suppliers,...

    • volza.com
    csv
    Updated Jul 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Volza FZ LLC (2025). Mega Science Co Ltd Company profile with phone,email, buyers, suppliers, price, export import shipments. [Dataset]. https://www.volza.com/company-profile/mega-science-co-ltd-19656267/
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jul 16, 2025
    Dataset provided by
    Authors
    Volza FZ LLC
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2014 - Sep 30, 2021
    Variables measured
    Count of exporters, Count of importers, Sum of export value, Sum of import value, Count of export shipments, Count of import shipments
    Description

    Credit report of Mega Science Co Ltd contains unique and detailed export import market intelligence with it's phone, email, Linkedin and details of each import and export shipment like product, quantity, price, buyer, supplier names, country and date of shipment.

  5. p

    GBIF - Global Biodiversity Information Facility, free and open access to...

    • pigma.org
    Updated Nov 10, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). GBIF - Global Biodiversity Information Facility, free and open access to biodiversity data [Dataset]. https://www.pigma.org/geonetwork/srv/search?orgName=GBIF%20Secretariat
    Explore at:
    Dataset updated
    Nov 10, 2023
    Description

    GBIF, the Global Biodiversity Information Facility, is an international network and data infrastructure funded by the world's governments and aimed at providing anyone, anywhere, open access to data about all types of life on Earth. Coordinated through its Secretariat in Copenhagen, the GBIF network of participating countries and organizations, working through participant nodes, provides data-holding institutions around the world with common standards and open-source tools that enable them to share information about where and when species have been recorded. This knowledge derives from many sources, including everything from museum specimens collected in the 18th and 19th century to geotagged smartphone photos shared by amateur naturalists in recent days and weeks. The GBIF network draws all these sources together through the use of data standards, such as Darwin Core, which forms the basis for the bulk of GBIF.org's index of hundreds of millions of species occurrence records. Publishers provide open access to their datasets using machine-readable Creative Commons licence designations, allowing scientists, researchers and others to apply the data in hundreds of peer-reviewed publications and policy papers each year. Many of these analyses, which cover topics from the impacts of climate change and the spread of invasive and alien pests to priorities for conservation and protected areas, food security and human health, would not be possible without this. GBIF arose from a 1999 recommendation by the Biodiversity Informatics Subgroup of the Organization for Economic Cooperation and Development's Megascience Forum. This report concluded that "An international mechanism is needed to make biodiversity data and information accessible worldwide", arguing that this mechanism could produce many economic and social benefits and enable sustainable development by providing sound scientific evidence.

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
MegaScience (2010). MegaScience [Dataset]. https://huggingface.co/datasets/MegaScience/MegaScience

MegaScience

MegaScience/MegaScience

Explore at:
Dataset updated
Feb 10, 2010
Dataset authored and provided by
MegaScience
License

Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically

Description

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Code: https://github.com/GAIR-NLP/MegaScience Project Page: https://huggingface.co/MegaScience MegaScience is a large-scale mixture of high-quality open-source datasets consisting of 1.25 million instances. We first collect multiple public datasets, then conduct comprehensive ablation studies across different data selection methods to identify the optimal approach for each dataset, thereby… See the full description on the dataset page: https://huggingface.co/datasets/MegaScience/MegaScience.

Search
Clear search
Close search
Google apps
Main menu