5 datasets found

h
MegaScience
huggingface.co
Updated Feb 10, 2010
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MegaScience (2010). MegaScience [Dataset]. https://huggingface.co/datasets/MegaScience/MegaScience
Explore at:
Dataset updated
Feb 10, 2010
Dataset authored and provided by
MegaScience
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Code: https://github.com/GAIR-NLP/MegaScience Project Page: https://huggingface.co/MegaScience MegaScience is a large-scale mixture of high-quality open-source datasets consisting of 1.25 million instances. We first collect multiple public datasets, then conduct comprehensive ablation studies across different data selection methods to identify the optimal approach for each dataset, thereby… See the full description on the dataset page: https://huggingface.co/datasets/MegaScience/MegaScience.
h
TextbookReasoning
huggingface.co
Updated Feb 10, 2010
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MegaScience (2010). TextbookReasoning [Dataset]. https://huggingface.co/datasets/MegaScience/TextbookReasoning
Explore at:
Dataset updated
Feb 10, 2010
Dataset authored and provided by
MegaScience
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Dataset Description

Scientific reasoning is critical for developing AI scientists and supporting human researchers in advancing the frontiers of natural science discovery. However, the open-source community has primarily focused on mathematics and coding while neglecting the scientific domain, largely due to the absence of open, large-scale, high-quality, verifiable scientific reasoning… See the full description on the dataset page: https://huggingface.co/datasets/MegaScience/TextbookReasoning.
h
MegaScience-Qwen3-Tokenized
huggingface.co
Updated Jul 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Seba (2025). MegaScience-Qwen3-Tokenized [Dataset]. https://huggingface.co/datasets/seba/MegaScience-Qwen3-Tokenized
Explore at:
Dataset updated
Jul 30, 2025
Authors
Seba
Description
seba/MegaScience-Qwen3-Tokenized dataset hosted on Hugging Face and contributed by the HF Datasets community
Mega Science Co Ltd Company profile with phone,email, buyers, suppliers,...
volza.com
csv
Updated Jul 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Volza FZ LLC (2025). Mega Science Co Ltd Company profile with phone,email, buyers, suppliers, price, export import shipments. [Dataset]. https://www.volza.com/company-profile/mega-science-co-ltd-19656267/
Explore at:
csvAvailable download formats
Dataset updated
Jul 16, 2025
Dataset provided by
Authors
Volza FZ LLC
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2014 - Sep 30, 2021
Variables measured
Count of exporters, Count of importers, Sum of export value, Sum of import value, Count of export shipments, Count of import shipments
Description
Credit report of Mega Science Co Ltd contains unique and detailed export import market intelligence with it's phone, email, Linkedin and details of each import and export shipment like product, quantity, price, buyer, supplier names, country and date of shipment.
p
GBIF - Global Biodiversity Information Facility, free and open access to...
pigma.org
Updated Nov 10, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). GBIF - Global Biodiversity Information Facility, free and open access to biodiversity data [Dataset]. https://www.pigma.org/geonetwork/srv/search?orgName=GBIF%20Secretariat
Explore at:
Dataset updated
Nov 10, 2023
Description
GBIF, the Global Biodiversity Information Facility, is an international network and data infrastructure funded by the world's governments and aimed at providing anyone, anywhere, open access to data about all types of life on Earth. Coordinated through its Secretariat in Copenhagen, the GBIF network of participating countries and organizations, working through participant nodes, provides data-holding institutions around the world with common standards and open-source tools that enable them to share information about where and when species have been recorded. This knowledge derives from many sources, including everything from museum specimens collected in the 18th and 19th century to geotagged smartphone photos shared by amateur naturalists in recent days and weeks. The GBIF network draws all these sources together through the use of data standards, such as Darwin Core, which forms the basis for the bulk of GBIF.org's index of hundreds of millions of species occurrence records. Publishers provide open access to their datasets using machine-readable Creative Commons licence designations, allowing scientists, researchers and others to apply the data in hundreds of peer-reviewed publications and policy papers each year. Many of these analyses, which cover topics from the impacts of climate change and the spread of invasive and alien pests to priorities for conservation and protected areas, food security and human health, would not be possible without this. GBIF arose from a 1999 recommendation by the Biodiversity Informatics Subgroup of the Organization for Economic Cooperation and Development's Megascience Forum. This report concluded that "An international mechanism is needed to make biodiversity data and information accessible worldwide", arguing that this mechanism could produce many economic and social benefits and enable sustainable development by providing sound scientific evidence.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

MegaScience (2010). MegaScience [Dataset]. https://huggingface.co/datasets/MegaScience/MegaScience

MegaScience

MegaScience/MegaScience

Explore at:

Dataset updated

Feb 10, 2010

Dataset authored and provided by

MegaScience

License

Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically

Description

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Code: https://github.com/GAIR-NLP/MegaScience Project Page: https://huggingface.co/MegaScience MegaScience is a large-scale mixture of high-quality open-source datasets consisting of 1.25 million instances. We first collect multiple public datasets, then conduct comprehensive ablation studies across different data selection methods to identify the optimal approach for each dataset, thereby… See the full description on the dataset page: https://huggingface.co/datasets/MegaScience/MegaScience.

Clear search

Close search

Google apps

Main menu

MegaScience

TextbookReasoning

MegaScience-Qwen3-Tokenized

Mega Science Co Ltd Company profile with phone,email, buyers, suppliers,...

GBIF - Global Biodiversity Information Facility, free and open access to...

MegaScience

MegaScience/MegaScience