100+ datasets found
  1. h

    MAP-CC

    • huggingface.co
    Updated Apr 5, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Multimodal Art Projection (2024). MAP-CC [Dataset]. https://huggingface.co/datasets/m-a-p/MAP-CC
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 5, 2024
    Dataset authored and provided by
    Multimodal Art Projection
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    MAP-CC

    🌐 Homepage | 🤗 MAP-CC | 🤗 CHC-Bench | 🤗 CT-LLM | 📖 arXiv | GitHub An open-source Chinese pretraining dataset with a scale of 800 billion tokens, offering the NLP community high-quality Chinese pretraining data.

      Disclaimer
    

    This model, developed for academic purposes, employs rigorously compliance-checked training data to uphold the highest standards of integrity and compliance. Despite our efforts, the inherent complexities of data and the broad spectrum of… See the full description on the dataset page: https://huggingface.co/datasets/m-a-p/MAP-CC.

  2. e

    Map Viewing Service (WMS) of the data batch: Municipal maps (CC) of the...

    • data.europa.eu
    • gimi9.com
    wms
    Updated Dec 17, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Map Viewing Service (WMS) of the data batch: Municipal maps (CC) of the Corrèze [Dataset]. https://data.europa.eu/data/datasets/fr-120066022-srv-1bfd1693-4c0a-4644-9d1c-d39ee5439d04?locale=en
    Explore at:
    wmsAvailable download formats
    Dataset updated
    Dec 17, 2021
    Description

    This COVADIS data standard concerns communal map documents (CCs). This data standard provides a technical framework describing in detail how to dematerialise these town planning documents in a spatial database that can be used by a GIS tool and interoperable. This standard of data covers both the graphical plans of the sectors and the information overlaying them.This standard of COVADIS data has been developed on the basis of the specifications for the dematerialisation of planning documents created in 2012 by the CNIG, itself based on the consolidated version of the urban planning code dated 16 March 2012. The recommendations of these two documents are consistent even if their purpose is not the same. The COVADIS data standard provides definitions and a structure for organising and storing spatial data from communal maps in an infrastructure, while the CNIG specification serves to frame the digitisation of these data.Part C ‘Data Structure’ presented in this COVADIS standard provides additional recommendations for the storage of data files. These are specific choices for the common data infrastructure of the ministries responsible for agriculture and sustainable development, which do not apply outside their context.

  3. e

    Simple download service (Atom) of the data package: Municipal maps (CC) of...

    • data.europa.eu
    unknown
    Updated Mar 31, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Simple download service (Atom) of the data package: Municipal maps (CC) of the Corrèze [Dataset]. https://data.europa.eu/data/datasets/fr-120066022-srv-21b00b55-f8e8-4248-95d7-a5a2672ceb49?locale=en
    Explore at:
    unknownAvailable download formats
    Dataset updated
    Mar 31, 2021
    Description

    This COVADIS data standard concerns communal map documents (CCs). This data standard provides a technical framework describing in detail how to dematerialise these town planning documents in a spatial database that can be used by a GIS tool and interoperable. This standard of data covers both the graphical plans of the sectors and the information overlaying them.This standard of COVADIS data has been developed on the basis of the specifications for the dematerialisation of planning documents created in 2012 by the CNIG, itself based on the consolidated version of the urban planning code dated 16 March 2012. The recommendations of these two documents are consistent even if their purpose is not the same. The COVADIS data standard provides definitions and a structure for organising and storing spatial data from communal maps in an infrastructure, while the CNIG specification serves to frame the digitisation of these data.Part C ‘Data Structure’ presented in this COVADIS standard provides additional recommendations for the storage of data files. These are specific choices for the common data infrastructure of the ministries responsible for agriculture and sustainable development, which do not apply outside their context.

  4. R

    Maps Dataset

    • universe.roboflow.com
    zip
    Updated Jun 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maps (2024). Maps Dataset [Dataset]. https://universe.roboflow.com/maps-dqkt3/maps-e2jwg
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 28, 2024
    Dataset authored and provided by
    Maps
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Pools Bounding Boxes
    Description

    Maps

    ## Overview
    
    Maps is a dataset for object detection tasks - it contains Pools annotations for 4,613 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  5. e

    Communal (CC) map of blackPlace

    • data.europa.eu
    Updated Mar 27, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2014). Communal (CC) map of blackPlace [Dataset]. https://data.europa.eu/data/datasets/fr-000051404-cc20140327
    Explore at:
    Dataset updated
    Mar 27, 2014
    Description

    Communal map (CC) of blackPlace. This lot informs the right to build. It is digitised in accordance with the national requirements of the CNIG

  6. e

    Elements of linear type, relating to the Communal Maps (CC), of the...

    • data.europa.eu
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Elements of linear type, relating to the Communal Maps (CC), of the department of Eure-et-Loir (28) [Dataset]. https://data.europa.eu/data/datasets/fr-120066022-jdd-2d7edd8c-8031-4b58-9646-b29c20a806f4?locale=en
    Explore at:
    Description

    The cladding elements are entries in relation to a regulatory provision (way width, odds, names of neighbouring municipalities.) or geometrical surface, linear or point indicative elements, dressing the graphic documents of the PLU or the POS. They are necessary for the paper edition of the applicable graphic documents. This may be, for example, a hold of a detail plan, a frame, a cartridge, a reminder for a writing, a draw to draw a rating, an equipment identification label

  7. Protein Structures: Pairwise Distance Maps

    • kaggle.com
    zip
    Updated Apr 20, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Collin Arnett (2020). Protein Structures: Pairwise Distance Maps [Dataset]. https://www.kaggle.com/datasets/collinarnett/protein-maps
    Explore at:
    zip(0 bytes)Available download formats
    Dataset updated
    Apr 20, 2020
    Authors
    Collin Arnett
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    https://upload.wikimedia.org/wikipedia/commons/7/79/VEGFR2_bound_to_axitinib.gif" alt="image">

    Introduction

    This dataset is a replication of the dataset described in the paper Generative Modeling for Protein Structures by Namrata Anand and Po-Ssu Huang. The data is used to train a Generative Adversarial Network with the capability of creating protein structures.

    Content

    The data is stored in a hdf5 file and is structured in the following manner:

    {
     "test_16": "16x16 numpy arrays",
     "train_16": "16x16 numpy arrays",
     "test_64": "64x64 numpy arrays",
     "train_64": "64x64 numpy arrays",
     "test_128": "128x128 numpy arrays"
     "train_128": "128x128 numpy arrays"
    }
    

    and contains the following number of numpy arrays:

    test_16: 69,713

    train_16: 1,820,586

    test_64: 11,835

    train_64: 331,006

    test_128: 3,276

    train_128: 98,748

    Quickstart

    Running the following will yeild ```python3 import h5py import matplotlib.pyplot as plt

    dataset = h5py.File('dataset.hdf5', 'r') test_64 = dataset['test_64']

    plt.imshow(test_64[1], cmap='viridis') plt.colorbar() plt.show() ``` https://i.imgur.com/lb2bOzo.png" alt="image">

    Acknowledgements

    @incollection{NIPS2018_7978,
    title = {Generative modeling for protein structures},
    author = {Anand, Namrata and Huang, Possu},
    booktitle = {Advances in Neural Information Processing Systems 31},
    editor = {S. Bengio and H. Wallach and H. Larochelle and K. Grauman and N. Cesa-Bianchi and R. Garnett},
    pages = {7494--7505},
    year = {2018},
    publisher = {Curran Associates, Inc.},
    url = {http://papers.nips.cc/paper/7978-generative-modeling-for-protein-structures.pdf}
    

    https://cdn.rcsb.org/rcsb-pdb/v2/common/images/rcsb_logo.png" alt="image"> H.M. Berman, J. Westbrook, Z. Feng, G. Gilliland, T.N. Bhat, H. Weissig, I.N. Shindyalov, P.E. Bourne. (2000) The Protein Data Bank Nucleic Acids Research, 28: 235-242.

  8. Z

    Augmented emission maps: 1398 cc 64 kW Euro 6 petrol engine

    • data.niaid.nih.gov
    • zenodo.org
    • +1more
    Updated Feb 5, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ruiter, J.M. de (2021). Augmented emission maps: 1398 cc 64 kW Euro 6 petrol engine [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4270371
    Explore at:
    Dataset updated
    Feb 5, 2021
    Dataset provided by
    Indrajuana, A.P.
    Gijlswijk, R.N. van
    Ruiter, J.M. de
    Elstgeest, M.
    Ligterink, N.E.
    Tilanus, P.A.J.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In order to enable the sharing of data the emission data for vehicles is standardized. The data exchange format contains all data that is applicable for a specific engine taxonomy code.

    This specific data set refers to the 1398 cc 64 kW Euro 6 petrol engine that has been applied in the

    The standardized emission map has a “.map.txt” extension and is also human readable. The files starts with metadata which contains information about:

    the engine taxonomy code,

    total driven kilometers over which the data was gathered,

    total time in hours over which the data was gathered,

    the number of vehicles which were tested to create the emission map,

    the DOI (Digital Object Identifier) reference,

    Which emission maps are available in the file.

    The DOI http://doi.org/10.5281/zenodo.4268034 refers to a updated meta-data document that provides the full description of the standardized emission map.

  9. e

    Municipal map (CC) — Lanarvily

    • data.europa.eu
    Updated Aug 28, 2007
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2007). Municipal map (CC) — Lanarvily [Dataset]. https://data.europa.eu/data/datasets/fr-000029100-cc20070828
    Explore at:
    Dataset updated
    Aug 28, 2007
    Description

    Communal map (CC) — Lanarvily. This lot informs the right to build. It is digitised in accordance with the national requirements of the CNIG.

    Approved on: 28 August 2007 Updated by deliberation on: 11 January 2017

  10. e

    Geological overview map 1:300.000 - GÜK300: Geological overview map

    • data.europa.eu
    wms
    Updated Mar 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Landesamt für Geologie und Bergbau, Rheinland-Pfalz (2025). Geological overview map 1:300.000 - GÜK300: Geological overview map [Dataset]. https://data.europa.eu/data/datasets/39847970-3268-601a-d1d2-3fff6f7301b6/embed?locale=en
    Explore at:
    wmsAvailable download formats
    Dataset updated
    Mar 19, 2025
    Dataset authored and provided by
    Landesamt fĂźr Geologie und Bergbau, Rheinland-Pfalz
    Description

    The geological overview map of Rhineland-Palatinate at a scale of 1:300,000 (GUEK300) has been newly compiled on the basis of published geological maps of different scales. It replaces the previous official geological overview map of Rhineland-Palatinate on a scale of 1:500,000. The basis of the new map are the geological overview maps in the scale 1:200,000 (cc5502 Cologne, cc5510 Siegen, cc6302 Trier, cc6310 Frankfurt a.M. West, cc7102 SaarbrĂźcken and cc7110 Mannheim), the geological maps published by the State Office for Geology and Mining Rhineland-Palatinate in the scales 1:100,000, 1:50,000 and 1:25,000 as well as the geological (and volcanic) maps of external processors. Due to scale, stratigraphic formations were combined into larger units and boundary lines were generalized. The spatially very complex disturbance pattern was (strongly generalized and) reduced to the representation of significant disturbances.:Geological survey map of Rhineland-Palatinate 1:300,000 (GUEK 300) The GUEK 300 was recompiled on the basis of published geological maps of different scales. It replaces the previous official geological overview map of Rhineland-Palatinate on a scale of 1: 500 000. The new map is based on the 1st scale geological survey maps issued by the Federal Institute for Geosciences and Natural Resources in Hanover in cooperation with the State Geological Services of the Laender : 200 000 (CC 5502 Koeln, CC 5510 Siegen, CC 6302 Trier, CC 6310 Frankfurt a.M. West, CC 7102 Saarbruecken and CC 7110 Mannheim), the geological maps published by the Rhineland-Palatinate State Office for Geology and Mining in the scales 1 : 100 000, 1 : 50 000 and 1 : 25 000 as well as the geological (and volcanic) maps of external processors. Due to scale, stratigraphic formations were combined into larger units and boundary lines were generalized. The area-by-area very complex shock pattern was (strongly generalized and) reduced to the representation of significant shocks.

  11. s

    Christmas Copper Mine Section C-C'

    • cinergi.sdsc.edu
    pdf
    Updated May 7, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Swanson, R. W.; Peterson, Nels Paul (2014). Christmas Copper Mine Section C-C' [Dataset]. http://cinergi.sdsc.edu/geoportal/rest/metadata/item/3473d182dd7a4704bb46d53a1fb71c7e/html
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 7, 2014
    Authors
    Swanson, R. W.; Peterson, Nels Paul
    Area covered
    Description

    ADMMR map collection: Christmas Copper Mine Section C-C'; 1 in. to 200 feet; 24 x 18 in.

  12. D

    OC Community College District

    • detroitdata.org
    • portal.datadrivendetroit.org
    • +5more
    Updated Oct 14, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oakland County, Michigan (2020). OC Community College District [Dataset]. https://detroitdata.org/dataset/oc-community-college-district1
    Explore at:
    zip, kml, html, geojson, arcgis geoservices rest api, csvAvailable download formats
    Dataset updated
    Oct 14, 2020
    Dataset provided by
    Oakland County, Michigan
    Description
    BY USING THIS WEBSITE OR THE CONTENT THEREIN, YOU AGREE TO THE TERMS OF USE.
    This polygon feature class was initially derived with data from School District, Tax, and Municipal feature classes and historical School Board correspondence. The key attribute is Name (the Community College name). Boundaries beyond the extent of Oakland County may not be exact representation of its true geographic and political location.
  13. e

    Simple download service (Atom) of the dataset: Map zoning plans (CC) of the...

    • data.europa.eu
    unknown
    Updated Mar 1, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Simple download service (Atom) of the dataset: Map zoning plans (CC) of the department of Eure-et-Loir (28) [Dataset]. https://data.europa.eu/data/datasets/fr-120066022-srv-9786a9ab-8893-47b8-9f3d-cee68e731f20
    Explore at:
    unknownAvailable download formats
    Dataset updated
    Mar 1, 2022
    Description

    The Urban Planning Code defines two types of areas for municipal maps: construction sectors and inconstructible sectors. There are, however, special cases: — Graphic documents may define areas reserved for industrial or craft activities, in particular those incompatible with the neighbourhood of inhabited areas. — They define, where appropriate, the areas in which the reconstruction of a building destroyed by a disaster is not permitted. — Installations necessary for public facilities, agricultural or forestry operations and the development of natural resources are not covered by the principle of inconstructibility resulting from classification. The areas of the communal map do not always cover the entire communal territory. The areas of the municipality not covered by a sector are represented by an object in order to cover the whole municipality.

  14. d

    Mystic Gold, Cross Section CC

    • datadiscoverystudio.org
    pdf
    Updated May 7, 2014
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Coggin, Mason (2014). Mystic Gold, Cross Section CC [Dataset]. http://datadiscoverystudio.org/geoportal/rest/metadata/item/6fdc8bd70b26489580d1587ede34abc7/html
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 7, 2014
    Authors
    Coggin, Mason
    Area covered
    Description

    ADMMR map collection: Mystic Gold, Cross Section CC; 1 in. to 20 feet; 24 x 18 in.

  15. Augmented emission maps: 998 cc 50 kW Euro 5b petrol engine

    • data.europa.eu
    • data.niaid.nih.gov
    • +1more
    unknown
    Updated Jul 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zenodo (2025). Augmented emission maps: 998 cc 50 kW Euro 5b petrol engine [Dataset]. https://data.europa.eu/data/datasets/oai-zenodo-org-4269986?locale=fi
    Explore at:
    unknown(206593)Available download formats
    Dataset updated
    Jul 3, 2025
    Dataset authored and provided by
    Zenodohttp://zenodo.org/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In order to enable the sharing of data the emission data for vehicles is standardized. The data exchange format contains all data that is applicable for a specific engine taxonomy code. This specific data set refers to the 998 cc 50 kW Euro 5b petrol engine that has been applied in the The standardized emission map has a “.map.txt” extension and is also human readable. The files starts with metadata which contains information about: the engine taxonomy code, total driven kilometers over which the data was gathered, total time in hours over which the data was gathered, the number of vehicles which were tested to create the emission map, the DOI (Digital Object Identifier) reference, Which emission maps are available in the file. The DOI http://doi.org/10.5281/zenodo.4268034 refers to a updated meta-data document that provides the full description of the standardized emission map.

  16. d

    Kay Copper Corporation Kay Mine Section on C-C

    • datadiscoverystudio.org
    pdf
    Updated May 7, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arizona Department of Mines and Mineral Resources (2014). Kay Copper Corporation Kay Mine Section on C-C [Dataset]. http://datadiscoverystudio.org/geoportal/rest/metadata/item/dd58ba494512489797f1d47eea1e980f/html
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 7, 2014
    Authors
    Arizona Department of Mines and Mineral Resources
    Area covered
    Description

    ADMMR map collection: Kay Copper Corporation Kay Mine Section on C-C; 1 in. to 80 feet; 42 x 25 in.

  17. a

    CC School Locator

    • co-cumberlandgis.opendata.arcgis.com
    Updated Sep 12, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cumberland County, NC (2016). CC School Locator [Dataset]. https://co-cumberlandgis.opendata.arcgis.com/maps/CumberlandGIS::cc-school-locator/about
    Explore at:
    Dataset updated
    Sep 12, 2016
    Dataset authored and provided by
    Cumberland County, NC
    Area covered
    Description

    Web map with schools layer for viewing and locating schools within Cumberland County, NC.

  18. d

    State of Texas Mine Ideal Section Along C-C with Geology

    • datadiscoverystudio.org
    pdf
    Updated May 7, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arizona Department of Mines and Mineral Resources (2014). State of Texas Mine Ideal Section Along C-C with Geology [Dataset]. http://datadiscoverystudio.org/geoportal/rest/metadata/item/a063a39308dd40c6b57a6e2426d71249/html
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 7, 2014
    Authors
    Arizona Department of Mines and Mineral Resources
    Area covered
    Description

    ADMMR map collection: State of Texas Mine Ideal Section Along C-C with Geology; 1 in. to 40 feet; 4 x 4 in.

  19. e

    Dataset Direct Download Service (WFS): Information (surface objects),...

    • data.europa.eu
    Updated Jan 21, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Dataset Direct Download Service (WFS): Information (surface objects), relating to Communal Maps (CC), Department of Eure-et-Loir (28) [Dataset]. https://data.europa.eu/data/datasets/fr-120066022-srv-e40474cd-e88d-4fad-b5f8-adf7c2a070f2?locale=en
    Explore at:
    Dataset updated
    Jan 21, 2022
    Description

    The information contained in graphic documents of a PLU or POS urban planning document shall be added either for regulatory reasons or for information purposes: — the information which is to be annexed to the planning documents in accordance with Articles R123-13 and R123-14 of the Planning Code, — the information reported on the graphic documents for information purposes.

  20. site-maps.cc Website Traffic, Ranking, Analytics [July 2025]

    • semrush.com
    Updated Aug 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Semrush (2025). site-maps.cc Website Traffic, Ranking, Analytics [July 2025] [Dataset]. https://www.semrush.com/website/site-maps.cc/overview/
    Explore at:
    Dataset updated
    Aug 12, 2025
    Dataset authored and provided by
    Semrushhttps://fr.semrush.com/
    License

    https://www.semrush.com/company/legal/terms-of-service/https://www.semrush.com/company/legal/terms-of-service/

    Time period covered
    Aug 12, 2025
    Area covered
    Worldwide
    Variables measured
    visits, backlinks, bounceRate, pagesPerVisit, authorityScore, organicKeywords, avgVisitDuration, referringDomains, trafficByCountry, paidSearchTraffic, and 3 more
    Measurement technique
    Semrush Traffic Analytics; Click-stream data
    Description

    site-maps.cc is ranked #2160 in GR with 602.02K Traffic. Categories: . Learn more about website traffic, market share, and more!

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Multimodal Art Projection (2024). MAP-CC [Dataset]. https://huggingface.co/datasets/m-a-p/MAP-CC

MAP-CC

m-a-p/MAP-CC

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 5, 2024
Dataset authored and provided by
Multimodal Art Projection
License

Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically

Description

MAP-CC

🌐 Homepage | 🤗 MAP-CC | 🤗 CHC-Bench | 🤗 CT-LLM | 📖 arXiv | GitHub An open-source Chinese pretraining dataset with a scale of 800 billion tokens, offering the NLP community high-quality Chinese pretraining data.

  Disclaimer

This model, developed for academic purposes, employs rigorously compliance-checked training data to uphold the highest standards of integrity and compliance. Despite our efforts, the inherent complexities of data and the broad spectrum of… See the full description on the dataset page: https://huggingface.co/datasets/m-a-p/MAP-CC.

Search
Clear search
Close search
Google apps
Main menu