100+ datasets found
  1. h

    regmix-data

    • huggingface.co
    Updated Jul 26, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sea AI Lab (2024). regmix-data [Dataset]. https://huggingface.co/datasets/sail/regmix-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 26, 2024
    Dataset authored and provided by
    Sea AI Lab
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    RegMix Data

      Dataset Description
    

    The RegMix Data is a curated dataset derived from the Pile-Uncopyrighted, specifically designed for the RegMix paper (https://huggingface.co/papers/2407.01492). This dataset aims to facilitate the automatic identification of high-performing data mixtures for language model pre-training by formulating it as a regression task.

      Key Features:
    

    Size: Approximately 1TB disk space, 250B tokens Distribution: Follows the natural token… See the full description on the dataset page: https://huggingface.co/datasets/sail/regmix-data.

  2. T

    SAIL FY2022 Hospital Performance - All Facilities

    • data.va.gov
    • datahub.va.gov
    application/rdfxml +5
    Updated Aug 22, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). SAIL FY2022 Hospital Performance - All Facilities [Dataset]. https://www.data.va.gov/dataset/SAIL-FY2022-Hospital-Performance-All-Facilities/rc4s-93qz
    Explore at:
    application/rssxml, csv, tsv, xml, application/rdfxml, jsonAvailable download formats
    Dataset updated
    Aug 22, 2022
    Description

    Strategic Analytics for Improvement and Learning Value Model or SAIL, is a system for summarizing hospital system performance within Veterans Health Administration (VHA). SAIL assesses key Quality measures in areas such as death rate, complications, and patient satisfaction, as well as overall efficiency at individual VA Medical Centers (VAMCs).

  3. H

    Outpatient Database for Wales (OPDW)

    • find.data.gov.scot
    • dtechtive.com
    Updated Oct 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SAIL (2023). Outpatient Database for Wales (OPDW) [Dataset]. https://find.data.gov.scot/datasets/25718
    Explore at:
    Dataset updated
    Oct 11, 2023
    Dataset provided by
    SAIL
    Area covered
    Wales
    Description

    Attendance information for all hospital outpatient appointments.

  4. m

    Database on space sails

    • data.mendeley.com
    Updated Nov 7, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maximilien Berthet (2024). Database on space sails [Dataset]. http://doi.org/10.17632/pr6pk3xmsp.3
    Explore at:
    Dataset updated
    Nov 7, 2024
    Authors
    Maximilien Berthet
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Space sails are a continuum of lightweight, thin, large-area, deployable technologies which are pushing forward new frontiers in space mobility and exploration. They encompass solar sails, laser-driven sails, drag sails, magnetic sails, electric sails, deployable membrane reflectors, deployable membrane antennas, and solar power sails. This database contains values of important parameters from 220 different space sails, which have either flown in space or been proposed as mission concepts. The parameters are: the deployed sail area, the spacecraft's total mass, the total sail loading, the characteristic acceleration, the characteristic thrust, the sail's stowed volume, the sail packing efficiency, and the sail thickness. Assumptions and definitions used for each parameter are provided, along with links to the data sources.

  5. E

    National Community Child Health Database

    • healthinformationportal.eu
    html
    Updated Mar 6, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SAIL Databank – https://saildatabank.com/application-process/ (2023). National Community Child Health Database [Dataset]. https://www.healthinformationportal.eu/health-information-sources/national-community-child-health-database
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Mar 6, 2023
    Dataset authored and provided by
    SAIL Databank – https://saildatabank.com/application-process/
    Variables measured
    sex, title, topics, acronym, country, language, data_owners, description, sample_size, age_range_to, and 14 more
    Measurement technique
    Administrative data
    Description

    The Child Health System in Wales; includes birth registration and monitoring of child health examinations and immunisations.

    The Child Health System in Wales; includes birth registration and monitoring of child health examinations and immunisations.

    The dataset brings together data from local Child Health System databases which are held by NHS Trusts and used by them to administer child immunisation and health surveillance programmes.

    The dataset contains all children born, resident or treated in Wales and born after 1987.

  6. I

    India Crude Steel: Production: Public Sector: Steel Authority of India...

    • ceicdata.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CEICdata.com, India Crude Steel: Production: Public Sector: Steel Authority of India Limited (SAIL) [Dataset]. https://www.ceicdata.com/en/india/crude-steel-production-annual/crude-steel-production-public-sector-steel-authority-of-india-limited-sail
    Explore at:
    Dataset provided by
    CEICdata.com
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Mar 1, 2007 - Mar 1, 2018
    Area covered
    India
    Variables measured
    Industrial Production
    Description

    Crude Steel: Production: Public Sector: Steel Authority of India Limited (SAIL) data was reported at 15,022.000 Metric Ton th in 2018. This records an increase from the previous number of 14,494.000 Metric Ton th for 2017. Crude Steel: Production: Public Sector: Steel Authority of India Limited (SAIL) data is updated yearly, averaging 13,507.500 Metric Ton th from Mar 2003 (Median) to 2018, with 16 observations. The data reached an all-time high of 15,022.000 Metric Ton th in 2018 and a record low of 11,628.000 Metric Ton th in 2003. Crude Steel: Production: Public Sector: Steel Authority of India Limited (SAIL) data remains active status in CEIC and is reported by Joint Plant Committee. The data is categorized under India Premium Database’s Metal and Steel Sector – Table IN.WAA003: Crude Steel: Production (Annual).

  7. Sail Import Data India – Buyers & Importers List

    • seair.co.in
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim, Sail Import Data India – Buyers & Importers List [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset provided by
    Seair Exim Solutions
    Authors
    Seair Exim
    Area covered
    India
    Description

    Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.

  8. d

    Marine in situ data collected from sail training ship Statsraad Lehmkuhl in...

    • catalog.data.gov
    • ncei.noaa.gov
    Updated Jun 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (Point of Contact) (2025). Marine in situ data collected from sail training ship Statsraad Lehmkuhl in the North and South Atlantic Ocean from 2021-11-25 to 2022-02-23 [Dataset]. https://catalog.data.gov/dataset/marine-in-situ-data-collected-from-sail-training-ship-statsraad-lehmkuhl-in-the-north-and-02-23
    Explore at:
    Dataset updated
    Jun 1, 2025
    Dataset provided by
    (Point of Contact)
    Area covered
    Atlantic Ocean
    Description

    In August 2021, the 107-year-old 98-meter-long tall ship Statsraad Lehmkuhl departed Norway to return in April 2023, having sailed 55,000 nautical miles and visited 36 ports worldwide. The main goal is to create attention and share knowledge about the crucial role of the ocean for a sustainable development in a global perspective. This dataset contains various marine observations collected in the Atlantic Ocean. This dataset is U.S. State Department MSR U2021-017 as part of the World Data Services for Oceanography. CTD is in TXT and RSK (RBR CTD) formats, navigation is in CSV, PCO2 data are in TXT format, wave radar data are in Python Pickle File (PKL) format, weather station and Ferrybox (through-flow system) data are in JSON format, echosounder data are in Simrad EK80 (.raw) format, hydrophone sound data are in uncompressed wave format (.wav). The latter two are compressed by gzip.

  9. i

    Pre-processed atmospheric data from the SAIL campaign onboard the Sagres...

    • rdm.inesctec.pt
    Updated Dec 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Pre-processed atmospheric data from the SAIL campaign onboard the Sagres ship - Dataset - CKAN [Dataset]. https://rdm.inesctec.pt/dataset/nis-2023-007
    Explore at:
    Dataset updated
    Dec 15, 2023
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Project SAIL aimed to improve the scientific understanding of the marine boundary layer by means of a unique monitoring campaign on board the iconic Portuguese tall ship NRP Sagres during its 2020 circumnavigation expedition. This dataset comprises the pre-processed atmospheric measurements from the SAIL campaign. It is derived from the raw measurements (https://doi.org/10.25747/b2ff-kg31) by applying preliminary quality-control and pre-processing procedures. The jupyter notebooks documenting the pre-processing of the data are publicly available on Zenodo's Project SAIL community (https://zenodo.org/communities/sail). Detailed information on the pre-processing procedures can be found in the project's data management plan (DOI: https://doi.org/10.5281/zenodo.4286209). This dataset currently includes the resources detailed below. Additional resources will be added as they become available. The ReadMe file provides detailed information about the resources structures.

  10. E

    Data from: Care Home Dataset

    • healthinformationportal.eu
    • www-acc.healthinformationportal.eu
    html
    Updated Mar 6, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SAIL Databank – https://saildatabank.com/application-process/ (2023). Care Home Dataset [Dataset]. https://www.healthinformationportal.eu/health-information-sources/care-home-dataset
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Mar 6, 2023
    Dataset authored and provided by
    SAIL Databank – https://saildatabank.com/application-process/
    Variables measured
    sex, title, topics, acronym, country, language, data_owners, description, sample_size, age_range_to, and 15 more
    Measurement technique
    Data from other records
    Description

    This database contains residential and geographical information data about care homes in Wales.

  11. d

    Oceanographic, meteorological and physical data collected from Saildrone...

    • catalog.data.gov
    Updated Jul 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (Point of Contact) (2025). Oceanographic, meteorological and physical data collected from Saildrone 1043, 1046, and 1049, in the eastern Bering Sea and northern Pacific Ocean from 2020-06-23 to 2020-08-24 (NCEI Accession 0234333) [Dataset]. https://catalog.data.gov/dataset/oceanographic-meteorological-and-physical-data-collected-from-saildrone-1043-1046-and-1049-in-t1
    Explore at:
    Dataset updated
    Jul 1, 2025
    Dataset provided by
    (Point of Contact)
    Area covered
    Bering Sea, Pacific Ocean
    Description

    This dataset contains near-surface measurements of oceanographic, meteorological and physical data collected in situ during a survey of the eastern Bering Sea shelf conducted by three autonomous surface vehicles (USVs, Saildrones (SD) 1043, 1046, and 1049). The saildrones were used to conduct an acoustic survey of walleye pollock (Gadus chalcogrammus) in the US economic exclusive zone in summer 2020. This survey is traditionally conducted with crewed research vessels, but was conducted with USVs asin response to the cancellation of the ship-based surveys due to safety concerns associated with COVID-19 pandemic. The USV survey was conducted on 14 transects spaced 74 km apart spanning the ~80 m to ~1000m depth contour, with SD 1046 sampling in the south, SD 1046 in the center, and SD 1049 in the north portion of the survey area. All available data are included, which encompass the survey and a portion of the transit to the survey area. The saildrones were equipped with a variety of sensors and instruments consisting of thermosalinograph, echo sounder, oxygen optode, fluorometer, SST IR pyrometer, anemometer, meteorological probe, digital and barometer. The oceanographic measurements include skin temperature, salinity, water temperature, water skin temperature, chlorophyll-a, and dissolved oxygen. The atmospheric measurements consist of wind speed and direction, air temperature, relative humidity and air pressure. All the data are in netCDF-CF (underway) format. These data are experimental and have not been quality controlled. These data are made available at the user’s own risk. Users will need to do quality control when using these data. The data from the echosounder will be separately archived at NCEI’s water column sonar data archive.

  12. H

    National Community Child Health Database (NCCHD)

    • dtechtive.com
    • find.data.gov.scot
    • +1more
    Updated Nov 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SAIL (2023). National Community Child Health Database (NCCHD) [Dataset]. https://dtechtive.com/datasets/25673
    Explore at:
    Dataset updated
    Nov 21, 2023
    Dataset provided by
    SAIL
    Area covered
    United Kingdom, Wales
    Description

    The Child Health System in Wales; includes birth registration and monitoring of child health examinations and immunisations.

  13. i

    Raw data collected onboard the Sagres ship during the SAIL project campaign...

    • rdm.inesctec.pt
    Updated Jul 6, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Raw data collected onboard the Sagres ship during the SAIL project campaign - Dataset - CKAN [Dataset]. https://rdm.inesctec.pt/dataset/nis-2021-003
    Explore at:
    Dataset updated
    Jul 6, 2021
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Project SAIL aimed to improve the scientific understanding of the marine boundary layer by means of a unique monitoring campaign on board the iconic Portuguese tall ship NRP Sagres during its 2020 circumnavigation expedition. The campaign focused on the measurement of the atmospheric electric field over the ocean, and on the study of space-driven interactions via the detailed monitoring of gamma, solar and cosmic radiation as well as GNSS signals and atmospheric ionization. The atmospheric measurements are complemented by the collection of fish samples and by underwater monitoring of the ocean state (temperature, conductivity, dissolved oxygen, pH, spectral radiance), providing unique data for the detailed study of ocean-atmosphere fluxes and surface-atmosphere interactions. This dataset comprises the raw atmospheric measurements from the SAIL campaign, including the ship data collected onboard (denoted by the infix SHIP), the sensor data, obtained after correction of logging errors (denoted by the infix SD) and the geosensor data corresponding to georeferenced datafiles (denoted by the infix GD). Further information can be found in the project's data management plan (DOI: https://doi.org/10.5281/zenodo.4286209). This dataset currently includes the resources detailed below. Additional resources will be added as they become available. ATMOSPHERIC ELECTRIC FIELD The resource SAIL_SHIP_E1.tar.gz contains the files SAIL_SHIP_E1_yyyymmdd.tgz, each including the hourly files E1_yyyymmdd_HH.txt (where yyyy is the year, mm the month, dd the day, and HH the hour). The files E1_yyyymmdd_HH.txt have the following structure: col 1: timestamp (seconds.microseconds) col 2: date (mm/dd/yyyy) col 3: time (HH:MM:SS) col 4: voltage (power) (V) col 5: voltage (internal) (V) col 6: Panel temperature (deg C) col 7: Electric field (V/m) col 8: Leakage current (nA) col 9: CS110 status (numeric code) col 10: Internal RH (%) col 11: shortwave incoming radiation (W/m2) col 12: shortwave outgoing radiation (W/m2) The resource SAIL_SHIP_E2.tar.gz contains the files SAIL_SHIP_E2_yyyymmdd.tgz, each including the hourly files E2_yyyymmdd_HH.txt (where yyyy is the year, mm the month, dd the day, and HH the hour). The files E2_yyyymmdd_HH.txt have the following structure: col 1: timestamp (seconds.microseconds) col 2: date (mm/dd/yyyy) col 3: time (HH:MM:SS) col 4: voltage (power) (V) col 5: voltage (internal) (V) col 6: Panel temperature (deg C) col 7: Electric field (V/m) col 8: Leakage current (nA) col 9: CS110 status (numeric code) col 10: Internal RH (%) The resource SAIL_SD_E1.tar.gz contains the files SAIL_SD_E1_yyyymmdd.tgz, each including the hourly files E1_yyyymmdd_HH.txt (where yyyy is the year, mm the month, dd the day, and HH the hour), with the following structure: col 1: timestamp (seconds.microseconds) col 2: Electric field (V/m) col 3: Leakage current (nA) col 4: CS110 status (numeric code) col 5: Internal RH (%) The resource SAIL_SD_E2.tar.gz contains the files SAIL_SD_E2_yyyymmdd.tgz, each including the hourly files E2_yyyymmdd_HH.txt (where yyyy is the year, mm the month, dd the day, and HH the hour), with the following structure: col 1: timestamp (seconds.microseconds) col 2: Electric field (V/m) col 3: Leakage current (nA) col 4: CS110 status (numeric code) col 5: Internal RH (%) The resource SAIL_GD_E1.tar.gz contains the files SAIL_GD_E1_yyyymmdd.tgz, each including the hourly files E1_yyyymmdd_HH.txt (where yyyy is the year, mm the month, dd the day, and HH the hour), with the following structure: col 1: timestamp (seconds.microseconds) col 2: Electric field (V/m) col 3: Leakage current (nA)

  14. o

    The Sail Cross Street Data in East Islip, NY

    • ownerly.com
    Updated Mar 9, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ownerly (2022). The Sail Cross Street Data in East Islip, NY [Dataset]. https://www.ownerly.com/ny/east-islip/the-sail-home-details
    Explore at:
    Dataset updated
    Mar 9, 2022
    Dataset authored and provided by
    Ownerly
    Area covered
    New York, Islip, The Sail, East Islip
    Description

    This dataset provides information about the number of properties, residents, and average property values for The Sail cross streets in East Islip, NY.

  15. I

    India Crude Steel: Installed Capacity: Public Sector: Steel Authority of...

    • ceicdata.com
    Updated Mar 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CEICdata.com (2025). India Crude Steel: Installed Capacity: Public Sector: Steel Authority of India Limited (SAIL) [Dataset]. https://www.ceicdata.com/en/india/crude-steel-production-capacity/crude-steel-installed-capacity-public-sector-steel-authority-of-india-limited-sail
    Explore at:
    Dataset updated
    Mar 26, 2025
    Dataset provided by
    CEICdata.com
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Mar 1, 2007 - Mar 1, 2018
    Area covered
    India
    Variables measured
    Industrial Production
    Description

    Crude Steel: Installed Capacity: Public Sector: Steel Authority of India Limited (SAIL) data was reported at 17,519.000 Metric Ton th in 2018. This stayed constant from the previous number of 17,519.000 Metric Ton th for 2017. Crude Steel: Installed Capacity: Public Sector: Steel Authority of India Limited (SAIL) data is updated yearly, averaging 12,859.000 Metric Ton th from Mar 2003 (Median) to 2018, with 16 observations. The data reached an all-time high of 17,519.000 Metric Ton th in 2018 and a record low of 12,696.000 Metric Ton th in 2004. Crude Steel: Installed Capacity: Public Sector: Steel Authority of India Limited (SAIL) data remains active status in CEIC and is reported by Joint Plant Committee. The data is categorized under India Premium Database’s Metal and Steel Sector – Table IN.WAA005: Crude Steel: Production: Capacity.

  16. Global exporters importers-export import data of Sail bearing

    • volza.com
    csv
    Updated Sep 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Volza FZ LLC (2025). Global exporters importers-export import data of Sail bearing [Dataset]. https://www.volza.com/trade-data-global/global-exporters-importers-export-import-data-of-sail+bearing
    Explore at:
    csvAvailable download formats
    Dataset updated
    Sep 7, 2025
    Dataset provided by
    Volza
    Authors
    Volza FZ LLC
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Count of exporters, Count of importers, Count of shipments, Sum of export import value
    Description

    1922 Global exporters importers export import shipment records of Sail bearing with prices, volume & current Buyer's suppliers relationships based on actual Global export trade database.

  17. Full sail marine group llc Import Company US

    • seair.co.in
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim, Full sail marine group llc Import Company US [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset provided by
    Seair Exim Solutions
    Authors
    Seair Exim
    Area covered
    United States
    Description

    Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.

  18. E

    Welsh Longitudinal General Practice Dataset

    • healthinformationportal.eu
    html
    Updated Apr 27, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SAIL Databank – https://saildatabank.com/application-process/ (2023). Welsh Longitudinal General Practice Dataset [Dataset]. https://www.healthinformationportal.eu/health-information-sources/welsh-longitudinal-general-practice-dataset
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Apr 27, 2023
    Dataset authored and provided by
    SAIL Databank – https://saildatabank.com/application-process/
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Variables measured
    sex, title, topics, acronym, country, language, data_owners, description, sample_size, age_range_to, and 17 more
    Measurement technique
    Data from other records
    Description

    Attendance and clinical information for all general practice interactions: includes patients symptoms, investigations, diagnoses, prescribed medication and referrals to tertiary care.

    This dataset covers 83% of the population of Wales and 80% of GP practices in Wales. It is linkable with anonymised fields for individuals and GPs to other datasets, including bespoke project specific cohorts. Each GP practice uses a clinical information system to maintain an electronic health record for each of their patients; capturing the signs, symptoms, test results, diagnoses, prescribed treatment, referrals for specialist treatment and social aspects relating to the patients home environment.

    The majority of the data is entered by the clinician during the patient consultation. Test results are electronically transferred from secondary care systems.

    There are no standard rules for recording data within primary care clinical information systems. Therefore, each individual clinician can record information in their own way. The majority use Read Code Terminology, however, sometimes this is applied behind the scenes by the clinical system and sometimes local codes are used. Read codes are not as precise as ICD 10 or OPCS codes.

    Coding standards have been agreed on for conditions monitored by the QOF (Quality Outcomes Framework) returns. Since the implementation of QOF these conditions have been coded in a more consistent way.

    Time coverage varies between each practice.

  19. United States Imports: cif: Indian Mackrls, Marlins, Sail, Spearfish, etc,...

    • ceicdata.com
    Updated Feb 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CEICdata.com (2022). United States Imports: cif: Indian Mackrls, Marlins, Sail, Spearfish, etc, Fr, Ch [Dataset]. https://www.ceicdata.com/en/united-states/imports-by-commodity-6-digit-hs-code-hs-1-to-15/imports-cif-indian-mackrls-marlins-sail-spearfish-etc-fr-ch
    Explore at:
    Dataset updated
    Feb 6, 2022
    Dataset provided by
    CEIC Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Feb 1, 2024 - Jan 1, 2025
    Area covered
    United States
    Description

    United States Imports: cif: Indian Mackrls, Marlins, Sail, Spearfish, etc, Fr, Ch data was reported at 0.037 USD mn in Jan 2025. This records a decrease from the previous number of 0.093 USD mn for Dec 2024. United States Imports: cif: Indian Mackrls, Marlins, Sail, Spearfish, etc, Fr, Ch data is updated monthly, averaging 0.039 USD mn from Apr 2017 (Median) to Jan 2025, with 94 observations. The data reached an all-time high of 0.541 USD mn in Dec 2018 and a record low of 0.003 USD mn in Sep 2023. United States Imports: cif: Indian Mackrls, Marlins, Sail, Spearfish, etc, Fr, Ch data remains active status in CEIC and is reported by U.S. Census Bureau. The data is categorized under Global Database’s United States – Table US.JA130: Imports: by Commodity: 6 Digit HS Code: HS 1 to 15.

  20. Global export data of Sail Fish

    • volza.com
    csv
    Updated Jun 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Volza FZ LLC (2025). Global export data of Sail Fish [Dataset]. https://www.volza.com/p/sail-fish/export/export-from-india/
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jun 30, 2025
    Dataset provided by
    Volza
    Authors
    Volza FZ LLC
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Count of exporters, Sum of export value, 2014-01-01/2021-09-30, Count of export shipments
    Description

    1882 Global export shipment records of Sail Fish with prices, volume & current Buyer's suppliers relationships based on actual Global export trade database.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sea AI Lab (2024). regmix-data [Dataset]. https://huggingface.co/datasets/sail/regmix-data

regmix-data

regmix-data

sail/regmix-data

Explore at:
60 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 26, 2024
Dataset authored and provided by
Sea AI Lab
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

RegMix Data

  Dataset Description

The RegMix Data is a curated dataset derived from the Pile-Uncopyrighted, specifically designed for the RegMix paper (https://huggingface.co/papers/2407.01492). This dataset aims to facilitate the automatic identification of high-performing data mixtures for language model pre-training by formulating it as a regression task.

  Key Features:

Size: Approximately 1TB disk space, 250B tokens Distribution: Follows the natural token… See the full description on the dataset page: https://huggingface.co/datasets/sail/regmix-data.

Search
Clear search
Close search
Google apps
Main menu