100+ datasets found
  1. S

    Galaxy, star, quasar dataset

    • scidb.cn
    Updated Feb 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Li Xin (2023). Galaxy, star, quasar dataset [Dataset]. http://doi.org/10.57760/sciencedb.07177
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 3, 2023
    Dataset provided by
    Science Data Bank
    Authors
    Li Xin
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The data used in this paper is from the 16th issue of SDSS. SDSS-DR16 contains a total of 930,268 photometric images, with 1.2 billion observation sources and tens of millions of spectra. The data obtained in this paper is downloaded from the official website of SDSS. Specifically, the data is obtained through the SkyServerAPI structure by using SQL query statements in the subwebsite CasJobs. As the current SDSS photometric table PhotoObj can only classify all observed sources as point sources and surface sources, the target sources can be better classified as galaxies, stars and quasars through spectra. Therefore, we obtain calibrated sources in CasJobs by crossing SpecPhoto with the PhotoObj star list, and obtain target position information (right ascension and declination). Calibrated sources can tell them apart precisely and quickly. Each calibrated source is labeled with the parameter "Class" as "galaxy", "star", or "quasar". In this paper, observation day area 3462, 3478, 3530 and other 4 areas in SDSS-DR16 are selected as experimental data, because a large number of sources can be obtained in these areas to provide rich sample data for the experiment. For example, there are 9891 sources in the 3462-day area, including 2790 galactic sources, 2378 stellar sources and 4723 quasar sources. There are 3862 sources in the 3478 day area, including 1759 galactic sources, 577 stellar sources and 1526 quasar sources. FITS files are a commonly used data format in the astronomical community. By cross-matching the star list and FITS files in the local celestial region, we obtained images of 5 bands of u, g, r, i and z of 12499 galaxy sources, 16914 quasar sources and 16908 star sources as training and testing data.1.1 Image SynthesisSDSS photometric data includes photometric images of five bands u, g, r, i and z, and these photometric image data are respectively packaged in single-band format in FITS files. Images of different bands contain different information. Since the three bands g, r and i contain more feature information and less noise, Astronomical researchers typically use the g, r, and i bands corresponding to the R, G, and B channels of the image to synthesize photometric images. Generally, different bands cannot be directly synthesized. If three bands are directly synthesized, the image of different bands may not be aligned. Therefore, this paper adopts the RGB multi-band image synthesis software written by He Zhendong et al. to synthesize images in g, r and i bands. This method effectively avoids the problem that images in different bands cannot be aligned. The pixel of each photometry image in this paper is 2048×1489.1.2 Data tailoringThis paper first clipped the target image, image clipping can use image segmentation tools to solve this problem, this paper uses Python to achieve this process. In the process of clipping, we convert the right ascension and declination of the source in the star list into pixel coordinates on the photometric image through the coordinate conversion formula, and determine the specific position of the source through the pixel coordinates. The coordinates are regarded as the center point and clipping is carried out in the form of a rectangular box. We found that the input image size affects the experimental results. Therefore, according to the target size of the source, we selected three different cutting sizes, 40×40, 60×60 and 80×80 respectively. Through experiment and analysis, we find that convolutional neural network has better learning ability and higher accuracy for data with small image size. In the end, we chose to divide the surface source galaxies, point source quasars, and stars into 40×40 sizes.1.3 Division of training and test dataIn order to make the algorithm have more accurate recognition performance, we need enough image samples. The selection of training set, verification set and test set is an important factor affecting the final recognition accuracy. In this paper, the training set, verification set and test set are set according to the ratio of 8:1:1. The purpose of verification set is used to revise the algorithm, and the purpose of test set is used to evaluate the generalization ability of the final algorithm. Table 1 shows the specific data partitioning information. The total sample size is 34,000 source images, including 11543 galaxy sources, 11967 star sources, and 10490 quasar sources.1.4 Data preprocessingIn this experiment, the training set and test set can be used as the training and test input of the algorithm after data preprocessing. The data quantity and quality largely determine the recognition performance of the algorithm. The pre-processing of the training set and the test set are different. In the training set, we first perform vertical flip, horizontal flip and scale on the cropped image to enrich the data samples and enhance the generalization ability of the algorithm. Since the features in the celestial object source have the flip invariability, the labels of galaxies, stars and quasars will not change after rotation. In the test set, our preprocessing process is relatively simple compared with the training set. We carry out simple scaling processing on the input image and test input the obtained image.

  2. rStar-Coder

    • huggingface.co
    Updated Jul 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Microsoft (2025). rStar-Coder [Dataset]. https://huggingface.co/datasets/microsoft/rStar-Coder
    Explore at:
    Dataset updated
    Jul 15, 2025
    Dataset authored and provided by
    Microsofthttp://microsoft.com/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    rStar-Coder Dataset

    Project GitHub | Paper

      Dataset Description
    

    rStar-Coder is a large-scale competitive code problem dataset containing 418K programming problems, 580K long-reasoning solutions, and rich test cases of varying difficulty levels. This dataset aims to enhance code reasoning capabilities in large language models, particularly in handling competitive code problems. Experiments on Qwen models (1.5B-14B) across various code reasoning benchmarks demonstrate… See the full description on the dataset page: https://huggingface.co/datasets/microsoft/rStar-Coder.

  3. Star dataset to predict star types

    • kaggle.com
    Updated Oct 21, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deepraj Baidya (2019). Star dataset to predict star types [Dataset]. https://www.kaggle.com/deepu1109/star-dataset/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 21, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Deepraj Baidya
    Description

    Dataset Info:

    This is a dataset consisting of several features of stars.

    Some of them are: - Absolute Temperature (in K) - Relative Luminosity (L/Lo) - Relative Radius (R/Ro) - Absolute Magnitude (Mv) - Star Color (white,Red,Blue,Yellow,yellow-orange etc) - Spectral Class (O,B,A,F,G,K,,M) - Star Type **(Red Dwarf, Brown Dwarf, White Dwarf, Main Sequence , SuperGiants, HyperGiants)**

    Lo = 3.828 x 10^26 Watts (Avg Luminosity of Sun) Ro = 6.9551 x 10^8 m (Avg Radius of Sun)

    Purpose:

    The purpose of making the dataset is to prove that the stars follows a certain graph in the celestial Space , specifically called Hertzsprung-Russell Diagram or simply HR-Diagram so that we can classify stars by plotting its features based on that graph.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F3791628%2F14338bbebf77d18e1faef582bccdbdd6%2Fhr.jpg?generation=1597349509841965&alt=media" alt="hr-1"> https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F3791628%2F9fc57334a9b9fafbc71aacdd6e5cd69c%2F310px-Hertzsprung-Russel_StarData.png?generation=1597349661801284&alt=media" alt="hr-2">

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F3791628%2Ffe9436bf4e2d23b5b18fb3db1f1fcbcb%2FHRDiagram.png?generation=1597348809674507&alt=media" alt="hr-3">

    Data Collection and Preparation techniques:

    The dataset is created based on several equations in astrophysics. They are given below:

    1. Stefan-Boltzmann's law of Black body radiation (To find the luminosity of a star)
    2. Wienn's Displacement law (for finding surface temperature of a star using wavelength)
    3. Absolute magnitude relation
    4. Radius of a star using parallax .

    The dataset took 3 weeks to collect for 240 stars which are mostly collected from web. The missing data were manually calculated using those equations of astrophysics given above.

    Acknowledgements:

    • Wikipedia
    • Stars and Galaxies by SEEDS | Backman

    Repo Link:

    AstroML

  4. p

    Trends in Average Expenditure per Student (1995-2021): Union Star R-II...

    • publicschoolreview.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Public School Review, Trends in Average Expenditure per Student (1995-2021): Union Star R-II School District [Dataset]. https://www.publicschoolreview.com/missouri/union-star-r-ii-school-district/2930600-school-district
    Explore at:
    Dataset authored and provided by
    Public School Review
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Union Star R-II School District
    Description

    This dataset tracks annual average expenditure per student from 1995 to 2021 for Union Star R-II School District

  5. A

    ‘Star dataset to predict star types’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Aug 14, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2020). ‘Star dataset to predict star types’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-star-dataset-to-predict-star-types-22fb/1b295ecf/?iid=004-260&v=presentation
    Explore at:
    Dataset updated
    Aug 14, 2020
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ‘Star dataset to predict star types’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/deepu1109/star-dataset on 28 January 2022.

    --- Dataset description provided by original source is as follows ---

    Dataset Info:

    This is a dataset consisting of several features of stars.

    Some of them are: - Absolute Temperature (in K) - Relative Luminosity (L/Lo) - Relative Radius (R/Ro) - Absolute Magnitude (Mv) - Star Color (white,Red,Blue,Yellow,yellow-orange etc) - Spectral Class (O,B,A,F,G,K,,M) - Star Type **(Red Dwarf, Brown Dwarf, White Dwarf, Main Sequence , SuperGiants, HyperGiants)**

    Lo = 3.828 x 10^26 Watts (Avg Luminosity of Sun) Ro = 6.9551 x 10^8 m (Avg Radius of Sun)

    Purpose:

    The purpose of making the dataset is to prove that the stars follows a certain graph in the celestial Space , specifically called Hertzsprung-Russell Diagram or simply HR-Diagram so that we can classify stars by plotting its features based on that graph.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F3791628%2F14338bbebf77d18e1faef582bccdbdd6%2Fhr.jpg?generation=1597349509841965&alt=media" alt="hr-1"> https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F3791628%2F9fc57334a9b9fafbc71aacdd6e5cd69c%2F310px-Hertzsprung-Russel_StarData.png?generation=1597349661801284&alt=media" alt="hr-2">

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F3791628%2Ffe9436bf4e2d23b5b18fb3db1f1fcbcb%2FHRDiagram.png?generation=1597348809674507&alt=media" alt="hr-3">

    Data Collection and Preparation techniques:

    The dataset is created based on several equations in astrophysics. They are given below:

    1. Stefan-Boltzmann's law of Black body radiation (To find the luminosity of a star)
    2. Wienn's Displacement law (for finding surface temperature of a star using wavelength)
    3. Absolute magnitude relation
    4. Radius of a star using parallax .

    The dataset took 3 weeks to collect for 240 stars which are mostly collected from web. The missing data were manually calculated using those equations of astrophysics given above.

    Acknowledgements:

    • Wikipedia
    • Stars and Galaxies by SEEDS | Backman

    Repo Link:

    AstroML

    --- Original source retains full ownership of the source dataset ---

  6. h

    usaco_2025

    • huggingface.co
    Updated May 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    rStar.msra (2025). usaco_2025 [Dataset]. https://huggingface.co/datasets/rStar-Reasoning/usaco_2025
    Explore at:
    Dataset updated
    May 29, 2025
    Dataset authored and provided by
    rStar.msra
    Description

    USACO 2025 Open Contest Dataset

      Dataset Description
    

    The USA Computing Olympiad (USACO) is a prestigious algorithmic programming competition for high school students in the United States, consisting of four difficulty levels: Bronze, Silver, Gold, and Platinum. Each level contains a set of challenging problems that test algorithmic thinking and implementation skills, making USACO a valuable benchmark for evaluating the reasoning and problem-solving capabilities of large… See the full description on the dataset page: https://huggingface.co/datasets/rStar-Reasoning/usaco_2025.

  7. STUDY OF THE POPULATION DISTRIBUTION OF THE 7

    • esdcdoi.esac.esa.int
    Updated Apr 8, 1999
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    European Space Agency (1999). STUDY OF THE POPULATION DISTRIBUTION OF THE 7 [Dataset]. http://doi.org/10.5270/esa-w6ixaa1
    Explore at:
    https://www.iana.org/assignments/media-types/application/fitsAvailable download formats
    Dataset updated
    Apr 8, 1999
    Dataset authored and provided by
    European Space Agencyhttp://www.esa.int/
    Time period covered
    Apr 9, 1996 - Mar 2, 1997
    Description

    we propose to investigate the newly discovered 7.15 micron oxygenrich (or) circumstellar emission feature by observing its profile and distribution among approximately 50 oxygenrich stars distributed throughout the sky. we expect this study to place constraints on the abundances, stellar evolution and chemistry of dust formation in these stars, and the importance of the emitting material as a possible inter stellar dust component. the iras autoclass ii study of cheeseman et al. 1989 provides a guide from which we select a set of stars representing the entire population of or stars. one star is selected from each of the 32 or subclasses expected to have the feature present. based upon the work of goebel et al.(1994 ap.j. july 15,1994), these objects have a high probability of displaying the 7.15 um feature. stars from each r^ant subclass need to be observed with the sws throughout their entire spectral range so that all spectral features may be accurately profiled, defined, and correlated with the 7.15 um feature. high reso lution scans are required to determine the influence of molecular bands on the features at 7.15, 10, 11.3, 13.1, 18, and 19.7 um. shape and relative strength of the features should enable identification of the responsible dust component. once high resolution scans of a rep resentative sample of or stars have been obtained, the low resolution spectra of survey investigators can be used to supplement our statis tics, but not to determine the confusing influences of molecular bands, nor meet the goals of this proposal. we anticipate reciprocal sharing of the proposed aot sws013 high resolution data with other groups that have aot sws011 low resolution data of or objects, notably price, tsuji, de jong, heske, waters, and barlow. the 7.15 um feature is in a spectral region blocked completely by the earths atmospheric water vapor. iso is the only platform with sufficient sensitivity, sky cover truncat [truncated!, Please see actual data for full text]

  8. p

    Trends in Graduation Rate (2012-2022): Union Star High School vs. Missouri...

    • publicschoolreview.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Public School Review, Trends in Graduation Rate (2012-2022): Union Star High School vs. Missouri vs. Union Star R-II School District [Dataset]. https://www.publicschoolreview.com/union-star-high-school-profile
    Explore at:
    Dataset authored and provided by
    Public School Review
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Union Star R-II School District, Missouri
    Description

    This dataset tracks annual graduation rate from 2012 to 2022 for Union Star High School vs. Missouri and Union Star R-II School District

  9. z

    Dataset for 'Discovery of a pair of very metal-poor stars enriched in...

    • zenodo.org
    csv
    Updated Mar 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    André Rodrigo da Silva; André Rodrigo da Silva; Rodolfo Smiljanic; Rodolfo Smiljanic (2025). Dataset for 'Discovery of a pair of very metal-poor stars enriched in neutron-capture elements: The proto-disk r-II star BPS CS 29529-0089 and the Gaia-Sausage-Enceladus r-I star TYC 9219-2422-1' [Dataset]. http://doi.org/10.5281/zenodo.14982574
    Explore at:
    csvAvailable download formats
    Dataset updated
    Mar 6, 2025
    Dataset provided by
    Astronomy and Astrophysics
    Authors
    André Rodrigo da Silva; André Rodrigo da Silva; Rodolfo Smiljanic; Rodolfo Smiljanic
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data containing the species, log gf, and abundances for BPS CS 29529-0089, TYC 9219-2422-1 and HD 122563 in aa53295-24.

  10. e

    Eximpedia Export Import Trade

    • eximpedia.app
    Updated Apr 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim (2025). Eximpedia Export Import Trade [Dataset]. https://www.eximpedia.app/
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset updated
    Apr 6, 2025
    Dataset provided by
    Eximpedia Export Import Trade Data
    Authors
    Seair Exim
    Area covered
    Gibraltar, Montserrat, Congo, Cook Islands, Denmark, Burundi, Seychelles, Virgin Islands (British), Cambodia, Spain
    Description

    R Star Company Limited Company Export Import Records. Follow the Eximpedia platform for HS code, importer-exporter records, and customs shipment details.

  11. h

    rStar-Coder-seed-test

    • huggingface.co
    Updated Jul 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The citation is currently not available for this dataset.
    Explore at:
    Dataset updated
    Jul 30, 2025
    Authors
    chiruan
    Description

    chiruan/rStar-Coder-seed-test dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. Data Files and Code Associated with "Brown Dwarfs are Violet"

    • zenodo.org
    • data.niaid.nih.gov
    bin, png, txt
    Updated Jul 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Steven R. Cranmer; Steven R. Cranmer (2024). Data Files and Code Associated with "Brown Dwarfs are Violet" [Dataset]. http://doi.org/10.5281/zenodo.5293307
    Explore at:
    bin, png, txtAvailable download formats
    Dataset updated
    Jul 18, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Steven R. Cranmer; Steven R. Cranmer
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The accompanying files provide some IDL code and "data behind the figures" for the paper titled "Brown Dwarfs are Violet" (by S. R. Cranmer), which has been submitted to Research Notes of the AAS.

    This paper presents a collection of objective (CIE x,y coordinate) and subjective (RGB triple) colors for main-sequence stars and brown dwarfs, as they may be perceived by human eyes without the reddening effects of the Earth's atmosphere. However, the algorithm described in the paper for computing RGB triples ought to be considered as only a preliminary first step; i.e., it needs to be tested by comparing the results to other more established ways of converting astronomical spectra to perceived colors.

  13. p

    Trends in Total Revenue (1990-2021): Union Star R-II School District

    • publicschoolreview.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Public School Review, Trends in Total Revenue (1990-2021): Union Star R-II School District [Dataset]. https://www.publicschoolreview.com/missouri/union-star-r-ii-school-district/2930600-school-district
    Explore at:
    Dataset authored and provided by
    Public School Review
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Union Star R-II School District
    Description

    This dataset tracks annual total revenue from 1990 to 2021 for Union Star R-II School District

  14. p

    Distribution of Students Across Grade Levels in Union Star R-II School...

    • publicschoolreview.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Public School Review, Distribution of Students Across Grade Levels in Union Star R-II School District and Average Distribution Per School District in Missouri [Dataset]. https://www.publicschoolreview.com/missouri/union-star-r-ii-school-district/2930600-school-district
    Explore at:
    Dataset authored and provided by
    Public School Review
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Union Star R-II School District, Missouri, Union Star
    Description

    This dataset tracks annual distribution of students across grade levels in Union Star R-II School District and average distribution per school district in Missouri

  15. Energy Star HVAC-R units shipped in the U.S. 2023, by type

    • statista.com
    Updated Apr 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Energy Star HVAC-R units shipped in the U.S. 2023, by type [Dataset]. https://www.statista.com/statistics/1549851/energy-star-hvac-r-units-shipped-us-by-type/
    Explore at:
    Dataset updated
    Apr 9, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2023
    Area covered
    United States
    Description

    The type of Energy Star HVAC-R equipment with most shipments in 2023 were for refrigeration. HVAC-R stands for heating, ventilation, air-conditioning, and refrigeration. These figures do not refer to all shipments of HVAC equipments or devices, but just to those certified by Energy Star. California was the U.S. state with plumbing and HVAC contractor establishment.

  16. Transfer learning for galaxy feature detection: Finding Giant Star-forming...

    • zenodo.org
    • data.niaid.nih.gov
    bin, csv
    Updated Apr 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jürgen Popp; Jürgen Popp; Hugh Dickinson; Hugh Dickinson; Stephen Serjeant; Stephen Serjeant; Mike Walmsley; Mike Walmsley; Dominic Adams; Dominic Adams (2024). Transfer learning for galaxy feature detection: Finding Giant Star-forming Clumps in low redshift galaxies using Faster R-CNN [Dataset]. http://doi.org/10.5281/zenodo.8228890
    Explore at:
    csv, binAvailable download formats
    Dataset updated
    Apr 4, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Jürgen Popp; Jürgen Popp; Hugh Dickinson; Hugh Dickinson; Stephen Serjeant; Stephen Serjeant; Mike Walmsley; Mike Walmsley; Dominic Adams; Dominic Adams
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository contains the data released in the paper 'Transfer learning for galaxy feature detection: Finding Giant Star-forming Clumps in low redshift galaxies using Faster R-CNN' (DOI: 10.1093/rasti/rzae013).

    We release a detailed catalogue of Giant Star-forming Clumps (GSFCs), detected for the full set of Galaxy Zoo: Clump Scout galaxies observed by SDSS using the Faster R-CNN architecture with the Zoobot classification-CNN as a feature extraction backbone.

    The final models and code are made publicly available via Github: https://github.com/ou-astrophysics/Faster-R-CNN-for-Galaxy-Zoo-Clump-Scout.

    We will release updates if needed via Zenodo versioning. We recommend using the latest version of this repository. You can check the version you are currently viewing on the right-hand sidebar.

    Please cite the paper (DOI: 10.1093/rasti/rzae013) when using the data in this repository.

    The csv-file FRCNN_Zoobot_SDSS_GZCS_detections.csv has the following columns. Alternatively, the file FRCNN_Zoobot_SDSS_GZCS_detections.gzip contains the same data but stored as a parquet-file.

    Column nameDescription
    specobjidSDSS spec object ID
    dr7objidSDSS DR7 object ID
    clump_idClump index
    clump_label_idClump label ID (1 or 2)
    clump_label_nameClump label name
    clump_scoreDetection score for the clump
    clump_centre_raClump centroid RA in degrees
    clump_centre_decClump centroid dec in degrees
    clump_flux_uClump u-band flux in Jy
    clump_flux_gClump g-band flux in Jy
    clump_flux_rClump r-band flux in Jy
    clump_flux_iClump i-band flux in Jy
    clump_flux_zClump z-band flux in Jy
    clump_flux_err_uClump u-band flux error in Jy
    clump_flux_err_gClump g-band flux error in Jy
    clump_flux_err_rClump r-band flux error in Jy
    clump_flux_err_iClump i-band flux error in Jy
    clump_flux_err_zClump z-band flux error in Jy
    clump_mag_uClump u-band magnitude (AB-mag)
    clump_mag_gClump g-band magnitude (AB-mag)
    clump_mag_rClump r-band magnitude (AB-mag)
    clump_mag_iClump i-band magnitude (AB-mag)
    clump_mag_zClump z-band magnitude (AB-mag)
    clump_ext_mag_uClump u-band extinction (E(B-V), AB-mag)
    clump_ext_mag_gClump g-band extinction (E(B-V), AB-mag)
    clump_ext_mag_rClump r-band extinction (E(B-V), AB-mag)
    clump_ext_mag_iClump i-band extinction (E(B-V), AB-mag)
    clump_ext_mag_zClump z-band extinction (E(B-V), AB-mag)
    clump_mag_corr_uClump corrected u-band magnitude (AB-mag)
    clump_mag_corr_gClump corrected g-band magnitude (AB-mag)
    clump_mag_corr_rClump corrected r-band magnitude (AB-mag)
    clump_mag_corr_iClump corrected i-band magnitude (AB-mag)
    clump_mag_corr_zClump corrected z-band magnitude (AB-mag)
    clump_mag_corr_u_gClump colour (u-g)
    clump_mag_corr_g_rClump colour (g-r)
    clump_mag_corr_r_iClump colour (r-i)
    clump_mag_corr_i_zClump colour (i-z)
    clump_flux_ratioEst. clump/galaxy near-UV flux ratio (u-band)
    is_clump_3pctFlag (True/False) if clump/galaxy flux ratio is >3%
    is_clump_8pctFlag (True/False) if clump/galaxy flux ratio is >8%
    galaxy_raHost galaxy RA in degrees
    galaxy_decHost galaxy dec in degrees
    galaxy_zHost galaxy redshift
    galaxy_mag_uHost galaxy u-band magnitude (AB-mag)
    galaxy_mag_gHost galaxy g-band magnitude (AB-mag)
    galaxy_mag_rHost galaxy r-band magnitude (AB-mag)
    galaxy_mag_iHost galaxy i-band magnitude (AB-mag)
    galaxy_mag_zHost galaxy z-band magnitude (AB-mag)
    galaxy_mag_err_uHost galaxy u-band magnitude error (AB-mag)
    galaxy_mag_err_gHost galaxy g-band magnitude error (AB-mag)
    galaxy_mag_err_rHost galaxy r-band magnitude error (AB-mag)
    galaxy_mag_err_iHost galaxy i-band magnitude error (AB-mag)
    galaxy_mag_err_zHost galaxy z-band magnitude error (AB-mag)
    galaxy_flux_uHost galaxy u-band flux in Jy
    galaxy_flux_gHost galaxy g-band flux in Jy
    galaxy_flux_rHost galaxy r-band flux in Jy
    galaxy_flux_iHost galaxy i-band flux in Jy
    galaxy_flux_zHost galaxy z-band flux in Jy
    galaxy_expAB_rHost galaxy axis ratio from SDSS
    galaxy_expRad_rHost galaxy exponential fit scale radius from SDSS
    galaxy_lmassHost galaxy log mass in MSun
    galaxy_lssfrHost galaxy log specific SFR
    galaxy_mag_corr_uHost galaxy corrected u-band magnitude (AB-mag)
    galaxy_mag_corr_gHost galaxy corrected g-band magnitude (AB-mag)
    galaxy_mag_corr_rHost galaxy corrected r-band magnitude (AB-mag)
    galaxy_mag_corr_iHost galaxy corrected i-band magnitude (AB-mag)
    galaxy_mag_corr_zHost galaxy corrected z-band magnitude (AB-mag)

  17. o

    Faster R-CNN object detection model for detecting star-forming clumps in...

    • ordo.open.ac.uk
    bin
    Updated May 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jürgen Joseph Popp (2025). Faster R-CNN object detection model for detecting star-forming clumps in galaxy images from CLAUDS and HSC SSP (U-GRIZY filterbands) using Zoobot for feature extraction. [Dataset]. http://doi.org/10.5281/zenodo.15316576
    Explore at:
    binAvailable download formats
    Dataset updated
    May 29, 2025
    Dataset provided by
    The Open University
    Authors
    Jürgen Joseph Popp
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This model is trained and used to detect star-forming clumps in nearby galaxies observed by CLAUDS and HSC SSP. The model is based on the Faster R-CNN architecture and uses the Zoobot galaxy morphology classification model (https://doi.org/10.21105/joss.05312) as its feature extraction backbone. The model differs also from terrestrial object detection models as it takes 6-channel imaging data as input, U-band from CLAUDS, GRIZY from HSC SSP.

    The compressed archive contains example code for training the model and running detections on sample galaxy images. Included is also the data used to train the model and the final model weights.

  18. Data from: Brown Dwarfs are Violet: Python Tools for the Estimation of...

    • zenodo.org
    • data.niaid.nih.gov
    bin, png, txt
    Updated Jul 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Steven R. Cranmer; Steven R. Cranmer (2024). Brown Dwarfs are Violet: Python Tools for the Estimation of Human-eye Colors of Stars and Substellar Objects [Dataset]. http://doi.org/10.5281/zenodo.7504254
    Explore at:
    png, txt, binAvailable download formats
    Dataset updated
    Jul 15, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Steven R. Cranmer; Steven R. Cranmer
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The accompanying files include a Python Jupyter notebook (and associated data files read in by the Python code) that carry out the calculations described by Cranmer (2023), talk 246.05 presented at the 241st Meeting of the American Astronomical Society (AAS) in Seattle, Washington. The abstract of the talk is provided here:

    There has always been interest in the perceived colors of the stars. They were key to the development of the H-R diagram, and they are also used widely in educational and public-outreach imagery. Thus, it is useful to develop software tools to compute these colors, as accurately as possible, from spectral energy distributions. This presentation follows up on an RNAAS paper (Cranmer 2021) that presented a collection of objective (CIE coordinate) and subjective (RGB triple) colors for main-sequence stars and brown dwarfs. A new empirical method of converting from CIE to RGB values is described, and results for various stellar spectra are presented. Although brown dwarfs over a wide range of effective temperatures (400 to 2000 K) emit most of their flux in the infrared, their visible spectra often exhibit a local maximum around a strong dip in the Na I cross section at 0.4-0.5 microns. Thus, they may appear purple to human eyes. Also, the hottest (O-type) main-sequence stars may appear even "bluer than the blue sky" because of Paschen continuum absorption. This presentation will update earlier stellar and brown-dwarf color estimates using more recently published synthetic spectra, and it will also investigate the effects of atmospheric absorption, over a range of air-mass values, on these perceived colors. Python Jupyter notebooks that carry out these calculations will be uploaded to the Zenodo repository for open-access distribution.

    NOTE 1: The algorithms described here, for computing RGB triples, ought to be considered as preliminary results in ongoing research; i.e., they need additional testing and validation by comparing to the results of other more established ways of converting astronomical spectra to perceived colors.

    NOTE 2: These files follow on from those provided in another Zenodo upload associated with the 2021 RNAAS paper: https://doi.org/10.5281/zenodo.5293307

  19. f

    Data from: Small Sagittarius Star Cloud

    • middlebury.figshare.com
    image/fits
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan Kemp (2023). Small Sagittarius Star Cloud [Dataset]. http://doi.org/10.57968/Middlebury.21637475.v1
    Explore at:
    image/fitsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Middlebury
    Authors
    Jonathan Kemp
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    object name: Small Sagittarius Star Cloud object type: star cloud coordinates: 18 16 48.0, -18 33 00 (2000) constellation: Sagittarius alternate names: Messier 24, IC 4715 observation filters: B, G, R file formats: FITS, PNG observation dates: 2020 June-September telescope: Mittelman Observatories 0.5m at New Mexico Skies

  20. TOO: DUST FORMATION AROUND R CRB TYPE STARS AT THEIR MINIMUMS SEARCH FOR...

    • esdcdoi.esac.esa.int
    Updated Apr 8, 1999
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    European Space Agency (1999). TOO: DUST FORMATION AROUND R CRB TYPE STARS AT THEIR MINIMUMS SEARCH FOR FULLERENES [Dataset]. http://doi.org/10.5270/esa-bsq5wip
    Explore at:
    https://www.iana.org/assignments/media-types/application/fitsAvailable download formats
    Dataset updated
    Apr 8, 1999
    Dataset authored and provided by
    European Space Agencyhttp://www.esa.int/
    Time period covered
    Oct 12, 1996 - Oct 15, 1996
    Description

    we propose to observe thermal emission from circumstellar dust grains around r crb type stars (rcbs) at their minimums using the sws in order to study the nucleation and grain formation around stars. our proposal consists of detecting the continuum emission due to carbon grains and searching for carbon cluster fullerenes (c60, c70, ...) around rcbs. rcbs are known to be hydrogendeficient, carbonrich stars and to undergo irregular decrease in light due to carbon grain formation. they provide a good opportunity to study the nucleation and the grain formation phenomena around stars because the condensible atoms are limited compared with that around normal carbon stars. furthermore, there is high possibility of detecting fullerenes around rcbs since the condition for the formation of carbon particles is similar to that for fullerene formation. although ubiquity of fullerenes is widely recognized on earth, they have not yet been discovered in space. this may be because fullerenes are not likely to be formed in the hydrogenrich atmosphere which is the case in normal carbon stars. rcbs are the best candidate at which fullerenes should be searched for. if one of our candidate stars shows decrease in light and will be observed with iso at that time, we also propose to take spectrum of this star again after the star recovers its light, if time allows. this spectrum together with the one taken at minimum can be used to determine the infrared properties of dust grains. whether fullerenes will be discovered or not, the spectroscopic data can be used to identify dust grains and to understand the nucleation and the grain formation in carbonrich environments. truncated!, Please see actual data for full text [truncated!, Please see actual data for full text]

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Li Xin (2023). Galaxy, star, quasar dataset [Dataset]. http://doi.org/10.57760/sciencedb.07177

Galaxy, star, quasar dataset

Explore at:
282 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 3, 2023
Dataset provided by
Science Data Bank
Authors
Li Xin
License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Description

The data used in this paper is from the 16th issue of SDSS. SDSS-DR16 contains a total of 930,268 photometric images, with 1.2 billion observation sources and tens of millions of spectra. The data obtained in this paper is downloaded from the official website of SDSS. Specifically, the data is obtained through the SkyServerAPI structure by using SQL query statements in the subwebsite CasJobs. As the current SDSS photometric table PhotoObj can only classify all observed sources as point sources and surface sources, the target sources can be better classified as galaxies, stars and quasars through spectra. Therefore, we obtain calibrated sources in CasJobs by crossing SpecPhoto with the PhotoObj star list, and obtain target position information (right ascension and declination). Calibrated sources can tell them apart precisely and quickly. Each calibrated source is labeled with the parameter "Class" as "galaxy", "star", or "quasar". In this paper, observation day area 3462, 3478, 3530 and other 4 areas in SDSS-DR16 are selected as experimental data, because a large number of sources can be obtained in these areas to provide rich sample data for the experiment. For example, there are 9891 sources in the 3462-day area, including 2790 galactic sources, 2378 stellar sources and 4723 quasar sources. There are 3862 sources in the 3478 day area, including 1759 galactic sources, 577 stellar sources and 1526 quasar sources. FITS files are a commonly used data format in the astronomical community. By cross-matching the star list and FITS files in the local celestial region, we obtained images of 5 bands of u, g, r, i and z of 12499 galaxy sources, 16914 quasar sources and 16908 star sources as training and testing data.1.1 Image SynthesisSDSS photometric data includes photometric images of five bands u, g, r, i and z, and these photometric image data are respectively packaged in single-band format in FITS files. Images of different bands contain different information. Since the three bands g, r and i contain more feature information and less noise, Astronomical researchers typically use the g, r, and i bands corresponding to the R, G, and B channels of the image to synthesize photometric images. Generally, different bands cannot be directly synthesized. If three bands are directly synthesized, the image of different bands may not be aligned. Therefore, this paper adopts the RGB multi-band image synthesis software written by He Zhendong et al. to synthesize images in g, r and i bands. This method effectively avoids the problem that images in different bands cannot be aligned. The pixel of each photometry image in this paper is 2048×1489.1.2 Data tailoringThis paper first clipped the target image, image clipping can use image segmentation tools to solve this problem, this paper uses Python to achieve this process. In the process of clipping, we convert the right ascension and declination of the source in the star list into pixel coordinates on the photometric image through the coordinate conversion formula, and determine the specific position of the source through the pixel coordinates. The coordinates are regarded as the center point and clipping is carried out in the form of a rectangular box. We found that the input image size affects the experimental results. Therefore, according to the target size of the source, we selected three different cutting sizes, 40×40, 60×60 and 80×80 respectively. Through experiment and analysis, we find that convolutional neural network has better learning ability and higher accuracy for data with small image size. In the end, we chose to divide the surface source galaxies, point source quasars, and stars into 40×40 sizes.1.3 Division of training and test dataIn order to make the algorithm have more accurate recognition performance, we need enough image samples. The selection of training set, verification set and test set is an important factor affecting the final recognition accuracy. In this paper, the training set, verification set and test set are set according to the ratio of 8:1:1. The purpose of verification set is used to revise the algorithm, and the purpose of test set is used to evaluate the generalization ability of the final algorithm. Table 1 shows the specific data partitioning information. The total sample size is 34,000 source images, including 11543 galaxy sources, 11967 star sources, and 10490 quasar sources.1.4 Data preprocessingIn this experiment, the training set and test set can be used as the training and test input of the algorithm after data preprocessing. The data quantity and quality largely determine the recognition performance of the algorithm. The pre-processing of the training set and the test set are different. In the training set, we first perform vertical flip, horizontal flip and scale on the cropped image to enrich the data samples and enhance the generalization ability of the algorithm. Since the features in the celestial object source have the flip invariability, the labels of galaxies, stars and quasars will not change after rotation. In the test set, our preprocessing process is relatively simple compared with the training set. We carry out simple scaling processing on the input image and test input the obtained image.

Search
Clear search
Close search
Google apps
Main menu