31 datasets found

o
Supporting data for "Label3DMaize: toolkit for 3D point cloud data...
explore.openaire.eu
Updated Jan 1, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Teng Miao; Weiliang Wen; Yinglun Li; Sheng Wu; Chao Zhu; Xinyu Guo (2021). Supporting data for "Label3DMaize: toolkit for 3D point cloud data annotation of maize shoots" [Dataset]. http://doi.org/10.5524/100884
Explore at:
Unique identifier
https://doi.org/10.5524/100884
Dataset updated
Jan 1, 2021
Authors
Teng Miao; Weiliang Wen; Yinglun Li; Sheng Wu; Chao Zhu; Xinyu Guo
Description
Three-dimensional (3D) point cloud is the most direct and effective data form for studying plant structure and morphology. In point cloud studies, the point cloud segmentation of individual plants to organs directly determines the accuracy of organ-level phenotype estimation and the 3D plant reconstruction reliability. However, highly accurate, automatic, and robust point cloud segmentation approaches for plants are unavailable. Thus, the high-throughput segmentation of many shoots is challenging. Although deep learning can feasibly solve this issue, software tools for 3D point cloud annotation to construct the training dataset are lacking.In this paper, a top-to-down point cloud segmentation algorithm using optimal transportation distance for maize shoots is proposed. On this basis, a point cloud annotation toolkit, Label3DMaize, for maize shoot is developed. Further, the toolkit was applied to achieve semi-automatic point cloud segmentation and annotation of maize shoots at different growth stages, through a series of operations, including stem segmentation, coarse segmentation, fine segmentation, and sample-based segmentation. The toolkit takes about 4 to 10 minutes to segment a maize shoot, and consumes 10%-20% of the total time if only coarse segmentation is required. Fine segmentation is more detailed than coarse segmentation, especially at the organ connection regions. The accuracy of coarse segmentation can reach 97.2% of the fine segmentation.Label3DMaize integrates point cloud segmentation algorithms and manual interactive operations, realizing semi-automatic point cloud segmentation of maize shoots at different growth stages. The toolkit provides a practical data annotation tool for further online segmentation researches based on deep learning and is expected to promote automatic point cloud processing of various plants.
f
DataSheet1_A novel approach for automatic annotation of human actions in 3D...
frontiersin.figshare.com
docx
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sebastian Krusche; Ibrahim Al Naser; Mohamad Bdiwi; Steffen Ihlenfeldt (2023). DataSheet1_A novel approach for automatic annotation of human actions in 3D point clouds for flexible collaborative tasks with industrial robots.docx [Dataset]. http://doi.org/10.3389/frobt.2023.1028329.s001
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.3389/frobt.2023.1028329.s001
Dataset updated
May 31, 2023
Dataset provided by
Frontiers
Authors
Sebastian Krusche; Ibrahim Al Naser; Mohamad Bdiwi; Steffen Ihlenfeldt
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Manual annotation for human action recognition with content semantics using 3D Point Cloud (3D-PC) in industrial environments consumes a lot of time and resources. This work aims to recognize, analyze, and model human actions to develop a framework for automatically extracting content semantics. Main Contributions of this work: 1. design a multi-layer structure of various DNN classifiers to detect and extract humans and dynamic objects using 3D-PC preciously, 2. empirical experiments with over 10 subjects for collecting datasets of human actions and activities in one industrial setting, 3. development of an intuitive GUI to verify human actions and its interaction activities with the environment, 4. design and implement a methodology for automatic sequence matching of human actions in 3D-PC. All these procedures are merged in the proposed framework and evaluated in one industrial Use-Case with flexible patch sizes. Comparing the new approach with standard methods has shown that the annotation process can be accelerated by 5.2 times through automation.
m
Sorghum Plants Labeled 3D LiDAR Point Cloud Data
data.mendeley.com
Updated Aug 29, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ajay Kumar Patel (2023). Sorghum Plants Labeled 3D LiDAR Point Cloud Data [Dataset]. http://doi.org/10.17632/pfnfzrmrg7.1
Explore at:
Unique identifier
https://doi.org/10.17632/pfnfzrmrg7.1
Dataset updated
Aug 29, 2023
Authors
Ajay Kumar Patel
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Point-wise annotation was conducted on input point clouds to prepare a labeled dataset for segmenting different sorghum plant-organ. Each sorghum plant's leaf, stem, and panicle were manually labeled in 0, 1, and 2, respectively, using the segment module of the CloudCompare software.
s
Global 3D Point Cloud Annotation Services Market Strategic Planning Insights...
statsndata.org
excel, pdf
Updated Jun 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stats N Data (2025). Global 3D Point Cloud Annotation Services Market Strategic Planning Insights 2025-2032 [Dataset]. https://www.statsndata.org/report/3d-point-cloud-annotation-services-market-379739
Explore at:
excel, pdfAvailable download formats
Dataset updated
Jun 2025
Dataset authored and provided by
Stats N Data
License
https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order
Area covered
Global
Description
The 3D Point Cloud Annotation Services market has emerged as a pivotal segment within the realms of computer vision, artificial intelligence, and geospatial technologies, addressing the increasing demand for accurate data interpretation across various industries. As enterprises strive to leverage 3D data for enhance
D
Data Labeling Tools Report
datainsightsmarket.com
doc, pdf, ppt
Updated Jun 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Data Insights Market (2025). Data Labeling Tools Report [Dataset]. https://www.datainsightsmarket.com/reports/data-labeling-tools-1368998
Explore at:
doc, pdf, pptAvailable download formats
Dataset updated
Jun 19, 2025
Dataset authored and provided by
Data Insights Market
License
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The Data Labeling Tools market is experiencing robust growth, driven by the escalating demand for high-quality training data in artificial intelligence (AI) and machine learning (ML) applications. The market's expansion is fueled by the increasing adoption of AI across various sectors, including automotive, healthcare, and finance, which necessitates vast amounts of accurately labeled data for model training and improvement. Technological advancements in automation and semi-supervised learning are streamlining the labeling process, improving efficiency and reducing costs, further contributing to market growth. A key trend is the shift towards more sophisticated labeling techniques, including 3D point cloud annotation and video annotation, reflecting the growing complexity of AI applications. Competition is fierce, with established players like Amazon Mechanical Turk and Google LLC coexisting with innovative startups offering specialized labeling solutions. The market is segmented by type of data labeling (image, text, video, audio), annotation method (manual, automated), and industry vertical, reflecting the diverse needs of different AI projects. Challenges include data privacy concerns, ensuring data quality and consistency, and the need for skilled annotators, which are all impacting the overall market growth, requiring continuous innovation and strategic investments to address these issues. Despite these challenges, the Data Labeling Tools market shows strong potential for continued expansion. The forecast period (2025-2033) anticipates a significant increase in market value, fueled by ongoing technological advancements, wider adoption of AI across various sectors, and a rising demand for high-quality data. The market is expected to witness increased consolidation as larger players acquire smaller companies to strengthen their market position and technological capabilities. Furthermore, the development of more sophisticated and automated labeling tools will continue to drive efficiency and reduce costs, making these tools accessible to a broader range of users and further fueling market growth. We anticipate that the focus on improving the accuracy and speed of data labeling will be paramount in shaping the future landscape of this dynamic market.
C
PFuji-Size dataset: photogrammetry-derived 3D point clouds of Fuji apples...
dataverse.csuc.cat
txt, zip
Updated Aug 29, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jordi Gené Mola; Jordi Gené Mola; Ricardo Sanz Cortiella; Ricardo Sanz Cortiella; Joan Ramon Rosell Polo; Joan Ramon Rosell Polo; Alexandre Escolà i Agustí; Alexandre Escolà i Agustí; Eduard Gregorio López; Eduard Gregorio López (2023). PFuji-Size dataset: photogrammetry-derived 3D point clouds of Fuji apples trees with annotations to evaluate fruit detection and size estimation methodologies [Dataset]. http://doi.org/10.34810/data141
Explore at:
zip(399393368), zip(9749381935), txt(45574), zip(229009819), zip(8278933720), txt(8440), zip(9189400218)Available download formats
Unique identifier
https://doi.org/10.34810/data141
Dataset updated
Aug 29, 2023
Dataset provided by
CORA.Repositori de Dades de Recerca
Authors
Jordi Gené Mola; Jordi Gené Mola; Ricardo Sanz Cortiella; Ricardo Sanz Cortiella; Joan Ramon Rosell Polo; Joan Ramon Rosell Polo; Alexandre Escolà i Agustí; Alexandre Escolà i Agustí; Eduard Gregorio López; Eduard Gregorio López
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Dataset funded by
Spanish Ministry of Economy and Competitiveness
Spanish Ministry of Education
Secretaria d’Universitats i Recerca del Departament d’Empresa i Coneixement de la Generalitat de Catalunya
Spanish Ministry of Science, Innovation and Universities
Description
The PFuji-Size dataset [2] includes the 3D point clouds of 6 Fuji apple trees containing a total of 615 apples and an additional 25 apples scanned in laboratory conditions. Structure-from-motion and multi-view stereo techniques were used to generate the 3D point clouds of the captured scene. Apple locations and ground truth diameter annotations are provided for assessing fruit detection and size estimation algorithms.
m
Data from: UA_L-DoTT: University of Alabama's Large Dataset of Trains and...
data.mendeley.com
Updated Feb 17, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Maxwell Eastepp (2022). UA_L-DoTT: University of Alabama's Large Dataset of Trains and Trucks [Dataset]. http://doi.org/10.17632/982jbmh5h9.1
Explore at:
Unique identifier
https://doi.org/10.17632/982jbmh5h9.1
Dataset updated
Feb 17, 2022
Authors
Maxwell Eastepp
License
Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
License information was derived automatically
Description
UA_L-DoTT (University of Alabama’s Large Dataset of Trains and Trucks) is a collection of camera images and 3D LiDAR point cloud scans from five different data sites. Four of the data sites targeted trains on railways and the last targeted trucks on a four-lane highway. Low light conditions were present at one of the data sites showcasing unique differences between individual sensor data. The final data site utilized a mobile platform which created a large variety of view points in images and point clouds. The dataset consists of 93,397 raw images, 11,415 corresponding labeled text files, 354,334 raw point clouds, 77,860 corresponding labeled point clouds, and 33 timestamp files. These timestamps correlate images to point cloud scans via POSIX time. The data was collected with a sensor suite consisting of five different LiDAR sensors and a camera. This provides various viewpoints and features of the same targets due to the variance in operational characteristics of the sensors. The inclusion of both raw and labeled data allows users to get started immediately with the labeled subset, or label additional raw data as needed. This large dataset is beneficial to any researcher interested in machine learning using cameras, LiDARs, or both.

The full dataset is too large (~1 Tb) to be uploaded to Mendeley Data. Please see the attached link for access to the full dataset.
P
STPLS3D Dataset
paperswithcode.com
Updated Mar 2, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Meida Chen; Qingyong Hu; Zifan Yu; Hugues Thomas; Andrew Feng; Yu Hou; Kyle McCullough; Fengbo Ren; Lucio Soibelman (2022). STPLS3D Dataset [Dataset]. https://paperswithcode.com/dataset/stpls3d
Explore at:
Dataset updated
Mar 2, 2022
Authors
Meida Chen; Qingyong Hu; Zifan Yu; Hugues Thomas; Andrew Feng; Yu Hou; Kyle McCullough; Fengbo Ren; Lucio Soibelman
Description
Our project (STPLS3D) aims to provide a large-scale aerial photogrammetry dataset with synthetic and real annotated 3D point clouds for semantic and instance segmentation tasks.

Although various 3D datasets with different functions and scales have been proposed recently, it remains challenging for individuals to complete the whole pipeline of large-scale data collection, sanitization, and annotation (e.g., semantic and instance labels). Moreover, the created datasets usually suffer from extremely imbalanced class distribution or partial low-quality data samples. Motivated by this, we explore the procedurally synthetic 3D data generation paradigm to equip individuals with the full capability of creating large-scale annotated photogrammetry point clouds. Specifically, we introduce a synthetic aerial photogrammetry point clouds generation pipeline that takes full advantage of open geospatial data sources and off-the-shelf commercial packages. Unlike generating synthetic data in virtual games, where the simulated data usually have limited gaming environments created by artists, the proposed pipeline simulates the reconstruction process of the real environment by following the same UAV flight pattern on a wide variety of synthetic terrain shapes and building densities, which ensure similar quality, noise pattern, and diversity with real data. In addition, the precise semantic and instance annotations can be generated fully automatically, avoiding the expensive and time-consuming manual annotation process. Based on the proposed pipeline, we present a richly-annotated synthetic 3D aerial photogrammetry point cloud dataset, termed STPLS3D, with more than 16 km^2 of landscapes and up to 18 fine-grained semantic categories. For verification purposes, we also provide a parallel dataset collected from four areas in the real environment.
A
AI Data Labeling Service Report
marketreportanalytics.com
doc, pdf, ppt
Updated Apr 9, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Report Analytics (2025). AI Data Labeling Service Report [Dataset]. https://www.marketreportanalytics.com/reports/ai-data-labeling-service-72370
Explore at:
ppt, pdf, docAvailable download formats
Dataset updated
Apr 9, 2025
Dataset authored and provided by
Market Report Analytics
License
https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The AI Data Labeling Services market is experiencing rapid growth, driven by the increasing demand for high-quality training data to fuel advancements in artificial intelligence. The market, estimated at $10 billion in 2025, is projected to witness a robust Compound Annual Growth Rate (CAGR) of 25% from 2025 to 2033, reaching a substantial market size. This expansion is fueled by several key factors. The automotive industry leverages AI data labeling for autonomous driving systems, while healthcare utilizes it for medical image analysis and diagnostics. The retail and e-commerce sectors benefit from improved product recommendations and customer service through AI-powered chatbots and image recognition. Agriculture is employing AI data labeling for precision farming and crop monitoring. Furthermore, the increasing adoption of cloud-based solutions offers scalability and cost-effectiveness, bolstering market growth. While data security and privacy concerns present challenges, the ongoing development of innovative techniques and the rising availability of skilled professionals are mitigating these restraints. The market is segmented by application (automotive, healthcare, retail & e-commerce, agriculture, others) and type (cloud-based, on-premises), with cloud-based solutions gaining significant traction due to their flexibility and accessibility. Key players like Scale AI, Labelbox, and Appen are actively shaping market dynamics through technological innovations and strategic partnerships. The North American market currently holds a significant share, but regions like Asia Pacific are poised for substantial growth due to increasing AI adoption and technological advancements. The competitive landscape is dynamic, characterized by both established players and emerging startups. While larger companies possess substantial resources and experience, smaller, agile companies are innovating with specialized solutions and niche applications. Future growth will likely be influenced by advancements in data annotation techniques (e.g., synthetic data generation), increasing demand for specialized labeling services (e.g., 3D point cloud labeling), and the expansion of AI applications across various industries. The continued development of robust data governance frameworks and ethical considerations surrounding data privacy will play a critical role in shaping the market's trajectory in the coming years. Regional growth will be influenced by factors such as government regulations, technological infrastructure, and the availability of skilled labor. Overall, the AI Data Labeling Services market presents a compelling opportunity for growth and investment in the foreseeable future.
r
PC-Urban Outdoordataset for 3D Point Cloud semantic segmentation
researchdata.edu.au
Updated 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ajmal Mian; Micheal Wise; Naveed Akhtar; Muhammad Ibrahim; Computer Science and Software Engineering (2021). PC-Urban Outdoordataset for 3D Point Cloud semantic segmentation [Dataset]. http://doi.org/10.21227/FVQD-K603
Explore at:
Unique identifier
https://doi.org/10.21227/FVQD-K603
Dataset updated
2021
Dataset provided by
IEEE DataPort
The University of Western Australia
Authors
Ajmal Mian; Micheal Wise; Naveed Akhtar; Muhammad Ibrahim; Computer Science and Software Engineering
Description
The proposed dataset, termed PC-Urban (Urban Point Cloud), is captured with an Ouster LiDAR sensor with 64 channels. The sensor is installed on an SUV that drives through the downtown of Perth, Western Australia (WA), Australia. The dataset comprises over 4.3 billion points captured for 66K sensor frames. The labelled data is organized as registered and raw point cloud frames, where the former has a different number of registered consecutive frames. We provide 25 class labels in the dataset covering 23 million points and 5K instances. Labelling is performed with PC-Annotate and can easily be extended by the end-users employing the same tool.The data is organized into unlabelled and labelled 3D point clouds. The unlabelled data is provided in .PCAP file format, which is the direct output format of the used Ouster LiDAR sensor. Raw frames are extracted from the recorded .PCAP files in the form of Ply and Excel files using the Ouster Studio Software. Labelled 3D point cloud data consists of registered or raw point clouds. A labelled point cloud is a combination of Ply, Excel, Labels and Summary files. A point cloud in Ply file contains X, Y, Z values along with color information. An Excel file contains X, Y, Z values, Intensity, Reflectivity, Ring, Noise, and Range of each point. These attributes can be useful in semantic segmentation using deep learning algorithms. The Label and Label Summary files have been explained in the previous section. Our one GB raw data contains nearly 1,300 raw frames, whereas 66,425 frames are provided in the dataset, each comprising 65,536 points. Hence, 4.3 billion points captured with the Ouster LiDAR sensor are provided. Annotation of 25 general outdoor classes is provided, which include car, building, bridge, tree, road, letterbox, traffic signal, light-pole, rubbish bin, cycles, motorcycle, truck, bus, bushes, road sign board, advertising board, road divider, road lane, pedestrians, side-path, wall, bus stop, water, zebra-crossing, and background. With the released data, a total of 143 scenes are annotated which include both raw and registered frames.
Z
Point cloud data sets of real and virtual Chenopodium alba
data.niaid.nih.gov
Updated Aug 5, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mirande, Katia (2022). Point cloud data sets of real and virtual Chenopodium alba [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6962993
Explore at:
Dataset updated
Aug 5, 2022
Dataset provided by
Charlaix, Julie
Tisserand, Marie
Besnard, Fabrice
Mirande, Katia
Godin, Christophe
Hetroy-Wheeler, Franck
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This data set contains:

5 annotated point clouds of real Chenopodium alba plants obtained from multi-view 2D camera imaging. Annotations consist of 5 classes: leaf blade, petiole, apex, main stem, branch. .txt files contain both 3D coordinates and annotations. .ply files are also provided for raw 3D point data without annotations.

24 annotated point clouds of virtual Chenopodium alba that were generated by a L-system simulation program. Annotations consist of 3 classes: leaf blade, petiole, stem. 3D coordinates and annotations are in separated .txt files.

These files have been used in a companion paper.
D
Data from: Weakly supervised semantic segmentation of airborne laser...
phys-techsciences.datastations.nl
bin, text/markdown +3
Updated Dec 2, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Y. LIN; Y. LIN (2019). Weakly supervised semantic segmentation of airborne laser scanning point clouds [Dataset]. http://doi.org/10.17026/DANS-2X9-XXQS
Explore at:
txt(21795309), txt(82810412), text/x-python(18301), txt(9285803), txt(21101303), txt(22743935), txt(13044257), txt(5096134), text/x-python(38452), txt(11316820), text/x-python(20044), text/x-python(10036), text/x-python(9076), txt(17519290), txt(71281919), txt(7171654), txt(76027456), txt(12744303), txt(47909306), txt(38487147), txt(15587321), txt(18088619), txt(20844100), txt(28863296), txt(45764134), txt(5619187), txt(84967460), text/x-python(12434), txt(19007263), text/x-python(7443), txt(25307832), txt(19399514), txt(35380682), text/x-python(22503), txt(33677986), txt(26886422), txt(82556010), text/x-python(37182), bin(482), txt(44401207), txt(82421872), text/x-python(8830), txt(23116620), text/x-python(17505), txt(23391657), txt(21710217), txt(23477363), zip(52813), txt(56888628), txt(49146563), txt(26554399), text/x-python(10859), txt(44708289), text/x-python(2684), txt(66551370), txt(16943889), txt(32821435), txt(59426621), text/x-python(22255), txt(11381570), text/x-python(2232), txt(9087866), txt(25482334), txt(26729742), txt(55191128), txt(35998350), txt(16640755), text/x-python(53947), txt(3322970), txt(28200704), txt(10819641), txt(11521448), text/x-python(18298), txt(17427617), text/markdown(239), txt(78233441), txt(8841097), txt(34380561), text/x-python(64732), txt(29998021), txt(23779576), txt(34435405), text/x-python(64484), txt(15429333), text/x-python(4859), txt(22882317), txt(32238231)Available download formats
Unique identifier
https://doi.org/10.17026/DANS-2X9-XXQS
Dataset updated
Dec 2, 2019
Dataset provided by
DANS Data Station Physical and Technical Sciences
Authors
Y. LIN; Y. LIN
License
https://doi.org/10.17026/fp39-0x58https://doi.org/10.17026/fp39-0x58
Description
While modern deep learning algorithms for semantic segmentation of airborne laser scanning (ALS) point clouds have achieved considerable success, the training process often requires a large number of labelled 3D points. Pointwise annotation of 3D point clouds, especially for large scale ALS datasets, is extremely time-consuming work. Weak supervision that only needs a few annotation efforts but can make networks achieve comparable performance is an alternative solution. Assigning a weak label to a subcloud, a group of points, is an efficient annotation strategy. With the supervision of subcloud labels, we first train a classification network that produces pseudo labels for the training data. Then the pseudo labels are taken as the input of a segmentation network which gives the final predictions on the testing data. As the quality of pseudo labels determines the performance of the segmentation network on testing data, we propose an overlap region loss and an elevation attention unit for the classification network to obtain more accurate pseudo labels. The overlap region loss that considers the nearby subcloud semantic information is introduced to enhance the awareness of the semantic heterogeneity within a subcloud. The elevation attention helps the classification network to encode more representative features for ALS point clouds. For the segmentation network, in order to effectively learn representative features from inaccurate pseudo labels, we adopt a supervised contrastive loss that uncovers the underlying correlations of class-specific features. Extensive experiments on three ALS datasets demonstrate the superior performance of our model to the baseline method (Wei et al., 2020). Date Submitted: 2023-11-21
u
Data from: SemanticRail3D: A 3D Point Cloud dataset with semantic...
portalcientifico.uvigo.gal
zenodo.org
Updated 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Soilán, Mario; Lamas-Novoa, Daniel; Martínez-Sánchez, Joaquín; Arias, Pedro; Lorenzo, Henrique; Riveiro, Belén; Soilán, Mario; Lamas-Novoa, Daniel; Martínez-Sánchez, Joaquín; Arias, Pedro; Lorenzo, Henrique; Riveiro, Belén (2024). SemanticRail3D: A 3D Point Cloud dataset with semantic annotations of railway environments [Dataset]. https://portalcientifico.uvigo.gal/documentos/668fc49bb9e7c03b01be255e?lang=de
Explore at:
Dataset updated
2024
Authors
Soilán, Mario; Lamas-Novoa, Daniel; Martínez-Sánchez, Joaquín; Arias, Pedro; Lorenzo, Henrique; Riveiro, Belén; Soilán, Mario; Lamas-Novoa, Daniel; Martínez-Sánchez, Joaquín; Arias, Pedro; Lorenzo, Henrique; Riveiro, Belén
Description
The SemanticRail3D dataset consists of a total of 438 .laz point clouds of railway environment. Each point cloud covers a track length of approximately 200 metres. In total, the dataset has approximately 2800 million points. The dataset labels contains a total of 11 classes, and also includes the track position of each railway line, as well as instance segmentation.

See Readme file for more information.
P
Argoverse 2 Lidar Dataset
paperswithcode.com
Updated Jun 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Benjamin Wilson; William Qi; Tanmay Agarwal; John Lambert; Jagjeet Singh; Siddhesh Khandelwal; Bowen Pan; Ratnesh Kumar; Andrew Hartnett; Jhony Kaesemodel Pontes; Deva Ramanan; Peter Carr; James Hays (2024). Argoverse 2 Lidar Dataset [Dataset]. https://paperswithcode.com/dataset/argoverse-2-lidar
Explore at:
Dataset updated
Jun 11, 2024
Authors
Benjamin Wilson; William Qi; Tanmay Agarwal; John Lambert; Jagjeet Singh; Siddhesh Khandelwal; Bowen Pan; Ratnesh Kumar; Andrew Hartnett; Jhony Kaesemodel Pontes; Deva Ramanan; Peter Carr; James Hays
Description
The Argoverse 2 Lidar Dataset is a collection of 20,000 scenarios with lidar sensor data, HD maps, and ego-vehicle pose. It does not include imagery or 3D annotations. The dataset is designed to support research into self-supervised learning in the lidar domain, as well as point cloud forecasting.

The dataset is divided into train, validation, and test sets of 16,000, 2,000, and 2,000 scenarios. This supports a point cloud forecasting task in which the future frames of the test set serve as the ground truth. Nonetheless, we encourage the community to use the dataset broadly for other tasks, such as self-supervised learning and map automation.

All Argoverse datasets contain lidar data from two out-of-phase 32 beam sensors rotating at 10 Hz. While this can be aggregated into 64 beam frames at 10 Hz, it is also reasonable to think of this as 32 beam frames at 20 Hz. Furthermore, all Argoverse datasets contain raw lidar returns with per-point timestamps, so the data does not need to be interpreted in quantized frames.
Stanford Large-Scale 3D Indoor Spaces Dataset (S3DIS)
redivis.com
application/jsonl +7
Updated Jun 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stanford Doerr School of Sustainability (2024). Stanford Large-Scale 3D Indoor Spaces Dataset (S3DIS) [Dataset]. http://doi.org/10.57761/gk3g-wc33
Explore at:
avro, sas, arrow, csv, application/jsonl, parquet, spss, stataAvailable download formats
Unique identifier
https://doi.org/10.57761/gk3g-wc33
Dataset updated
Jun 28, 2024
Dataset provided by
Redivis Inc.
Authors
Stanford Doerr School of Sustainability
Time period covered
Jun 27, 2024
Description
Abstract

S3DIS comprises 6 colored 3D point clouds from 6 large-scale indoor areas, along with semantic instance annotations for 12 object categories (wall, floor, ceiling, beam, column, window, door, sofa, desk, chair, bookcase, and board).

Methodology

The Stanford Large-Scale 3D Indoor Spaces (S3DIS) dataset is composed of the colored 3D point clouds of six large-scale indoor areas from three different buildings, each covering approximately 935, 965, 450, 1700, 870, and 1100 square meters (total of 6020 square meters). These areas show diverse properties in architectural style and appearance and include mainly office areas, educational and exhibition spaces, and conference rooms, personal offices, restrooms, open spaces, lobbies, stairways, and hallways are commonly found therein. The entire point clouds are automatically generated without any manual intervention using the Matterport scanner. The dataset also includes semantic instance annotations on the point clouds for 12 semantic elements, which are structural elements (ceiling, floor, wall, beam, column, window, and door) and commonly found items and furniture (table, chair, sofa, bookcase, and board).

https://redivis.com/fileUploads/5bdaf09c-7d3b-4a91-b192-d98a0f0b0018%3E" alt="S3DIS.png">

%3Cu%3E%3Cstrong%3EImportant Information%3C/strong%3E%3C/u%3E

This paper was presented in the "3D Semantic Parsing of Large-Scale Indoor Spaces", CVPR 2016.

Project website: http://buildingparser.stanford.edu/

%3C!-- --%3E
Stanford 2D-3D-Semantics Dataset (2D-3D-S)
redivis.com
application/jsonl +7
Updated Jun 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stanford Doerr School of Sustainability (2024). Stanford 2D-3D-Semantics Dataset (2D-3D-S) [Dataset]. http://doi.org/10.57761/gmhc-wx10
Explore at:
sas, application/jsonl, avro, spss, arrow, csv, parquet, stataAvailable download formats
Unique identifier
https://doi.org/10.57761/gmhc-wx10
Dataset updated
Jun 28, 2024
Dataset provided by
Redivis Inc.
Authors
Stanford Doerr School of Sustainability
Time period covered
Jun 27, 2024
Description
Abstract

2D-3D-S comprises

Methodology

The 2D-3D-S dataset provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annotations. It covers over 6,000 m2 and contains over 70,000 RGB images, along with the corresponding depths, surface normals, semantic annotations, global XYZ images (all in forms of both regular and 360° equirectangular images) as well as camera information. It also includes registered raw and semantically annotated 3D meshes and point clouds. In addition, the dataset contains the raw RGB and Depth imagery along with the corresponding camera information per scan location. The dataset enables development of joint and cross-modal learning models and potentially unsupervised approaches utilizing the regularities present in large-scale indoor spaces.

In more detail, the dataset is collected in 6 large-scale indoor areas that originate from 3 different buildings of mainly educational and office use. For each area, all modalities are registered in the same reference system, yielding pixel to pixel correspondences among them. In a nutshell, the presented dataset contains a total of 70,496 regular RGB and 1,413 equirectangular RGB images, along with their corresponding depths, surface normals, semantic annotations, global XYZ OpenEXR format and camera metadata. It also contains the raw sensor data, which comprises of 18 HDR RGB and Depth images (6 looking forward, 6 towards the top, 6 towards the bottom) along with the corresponding camera metadata per each of the 1,413 scan locations, yielding a total of 25,434 RGBD raw images. In addition, we provide whole building 3D reconstructions as textured meshes, as well as the corresponding 3D semantic meshes. It also includes the colored 3D point cloud data of these areas with the total number of 695,878,620 points, that have been previously presented in the Stanford large-scale 3D Indoor Spaces Dataset (S3DIS).

https://redivis.com/fileUploads/7a4dcf34-471b-4dd8-b2dc-dc9842280f76%3E" alt="2D3DS_pano.png">

https://redivis.com/fileUploads/699e543b-cac6-4db0-bf30-77d48e3b2203%3E" alt="3Dmodal.png">

https://redivis.com/fileUploads/43f7c602-202c-48fb-a44e-386b57a22835%3E" alt="equirect.png">%3Cu%3E%3Cstrong%3EImportant Information:%3C/strong%3E%3C/u%3E

The dataset is an extension of the Stanford large-scale 3D Indoor Spaces Dataset (S3DIS).

Dataset files and structure: https://github.com/alexsax/2D-3D-Semantics

Dataset information: http://3dsemantics.stanford.edu/

%3C!-- --%3E
CRBeDaSet_test
zenodo.org
Updated Jul 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gabara Grzegorz; Gabara Grzegorz (2024). CRBeDaSet_test [Dataset]. http://doi.org/10.5281/zenodo.7496481
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.7496481
Dataset updated
Jul 15, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Gabara Grzegorz; Gabara Grzegorz
Description
The repository is related to CRBeDaSet (http://dx.doi.org/10.17632/9nvn27yw62.2). This repository is extending it with new data. Moreover, it contains the results of CRBeDaSet processing. The description of the dataset and its evaluation is presented in the paper: Gabara, G.; Sawicki, P. CRBeDaSet: A Benchmark Dataset for High Accuracy Close Range 3D Object Reconstruction. Remote Sens. 2023, 15, 1116. https://doi.org/10.3390/rs15041116

The first part is related to computations using different COTS software. The second part includes the comparison of TLS and image-based point clouds. The third part presents the results of different detectors and descriptors used on CRBeDaSet images.
Folders contain the following data:
1. Application processing results - the results of CRBeDaSet processing using COTS software
- high resolution images of dense point clouds
- high resolution images of 3D mesh models
2. C2C_TLS_ImageBasedPC - the comparison of TLS and image-based point clouds
- high resolution images of 3D point clouds for four elevations
3. MatchingResults - the usage of different detectors and descriptors on CRBeDaSet images
- results for four elevations (4 x 2 images) using 12 detectors and 12 descriptors (high resolution).

Files contain the following data:
1. CRBeDaSet_ Description of folders and files.
2. CRBeDaSet_merged_coordinate.rar - CRBeDaSet_PC_merged.pts compressed with WinRAR. RAW scan stations merged using Leica Cyclone™ SCAN software and object coordinate system (the same coordinate system as used in image-based reconstruction).
3. CRBeDaSet_merged_filtered_annotated.rar - CRBeDaSet_PC_merged.pts filtered and annotated using LoD3 class topology (compressed with WinRAR).
4. CRBeDaSet_RAW_SCAN_stations.zip - RAW scan stations (compressed with WinRAR).

5. remotesensing-15-01116.pdf - article with CRBeDaset description and evaluation.

6. Figures.zip - high-resolution figures presented in the article.
P
2D-3D-S Dataset
paperswithcode.com
Updated Feb 2, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Iro Armeni; Sasha Sax; Amir R. Zamir; Silvio Savarese (2017). 2D-3D-S Dataset [Dataset]. https://paperswithcode.com/dataset/2d-3d-s
Explore at:
Dataset updated
Feb 2, 2017
Authors
Iro Armeni; Sasha Sax; Amir R. Zamir; Silvio Savarese
Description
The 2D-3D-S dataset provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annotations. It covers over 6,000 m2 collected in 6 large-scale indoor areas that originate from 3 different buildings. It contains over 70,000 RGB images, along with the corresponding depths, surface normals, semantic annotations, global XYZ images (all in forms of both regular and 360° equirectangular images) as well as camera information. It also includes registered raw and semantically annotated 3D meshes and point clouds. The dataset enables development of joint and cross-modal learning models and potentially unsupervised approaches utilizing the regularities present in large-scale indoor spaces.
h
Global Generative AI in Data Labeling Solution and Services Market Scope &...
htfmarketinsights.com
pdf & excel
Updated Jun 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
HTF Market Intelligence (2025). Global Generative AI in Data Labeling Solution and Services Market Scope & Changing Dynamics 2024-2032 [Dataset]. https://www.htfmarketinsights.com/report/4361215-generative-ai-in-data-labeling-solution-and-services-market
Explore at:
pdf & excelAvailable download formats
Dataset updated
Jun 25, 2025
Dataset authored and provided by
HTF Market Intelligence
License
https://www.htfmarketinsights.com/privacy-policyhttps://www.htfmarketinsights.com/privacy-policy
Time period covered
2019 - 2031
Area covered
Global
Description
Global Generative AI in Data Labeling Solution and Services is segmented by Application (Autonomous driving, NLP, Medical imaging, Retail AI, Robotics), Type (Text Annotation, Image/Video Tagging, Audio Labeling, 3D Point Cloud Labeling, Synthetic Data Generation) and Geography(North America, LATAM, West Europe, Central & Eastern Europe, Northern Europe, Southern Europe, East Asia, Southeast Asia, South Asia, Central Asia, Oceania, MEA)
D
Replication Data for: Terrain-Informed Self-Supervised Learning: Enhancing...
dataverse.no
dataverse.azure.uit.no
+1more
txt, zip
Updated Feb 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anuja Vats; Anuja Vats (2025). Replication Data for: Terrain-Informed Self-Supervised Learning: Enhancing Building Footprint Extraction from LiDAR Data with Limited Annotations [Dataset]. http://doi.org/10.18710/HSMJLL
Explore at:
zip(25842728960), zip(27433246720), zip(10174319212), zip(27219730824), txt(5048), zip(22036067961)Available download formats
Unique identifier
https://doi.org/10.18710/HSMJLL
Dataset updated
Feb 20, 2025
Dataset provided by
DataverseNO
Authors
Anuja Vats; Anuja Vats
License
https://dataverse.no/api/datasets/:persistentId/versions/1.1/customlicense?persistentId=doi:10.18710/HSMJLLhttps://dataverse.no/api/datasets/:persistentId/versions/1.1/customlicense?persistentId=doi:10.18710/HSMJLL
Area covered
Norway
Dataset funded by
The Research Council of Norway
Description
The dataset comprises the pretraining and testing data for our work: Terrain-Informed Self-Supervised Learning: Enhancing Building Footprint Extraction from LiDAR Data with Limited Annotations. The pretaining data consists of images corresponding to the Digital Surface Models (DSM) and Digital Terrain Models (DTM) obtained from Norway, with a ground resolution of 1 meter, utilizing the UTM 33N projection. The primary data source for this dataset is the Norwegian Mapping Authority (Kartverket), which has made the data freely available on their website under the CC BY 4.0 license (Source: https://hoydedata.no/, License terms: https://creativecommons.org/licenses/by/4.0/) The DSM and DTM models are generated from 3D LiDAR point clouds collected through periodic aerial campaigns. During these campaigns, the LiDAR sensors capture data with a maximum offset of 20 degrees from the nadir. Additionally, a subset of data also includes building footprints/labels created using the OpenStreetMap (OSM) database. Specifically, building footprints extracted from the OSM database were rasterized to match the grid of the DTM and DSM models. These rasterized labels are made available under the Open Database License (ODbL) in compliance with the OSM license requirements. We hope this dataset facilitates various applications in geographic analysis, remote sensing, and machine learning research.

Facebook

Twitter

Click to copy link

Link copied

Cite

Teng Miao; Weiliang Wen; Yinglun Li; Sheng Wu; Chao Zhu; Xinyu Guo (2021). Supporting data for "Label3DMaize: toolkit for 3D point cloud data annotation of maize shoots" [Dataset]. http://doi.org/10.5524/100884

Supporting data for "Label3DMaize: toolkit for 3D point cloud data annotation of maize shoots"

Explore at:

26 scholarly articles cite this dataset (View in Google Scholar)

Unique identifier

https://doi.org/10.5524/100884

Dataset updated

Jan 1, 2021

Authors

Teng Miao; Weiliang Wen; Yinglun Li; Sheng Wu; Chao Zhu; Xinyu Guo

Description

Three-dimensional (3D) point cloud is the most direct and effective data form for studying plant structure and morphology. In point cloud studies, the point cloud segmentation of individual plants to organs directly determines the accuracy of organ-level phenotype estimation and the 3D plant reconstruction reliability. However, highly accurate, automatic, and robust point cloud segmentation approaches for plants are unavailable. Thus, the high-throughput segmentation of many shoots is challenging. Although deep learning can feasibly solve this issue, software tools for 3D point cloud annotation to construct the training dataset are lacking.In this paper, a top-to-down point cloud segmentation algorithm using optimal transportation distance for maize shoots is proposed. On this basis, a point cloud annotation toolkit, Label3DMaize, for maize shoot is developed. Further, the toolkit was applied to achieve semi-automatic point cloud segmentation and annotation of maize shoots at different growth stages, through a series of operations, including stem segmentation, coarse segmentation, fine segmentation, and sample-based segmentation. The toolkit takes about 4 to 10 minutes to segment a maize shoot, and consumes 10%-20% of the total time if only coarse segmentation is required. Fine segmentation is more detailed than coarse segmentation, especially at the organ connection regions. The accuracy of coarse segmentation can reach 97.2% of the fine segmentation.Label3DMaize integrates point cloud segmentation algorithms and manual interactive operations, realizing semi-automatic point cloud segmentation of maize shoots at different growth stages. The toolkit provides a practical data annotation tool for further online segmentation researches based on deep learning and is expected to promote automatic point cloud processing of various plants.

Clear search

Close search

Google apps

Main menu

Supporting data for "Label3DMaize: toolkit for 3D point cloud data...

DataSheet1_A novel approach for automatic annotation of human actions in 3D...

Sorghum Plants Labeled 3D LiDAR Point Cloud Data

Global 3D Point Cloud Annotation Services Market Strategic Planning Insights...

Data Labeling Tools Report

PFuji-Size dataset: photogrammetry-derived 3D point clouds of Fuji apples...

Data from: UA_L-DoTT: University of Alabama's Large Dataset of Trains and...

STPLS3D Dataset

AI Data Labeling Service Report

PC-Urban Outdoordataset for 3D Point Cloud semantic segmentation

Point cloud data sets of real and virtual Chenopodium alba

Data from: Weakly supervised semantic segmentation of airborne laser...

Data from: SemanticRail3D: A 3D Point Cloud dataset with semantic...

Argoverse 2 Lidar Dataset

Stanford Large-Scale 3D Indoor Spaces Dataset (S3DIS)

Abstract

Methodology

Stanford 2D-3D-Semantics Dataset (2D-3D-S)

Abstract

Methodology

CRBeDaSet_test

2D-3D-S Dataset

Global Generative AI in Data Labeling Solution and Services Market Scope &...

Replication Data for: Terrain-Informed Self-Supervised Learning: Enhancing...

Supporting data for "Label3DMaize: toolkit for 3D point cloud data annotation of maize shoots"