96 datasets found

d
TagX Data Annotation | Automated Annotation | AI-assisted labeling with...
datarade.ai
Updated Aug 14, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TagX (2022). TagX Data Annotation | Automated Annotation | AI-assisted labeling with human verification | Customized annotation | Data for AI & LLMs [Dataset]. https://datarade.ai/data-products/data-annotation-services-for-artificial-intelligence-and-data-tagx
Explore at:
.json, .xml, .csv, .xls, .txtAvailable download formats
Dataset updated
Aug 14, 2022
Dataset authored and provided by
TagX
Area covered
Egypt, Georgia, Sint Eustatius and Saba, Estonia, Comoros, Saint Barthélemy, Guatemala, Cabo Verde, Lesotho, Central African Republic
Description
TagX data annotation services are a set of tools and processes used to accurately label and classify large amounts of data for use in machine learning and artificial intelligence applications. The services are designed to be highly accurate, efficient, and customizable, allowing for a wide range of data types and use cases.

The process typically begins with a team of trained annotators reviewing and categorizing the data, using a variety of annotation tools and techniques, such as text classification, image annotation, and video annotation. The annotators may also use natural language processing and other advanced techniques to extract relevant information and context from the data.

Once the data has been annotated, it is then validated and checked for accuracy by a team of quality assurance specialists. Any errors or inconsistencies are corrected, and the data is then prepared for use in machine learning and AI models.

TagX annotation services can be applied to a wide range of data types, including text, images, videos, and audio. The services can be customized to meet the specific needs of each client, including the type of data, the level of annotation required, and the desired level of accuracy.

TagX data annotation services provide a powerful and efficient way to prepare large amounts of data for use in machine learning and AI applications, allowing organizations to extract valuable insights and improve their decision-making processes.
i
15M+ Images | AI Training Data | Annotated imagery data for AI | Object &...
data.imagedatasets.ai
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Image Datasets, 15M+ Images | AI Training Data | Annotated imagery data for AI | Object & Scene Detection | Global Coverage [Dataset]. https://data.imagedatasets.ai/products/2m-images-annotated-imagery-data-full-exif-data-object-image-datasets
Explore at:
Dataset authored and provided by
Image Datasets
Area covered
Czechia, Israel, Gabon, Singapore, Senegal, Martinique, Marshall Islands, Gambia, Brazil, Belize
Description
A comprehensive dataset of 15M+ images sourced globally, featuring full EXIF data, including camera settings and photography details. Enriched with object and scene detection metadata, this dataset is ideal for AI model training in image recognition, classification, and segmentation.
d
15M+ Images | AI Training Data | Annotated imagery data for AI | Object &...
datarade.ai
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Image Datasets, 15M+ Images | AI Training Data | Annotated imagery data for AI | Object & Scene Detection | Global Coverage [Dataset]. https://datarade.ai/data-products/2m-images-annotated-imagery-data-full-exif-data-object-image-datasets
Explore at:
.bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
Dataset authored and provided by
Image Datasets
Area covered
Albania, United States Minor Outlying Islands, New Zealand, Malta, Mexico, Chad, Anguilla, Qatar, Brunei Darussalam, Georgia
Description
This dataset features over 15,000,000 high-quality images sourced from photographers worldwide. Designed to support AI and machine learning applications, it provides a diverse and richly annotated collection of imagery.

Key Features: 1. Comprehensive Metadata: the dataset includes full EXIF data, detailing camera settings such as aperture, ISO, shutter speed, and focal length. Additionally, each image is pre-annotated with object and scene detection metadata, making it ideal for tasks like classification, detection, and segmentation. Popularity metrics, derived from engagement on our proprietary platform, are also included.

Unique Sourcing Capabilities: the images are collected through a proprietary gamified platform for photographers. Competitions focused on flower photography ensure fresh, relevant, and high-quality submissions. Custom datasets can be sourced on-demand within 72 hours, allowing for specific requirements such as particular flower species or geographic regions to be met efficiently.

Global Diversity: photographs have been sourced from contributors in over 100 countries, ensuring a vast array of flower species, colors, and environmental settings. The images feature varied contexts, including natural habitats, gardens, bouquets, and urban landscapes, providing an unparalleled level of diversity.

High-Quality Imagery: the dataset includes images with resolutions ranging from standard to high-definition to meet the needs of various projects. Both professional and amateur photography styles are represented, offering a mix of artistic and practical perspectives suitable for a variety of applications.

Popularity Scores Each image is assigned a popularity score based on its performance in GuruShots competitions. This unique metric reflects how well the image resonates with a global audience, offering an additional layer of insight for AI models focused on user preferences or engagement trends.

I-Ready Design: this dataset is optimized for AI applications, making it ideal for training models in tasks such as image recognition, classification, and segmentation. It is compatible with a wide range of machine learning frameworks and workflows, ensuring seamless integration into your projects.

Licensing & Compliance: the dataset complies fully with data privacy regulations and offers transparent licensing for both commercial and academic use.

Use Cases 1. Training AI systems for plant recognition and classification. 2. Enhancing agricultural AI models for plant health assessment and species identification. 3. Building datasets for educational tools and augmented reality applications. 4. Supporting biodiversity and conservation research through AI-powered analysis.

This dataset offers a comprehensive, diverse, and high-quality resource for training AI and ML models, tailored to deliver exceptional performance for your projects. Customizations are available to suit specific project needs. Contact us to learn more!
D
Data Annotation and Collection Services Report
marketresearchforecast.com
doc, pdf, ppt
Updated Mar 9, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Research Forecast (2025). Data Annotation and Collection Services Report [Dataset]. https://www.marketresearchforecast.com/reports/data-annotation-and-collection-services-30703
Explore at:
doc, ppt, pdfAvailable download formats
Dataset updated
Mar 9, 2025
Dataset authored and provided by
Market Research Forecast
License
https://www.marketresearchforecast.com/privacy-policyhttps://www.marketresearchforecast.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The Data Annotation and Collection Services market is experiencing robust growth, driven by the increasing adoption of artificial intelligence (AI) and machine learning (ML) across diverse sectors. The market, estimated at $10 billion in 2025, is projected to achieve a Compound Annual Growth Rate (CAGR) of 25% from 2025 to 2033, reaching approximately $45 billion by 2033. This significant expansion is fueled by several key factors. The surge in autonomous driving initiatives necessitates high-quality data annotation for training self-driving systems, while the burgeoning smart healthcare sector relies heavily on annotated medical images and data for accurate diagnoses and treatment planning. Similarly, the growth of smart security systems and financial risk control applications demands precise data annotation for improved accuracy and efficiency. Image annotation currently dominates the market, followed by text annotation, reflecting the widespread use of computer vision and natural language processing. However, video and voice annotation segments are showing rapid growth, driven by advancements in AI-powered video analytics and voice recognition technologies. Competition is intense, with both established technology giants like Alibaba Cloud and Baidu, and specialized data annotation companies like Appen and Scale Labs vying for market share. Geographic distribution shows a strong concentration in North America and Europe initially, but Asia-Pacific is expected to emerge as a major growth region in the coming years, driven primarily by China and India's expanding technology sectors. The market, however, faces certain challenges. The high cost of data annotation, particularly for complex tasks such as video annotation, can pose a barrier to entry for smaller companies. Ensuring data quality and accuracy remains a significant concern, requiring robust quality control mechanisms. Furthermore, ethical considerations surrounding data privacy and bias in algorithms require careful attention. To overcome these challenges, companies are investing in automation tools and techniques like synthetic data generation, alongside developing more sophisticated quality control measures. The future of the Data Annotation and Collection Services market will likely be shaped by advancements in AI and ML technologies, the increasing availability of diverse data sets, and the growing awareness of ethical considerations surrounding data usage.
d
750K+ Furniture Images | AI Training Data | Object Detection Data |...
datarade.ai
Updated Dec 22, 2007
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Image Datasets (2007). 750K+ Furniture Images | AI Training Data | Object Detection Data | Annotated imagery data | Global Coverage [Dataset]. https://datarade.ai/data-products/500k-furniture-images-object-detection-data-full-exif-da-image-datasets
Explore at:
.bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
Dataset updated
Dec 22, 2007
Dataset authored and provided by
Image Datasets
Area covered
Tunisia, Suriname, Slovakia, Liberia, Fiji, Burkina Faso, Indonesia, Uruguay, Haiti, Saint Kitts and Nevis
Description
This dataset features over 750,000 high-quality images of furniture sourced from photographers worldwide. Designed to support AI and machine learning applications, it provides a diverse and richly annotated collection of flower imagery.

Key Features: 1. Comprehensive Metadata: the dataset includes full EXIF data, detailing camera settings such as aperture, ISO, shutter speed, and focal length. Additionally, each image is pre-annotated with object and scene detection metadata, making it ideal for tasks like classification, detection, and segmentation. Popularity metrics, derived from engagement on our proprietary platform, are also included.

Unique Sourcing Capabilities: the images are collected through a proprietary gamified platform for photographers. Competitions focused on flower photography ensure fresh, relevant, and high-quality submissions. Custom datasets can be sourced on-demand within 72 hours, allowing for specific requirements such as particular flower species or geographic regions to be met efficiently.

Global Diversity: photographs have been sourced from contributors in over 100 countries, ensuring a vast array of flower species, colors, and environmental settings. The images feature varied contexts, including natural habitats, gardens, bouquets, and urban landscapes, providing an unparalleled level of diversity.

High-Quality Imagery: the dataset includes images with resolutions ranging from standard to high-definition to meet the needs of various projects. Both professional and amateur photography styles are represented, offering a mix of artistic and practical perspectives suitable for a variety of applications.

Popularity Scores Each image is assigned a popularity score based on its performance in GuruShots competitions. This unique metric reflects how well the image resonates with a global audience, offering an additional layer of insight for AI models focused on user preferences or engagement trends.

I-Ready Design: this dataset is optimized for AI applications, making it ideal for training models in tasks such as image recognition, classification, and segmentation. It is compatible with a wide range of machine learning frameworks and workflows, ensuring seamless integration into your projects.

Licensing & Compliance: the dataset complies fully with data privacy regulations and offers transparent licensing for both commercial and academic use.

Use Cases 1. Training AI systems for plant recognition and classification. 2. Enhancing agricultural AI models for plant health assessment and species identification. 3. Building datasets for educational tools and augmented reality applications. 4. Supporting biodiversity and conservation research through AI-powered analysis.

This dataset offers a comprehensive, diverse, and high-quality resource for training AI and ML models, tailored to deliver exceptional performance for your projects. Customizations are available to suit specific project needs. Contact us to learn more!
Data from: Region-based Annotation Data of Fire Images for Intelligent...
zenodo.org
data.niaid.nih.gov
zip
Updated Jan 23, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wahyono; Andi Dharmawan; Agus Harjoko; Chrystian; Faisal Dharma Adhinata; Wahyono; Andi Dharmawan; Agus Harjoko; Chrystian; Faisal Dharma Adhinata (2022). Region-based Annotation Data of Fire Images for Intelligent Surveillance System [Dataset]. http://doi.org/10.5281/zenodo.5574537
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.5574537
Dataset updated
Jan 23, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Wahyono; Andi Dharmawan; Agus Harjoko; Chrystian; Faisal Dharma Adhinata; Wahyono; Andi Dharmawan; Agus Harjoko; Chrystian; Faisal Dharma Adhinata
Description
This data presents fire segmentation annotation data on 12 commonly used and publicly available “VisiFire Dataset” videos from http://signal.ee.bilkent.edu.tr/VisiFire/. This annotations dataset was obtained by per-frame, manual hand annotation over the fire region with 2,684 total annotated frames. Since this annotation provides per-frame segmentation data, it offers a new and unique fire motion feature to the existing video, unlike other fire segmentation data that are collected from different still images. The annotations dataset also provides ground truth for segmentation task on videos. With segmentation task, it offers better insight on how well a machine learning model understood, not only detecting whether a fire is present, but also its exact location by calculating metrics such as Intersection over Union (IoU) with this annotations data. This annotations data is a tremendously useful addition to train, develop, and create a much better smart surveillance system for early detection in high-risk fire hotspots area.
Image segmentations produced by BAMF under the AIMI Annotations initiative
zenodo.org
data.niaid.nih.gov
zip
Updated May 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jeff Van Oss; Jeff Van Oss; Gowtham Krishnan Murugesan; Gowtham Krishnan Murugesan; Diana McCrumb; Diana McCrumb; Rahul Soni; Rahul Soni (2024). Image segmentations produced by BAMF under the AIMI Annotations initiative [Dataset]. http://doi.org/10.5281/zenodo.10081112
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.10081112
Dataset updated
May 24, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Jeff Van Oss; Jeff Van Oss; Gowtham Krishnan Murugesan; Gowtham Krishnan Murugesan; Diana McCrumb; Diana McCrumb; Rahul Soni; Rahul Soni
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The Imaging Data Commons (IDC)(https://imaging.datacommons.cancer.gov/) [1] connects researchers with publicly available cancer imaging data, often linked with other types of cancer data. Many of the collections have limited annotations due to the expense and effort required to create these manually. The increased capabilities of AI analysis of radiology images provides an opportunity to augment existing IDC collections with new annotation data. To further this goal, we trained several nnUNet [2] based models for a variety of radiology segmentation tasks from public datasets and used them to generate segmentations for IDC collections.

To validate the models performance, roughly 10% of the predictions were manually reviewed and corrected by both a board certified radiologist and a medical student (non-expert). Additionally, this non-expert looked at all the ai predictions and rated them on a 5 point Likert scale .

This record provides the AI segmentations, Manually corrected segmentations, and Manual scores for the inspected IDC Collection images.

File Overview

breast-fdg-pet-ct.zip

Segment Description: FDG-avid lesions in breast from FDG PET/CT scans QIN-Breast

IDC Collection: QIN-Breast

Links: model weights, github

kidney-ct.zip

Segment Description: Kidney, Tumor, and Cysts from contrast enhanced CT scans

IDS Collection: TCGA-KIRC

Links: model weights, github

liver-ct.zip

Segment Description: Liver from CT scans

IDC Collection: TCGA-LIHC

Links: model weights, github

liver-mr.zip

Segment Description: Liver from T1 MRI scans

IDC Collection: TCGA-LIHC

Links: model weights, github

lung-ct.zip

Segment Description: Lung and Nodules (3mm-30mm) from CT scans

IDC Collections:

Anti-PD-1-Lung

LUNG-PET-CT-Dx

NSCLC Radiogenomics

RIDER Lung PET-CT

TCGA-LUAD

TCGA-LUSC

Links: model weights 1, model weights 2, github

lung-fdg-pet-ct.zip

Segment Description: Lungs and FDG-avid lesions in the lung from FDG PET/CT scans

IDC Collections:

ACRIN-NSCLC-FDG-PET

Anti-PD-1-Lung

LUNG-PET-CT-Dx

NSCLC Radiogenomics

RIDER Lung PET-CT

TCGA-LUAD

TCGA-LUSC

Links: model weights, github

prostate-mr.zip

Segment Description: Prostate from T2 MRI scans

IDC Collection: ProstateX

Links: model weights, github

Likert Score Definition:

5 Strongly Agree - Use-as-is (i.e., clinically acceptable, and could be used for treatment without change)

4 Agree - Minor edits that are not necessary. Stylistic differences, but not clinically important. The current segmentation is acceptable

3 Neither agree nor disagree - Minor edits that are necessary. Minor edits are those that the review judges can be made in less time than starting from scratch or are expected to have minimal effect on treatment outcome

2 Disagree - Major edits. This category indicates that the necessary edit is required to ensure correctness, and sufficiently significant that user would prefer to start from the scratch

1 Strongly disagree - Unusable. This category indicates that the quality of the automatic annotations is so bad that they are unusable.

Zip File Folder Structure

Each zip file in the collection correlates to a specific segmentation task. The common folder structure is

ai-segmentations-dcm This directory contains the AI model predictions in DICOM-SEG format for all analyzed IDC collection files

qa-segmentations-dcm This directory contains manual corrected segmentation files, based on the AI prediction, in DICOM-SEG format. Only a fraction, ~10%, of the AI predictions were corrected. Corrections were performed by radiologist (rad*) and non-experts (ne*)

qa-results.csv CSV file linking the study/series UIDs with the ai segmentation file, radiologist corrected segmentation file, radiologist ratings of AI performance.
d
TagX - 10,000+ Car damage images with annotation | Car insurance &...
datarade.ai
Updated Jul 22, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TagX (2023). TagX - 10,000+ Car damage images with annotation | Car insurance & inspection | Global coverage with custom annotations [Dataset]. https://datarade.ai/data-products/10-000-car-damage-images-with-annotation-car-insurance-i-tagx
Explore at:
Dataset updated
Jul 22, 2023
Dataset authored and provided by
TagX
Area covered
Ascension and Tristan da Cunha, Virgin Islands (U.S.), Bonaire, Rwanda, Faroe Islands, Poland, Guinea-Bissau, Myanmar, Congo, Senegal
Description
We collect images of Damaged cars from around the world and create a custom annotations on those images for our customers. Annotations can be customized as per requirement, both Polygon annotation and Bounding box annotations are possible. The collection can also be customized with the desired country of origin, color, model, and Damage types.

Customers can order images from a single car from 1 to 8 angles.

The dataset consists of images in JPEG and PNG formats.

A curated dataset of damaged cars having dents, scratches, bends, cracks, and totaled cars.

The dataset has proven to be suitable for training Artificial Intelligence algorithms to detect type damages in a car by analyzing a picture. The model developed with this dataset is highly sought after in the Insurance industry.
D
Data Annotation Platform Report
marketresearchforecast.com
doc, pdf, ppt
Updated Mar 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Research Forecast (2025). Data Annotation Platform Report [Dataset]. https://www.marketresearchforecast.com/reports/data-annotation-platform-30706
Explore at:
ppt, pdf, docAvailable download formats
Dataset updated
Mar 9, 2025
Dataset authored and provided by
Market Research Forecast
License
https://www.marketresearchforecast.com/privacy-policyhttps://www.marketresearchforecast.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The global data annotation platform market is experiencing robust growth, driven by the increasing demand for high-quality training data across diverse sectors. The market's expansion is fueled by the proliferation of artificial intelligence (AI) and machine learning (ML) applications in autonomous driving, smart healthcare, and financial risk control. Autonomous vehicles, for instance, require vast amounts of annotated data for object recognition and navigation, significantly boosting demand. Similarly, the healthcare sector leverages data annotation for medical image analysis, leading to advancements in diagnostics and treatment. The market is segmented by application (Autonomous Driving, Smart Healthcare, Smart Security, Financial Risk Control, Social Media, Others) and annotation type (Image, Text, Voice, Video, Others). The prevalent use of cloud-based platforms, coupled with the rising adoption of AI across various industries, presents significant opportunities for market expansion. While the market faces challenges such as high annotation costs and data privacy concerns, the overall growth trajectory remains positive, with a projected compound annual growth rate (CAGR) suggesting substantial market expansion over the forecast period (2025-2033). Competition among established players like Appen, Amazon, and Google, alongside emerging players focusing on specialized annotation needs, is expected to intensify. The regional distribution of the market reflects the concentration of AI and technology development in specific geographical regions. North America and Europe currently hold a significant market share due to their robust technological infrastructure and early adoption of AI technologies. However, the Asia-Pacific region, particularly China and India, is demonstrating rapid growth potential due to the burgeoning AI industry and expanding digital economy. This signifies a shift in market dynamics, as the demand for data annotation services increases globally, leading to a more geographically diverse market landscape. Continuous advancements in annotation techniques, including the use of automated tools and crowdsourcing, are expected to reduce costs and improve efficiency, further fueling market growth.
F
Bahasa Shopping List OCR Image Dataset
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bahasa Shopping List OCR Image Dataset [Dataset]. https://www.futurebeeai.com/dataset/ocr-dataset/bahasa-shopping-list-ocr-image-dataset
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/data-license-agreementhttps://www.futurebeeai.com/data-license-agreement
Dataset funded by
FutureBeeAI
Description
What’s Included
Introducing the Bahasa Shopping List Image Dataset - a diverse and comprehensive collection of handwritten text images carefully curated to propel the advancement of text recognition and optical character recognition (OCR) models designed specifically for the Bahasa language.
Dataset Contain & Diversity:
Containing more than 2000 images, this Bahasa OCR dataset offers a wide distribution of different types of shopping list images. Within this dataset, you'll discover a variety of handwritten text, including sentences, and individual item name words, quantity, comments, etc on shopping lists. The images in this dataset showcase distinct handwriting styles, fonts, font sizes, and writing variations.
To ensure diversity and robustness in training your OCR model, we allow limited (less than three) unique images in a single handwriting. This ensures we have diverse types of handwriting to train your OCR model on. Stringent measures have been taken to exclude any personally identifiable information (PII) and to ensure that in each image a minimum of 80% of space contains visible Bahasa text.
The images have been captured under varying lighting conditions, including day and night, as well as different capture angles and backgrounds. This diversity helps build a balanced OCR dataset, featuring images in both portrait and landscape modes.
All these shopping lists were written and images were captured by native Bahasa people to ensure text quality, prevent toxic content, and exclude PII text. We utilized the latest iOS and Android mobile devices with cameras above 5MP to maintain image quality. Images in this training dataset are available in both JPEG and HEIC formats.
Metadata:
In addition to the image data, you will receive structured metadata in CSV format. For each image, this metadata includes information on image orientation, country, language, and device details. Each image is correctly named to correspond with the metadata.
This metadata serves as a valuable resource for understanding and characterizing the data, aiding informed decision-making in the development of Bahasa text recognition models.
Update & Custom Collection:
We are committed to continually expanding this dataset by adding more images with the help of our native Bahasa crowd community.
If you require a customized OCR dataset containing shopping list images tailored to your specific guidelines or device distribution, please don't hesitate to contact us. We have the capability to curate specialized data to meet your unique requirements.
Additionally, we can annotate or label the images with bounding boxes or transcribe the text in the images to align with your project's specific needs using our crowd community.
License:
This image dataset, created by FutureBeeAI, is now available for commercial use.
Conclusion:
Leverage this shopping list image OCR dataset to enhance the training and performance of text recognition, text detection, and optical character recognition models for the Bahasa language. Your journey to improved language understanding and processing begins here.
d
Re-ID Data | 600,000 ID | CCTV Data |Computer Vision Data| Identity Data
datarade.ai
Updated Dec 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nexdata (2023). Re-ID Data | 600,000 ID | CCTV Data |Computer Vision Data| Identity Data [Dataset]. https://datarade.ai/data-products/nexdata-re-id-data-60-000-id-image-video-ai-ml-train-nexdata
Explore at:
.bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
Dataset updated
Dec 8, 2023
Dataset authored and provided by
Nexdata
Area covered
Bolivia (Plurinational State of), Turkmenistan, Sri Lanka, Cuba, Luxembourg, Ecuador, Portugal, Trinidad and Tobago, Russian Federation, United Arab Emirates
Description
Specifications Data size : 60,000 ID

Population distribution : the race distribution is Asians, Caucasians and black people, the gender distribution is male and female, the age distribution is from children to the elderly

Collecting environment : including indoor and outdoor scenes (such as supermarket, mall and residential area, etc.)

Data diversity : different ages, different time periods, different cameras, different human body orientations and postures, different ages collecting environment

Device : surveillance cameras, the image resolution is not less than 1,9201,080

Data format : the image data format is .jpg, the annotation file format is .json

Annotation content : human body rectangular bounding boxes, 15 human body attributes

Quality Requirements : A rectangular bounding box of human body is qualified when the deviation is not more than 3 pixels, and the qualified rate of the bounding boxes shall not be lower than 97%;Annotation accuracy of attributes is over 97%

About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 1 million hours of Audio Data and 800TB of Annotated Imagery Data.These ready-to-go Identity Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/computervision?source=Datarade
Sentinel-2 KappaZeta Cloud and Cloud Shadow Masks
zenodo.org
data.niaid.nih.gov
pdf, zip
Updated Jul 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marharyta Domnich; Marharyta Domnich; Kaupo Voormansik; Olga Wold; Fariha Harun; Indrek Sünter; Heido Trofimov; Anton Kostiukhin; Mihkel Järveoja; Kaupo Voormansik; Olga Wold; Fariha Harun; Indrek Sünter; Heido Trofimov; Anton Kostiukhin; Mihkel Järveoja (2024). Sentinel-2 KappaZeta Cloud and Cloud Shadow Masks [Dataset]. http://doi.org/10.5281/zenodo.5095024
Explore at:
zip, pdfAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.5095024
Dataset updated
Jul 18, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Marharyta Domnich; Marharyta Domnich; Kaupo Voormansik; Olga Wold; Fariha Harun; Indrek Sünter; Heido Trofimov; Anton Kostiukhin; Mihkel Järveoja; Kaupo Voormansik; Olga Wold; Fariha Harun; Indrek Sünter; Heido Trofimov; Anton Kostiukhin; Mihkel Järveoja
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
General information

The dataset consists of 4403 labelled subscenes from 155 Sentinel-2 (S2) Level-1C (L1C) products distributed over the Northern European terrestrial area. Each S2 product was oversampled at 10 m resolution for 512 x 512 pixels subscenes. 6 L1C S2 products were labelled fully. Among other 149 S2 products the most challenging ~10 subscenes per product were selected for labelling. In total the dataset represents 4403 labelled Sentinel-2 subscenes, where each sub-tile is 512 x 512 pixels at 10 m resolution. The dataset consists of around 30 S2 products per month from April to August and 3 S2 products per month for September and October. Each selected L1C S2 product represents different clouds, such as cumulus, stratus, or cirrus, which are spread over various geographical locations in Northern Europe.

The classification pixel-wise map consists of the following categories:

0 – MISSING: missing or invalid pixels;

1 – CLEAR: pixels without clouds or cloud shadows;

2 – CLOUD SHADOW: pixels with cloud shadows;

3 – SEMI TRANSPARENT CLOUD: pixels with thin clouds through which the land is visible; include cirrus clouds that are on the high cloud level (5-15km).

4 – CLOUD: pixels with cloud; include stratus and cumulus clouds that are on the low cloud level (from 0-0.2km to 2km).

5 – UNDEFINED: pixels that the labeler is not sure which class they belong to.

The dataset was labelled using Computer Vision Annotation Tool (CVAT) and Segments.ai. With the possibility of integrating active learning process in Segments.ai, the labelling was performed semi-automatically.

The dataset limitations must be considered: the data is covering only terrestrial region and does not include water areas; the dataset is not presented in winter conditions; the dataset represent summer conditions, therefore September and October contain only test products used for validation. Current subscenes do not have georeferencing, however, we are working towards including them in next version.

More details about the dataset structure can be found in README.

Contributions and Acknowledgements

The data were annotated by Fariha Harun and Olga Wold. The data verification and Software Development was performed by Indrek Sünter, Heido Trofimov, Anton Kostiukhin, Marharyta Domnich, Mihkel Järveoja, Olga Wold. Methodology was developed by Kaupo Voormansik, Indrek Sünter, Marharyta Domnich.
We would like to thank Segments.ai annotation tool for instant and an individual customer support. We are grateful to European Space Agency for reviews and suggestions. We would like to extend our thanks to Prof. Gholamreza Anbarjafari for the feedback and directions.
The project was funded by European Space Agency, Contract No. 4000132124/20/I-DT.
c
Annotations for ACRIN-HNSCC-FDG-PET-CT Collection
cancerimagingarchive.net
stage.cancerimagingarchive.net
csv, dicom, n/a
Updated Dec 23, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Cancer Imaging Archive (2022). Annotations for ACRIN-HNSCC-FDG-PET-CT Collection [Dataset]. http://doi.org/10.7937/JVGC-AQ36
Explore at:
n/a, csv, dicomAvailable download formats
Unique identifier
https://doi.org/10.7937/JVGC-AQ36
Dataset updated
Dec 23, 2022
Dataset authored and provided by
The Cancer Imaging Archive
License
https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/
Time period covered
Nov 13, 2023
Dataset funded by
National Cancer Institutehttp://www.cancer.gov/
Description
This dataset contains image annotations derived from the NCI Clinical Trial "ACRIN-HNSCC-FDG-PET-CT (ACRIN 6685)”. This dataset was generated as part of an NCI project to augment TCIA datasets with annotations that will improve their value for cancer researchers and AI developers.
Annotation Protocol
For each patient, all scans were reviewed to identify and annotate the clinically relevant time points and sequences/series. Scans were initially annotated by an international team of radiologists holding MBBS degrees or higher, which were then reviewed by US-based board-certified radiologists to ensure accuracy. In a typical patient all available time points were annotated. The following annotation rules were followed:

PERCIST criteria was followed for PET imaging. Specifically, the lesions estimated to have the most elevated SUVmax were annotated.

RECIST 1.1 was otherwise generally followed for MR and CT imaging. A maximum of 5 lesions were annotated per patient scan (timepoint); no more than 2 per organ. The same 5 lesions were annotated at each time point. Lymph nodes were annotated if >1 cm in short axis. Other lesions were annotated if >1 cm. If the primary lesion is < 1 cm, it was still annotated.

Three-dimensional segmentations of lesions were created in the axial plane. If no axial plane was available, lesions were annotated in the coronal plane.

MRIs were annotated using the T1-weighted axial post contrast sequence, fat saturated if available.

CTs were annotated using the axial post contrast series. If not available, the non contrast series were annotated.

PET/CTs were annotated on the CT and attenuation corrected PET images.

If the post contrast CT was performed the same day as the PET/CT, the non contrast CT portion of the PET/CT was not annotated.

Lesions were labeled separately.

The volume of each annotated lesion was calculated and reported in cubic centimeters [cc] in the Annotation Metadata CSV.

Seed points were automatically generated, but reviewed by a radiologist.

A “negative” annotation was created for any exam without findings.

At each time point:

A seed point (kernel) was created for each segmented structure. The seed points for each segmentation are provided in a separate DICOM RTSTRUCT file.

SNOMED-CT “Anatomic Region Sequence” and “Segmented Property Category Code Sequence” and codes were inserted for all segmented structures.

“Tracking ID” and “Tracking UID” tags were inserted for each segmented structure to enable longitudinal lesion tracking.

Imaging time point codes were inserted to help identify each annotation in the context of the clinical trial assessment protocol.

“Clinical Trial Time Point ID” was used to encode time point type using one of the following strings as applicable: “pre-dose” or “post-chemotherapy”.

Content Item in “Acquisition Context Sequence” was added containing "Time Point Type" using Concept Code Sequence (0040,A168) selected from:

(255235001, SCT, “Pre-dose”) (in this trial, both the CT/MRI and PET/CT, while being different timepoints, are pre-treatment)

Important supplementary information and sample code

A spreadsheet containing key details about the annotations is available in the Data Access section below.

A Jupyter notebook demonstrating how to use the NBIA Data Retriever Command-Line Interface application and the REST API to access these data can be found in the Additional Resources section below.
n
103,282-Images Driver Behavior Annotation Data
m.nexdata.ai
Updated Nov 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nexdata (2023). 103,282-Images Driver Behavior Annotation Data [Dataset]. https://m.nexdata.ai/datasets/computervision/1033
Explore at:
Dataset updated
Nov 2, 2023
Dataset provided by
nexdata technology inc
Authors
Nexdata
Variables measured
Accuracy, Data size, Annotation, Population, Collection time, Desensitization, Image parameter, Collection device, Collection diversity, Collection environment
Description
103,282-Images Driver Behavior Annotation Data. The data includes multiple ages, multiple time periods and behaviors (Dangerous behaviors, Fatigue behaviors, Visual movement behaviors). In terms of annotation, 72 facial landmarks (including pupils), face attributes, gesture bounding boxes, seatbelt bounding boxes, pupil landmarks and behavior categories were annotated in the data. This data can be used for tasks such as driver behavior analysis.
I
Image Data Labeling Service Report
archivemarketresearch.com
doc, pdf, ppt
Updated Feb 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archive Market Research (2025). Image Data Labeling Service Report [Dataset]. https://www.archivemarketresearch.com/reports/image-data-labeling-service-30906
Explore at:
doc, ppt, pdfAvailable download formats
Dataset updated
Feb 16, 2025
Dataset authored and provided by
Archive Market Research
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The Image Data Labeling Service market is expected to experience significant growth over the next decade, driven by the increasing demand for annotated data for artificial intelligence (AI) applications. The market is expected to grow from USD XXX million in 2025 to USD XXX million by 2033, at a CAGR of XX%. The growth of the market is attributed to the growing adoption of AI in various industries, including IT, automotive, healthcare, and financial services. The growing use of computer vision and machine learning algorithms for tasks such as object detection, image classification, and facial recognition has led to a surge in demand for annotated data. Image data labeling services provide the labeled data that is essential for training these algorithms. The market is expected to be further driven by the increasing availability of cloud-based services and the adoption of automation tools for image data labeling. Additionally, the growing awareness of the importance of data quality for AI applications is expected to drive the adoption of image data labeling services.
D
Data Annotation Service Report
datainsightsmarket.com
doc, pdf, ppt
Updated Feb 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Data Insights Market (2025). Data Annotation Service Report [Dataset]. https://www.datainsightsmarket.com/reports/data-annotation-service-1928464
Explore at:
pdf, ppt, docAvailable download formats
Dataset updated
Feb 8, 2025
Dataset authored and provided by
Data Insights Market
License
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
Market Overview: The global data annotation service market is projected to reach a valuation of USD XXX million by 2033, expanding at a CAGR of XX% from 2025 to 2033. The surging demand for accurate and annotated data for artificial intelligence (AI) and machine learning (ML) models is driving the market growth. The increasing adoption of AI across various industries, including healthcare, manufacturing, and finance, is fueling the need for high-quality data annotation services. Market Dynamics and Key Players: Key drivers of the data annotation service market include the growing demand for automated processes, the rise of IoT devices generating massive data, and advancements in AI technology. However, the high cost of data annotation and concerns over data privacy pose challenges. The market is segmented into application areas (government, enterprise, others) and annotation types (text, image, others). Notable companies operating in the market include Appen Limited, CloudApp, Cogito Tech LLC, and Deep Systems. Regional markets include North America, Europe, Asia Pacific, and the Middle East & Africa. The study period spans from 2019 to 2033, with 2025 as the base year and a forecast period from 2025 to 2033.
n
70,846 Images – Human Face Segmentation Data
nexdata.ai
m.nexdata.ai
Updated Mar 14, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nexdata (2025). 70,846 Images – Human Face Segmentation Data [Dataset]. https://www.nexdata.ai/datasets/computervision/945?source=Huggingface
Explore at:
Dataset updated
Mar 14, 2025
Dataset provided by
nexdata technology inc
Authors
Nexdata
Variables measured
Accuracy, Data size, Data diversity, Image Parameter, Annotation content, Collection environment, Population distribution
Description
Human Face Segmentation Data from 70,846 Images. Pure color backgrounds, interior and exterior scene types are all included in the data. Both males and females are included in the data. Asian, Black, and Caucasian races are represented in the race distribution. The age ranges from young children to elderly people. Simple and complex facial expressions can be found in the data (large-angle tilt of face, closing eye, glower, pucker, opening mouth, etc.). We used pixel-by-pixel segmentation annotations to annotate the human face, the five sense organs, the body, and appendages. The information can be applied to tasks like facial Recon Related Tasks
P
Side Profile Tires Dataset Dataset
paperswithcode.com
Updated Sep 18, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Side Profile Tires Dataset Dataset [Dataset]. https://paperswithcode.com/dataset/side-profile-tires-dataset
Explore at:
Dataset updated
Sep 18, 2024
Description
Description:

👉 Download the dataset here

This dataset consists of meticulously annotated images of tire side profiles, specifically designed for image segmentation tasks. Each tire has been manually labeled to ensure high accuracy, making this dataset ideal for training machine learning models focused on tire detection, classification, or related automotive applications.

The annotations are provided in the YOLO v5 format, leveraging the PyTorch framework for deep learning applications. The dataset offers a robust foundation for researchers and developers working on object detection, autonomous vehicles, quality control, or any project requiring precise tire identification from images.

Download Dataset

Data Collection and Labeling Process:

Manual Labeling: Every tire in the dataset has been individually labeled to guarantee that the annotations are highly precise, significantly reducing the margin of error in model training.

Annotation Format: YOLO v5 PyTorch format, a highly efficient and widely used format for real-time object detection systems.

Pre-processing Applied:

Auto-orientation: Pixel data has been automatically oriented, and EXIF orientation metadata has been stripped to ensure uniformity across all images, eliminating issues related to

image orientation during processing.

Resizing: All images have been resized to 416×416 pixels using stretching to maintain compatibility with common object detection frameworks like YOLO. This resizing standardizes the image input size while preserving visual integrity.

Applications:

Automotive Industry: This dataset is suitable for automotive-focused AI models, including tire quality assessment, tread pattern recognition, and autonomous vehicle systems.

Surveillance and Security: Use cases in monitoring systems where identifying tires is crucial for vehicle recognition in parking lots or traffic management systems.

Manufacturing and Quality Control: Can be used in tire manufacturing processes to automate defect detection and classification.

Dataset Composition:

Number of Images: [Add specific number]

File Format: JPEG/PNG

Annotation Format: YOLO v5 PyTorch

Image Size: 416×416 (standardized across all images)

This dataset is sourced from Kaggle.
R
Custom Yolov7 On Kaggle On Custom Dataset
universe.roboflow.com
zip
Updated Jan 29, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Owais Ahmad (2023). Custom Yolov7 On Kaggle On Custom Dataset [Dataset]. https://universe.roboflow.com/owais-ahmad/custom-yolov7-on-kaggle-on-custom-dataset-rakiq/dataset/2
Explore at:
zipAvailable download formats
Dataset updated
Jan 29, 2023
Dataset authored and provided by
Owais Ahmad
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Person Car Bounding Boxes
Description
Custom Training with YOLOv7 🔥

Some Important links

Model Inference🤖

🚀Training Yolov7 on Kaggle

Weight and Biases 🐝

HuggingFace 🤗 Model Repo

Contact Information

Name - Owais Ahmad

Phone - +91-9515884381

Email - owaiskhan9654@gmail.com

Portfolio - https://owaiskhan9654.github.io/

Objective

To Showcase custom Object Detection on the Given Dataset to train and Infer the Model using newly launched YoloV7.

Data Acquisition

The goal of this task is to train a model that can localize and classify each instance of Person and Car as accurately as possible.

Link to the Downloadable Dataset

from IPython.display import Markdown, display display(Markdown("../input/Car-Person-v2-Roboflow/README.roboflow.txt"))

Custom Training with YOLOv7 🔥

In this Notebook, I have processed the images with RoboFlow because in COCO formatted dataset was having different dimensions of image and Also data set was not splitted into different Format. To train a custom YOLOv7 model we need to recognize the objects in the dataset. To do so I have taken the following steps:

Export the dataset to YOLOv7

Train YOLOv7 to recognize the objects in our dataset

Evaluate our YOLOv7 model's performance

Run test inference to view performance of YOLOv7 model at work

📦 YOLOv7

https://raw.githubusercontent.com/Owaiskhan9654/Yolo-V7-Custom-Dataset-Train-on-Kaggle/main/car-person-2.PNG" width=800>

Image Credit - jinfagang

Step 1: Install Requirements

!git clone https://github.com/WongKinYiu/yolov7 # Downloading YOLOv7 repository and installing requirements %cd yolov7 !pip install -qr requirements.txt !pip install -q roboflow

Downloading YOLOV7 starting checkpoint

!wget "https://github.com/WongKinYiu/yolov7/releases/download/v0.1/yolov7.pt"

import os import glob import wandb import torch from roboflow import Roboflow from kaggle_secrets import UserSecretsClient from IPython.display import Image, clear_output, display # to display images print(f"Setup complete. Using torch {torch._version_} ({torch.cuda.get_device_properties(0).name if torch.cuda.is_available() else 'CPU'})")

https://camo.githubusercontent.com/dd842f7b0be57140e68b2ab9cb007992acd131c48284eaf6b1aca758bfea358b/68747470733a2f2f692e696d6775722e636f6d2f52557469567a482e706e67">

I will be integrating W&B for visualizations and logging artifacts and comparisons of different models!

YOLOv7-Car-Person-Custom

try: user_secrets = UserSecretsClient() wandb_api_key = user_secrets.get_secret("wandb_api") wandb.login(key=wandb_api_key) anonymous = None except: wandb.login(anonymous='must') print('To use your W&B account, Go to Add-ons -> Secrets and provide your W&B access token. Use the Label name as WANDB. Get your W&B access token from here: https://wandb.ai/authorize') wandb.init(project="YOLOvR",name=f"7. YOLOv7-Car-Person-Custom-Run-7")

Step 2: Assemble Our Dataset

https://uploads-ssl.webflow.com/5f6bc60e665f54545a1e52a5/615627e5824c9c6195abfda9_computer-vision-cycle.png" alt="">

In order to train our custom model, we need to assemble a dataset of representative images with bounding box annotations around the objects that we want to detect. And we need our dataset to be in YOLOv7 format.

In Roboflow, We can choose between two paths:

Convert an existing Coco dataset to YOLOv7 format. In Roboflow it supports over 30 formats object detection formats for conversion.

Uploading only these raw images and annotate them in Roboflow with Roboflow Annotate.

Version v2 Aug 12, 2022 Looks like this.

https://raw.githubusercontent.com/Owaiskhan9654/Yolo-V7-Custom-Dataset-Train-on-Kaggle/main/Roboflow.PNG" alt="">

user_secrets = UserSecretsClient() roboflow_api_key = user_secrets.get_secret("roboflow_api")

rf = Roboflow(api_key=roboflow_api_key) project = rf.workspace("owais-ahmad").project("custom-yolov7-on-kaggle-on-custom-dataset-rakiq") dataset = project.version(2).download("yolov7")

Step 3: Training Custom pretrained YOLOv7 model

Here, I am able to pass a number of arguments: - img: define input image size - batch: determine
F
Odia Handwritten Sticky Notes OCR Image Dataset
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Odia Handwritten Sticky Notes OCR Image Dataset [Dataset]. https://www.futurebeeai.com/dataset/ocr-dataset/odia-sticky-notes-ocr-image-dataset
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/data-license-agreementhttps://www.futurebeeai.com/data-license-agreement
Dataset funded by
FutureBeeAI
Description
What’s Included
Introducing the Odia Sticky Notes Image Dataset - a diverse and comprehensive collection of handwritten text images carefully curated to propel the advancement of text recognition and optical character recognition (OCR) models designed specifically for the Odia language.
Dataset Contain & Diversity:
Containing more than 2000 images, this Odia OCR dataset offers a wide distribution of different types of sticky note images. Within this dataset, you'll discover a variety of handwritten text, including quotes, sentences, and individual words on sticky notes. The images in this dataset showcase distinct handwriting styles, fonts, font sizes, and writing variations.
To ensure diversity and robustness in training your OCR model, we allow limited (less than three) unique images in a single handwriting. This ensures we have diverse types of handwriting to train your OCR model on. Stringent measures have been taken to exclude any personally identifiable information (PII) and to ensure that in each image a minimum of 80% of space contains visible Odia text.
The images have been captured under varying lighting conditions, including day and night, as well as different capture angles and backgrounds. This diversity helps build a balanced OCR dataset, featuring images in both portrait and landscape modes.
All these sticky notes were written and images were captured by native Odia people to ensure text quality, prevent toxic content, and exclude PII text. We utilized the latest iOS and Android mobile devices with cameras above 5MP to maintain image quality. Images in this training dataset are available in both JPEG and HEIC formats.
Metadata:
In addition to the image data, you will receive structured metadata in CSV format. For each image, this metadata includes information on image orientation, country, language, and device details. Each image is correctly named to correspond with the metadata.
This metadata serves as a valuable resource for understanding and characterizing the data, aiding informed decision-making in the development of Odia text recognition models.
Update & Custom Collection:
We are committed to continually expanding this dataset by adding more images with the help of our native Odia crowd community.
If you require a customized OCR dataset containing sticky note images tailored to your specific guidelines or device distribution, please don't hesitate to contact us. We have the capability to curate specialized data to meet your unique requirements.
Additionally, we can annotate or label the images with bounding boxes or transcribe the text in the images to align with your project's specific needs using our crowd community.
License:
This image dataset, created by FutureBeeAI, is now available for commercial use.
Conclusion:
Leverage this sticky notes image OCR dataset to enhance the training and performance of text recognition, text detection, and optical character recognition models for the Odia language. Your journey to improved language understanding and processing begins here.

Facebook

Twitter

Click to copy link

Link copied

Cite

TagX (2022). TagX Data Annotation | Automated Annotation | AI-assisted labeling with human verification | Customized annotation | Data for AI & LLMs [Dataset]. https://datarade.ai/data-products/data-annotation-services-for-artificial-intelligence-and-data-tagx

TagX Data Annotation | Automated Annotation | AI-assisted labeling with human verification | Customized annotation | Data for AI & LLMs

Explore at:

.json, .xml, .csv, .xls, .txtAvailable download formats

Dataset updated

Aug 14, 2022

Dataset authored and provided by

TagX

Area covered

Egypt, Georgia, Sint Eustatius and Saba, Estonia, Comoros, Saint Barthélemy, Guatemala, Cabo Verde, Lesotho, Central African Republic

Description

TagX data annotation services are a set of tools and processes used to accurately label and classify large amounts of data for use in machine learning and artificial intelligence applications. The services are designed to be highly accurate, efficient, and customizable, allowing for a wide range of data types and use cases.

The process typically begins with a team of trained annotators reviewing and categorizing the data, using a variety of annotation tools and techniques, such as text classification, image annotation, and video annotation. The annotators may also use natural language processing and other advanced techniques to extract relevant information and context from the data.

Once the data has been annotated, it is then validated and checked for accuracy by a team of quality assurance specialists. Any errors or inconsistencies are corrected, and the data is then prepared for use in machine learning and AI models.

TagX annotation services can be applied to a wide range of data types, including text, images, videos, and audio. The services can be customized to meet the specific needs of each client, including the type of data, the level of annotation required, and the desired level of accuracy.

TagX data annotation services provide a powerful and efficient way to prepare large amounts of data for use in machine learning and AI applications, allowing organizations to extract valuable insights and improve their decision-making processes.

Clear search

Close search

Google apps

Main menu

TagX Data Annotation | Automated Annotation | AI-assisted labeling with...

15M+ Images | AI Training Data | Annotated imagery data for AI | Object &...

15M+ Images | AI Training Data | Annotated imagery data for AI | Object &...

Data Annotation and Collection Services Report

750K+ Furniture Images | AI Training Data | Object Detection Data |...

Data from: Region-based Annotation Data of Fire Images for Intelligent...

Image segmentations produced by BAMF under the AIMI Annotations initiative

File Overview

breast-fdg-pet-ct.zip

kidney-ct.zip

liver-ct.zip

liver-mr.zip

lung-ct.zip

lung-fdg-pet-ct.zip

prostate-mr.zip

Likert Score Definition:

Zip File Folder Structure

TagX - 10,000+ Car damage images with annotation | Car insurance &...

Data Annotation Platform Report

Bahasa Shopping List OCR Image Dataset

What’s Included

Re-ID Data | 600,000 ID | CCTV Data |Computer Vision Data| Identity Data

Sentinel-2 KappaZeta Cloud and Cloud Shadow Masks

Annotations for ACRIN-HNSCC-FDG-PET-CT Collection

Annotation Protocol

Important supplementary information and sample code

103,282-Images Driver Behavior Annotation Data

Image Data Labeling Service Report

Data Annotation Service Report

70,846 Images – Human Face Segmentation Data

Side Profile Tires Dataset Dataset

Custom Yolov7 On Kaggle On Custom Dataset

Custom Training with YOLOv7 🔥

Some Important links

Contact Information

Objective

To Showcase custom Object Detection on the Given Dataset to train and Infer the Model using newly launched YoloV7.

Data Acquisition

Custom Training with YOLOv7 🔥

📦 YOLOv7

Step 1: Install Requirements

Downloading YOLOV7 starting checkpoint

Step 2: Assemble Our Dataset

Version v2 Aug 12, 2022 Looks like this.

Step 3: Training Custom pretrained YOLOv7 model

Odia Handwritten Sticky Notes OCR Image Dataset

What’s Included

TagX Data Annotation | Automated Annotation | AI-assisted labeling with human verification | Customized annotation | Data for AI & LLMs