100+ datasets found

h
HQ-Edit-data-demo
huggingface.co
Updated Jun 28, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UCSC-VLAA (2024). HQ-Edit-data-demo [Dataset]. https://huggingface.co/datasets/UCSC-VLAA/HQ-Edit-data-demo
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 28, 2024
Dataset authored and provided by
UCSC-VLAA
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Dataset Card for HQ-EDIT

HQ-Edit, a high-quality instruction-based image editing dataset with total 197,350 edits. Unlike prior approaches relying on attribute guidance or human feedback on building datasets, we devise a scalable data collection pipeline leveraging advanced foundation models, namely GPT-4V and DALL-E 3. HQ-Edit’s high-resolution images, rich in detail and accompanied by comprehensive editing prompts, substantially enhance the capabilities of existing image editing… See the full description on the dataset page: https://huggingface.co/datasets/UCSC-VLAA/HQ-Edit-data-demo.
p
Video Editing Services in Brazil - 1,690 Verified Listings Database
poidata.io
csv, excel, json
Updated Jul 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Poidata.io (2025). Video Editing Services in Brazil - 1,690 Verified Listings Database [Dataset]. https://www.poidata.io/report/video-editing-service/brazil
Explore at:
csv, json, excelAvailable download formats
Dataset updated
Jul 2, 2025
Dataset provided by
Poidata.io
Area covered
Brazil
Description
Comprehensive dataset of 1,690 Video editing services in Brazil as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
l
APE Shared Task WMT17: Human Post-edits Test Data EN-DE
lindat.cz
live.european-language-grid.eu
+2more
Updated Oct 16, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marco Turchi; Rajen Chatterjee; Matteo Negri (2017). APE Shared Task WMT17: Human Post-edits Test Data EN-DE [Dataset]. https://lindat.cz/repository/xmlui/handle/11372/LRT-2483
Explore at:
Dataset updated
Oct 16, 2017
Authors
Marco Turchi; Rajen Chatterjee; Matteo Negri
License
https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21
Description
Human post-edited test sentences for the WMT 2017 Automatic post-editing task. This consists in 2,000 German sentences belonging to the IT domain and already tokenized. Source and target segments can be downloaded from: https://lindat.mff.cuni.cz/repository/xmlui/handle/11372/LRT-2133. All data is provided by the EU project QT21 (http://www.qt21.eu/).
E
Data from: Post-edited and error annotated machine translation corpus PErr...
live.european-language-grid.eu
binary format
Updated May 23, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2016). Post-edited and error annotated machine translation corpus PErr 1.0 [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/8212
Explore at:
binary formatAvailable download formats
Dataset updated
May 23, 2016
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
The PE²rr corpus contains source language texts from different domains along with their automatically generated translations into several morphologically rich languages, their post-edited versions, and error annotations of the performed post-edit operations. The main advantage of the corpus is the fusion of post-editing and error classification tasks, which have usually been seen as two independent tasks, although naturally they are not.
Processed bystander editing data
figshare.com
bin
Updated Jun 12, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Max Shen (2020). Processed bystander editing data [Dataset]. http://doi.org/10.6084/m9.figshare.10678097.v1
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.10678097.v1
Dataset updated
Jun 12, 2020
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
Max Shen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Python3 pickled dictionaries. Keys are target site names, values are pandas dataframes where each row is a unique editing outcome, and there is a column for each substrate nucleotide and a frequency column.
p
Video Editing Services in Rhode Island, United States - 25 Verified Listings...
poidata.io
csv, excel, json
Updated Jul 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Poidata.io (2025). Video Editing Services in Rhode Island, United States - 25 Verified Listings Database [Dataset]. https://www.poidata.io/report/video-editing-service/united-states/rhode-island
Explore at:
csv, json, excelAvailable download formats
Dataset updated
Jul 12, 2025
Dataset provided by
Poidata.io
Area covered
Rhode Island, United States
Description
Comprehensive dataset of 25 Video editing services in Rhode Island, United States as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
Data from: Learning to Edit Interactive Machine Learning Notebooks
zenodo.org
bin
Updated Jun 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bihui Jin; Jiayue Wang; Pengyu Nie; Bihui Jin; Jiayue Wang; Pengyu Nie (2025). Learning to Edit Interactive Machine Learning Notebooks [Dataset]. http://doi.org/10.5281/zenodo.15716537
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.15716537
Dataset updated
Jun 23, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Bihui Jin; Jiayue Wang; Pengyu Nie; Bihui Jin; Jiayue Wang; Pengyu Nie
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Machine learning (ML) developers frequently use interactive computational notebooks, such as Jupyter notebooks, to host code for data processing and model training. Notebooks provide a convenient tool for writing ML pipelines and interactively observing outputs. However, maintaining notebooks, e.g., to add new features or fix bugs, can be challenging due to the length and complexity of the ML pipeline code. Moreover, there is no existing benchmark related to developer edits on notebooks.
In this paper, we present early results of the first study on learning to edit ML pipeline code in notebooks using large language models (LLMs). We collect the first dataset of 48,398 notebook edits derived from 20,095 revisions of 792 ML-related GitHub repositories. Our dataset captures granular details of file-level and cell-level modifications, offering a foundation for understanding real-world maintenance patterns in ML pipelines. We observe that the edits on notebooks are highly localized. Although LLMs have been shown to be effective on general-purpose code generation and editing, our results reveal that the same LLMs, even after finetuning, have low accuracy on notebook editing, demonstrating the complexity of real-world ML pipeline maintenance tasks. Our findings emphasize the critical role of contextual information in improving model performance and point toward promising avenues for advancing LLMs' capabilities in engineering ML code.
p
Video Editing Services in New Mexico, United States - 30 Verified Listings...
poidata.io
csv, excel, json
Updated Jul 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Poidata.io (2025). Video Editing Services in New Mexico, United States - 30 Verified Listings Database [Dataset]. https://www.poidata.io/report/video-editing-service/united-states/new-mexico
Explore at:
csv, excel, jsonAvailable download formats
Dataset updated
Jul 9, 2025
Dataset provided by
Poidata.io
Area covered
New Mexico, United States
Description
Comprehensive dataset of 30 Video editing services in New Mexico, United States as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
NCCI Procedure to Procedure Edits (PTP) Quarter Beginning 01/01/2018 -...
healthdata.gov
application/rdfxml +5
Updated Apr 8, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). NCCI Procedure to Procedure Edits (PTP) Quarter Beginning 01/01/2018 - aaa7-r9su - Archive Repository [Dataset]. https://healthdata.gov/dataset/NCCI-Procedure-to-Procedure-Edits-PTP-Quarter-Begi/966m-c3td
Explore at:
xml, application/rssxml, csv, tsv, json, application/rdfxmlAvailable download formats
Dataset updated
Apr 8, 2022
Description
This dataset tracks the updates made on the dataset "NCCI Procedure to Procedure Edits (PTP) Quarter Beginning 01/01/2018" as a repository for previous versions of the data and metadata.
p
Video Editing Services in Phitsanulok, Thailand - 2 Verified Listings...
poidata.io
csv, excel, json
Updated Jun 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Poidata.io (2025). Video Editing Services in Phitsanulok, Thailand - 2 Verified Listings Database [Dataset]. https://www.poidata.io/report/video-editing-service/thailand/phitsanulok
Explore at:
csv, json, excelAvailable download formats
Dataset updated
Jun 28, 2025
Dataset provided by
Poidata.io
Area covered
Phitsanulok, Thailand
Description
Comprehensive dataset of 2 Video editing services in Phitsanulok, Thailand as of June, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
CGBE - processed editing efficiency data
figshare.com
txt
Updated Jun 29, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Max Shen (2021). CGBE - processed editing efficiency data [Dataset]. http://doi.org/10.6084/m9.figshare.12275654.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.12275654.v1
Dataset updated
Jun 29, 2021
Dataset provided by
Figsharehttp://figshare.com/
Authors
Max Shen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
CSVs containing designed sgRNA-target sites and base editing outcomes across multiple replicates.
NCCI Procedure to Procedure Edits (PTP) Quarter Beginning 04/01/2020 -...
healthdata.gov
application/rdfxml +5
Updated Apr 8, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). NCCI Procedure to Procedure Edits (PTP) Quarter Beginning 04/01/2020 - akta-g454 - Archive Repository [Dataset]. https://healthdata.gov/dataset/NCCI-Procedure-to-Procedure-Edits-PTP-Quarter-Begi/rxa6-2khy
Explore at:
csv, xml, application/rssxml, json, tsv, application/rdfxmlAvailable download formats
Dataset updated
Apr 8, 2022
Description
This dataset tracks the updates made on the dataset "NCCI Procedure to Procedure Edits (PTP) Quarter Beginning 04/01/2020" as a repository for previous versions of the data and metadata.
g
Hospital Cost Report Edited Data Print Image: 2010
gimi9.com
healthdata.gov
+1more
Updated Mar 11, 2013
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2013). Hospital Cost Report Edited Data Print Image: 2010 [Dataset]. https://gimi9.com/dataset/ny_cf7i-99p5/
Explore at:
Dataset updated
Mar 11, 2013
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The Institutional Cost Report (ICR) is a uniform report completed by New York hospitals to report income, expenses, assets, liabilities, and statistics to the Department of Health (DOH). Under DOH regulations, (Part 86-1.2), Article 28 hospitals are required to file financial and statistical data with DOH annually. The data filed is part of the ICR and is received electronically through a secured network. This data is used to develop Medicaid rates, assist in the formulation of reimbursement methodologies, and analyze trends. This dataset includes the print image of the edited data. The ICR is a comprehensive compilation of exhibits that have been modified over time that users should consider when using the ICR dataset. It is possible that data is updated subsequent to posting on this website; therefore the data could become obsolete. To get the details related to the exhibits and data elements, please refer to the blank ICR form, the ICR Table of Contents, the ICR Instructions and the Glossary of Terms, Acronyms, and Abbreviations which are in the Supporting Information section of this site. The data posted as edited contains desk edit adjustments by DOH personnel. In 2009, this information was not audited; however effective with the 2010 ICR, all ICRs will be audited by a Certified Public Accounting Firm annually.
Data from: PartialEdit: Identifying Partial Deepfakes in the Era of Neural...
zenodo.org
application/gzip, csv
Updated May 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
You Zhang; You Zhang; Baotong Tian; Baotong Tian; Lin Zhang; Lin Zhang; Zhiyao Duan; Zhiyao Duan (2025). PartialEdit: Identifying Partial Deepfakes in the Era of Neural Speech Editing [Dataset]. http://doi.org/10.5281/zenodo.15519188
Explore at:
application/gzip, csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.15519188
Dataset updated
May 27, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
You Zhang; You Zhang; Baotong Tian; Baotong Tian; Lin Zhang; Lin Zhang; Zhiyao Duan; Zhiyao Duan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This is part of the dataset we curated based on VCTK to study partial speech deepfake detection in the era of neural speech editing. For more details, please refer to our Interspeech 2025 paper: "PartialEdit: Identifying Partial Deepfakes in the Era of Neural Speech Editing".

In the paper, we curated four subsets: E1: VoiceCraft, E2: SSR-Speech, E3: Audiobox-Speech, and E4: Audiobox. Adhering to Audiobox's license, we cannot release the E3 and E4 subsets.

The folder structure is as follows:

PartialEdit/
├── PartialEdit_E1E2.csv
├── E1/
│ ├── p225/
│ │ ├── p225_001_edited_partial_16k.wav
│ │ ├── p225_002_edited_partial_16k.wav
│ │ └── ...
│ ├── p231/
│ │ ├── p231_001_edited_partial_16k.wav
│ │ ├── p231_002_edited_partial_16k.wav
│ │ └── ...
│ └── ...
├── E1-Codec/
│ └── (same structure as E1)
├── E2/
│ └── (same structure as E1)
├── E2-Codec/
│ └── (same structure as E1)
└── modified_txt/
├── p225/
│ ├── p225_001_modified.txt
│ ├── p225_002_modified.txt
│ ├── p225_003_modified.txt
│ └── ...
├── p231/
│ ├── p231_001_modified.txt
│ ├── p231_002_modified.txt
│ └── ...
└── ...

This is version 1.0, and we will include links to the paper and demo page soon.

The `PartialEdit_E1E2.csv` file contains information about the edited regions in each audio file. Each row represents the following columns:

- `filename`: The name of the audio file.
- `start of the edited region (s)`: The starting time (in seconds) of the first edited region.
- `end of the edited region (s)`: The ending time (in seconds) of the first edited region.
- `total duration (s)`: The total duration (in seconds) of the audio file.

If there are two edited regions within a file, the row format expands to include:

- `filename`: The name of the audio file.
- `start of the edited region (s)`: The starting time (in seconds) of the first edited region.
- `end of the edited region (s)`: The ending time (in seconds) of the first edited region.
- `start of the second edited region (s)`: The starting time (in seconds) of the second edited region.
- `end of the second edited region (s)`: The ending time (in seconds) of the second edited region.
- `total duration (s)`: The total duration (in seconds) of the audio file.

To make sure the download is complete, you can check the MD5 code with the following command:

md5sum *
h
OmniEdit-Filtered-1.2M
huggingface.co
Updated Nov 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TIGER-Lab (2024). OmniEdit-Filtered-1.2M [Dataset]. https://huggingface.co/datasets/TIGER-Lab/OmniEdit-Filtered-1.2M
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 12, 2024
Dataset authored and provided by
TIGER-Lab
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
OmniEdit

In this paper, we present OMNI-EDIT, which is an omnipotent editor to handle seven different image editing tasks with any aspect ratio seamlessly. Our contribution is in four folds: (1) OMNI-EDIT is trained by utilizing the supervision from seven different specialist models to ensure task coverage. (2) we utilize importance sampling based on the scores provided by large multimodal models (like GPT-4o) instead of CLIP-score to improve the data quality. 📃Paper | 🌐Website |… See the full description on the dataset page: https://huggingface.co/datasets/TIGER-Lab/OmniEdit-Filtered-1.2M.
c
APE Shared Task WMT17: Human Post-edits Test Data DE-EN
lindat.mff.cuni.cz
live.european-language-grid.eu
+1more
Updated Oct 17, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marco Turchi; Rajen Chatterjee; Matteo Negri (2017). APE Shared Task WMT17: Human Post-edits Test Data DE-EN [Dataset]. https://lindat.mff.cuni.cz/repository/xmlui/handle/11372/LRT-2485?show=full
Explore at:
Dataset updated
Oct 17, 2017
Authors
Marco Turchi; Rajen Chatterjee; Matteo Negri
License
https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21
Description
Human post-edited test sentences for the WMT 2017 Automatic post-editing task. This consists in 2,000 English sentences belonging to the IT domain and already tokenized. Source and target segments can be downloaded from: https://lindat.mff.cuni.cz/repository/xmlui/handle/11372/LRT-2132. All data is provided by the EU project QT21 (http://www.qt21.eu/).
f
Data from: RNA A-to-I editing, environmental exposure, and human diseases
tandf.figshare.com
xlsx
Updated Jun 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Akin Cayir (2023). RNA A-to-I editing, environmental exposure, and human diseases [Dataset]. http://doi.org/10.6084/m9.figshare.16552991.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.16552991.v1
Dataset updated
Jun 9, 2023
Dataset provided by
Taylor & Francis
Authors
Akin Cayir
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Epigenetic modifications have gained attention since they can be potentially changed with environmental stimuli and can be associated with adverse health outcomes. Epitranscriptome field has begun to attract attention with several aspects since RNA modifications have been linked with critical biological processes and implicated in diseases. Several RNA modifications have been identified as reversible indicating the dynamic features of modification which can be altered by environmental cues. Currently, we know more than 150 RNA modifications in different organisms and on different bases which are modified by various chemical groups. RNA editing, which is one of the RNA modifications, occurs after transcription, which results in RNA sequence different from its corresponding DNA sequence. Emerging evidence reveals the functions of RNA editing as well as the association between RNA editing and diseases. However, the RNA editing field is beginning to grow up and needs more empirical evidence in regard to disease and toxicology. Thus, this review aims to provide the current evidence-based studies on RNA editing modifying genes for genotoxicity and cancer. The review presented the association between environmental xenobiotics exposure and RNA editing modifying genes and focused on the association between the expression of RNA editing modifying genes and cancer. Furthermore, we discussed the future directions of scientific studies in the area of RNA modifications, especially in the RNA editing field, and provided a knowledge-based framework for further studies.
d
Data from: The majority of transcripts in the squid nervous system are...
datadryad.org
zenodo.org
zip
Updated Jan 13, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shahar Alon; Sandra C. Garrett; Erez Y. Levanon; Sara Olson; Brenton R. Graveley; Joshua J. C. Rosenthal; Eli Eisenberg (2015). The majority of transcripts in the squid nervous system are extensively recoded by A-to-I RNA editing [Dataset]. http://doi.org/10.5061/dryad.2hv7d
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.2hv7d
Dataset updated
Jan 13, 2015
Dataset provided by
Dryad
Authors
Shahar Alon; Sandra C. Garrett; Erez Y. Levanon; Sara Olson; Brenton R. Graveley; Joshua J. C. Rosenthal; Eli Eisenberg
Time period covered
2015
Description
Supplementary File 1A text file in a fasta format with the constructed squid coding sequences.EisenbergSI_Data1.txtSupplementary File 2A spreadsheet with all the A-to-G modification sites detected in the coding regions of the squid, along with their number of supporting reads in all the tissues studied.EisenbergSI_Table1.xlsx
SelfTargeting h5ad data encoded into 912 classes
figshare.com
application/x-gzip
Updated Jul 19, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wergillius Zheng (2023). SelfTargeting h5ad data encoded into 912 classes [Dataset]. http://doi.org/10.6084/m9.figshare.22807238.v5
Explore at:
application/x-gzipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.22807238.v5
Dataset updated
Jul 19, 2023
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Wergillius Zheng
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
filename is formated as : CellType_Repeat_Coding.h5ad

h5ad with

event frequency stored in adata.X event description saved in adata.var meta data for each Guide-Target pair saved in adata.obs

To download and unzip the file

copy the link address within terminal : wget --no-check-certificate LINK within terminal : mv 2 2.zip unzip 2.zip
p
Video Editing Services in Illinois, United States - 327 Verified Listings...
poidata.io
csv, excel, json
Updated Jul 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Poidata.io (2025). Video Editing Services in Illinois, United States - 327 Verified Listings Database [Dataset]. https://www.poidata.io/report/video-editing-service/united-states/illinois
Explore at:
csv, excel, jsonAvailable download formats
Dataset updated
Jul 11, 2025
Dataset provided by
Poidata.io
Area covered
Illinois, United States
Description
Comprehensive dataset of 327 Video editing services in Illinois, United States as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.

Facebook

Twitter

Click to copy link

Link copied

Cite

UCSC-VLAA (2024). HQ-Edit-data-demo [Dataset]. https://huggingface.co/datasets/UCSC-VLAA/HQ-Edit-data-demo

HQ-Edit-data-demo

UCSC-VLAA/HQ-Edit-data-demo

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Jun 28, 2024

Dataset authored and provided by

UCSC-VLAA

License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Dataset Card for HQ-EDIT

HQ-Edit, a high-quality instruction-based image editing dataset with total 197,350 edits. Unlike prior approaches relying on attribute guidance or human feedback on building datasets, we devise a scalable data collection pipeline leveraging advanced foundation models, namely GPT-4V and DALL-E 3. HQ-Edit’s high-resolution images, rich in detail and accompanied by comprehensive editing prompts, substantially enhance the capabilities of existing image editing… See the full description on the dataset page: https://huggingface.co/datasets/UCSC-VLAA/HQ-Edit-data-demo.

Clear search

Close search

Google apps

Main menu

HQ-Edit-data-demo

Video Editing Services in Brazil - 1,690 Verified Listings Database

APE Shared Task WMT17: Human Post-edits Test Data EN-DE

Data from: Post-edited and error annotated machine translation corpus PErr...

Processed bystander editing data

Video Editing Services in Rhode Island, United States - 25 Verified Listings...

Data from: Learning to Edit Interactive Machine Learning Notebooks

Video Editing Services in New Mexico, United States - 30 Verified Listings...

NCCI Procedure to Procedure Edits (PTP) Quarter Beginning 01/01/2018 -...

Video Editing Services in Phitsanulok, Thailand - 2 Verified Listings...

CGBE - processed editing efficiency data

NCCI Procedure to Procedure Edits (PTP) Quarter Beginning 04/01/2020 -...

Hospital Cost Report Edited Data Print Image: 2010

Data from: PartialEdit: Identifying Partial Deepfakes in the Era of Neural...

OmniEdit-Filtered-1.2M

APE Shared Task WMT17: Human Post-edits Test Data DE-EN

Data from: RNA A-to-I editing, environmental exposure, and human diseases

Data from: The majority of transcripts in the squid nervous system are...

SelfTargeting h5ad data encoded into 912 classes

Video Editing Services in Illinois, United States - 327 Verified Listings...

HQ-Edit-data-demoSee More Versions

UCSC-VLAA/HQ-Edit-data-demo

HQ-Edit-data-demo