12 datasets found

h
Data from: raw_data
huggingface.co
Updated Nov 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
target-bench (2025). raw_data [Dataset]. https://huggingface.co/datasets/target-bench/raw_data
Explore at:
Dataset updated
Nov 26, 2025
Dataset authored and provided by
target-bench
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Please temporarily download the raw data from: https://drive.google.com/drive/folders/1ATaUy2VKGyvh6DzuvkMm_DSSAvOEgdH0?usp=drive_link We will upload it here as soon as possible. Please note that the full raw data requires approximately 270 GB of storage.
SWE-Bench Coding Tasks Dataset
kaggle.com
zip
Updated Oct 3, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unidata (2025). SWE-Bench Coding Tasks Dataset [Dataset]. https://www.kaggle.com/datasets/unidpro/fermatix-swe-bench
Explore at:
zip(146556 bytes)Available download formats
Dataset updated
Oct 3, 2025
Authors
Unidata
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
SWE-Bench Dataset

The dataset comprises 8,712 files across 6 programming languages, featuring verified tasks and benchmarks for evaluating coding agents and language models. It introduces new benchmarks with real-world coding tasks, providing datasets for software engineering problems and tests. It builds upon the original swe-bench by evaluating repository-level challenges and scoring performances.

By utilizing this dataset with its multi-language test sets and golden patches, researchers and developers can advance their understanding of large language models and developer tools, comparing their performances on real software engineering challenges. - Get the data

Specifically engineered for evaluating advanced coding and software development, SWE-Bench Dataset supports research in code generation, automated patching, and fixing GitHub issues.

💵 Buy the Dataset: This is a limited preview of the data. To access the full dataset, please contact us at https://unidata.pro to discuss your requirements and pricing options.

Example of the data

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F27063537%2F6876a1091e5e4e12d330177c6ec3a0e6%2F1.PNG?generation=1759494538704549&alt=media" alt="">

The dataset provides a robust foundation for achieving higher accuracy in code generation and advancing automated software development tools, which are essential for improving developer productivity and software quality.

🌐 UniData provides high-quality datasets, content moderation, data collection and annotation for your AI/ML projects
Data from: Storage of cuttings before and after grafting influences survival...
scielo.figshare.com
jpeg
Updated Jun 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rafael Henrique Pertille; Marcos Robson Sachet; Marieli Teresinha Guerrezi; Chaiane Renata Grigolo; Idemir Citadin (2023). Storage of cuttings before and after grafting influences survival and vigor of vine grafts [Dataset]. http://doi.org/10.6084/m9.figshare.14291514.v1
Explore at:
jpegAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.14291514.v1
Dataset updated
Jun 3, 2023
Dataset provided by
SciELOhttp://www.scielo.org/
Authors
Rafael Henrique Pertille; Marcos Robson Sachet; Marieli Teresinha Guerrezi; Chaiane Renata Grigolo; Idemir Citadin
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
ABSTRACT Several grafting methods have been developed, and bench grafting with stratification is the most widely used technique, except in Brazil, which is still in adaptation. The objective of this study was to evaluate for how long plant material can be stored before the grafting and the optimum temperature for stratification. Cultivar 'Paulsen 1103' was used as rootstock and 'Niagara Rosada' as scion cultivar. The storage period treatments were 0, 30, 60 and 90 days at the temperature of 3 ℃ and 95% of relative humidity. After the storage period, the branches were removed from the cold chamber, taken to grafting, and then placed at 19 °C and 24 °C for stratification. After 21 days of stratification, the vine grafts were planted in commercial substrate and left to grow for 160 days. The vine cuttings of cultivars Niagara and Paulsen 1103 can be stored in cold chamber at 3 °C for 90 days and, during this period, bench grafting can be performed at any time. However, the vines from cuttings stored in cold chamber for more than 30 days have better growth. It is recommended to stratify the vine grafts at 19 °C.
WebGen-Bench
kaggle.com
zip
Updated May 1, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zimu Lu (2025). WebGen-Bench [Dataset]. https://www.kaggle.com/datasets/zimulu/webgen-bench/discussion
Explore at:
zip(940119 bytes)Available download formats
Dataset updated
May 1, 2025
Authors
Zimu Lu
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
WebGen-Bench

WebGen-Bench is created to benchmark LLM-based agent's ability to generate websites from scratch. The dataset is introduced in WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch. It contains 101 instructions and 647 test cases. It also has a training set of 6667 instructions, named WebGen-Instruct.

The code for evaluation as well as the training code and data are released at WebGen-Bench (Github)

Categories

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F11818724%2Ff21b9227c91890850d045450adbb8528%2F2025-05-08%20213306.png?generation=1746711223375173&alt=media" alt="">

Data Curation and Testing Pipelines

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F11818724%2Fc48f3086fc2c1dc95e5bd99511cd559d%2F2025-05-08%20213431.png?generation=1746711286481320&alt=media" alt="">

Citation

If you find our project useful, please cite:

@misc{lu2025webgenbenchevaluatingllmsgenerating, title={WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch}, author={Zimu Lu and Yunqiao Yang and Houxing Ren and Haotian Hou and Han Xiao and Ke Wang and Weikang Shi and Aojun Zhou and Mingjie Zhan and Hongsheng Li}, year={2025}, eprint={2505.03733}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2505.03733}, }
DrVD-Bench
kaggle.com
huggingface.co
zip
Updated May 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tianhong Zhou (2025). DrVD-Bench [Dataset]. https://www.kaggle.com/datasets/tianhongzhou/drvd-bench/versions/4
Explore at:
zip(7024171213 bytes)Available download formats
Dataset updated
May 22, 2025
Authors
Tianhong Zhou
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?

paper ｜ kaggle ｜ huggingface ｜ github

This repository is the official implementation of the paper: DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?

Introduction

Vision–language models (VLMs) exhibit strong zero-shot generalization on natural images and show early promise in interpretable medical image analysis. However, existing benchmarks do not systematically evaluate whether these models truly reason like human clinicians or merely imitate superficial patterns.
To address this gap, we propose DrVD-Bench, the first multimodal benchmark for clinical visual reasoning. DrVD-Bench consists of three modules: Visual Evidence Comprehension, Reasoning Trajectory Assessment, and Report Generation Evaluation, comprising 7 789 image–question pairs.
Our benchmark covers 20 task types, 17 diagnostic categories, and five imaging modalities—CT, MRI, ultrasound, X-ray, and pathology. DrVD-Bench mirrors the clinical workflow from modality recognition to lesion identification and diagnosis.
We benchmark 19 VLMs (general-purpose & medical-specific, open-source & proprietary) and observe that performance drops sharply as reasoning complexity increases. While some models begin to exhibit traces of human-like reasoning, they often rely on shortcut correlations rather than grounded visual understanding. DrVD-Bench therefore provides a rigorous framework for developing clinically trustworthy VLMs.

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F26863010%2F57e63a57a9aa29bd8e502dbfeb16e834%2Fcover_image.jpeg?generation=1747904850177313&alt=media" alt="">

Quick Start

Prepare Environment

pip3 install -r requirements.txt

Obtain DeepSeek API Key

Report generation will use DeepSeek to extract report keywords, and instruction-following weaker models can also leverage DeepSeek to extract answers from their outputs.
You can apply for an API key on the DeepSeek platform.
For more details, please refer to the official documentation: DeepSeek API Docs.

Obtain Model Outputs

Download the dataset from Kaggle or Hugging Face.

Run inference with your model and append the results to the model_response field in the corresponding files.

model_response format requirements

visual_evidence_qa.jsonl / independent_qa.jsonl: Single letter A / B / C …

joint_qa.jsonl: List containing only letters, separated by commas, e.g., ['B','D','A']

report_generation.jsonl: Full string

Inference Example Using Qwen-2.5-VL-72B API

The Qwen-2.5-VL-72B API can be obtained on the Alibaba Cloud Bailian platform (link).

· task - joint_qa.jsonl ~~~bash python qwen2.5vl_example.py
--API_KEY="your_qwen_api_key"
--INPUT_PATH="/path/to/joint_qa.jsonl"
--OUTPUT_PATH="/path/to/result.jsonl"
--IMAGE_ROOT='path/to/benchmark/data/root'
--type="joint" ~~~

· other tasks ~~~bash python qwen2.5vl_example.py
--API_KEY="your_qwen_api_key"
--INPUT_PATH="/path/to/qa.jsonl"
--OUTPUT_PATH="/path/to/result.jsonl"
--IMAGE_ROOT='path/to/benchmark/data/root'
--type="single" ~~~

Mapping Script

Applicable for instruction-following weaker models; if your model cannot standardize outputs according to the above format, you can use the following script to extract option answers from the model_response field: ~~~bash python map.py
--API_KEY="your_deepseek_api_key"
--INPUT_FILE="/path/to/model_result.jsonl"
--OUTPUT_FILE="/path/to/model_result_mapped.jsonl" ~~~

Compute Metrics

task - visual_evidence_qa.jsonl / independent_qa.jsonl

python compute_choice_metric.py \ --json_path="/path/to/results.jsonl" \ --type='single'

task - joint_qa.jsonl

python compute_choice_metric.py \ --json_path="/path/to/results.jsonl" \ --type='joint'

task - report_generation.jsonl

python report_generation_metric.py \ --API_KEY='your_deepseek_api_key' \ --JSON_PATH='/path/to/results.jsonl'

Contact

Tianhong Zhou ·
w
Data from Transnational Mod Languages (09-2018)/05 TML Website/TML Website...
data.wu.ac.at
docx
Updated Oct 2, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Arts (2018). Data from Transnational Mod Languages (09-2018)/05 TML Website/TML Website Storage/5 TML NEWS/A Bench on the road [Dataset]. https://data.wu.ac.at/schema/data_bris_ac_uk_data_/ODNjNTc3MGItNjFkNi00MGE3LTg2NmMtZDNmMWIzYTNmODdi
Explore at:
docx(41503.0)Available download formats
Dataset updated
Oct 2, 2018
Dataset provided by
Arts
License
http://www.nationalarchives.gov.uk/doc/non-commercial-government-licence/non-commercial-government-licence.htmhttp://www.nationalarchives.gov.uk/doc/non-commercial-government-licence/non-commercial-government-licence.htm
Description
Data from Transnationalizing Modern Languages (09-2018)

Transnationalizing Modern Languages: Mobility, Identity and Translation in Modern Italian Cultures (TML) (funded by the AHRC under the ‘Translating Cultures’ theme, 2014-17)

PI Charles Burdett, University of Bristol. CIs Jenny Burns (Warwick), Loredana Polezzi (Warwick/Cardiff), Derek Duncan (St Andrews), Margaret Hills de Zarate (QMU)

RAs: Barbara Spadaro (Bristol), Carlo Pirozzi (St Andrews), Marco Santello (Warwick), Naomi Wells (Warwick), Luisa Percopo (Cardiff)

PhD students: Iacopo Colombini (St Andrews), Georgia Wall (Warwick)

Below is a short description of the project. Within the repository, there is a longer description of TML and each folder is accompanied by an explanatory text.

The project investigates practices of linguistic and cultural interchange within communities and individuals and explores the ways in which cultural translation intersects with linguistic translation in the everyday lives of people. The project has used as its primary object of enquiry the 150-year history of Italy as a nation state and its patterns of emigration and immigration. TML has concentrated on a series of exemplary cases, representative of the geographic, historical and linguistic map of Italian mobility. Focussing on the cultural associations that each community has formed, it examines the wealth of publications and materials that are associated with these organizations.

Working closely with researchers from across Modern Languages, the project has sought to demonstrate the principle that language is most productively apprehended in the frame of translation and the national in the frame of the transnational. TML is contributing to the development of a new framework for the disciplinary field of MLs, one which puts the interaction of languages and cultures at its core.

The principles of co-production and co-research lie at the core of the project and TML has worked closely with a very extensive range of partners. It has worked closely with Castlebrae and Drummond Community High Schools and with cultural associations across the world. The project exhibition, featuring the research of the project and including the work of photographer Mario Badagliacca, was curated by Viviana Gravano and Giulia Grechi of Routes Agency. Project events in the UK have drawn on the expertise of Rita Wilson (Monash), the writer Shirin Ramzanali Fazel and all members of the Advisory Board. The project, in close collaboration with the University of Namibia (UNAM) and the Phoenix Project (Cardiff), has been followed by ‘TML: Global Challenges’.
OST-Bench
kaggle.com
zip
Updated Aug 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jingli Lin (2025). OST-Bench [Dataset]. https://www.kaggle.com/datasets/jinglilin/ost-bench/code
Explore at:
zip(27480542142 bytes)Available download formats
Dataset updated
Aug 2, 2025
Authors
Jingli Lin
Description
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F22426573%2Fcf73fe47ffc203579aaf448ac99e60ab%2Fbenchmark_samples.png?generation=1747364225904852&alt=media" alt="">This is the OST datasets of our work: OST-Bench, An Online Spatio-temporal Scene Understanding benchmark.

In the OST_bench_v0.json file, there are a total of 1.4k multi-turn sessions(totally 10k samples) for evaluation; in the OST_bench_training_v0.json file, there are a total of 7k multi-turn sessions(totally 50k samples) for training. Each multi-turn data representing an agent's exploration and containing a set of multi-turn dialogues. Each sample conforms to the following dictionary format: python { scan_id(str): The scan ID, system_prompt(str): The system prompt to input to the model, user_message(str): A list of multi-turn dialogue. } The "user_message" contains multi-turn dialogues with the agent, where each turn is represented as a dictionary in the following format, each path in the "image_paths" refers to an image in the img directory: python { turn_id(int): the index of the turn, act as a timestamp, type(str): the subtype of the question, origin_question(str): the original version of question, answer(str): the answer to the question, option(list[str]): the options image_paths(list[str]): a list of new observations, prompt(str): the prompt input into model with the new observations together for each turn, } Our data samples include diverse question types, covering three major categories(Agent State, Agent Visible Info and Agent-object Spatial Relationship) and four distinct question formats(judgement / Counting / Temporal-localization / Estimation). For details on how to use this data for model evaluation, please refer to our code repository.

The evaluation images are stored in img.zip, while the training images are split into two parts: img_train_part.zip and img_train_part.z01.
B
Battery Test Bench Report
datainsightsmarket.com
doc, pdf, ppt
Updated Aug 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Data Insights Market (2025). Battery Test Bench Report [Dataset]. https://www.datainsightsmarket.com/reports/battery-test-bench-1575343
Explore at:
pdf, doc, pptAvailable download formats
Dataset updated
Aug 20, 2025
Dataset authored and provided by
Data Insights Market
License
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The Battery Test Bench market is experiencing robust growth, driven by the burgeoning electric vehicle (EV) industry and the increasing demand for energy storage solutions. The market's expansion is fueled by stringent government regulations promoting EV adoption globally, alongside the continuous advancements in battery technology requiring rigorous testing procedures. This necessitates sophisticated test benches capable of evaluating various battery parameters, including performance, safety, and lifespan under diverse operating conditions. While precise market sizing data wasn't provided, considering the rapid growth of the EV sector and the crucial role of battery testing, a reasonable estimate for the 2025 market size could be placed in the range of $2.5 billion to $3 billion. Assuming a conservative Compound Annual Growth Rate (CAGR) of 15% over the forecast period (2025-2033), the market is projected to reach a significant size by 2033, driven by factors like increasing EV production, grid-scale energy storage deployments, and expanding research and development in battery technologies. Key market segments include those catering to different battery chemistries (Li-ion, solid-state, etc.), testing types (cell, module, pack), and application areas (EVs, grid storage, portable electronics). Major players like FEV, HORIBA, Simpro, Chroma ATE, and others are actively investing in R&D and expanding their product portfolios to capitalize on this expanding market. However, challenges remain, including the high cost of advanced testing equipment and the need for standardized testing protocols across different regions. Furthermore, the relatively longer lead times for customized testing solutions can pose a constraint to immediate market expansion. Nevertheless, the long-term outlook remains positive, given the continued growth of the EV and renewable energy sectors, driving demand for robust and reliable battery test benches for years to come.
Induction Motor Data Set
kaggle.com
zip
Updated Jul 21, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marius Stender (2024). Induction Motor Data Set [Dataset]. https://www.kaggle.com/datasets/stender/induction-motor-data-set/data
Explore at:
zip(1149315364 bytes)Available download formats
Dataset updated
Jul 21, 2024
Authors
Marius Stender
Description
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F4739409%2Fbb6a5e7edd4e7dc2e50e22ce5454042e%2Finbox_3819783_abe43385305442deceb5a0019e62e1ea_UPB_LEA_Headder_300dpi.png?generation=1587388001472129&alt=media" alt="">

Context

The data set comprises several sensor data collected from a typical induction motor drive deployed at a test bench. Thereby, measurements from the electrical, thermal and mechanical domain are included in the data set. The test bench measurements were collected at the LEA department of the Paderborn University.

Content

The data set comprises approximately 262 hours of test bench measurements in the complete operating range of the exemplary drive system.

Detailed description

A comprehensive description of the data set can be found in the following paper (freely available):

https://www.researchgate.net/publication/382249181_Data_Set_Description_Induction_Motor_with_Thermal_Sensing_in_Rotor_and_Stator

Acknowledgements

This work was funded by the German Research Foundation (DFG) under the reference number 389029890.

Links

Department of Power Electronics and Electrical Drives, Paderborn University, Germany

License

Data files © Original Authors
Electric Motor Temperature
kaggle.com
zip
Updated Apr 26, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kirgsn (2021). Electric Motor Temperature [Dataset]. https://www.kaggle.com/datasets/wkirgsn/electric-motor-temperature/code
Explore at:
zip(122212134 bytes)Available download formats
Dataset updated
Apr 26, 2021
Authors
Kirgsn
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F3819783%2Fabe43385305442deceb5a0019e62e1ea%2FUPB_LEA_Headder_300dpi.png?generation=1585167925099468&alt=media" alt="titlepage header">

UPDATE 26.04.2021

All data is deanonymized now. Moreover, 17 additional measurement profiles were added, expanding the dataset from 138 hours to 185 hours of records.

Context

The data set comprises several sensor data collected from a permanent magnet synchronous motor (PMSM) deployed on a test bench. The PMSM represents a german OEM's prototype model. Test bench measurements were collected by the LEA department at Paderborn University.

Content

All recordings are sampled at 2 Hz. The data set consists of multiple measurement sessions, which can be distinguished from each other by column "profile_id". A measurement session can be between one and six hours long.

The motor is excited by hand-designed driving cycles denoting a reference motor speed and a reference torque. Currents in d/q-coordinates (columns "i_d" and i_q") and voltages in d/q-coordinates (columns "u_d" and "u_q") are a result of a standard control strategy trying to follow the reference speed and torque. Columns "motor_speed" and "torque" are the resulting quantities achieved by that strategy, derived from set currents and voltages.

Most driving cycles denote random walks in the speed-torque-plane in order to imitate real world driving cycles to a more accurate degree than constant excitations and ramp-ups and -downs would.

Acknowledgements

Several publications leveraged the setup of the PMSM in the Paderborn University Lab:

Temperature Estimation on the same motor but different data

Determination of rotor temperature for an interior permanent magnet synchronous machine using a precise flux observer

Investigation of Long Short-Term Memory Networks to Temperature Prediction for Permanent Magnet Synchronous Motors

Real-Time Capable Methods to Determine the Magnet Temperature of Permanent Magnet Synchronous Motors — A Review

Improved Fusion of Permanent Magnet Temperature Estimation Techniques for Synchronous Motors Using a Kalman Filter

Fusion of a lumped-parameter thermal network and speed-dependent flux observer for PM temperature estimation in synchronous machines

Fusion of direct and indirect temperature estimation techniques for permanent magnet synchronous motors

Sensitivity analysis of a permanent magnet temperature observer for PM synchronous machines using the Monte Carlo method

Observing the Permanent-Magnet Temperature of Synchronous Motors Based on Electrical Fundamental Wave Model Quantities

Global Identification of a Low-Order Lumped-Parameter Thermal Network for Permanent Magnet Synchronous Motors

Glocal identification methods for low-order lumped parameter thermal networks used in permanent magnet synchronous motors

Temperature Estimation based on this dataset

Please cite the following publication if you intend to use this dataset for your own publications:

Estimating Electric Motor Temperatures with Deep Residual Machine Learning. Code available on GitHub upb-lea/deep-pmsm

Inspiration

The most interesting target features are rotor temperature ("pm"), stator temperatures ("stator_*") and torque. Especially rotor temperature and torque are not reliably and economically measurable in a commercial vehicle.

Being able to have strong estimators for the rotor temperature helps the automotive industry to manufacture motors with less material and enables control strategies to utilize the motor to its maximum capability. A precise torque estimate leads to more accurate and adequate control of the motor, reducing power losses and eventually heat build-up.

Other electric motor related data sets

Voltage converter data

Currents and motor speed per switching state

Torque characteristics
Inverter Data Set
kaggle.com
zip
Updated Aug 6, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marius Stender (2020). Inverter Data Set [Dataset]. https://www.kaggle.com/stender/inverter-data-set
Explore at:
zip(23656681 bytes)Available download formats
Dataset updated
Aug 6, 2020
Authors
Marius Stender
Description
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F4739409%2Fbb6a5e7edd4e7dc2e50e22ce5454042e%2Finbox_3819783_abe43385305442deceb5a0019e62e1ea_UPB_LEA_Headder_300dpi.png?generation=1587388001472129&alt=media" alt="">

Context

The data set comprises several sensor data collected from a typical combined system between an inverter, an induction motor, and a control system, deployed on a test bench. Test bench measurements were collected by the LEA department at Paderborn University.

An inverter is a power electronic component with transistors (read 'switches'), that determine how the battery voltage (so called DC-link voltage) is applied on the three phase circuits of the electric motor. The control unit decides according to some control strategy the current switching states of the inverter at each discrete point in time.

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F642765%2Fff29577cce2f7018785f91a5d1a3805c%2FScreenshot%20from%202020-10-27%2011-39-16.png?generation=1603795173235540&alt=media" alt="">

Content

The data set comprises approximately 235 thousand samples in the complete operating range of an exemplary drive system.

Rows follow no particular order.

Inspiration

The most important aspect of an electric vehicle from a marketing and engineering perspective is its efficiency and, thus, achievable range. For this, it is essential to avoid over-dimensioning of the drive train, i.e. applying more and heavier metal packs to increase its thermal capabilities. If the motor is controlled inefficiently through the inverter, there'll be superfluous power losses, i.e. heat build-up, which eventually leads to electric power derating during operation and, crucially, early depletion of the battery.

Precise phase voltage information is mandatory in order to enable an accurate, efficient or high dynamic control performance of electric motor drives, especially if a torque-controlled operation is considered. However, most electrical drives do not measure the phase voltages online due to their cost implications, and, therefore, these have to be estimated by inverter models.

Because of various nonlinear switching effects partly at nanosecond scale, an analytical white-box modeling approach is hardly feasible in a control context. Hence, data-driven inverter models seem favorable for this purpose.

Since the control utilizes pulse width modulation (PWM), the mean phase voltages for each PWM interval are the targets of the inverter models.

Detailed description

A comprehensive description of the data set can be found in the following paper (freely available): Data Set Description: Three-Phase IGBT Two-Level Inverter for Electrical Drives DOI: 10.13140/RG.2.2.23335.37280

Acknowledgements

This work was funded by the German Research Foundation (DFG) under the reference number 389029890.

Links

Department of Power Electronics and Electrical Drives, Paderborn University, Germany
VM Images for Deduplication
kaggle.com
zip
Updated Jan 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sreeharsha Udayashankar (2025). VM Images for Deduplication [Dataset]. https://www.kaggle.com/datasets/sreeharshau/vm-deb-fast25
Explore at:
zip(41374998680 bytes)Available download formats
Dataset updated
Jan 23, 2025
Authors
Sreeharsha Udayashankar
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This is a dataset of VM images from the VMWare marketplace, mainly intended for use within data deduplication projects. This dataset is compatible with the DedupBench framework on GitHub.

Note that this is the full DEB dataset used in our VectorCDC paper from FAST 2025. Please cite our paper using the citation below if you are using this dataset.

Udayashankar, S., Baba, A. and Al-Kiswany, S., 2025, February. VectorCDC: Accelerating Data Deduplication with Vector Instructions. In 2025 USENIX 23rd Conference on File and Storage Technologies (FAST' 25). USENIX

@inproceedings {305256, author = {Sreeharsha Udayashankar and Abdelrahman Baba and Samer Al-Kiswany}, title = {{VectorCDC}: Accelerating Data Deduplication with Vector Instructions}, booktitle = {23rd USENIX Conference on File and Storage Technologies (FAST 25)}, year = {2025}, isbn = {978-1-939133-45-8}, address = {Santa Clara, CA}, pages = {513--522}, url = {https://www.usenix.org/conference/fast25/presentation/udayashankar}, publisher = {USENIX Association}, month = feb }

`
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

target-bench (2025). raw_data [Dataset]. https://huggingface.co/datasets/target-bench/raw_data

Data from: raw_data

target-bench/raw_data

Explore at:

Dataset updated

Nov 26, 2025

Dataset authored and provided by

target-bench

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

Please temporarily download the raw data from: https://drive.google.com/drive/folders/1ATaUy2VKGyvh6DzuvkMm_DSSAvOEgdH0?usp=drive_link We will upload it here as soon as possible. Please note that the full raw data requires approximately 270 GB of storage.

Clear search

Close search

Google apps

Main menu

Data from: raw_data

SWE-Bench Coding Tasks Dataset

SWE-Bench Dataset

💵 Buy the Dataset: This is a limited preview of the data. To access the full dataset, please contact us at https://unidata.pro to discuss your requirements and pricing options.

Example of the data

🌐 UniData provides high-quality datasets, content moderation, data collection and annotation for your AI/ML projects

Data from: Storage of cuttings before and after grafting influences survival...

WebGen-Bench

WebGen-Bench

Categories

Data Curation and Testing Pipelines

Citation

DrVD-Bench

DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?

Introduction

Quick Start

Prepare Environment

Obtain DeepSeek API Key

Obtain Model Outputs

Inference Example Using Qwen-2.5-VL-72B API

Mapping Script

Compute Metrics

task - visual_evidence_qa.jsonl / independent_qa.jsonl

task - joint_qa.jsonl

task - report_generation.jsonl

Contact

Data from Transnational Mod Languages (09-2018)/05 TML Website/TML Website...

OST-Bench

Battery Test Bench Report

Induction Motor Data Set

Context

Content

Detailed description

Acknowledgements

Links

License

Electric Motor Temperature

UPDATE 26.04.2021

Context

Content

Acknowledgements

Temperature Estimation on the same motor but different data

Temperature Estimation based on this dataset

Inspiration

Other electric motor related data sets

Inverter Data Set

Context

Content

Inspiration

Detailed description

Acknowledgements

Links

VM Images for Deduplication

Data from: raw_data

target-bench/raw_data