Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Arabic End-of-Utterance Dataset
Dataset Description
This dataset contains 5,000 Arabic samples for End-of-Utterance (EOU) detection, specifically designed for Saudi dialect conversational AI applications. Purpose: Train models to detect when a speaker has finished their conversational turn in Arabic dialogue.
Dataset Statistics
Attribute Value
Total Samples 5,000
Real SADA22 531 (10.6%)
Synthetic 4,469 (89.4%)
EOU Samples 3,655 (73.1%)โฆ See the full description on the dataset page: https://huggingface.co/datasets/HossamEL-Dein/arabic-eou-dataset.