100+ datasets found

h
Total-Text-Dataset
huggingface.co
datasetninja.com
+2more
Updated Apr 30, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yunus Serhat Bıçakçı (2024). Total-Text-Dataset [Dataset]. https://huggingface.co/datasets/yunusserhat/Total-Text-Dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 30, 2024
Authors
Yunus Serhat Bıçakçı
Description
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind. Original github repo; https://github.com/cs-chan/Total-Text-Dataset Forked repo; https://github.com/yunusserhat/Total-Text-Dataset
R
Total Text Dataset
universe.roboflow.com
zip
Updated Feb 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Text Spotting (2025). Total Text Dataset [Dataset]. https://universe.roboflow.com/text-spotting/total-text-dataset
Explore at:
zipAvailable download formats
Dataset updated
Feb 28, 2025
Dataset authored and provided by
Text Spotting
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Text Bounding Boxes
Description
Total Text Dataset

## Overview Total Text Dataset is a dataset for object detection tasks - it contains Text annotations for 1,255 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Number of text messages sent in the U.S. 2004-2014
statista.com
Updated Dec 10, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2015). Number of text messages sent in the U.S. 2004-2014 [Dataset]. https://www.statista.com/statistics/215776/mobile-messaging-volumes-in-the-us/
Explore at:
Dataset updated
Dec 10, 2015
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2004 - 2014
Area covered
United States
Description
This statistic shows mobile messaging volumes in the U.S. for selected years between 2004 and 2014. In 2010, approximately ***** billion messages were sent in total, up from ** billion in 2004.

U.S. mobile messaging volumes - additional information

A total of around *** trillion text messages were sent in the United States in 2012, marking an almost tenfold increase on the figure from 2006. A further ** million MMS messages were sent in the country in 2012, an increase from * million in 2006. In 2013, the United States was the country with the highest average number of text messages sent per month and per mobile connection. Over *** messages were sent monthly per mobile connection in the United States, in comparison to *** in the United Kingdom and *** in Germany.

The most active age group for sending and receiving text messages in the United States were those aged 18 to 29, as ** percent of respondents said that they did use mobile messaging in 2013. By comparison, only ** percent of those aged 65 and older said that they used their mobile phone for text messaging in 2013.

Rather than using a mobile phone’s integrated text messaging service, many users are opting for third party apps to communicate. As of January 2015, mobile messaging service WhatsApp had around 700 million monthly active users, marking double the amount of users it had in October 2013. Within the U.S. market, iOS and Android users spent a total of 680 million minutes on WhatsApp in February 2013, with those aged between 25 and 34 years most likely to use the service in 2014.
Text Retrieval Conference (TREC) Total Recall collections
catalog.data.gov
data.nist.gov
Updated May 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institute of Standards and Technology (2024). Text Retrieval Conference (TREC) Total Recall collections [Dataset]. https://catalog.data.gov/dataset/text-retrieval-conference-trec-total-recall-collections
Explore at:
Dataset updated
May 15, 2024
Dataset provided by
National Institute of Standards and Technologyhttp://www.nist.gov/
Description
This data was used in the TREC 2015 and 2016 total recall track. The goal of the total recall track was to help develop retrieval systems tuned to retrieving ALL relevant information, as opposed to common web search engines where one good answer could be sufficient.
Number of text messages sent in the U.S. 2005-2021
statista.com
Updated Jul 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Number of text messages sent in the U.S. 2005-2021 [Dataset]. https://www.statista.com/statistics/185879/number-of-text-messages-in-the-united-states-since-2005/
Explore at:
Dataset updated
Jul 8, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
United States
Description
In 2021, mobile users in the United States sent roughly 2 trillion SMS or MMS messages. Following a sharp drop off in 2012, the number of SMS and MMS messages sent in the U.S. has generally increased over the past several years to another peak in 2020, during the COVID pandemic, at 2.2 trillion SMS or MMS messages.
Total number of SMS and MMS messages sent in Turkey Q1 2019- Q1 2024
statista.com
Updated Jul 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Total number of SMS and MMS messages sent in Turkey Q1 2019- Q1 2024 [Dataset]. https://www.statista.com/statistics/1316954/turkey-number-of-sms-and-mms-sent/
Explore at:
Dataset updated
Jul 10, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Turkey
Description
The total number of SMS and MMS messages sent in Turkey mostly presented a diminishing trend, with some fluctuations from the 1st quarter of 2019 to the first quarter of 2024. The number of SMS messages sent went down to nearly *** billion in the first quarter of 2024 from **** billion in the first quarter of 2019. However, the number of MMS messages sent increased in the first quarter of 2024, and amounted to nearly ** million.
W
Total Secondary Schools Text Book for Selected Subjects
cloud.csiss.gmu.edu
csv, json, rdf, xml
Updated Oct 3, 2016
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Open Africa (2016). Total Secondary Schools Text Book for Selected Subjects [Dataset]. http://cloud.csiss.gmu.edu/dataset/7b88a4c8-c00e-478c-a34f-d2e5f9aeff09
Explore at:
xml, json, csv, rdfAvailable download formats
Dataset updated
Oct 3, 2016
Dataset provided by
Open Africa
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The Ministry of Educations' - Basic Education Statistical Booklet captures national statistics for the Education Sector in totality. This dataset details the number of English, Kiswahili, Maths, Biology, Chemistry and Physics subjects text books across the 47 counties. Source - The Ministry of Educations, Basic Education Statistical Booklet, Table 84: Total Secondary Schools Text Book for Selected Subjects
t
Total-Text dataset - Dataset - LDM
service.tib.eu
Updated Dec 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Total-Text dataset - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/total-text-dataset
Explore at:
Dataset updated
Dec 3, 2024
Description
The Total-Text dataset contains the text of various shapes, including horizontal, multi-orientational, and curved.
Open Text total equity 2020-2024
statista.com
Updated Oct 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Open Text total equity 2020-2024 [Dataset]. https://www.statista.com/statistics/1533663/open-text-total-equity/
Explore at:
Dataset updated
Oct 29, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Canada
Description
The total equity of Open Text with headquarters in Canada amounted to *** billion U.S. dollars in 2024. The reported fiscal year ends on June 30.Compared to the earliest depicted value from 2020 this is a total increase by approximately **** billion U.S. dollars. The trend from 2020 to 2024 shows, however, that this increase did not happen continuously.
h
total-text
huggingface.co
Updated Sep 21, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Louisa Lin (2024). total-text [Dataset]. https://huggingface.co/datasets/green-luigi/total-text
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 21, 2024
Authors
Louisa Lin
Description
green-luigi/total-text dataset hosted on Hugging Face and contributed by the HF Datasets community
P
COCO-Text Dataset
paperswithcode.com
Updated May 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andreas Veit; Tomas Matera; Lukas Neumann; Jiri Matas; Serge Belongie (2024). COCO-Text Dataset [Dataset]. https://paperswithcode.com/dataset/coco-text
Explore at:
Dataset updated
May 6, 2024
Authors
Andreas Veit; Tomas Matera; Lukas Neumann; Jiri Matas; Serge Belongie
Description
The COCO-Text dataset is a dataset for text detection and recognition. It is based on the MS COCO dataset, which contains images of complex everyday scenes. The COCO-Text dataset contains non-text images, legible text images and illegible text images. In total there are 22184 training images and 7026 validation images with at least one instance of legible text.
E
Total-Text-Dataset
live.european-language-grid.eu
Updated Dec 30, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2019). Total-Text-Dataset [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/5203
Explore at:
Dataset updated
Dec 30, 2019
License
https://opensource.org/licenses/BSD-3-Clausehttps://opensource.org/licenses/BSD-3-Clause
Description
This dataset consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
f
Illustrative text conversations excerpts between public health officials and...
plos.figshare.com
xls
Updated Jan 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Richard T. Lester; Matthew Manson; Muhammed Semakula; Hyeju Jang; Hassan Mugabo; Ali Magzari; Junhong Ma Blackmer; Fanan Fattah; Simon Pierre Niyonsenga; Edson Rwagasore; Charles Ruranga; Eric Remera; Jean Claude S. Ngabonziza; Giuseppe Carenini; Sabin Nsanzimana (2025). Illustrative text conversations excerpts between public health officials and patients (cases and contacts) during the COVID-19 pandemic in Rwanda. [Dataset]. http://doi.org/10.1371/journal.pdig.0000625.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pdig.0000625.t003
Dataset updated
Jan 15, 2025
Dataset provided by
PLOS Digital Health
Authors
Richard T. Lester; Matthew Manson; Muhammed Semakula; Hyeju Jang; Hassan Mugabo; Ali Magzari; Junhong Ma Blackmer; Fanan Fattah; Simon Pierre Niyonsenga; Edson Rwagasore; Charles Ruranga; Eric Remera; Jean Claude S. Ngabonziza; Giuseppe Carenini; Sabin Nsanzimana
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Rwanda
Description
“…” indicates excluded text for brevity. Message types include: ‘(S):’–a ‘system’ message containing customizable content with automated sending; ‘(P):’–a ‘patient’ message containing patient remarks in free text form; and ‘(C):’–a ‘clinician’ message containing clinician remarks in free text form.
Z
The Threatening English Language (TEL) Corpus
data.niaid.nih.gov
explore.openaire.eu
Updated Sep 17, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gales, Tammy (2022). The Threatening English Language (TEL) Corpus [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6815670
Explore at:
Dataset updated
Sep 17, 2022
Dataset provided by
Nini, Andrea
Gales, Tammy
Symonds, Ellen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
TEL is the Threatening English Language corpus. It is a collection of 309 written texts compiled from the publicly-available portion of CTARC (the Communicated Threat Assessment Research Corpus, compiled by Tammy Gales), MFT (the Malicious Forensic Texts corpus, compiled by Andrea Nini), and the written portion of CoJO (the Corpus of Judicial Opinions, compiled by Julia Muschalik). Additional texts are from ForensicLing.com (the forensic linguistic data site hosted by Tammy Gales and Dakota Wing). Basic metadata is supplied for each text where known from the original case research. We wish to thank our graduate student fellows who helped compile the texts and metadata: Nicole Harris, Annina van Riper, Zara Rabinko, and Zachary Boudreaux.

Total texts: 309 Total estimated authors: 203 Total word count: 54,167

METADATA KEY

TG = Tammy Gales (public portion of CTARC) AN = Andrea Nini (MFT) JM = Julia Muschalik (written portion of CoJo) FL = ForensicLing.com (Tammy Gales and Dakota Wing)

Name###_## = file name, case number, text number within case File name might be threat recipient or author; remaining info is about the author, where known
SMS Co., Ltd. total equity 2020 to 2023
statista.com
Updated Mar 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). SMS Co., Ltd. total equity 2020 to 2023 [Dataset]. https://www.statista.com/statistics/1573798/sms-co-ltd-total-equity/
Explore at:
Dataset updated
Mar 11, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Japan
Description
The total equity of SMS Co., Ltd. with headquarters in Japan amounted to 44.28 billion Japanese yen in 2023. The reported fiscal year ends on March 31.Compared to the earliest depicted value from 2020 this is a total increase by approximately 21.62 billion Japanese yen. The trend from 2020 to 2023 shows, furthermore, that this increase happened continuously.
China: amount of text messages by month June 2018
statista.com
Updated Jul 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). China: amount of text messages by month June 2018 [Dataset]. https://www.statista.com/statistics/278205/china-amount-of-text-messages/
Explore at:
Dataset updated
Jul 9, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Jun 2017 - Jun 2018
Area covered
China
Description
The graph shows the monthly amount of text messages in China from ********* to *********. In *********, about ***** billion text messages had been sent in China. Text messaging in China – additional information

There has been a significant decline in text messaging after the total number of text messages sent in China peaked in 2012 at *** billion. The decrease is even more noticeable in terms of text messages sent per person, taking account of the increasing number of registered mobile users in China. The reason for the continuous decline in text messaging is quite obvious; due to the growing popularity of smartphones and mobile internet, Chinese mobile users are preferring mobile messaging apps to share information. The usage of mobile message apps is almost universal among Chinese smartphone users; around ** percent of iPhone users in China are using WeChat, for example, the most popular Chinese messaging app developed by Tencent. As of the second quarter of 2015, the number of monthly active WeChat users has reached approximately *** million. Mobile message apps like WeChat are gaining rapid traction among Chinese users because they offer more than an alternative to texting. Voice messaging, also known as “push-to-talk”, was the most commonly used function of WeChat in 2014. One reason may be that Chinese language is relatively hard to type, so voice messaging could take full advantage in keeping the users hands free and saving a considerable amount of time. Besides, mobile message apps in China are even more appealing due to the inclusion of social media features: As of ********, about ** percent of WeChat users had used “moment”, a sharing feature allowing people to exchange stories, photos and short videos among their circle of friends. Moreover, China’s instant messaging apps like WeChat are expanding their services in sectors such as gaming, commercial promoting, online shopping, and even banking.
h
toxi-text-3M
huggingface.co
Updated Dec 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fred Zhang (2023). toxi-text-3M [Dataset]. https://huggingface.co/datasets/FredZhang7/toxi-text-3M
Explore at:
Dataset updated
Dec 12, 2023
Authors
Fred Zhang
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
This is a large multilingual toxicity dataset with 3M rows of text data from 55 natural languages, all of which are written/sent by humans, not machine translation models. The preprocessed training data alone consists of 2,880,667 rows of comments, tweets, and messages. Among these rows, 416,529 are classified as toxic, while the remaining 2,463,773 are considered neutral. Below is a table to illustrate the data composition:

Toxic Neutral Total

multilingual-train-deduplicated.csv… See the full description on the dataset page: https://huggingface.co/datasets/FredZhang7/toxi-text-3M.
P
WOS Hierarchical Text Classification Dataset
paperswithcode.com
Updated Nov 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jaco du Toit; Herman Redelinghuys; Marcel Dunaiski (2024). WOS Hierarchical Text Classification Dataset [Dataset]. https://paperswithcode.com/dataset/wos-hierarchical-text-classification
Explore at:
Dataset updated
Nov 27, 2024
Authors
Jaco du Toit; Herman Redelinghuys; Marcel Dunaiski
Description
The WOS Hierarchical Text Classification are three dataset variants created from Web of Science (WOS) title and abstract data categorised into a hierarchical, multi-label class structure. The aim of the sampling and filtering methodology used was to create well-balanced class distributions (at chosen hierarchical levels). Furthermore, the WOS_JTF variant was also created with the aim to only contain publication data such that their class assignments results is classes instances that semantically more similar.

The three dataset variants have the following properties: 1. WOS_JT comprises 43,366 total samples (train=30356, dev=6505, test=6505) and only uses the journal-based classifications as labels. 2. WOS_CT comprises 65,200 total samples (train=45640, dev=9780, test=9780) and only uses citation-based classifications as labels. 3. WOS_JTF comprises 42,926 total samples (train=30048, dev=6439, test=6439) and uses a filtered set of papers based on journal and citation classification.

The dataset is available at:

https://huggingface.co/datasets/marcelsun/wos_hierarchical_multi_label_text_classification

Dataset details: *.json: - concatenated title and abstract mapped to a list each associated class label.

depth2label.pt: dictionary where: - key = depth of classification hierarchy. - value = list of classes associated with depth.

path_list.pt: - list of tuples for every edge between classes in the hierarchical classification. This specifies the acyclic graph.

slot.pt: dictionary where: - key = label_id of parent class - value = label_ids of children classes

value2slot.pt: dictionary where: - key = label_id - value = label_id of parent class

value_dict.pt: dictionary where: - key = label_id - value = string representation of class.
f
Hyperparameter settings of classification model.
plos.figshare.com
xls
Updated Oct 18, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lu Xiao; Qiaoxing Li; Qian Ma; Jiasheng Shen; Yong Yang; Danyang Li (2024). Hyperparameter settings of classification model. [Dataset]. http://doi.org/10.1371/journal.pone.0305095.t006
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0305095.t006
Dataset updated
Oct 18, 2024
Dataset provided by
PLOS ONE
Authors
Lu Xiao; Qiaoxing Li; Qian Ma; Jiasheng Shen; Yong Yang; Danyang Li
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Text classification, as an important research area of text mining, can quickly and effectively extract valuable information to address the challenges of organizing and managing large-scale text data in the era of big data. Currently, the related research on text classification tends to focus on the application in fields such as information filtering, information retrieval, public opinion monitoring, and library and information, with few studies applying text classification methods to the field of tourist attractions. In light of this, a corpus of tourist attraction description texts is constructed using web crawler technology in this paper. We propose a novel text representation method that combines Word2Vec word embeddings with TF-IDF-CRF-POS weighting, optimizing traditional TF-IDF by incorporating total relative term frequency, category discriminability, and part-of-speech information. Subsequently, the proposed algorithm respectively combines seven commonly used classifiers (DT, SVM, LR, NB, MLP, RF, and KNN), known for their good performance, to achieve multi-class text classification for six subcategories of national A-level tourist attractions. The effectiveness and superiority of this algorithm are validated by comparing the overall performance, specific category performance, and model stability against several commonly used text representation methods. The results demonstrate that the newly proposed algorithm achieves higher accuracy and F1-measure on this type of professional dataset, and even outperforms the high-performance BERT classification model currently favored by the industry. Acc, marco-F1, and mirco-F1 values are respectively 2.29%, 5.55%, and 2.90% higher. Moreover, the algorithm can identify rare categories in the imbalanced dataset and exhibit better stability across datasets of different sizes. Overall, the algorithm presented in this paper exhibits superior classification performance and robustness. In addition, the conclusions obtained by the predicted value and the true value are consistent, indicating that this algorithm is practical. The professional domain text dataset used in this paper poses higher challenges due to its complexity (uneven text length, relatively imbalanced categories), and a high degree of similarity between categories. However, this proposed algorithm can efficiently implement the classification of multiple subcategories of this type of text set, which is a beneficial exploration of the application research of complex Chinese text datasets in specific fields, and provides a useful reference for the vector expression and classification of text datasets with similar content.
C
Replication Data for: Text Messaging Versus Postal Reminders to Improve...
dataverse.csuc.cat
tsv, txt
Updated Feb 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Núria Vives; Núria Vives; Gemma Binefa; Gemma Binefa; Noemie Travier; Noemie Travier; Albert Farre; Albert Farre; Jon Aritz Panera; Jon Aritz Panera; Berta Casas; Berta Casas; Carmen Vidal; Carmen Vidal; Gemma Ibañez-Sanz; Gemma Ibañez-Sanz; Montse Garcia; Montse Garcia; M-TICS research group; M-TICS research group (2025). Replication Data for: Text Messaging Versus Postal Reminders to Improve Participation in a Colorectal Cancer Screening Program: Randomized Controlled Trial [Dataset]. http://doi.org/10.34810/data1713
Explore at:
txt(4616), tsv(2821457)Available download formats
Unique identifier
https://doi.org/10.34810/data1713
Dataset updated
Feb 4, 2025
Dataset provided by
CORA.Repositori de Dades de Recerca
Authors
Núria Vives; Núria Vives; Gemma Binefa; Gemma Binefa; Noemie Travier; Noemie Travier; Albert Farre; Albert Farre; Jon Aritz Panera; Jon Aritz Panera; Berta Casas; Berta Casas; Carmen Vidal; Carmen Vidal; Gemma Ibañez-Sanz; Gemma Ibañez-Sanz; Montse Garcia; Montse Garcia; M-TICS research group; M-TICS research group
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Dataset funded by
Instituto de Salud Carlos III
Description
The study aimed to assess the effectiveness of text messages as a replacement for routine postal reminders in a fecal immunochemical test (FIT) based colorectal cancer screening program in Catalonia. For that purpose, a randomized controlled trial was conducted. Study population: individuals aged 50 to 69 invited to screening who had not completed FIT within six weeks. The intervention group (n=12,167) received a text message reminder, and the control group (n=12,221) used the standard procedure (reminder letter). The primary outcome was a participation rate within 18 weeks of the invitation. The trial was discontinued, and a recovery strategy was implemented by sending a reminder letter to non-participant individuals from the intervention group. We performed a final analysis to determine the impact of the recovery strategy. Results: Interim analysis (n=7095) showed a lower participation rate among nonparticipants within six weeks in the text message group compared to the control group (16.4% vs. 20.9%, OR 0.71, 95% CI 0.63–0.81). A total of 7591 non-participants in the text message group received a second reminder by letter, reaching a participation rate of 23%. Final analysis (n=24,388) showed that the intervention group, which received two reminders, had higher participation than the control group (29.3% vs. 26.5%, OR 1.16, 95% CI 1.09–1.23). Our attempt to replace reminder letters with text messages was unsuccessful, but receiving two reminders significantly increased participation rates among non-participants within six weeks compared to one postal reminder. Additional research is essential to determine the best timing and frequency of reminders to boost participation without being intrusive in their choice of participation

Facebook

Twitter

Click to copy link

Link copied

Cite

Yunus Serhat Bıçakçı (2024). Total-Text-Dataset [Dataset]. https://huggingface.co/datasets/yunusserhat/Total-Text-Dataset

Total-Text-Dataset

yunusserhat/Total-Text-Dataset

Explore at:

246 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Apr 30, 2024

Authors

Yunus Serhat Bıçakçı

Description

Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind. Original github repo; https://github.com/cs-chan/Total-Text-Dataset Forked repo; https://github.com/yunusserhat/Total-Text-Dataset

Clear search

Close search

Google apps

Main menu

Total-Text-Dataset

Total Text Dataset

Total Text Dataset

Number of text messages sent in the U.S. 2004-2014

Text Retrieval Conference (TREC) Total Recall collections

Number of text messages sent in the U.S. 2005-2021

Total number of SMS and MMS messages sent in Turkey Q1 2019- Q1 2024

Total Secondary Schools Text Book for Selected Subjects

Total-Text dataset - Dataset - LDM

Open Text total equity 2020-2024

total-text

COCO-Text Dataset

Total-Text-Dataset

Illustrative text conversations excerpts between public health officials and...

The Threatening English Language (TEL) Corpus

SMS Co., Ltd. total equity 2020 to 2023

China: amount of text messages by month June 2018

toxi-text-3M

WOS Hierarchical Text Classification Dataset

Hyperparameter settings of classification model.

Replication Data for: Text Messaging Versus Postal Reminders to Improve...

Total-Text-Dataset

yunusserhat/Total-Text-Dataset