1 dataset found

f
Linguistic features: Extracted features.
plos.figshare.com
csv
Updated May 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hong Jiang; Zhengwei Chen; Yu Liu; Chun Yang; Xiaofeng Yuan; Rui He (2025). Linguistic features: Extracted features. [Dataset]. http://doi.org/10.1371/journal.pone.0324270.s003
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0324270.s003
Dataset updated
May 22, 2025
Dataset provided by
PLOS ONE
Authors
Hong Jiang; Zhengwei Chen; Yu Liu; Chun Yang; Xiaofeng Yuan; Rui He
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Impairment in the semantic domain is prominent in Alzheimer’s Disease (AD). We analyzed spontaneous speech in English from 148 people with probable AD (pAD) and 143 controls, and aimed to replicate these findings in a smaller Greek dataset of 28 controls and 26 pAD patients, using different language models comparatively. Static models (fastText) represented non-contextual meaning via encoding words as static vectors, while contextual models (BERT) represented the contextual meanings sensitive to syntactic structure. These models calculated semantic similarity at two levels: local similarity (between adjacent words/tokens) and global similarity (across all word/token pairs). Generative contextual models (Mistral) additionally quantified token probability within context, thereby indicating the unexpectedness in speech progression. Given that contextual meaning is syntactically sensitive, we introduced averaged dependency distance as an indicator for formal syntactic complexity. Moreover, bimodal models were introduced to evaluate how speech reflected picture-based stimuli. Results showed significant increases in global semantic similarity in the pAD group, as measured by both fastText and BERT models, which co-occurred with enlarged picture-speech semantic distance and increased in speech perplexity. Only the fastText-based global semantic similarity, which captured the contraction in conceptual semantic space, correlated with the overall cognitive decline in the AD populations. These findings together indicates that semantic space changes in AD differed across different forms of meanings and thus points to the necessity of distinguishing these forms to raveling the underlying mechanism.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Hong Jiang; Zhengwei Chen; Yu Liu; Chun Yang; Xiaofeng Yuan; Rui He (2025). Linguistic features: Extracted features. [Dataset]. http://doi.org/10.1371/journal.pone.0324270.s003

Linguistic features: Extracted features.

Explore at:

2 scholarly articles cite this dataset (View in Google Scholar)

csvAvailable download formats

Unique identifier

https://doi.org/10.1371/journal.pone.0324270.s003

Dataset updated

May 22, 2025

Dataset provided by

PLOS ONE

Authors

Hong Jiang; Zhengwei Chen; Yu Liu; Chun Yang; Xiaofeng Yuan; Rui He

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Impairment in the semantic domain is prominent in Alzheimer’s Disease (AD). We analyzed spontaneous speech in English from 148 people with probable AD (pAD) and 143 controls, and aimed to replicate these findings in a smaller Greek dataset of 28 controls and 26 pAD patients, using different language models comparatively. Static models (fastText) represented non-contextual meaning via encoding words as static vectors, while contextual models (BERT) represented the contextual meanings sensitive to syntactic structure. These models calculated semantic similarity at two levels: local similarity (between adjacent words/tokens) and global similarity (across all word/token pairs). Generative contextual models (Mistral) additionally quantified token probability within context, thereby indicating the unexpectedness in speech progression. Given that contextual meaning is syntactically sensitive, we introduced averaged dependency distance as an indicator for formal syntactic complexity. Moreover, bimodal models were introduced to evaluate how speech reflected picture-based stimuli. Results showed significant increases in global semantic similarity in the pAD group, as measured by both fastText and BERT models, which co-occurred with enlarged picture-speech semantic distance and increased in speech perplexity. Only the fastText-based global semantic similarity, which captured the contraction in conceptual semantic space, correlated with the overall cognitive decline in the AD populations. These findings together indicates that semantic space changes in AD differed across different forms of meanings and thus points to the necessity of distinguishing these forms to raveling the underlying mechanism.

Clear search

Close search

Google apps

Main menu