1 dataset found
  1. f

    Linguistic features: Extracted features.

    • plos.figshare.com
    csv
    Updated May 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hong Jiang; Zhengwei Chen; Yu Liu; Chun Yang; Xiaofeng Yuan; Rui He (2025). Linguistic features: Extracted features. [Dataset]. http://doi.org/10.1371/journal.pone.0324270.s003
    Explore at:
    csvAvailable download formats
    Dataset updated
    May 22, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Hong Jiang; Zhengwei Chen; Yu Liu; Chun Yang; Xiaofeng Yuan; Rui He
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Impairment in the semantic domain is prominent in Alzheimer’s Disease (AD). We analyzed spontaneous speech in English from 148 people with probable AD (pAD) and 143 controls, and aimed to replicate these findings in a smaller Greek dataset of 28 controls and 26 pAD patients, using different language models comparatively. Static models (fastText) represented non-contextual meaning via encoding words as static vectors, while contextual models (BERT) represented the contextual meanings sensitive to syntactic structure. These models calculated semantic similarity at two levels: local similarity (between adjacent words/tokens) and global similarity (across all word/token pairs). Generative contextual models (Mistral) additionally quantified token probability within context, thereby indicating the unexpectedness in speech progression. Given that contextual meaning is syntactically sensitive, we introduced averaged dependency distance as an indicator for formal syntactic complexity. Moreover, bimodal models were introduced to evaluate how speech reflected picture-based stimuli. Results showed significant increases in global semantic similarity in the pAD group, as measured by both fastText and BERT models, which co-occurred with enlarged picture-speech semantic distance and increased in speech perplexity. Only the fastText-based global semantic similarity, which captured the contraction in conceptual semantic space, correlated with the overall cognitive decline in the AD populations. These findings together indicates that semantic space changes in AD differed across different forms of meanings and thus points to the necessity of distinguishing these forms to raveling the underlying mechanism.

  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Hong Jiang; Zhengwei Chen; Yu Liu; Chun Yang; Xiaofeng Yuan; Rui He (2025). Linguistic features: Extracted features. [Dataset]. http://doi.org/10.1371/journal.pone.0324270.s003

Linguistic features: Extracted features.

Related Article
Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
csvAvailable download formats
Dataset updated
May 22, 2025
Dataset provided by
PLOS ONE
Authors
Hong Jiang; Zhengwei Chen; Yu Liu; Chun Yang; Xiaofeng Yuan; Rui He
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Impairment in the semantic domain is prominent in Alzheimer’s Disease (AD). We analyzed spontaneous speech in English from 148 people with probable AD (pAD) and 143 controls, and aimed to replicate these findings in a smaller Greek dataset of 28 controls and 26 pAD patients, using different language models comparatively. Static models (fastText) represented non-contextual meaning via encoding words as static vectors, while contextual models (BERT) represented the contextual meanings sensitive to syntactic structure. These models calculated semantic similarity at two levels: local similarity (between adjacent words/tokens) and global similarity (across all word/token pairs). Generative contextual models (Mistral) additionally quantified token probability within context, thereby indicating the unexpectedness in speech progression. Given that contextual meaning is syntactically sensitive, we introduced averaged dependency distance as an indicator for formal syntactic complexity. Moreover, bimodal models were introduced to evaluate how speech reflected picture-based stimuli. Results showed significant increases in global semantic similarity in the pAD group, as measured by both fastText and BERT models, which co-occurred with enlarged picture-speech semantic distance and increased in speech perplexity. Only the fastText-based global semantic similarity, which captured the contraction in conceptual semantic space, correlated with the overall cognitive decline in the AD populations. These findings together indicates that semantic space changes in AD differed across different forms of meanings and thus points to the necessity of distinguishing these forms to raveling the underlying mechanism.

Search
Clear search
Close search
Google apps
Main menu