Natural Language Processing markers in First Episode Psychosis and People at Clinical High-risk

Sarah E. Morgan; Kelly Diederen; Petra E. Vértes; Samantha H. Y. Ip; Bo Wang; Bethany Thompson; Arsime Demjaha; Andrea De Micheli; Dominic Oliver; Maria Liakata; Paolo Fusar-Poli; Tom J. Spencer; Philip McGuire

doi:10.1038/s41398-021-01722-y

Back to publications

Natural Language Processing markers in First Episode Psychosis and People at Clinical High-risk

Sarah E. Morgan, Kelly Diederen, Petra E. Vértes, Samantha H. Y. Ip, Bo Wang, Bethany Thompson, Arsime Demjaha, Andrea De Micheli, Dominic Oliver, Maria Liakata, Paolo Fusar-Poli, Tom J. Spencer, Philip McGuire

Translational Psychiatry, 11(630), 2021.

Abstract

Recent work has suggested that disorganised speech might be a powerful predictor of later psychotic illness in clinical high risk subjects. To that end, several automated measures to quantify disorganisation of transcribed speech have been proposed. However, it remains unclear which measures are most strongly associated with psychosis, how different measures are related to each other and what the best strategies are to collect speech data from participants. Here, we assessed whether twelve automated Natural Language Processing markers could differentiate transcribed speech excerpts from subjects at clinical high risk for psychosis, first episode psychosis patients and healthy control subjects (total $N = 54$). In-line with previous work, several measures showed significant differences between groups, including semantic coherence, speech graph connectivity and a measure of whether speech was on-topic, the latter of which outperformed the related measure of tangentiality. Most NLP measures examined were only weakly related to each other, suggesting they provide complementary information. Finally, we compared the ability of transcribed speech generated using different tasks to differentiate the groups. Speech generated from picture descriptions of the Thematic Apperception Test and a story re-telling task outperformed free speech, suggesting that choice of speech generation method may be an important consideration. Overall, quantitative speech markers represent a promising direction for future clinical applications.

Links

Cite this Paper

BibTeX


@Article{publications/assessing-psychosis-risk-using-quantitative-markers-of-disorganised-speech,
  title = 	 {Natural Language Processing markers in First Episode Psychosis and People at Clinical High-risk},
  author = 	 {Morgan, Sarah E. and Diederen, Kelly and Vértes, Petra E. and Ip, Samantha H. Y. and Wang, Bo and Thompson, Bethany and Demjaha, Arsime and De Micheli, Andrea and Oliver, Dominic and Liakata, Maria and Fusar-Poli, Paolo and Spencer, Tom J. and McGuire, Philip},
  journal =      {Translational Psychiatry},
  year = 	 {2021},
  volume = 	 {11},
  number =       {630},
  doi = 	 {10.1038/s41398-021-01722-y},
  url = 	 {/publications/assessing-psychosis-risk-using-quantitative-markers-of-disorganised-speech.html},
  abstract = 	 {Recent work has suggested that disorganised speech might be a powerful predictor of later psychotic illness in clinical high risk subjects. To that end, several automated measures to quantify disorganisation of transcribed speech have been proposed. However, it remains unclear which measures are most strongly associated with psychosis, how different measures are related to each other and what the best strategies are to collect speech data from participants. Here, we assessed whether twelve automated Natural Language Processing markers could differentiate transcribed speech excerpts from subjects at clinical high risk for psychosis, first episode psychosis patients and healthy control subjects (total $N = 54$). In-line with previous work, several measures showed significant differences between groups, including semantic coherence, speech graph connectivity and a measure of whether speech was on-topic, the latter of which outperformed the related measure of tangentiality. Most NLP measures examined were only weakly related to each other, suggesting they provide complementary information. Finally, we compared the ability of transcribed speech generated using different tasks to differentiate the groups. Speech generated from picture descriptions of the Thematic Apperception Test and a story re-telling task outperformed free speech, suggesting that choice of speech generation method may be an important consideration. Overall, quantitative speech markers represent a promising direction for future clinical applications.}
}

Endnote

%0 Journal Article
%T Natural Language Processing markers in First Episode Psychosis and People at Clinical High-risk
%A Sarah E. Morgan
%A Kelly Diederen
%A Petra E. Vértes
%A Samantha H. Y. Ip
%A Bo Wang
%A Bethany Thompson
%A Arsime Demjaha
%A Andrea De Micheli
%A Dominic Oliver
%A Maria Liakata
%A Paolo Fusar-Poli
%A Tom J. Spencer
%A Philip McGuire
%J Translational Psychiatry
%D 2021	
%F publications/assessing-psychosis-risk-using-quantitative-markers-of-disorganised-speech
%R 10.1038/s41398-021-01722-y
%U /publications/assessing-psychosis-risk-using-quantitative-markers-of-disorganised-speech.html
%V 11
%N 630
%X Recent work has suggested that disorganised speech might be a powerful predictor of later psychotic illness in clinical high risk subjects. To that end, several automated measures to quantify disorganisation of transcribed speech have been proposed. However, it remains unclear which measures are most strongly associated with psychosis, how different measures are related to each other and what the best strategies are to collect speech data from participants. Here, we assessed whether twelve automated Natural Language Processing markers could differentiate transcribed speech excerpts from subjects at clinical high risk for psychosis, first episode psychosis patients and healthy control subjects (total $N = 54$). In-line with previous work, several measures showed significant differences between groups, including semantic coherence, speech graph connectivity and a measure of whether speech was on-topic, the latter of which outperformed the related measure of tangentiality. Most NLP measures examined were only weakly related to each other, suggesting they provide complementary information. Finally, we compared the ability of transcribed speech generated using different tasks to differentiate the groups. Speech generated from picture descriptions of the Thematic Apperception Test and a story re-telling task outperformed free speech, suggesting that choice of speech generation method may be an important consideration. Overall, quantitative speech markers represent a promising direction for future clinical applications.

RIS


TY  - JOUR
TI  - Natural Language Processing markers in First Episode Psychosis and People at Clinical High-risk
AU  - Sarah E. Morgan
AU  - Kelly Diederen
AU  - Petra E. Vértes
AU  - Samantha H. Y. Ip
AU  - Bo Wang
AU  - Bethany Thompson
AU  - Arsime Demjaha
AU  - Andrea De Micheli
AU  - Dominic Oliver
AU  - Maria Liakata
AU  - Paolo Fusar-Poli
AU  - Tom J. Spencer
AU  - Philip McGuire
DA  - 2021/12/13	
ID  - publications/assessing-psychosis-risk-using-quantitative-markers-of-disorganised-speech
VL  - 11
IS  - 630
DO  - 10.1038/s41398-021-01722-y
UR  - /publications/assessing-psychosis-risk-using-quantitative-markers-of-disorganised-speech.html
AB  - Recent work has suggested that disorganised speech might be a powerful predictor of later psychotic illness in clinical high risk subjects. To that end, several automated measures to quantify disorganisation of transcribed speech have been proposed. However, it remains unclear which measures are most strongly associated with psychosis, how different measures are related to each other and what the best strategies are to collect speech data from participants. Here, we assessed whether twelve automated Natural Language Processing markers could differentiate transcribed speech excerpts from subjects at clinical high risk for psychosis, first episode psychosis patients and healthy control subjects (total $N = 54$). In-line with previous work, several measures showed significant differences between groups, including semantic coherence, speech graph connectivity and a measure of whether speech was on-topic, the latter of which outperformed the related measure of tangentiality. Most NLP measures examined were only weakly related to each other, suggesting they provide complementary information. Finally, we compared the ability of transcribed speech generated using different tasks to differentiate the groups. Speech generated from picture descriptions of the Thematic Apperception Test and a story re-telling task outperformed free speech, suggesting that choice of speech generation method may be an important consideration. Overall, quantitative speech markers represent a promising direction for future clinical applications.
ER  -

APA


Morgan, S.E., Diederen, K., Vértes, P.E., Ip, S.H.Y., Wang, B., Thompson, B., Demjaha, A., De Micheli, A., Oliver, D., Liakata, M., Fusar-Poli, P., Spencer, T.J. & McGuire, P.. (2021). Natural Language Processing markers in First Episode Psychosis and People at Clinical High-risk. Translational Psychiatry 11(630) doi:10.1038/s41398-021-01722-y Available from /publications/assessing-psychosis-risk-using-quantitative-markers-of-disorganised-speech.html.