TY - JOUR
T1 - Collecting language, speech acoustics, and facial expression to predict psychosis and other clinical outcomes
T2 - strategies from the AMP® SCZ initiative
AU - Accelerating Medicines Partnership® Schizophrenia (AMP® SCZ)
AU - Bilgrami, Zarina R.
AU - Castro, Eduardo
AU - Agurto, Carla
AU - Liebenthal, Einat
AU - Ennis, Michaela
AU - Baker, Justin T.
AU - Scott, Isabelle
AU - Colton, Beau Luke
AU - Cho, Kang Ik K.
AU - Li, Linying
AU - Tamayo, Zailyn
AU - Henecks, Mara
AU - Rahimi Eichi, Habiballah
AU - Henry, Tae’lar
AU - Addington, Jean
AU - Alameda, Luis K.
AU - Arango, Celso
AU - Breitborde, Nicholas J.K.
AU - Broome, Matthew R.
AU - Cadenhead, Kristin S.
AU - Calkins, Monica E.
AU - Chen, Eric Yu Hai
AU - Choi, Jimmy
AU - Conus, Philippe
AU - Cornblatt, Barbara A.
AU - Ellman, Lauren M.
AU - Fusar-Poli, Paolo
AU - Gaspar, Pablo A.
AU - Gerber, Carla
AU - Glenthøj, Louise Birkedal
AU - Horton, Leslie E.
AU - Hui, Christy
AU - Kambeitz, Joseph
AU - Kambeitz-Ilankovic, Lana
AU - Keshavan, Matcheri S.
AU - Kim, Sung Wan
AU - Koutsouleris, Nikolaos
AU - Kwon, Jun Soo
AU - Langbein, Kerstin
AU - Mamah, Daniel
AU - Diaz-Caneja, Covadonga M.
AU - Mathalon, Daniel H.
AU - Mittal, Vijay A.
AU - Nordentoft, Merete
AU - Pearlson, Godfrey D.
AU - Perez, Jesus
AU - Perkins, Diana O.
AU - Powers, Albert R.
AU - Kahn, Rene S.
AU - Corcoran, Cheryl M.
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Speech-based detection of early psychosis is progressing at a rapid pace. Within this evolving field, the Accelerating Medicines Partnership® in Schizophrenia (AMP® SCZ) is uniquely positioned to deepen our understanding of how language and related behaviors reflect early psychosis. We begin with detailed standard operating procedures (SOPs) that govern every stage of collection. These SOPs specify how to elicit speech, capture facial expressions, and record acoustics in synchronized audio–video files—both on-site and through remote platforms. We then explain how we chose our sampling tasks, hardware, and software, and how we built streamlined pipelines for data acquisition, aggregation, and processing. Robust quality-assurance and quality-control (QA/QC) routines, along with standardized interviewer training and certification, ensure data integrity across sites. Using natural language processing parsers, large language models, and machine-learning classifiers, we analyzed Data Release 3.0 to uncover systematic grammatical markers of psychosis risk. Speakers at clinical high risk (CHR) produced more referential language but fewer adjectives, adverbs, and nouns than community controls (CC), a pattern that replicated across sampling tasks. Some effects were task-specific: CHR participants showed elevated use of complex syntactic embeddings in two elicitation conditions but not the third, underscoring the importance of the language sampling task. Together, these results demonstrate how computational linguistics can turn everyday speech into a scalable, objective biomarker, paving the way for earlier and more precise detection of psychosis. Video Link: https://vimeo.com/1112291965?fl=pl&fe=sh
AB - Speech-based detection of early psychosis is progressing at a rapid pace. Within this evolving field, the Accelerating Medicines Partnership® in Schizophrenia (AMP® SCZ) is uniquely positioned to deepen our understanding of how language and related behaviors reflect early psychosis. We begin with detailed standard operating procedures (SOPs) that govern every stage of collection. These SOPs specify how to elicit speech, capture facial expressions, and record acoustics in synchronized audio–video files—both on-site and through remote platforms. We then explain how we chose our sampling tasks, hardware, and software, and how we built streamlined pipelines for data acquisition, aggregation, and processing. Robust quality-assurance and quality-control (QA/QC) routines, along with standardized interviewer training and certification, ensure data integrity across sites. Using natural language processing parsers, large language models, and machine-learning classifiers, we analyzed Data Release 3.0 to uncover systematic grammatical markers of psychosis risk. Speakers at clinical high risk (CHR) produced more referential language but fewer adjectives, adverbs, and nouns than community controls (CC), a pattern that replicated across sampling tasks. Some effects were task-specific: CHR participants showed elevated use of complex syntactic embeddings in two elicitation conditions but not the third, underscoring the importance of the language sampling task. Together, these results demonstrate how computational linguistics can turn everyday speech into a scalable, objective biomarker, paving the way for earlier and more precise detection of psychosis. Video Link: https://vimeo.com/1112291965?fl=pl&fe=sh
UR - https://www.scopus.com/pages/publications/105026920939
U2 - 10.1038/s41537-025-00669-z
DO - 10.1038/s41537-025-00669-z
M3 - Article
AN - SCOPUS:105026920939
SN - 2334-265X
VL - 11
JO - Schizophrenia
JF - Schizophrenia
IS - 1
M1 - 125
ER -