THE USAGE OF NATURAL LANGUAGE PROCESSING METHODS TO DETECT THE SYMPTOMS OF MENTAL ILLNESS

Authors

  • S. D. Pogorilyy Taras Shevchenko National University of Kyiv
  • A. A. Kramov Taras Shevchenko National University of Kyiv

DOI:

https://doi.org/10.11603/mie.1996-1960.2020.1.11125

Keywords:

detection of the symptoms of mental illness, natural language processing, incoherence model, tangentiality model, semantic representation of the text's elements, classification model

Abstract

Background. The process of the detection of the symptoms of mental illness is a complicated task that requires the appropriate level of the qualification of a specialist to solve it. One part of the diagnostics of such diseases is the analysis of the patient's speech. Alogia (the poverty of speech), the lack of the persistent focus on a topic, incoherent speech, permanent usage of metaphors can indicate the availability of appropriate symptoms. Thus, it is necessary to apply the different automated methods of the estimation of the patient's speech in order to detect some deviations from defined statistical data. Such methods fall into the category of natural language processing. Taking into account the lack of unified structure, the availability of ambiguous terms, the tasks of natural language processing cannot be solved with the usage of defined algorithms. Search for regularities and the detection of the connection between the text's elements are performed using the methods of machine learning: regression models, decision trees, deep learning (multilayer neural networks). Thus, it is advisable to consider state-of-the-art methods, based on different methods of machine learning, to detect the symptoms of mental illness by analyzing the patient's speech. The purpose of the work is the following: to perform the comparative analysis of different state-of-the-art methods of the detection of the symptoms of mental illness based on the methods of natural language processing; to make the experimental examination of the effectiveness of the proposed method based on the analysis of the connectivity of the text's elements.

Materials and methods. Results. According to the analysis of state-of-the-art methods, the semantic coherence is the main feature of a text to predict mental illness. Two different models based on the estimation of semantic coherence are considered: tangentiality model and incoherence model. The main idea of the tangentiality model consists in the detection of the persistent deviation of the topic of an answer from a question. A text is divided into windows — sets of words with a fixed length. Each window and question are represented as vectors using a pre-trained semantic embedding model — LSA. The similarity between a window and the question is calculated as the cosine distance between corresponding vectors. Using the set of calculated distances, linear regression is built. The steeper slope of a line indicates the deviation of the thoughts of a speaker from the whole topic of a conversation. In comparison to the tangentiality model, the incoherence model processes a text at the level of sentences. All sentences are represented as the average vector interpretation of its words; each word is represented as a vector using a pre-trained semantic embedding model. Then three different features are calculated to form a feature vector: minimum first-order coherence (minimum similarity between two sentences that is estimated as a cosine distance between corresponding vectors), maximum sentence length, and the frequency of the usage of additional uninformative words. This dataset is used to build a convex hull classifier that divides interviews of healthy and ill people. The key disadvantage of both mentioned models is the neglect of the repeats of phrases within a text. Moreover, such repeats can complicate the classification process. In order to solve it, the different combination of state-of-the-art semantic embedding models (Word2Vec, Sent2Vec, GloVe) with frequency algorithms (TF-IDF, SIF) can be used. The disadvantage of such an approach is the dependency on an additional corpus to calculate statistical data about the frequency of words' usage. As for the effectiveness of each model for different languages, it depends on the collected dataset and the unique features of a separate language. Except for the semantic coherence, other linguistic characteristics

can be taken into account to form a feature vector: linguistic complexity, linguistic density, syntactic complexity. Each of these characteristics can be represented with the corresponding set of metrics. Moreover, the frequent usage of ambiguous pronouns may also be taken into account because it can indicate the disorganization of the thoughts of a speaker.

The proposed method based on the graph of the consistency of phrases allows estimating the connectivity of a text — its cohesion. It takes into account the availability of coreferent objects and common terms within a text. The effectiveness of the suggested method was compared with other features of a text using pre-trained classification models. The results obtained can indicate that the proposed method may be used to calculate the connectivity feature for a model that predicts a mental illness.

Conclusions. As the main criteria to distinguish the texts of healthy and ill persons, the semantic coherence is used. The estimation of the semantic coherence is performed in the following models: tangentiality model and incoherence model. It is advisable to perform the semantic representation of the text's elements (sentences for the incoherence model and windows for the tangentiailty model) using the combination of different semantic embedding models with statistical algorithms (TF-IDF, SIF) in order to take into account permanent repeats of phrases. As for the effectiveness of the mentioned models for different languages, it depends on the semantic embedding model and the properties of a certain language.

In order to increase the accuracy of the classification model, other linguistic features should be taken into account: lexical density, lexical and syntactic complexity, connectivity. The method based on the graph of the consistency of phrases has been proposed to take into account the connectivity of a text. The experimental examination of the effectiveness of the proposed method in comparison with other features has been verified. The results obtained can indicate the expediency of the usage of the proposed method to increase the accuracy of a prediction model.

References

Altamura, C., Goikolea, J. (2008). Differential diagnoses and management strategies in patients with schizophrenia and bipolar disorder. Neuropsychiatric Disease and Treatment, 4 (1), 311-7. DOI: https://doi.org/10.2147/NDT.S2703

Pogoriliy, S. D., Kramov, A. A. (2018). Metod rozrakhunku koherentnosti ukrayins'koho tekstu. [Method of calculating the coherence of the Ukrainian text]. Reyestratsiya, zberihannya i obrobka danykh (Registration, storage and data processing), 20:4, 6475. [In Ukrainian].

Elvevag, B., Foltz, P. W., Weinberger, D. R., Goldberg T. E. et. al. (2007). Quantifying incoherence in speech: an automated methodology and novel application to schizophrenia. Schizophrenia research, 93 (1-3), 304-16. DOI: https://doi.org/10.1016/j.schres.2007.03.001

Bedi, G., Carrillo, F., Cecchi, G. A. et. al. (2015). Automated analysis of free speech predicts psychosis onset in high-risk youths. npj Schizophrenia, 1, 1-20. DOI: https://doi.org/10.1038/npjschz.2015.30

Mikolov, T., Sutskever, I., Chen, K. et. al. (2013). Distributed representations of words and phrases and their compositionality. Proceedings of the 26th International Conference on Neural Information Processing Systems, 3111-9.

Mikolov, T., Le Q. (2014). Distributed representations of sentences and documents. International Conference on Machine Learning, 1188-96.

Pennington, J., Socher, R., Manning, C. D. (2014). Global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), 1532-43. DOI: https://doi.org/10.3115/v1/D14-1162

Peters, M., Neumann, M., Iyyer M. et. al. (2018). Deep Contextualized Word Representations. Proceedings of DOI: https://doi.org/10.18653/v1/N18-1202

the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), 2227-37.

Devlin, J., Chang, M., Lee, K. et. al. (2029). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of NAACL-HLT 2019, 4171-86.

Iter, D., Yoon, J., Jurafsky, D. (2018). Automatic Detection of Incoherent Speech for Diagnosing Schizophrenia. Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic, 136-46. DOI: https://doi.org/10.18653/v1/W18-0615

Pogorilyy, S. D., Kramov, A. A. (2019). Coreference Resolution Method Using a Convolutional Neural Network. Proceedings of 2019 IEEE International Conference on Advanced Trends in Information Theory, 397-401. DOI: https://doi.org/10.1109/ATIT49449.2019.9030596

Voleti, R., Woolridge, S., Liss, J. M. et. al. (2019). Objective Assessment of Social Skills Using Automated Language Analysis for Identification of Schizophrenia and Bipolar Disorder. Proc. Interspeech 2019, 1433-7. DOI: https://doi.org/10.21437/Interspeech.2019-2960

Covington, M. A., McFall, J. D. (2010). Cutting the Gordian Knot: The Moving-Average Type—Token Ratio (MATTR). Journal of Quantitative Linguistics, 17 (2), 94-100. DOI: https://doi.org/10.1080/09296171003643098

Honore, A. (1979). Some Simple Measures of Richness of Vocabulary. Association for Literary and Linguistic Computing Bulletin, № 7 (2), 172-7.

Bucks, R. S. (2000). Analysis of spontaneous, conversational speech in dementia of Alzheimer type: Evaluation of an objective technique for analysing lexical performance. Aphasiology, 14 (1), 71-91. DOI: https://doi.org/10.1080/026870300401603

Johansson, V. (2018). Lexical Diversity and Lexical Density in Speech and Writing: A Developmental Perspective. Working Papers in Linguistics, 53, 61-79.

Just, S., Haegert, E., Koranova, N. (2019). Coherence models in schizophrenia. Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology, 126-36. DOI: https://doi.org/10.18653/v1/W19-3015

Panicheva, P., Litvinova, T. (2019). Semantic Coherence in Schizophrenia in Russian Written Texts. 25th Conference of Open Innovations Association (FRUCT), 241-9. DOI: https://doi.org/10.23919/FRUCT48121.2019.8981535

Kramov, A. (2020). Evaluating text coherence based on the graph of the consistency of phrases to identify symptoms of schizophrenia. Data Recording, Storage & Processing, 22 (1).

Andreasen, N. C. (1986). Scale for the assessment of thought, language, and communication (TLC). Schizophrenia bulletin, 12 (3), 473-82. DOI: https://doi.org/10.1093/schbul/12.3.473

Seeman, M. V., Cole, H. J. (1977). The effect of increasing personal contact in schizophrenia. Comprehensive psychiatry, 18 (3), 283-93. DOI: https://doi.org/10.1016/0010-440X(77)90024-4

English Speeches with English Subtitles — English Speeches. URL: https://www.englishspeecheschannel. com/english-speeches.

Published

2020-06-22

How to Cite

Pogorilyy, S. D., & Kramov, . A. A. . (2020). THE USAGE OF NATURAL LANGUAGE PROCESSING METHODS TO DETECT THE SYMPTOMS OF MENTAL ILLNESS. Medical Informatics and Engineering, (1), 8–16. https://doi.org/10.11603/mie.1996-1960.2020.1.11125

Issue

Section

Articles