Speech and Multimodal Interfaces Laboratory

Publications

2023

Ushakov I.B., Bubeev Yu.A., Syrkin L.D., Karpov A.A., Polyakov A.V., Ivanov A.V., Usov V.M. Remote tele-counseling in primary healthcare for screening of anxiety-depressive disorders with a feedback loop from the patient // System analysis and control in biomedical systems. 2023. vol. 22. no. 4. pp. 140-153.
Kashevnik A.M., Karpov A.A., Bubeev Yu.A., Usov V.M., Ivanov A.V. Systems for Detecting Fatigue While Simulating Operator Activity of Cosmonauts // Manned Spaceflight. 2023. no. 4(49). pp. 106-121.
Povolotskaia A., Karpov A. Modern problems of automatic speech recognition: detection and analysis of extra-linguistic vocalizations in spontaneous conversational speech // In Proc. of the 8th Interdisciplinary Seminar “Analysis of Conversational Russian Speech” AR3-2023. St. Petersburg. 2023. pp. 44-50.
Dvoynikova A. An analytical review of multimodal data corpora for emotion recognition // Almanac of scientific works of young scientists of ITMO University. 2023. Vol. 1. pp. 251-256.
Dvoynikova A. Analyzing the engagement and emotions of virtual communication interlocutors // Proceedings of the XII Congress of Young Scientists of ITMO University. 2023. Vol. 2. pp. 185-190.
Karpov A.A. Intellectual systems for organization of multimodal human-machine interaction // Collection of abstracts of the Russian forum “Microelectronics 2023”. 2023. pp. 550-551.

2022

Ryumina E. Automatic recognition of emotional speech from video information // Almanac of scientific works of young scientists of ITMO University. 2023. Vol. 1. pp. 324-329.
Ryumina E., Dresvyanskiy D., Karpov A. In Search of a Robust Facial Expressions Recognition Model: A Large-Scale Visual Cross-Corpus Study // Neurocomputing. Elsevier. 2022. Vol. 514. pp. 435-450.
Dresvyanskiy D., Ryumina E., Kaya H., Markitantov M., Karpov A., Minker W. End-to-End Modeling and Transfer Learning for Audiovisual Emotion Recognition in-the-Wild // Multimodal Technologies and Interaction. 2022. Vol. 6. ID 11.
Siegert I., Hillmann S., Weiss B., Szczuka J. M., Karpov A. Editorial: Towards Omnipresent and Smart Speech Assistants // Frontiers in Computer Science. 2022. Vol. 4. pp. 1-3.
Velichko A., Markitantov M., Kaya H., Karpov A. Complex Paralinguistic Analysis of Speech: Predicting Gender, Emotions and Deception in a Hierarchical Framework // Proceedings of International Conference INTERSPEECH-2022. 2022. pp. 4735-4739.
Dresvyanskiy D., Sinha Y., Busch M.s, Siegert I., Karpov A., Minker W. DyCoDa: A Multi-modal Data Collection of Multi-user Remote Survival Game Recordings // Lecture Notes in Computer Science, SPECOM-2022. India. 2022. Vol. 13721. pp. 163-177.
Mamontov D., Minker W., Karpov A. Self-Configuring Genetic Programming Feature Generation in Affect Recognition Tasks // Lecture Notes in Computer Science, SPECOM-2022, India. 2022. Vol. 13721. pp. 464-476.
Krebbers D., Kaya H., Karpov A. Multi-level Fusion of Fisher Vector Encoded BERT and Wav2vec 2.0 Embeddings for Native Language Identification // Lecture Notes in Computer Science, SPECOM-2022, India. 2022. Vol. 13721. pp. 391-403.
Prasanna Mahadeva S.R., Karpov A., Samudravijaya K., Agrawal Shyam S. SPECOM 2022 Preface // Lecture Notes in Computer Science, SPECOM-2022, India. 2022. Vol. 13721. pp. v-vi