Speech and Multimodal Interfaces Laboratory

Publications

2021

Ivanko D., Ryumin D. A novel task-oriented approach toward automated lip-reading system implementation // The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences. Moscow. 2021. Vol. XLIV-2/W1-2021. pp. 85-89.

Axyonov А., Ryumin D., Kagirov I. Method of multi-modal video analysis of hand movements for automatic recognition of isolated signs of Russian sign language // The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences. Moscow. 2021. Vol. XLIV-2/W1-2021. pp. 7–13.

Ivanko D., Ryumin D., Karpov A. Developing of a Software-Hardware Complex for Automatic Audio-Visual Speech Recognition in Human–Robot Interfaces // Smart Innovation, Systems and Technologies, Springer, Electromechanics and Robotics ZR-2021. 2021. Vol. 232. pp. 259-270.

Letenkov M., Iakovlev R., Karpov. A. Approach to Image-based Face Recognition in Setting of Partial Face Occlusion by Personal Protective Equipment // Smart Innovation, Systems and Technologies, Springer, Electromechanics and Robotics ZR-2021. 2021. Vol. 232. pp. 249-258.

Velichko A.N., Karpov A.A. Automatic Detection of Deceptive and Truthful Paralinguistic Information in Speech using Two-Level Machine Learning Model // Computational Linguistics and Intellectual Technologies, Proc. International Conference “Dialogue” 2021. Moscow. 2021. pp. 698-704.

Ivanko D., Ryumin D. Development of Visual and Audio Speech Recognition Systems using Deep Neural Networks // Proc. 31st International Conference on Computer Graphics and Machine Vision GraphiCon-2021, CEUR Workshop Proceedings. 2021. Vol. 3027. pp. 905-916.

Dvoynikova A., Karpov A. An effect of back translation on emotion recognition in spontaneous Russian speech transcriptions // Proceedings of the 9th Interdisciplinary Seminar “Analysis of Conversational Russian Speech” AR3-2021. 2021. pp. 17-23.

Dvoynikova A., Mamontov D., Karpov A. Automatic detecting of the emotional state of participants in subject conversations by speech transcriptions // Almanac of scientific works of young scientists of ITMO University. 2021. Vol. 3. pp. 63-68.

Karpov A., Potapov V., Potapova R. The 22nd International Conference SPECOM-2020 “Speech And Computerˮ // Izvestiia Rossiiskoi akademii nauk. Seriia literatury i iazyka. 2022. V. 80. No. 2. pp. 107-115.

Ryumin D.A., Kagirov I.A. Approaches to Automatic Gesture Recognition: Hardware and Methods Overview // Manned Spaceflight. 2021. pp. 82-99.

Ryumina E.V., Dresvyanskiy D.V. Affecting of learning rate schedules on the efficiency of facial emotion recognition systems. // Almanac of scientific works of young scientists of ITMO University. 2021, vol. 3. pp. 174-180

Dvoynikova A. Sentiment-analysis of spoken language transcription using automatic machine translation // Proceedings of the IX Congress of Young Scientists of ITMO University. 2021. Vol. 1. pp. 199-203.

2020

Bojanić M., Delić V., Karpov A. Call Redistribution for a Call Center Based on Speech Emotion Recognition // Applied Sciences. 2020. 10(13). ID 4653.

Akhtiamov O., Siegert I., Karpov A., Minker W. Using Complexity-Identical Human- and Machine-Directed Utterances to Investigate Addressee Detection for Spoken Dialogue Systems // Sensors. 2020. 20(9). ID 2740.

Ryumin D., Kagirov I., Axyonov A., Pavlyuk N., Saveliev A., Kipyatkova I., Zelezny M., Mporas I., Karpov A. A Multimodal User Interface for an Assistive Robotic Shopping Cart // Electronics. 9(12). ID 2093.