Speech and Multimodal Interfaces Laboratory

Publications

2022

Axyonov A., Kagirov I., Ryumin D. A method of multimodal machine sign language translation for natural human-computer interaction // Scientific and Technical Journal of Information Technologies, Mechanics and Optics. 2022. Vol. 22. No. 3. pp. 585-593.

Dvoynikova A. A., Kagirov I. A., Karpov A. A. Analytical review of methods for automatic detection of user engagement in virtual communication // Information and Control Systems. 2022. No. 5(120). pp. 12-22.

Markitantov M., Ryumina E., Ryumin D., Karpov A. Biometric Russian Audio-Visual Extended MASKS (BRAVE-MASKS) Corpus: Multimodal Mask Type Recognition Task // Proceedings of 23rd International Conference INTERSPEECH-2022. Korea. 2022. pp. 1756-1760.

Ivanko D., Ryumin D., Kashevnik A., Axyonov A., Kitenko A., Lashkov I., Karpov A. DAVIS: Driver’s Audio-Visual Speech recognition // Proceedings of 23rd International Conference INTERSPEECH-2022. Korea. 2022. pp. 1141-1142.

Ivanko D., Ryumin D., Kashevnik A., Axyonov A., Karpov A. Visual Speech Recognition in a Driver Assistance System // Proceedings of 30th European Signal Processing Conference EUSIPCO-2022. Belgrade, Serbia. 2022. pp. 1131-1135.

Ivanko D., Axyonov A., Ryumin D., Kashevnik A., Karpov A. RUSAVIC Corpus: Russian Audio-Visual Speech in Cars // Proceedings of 13th Language Resources and Evaluation Conference LREC-2022. France. 2022. pp. 1555-1559.

Ivanko D., Kashevnik A., Ryumin D., Kitenko A., Axyonov A., Lashkov I., Karpov A. MIDriveSafely: Multimodal Interaction for Drive Safely // Proceedings of 24th ACM International Conference on Multimodal Interaction ICMI-2022. India. 2022. pp. 733-735.

Dvoynikova A., Markitantov M., Ryumina E., Uzdiaev M., Velichko A., Kagirov I., Kipyatkova I., Lyakso E., Karpov A. An analysis of automatic techniques for recognizing human's affective states by speech and multimodal data // Proceedings of 24th International Congress on Acoustics ICA-2022. Korea. 2022. pp. 22-33.

Ryumina E., Ivanko D. Emotional Speech Recognition Based on Lip-Reading // Lecture Notes in Computer Science, SPECOM-2022, India. 2022. Vol. 13721. pp. 616-625.

Kipyatkova I. Investigation of Transfer Learning for End-to-End Russian Speech Recognition // Lecture Notes in Computer Science, SPECOM-2022, India. 2022. Vol. 13721. pp. 349-357.

Velichko A. A speech singnal analysis methodfor automatic aggression detection in colloquial speech // Proceedings of vsu, series: systems analysis and information technologies. 2022. No. 4. pp. 180-188.

Letenkov M., Iakovlev R., Markitantov M., Ryumin D., Karpov A. Application of training data synthesis methods for recognition of partially hidden faces in images // Journal of instrument engineering. 2022. No. 65(11). pp. 842-850.

Kagirov I., Ryumin D. Russian Sign Language Database for Clinical Use: Data and Annotation Peculiarities // Vestnik NSU. Series: Linguistics and Intercultural Communication. 2022. No. 20(3). pp. 90-108.

Ivanko D., Ryumin D., Markitantov M. End-to-end Visual Speech Recognition for Human-Robot Interaction // Proceedings of IV International Scientific Conference MIP: Engineering-IV-2022: Modernization, Innovations, Progress: Advanced Technologies in Material Science, Mechanical and Automation Engineering. 2022. pp. 82-90.

Dvoynikova A. Cough recognition using spectrogram analysis // Almanac of scientific works of young scientists of ITMO University. 2022. Vol. 2. pp. 230-234.