Speech and Multimodal Interfaces Laboratory

Publications

2018

Verkhodanova V.O., Shapranov V.V., Kipyatkova I.S., Karpov A.A. Automatic detection of vocalized hesitations in Russian speech. Voprosy Jazykoznanija, 2018, No. 6, pp. 104–118. (in Russian)

Ivanko D.V., Fedotov D.V., Karpov A. A. Accuracy increase for automatic visual Russian speech recognition: viseme classes optimization. Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 2018, vol. 18, no. 2, pp. 346–349

Markovnikov N.M., Kipyatkova I.S. An Analytic Survey of End-to-End Speech Recognition Systems // SPIIRAS Proceedings. 2018. Issue 3(58). pp. 77-110.

Karpov A., Mporas I. Speech Communication Integrated with Other Modalities (Editorial) // Journal on Multimodal User Interfaces, Springer, Vol. 12, № 4, 2018, pp. 271-272.

Karpov A.A., Yusupov R.M. Multimodal Interfaces of Human-Computer Interaction // Herald of the Russian Academy of Sciences, Springer, Vol. 88, No. 1, 2018, pp. 67-74.

Ivanko D., Karpov A., Fedotov D., Kipyatkova I., Ryumin D., Ivanko Dm., Minker W., Zelezny M. Multimodal Speech Recognition: Increasing Accuracy using High Speed Video Data // Journal on Multimodal User Interfaces, Springer, Vol. 12, № 4, 2018, pp. 319-328.

Karpov A.A., Yusupov R.M. Multimodal Interfaces of Human-Computer Interaction // Herald of the Russian Academy of Sciences, Springer, Vol. 88, No. 2, 2018, pp. 146-155.

Kaya H., Karpov A. Efficient and Effective Feature Normalization Strategies for Cross-Corpus Acoustic Emotion Recognition // Neurocomputing. Elsevier, Vol. 275, 2018, pp. 1028-1034.

2017

Kipyatkova I. Development and research of neural network hybrid acoustic models for the Russian speech recognition system. Materials of the XXII St. Petersburg Assembly of Young Scientists and Specialists, 2017, p. 201.

Saveliev A. Development of a configuration method for the optimal arrangement of heterogeneous modules of the IoT network. Materials of the XXII St. Petersburg Assembly of Young Scientists and Specialists, 2017, p. 143.

Velichko A., Sokolov B., Karpov A., Budkov V. A brief review of the methods used in paralinguistic analysis of speech. Collection of reports of the 70th international student scientific conference GUAP. Part 2. Technical Sciences, St. Petersburg: GUAP, 2017, pp. 51-53.

Syrkin L., Zuykova A., Karpov A., Usov V. Application of an alternative method of communication for everyday interaction of a person with reduced physical capacity and a robot assistant. Proceedings of the Conference "Cognitive Research at the Present Stage" KISE-2017, Kazan, 2017.

Verkholyak O., Karpov A. Combining utterance-level and frame-level feature representations for emotion classification from speech. In Proc. IEEE International Symposium "Video and Audio Signal Processing in the Context of Neurotechnologies", SPCN-2017, 2017, pp. 31.

Tampel I., Karpov A. Automatic speech recognition. Tutorial - Spb: University of ITMO, 2017, 152 p.

Karasev E., Saveliev A., Malov D. Managing audio and video streams in peer-to-peer videoconferencing applications. Proceedings of the 10th Multi-Conference MCU-2017, vol. 3, 2017, pp. 94-96.