Speech and Multimodal Interfaces Laboratory

Publications

2017

Budkov V., Saveliev A., Basov O., Ronzhin A. Corpus of Russian speech for the study of the truth of the transmitted message // Proceedings of the 7th Interdisciplinary Workshop "Analysis of Conversational Russian Speech" АР3 - 2017, St. Petersburg, 2017, pp. 21-25.
Kryuchkov B., Karpov A., Usov V., Chertopolokhov V. Multi-level monitoring of the gesture control of a mobile robot with out-of-ship activities on the lunar surface. Proceedings of the XIX International Conference "Problems of Control and Modeling in Complex Systems" PUMSS-2017, Samara, 2017, pp. 153-159.
Velichko A., Budkov V., Karpov A. Analytical Survey of Computational Paralinguistic Systems for Automatic Recognition of Deception in Human Speech // Informatsionno-Upravliaiushchie Sistemy, No. 5, 2017, pp. 30-41.
Kipyatkova I., Karpov A. Research of neural network models of the Russian language for automatic speech recognition systems // Avtomatika i Telemekhanika, vol. 78, No. 5, 2017, pp. 110-122.
Kryuchkov B., Usov V., Tchertopolokhov V., Ronzhin A., Karpov A. Simulation of the “cosmonaut-robot” system interaction on the lunar surface based on methods of machine vision and computer graphics. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences. In Proc. ISPRS International Workshop “Photogrammetric and computer vision techniques for video Surveillance, Biometrics and Biomedicine” PSBB-2017, Moscow, 2017, pp. 129-133.
More
Ryumin D., Karpov A. Parametric representation of the speaker’s lips for multimodal sign language and speech recognition. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences. In Proc. ISPRS International Workshop “Photogrammetric and computer vision techniques for video Surveillance, Biometrics and Biomedicine” PSBB-2017, Moscow, 2017, pp. 155-161.
More
Vatamaniuk I., Budkov V., Kipyatkova I., Karpov A. Methods and Algorithms of Audio-Video Signal Processing for Analysis of Indoor Human Activity. In: Favorskaya M., Jain L. (eds.) Computer Vision in Control Systems-4. Intelligent Systems Reference Library, Springer, vol. 136. 2018, pp. 139-173.
More
Verkholyak O., Karpov A. Combined feature representation for emotion classification from Russian speech. In Proc. 6th International Conference on Artificial Intelligence and Natural Language AINL-2017, St. Petersburg, Communications in Computer and Information Science, Springer, vol. 789, pp. 68-73.
More
Markovnikov N., Kipyatkova I., Karpov A., Filchenkov A. Deep neural networks in Russian language recognition. In Proc. 6th International Conference on Artificial Intelligence and Natural Language AINL-2017, St. Petersburg, Springer, Communications in Computer and Information Science, vol. 789, pp. 54-67.
More
Pugachev A., Akhtiamov O., Karpov A., Minker W. Deep Learning for Acoustic Addressee Detection in Spoken Dialogue Systems. In Proc. 6th International Conference on Artificial Intelligence and Natural Language AINL-2017, St. Petersburg, Communications in Computer and Information Science, Springer, vol. 789, pp. 45-53.
More
Kryuchkov B., Syrkin L., Usov V., Ivanko D., Ivanko Dm. Using Augmentative and Alternative Communication for Human Robot Interaction during Maintaining Habitability of a Lunar Base. In Proc. 2nd International Conference on Interactive Collaborative Robotics ICR-2017, Hatfield, UK, Springer LNCS vol. 10459, 2017, pp. 95–104.15.
More
Gruber I., Hlaváč M., Železný M., Karpov A. Facing Face Recognition with ResNet: Round One. In Proc. 2nd International Conference on Interactive Collaborative Robotics ICR-2017, Hatfield, UK, Springer LNCS vol. 10459, 2017, pp. 67-74.
More
Hlaváč M., Gruber I., Železný M., Karpov A. Semi-automatic Facial Key-point Dataset Creation. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 662-668.
More
Akhtiamov O., Pugachev A., Karpov A., Sidorov M., Minker W. Are You Addressing Me? Multimodal Addressee Detection in Human-Human-Computer Conversations. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 152-161.
More
Ivanko D., Karpov A., Kipyatkova I., Ryumin D., Saveliev A., Budkov V., Ivanko Dm., Železný M. Using a High-Speed Video Camera for Robust Audio-Visual Speech Recognition in Acoustically Noisy Conditions. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 757-766.
More