Speech and Multimodal Interfaces Laboratory

Publications

2017

Ryumin D., Karpov A. Parametric representation of the speaker’s lips for multimodal sign language and speech recognition. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences. In Proc. ISPRS International Workshop “Photogrammetric and computer vision techniques for video Surveillance, Biometrics and Biomedicine” PSBB-2017, Moscow, 2017, pp. 155-161.
More
Vatamaniuk I., Budkov V., Kipyatkova I., Karpov A. Methods and Algorithms of Audio-Video Signal Processing for Analysis of Indoor Human Activity. In: Favorskaya M., Jain L. (eds.) Computer Vision in Control Systems-4. Intelligent Systems Reference Library, Springer, vol. 136. 2018, pp. 139-173.
More
Verkholyak O., Karpov A. Combined feature representation for emotion classification from Russian speech. In Proc. 6th International Conference on Artificial Intelligence and Natural Language AINL-2017, St. Petersburg, Communications in Computer and Information Science, Springer, vol. 789, pp. 68-73.
More
Markovnikov N., Kipyatkova I., Karpov A., Filchenkov A. Deep neural networks in Russian language recognition. In Proc. 6th International Conference on Artificial Intelligence and Natural Language AINL-2017, St. Petersburg, Springer, Communications in Computer and Information Science, vol. 789, pp. 54-67.
More
Pugachev A., Akhtiamov O., Karpov A., Minker W. Deep Learning for Acoustic Addressee Detection in Spoken Dialogue Systems. In Proc. 6th International Conference on Artificial Intelligence and Natural Language AINL-2017, St. Petersburg, Communications in Computer and Information Science, Springer, vol. 789, pp. 45-53.
More
Kryuchkov B., Syrkin L., Usov V., Ivanko D., Ivanko Dm. Using Augmentative and Alternative Communication for Human Robot Interaction during Maintaining Habitability of a Lunar Base. In Proc. 2nd International Conference on Interactive Collaborative Robotics ICR-2017, Hatfield, UK, Springer LNCS vol. 10459, 2017, pp. 95–104.15.
More
Gruber I., Hlaváč M., Železný M., Karpov A. Facing Face Recognition with ResNet: Round One. In Proc. 2nd International Conference on Interactive Collaborative Robotics ICR-2017, Hatfield, UK, Springer LNCS vol. 10459, 2017, pp. 67-74.
More
Hlaváč M., Gruber I., Železný M., Karpov A. Semi-automatic Facial Key-point Dataset Creation. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 662-668.
More
Akhtiamov O., Pugachev A., Karpov A., Sidorov M., Minker W. Are You Addressing Me? Multimodal Addressee Detection in Human-Human-Computer Conversations. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 152-161.
More
Ivanko D., Karpov A., Kipyatkova I., Ryumin D., Saveliev A., Budkov V., Ivanko Dm., Železný M. Using a High-Speed Video Camera for Robust Audio-Visual Speech Recognition in Acoustically Noisy Conditions. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 757-766.
More
Verkhodanova V., Shapranov V., Kipyatkova I. Hesitations in Spontaneous Speech: Acoustic Analysis and Detection. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 398-406.
More
Kipyatkova I. Experimenting with Hybrid TDNN/HMM Acoustic Models for Russian Speech Recognition. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS 10458, 2017, pp. 362-369.
More
Ryumin D., Karpov A. Towards Automatic Recognition of Sign Language Gestures using Kinect 2.0. In Proc. 19th International Conference on Human-Computer Interaction HCII-2017, Vancouver, Canada, Springer LNCS vol. 10278, 2017, pp. 89-104.
More
Akhtiamov O., Sidorov M., Karpov A., Minker W. Speech and Text Analysis for Multimodal Addressee Detection in Human-Human-Computer Interaction. In Proc. INTERSPEECH-2017, Stockholm, Sweden, ISCA, 2017, pp. 2521-2525.
More
Kaya H., Karpov A. Introducing Weighted Kernel Classifiers for Handling Imbalanced Paralinguistic Corpora: Snoring, Addressee and Cold. In Proc. INTERSPEECH-2017, Stockholm, Sweden, ISCA, 2017, pp. 3527-3531.
More