Speech and Multimodal Interfaces Laboratory

Publications

2017

Gruber I., Hlaváč M., Železný M., Karpov A. Facing Face Recognition with ResNet: Round One. In Proc. 2nd International Conference on Interactive Collaborative Robotics ICR-2017, Hatfield, UK, Springer LNCS vol. 10459, 2017, pp. 67-74.
More
Hlaváč M., Gruber I., Železný M., Karpov A. Semi-automatic Facial Key-point Dataset Creation. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 662-668.
More
Akhtiamov O., Pugachev A., Karpov A., Sidorov M., Minker W. Are You Addressing Me? Multimodal Addressee Detection in Human-Human-Computer Conversations. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 152-161.
More
Ivanko D., Karpov A., Kipyatkova I., Ryumin D., Saveliev A., Budkov V., Ivanko Dm., Železný M. Using a High-Speed Video Camera for Robust Audio-Visual Speech Recognition in Acoustically Noisy Conditions. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 757-766.
More
Verkhodanova V., Shapranov V., Kipyatkova I. Hesitations in Spontaneous Speech: Acoustic Analysis and Detection. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 398-406.
More
Kipyatkova I. Experimenting with Hybrid TDNN/HMM Acoustic Models for Russian Speech Recognition. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS 10458, 2017, pp. 362-369.
More
Ryumin D., Karpov A. Towards Automatic Recognition of Sign Language Gestures using Kinect 2.0. In Proc. 19th International Conference on Human-Computer Interaction HCII-2017, Vancouver, Canada, Springer LNCS vol. 10278, 2017, pp. 89-104.
More
Akhtiamov O., Sidorov M., Karpov A., Minker W. Speech and Text Analysis for Multimodal Addressee Detection in Human-Human-Computer Interaction. In Proc. INTERSPEECH-2017, Stockholm, Sweden, ISCA, 2017, pp. 2521-2525.
More
Kaya H., Karpov A. Introducing Weighted Kernel Classifiers for Handling Imbalanced Paralinguistic Corpora: Snoring, Addressee and Cold. In Proc. INTERSPEECH-2017, Stockholm, Sweden, ISCA, 2017, pp. 3527-3531.
More
Basov O., Kipyatkova I., Saveliev A. Multimodal Subscriber Interfaces for Infocommunication Systems // Computing and Informatics, Slovak Academy of Sciences, Vol. 36, 2017, pp. 908-924. (WoS, Scopus SJR=0,253, Q3).
More
Petrovsky A., Wan W., Rosa-Zurera M., Karpov A. Signal Processing Platforms and Algorithms for Real-life Communications and Listening to Digital Audio // Journal of Electrical and Computer Engineering, Hindawi, Volume 2017, 2017, Article ID 2913236. (WoS, Scopus SJR=0,168, Q3).
More
Kaya H., Salah A., Karpov A., Frolova O., Grigorev A., Lyakso E. Emotion, Age, and Gender Classification in Children's Speech by Humans and Machines // Computer Speech and Language, Elsevier, 2017, Vol. 46, pp. 268-283. (WoS JCR=1,900, Scopus SJR=0,168, Q1).
More
Kipyatkova I., Karpov A. A Study of Neural Network Russian Language Models for Automatic Continuous Speech Recognition Systems // Automation and Remote Control, Springer, Vol. 78, No. 5, 2017, pp. 858-867. (WoS JCR=0,492, Scopus SJR=0,34, Q2).
More

2016

Tampel I., Karpov A. Automatic Speech Recognition: tutorial book – St. Petersburg, ITMO University, 2016, 138 p.
Karpov A., Kryuchkov B., Ronzhin A., Usov V. Designing human-robot interaction in a united team of cosmonauts and autonomous mobile robots on the lunar surface. In Proc. 26th International Conference “Extreme Robotics (ER-2016)”, St. Petersburg, Russia, 2016, pp. 71-75. (in Eng.)