Speech and Multimodal Interfaces Laboratory

Publications

2017

Hlaváč M., Gruber I., Železný M., Karpov A. Semi-automatic Facial Key-point Dataset Creation. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 662-668.
More
Akhtiamov O., Pugachev A., Karpov A., Sidorov M., Minker W. Are You Addressing Me? Multimodal Addressee Detection in Human-Human-Computer Conversations. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 152-161.
More
Ivanko D., Karpov A., Kipyatkova I., Ryumin D., Saveliev A., Budkov V., Ivanko Dm., Železný M. Using a High-Speed Video Camera for Robust Audio-Visual Speech Recognition in Acoustically Noisy Conditions. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 757-766.
More
Verkhodanova V., Shapranov V., Kipyatkova I. Hesitations in Spontaneous Speech: Acoustic Analysis and Detection. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 398-406.
More
Kipyatkova I. Experimenting with Hybrid TDNN/HMM Acoustic Models for Russian Speech Recognition. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS 10458, 2017, pp. 362-369.
More
Ryumin D., Karpov A. Towards Automatic Recognition of Sign Language Gestures using Kinect 2.0. In Proc. 19th International Conference on Human-Computer Interaction HCII-2017, Vancouver, Canada, Springer LNCS vol. 10278, 2017, pp. 89-104.
More
Akhtiamov O., Sidorov M., Karpov A., Minker W. Speech and Text Analysis for Multimodal Addressee Detection in Human-Human-Computer Interaction. In Proc. INTERSPEECH-2017, Stockholm, Sweden, ISCA, 2017, pp. 2521-2525.
More
Kaya H., Karpov A. Introducing Weighted Kernel Classifiers for Handling Imbalanced Paralinguistic Corpora: Snoring, Addressee and Cold. In Proc. INTERSPEECH-2017, Stockholm, Sweden, ISCA, 2017, pp. 3527-3531.
More
Basov O., Kipyatkova I., Saveliev A. Multimodal Subscriber Interfaces for Infocommunication Systems // Computing and Informatics, Slovak Academy of Sciences, Vol. 36, 2017, pp. 908-924. (WoS, Scopus SJR=0,253, Q3).
More
Petrovsky A., Wan W., Rosa-Zurera M., Karpov A. Signal Processing Platforms and Algorithms for Real-life Communications and Listening to Digital Audio // Journal of Electrical and Computer Engineering, Hindawi, Volume 2017, 2017, Article ID 2913236. (WoS, Scopus SJR=0,168, Q3).
More
Kaya H., Salah A., Karpov A., Frolova O., Grigorev A., Lyakso E. Emotion, Age, and Gender Classification in Children's Speech by Humans and Machines // Computer Speech and Language, Elsevier, 2017, Vol. 46, pp. 268-283. (WoS JCR=1,900, Scopus SJR=0,168, Q1).
More
Kipyatkova I., Karpov A. A Study of Neural Network Russian Language Models for Automatic Continuous Speech Recognition Systems // Automation and Remote Control, Springer, Vol. 78, No. 5, 2017, pp. 858-867. (WoS JCR=0,492, Scopus SJR=0,34, Q2).
More

2016

Tampel I., Karpov A. Automatic Speech Recognition: tutorial book – St. Petersburg, ITMO University, 2016, 138 p.
Karpov A., Kryuchkov B., Ronzhin A., Usov V. Designing human-robot interaction in a united team of cosmonauts and autonomous mobile robots on the lunar surface. In Proc. 26th International Conference “Extreme Robotics (ER-2016)”, St. Petersburg, Russia, 2016, pp. 71-75. (in Eng.)
Karpov A. Multimodal recognition of Russian speech using audio and video information. Scientific works of the participants of the competition "Young scientists of the University ITMO”, St. Petersburg, 2016, pp. 132-138.