Speech and Multimodal Interfaces Laboratory

Publications

2017

Verkholyak O., Karpov A. Combined feature representation for emotion classification from Russian speech. In Proc. 6th International Conference on Artificial Intelligence and Natural Language AINL-2017, St. Petersburg, Communications in Computer and Information Science, Springer, vol. 789, pp. 68-73.
More
Markovnikov N., Kipyatkova I., Karpov A., Filchenkov A. Deep neural networks in Russian language recognition. In Proc. 6th International Conference on Artificial Intelligence and Natural Language AINL-2017, St. Petersburg, Springer, Communications in Computer and Information Science, vol. 789, pp. 54-67.
More
Pugachev A., Akhtiamov O., Karpov A., Minker W. Deep Learning for Acoustic Addressee Detection in Spoken Dialogue Systems. In Proc. 6th International Conference on Artificial Intelligence and Natural Language AINL-2017, St. Petersburg, Communications in Computer and Information Science, Springer, vol. 789, pp. 45-53.
More
Kryuchkov B., Syrkin L., Usov V., Ivanko D., Ivanko Dm. Using Augmentative and Alternative Communication for Human Robot Interaction during Maintaining Habitability of a Lunar Base. In Proc. 2nd International Conference on Interactive Collaborative Robotics ICR-2017, Hatfield, UK, Springer LNCS vol. 10459, 2017, pp. 95–104.15.
More
Gruber I., Hlaváč M., Železný M., Karpov A. Facing Face Recognition with ResNet: Round One. In Proc. 2nd International Conference on Interactive Collaborative Robotics ICR-2017, Hatfield, UK, Springer LNCS vol. 10459, 2017, pp. 67-74.
More
Hlaváč M., Gruber I., Železný M., Karpov A. Semi-automatic Facial Key-point Dataset Creation. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 662-668.
More
Akhtiamov O., Pugachev A., Karpov A., Sidorov M., Minker W. Are You Addressing Me? Multimodal Addressee Detection in Human-Human-Computer Conversations. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 152-161.
More
Ivanko D., Karpov A., Kipyatkova I., Ryumin D., Saveliev A., Budkov V., Ivanko Dm., Železný M. Using a High-Speed Video Camera for Robust Audio-Visual Speech Recognition in Acoustically Noisy Conditions. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 757-766.
More
Verkhodanova V., Shapranov V., Kipyatkova I. Hesitations in Spontaneous Speech: Acoustic Analysis and Detection. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS vol. 10458, 2017, pp. 398-406.
More
Kipyatkova I. Experimenting with Hybrid TDNN/HMM Acoustic Models for Russian Speech Recognition. In Proc. 19th International Conference on Speech and Computer SPECOM-2017, Hatfield, UK, Springer LNCS 10458, 2017, pp. 362-369.
More
Ryumin D., Karpov A. Towards Automatic Recognition of Sign Language Gestures using Kinect 2.0. In Proc. 19th International Conference on Human-Computer Interaction HCII-2017, Vancouver, Canada, Springer LNCS vol. 10278, 2017, pp. 89-104.
More
Akhtiamov O., Sidorov M., Karpov A., Minker W. Speech and Text Analysis for Multimodal Addressee Detection in Human-Human-Computer Interaction. In Proc. INTERSPEECH-2017, Stockholm, Sweden, ISCA, 2017, pp. 2521-2525.
More
Kaya H., Karpov A. Introducing Weighted Kernel Classifiers for Handling Imbalanced Paralinguistic Corpora: Snoring, Addressee and Cold. In Proc. INTERSPEECH-2017, Stockholm, Sweden, ISCA, 2017, pp. 3527-3531.
More
Basov O., Kipyatkova I., Saveliev A. Multimodal Subscriber Interfaces for Infocommunication Systems // Computing and Informatics, Slovak Academy of Sciences, Vol. 36, 2017, pp. 908-924. (WoS, Scopus SJR=0,253, Q3).
More
Petrovsky A., Wan W., Rosa-Zurera M., Karpov A. Signal Processing Platforms and Algorithms for Real-life Communications and Listening to Digital Audio // Journal of Electrical and Computer Engineering, Hindawi, Volume 2017, 2017, Article ID 2913236. (WoS, Scopus SJR=0,168, Q3).
More