Speech and Multimodal Interfaces Laboratory

Publications

2018

Velichko A., Budkov V., Kagirov I., Karpov A. Comparative Analysis of Classification Methods for Automatic Deception Detection in Speech. In Proc. 20th International Conference on Speech and Computer SPECOM-2018, Leipzig, Germany, Springer, LNAI vol. 11096, 2018, pp. 737-746.
Fedotov D., Kaya H., Karpov A. Context Modeling for Cross-Corpus Dimensional Acoustic Emotion Recognition: Challenges and Mixup. In Proc. 20th International Conference on Speech and Computer SPECOM-2018, Leipzig, Germany, Springer, LNAI vol. 11096, 2018, pp. 155-165.
Kaya H., Fedotov D., Yesilkanat A., Verkholyak O., Zhang Y., Karpov A. LSTM based Cross-corpus and Cross-task Acoustic Emotion Recognition. In Proc. 19th International Conference INTERSPEECH-2018, Hyderabad, India, ISCA, 2018, pp. 521-525.
More
Vatamaniuk I.V., Budkov V.Y., Kipyatkova I.S., Karpov A.A. Methods and Algorithms of Audio-Video Signal Processing for Analysis of Indoor Human Activity. In: Favorskaya M., Jain L. (eds.) Computer Vision in Control Systems-4. Intelligent Systems Reference Library, vol. 136. Springer, 2018, pp. 139-173.
More
Verkhodanova V.O., Shapranov V.V., Kipyatkova I.S., Karpov A.A. Automatic detection of vocalized hesitations in Russian speech. Voprosy Jazykoznanija, 2018, No. 6, pp. 104–118. (in Russian)
More
Ivanko D.V., Fedotov D.V., Karpov A. A. Accuracy increase for automatic visual Russian speech recognition: viseme classes optimization. Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 2018, vol. 18, no. 2, pp. 346–349
More
Markovnikov N.M., Kipyatkova I.S. An Analytic Survey of End-to-End Speech Recognition Systems // SPIIRAS Proceedings. 2018. Issue 3(58). pp. 77-110.
More
Karpov A., Mporas I. Speech Communication Integrated with Other Modalities (Editorial) // Journal on Multimodal User Interfaces, Springer, Vol. 12, № 4, 2018, pp. 271-272.
More
Karpov A.A., Yusupov R.M. Multimodal Interfaces of Human-Computer Interaction // Herald of the Russian Academy of Sciences, Springer, Vol. 88, No. 1, 2018, pp. 67-74.
More
Ivanko D., Karpov A., Fedotov D., Kipyatkova I., Ryumin D., Ivanko Dm., Minker W., Zelezny M. Multimodal Speech Recognition: Increasing Accuracy using High Speed Video Data // Journal on Multimodal User Interfaces, Springer, Vol. 12, № 4, 2018, pp. 319-328.
More
Karpov A.A., Yusupov R.M. Multimodal Interfaces of Human-Computer Interaction // Herald of the Russian Academy of Sciences, Springer, Vol. 88, No. 2, 2018, pp. 146-155.
More
Kaya H., Karpov A. Efficient and Effective Feature Normalization Strategies for Cross-Corpus Acoustic Emotion Recognition // Neurocomputing. Elsevier, Vol. 275, 2018, pp. 1028-1034.
More

2017

Kipyatkova I. Development and research of neural network hybrid acoustic models for the Russian speech recognition system. Materials of the XXII St. Petersburg Assembly of Young Scientists and Specialists, 2017, p. 201.
Saveliev A. Development of a configuration method for the optimal arrangement of heterogeneous modules of the IoT network. Materials of the XXII St. Petersburg Assembly of Young Scientists and Specialists, 2017, p. 143.
Velichko A., Sokolov B., Karpov A., Budkov V. A brief review of the methods used in paralinguistic analysis of speech. Collection of reports of the 70th international student scientific conference GUAP. Part 2. Technical Sciences, St. Petersburg: GUAP, 2017, pp. 51-53.