Speech and Multimodal Interfaces Laboratory

Publications

2020

Kipyatkova I., Karpov A. Class-based LSTM Russian Language Model with Linguistic Information // Proceedings of the 12th Conference on Language Resources and Evaluation LREC-2020. 2020. pp. 2470–2474.
Dvoynikova A., Verkholyak O., Karpov A. Emotion Recognition and Sentiment Analysis of Extemporaneous Speech Transcriptions in Russian // Lecture Notes in Computer Science, Springer LNAI 12335, SPECOM 2020. 2020. pp. 136-144.
Kipyatkova I., Markovnikov N. Experimenting with Attention Mechanisms in Joint CTC-Attention Models for Russian Speech Recognition // Lecture Notes in Computer Science, Springer LNAI 12335, SPECOM 2020. 2020. pp. 214–222.
Markitantov M. Transfer Learning in Speaker’s Age and Gender Recognition // Lecture Notes in Computer Science, Springer LNAI 12335, SPECOM 2020. 2020. pp. 326-335.
Ivanko D., Ryumin D., Karpov A. An Experimental Analysis of Different Approaches to Audio–Visual Speech Recognition and Lip-Reading // Proceedings of 15th International Conference on Electromechanics and Robotics "Zavalishin's Readings" ZR-2020. 2020. pp. 197-209.
Gundelakh F., Stankevich L., Kapralov N., Ekimovskii J. Cyber-Physical System Control Based on Brain-Computer Interface. In Proc. International Conference on Cyber-Physical Systems and Control CPS&C 2019. Lecture Notes in Networks and Systems, Springer. 2020. Vol. 95. pp. 458-469.
More
Ryumina E., Karpov A. Facial Expression Recognition using Distance Importance Scores Between Facial Landmarks // CEUR Workshop Proceedings, 30th International Conference on Computer Graphics and Machine Vision GraphiCon-2020, vol. 2744, 2020, paper 32.
More
Velichko A., Karpov A. A Study of Data Scarcity Problem for Automatic Detection of Deceptive Speech Utterances // CEUR Workshop Proceedings, 3rd International Conference on R. Piotrowski's Readings in Language Engineering and Applied Linguistics PRLEAL-2019, vol. 2552, 2020, pp. 38-46.
More
Dvoynikova A., Verkholyak O., Karpov A. Analytical review of methods for identifying emotions in text data // CEUR Workshop Proceedings, 3rd International Conference on R. Piotrowski's Readings in Language Engineering and Applied Linguistics PRLEAL-2019, vol. 2552, 2020, pp. 8-21.
More
Kipyatkova I., Karpov А. A comparative study of neural network architectures for end-to-end speech recognition system // Journal of Instrument Engineering. 2020, Vol. 63, No. 11, pp. 1027-1033.
Axyonov A., Ivanko D., Lashkov I., Ryumin D., Kashevnik A., Karpov A. A methodology of multimodal corpus creation for audio-visual speech recognition in assistive transport systems // Informatization and Communication, 2020, no. 5, pp. 87-93.
Markitantov M., Karpov А. Automatic human age and gender recognition using time-delay neural networks based on acoustic features // Proceedings of III All-Russian Acoustic Conference, St. Petersburg, 2020, pp. 374-380.
Kipyatkova I., Markovnikov N. A Study of Methods for Improving End-to-End Speech Recognition System at Lack of Training Data // Proceedings of III All-Russian Acoustic Conference, St. Petersburg, 2020, pp. 361-367.
Axyonov А., Ryumin D., Kagirov I., Ivanko D., Karpov A. A technique for hand landmarks detection for contactless gesture-based human-machine interaction // Proceedings of 31st International Scientific and Technological Conference «Extreme Robotics», St. Petersburg, 2020, pp. 34-36.
Mikhajlyuk М., Karpov А., Kryuchkov B., Usov V., Dovzhenko V. Voice control of service robots under conditions of possible limitations of human motor functions in space flight // Proceedings of the XII All-Russian scientific-technical conference "Robotics and artificial intelligence", 2020, pp. 197-201.