Speech and Multimodal Interfaces Laboratory

Publications

2020

Velichko A., Karpov A. A Study of Data Scarcity Problem for Automatic Detection of Deceptive Speech Utterances // CEUR Workshop Proceedings, 3rd International Conference on R. Piotrowski's Readings in Language Engineering and Applied Linguistics PRLEAL-2019, vol. 2552, 2020, pp. 38-46.
More
Dvoynikova A., Verkholyak O., Karpov A. Analytical review of methods for identifying emotions in text data // CEUR Workshop Proceedings, 3rd International Conference on R. Piotrowski's Readings in Language Engineering and Applied Linguistics PRLEAL-2019, vol. 2552, 2020, pp. 8-21.
More
Kipyatkova I., Karpov А. A comparative study of neural network architectures for end-to-end speech recognition system // Journal of Instrument Engineering. 2020, Vol. 63, No. 11, pp. 1027-1033.
Axyonov A., Ivanko D., Lashkov I., Ryumin D., Kashevnik A., Karpov A. A methodology of multimodal corpus creation for audio-visual speech recognition in assistive transport systems // Informatization and Communication, 2020, no. 5, pp. 87-93.
Markitantov M., Karpov А. Automatic human age and gender recognition using time-delay neural networks based on acoustic features // Proceedings of III All-Russian Acoustic Conference, St. Petersburg, 2020, pp. 374-380.
Kipyatkova I., Markovnikov N. A Study of Methods for Improving End-to-End Speech Recognition System at Lack of Training Data // Proceedings of III All-Russian Acoustic Conference, St. Petersburg, 2020, pp. 361-367.
Axyonov А., Ryumin D., Kagirov I., Ivanko D., Karpov A. A technique for hand landmarks detection for contactless gesture-based human-machine interaction // Proceedings of 31st International Scientific and Technological Conference «Extreme Robotics», St. Petersburg, 2020, pp. 34-36.
Mikhajlyuk М., Karpov А., Kryuchkov B., Usov V., Dovzhenko V. Voice control of service robots under conditions of possible limitations of human motor functions in space flight // Proceedings of the XII All-Russian scientific-technical conference "Robotics and artificial intelligence", 2020, pp. 197-201.
Dvoynikova A., Verkholyak O., Karpov A. Sentiment-analysis of spoken language using a method based on tonal dictionaries // Almanac of scientific works of young scientists of ITMO University. 2020. vol. 3. pp. 75-80.
Ryumina E. A method for extracting informative video features for emotion recognition // Almanac of scientific works of ITMO University young scientists. 2020, vol. 3. pp. 151-155.
Axyonov A., Ryumina E. Analytical review of modern methods of face detection // Almanac of scientific works of ITMO University young scientists. 2020, vol. 3. pp. 12-19.
Markitantov M. Analytical survey of audiovisual speech corpora for automatic speaker’s age recognition // Almanac of scientific works of young scientists of the University ITMO. 2020, vol. 3, pp. 124-128.
Verkholyak O., Karpov A. Chapter 4 "Automatic analysis of emotionally-colored speech" in a monograph "Child speech portrait with typical and atypical development" / Lyakso E., Frolova O., Grechaniy S., Matveev Yu., Verkholyak O., Karpov A. / St. Petersburg: Publishing and Printing Association of Higher Educational Institutions, 2020, 204 p. ISBN 978-5-91155-096-7.
Ivanko D., Ryumin D., Kipyatkova I., Axyonov A., Karpov A. Lip-reading Using Pixel-based and Geometry-based Features for Multimodal Human-Robot Interfaces // Smart Innovation, Systems and Technologies, Springer, vol. 154, Zavalishin’s Readings 2019, 2020, pp. 477-486.
More
Ryumin D., Ivanko D., Kagirov I., Axyonov A., Karpov A. Vision-Based Assistive Systems for Deaf and Hearing Impaired People // In: Favorskaya M., Jain L. (eds) Computer Vision in Advanced Control Systems-5, Intelligent Systems Reference Library, Springer, vol. 175, 2020, pp. 197-224.
More