Speech and Multimodal Interfaces Laboratory

Publications

2016

Savelyev A., Somenkov N. Architecture of client-side application of peering videoconferencing // In Proc. XXIX International Scientific Conference "Mathematical Methods in Engineering and Technology" (MMTT-29), St. Petersburg, 2016, pp 176-180.
Karpov A., Kryuchkov B., Ronzhin A., Usov V. Designing human-robot interaction in a united team of cosmonauts and autonomous mobile robots on the lunar surface. In Proc. 26th International Conference “Extreme Robotics (ER-2016)”, St. Petersburg, 2016, pp. 76-80 (in Rus).
Ivanko D., Karpov A. The use of high-speed video camera in the problems of human-computer interaction // In Proc. 9th Conference "Information Technologies in Control" ITC-2016, St. Petersburg, 2016, pp. 801-806.
Ryumin D., Karpov A. Automated hand gesture recognition system using the Kinect sensor // In Proc. 9th Conference "Information Technologies in Control" ITC-2016, St. Petersburg, 2016, pp. 838-846.
Kipyatkova I. Automatic recognition of continuous Russian speech using acoustic models based on deep neural networks. In Proc. 9th Conference "Information Technologies in Control" ITC-2016, St. Petersburg, 2016, pp. 807-814.23.
Struev D., Bondareva N., Budkov V., Basov O., Ronzhin A. A conceptual model multimodal user interface of the subscriber terminal // Scientific News of the Belgorod State University. Series: Economics. Informatics. No 23 (244), Issue 40, 2016, pp. 156-164. (VAK; RSCI impact factor – 0,261).
Levonevskiy D., Vatamaniuk I., Saveliev A., Denisov A. Corporate Information System of User Service as a Component of Cyber-Physical Intellectual Space // Journal of Instrument Engineering. Vol. 59, No 11, 2016, pp. 15–23. (VAK; RSCI impact factor –0,282).
Basov O., Kipyatkova I., Saveliev A., Saitov I. Polymodal Information Encoding Models // Information and Control Systems, No 2(81), 2016, pp. 68-73. (VAK; RSCI impact factor – 0,502).
Saveliev A. Algorithms of Data Processing in Supervised Accounts of a Videoconferencing System // Information and Control Systems, No 3(82), 2016, pp. 906–913. (VAK; RSCI impact factor – 0,502).
Ivanko D., Kipyatkova I., Ronzhin A., Karpov A. Analysis of Multimodal Fusion Techniques for Audio-Visual Speech Recognition // Scientific and Technical Journal of Information Technologies, Mechanics and Optics, Vol. 16, No 3, 2016, pp. 387-401. (VAK; RSCI impact factor – 0,285).
Karpov A., Kaya H., Salah A. State-of-the-art Tasks and Achievements of Paralinguistic Speech Analysis Systems // Scientific and Technical Journal of Information Technologies, Mechanics and Optics, Vol. 16, No 4, 2016, pp. 581–592. (VAK; RSCI impact factor – 0,285).
Ivanko D., Karpov A. An Analysis of Perspectives for Using High-Speed Cameras in Processing Dynamic Video Information // SPIIRAS Proceedings, Vol. 44, No 1, 2016, pp. 98-113. (VAK; RSCI impact factor – 0,359).
Kipyatkova I., Karpov A. Variants of Deep Artificial Neural Networks for Speech Recognition Systems // SPIIRAS Proceedings, Vol. 49, No 6, 2016, pp. 80-103. (VAK; RSCI impact factor – 0,359).
Ronzhin Al., Vatamaniuk I., Zelezny M. Implementation of Face Recognition Methods as a First Step for Human Behaviour Analysis in Intelligent Room. In Proc. 24th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision WSCG-2016 (poster proc.), Pilsen, Czech Republic, CSRN 2603, 2016, pp. 61-64.
Saveliev A., Saitov S., Vatamaniuk I., Basov O., Shilov N. Neural Network System for Monitoring State of a Optical Telecommunication System. In Proc. International Conference on Next Generation Wired/Wireless Networking NEW2AN-2016. Springer LNCS, Vol. 9870, 2016, pp. 39-49.