Speech and Multimodal Interfaces Laboratory

Publications

2016

Karpov A. Multimodal recognition of Russian speech using audio and video information. Scientific works of the participants of the competition "Young scientists of the University ITMO”, St. Petersburg, 2016, pp. 132-138.
Kryuchkov B., Usov V., Karpov A. An ontological approach for designing interactive virtual environments for a visual representation of planned actions during dialogue controlling a robot-assistant on-board of the ISS. In Proc. VI International Conference "Open Semantic Technologies for Intelligent Systems" OSTIS-2016, Minsk, Belarus, 2016, pp. 477-482.
Malakhov S., Karpov A., Sirkin L., Usov V. The approaches for compensation of defects of polymodal perception at persons with deep visual impairment by means of high information and communication technologies. In Proc. 7th International Conference on Cognitive Science, Svetlogorsk, Russia, 2016, pp 406-407.
Verkhodanova V., Shapranov V., Karpov A. Filled pauses and lengthenings detection using machine learning techniques. In Proc. 7th Workshop on Experimental Linguistics ExLing-2016, St. Petersburg, Russia, 2016, pp. 167-170.
Savelyev A., Somenkov N. Architecture of client-side application of peering videoconferencing // In Proc. XXIX International Scientific Conference "Mathematical Methods in Engineering and Technology" (MMTT-29), St. Petersburg, 2016, pp 176-180.
Karpov A., Kryuchkov B., Ronzhin A., Usov V. Designing human-robot interaction in a united team of cosmonauts and autonomous mobile robots on the lunar surface. In Proc. 26th International Conference “Extreme Robotics (ER-2016)”, St. Petersburg, 2016, pp. 76-80 (in Rus).
Ivanko D., Karpov A. The use of high-speed video camera in the problems of human-computer interaction // In Proc. 9th Conference "Information Technologies in Control" ITC-2016, St. Petersburg, 2016, pp. 801-806.
Ryumin D., Karpov A. Automated hand gesture recognition system using the Kinect sensor // In Proc. 9th Conference "Information Technologies in Control" ITC-2016, St. Petersburg, 2016, pp. 838-846.
Kipyatkova I. Automatic recognition of continuous Russian speech using acoustic models based on deep neural networks. In Proc. 9th Conference "Information Technologies in Control" ITC-2016, St. Petersburg, 2016, pp. 807-814.23.
Struev D., Bondareva N., Budkov V., Basov O., Ronzhin A. A conceptual model multimodal user interface of the subscriber terminal // Scientific News of the Belgorod State University. Series: Economics. Informatics. No 23 (244), Issue 40, 2016, pp. 156-164. (VAK; RSCI impact factor – 0,261).
Levonevskiy D., Vatamaniuk I., Saveliev A., Denisov A. Corporate Information System of User Service as a Component of Cyber-Physical Intellectual Space // Journal of Instrument Engineering. Vol. 59, No 11, 2016, pp. 15–23. (VAK; RSCI impact factor –0,282).
Basov O., Kipyatkova I., Saveliev A., Saitov I. Polymodal Information Encoding Models // Information and Control Systems, No 2(81), 2016, pp. 68-73. (VAK; RSCI impact factor – 0,502).
Saveliev A. Algorithms of Data Processing in Supervised Accounts of a Videoconferencing System // Information and Control Systems, No 3(82), 2016, pp. 906–913. (VAK; RSCI impact factor – 0,502).
Ivanko D., Kipyatkova I., Ronzhin A., Karpov A. Analysis of Multimodal Fusion Techniques for Audio-Visual Speech Recognition // Scientific and Technical Journal of Information Technologies, Mechanics and Optics, Vol. 16, No 3, 2016, pp. 387-401. (VAK; RSCI impact factor – 0,285).
Karpov A., Kaya H., Salah A. State-of-the-art Tasks and Achievements of Paralinguistic Speech Analysis Systems // Scientific and Technical Journal of Information Technologies, Mechanics and Optics, Vol. 16, No 4, 2016, pp. 581–592. (VAK; RSCI impact factor – 0,285).