Лаборатория речевых и многомодальных интерфейсов

Публикации

2016

Иванько Д.В., Карпов А.А. Анализ перспектив применения высокоскоростных камер для распознавания динамической видеоинформации // Труды СПИИРАН. Вып. 44, No 1, 2016, С. 98-113. (ВАК, РИНЦ, импакт-фактор – 0,359).
Кипяткова И.С., Карпов А.А. Разновидности глубоких искусственных нейронных сетей для систем распознавания речи // Труды СПИИРАН. Вып. 49, No 6, 2016, С. 80-103 (ВАК, РИНЦ, импакт-фактор – 0,359).
Ronzhin A., Vatamaniuk I., Zelezny M. Implementation of Face Recognition Methods as a First Step for Human Behaviour Analysis in Intelligent Room. In Proc. 24th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision WSCG-2016 (poster proc.), Pilsen, Czech Republic, CSRN 2603, 2016, pp. 61-64.
Saveliev A., Saitov S., Vatamaniuk I., Basov O., Shilov N. Neural Network System for Monitoring State of a Optical Telecommunication System. In Proc. International Conference on Next Generation Wired/Wireless Networking. Springer LNCS, 2016, pp. 39-49.13.
Gruber I., Hlaváč M., Hrúz M., Železný M., Karpov A. An Analysis of Visual Faces Datasets. In Proc. ICR-2016, Budapest, Hungary, Springer LNCS, Vol. 9812, 2016, pp. 18-26.
Verkhodanova V., Ronzhin Al., Kipyatkova I., Ivanko D., Karpov A., Železný M. HAVRUS Corpus: High-Speed Recordings of Audio-Visual Russian Speech. In Proc. SPECOM-2016, Budapest, Hungary, Springer LNCS, Vol. 9811, 2016, pp. 338-345.
Vatamaniuk I., Levonevskiy D., Saveliev A., Denisov A. Scenarios of Multimodal Information Navigation Services for Users in Cyberphysical Environment. In Proc. SPECOM-2016, Budapest, Hungary, Springer LNCS, Vol. 9811, 2016, pp. 588-595.
Kipyatkova I., Karpov A. DNN-Based Acoustic Modeling for Russian Speech Recognition Using Kaldi. In Proc. SPECOM-2016, Budapest, Hungary, Springer LNCS, Vol. 9811, pp. 246-253.
Verkhodanova V., Shapranov V. Detecting Filled Pauses and Lengthenings in Russian Spontaneous Speech Using SVM. In Proc. SPECOM-2016, Budapest, Hungary, Springer LNCS, Vol. 9811, 2016, pp. 224-231.
Karpov A., Ronzhin Al., Kipyatkova I., Ronzhin A., Verkhodanova V., Saveliev A., Zelezny M. Bimodal Speech Recognition Fusing Audio-Visual Modalities. In Proc. HCII-2016, Toronto, Canada, Springer LNCS, Vol. 9732, 2016, pp. 170-179.
Kaya H., Karpov A., Salah A. Robust Acoustic Emotion Recognition based on Cascaded Normalization and Extreme Learning Machines. In Proc. 13th International Symposium on Neural Networks ISNN-2016, St. Petersburg, Russia, Springer LNCS, Vol. 9719, 2016, pp. 115-123.
Kipyatkova I., Karpov A. Language Models with RNNs for Rescoring Hypotheses of Russian ASR. In Proc. 13th International Symposium on Neural Networks ISNN-2016, St. Petersburg, Russia, Springer LNCS, Vol. 9719, 2016, pp. 418-425.
Kaya H., Karpov A. Fusing Acoustic Feature Representations for Computational Paralinguistics Tasks. In Proc. INTERSPEECH-2016, San Francisco, USA, 2016, pp. 2046-2050.
Karpov A., Kipyatkova I., Zelezny M. Automatic Technologies for Processing Spoken Sign Languages. In Proc. SLTU-2016, Procedia Computer Science. Elsevier, Vol. 81, 2016, pp. 201-207.
Verkhodanova V., Shapranov V. Experiments on detection of voiced hesitations in Russian spontaneous speech // Journal of Electrical and Computer Engineering. Hindawi, Volume 2016, Article ID 2013658.