Speech and Multimodal Interfaces Laboratory

Publications

2016

Gruber I., Hlaváč M., Hrúz M., Železný M., Karpov A. An Analysis of Visual Faces Datasets. In Proc. 1st International Conference on Interactive Collaborative Robotics ICR-2016, Budapest, Hungary, Springer LNCS, Vol. 9812, 2016, pp. 18-26.

Verkhodanova V., Ronzhin Al., Kipyatkova I., Ivanko D., Karpov A., Železný M. HAVRUS Corpus: High-Speed Recordings of Audio-Visual Russian Speech. In Proc. SPECOM-2016, Budapest, Hungary, Springer LNCS, Vol. 9811, 2016, pp. 338-345.

Vatamaniuk I., Levonevskiy D., Saveliev A., Denisov A. Scenarios of Multimodal Information Navigation Services for Users in Cyberphysical Environment. In Proc. SPECOM-2016, Budapest, Hungary, Springer LNCS, Vol. 9811, 2016, pp. 588-595.

Kipyatkova I., Karpov A. DNN-Based Acoustic Modeling for Russian Speech Recognition Using Kaldi. In Proc. SPECOM-2016, Budapest, Hungary, Springer LNCS, Vol. 9811, 2016, pp. 246-253.

Verkhodanova V., Shapranov V. Detecting Filled Pauses and Lengthenings in Russian Spontaneous Speech Using SVM. In Proc. 18th International Conference on Speech and Computer SPECOM-2016, Budapest, Hungary, Springer LNCS, Vol. 9811, 2016, pp. 224-231.

Karpov A., Ronzhin Al., Kipyatkova I., Ronzhin A., Verkhodanova V., Saveliev A., Zelezny M. Bimodal Speech Recognition Fusing Audio-Visual Modalities. In Proc. 18th International Conference on Human-Computer Interaction HCII-2016, Toronto, Canada, Springer LNCS, Vol. 9732, 2016, pp. 170-179.

Kaya H., Karpov A., Salah A. Robust Acoustic Emotion Recognition based on Cascaded Normalization and Extreme Learning Machines. In Proc. 13th International Symposium on Neural Networks ISNN-2016, St. Petersburg, Russia, Springer LNCS, Vol. 9719, 2016, pp. 115-123.

Kipyatkova I., Karpov A. Language Models with RNNs for Rescoring Hypotheses of Russian ASR. In Proc. 13th International Symposium on Neural Networks ISNN-2016, St. Petersburg, Russia, Springer LNCS, Vol. 9719, 2016, pp. 418-425. (Scopus SJR – 0,252).

Kaya H., Karpov A. Fusing Acoustic Feature Representations for Computational Paralinguistics Tasks. In Proc. INTERSPEECH-2016, San Francisco, USA, 2016, pp. 2046-2050. (Scopus SJR – 0,275).

Karpov A., Kipyatkova I., Zelezny M. Automatic Technologies for Processing Spoken Sign Languages. In Proc. SLTU-2016, Indonesia. Procedia Computer Science. Elsevier, Vol. 81, 2016, pp. 201-207. (Scopus SJR – 0,314).

Verkhodanova V., Shapranov V. Experiments on detection of voiced hesitations in Russian spontaneous speech // Journal of Electrical and Computer Engineering. Hindawi, USA, Volume 2016, 2016, Article ID 2013658. (Scopus SJR – 0,225).

2015

Verkhodanova V.O. Analysis and detection of phonational speech disfluencies on the material of continues Russian speech of different types and styles // In Proc. XX St. Petersburg Assembly of young scientists and specialists. 2015. p. 9.

Karpov A.A., Means of speech and multimodal human-computer interaction for assistive information technologies // Speech Technology, No 3-4, 2014, pp. 48-59.

Kryuchkov B.I., Karpov A.A., Usov V.M. Organization of voice interaction between a human operator and an anthropomorphic mobile robot for maintaining spatial orientation in weightlessness. // Proceedings of the XVII International Conference on "Complex Systems: Control and Modeling Problems", Samara: Samara Scientific Centre RAS, Russia, 2015, pp. 522-527.

Ushakov I.B., Polyakov A.V., Karpov A.A., Usov V.M. Medical robotics as a new stage of designing on-board trainers and biotechnical systems for the space station.// In Proc. International Scientific and Technological Conference «Extreme Robotics»– St. Petersburg: Publishing house "Polytechnic-service", 2015. pp. 47-51.