Speech and Multimodal Interfaces Laboratory

Axyonov Alexander Alexandrovich

Axyonov Alexander Alexandrovich
Position
Senior researcher
Qualification
PhD

Publications

2024

Ryumin D., Axyonov A., Ryumina E., Ivanko D., Kashevnik A., Karpov A. Audio–visual speech recognition based on regulated transformer and spatio–temporal fusion strategy for driver assistive systems // Expert Systems with Applications. 2024. vol. 252. ID 124159.
Axyonov A., Ryumin D., Ivanko D., Kashevnik A., Karpov A. Audio-Visual Speech Recognition In-The-Wild: Multi-Angle Vehicle Cabin Dataset and Attention-Based Approach // In Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2024. pp. 8195-8199.
Ivanko D., Ryumin D., Axyonov A., Kashevnik A., Karpov A. OpenAV: Bilingual Dataset for Audio-Visual Voice Control of a Computer for Hand Disabled People // Lecture Notes in Computer Science, SPECOM-2024. 2024. vol. 15299. pp. 163-173.

2023

Axyonov A.A., Ryumina E.V., Ryumin D.A., Ivanko D.V., Karpov A.A. Neural network-based method for visual recognition of driver’s voice commands using attention mechanism // Scientific and Technical Journal of Information Technologies, Mechanics and Optics. 2023. vol. 23. no. 4. pp. 767–775.
Ryumin D., Ivanko D., Axyonov A. Cross-Language Transfer Learning Using Visual Information for Automatic Sign Gesture Recognition // The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences. 2023. vol. XLVIII. pp. 209–216.
Ivanko D., Ryumina E., Ryumin D., Axyonov A., Kashevnik A., Karpov A. EMO-AVSR: Two-Level Approach for Audio-Visual Emotional Speech Recognition// In Proc. of the 25th International Conference on Speech and Computer SPECOM-2023. Lecture Notes in Computer Science. LNAI. 2023. vol. 14338. pp. 18–31.

2022

Axyonov A., Ryumin D., Kashevnik A., Ivanko D., Karpov A. Method for visual analysis of driver's face for automatic lip-reading in the wild // Computer Optics. 2022. Vol. 46(6). pp. 955-962.