Speech and Multimodal Interfaces Laboratory

RUSAVIC Corpus: Russian Audio-Visual Speech in Cars

Access to the corpus:

This corpus is available to the public. Permission to use, but not to reproduce or distribute our corpus is granted to all researchers, provided that the following steps are properly followed:

  • Send an email to Alexandr Axyonov (axyonov.a@iias.spb.su) to get a link to download this corpus and a password to access the files of this corpus. Your email MUST be sent from a valid university account and MUST contain the following text:

    1. Subject: Application to download the RUSAVIC corpus          
    2. Name: <your first and last name>
    3. Affiliation: <University where you work>
    4. Department: <your department>
    5. Position: <your job title>
    6. Email: <must be the email at the above mentioned institution>
    
    I have read and agree to the terms and conditions specified in the RUSAVIC corpus webpage. 
    This corpus will only be used for research purposes. 
    I will not make any part of this corpus available to a third party. 
    I'll not sell any part of this corpus or make any profit from its use.
    
  • If you are going to use the data mentioned above, you MUST cite these papers below:

    Ivanko D., Axyonov A., Ryumin D., Kashevnik A., Karpov A. 2022. RUSAVIC Corpus: Russian Audio-Visual Speech in Cars // In Proc. of 13-th Language Resources and Evaluation Conference (LREC). 2022. pp 1555–1559.

    Axyonov A., Ryumin D., Ivanko D., Kashevnik A., Karpov A. Audio-Visual Speech Recognition In-The-Wild: Multi-Angle Vehicle Cabin Corpus and Attention-Based Method // In Proc. of ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing. 2024. pp. 8195-8199, doi: 10.1109/ICASSP48485.2024.10448048.

    or:

    @inproceedings{ivanko2022rusavic,
      title={RUSAVIC Corpus: Russian audio-visual speech in cars},
      author={Ivanko, Denis and Axyonov, Alexandr and Ryumin, Dmitry and Kashevnik, Alexey and Karpov, Alexey},
      booktitle={Proceedings of the thirteenth language resources and evaluation conference},
      pages={1555--1559},
      year={2022}
    }
    
    @inproceedings {
        axyonov2024audio,
        title={Audio-Visual Speech Recognition In-The-Wild: Multi-Angle Vehicle Cabin Corpus and Attention-Based Method},
        author={Axyonov, Alexandr and Ryumin, Dmitry and Ivanko, Denis and Kashevnik, Alexey and Karpov, Alexey},
        booktitle={ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
        pages={8195--8199},
        year={2024},
        organization={IEEE},
        doi={10.1109/icassp48485.2024.10448048}
    }