Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition
| dc.contributor.author | Hu, Yuchen | |
| dc.contributor.author | Li, Ruizhe | |
| dc.contributor.author | Chen, Chen | |
| dc.contributor.author | Zou, Heqing | |
| dc.contributor.author | Zhu, Qiushi | |
| dc.contributor.author | Chng, Eng Siong | |
| dc.contributor.institution | University of Aberdeen.Computing Science | en |
| dc.contributor.institution | University of Aberdeen.Computational Linguistics at Aberdeen | en |
| dc.date.accessioned | 2023-07-11T15:49:01Z | |
| dc.date.available | 2023-07-11T15:49:01Z | |
| dc.date.issued | 2023-05-16 | |
| dc.description | 12 pages, 5 figures, Accepted by IJCAI 2023 | en |
| dc.format.extent | 3479694 | |
| dc.identifier | 251076102 | |
| dc.identifier | f4a6c527-9e8f-4185-8bb0-a2ea511fc60f | |
| dc.identifier.citation | Hu, Y, Li, R, Chen, C, Zou, H, Zhu, Q & Chng, E S 2023 'Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition' ArXiv. https://doi.org/10.48550/arXiv.2305.09212 Focus to learn more | en |
| dc.identifier.doi | 10.48550/arXiv.2305.09212 Focus to learn more | |
| dc.identifier.other | ArXiv: http://arxiv.org/abs/2305.09212v1 | |
| dc.identifier.other | ORCID: /0000-0003-2512-845X/work/138534414 | |
| dc.identifier.uri | https://hdl.handle.net/2164/21191 | |
| dc.language.iso | eng | |
| dc.publisher | ArXiv | |
| dc.subject | eess.AS | en |
| dc.subject | cs.CV | en |
| dc.subject | cs.MM | en |
| dc.subject | cs.SD | en |
| dc.title | Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition | en |
| dc.type | Preprint | en |
Files
Original bundle
1 - 1 of 1
