Cardon, Rémi
[UCL]
Pham, Tran Hanh Trang
[UCL]
Zakhia Doueihi, Julien
[UCL]
François, Thomas
[UCL]
The present work studies the contribution of move structure to automatic genre identification. This concept - well known in other branches of genre analysis - seems to have little application in natural language processing. We describe how we collect a corpus of websites in French related to tourism and annotate it with move structure. We conduct experiments on automatic genre identification with our corpus. Our results show that our approach for informing a model with move structure can increase its performance for automatic genre identification, and reduce the need for annotated data and computational power.
Bibliographic reference |
Cardon, Rémi ; Pham, Tran Hanh Trang ; Zakhia Doueihi, Julien ; François, Thomas. Contribution of Move Structure to Automatic Genre Identification: An Annotated Corpus of French Tourism Websites.LREC-COLING 2024 (Torino, Italia, du 20/05/2024 au 25/05/2024). In: Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, ELRA and ICCL : Torino, Italia2024, p. 3916-3926 |
Permanent URL |
http://hdl.handle.net/2078.1/289319 |