Ruan, William
[UCL]
Pircalabelu, Eugen
[UCL]
The totality of human communication is a complex system comprised of sound and vision patterns, specifically constructed to convey meaning between people. This presents a stark contrast between the language spoken by machines. The bridge between both systems can be built with the aid of word embedding algorithms that aim to mimic the words' semantic and syntactic meaning through real-valued vectors. This thesis is grounded on a rather recent field, and aims to shed some light on the mathematical background of a specific statistical approach to this algorithm, while comparing it to modern neural network models. In particular, we test these models on real-world data consisting of articles related to the Russo-Ukrainian war. The outcomes of these comparisons are examined, as are the limitations of the studies and future directions.


Bibliographic reference |
Ruan, William. Word Embeddings using Canonical Correlation Analysis. Faculté des sciences, Université catholique de Louvain, 2022. Prom. : Pircalabelu, Eugen. |
Permanent URL |
http://hdl.handle.net/2078.1/thesis:38059 |