Abstract: Visual Question Answering (VQA), a challenging field combining computer vision and natural language processing, is finding applications in critical real-world scenarios. This paper ...