Abstract: The proposed paper examines enhancements in Visual Question Answering (VQA) by systematically tuning hyperparameters and utilizing advanced image and text encoders. The study particularly ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results