Towards Reliable Large Vision-Language Models
Title: |
Towards Reliable Large Vision-Language Models |
DNr: |
Berzelius-2024-249 |
Project Type: |
LiU Berzelius |
Principal Investigator: |
Xixi Liu <xixil@chalmers.se> |
Affiliation: |
Chalmers tekniska högskola |
Duration: |
2024-07-21 – 2025-02-01 |
Classification: |
10207 |
Keywords: |
|
Abstract
Large Vision-Language Models (LVLMs) are extensively utilized as foundational models in numerous computer vision tasks. However, a notorious problem with LVLMs is their tendency to hallucinate, meaning they often identify objects that are not present in the image. This project aims to mitigate these hallucinations and enhance the reliability of LVLMs.