Towards Reliable Large Vision-Language Models

System

NSC Web

Front Page

Getting Access

Support Email

support@nsc.liu.se

Feedback

Give Feedback

Towards Reliable Large Vision-Language Models

Title:	Towards Reliable Large Vision-Language Models
DNr:	Berzelius-2024-249
Project Type:	LiU Berzelius
Principal Investigator:	Xixi Liu <xixil@chalmers.se>
Affiliation:	Chalmers tekniska högskola
Duration:	2024-07-21 – 2025-02-01
Classification:	10207
Keywords:

Abstract

Large Vision-Language Models (LVLMs) are extensively utilized as foundational models in numerous computer vision tasks. However, a notorious problem with LVLMs is their tendency to hallucinate, meaning they often identify objects that are not present in the image. This project aims to mitigate these hallucinations and enhance the reliability of LVLMs.

National Supercomputer Centre at Linköping University

Abstract