Skip to content

ACL'24: Modality-Aware Integration with Large Language Models for Knowledge-based Visual Question Answering

Notifications You must be signed in to change notification settings

DEEP-PolyU/MAIL_ACL24

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 

Repository files navigation

MAIL_ACL24

Modality-Aware Integration with Large Language Models for Knowledge-based Visual Question Answering

Pre-processing

In this part, we:

  • Transform images into captions then Scene Graphs;
  • Extract the topic entities in the questions;
  • Construct the Concept Graphs based on previous outputs.

Localization of MiniGPT-4

Please check the implementation details in Localized MiniGPT-4.

About

ACL'24: Modality-Aware Integration with Large Language Models for Knowledge-based Visual Question Answering

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published