RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Project page for RLHF-V

The code is borrowed from VPGTrans and MiniGPT-4. Thanks for the excellent code!

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
demos		demos
images		images
js		js
static		static
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
index.html		index.html

Provide feedback