We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Could you tell me how many tokens you used to represent the whole 3D scene before sending them to LLM? Thanks a lot.
The text was updated successfully, but these errors were encountered:
We represent a 3D scene with 32 learnable tokens following BLIP2 and InstructBlip https://github.com/Open3DA/LL3DA/blob/main/models/ll3da/captioner.py#L183. We introduce 8 additional tokens for each visual prompt.
BLIP2
InstructBlip
Sorry, something went wrong.
No branches or pull requests
Could you tell me how many tokens you used to represent the whole 3D scene before sending them to LLM?
Thanks a lot.
The text was updated successfully, but these errors were encountered: