This repository contains the code supporting the LLaVA base model for use with Autodistill.
LLaVA is a multi-modal language model with object detection capabilities. You can use LLaVA with autodistill for object detection. Learn more about LLaVA 1.5, the most recent version of LLaVA at the time of releasing this package.
Read the full Autodistill documentation.
Read the LLaVA Autodistill documentation.
To use LLaVA with autodistill, you need to install the following dependency:
pip3 install autodistill-llava
from autodistill_llava import LLaVA
# define an ontology to map class names to our LLaVA prompt
# the ontology dictionary has the format {caption: class}
# where caption is the prompt sent to the base model, and class is the label that will
# be saved for that caption in the generated annotations
# then, load the model
base_model = LLaVA(
ontology=CaptionOntology(
{
"a forklift": "forklift"
}
)
)
base_model.label("./context_images", extension=".jpeg")
This model is licensed under an Apache 2.0 License.
We love your input! Please see the core Autodistill contributing guide to get started. Thank you 🙏 to all our contributors!