Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Develop autotagging datalake processing #155

Open
8 tasks
SoniaGrh opened this issue Aug 14, 2024 · 0 comments · May be fixed by #167
Open
8 tasks

Develop autotagging datalake processing #155

SoniaGrh opened this issue Aug 14, 2024 · 0 comments · May be fixed by #167
Assignees

Comments

@SoniaGrh
Copy link
Collaborator

SoniaGrh commented Aug 14, 2024

As a Data Scientist,
I want to develop a processing tool that automatically tags images in the datalake using a Visual Language Model (VLM) based on a predefined list of tags
So that we can efficiently categorize and search through the images in the datalake.

Acceptance Criteria:

  • The tool takes as input either the entire datalake or a specified list of images from the datalake.
  • The tool accepts a list of tags (e.g., "day", "night", "autumn", "winter", "summer", "spring").
  • The tool runs the VLM on each image with the appropriate prompts to assign the most relevant tag from the provided list.
  • The assigned tags are accurately associated with each image in the datalake.

Definition of Done:

  • All acceptance criteria are met.
  • Code is reviewed and approved.
  • The tool is tested and validated with a variety of images and tag lists.
  • Documentation is updated to reflect the new tagging process.
@SoniaGrh SoniaGrh self-assigned this Aug 14, 2024
@SoniaGrh SoniaGrh linked a pull request Sep 26, 2024 that will close this issue
@SoniaGrh SoniaGrh linked a pull request Sep 26, 2024 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment