Consolidating AgentState metadata #814

JackUrb · 2022-07-06T22:12:51Z

Overview

To unblock @Etesam913 on #791, this PR standardizes a flow for reading and writing metadata from the top-level AgentState class

After much (internal) deliberation on what should go in the main AgentState class and what should still be fragmented out into the subclasses, I decided that only metadata would be considered a first-class feature of AgentState's (rather than all data types) but that the save+load flow would be standardized and updated towards the direction of #567. Happy to revisit this in discussions, but I couldn't find any other clean way to allow new agent types to be flexible with how they handle their data.

Implementation

New functionality:

Some AgentState abstract methods have been moved behind wrappers such that I can put the save+load lifecycle functions inside
AgentState.save_metadata and AgentState.load_metadata have been created as ways to handle IO for the metadata, while direct edits can be made to the AgentState.metadata object (a dataclass of type _AgentStateMetadata).

Towards future design, this allows the update_metadata method to exist inside of the base AgentState class rather than needing implementations throughout the similar classes.

pringshia

Thanks for putting this together!

Have a few minor questions but overall looks good.

pringshia · 2022-07-07T12:10:01Z

mephisto/abstractions/blueprints/remote_procedure/remote_procedure_agent_state.py

+            # Backwards compatibility for times
+            if "start_time" in state:
+                self.metadata.task_start = state["start_time"]
+                self.metadata.task_end = state["end_time"]


thanks for this migration consideration

pringshia · 2022-07-07T12:16:56Z

mephisto/abstractions/_subcomponents/agent_state.py

+    Class to track the first-class feature fields of info about an AgentState.
+
+    AgentState subclasses may choose to track additional metadata, but should
+    attach to the agent state subclass directly.


what is meant by "attach" here? is that some Python specific terminology?

Ah I just mean use their own attributes, this is leftover commentary from when I intended to standardize more. I'll update

pringshia · 2022-07-07T12:20:27Z

mephisto/abstractions/_subcomponents/agent_state.py

+
+    task_start: Optional[float] = None
+    task_end: Optional[float] = None
+    # TODO other metadata fields can be initialized


I was imagining that adding the tips plugin would be as simple as doing yarn add mephisto-worker-addons and using it without having to modify the back-end code much. Would this require end users to update the python code to use different subclasses for AgentStateMetadata?

Also what if they wanted to compose different plugins? For example, let's say in the future we introduce a plugin for gamification or leaderboards. It would be nice if adding those plugins just worked with the metadata storage, without having to define and compose the dataclasses.

Not sure if this is the right solution, but just wanted to articulate the use case.

Valid concern, but I'm wary of projecting too far here. Additional plugins could piggyback on metadata by adding directly to this data class on functionality we feel is standard or general enough. Anything additional or in development should probably just store on an AgentState+Blueprint if the functionality is too complex to simply be in this metadata dataclass.

pringshia · 2022-07-07T12:24:22Z

mephisto/abstractions/blueprints/abstract/static_task/static_agent_state.py

+            if "times" in self.state:
+                assert isinstance(self.state["times"], dict)
+                self.metadata.task_start = self.state["times"]["task_start"]
+                self.metadata.task_end = self.state["times"]["task_end"]


tbh I'm happy keeping the times within the "times" top level field, as library managed metadata... and the new metadata field for plugins and extensions to use. it might make some of the compatibility stuff here easier to manage though i see you've already handled it. I don't have a strong opinion on this though

I think the cost of standardization here for these simple fields is overall worth it. I likely should have done this before.

Etesam913

Looks good, I don't see any problems and it ran as expected.

One thing that would make sense to think about (I could do this when I merge into add-tips-example):
There isn't a way to update the metadata generally. As of now this isn't a problem because the metadata only has a task_start and task_end field. There is no reason to update these outside of init and submit.

However, if there is a feature like Tips or something else, it might not make too much sense to create an update method for each new field in metadata.

Something like below can be created in agent_state.py:

def update_metadata(self, property_name: str, property_value):
    if property_name not in self.metadata:
        """
        put some message here to tell user that the metadata field does not exist 
        (could help user know that they have to add a field to the _AgentStateMetadata) 
        """
        return

    else:
        self.metadata[property_name] = property_value

JackUrb · 2022-07-07T14:38:53Z

@Etesam913 noted in the PR description:

Towards future design, this allows the update_metadata method to exist inside of the base AgentState class rather than needing implementations throughout the similar classes.

Definitely would make sense to create an update_metadata method that is able to handle more complex alterations when provided with a metadata update packet's data.

Rebasing #791 on this should let you clean up the code a bit while keeping relatively the same design.

pringshia · 2022-07-07T14:43:18Z

There isn't a way to update the metadata generally

Agreed on the general edit part, that's sort of what I was sort of getting at with my comment here.

Also, given (from the OP):

while direct edits can be made to the AgentState.metadata object

I think the idea is that edits are to be made directly to the metadata object and then persisted via a call to AgentState.save_metadata?

JackUrb · 2022-07-07T14:44:55Z

Correct - the AgentState class semantics rely on explicit save_data and load_data calls for persisting vs overwriting from disk, and so I had the metadata attribute follow these semantics as well.

I only didn't implement update_metadata as I was leaving it to be introduced in #791

codecov-commenter · 2022-07-07T14:48:02Z

Codecov Report

Merging #814 (8b22ec1) into main (c776529) will increase coverage by 0.55%.
The diff coverage is 65.92%.

@@            Coverage Diff             @@
##             main     #814      +/-   ##
==========================================
+ Coverage   64.37%   64.92%   +0.55%     
==========================================
  Files         107      107              
  Lines        9178     9195      +17     
==========================================
+ Hits         5908     5970      +62     
+ Misses       3270     3225      -45

Impacted Files	Coverage Δ
mephisto/data_model/agent.py	`80.25% <0.00%> (+4.45%)`	⬆️
...s/remote_procedure/remote_procedure_agent_state.py	`41.53% <28.57%> (+5.39%)`	⬆️
...eprints/abstract/static_task/static_agent_state.py	`37.20% <38.46%> (+4.85%)`	⬆️
.../blueprints/parlai_chat/parlai_chat_agent_state.py	`30.35% <41.66%> (+2.18%)`	⬆️
mephisto/abstractions/databases/local_database.py	`89.83% <60.00%> (-1.67%)`	⬇️
mephisto/abstractions/database.py	`86.18% <68.75%> (-0.81%)`	⬇️
...ephisto/abstractions/_subcomponents/agent_state.py	`84.11% <89.36%> (+6.17%)`	⬆️
...o/abstractions/blueprints/mock/mock_agent_state.py	`92.30% <100.00%> (+8.43%)`	⬆️
mephisto/operations/client_io_handler.py	`85.47% <100.00%> (ø)`
mephisto/data_model/unit.py	`77.59% <0.00%> (-0.55%)`	⬇️
... and 2 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c776529...8b22ec1. Read the comment docs.

Consolidating some agent-state metadata

2ff4560

JackUrb requested a review from pringshia July 6, 2022 22:12

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 6, 2022

JackUrb requested a review from Etesam913 July 6, 2022 22:13

pringshia reviewed Jul 7, 2022

View reviewed changes

pringshia approved these changes Jul 7, 2022

View reviewed changes

Etesam913 approved these changes Jul 7, 2022

View reviewed changes

Comment update

8b22ec1

JackUrb merged commit f2a4d62 into main Jul 7, 2022

JackUrb deleted the agent-state-consolidate branch July 7, 2022 17:54

JackUrb mentioned this pull request Jul 22, 2022

Fixing bug in ParlAI agent state from 1.0.2 #859

Merged

snyk-bot mentioned this pull request Mar 6, 2023

[Snyk] Upgrade rc-slider from 8.7.1 to 10.1.1 Benjamin-KY/Mephisto#6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consolidating AgentState metadata #814

Consolidating AgentState metadata #814

JackUrb commented Jul 6, 2022

pringshia left a comment •

edited

Loading

pringshia Jul 7, 2022

pringshia Jul 7, 2022

JackUrb Jul 7, 2022

pringshia Jul 7, 2022

JackUrb Jul 7, 2022

pringshia Jul 7, 2022

JackUrb Jul 7, 2022

Etesam913 left a comment •

edited

Loading

JackUrb commented Jul 7, 2022 •

edited

Loading

pringshia commented Jul 7, 2022 •

edited

Loading

JackUrb commented Jul 7, 2022 •

edited

Loading

codecov-commenter commented Jul 7, 2022

Consolidating AgentState metadata #814

Consolidating AgentState metadata #814

Conversation

JackUrb commented Jul 6, 2022

Overview

Implementation

pringshia left a comment • edited Loading

Choose a reason for hiding this comment

pringshia Jul 7, 2022

Choose a reason for hiding this comment

pringshia Jul 7, 2022

Choose a reason for hiding this comment

JackUrb Jul 7, 2022

Choose a reason for hiding this comment

pringshia Jul 7, 2022

Choose a reason for hiding this comment

JackUrb Jul 7, 2022

Choose a reason for hiding this comment

pringshia Jul 7, 2022

Choose a reason for hiding this comment

JackUrb Jul 7, 2022

Choose a reason for hiding this comment

Etesam913 left a comment • edited Loading

Choose a reason for hiding this comment

JackUrb commented Jul 7, 2022 • edited Loading

pringshia commented Jul 7, 2022 • edited Loading

JackUrb commented Jul 7, 2022 • edited Loading

codecov-commenter commented Jul 7, 2022

Codecov Report

pringshia left a comment •

edited

Loading

Etesam913 left a comment •

edited

Loading

JackUrb commented Jul 7, 2022 •

edited

Loading

pringshia commented Jul 7, 2022 •

edited

Loading

JackUrb commented Jul 7, 2022 •

edited

Loading