Add support for loading data from pandas dataframe #18

agarwl · 2023-01-18T17:25:44Z

Right now, we only support loading data from numpy arrays. It would be nice if there was a helper function to convert a dataframe of scores to numpy arrays. Some initial code to help what this might look like:

def get_all_return_values(df):
  games = list(df['game'].unique())
  return_vals = {}
  for game in games:
    game_df = df[df['game'] == game]
    arr = game_df.groupby('wid')['normalized_score'].apply(list).values
    return_vals[game] = np.stack(arr, axis=0)
  return return_vals

def convert_to_matrix(x):
  return np.stack([x[k] for k in sorted(x.keys())], axis=1)

## Usage
# Array of shape (num_runs, num_games, num_steps)`
all_normalized_scores = convert_to_matrix(get_all_return_values(score_df))

The above code assumes we have a pandas Dataframe with keys run_number, 'gameandnormalized_score` containing scores for all steps (in a ordered manner).

The text was updated successfully, but these errors were encountered:

stefanbschneider · 2023-07-05T14:18:27Z

Hi, just to better understand the assumed structure of the DataFrame: We have one row, for each step?
Are these all the steps during evaluation (not training) on all the tasks?

And we'd assume separate DataFrames for each approach, which are each read separately by get_all_return_values()?
Eg, to construct the required dict for computing performance profiles.

agarwl · 2023-07-12T21:46:00Z

Yeah, for performance profiles, the data frames contain per-step results from evaluation (obtained during the course of training).

For aggregate metrics, we use the final performance, so that corresponds to evaluation results at the final step or a pre-specified step.

agarwl added enhancement New feature or request help wanted Extra attention is needed labels Jan 18, 2023

agarwl pinned this issue Jan 18, 2023

agarwl added the good first issue Good for newcomers label Jan 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for loading data from pandas dataframe #18

Add support for loading data from pandas dataframe #18

agarwl commented Jan 18, 2023 •

edited

Loading

stefanbschneider commented Jul 5, 2023

agarwl commented Jul 12, 2023

Add support for loading data from pandas dataframe #18

Add support for loading data from pandas dataframe #18

Comments

agarwl commented Jan 18, 2023 • edited Loading

stefanbschneider commented Jul 5, 2023

agarwl commented Jul 12, 2023

agarwl commented Jan 18, 2023 •

edited

Loading