Skip to content

Commit

Permalink
update to the NeurIPS 2023 blog
Browse files Browse the repository at this point in the history
  • Loading branch information
Max Zhdanov authored and Max Zhdanov committed Nov 30, 2023
1 parent 5ceb9cb commit a1b77a5
Show file tree
Hide file tree
Showing 2 changed files with 29 additions and 0 deletions.
29 changes: 29 additions & 0 deletions _posts/2023-11-20-neurips.md
Original file line number Diff line number Diff line change
Expand Up @@ -185,6 +185,35 @@ date: 2023-11-20 8:00:00+0100

<hr style="margin-top: 20px;">

<table>
<tr>
<td style="width: 50%;">
<strong style="font-size: 20px;">Learning Dynamic Attribute-factored World Models for Efficient Multi-object Reinforcement Learning</strong><br>
<em>Fan Feng, Sara Magliacane</em><br>

<span style="font-size: 14px; display: block; margin-bottom: 10px;">
<a href="https://arxiv.org/abs/2307.09205" style="text-decoration: none; color: #007bff;">Paper</a> &middot;
<a href="https://arxiv.org/pdf/2307.09205.pdf" style="text-decoration: none; color: #007bff;">PDF</a> &middot;
<!-- GitHub repo to be updated later -->
</span>

<strong>Abstract:</strong> In many reinforcement learning tasks, the agent has to learn to interact with many objects of different types and generalize to unseen combinations and numbers of objects. Often a task is a composition of previously learned tasks (e.g. block stacking). These are examples of compositional generalization, in which we compose object-centric representations to solve complex tasks. Recent works have shown the benefits of object-factored representations and hierarchical abstractions for improving sample efficiency in these settings. In this paper, we introduce the Dynamic Attribute FacTored RL (DAFT-RL) framework. In DAFT-RL, we leverage object-centric representation learning to extract objects from visual inputs, classify them in classes, and infer their latent parameters. For each class of object, we learn a class template graph that describes the dynamics and reward of an object of this class factorized according to its attributes. We also learn an interaction pattern graph that describes how objects of different classes interact with each other at the attribute level. Through these graphs and a dynamic interaction graph that models the interactions between objects, we learn a policy that can then be directly applied in a new environment by just estimating the interactions and latent parameters. We evaluate DAFT-RL in three benchmark datasets and show our framework outperforms the state-of-the-art in generalizing across unseen objects with varying attributes and latent parameters, as well as in the composition of previously learned tasks. <br>

<span style="font-size: 14px; font-weight: bold; color: #6c757d;">
#ReinforcementLearning #ObjectCentricRepresentations
</span>

</td>
<td style="width: 50%;">
<!-- Image placeholder since no image link is provided -->
<img src="/assets/neurips2023/dyn_attr.png" alt="Graphical Abstract" style="height: auto; max-height: 100%; width: auto; max-width: 100%; margin-left: 5%;">
</td>
</tr>
</table>

<hr style="margin-top: 20px;">





Expand Down
Binary file added assets/neurips2023/dyn_attr.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit a1b77a5

Please sign in to comment.