Skip to content

Commit

Permalink
'qmix_learner_x(#1)'
Browse files Browse the repository at this point in the history
  • Loading branch information
baijinqiu committed Dec 25, 2023
1 parent da08dc3 commit e7662ab
Showing 1 changed file with 5 additions and 5 deletions.
10 changes: 5 additions & 5 deletions docs/source/documents/api/learners/marl/qmix.rst
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
QMIX_Learner
=====================================

xxxxxx.
An implementation of the QMIX (Monotonic value function factorisation for deep multi-agent reinforcement learning) algorithm.

.. raw:: html

Expand Down Expand Up @@ -33,7 +33,7 @@ PyTorch
.. py:function::
xuance.torch.learners.multi_agent_rl.qmix_learner.QMIX_Learner.update(sample)

xxxxxx.
Update the QMIX learner based on the sampled experience from the experience replay buffer.

:param sample: A dictionary contianing necessary experience data that is sampled from experience replay buffer.
:type sample: dict
Expand All @@ -43,7 +43,7 @@ PyTorch
.. py:function::
xuance.torch.learners.multi_agent_rl.qmix_learner.QMIX_Learner.update_recurrent(sample)

xxxxxx.
Update the QMIX learner for recurrent architectures based on the sampled experience from the experience replay buffer.

:param sample: A dictionary contianing necessary experience data that is sampled from experience replay buffer.
:type sample: dict
Expand Down Expand Up @@ -78,7 +78,7 @@ TensorFlow
.. py:function::
xuance.tensorflow.learners.multi_agent_rl.qmix_learner.QMIX_Learner.update(sample)

xxxxxx.
Update the QMIX learner based on the sampled experience from the experience replay buffer.

:param sample: A dictionary contianing necessary experience data that is sampled from experience replay buffer.
:type sample: dict
Expand Down Expand Up @@ -113,7 +113,7 @@ MindSpore
.. py:function::
xuance.mindspore.learners.multi_agent_rl.qmix_learner.QMIX_Learner.update(sample)

xxxxxx.
Update the QMIX learner based on the sampled experience from the experience replay buffer.

:param sample: A dictionary contianing necessary experience data that is sampled from experience replay buffer.
:type sample: dict
Expand Down

0 comments on commit e7662ab

Please sign in to comment.