Skip to content

Commit

Permalink
'masac_learner_x(#1)'
Browse files Browse the repository at this point in the history
  • Loading branch information
baijinqiu committed Dec 25, 2023
1 parent 4135878 commit da053b0
Showing 1 changed file with 16 additions and 13 deletions.
29 changes: 16 additions & 13 deletions docs/source/documents/api/learners/marl/masac.rst
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
MASAC_Learner
=====================================

xxxxxx.
An implementation of the Multi-Agent Soft Actor-Critic (MASAC) algorithm .

.. raw:: html

Expand Down Expand Up @@ -33,11 +33,12 @@ PyTorch
.. py:function::
xuance.torch.learners.multi_agent_rl.masac_learner.MASAC_Learner.update(sample)

xxxxxx.
Update the MASAC agent with a batch of training samples.

:param sample: xxxxxx.
:type sample: xxxxxx
:return: The infomation of the training.
:param sample: A dictionary containing training samples, including observations, actions, next observations, rewards,
terminals, agent masks, and agent IDs.
:type sample: dict
:return: The information of the training.
:rtype: dict

.. raw:: html
Expand Down Expand Up @@ -68,11 +69,12 @@ TensorFlow
.. py:function::
xuance.tensorflow.learners.multi_agent_rl.masac_learner.MASAC_Learner.update(sample)

xxxxxx.
Update the MASAC agent with a batch of training samples.

:param sample: xxxxxx.
:type sample: xxxxxx
:return: The infomation of the training.
:param sample: A dictionary containing training samples, including observations, actions, next observations, rewards,
terminals, agent masks, and agent IDs.
:type sample: dict
:return: The information of the training.
:rtype: dict

.. raw:: html
Expand Down Expand Up @@ -103,11 +105,12 @@ MindSpore
.. py:function::
xuance.mindspore.learners.multi_agent_rl.masac_learner.MASAC_Learner.update(sample)

xxxxxx.
Update the MASAC agent with a batch of training samples.

:param sample: xxxxxx.
:type sample: xxxxxx
:return: The infomation of the training.
:param sample: A dictionary containing training samples, including observations, actions, next observations, rewards,
terminals, agent masks, and agent IDs.
:type sample: dict
:return: The information of the training.
:rtype: dict

.. raw:: html
Expand Down

0 comments on commit da053b0

Please sign in to comment.