I'm curious if noise takes effect in search_best_move #68

shengkelong · 2023-06-16T18:05:04Z

I observed that the policy will be set to noise in "expand_node", but the "update_policy" used during inference (in "process_mini_batch") will directly update the policy to the result of network calculations, so that there will be no randomness at all except selfplay games.

CGLemon · 2023-06-17T02:44:07Z

The noise is not set in expand_node(). It is tentative policy. It will be replaced by NN policy in process_mini_batch(). So you are right. The MCTS process is not random.

kobanium · 2023-06-20T18:08:22Z

CGLemon is right. There is little randomness when executing as a normal MCTS player. If you want to add randomness to TamaGo, I modify TamaGo to be able to run like AlphaZero (dirichlet noise and move generation from distribution of the number of visits).

kobanium added the question Further information is requested label Jun 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I'm curious if noise takes effect in search_best_move #68

I'm curious if noise takes effect in search_best_move #68

shengkelong commented Jun 16, 2023

CGLemon commented Jun 17, 2023

kobanium commented Jun 20, 2023

I'm curious if noise takes effect in search_best_move #68

I'm curious if noise takes effect in search_best_move #68

Comments

shengkelong commented Jun 16, 2023

CGLemon commented Jun 17, 2023

kobanium commented Jun 20, 2023