You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, first of all, thank you for making this available to the world. I am curious about the motivation of having partition number. I think I am confused with the role of this and the segment(-=strata?)
I see the buffer consists of partitions. And each partition consists of segments whose length is batch size.
In addition, I initialized ExperienceReplay with the following setup. And I am getting this error: ValueError: empty range for randrange() (2,2, 0) at this line below.
for n in range(1, self.batch_size + 1):
index = random.randint(distribution['strata_ends'][n] + 1,
distribution['strata_ends'][n + 1])
rank_list.append(index)
and it is because my distribution['strata_ends'] was set up wrong.
Due to the particular way the algorithm was implemented (w.r.t dist_index & partition_number), I discovered there are a few constraints we need to impose on the hyperparameters. Here are assertion statements I set up for my project.
# this must be an integer (otherwise, KeyError or ValueError may be thrown)dist_index=config['learn_start'] /config['size'] *config['partition_num']
assertmath.floor(dist_index) ==math.ceil(dist_index), "this must be an integer"assertconfig['learn_start'] *config['partition_num'] >=config['size'], "another glitch"
I found it useful to initialize every hyper-parameter as a power of 2. (Technically it can be any integer like 10).
Hi, first of all, thank you for making this available to the world. I am curious about the motivation of having partition number. I think I am confused with the role of this and the segment(-=strata?)
I see the buffer consists of partitions. And each partition consists of segments whose length is batch size.
In addition, I initialized ExperienceReplay with the following setup. And I am getting this error:
ValueError: empty range for randrange() (2,2, 0)
at this line below.and it is because my distribution['strata_ends'] was set up wrong.
And this is because cdf[index] < step evaluates to False in early stratas, not incrementing the index.
Is this expected? Would you have any advice on what I am doing wrong?
My ER set up is:
The text was updated successfully, but these errors were encountered: