-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathresults.txt
140 lines (138 loc) · 12.1 KB
/
results.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
Hyper-parameter tuning results from running 5 train & simulates for 27 different combinations of epsilon, discount factor, and learning rate.
RESULTS: 10/21/22 2 AM - 12 PM
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.1% | W: 41567, D: 28125, L: 30308
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.1% | W: 49210, D: 20548, L: 30242
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.1% | W: 44652, D: 24029, L: 31319
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.1% | W: 38394, D: 38932, L: 22674
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.1% | W: 34417, D: 34444, L: 31139
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.2% | W: 32115, D: 34796, L: 33089
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.2% | W: 34150, D: 42482, L: 23368
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.2% | W: 23586, D: 36881, L: 39533
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.2% | W: 56377, D: 17733, L: 25890
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.2% | W: 30591, D: 30782, L: 38627
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.3% | W: 32652, D: 34598, L: 32750
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.3% | W: 31035, D: 29746, L: 39219
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.3% | W: 31288, D: 36141, L: 32571
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.3% | W: 28029, D: 42501, L: 29470
epsilon: 0.1%, discount factor: 0.2%, learning rate: 0.3% | W: 57095, D: 18728, L: 24177
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.1% | W: 55623, D: 21250, L: 23127
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.1% | W: 32010, D: 35319, L: 32671
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.1% | W: 35753, D: 27443, L: 36804
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.1% | W: 38708, D: 23416, L: 37876
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.1% | W: 10516, D: 74953, L: 14531
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.2% | W: 35058, D: 26188, L: 38754
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.2% | W: 22844, D: 35291, L: 41865
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.2% | W: 39163, D: 30420, L: 30417
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.2% | W: 33287, D: 41279, L: 25434
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.2% | W: 51618, D: 21147, L: 27235
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.3% | W: 49833, D: 34459, L: 15708
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.3% | W: 53756, D: 17358, L: 28886
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.3% | W: 26205, D: 46944, L: 26851
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.3% | W: 55521, D: 18741, L: 25738
epsilon: 0.1%, discount factor: 0.4%, learning rate: 0.3% | W: 51222, D: 32618, L: 16160
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.1% | W: 44857, D: 25863, L: 29280
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.1% | W: 42031, D: 21830, L: 36139
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.1% | W: 48335, D: 20252, L: 31413
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.1% | W: 36392, D: 34386, L: 29222
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.1% | W: 50327, D: 17664, L: 32009
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.2% | W: 17889, D: 67348, L: 14763
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.2% | W: 38991, D: 26354, L: 34655
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.2% | W: 50879, D: 24137, L: 24984
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.2% | W: 40554, D: 22912, L: 36534
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.2% | W: 57583, D: 20336, L: 22081
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.3% | W: 28199, D: 32807, L: 38994
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.3% | W: 44609, D: 20101, L: 35290
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.3% | W: 45396, D: 22653, L: 31951
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.3% | W: 50182, D: 23559, L: 26259
epsilon: 0.1%, discount factor: 0.6%, learning rate: 0.3% | W: 18449, D: 65520, L: 16031
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.1% | W: 24236, D: 56794, L: 18970
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.1% | W: 32511, D: 42147, L: 25342
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.1% | W: 48551, D: 26028, L: 25421
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.1% | W: 35880, D: 31290, L: 32830
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.1% | W: 40115, D: 23610, L: 36275
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.2% | W: 23002, D: 46422, L: 30576
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.2% | W: 36130, D: 43882, L: 19988
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.2% | W: 37870, D: 37992, L: 24138
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.2% | W: 33686, D: 48539, L: 17775
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.2% | W: 38698, D: 30485, L: 30817
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.3% | W: 34135, D: 40684, L: 25181
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.3% | W: 31577, D: 24027, L: 44396
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.3% | W: 33053, D: 43726, L: 23221
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.3% | W: 37827, D: 23391, L: 38782
epsilon: 0.2%, discount factor: 0.2%, learning rate: 0.3% | W: 28679, D: 43236, L: 28085
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.1% | W: 37517, D: 30691, L: 31792
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.1% | W: 39008, D: 26377, L: 34615
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.1% | W: 54242, D: 20834, L: 24924
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.1% | W: 48200, D: 29439, L: 22361
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.1% | W: 19119, D: 61294, L: 19587
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.2% | W: 48673, D: 21620, L: 29707
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.2% | W: 29133, D: 32136, L: 38731
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.2% | W: 34660, D: 42582, L: 22758
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.2% | W: 22158, D: 35651, L: 42191
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.2% | W: 36097, D: 30755, L: 33148
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.3% | W: 30400, D: 39904, L: 29696
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.3% | W: 38006, D: 27715, L: 34279
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.3% | W: 33208, D: 35777, L: 31015
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.3% | W: 20704, D: 59019, L: 20277
epsilon: 0.2%, discount factor: 0.4%, learning rate: 0.3% | W: 34368, D: 31851, L: 33781
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.1% | W: 43483, D: 27021, L: 29496
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.1% | W: 43021, D: 30364, L: 26615
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.1% | W: 22316, D: 31642, L: 46042
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.1% | W: 41262, D: 34845, L: 23893
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.1% | W: 39344, D: 29427, L: 31229
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.2% | W: 13557, D: 47025, L: 39418
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.2% | W: 34253, D: 35469, L: 30278
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.2% | W: 35476, D: 26742, L: 37782
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.2% | W: 39182, D: 31301, L: 29517
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.2% | W: 32905, D: 41611, L: 25484
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.3% | W: 26421, D: 47125, L: 26454
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.3% | W: 38237, D: 26465, L: 35298
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.3% | W: 41033, D: 25271, L: 33696
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.3% | W: 56757, D: 18781, L: 24462
epsilon: 0.2%, discount factor: 0.6%, learning rate: 0.3% | W: 32765, D: 31086, L: 36149
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.1% | W: 41817, D: 26237, L: 31946
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.1% | W: 37180, D: 30531, L: 32289
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.1% | W: 51964, D: 23824, L: 24212
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.1% | W: 39132, D: 31186, L: 29682
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.1% | W: 38936, D: 26488, L: 34576
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.2% | W: 35268, D: 25111, L: 39621
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.2% | W: 49501, D: 18351, L: 32148
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.2% | W: 33922, D: 26841, L: 39237
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.2% | W: 46514, D: 22009, L: 31477
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.2% | W: 31766, D: 28814, L: 39420
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.3% | W: 49331, D: 20544, L: 30125
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.3% | W: 39505, D: 23110, L: 37385
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.3% | W: 33427, D: 37779, L: 28794
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.3% | W: 31683, D: 34984, L: 33333
epsilon: 0.3%, discount factor: 0.2%, learning rate: 0.3% | W: 40658, D: 39087, L: 20255
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.1% | W: 21474, D: 42433, L: 36093
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.1% | W: 36170, D: 37217, L: 26613
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.1% | W: 56366, D: 19943, L: 23691
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.1% | W: 29049, D: 35511, L: 35440
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.1% | W: 57397, D: 17044, L: 25559
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.2% | W: 29731, D: 40070, L: 30199
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.2% | W: 49234, D: 22410, L: 28356
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.2% | W: 31071, D: 32571, L: 36358
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.2% | W: 32405, D: 38889, L: 28706
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.2% | W: 34642, D: 37497, L: 27861
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.3% | W: 23237, D: 37461, L: 39302
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.3% | W: 36428, D: 31203, L: 32369
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.3% | W: 40895, D: 27180, L: 31925
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.3% | W: 29885, D: 34852, L: 35263
epsilon: 0.3%, discount factor: 0.4%, learning rate: 0.3% | W: 38263, D: 27799, L: 33938
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.1% | W: 24754, D: 40001, L: 35245
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.1% | W: 37541, D: 26567, L: 35892
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.1% | W: 31750, D: 42628, L: 25622
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.1% | W: 32327, D: 31363, L: 36310
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.1% | W: 36312, D: 31702, L: 31986
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.2% | W: 36892, D: 37277, L: 25831
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.2% | W: 30567, D: 29586, L: 39847
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.2% | W: 39033, D: 36264, L: 24703
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.2% | W: 38740, D: 27584, L: 33676
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.2% | W: 14057, D: 38804, L: 47139
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.3% | W: 45440, D: 27296, L: 27264
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.3% | W: 34180, D: 26154, L: 39666
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.3% | W: 48393, D: 23069, L: 28538
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.3% | W: 29251, D: 40711, L: 30038
epsilon: 0.3%, discount factor: 0.6%, learning rate: 0.3% | W: 35111, D: 38165, L: 26724
FURTHER RESULTS: ?