Skip to content

Commit

Permalink
regenerate static site
Browse files Browse the repository at this point in the history
  • Loading branch information
danvk committed Nov 29, 2024
1 parent 48428b2 commit 7af2224
Show file tree
Hide file tree
Showing 6 changed files with 46 additions and 34 deletions.
2 changes: 1 addition & 1 deletion data/lat-lon-to-ids.json

Large diffs are not rendered by default.

6 changes: 6 additions & 0 deletions data/self-hosted-sizes.txt
Original file line number Diff line number Diff line change
Expand Up @@ -28,23 +28,29 @@
1113275,760,253
1113276,760,252
1113277,760,252
1113278,760,254
1113279,760,253
1113280,760,258
1113281,760,254
1113282,760,257
1113283,760,261
1113284,760,252
1113285,760,257
1113286,760,262
1113287,760,259
1113288,760,264
1113289,760,261
1113290,760,260
1113291,760,259
1113292,760,256
1113293,760,257
1113294,760,259
1113295,760,253
1113296,760,264
1113297,760,261
1113298,760,262
1113299,760,258
1113300,760,259
1113301,760,256
1113302,760,254
1229128,579,760
Expand Down
45 changes: 24 additions & 21 deletions test/geocoding-stats.txt
Original file line number Diff line number Diff line change
@@ -1,20 +1,22 @@
-- Finalizing fifth --
Fifth Avenue: 77 claimed
-- Finalizing title-cross --
titles matched: 0
alt titles matched: 0
total matches: 31270
counters: [('boro-int', 30503), ('title', 30096), ('alt_title', 1174), ('at-int', 288), ('num-prefix', 271), ('between', 208)]
grid: 25380 (31113 attempts)
Grid statistics:
Counts: [('exact', 13920), ('dir strip', 12957), ('exact: str', 759), ('interpolated', 176), ('unclaimed', 136), ('extrapolated', 105), ('cursed', 7), ('exact_grid', 3)]
Counts: [('exact', 13872), ('dir strip', 12952), ('exact: str', 759), ('interpolated', 176), ('unclaimed', 135), ('extrapolated', 105), ('cursed', 7), ('exact_grid', 3)]
Unknown avenues: Counter({'13': 38, 'Broadway': 9, '36': 7, '19': 6, '85': 5, '62': 3, '53': 2, '88': 2, '93': 2, '95': 2, '22': 2, '98': 2, '57': 1, '49': 1, '26': 1, '46': 1, '44': 1, '96': 1})
Unknown streets: Counter({'212': 11, '193': 10, '145': 6, '129': 3, '213': 3, '157': 2, '139': 2, '143': 2, '192': 2, '142': 2, '181': 2, '130': 2, '159': 1, '174': 1, '208': 1})
Unknown streets: Counter({'212': 11, '193': 10, '145': 6, '129': 3, '213': 3, '157': 2, '139': 2, '143': 2, '192': 2, '142': 2, '181': 2, '130': 2, '174': 1, '208': 1})
google: 1700
boro mismatch: 434
failures: 3593
Google geocoder stats:
Cache misses: 0
Cache files hit: 7313
[('google: intersection - fail', 8242), ('google: intersection - success', 2217), ('google: address - success', 1660), ('google: intersection - boro mismatch', 902), ('google: address - fail', 138), ('google: address - boro mismatch', 79), ('cursed', 10)]
Cache files hit: 7300
[('google: intersection - fail', 8235), ('google: intersection - success', 2217), ('google: address - success', 1656), ('google: intersection - boro mismatch', 901), ('google: address - fail', 137), ('google: address - boro mismatch', 79), ('cursed', 10)]
-- Finalizing title-address --
address matches: 625
patterns: [('street_pound', 426), ('num_street', 199)]
Expand All @@ -23,24 +25,24 @@ Google geocoder stats:
failures: 16
Google geocoder stats:
Cache misses: 0
Cache files hit: 7313
[('google: intersection - fail', 8242), ('google: intersection - success', 2217), ('google: address - success', 1660), ('google: intersection - boro mismatch', 902), ('google: address - fail', 138), ('google: address - boro mismatch', 79), ('cursed', 10)]
Cache files hit: 7300
[('google: intersection - fail', 8235), ('google: intersection - success', 2217), ('google: address - success', 1656), ('google: intersection - boro mismatch', 901), ('google: address - fail', 137), ('google: address - boro mismatch', 79), ('cursed', 10)]
-- Finalizing gpt --
GPT POI: 14556
GPT address: 2795
GPT intersection: 10062
grid: 2540 (8290 attempts)
GPT POI: 14441
GPT address: 2760
GPT intersection: 9985
grid: 2487 (8228 attempts)
Grid statistics:
Counts: [('exact', 13920), ('dir strip', 12957), ('exact: str', 759), ('interpolated', 176), ('unclaimed', 136), ('extrapolated', 105), ('cursed', 7), ('exact_grid', 3)]
Counts: [('exact', 13872), ('dir strip', 12952), ('exact: str', 759), ('interpolated', 176), ('unclaimed', 135), ('extrapolated', 105), ('cursed', 7), ('exact_grid', 3)]
Unknown avenues: Counter({'13': 38, 'Broadway': 9, '36': 7, '19': 6, '85': 5, '62': 3, '53': 2, '88': 2, '93': 2, '95': 2, '22': 2, '98': 2, '57': 1, '49': 1, '26': 1, '46': 1, '44': 1, '96': 1})
Unknown streets: Counter({'212': 11, '193': 10, '145': 6, '129': 3, '213': 3, '157': 2, '139': 2, '143': 2, '192': 2, '142': 2, '181': 2, '130': 2, '159': 1, '174': 1, '208': 1})
google: 1577
boro mismatch: 538
failures: 4771
Unknown streets: Counter({'212': 11, '193': 10, '145': 6, '129': 3, '213': 3, '157': 2, '139': 2, '143': 2, '192': 2, '142': 2, '181': 2, '130': 2, '174': 1, '208': 1})
google: 1573
boro mismatch: 537
failures: 4763
Google geocoder stats:
Cache misses: 0
Cache files hit: 7313
[('google: intersection - fail', 8242), ('google: intersection - success', 2217), ('google: address - success', 1660), ('google: intersection - boro mismatch', 902), ('google: address - fail', 138), ('google: address - boro mismatch', 79), ('cursed', 10)]
Cache files hit: 7300
[('google: intersection - fail', 8235), ('google: intersection - success', 2217), ('google: address - success', 1656), ('google: intersection - boro mismatch', 901), ('google: address - fail', 137), ('google: address - boro mismatch', 79), ('cursed', 10)]
-- Finalizing special --
Special cases: [('Columbus Circle', 25), ('China Daily News', 23), ('Squatters: Camp Thomas Paine', 6), ('Mt. Sinai', 3), ('St. John the Divine', 1)]
-- Finalizing subjects --
Expand All @@ -60,12 +62,13 @@ POI/subject geocoding:
250 n_title_island
501 n_title_park
-- Final stats --
77 fifth
27080 title-cross
600 title-address
4117 gpt
4060 gpt
58 special
1932 subjects
33787 (total)
33807 (total)
Dropped w/ no date: 0
Unique lat/longs: 10597
Total photographs: 33787
Unique lat/longs: 10669
Total photographs: 33807
4 changes: 2 additions & 2 deletions test/random200-geocoded.txt
Original file line number Diff line number Diff line change
@@ -1,8 +1,8 @@
104536 failed n/a n/a
104780 failed n/a n/a
104881 failed n/a n/a
1113261 gpt (40.75096, -73.982726) Fifth Avenue and West 38th St
1113271 gpt (40.756659, -73.978557) Fifth Avenue and West 47th St
1113261 fifth (40.7512051, -73.9824083) Fifth Avenue
1113271 fifth (40.7566585, -73.9784762) Fifth Avenue
1231043 failed n/a n/a
1238854 failed n/a n/a
1507627 title-cross (40.750907, -73.980677) Manhattan: Madison Avenue - 39th Street,
Expand Down
17 changes: 10 additions & 7 deletions test/random200.logs.txt
Original file line number Diff line number Diff line change
@@ -1,12 +1,14 @@
Filtered to 200/41463 records with --ids_filter (200)
-- Finalizing fifth --
Fifth Avenue: 2 claimed
-- Finalizing title-cross --
titles matched: 0
alt titles matched: 0
total matches: 150
counters: [('title', 148), ('boro-int', 148), ('alt_title', 2), ('num-prefix', 1), ('at-int', 1)]
grid: 126 (150 attempts)
Grid statistics:
Counts: [('dir strip', 73), ('exact', 63), ('exact: str', 2), ('unclaimed', 2), ('extrapolated', 1)]
Counts: [('dir strip', 73), ('exact', 61), ('exact: str', 2), ('unclaimed', 2), ('extrapolated', 1)]
Unknown avenues: Counter({'19': 2})
Unknown streets: Counter()
google: 7
Expand All @@ -27,12 +29,12 @@ Google geocoder stats:
Cache files hit: 50
[('google: intersection - fail', 40), ('google: intersection - success', 12), ('google: address - success', 6), ('google: intersection - boro mismatch', 2)]
-- Finalizing gpt --
GPT POI: 60
GPT address: 11
GPT intersection: 52
grid: 13 (43 attempts)
GPT POI: 58
GPT address: 10
GPT intersection: 49
grid: 11 (41 attempts)
Grid statistics:
Counts: [('dir strip', 73), ('exact', 63), ('exact: str', 2), ('unclaimed', 2), ('extrapolated', 1)]
Counts: [('dir strip', 73), ('exact', 61), ('exact: str', 2), ('unclaimed', 2), ('extrapolated', 1)]
Unknown avenues: Counter({'19': 2})
Unknown streets: Counter()
google: 8
Expand All @@ -56,9 +58,10 @@ POI/subject geocoding:
2 n_title_bridge
2 n_title_island
-- Final stats --
2 fifth
133 title-cross
3 title-address
21 gpt
19 gpt
1 special
9 subjects
167 (total)
6 changes: 3 additions & 3 deletions test/site-stats.txt
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
Missing popular: ['721912f-b']
NYPL items on site: 33787
Unique photos on site: 49081
NYPL items on site: 33807
Unique photos on site: 49101
Text-less photos: 1380
Unique lat/lngs: 10597
Unique lat/lngs: 10669
Orphaned popular photos: 1 / 54

0 comments on commit 7af2224

Please sign in to comment.