Skip to content

Commit

Permalink
Merge branch 'rpb-154-missingLabels' of https://github.com/hbz/rpb in…
Browse files Browse the repository at this point in the history
  • Loading branch information
fsteeg committed Apr 17, 2024
2 parents 55fb38b + 61f1be6 commit 4763f0d
Show file tree
Hide file tree
Showing 18 changed files with 18,756 additions and 16,116 deletions.
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -22,4 +22,5 @@ RPB-Export_HBZ_SWN.txt
RPB-Export_HBZ_Syst.txt
RPB-Export_HBZ_ZSS.txt
conf/RPBEXP/*.ZIP
conf/strapi-export.tar.gz
nohup.out*
8 changes: 8 additions & 0 deletions conf/map-gnd-person-to-label.fix
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
if any_contain("gndIdentifier", "Keine")
reject()
end

replace_all("preferredName", "\\/\\s(ca\\.|um)?-?\\s?\\d.+$", "")
trim("preferredName")

retain("gndIdentifier", "preferredName")
8 changes: 8 additions & 0 deletions conf/map-gnd-person-to-label.flux
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
FLUX_DIR + "output/rppd-export.jsonl"
| open-file
| as-lines
| decode-json(recordPath="data")
| fix(FLUX_DIR + "map-gnd-person-to-label.fix")
| encode-csv(includeheader="true", noquotes="true",separator="\t")
| write(FLUX_DIR + "maps/gndId-to-label.tsv")
;
8 changes: 8 additions & 0 deletions conf/map-rppd-to-label.fix
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
unless any_contain("gndIdentifier", "Keine")
reject()
end

replace_all("preferredName", "\\/\\s(ca\\.|um)?-?\\s?\\d.+$", "")
trim("preferredName")

retain("rppdId", "preferredName")
8 changes: 8 additions & 0 deletions conf/map-rppd-to-label.flux
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
FLUX_DIR + "output/rppd-export.jsonl"
| open-file
| as-lines
| decode-json(recordPath="data")
| fix(FLUX_DIR + "map-rppd-to-label.fix")
| encode-csv(includeheader="true", noquotes="true",separator="\t")
| write(FLUX_DIR + "maps/rppdId-to-label.tsv")
;
Loading

0 comments on commit 4763f0d

Please sign in to comment.