Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Custom marker mismatch between source project and target project when translating #602

Open
mmartin9684-sil opened this issue Dec 2, 2024 · 2 comments
Assignees
Labels
bug Something isn't working

Comments

@mmartin9684-sil
Copy link
Collaborator

There are cases where the source project defines custom markers that are not defined in the target project. Copying these markers into the draft text may cause issues with the target project if the custom marker is not defined in the target project. In some cases, the custom marker is not correctly recognized during parsing, and the draft text includes the opening marker but does not include the marker content nor closing marker.

As an example, the BHNSB translation defines an "extended footnote" marker ("\ef", "\ef*"). When books from the BHNSB are translated into a target language, the opening "\ef" marker is retrained in the draft that is created, while the content of the marker and the closing marker is removed. The BHNSB text for Ruth 1:22 has an extended footnote:

\v 17 Pale utakapofia hapo nitakufa nami, \q1 na papo hapo nitazikwa; \q1 Mwenyezi-Mungu anipe adhabu kali kama nikitenganishwa nawe \q1 isipokuwa tu kwa kifo.”\ef f \fr 1:16-17 \ft Aya hizi zimewekwa katika muundo wa utenzi au shairi. Muhimu katika utenzi huu ni tamko la Ruthu la kujiweka mmoja na jumuiya ya Waisraeli.\ef*

which gets translated as follows in the draft verse:

\v 17 Palu' pawaalu'a waa ngapaniwaa, no sikilwa. Nnu'ngu' anipei ukumu ngali' mana naabu'kana na wenga, ila kwa kiwo bai." \q1 \q1 \q1 \ef

(Note the opening marker ("\ef") with no marker content and closing marker.)

@ddaspit ddaspit added the bug Something isn't working label Dec 3, 2024
@ddaspit
Copy link
Collaborator

ddaspit commented Dec 3, 2024

We will have a similar issue in Serval as well.

@ddaspit
Copy link
Collaborator

ddaspit commented Dec 3, 2024

@mmartin9684-sil Are you okay with us stripping out custom markers entirely when this occurs?

@ddaspit ddaspit moved this from 🆕 New to 🔖 Ready in SIL-NLP Research Dec 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: 🔖 Ready
Development

No branches or pull requests

2 participants