Barcode scanning fails with "Unknown encoding" for ISO-8859-1 encoded data matrix #218

dspoeri · 2021-01-10T18:54:07Z

The official German medication plan data matrix ("BMP", Bundeseinheitlicher Medikationsplan) expects data to be encoded with ISO-8859-1. If the data contains a German umlaut, Google Vision barcode scanning fails with an "Unknown encoding" error.

Scanning the following data matrix reproduces the bug:

This bug sadly renders Google Vision barcode scanning useless for the mentioned use case.

Two suggested solutions:

accept other encodings than ASCII and UTF-8
provide access to the raw data through a byte array

GarryKelly · 2021-01-12T15:28:18Z

Just saw this and it is similar to an issue reported last year . I commented on that here #44 (comment) Unfortunately meant the library just didnt work for our use cases.... Its a shame as its an excellent library otherwise....

I agree with the suggested solutions... it would be wonderful for the library to either support the ISO-8859-1 characterset as an option. Or else to provide access to the scanned data as a byte array without going through any character set conversions... Both options would allow reading of all barcodes

I noticed there was some new version com.google.firebase:firebase-ml-vision-barcode-model:16.1.2 released later in 2020 but havent had time to see if these provided that access...

ivan200 · 2021-01-13T10:37:51Z

At least com.google.mlkit:barcode-scanning:16.1.0
contains barcode.rawBytes

Returns raw bytes as it was encoded in the barcode.
Returns null if the raw bytes can not be determined.

so I think you can make
return String(barcode.rawBytes, StandardCharsets.ISO_8859_1)

dspoeri · 2021-01-13T14:57:40Z

At least com.google.mlkit:barcode-scanning:16.1.0
contains barcode.rawBytes

Returns raw bytes as it was encoded in the barcode.
Returns null if the raw bytes can not be determined.

so I think you can make
return String(barcode.rawBytes, StandardCharsets.ISO_8859_1)

It doesn't help:
rawBytes returns an array with 16 bytes representing the string Unknown encoding.

cs-googler · 2021-04-05T17:10:07Z

Hi, we are working on a fix internally.

pke · 2023-03-14T15:47:45Z

so @cs-googler how is the internal fix going? How about letting the user specify the encoding via BarcodeScannerOptions?

mattemyoo · 2024-12-09T13:47:32Z

@cs-googler

Hello. We are also trying to scan barcodes that include letters with umlauts.

When we are scanning a barcode that contains "ä", it becomes a "d". Even in the rawBytes array, we are getting the value 100, which corresponds to ASCII character "d".

@pke Did you get any new information about this?

mattemyoo · 2024-12-09T14:11:33Z

So, I did some research, and it seems like the int type that you are using is only covering the first 128 characters, and then the value starts from 0 again.

For example, the character "µ" has the actual ASCII number 181, but in the rawBytes, I am getting the value 53 (ASCII char "5").

Another example is the inverted exclamation mark (¡ with ASCII number 161). The actual rawByte I am receiving is 33 (161 - 128 = 33)

cs-googler added the enhancement New feature or request label Apr 5, 2021

miworking3 assigned zhouyiself Aug 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Barcode scanning fails with "Unknown encoding" for ISO-8859-1 encoded data matrix #218

Barcode scanning fails with "Unknown encoding" for ISO-8859-1 encoded data matrix #218

dspoeri commented Jan 10, 2021 •

edited

Loading

GarryKelly commented Jan 12, 2021

ivan200 commented Jan 13, 2021

dspoeri commented Jan 13, 2021

cs-googler commented Apr 5, 2021

pke commented Mar 14, 2023

mattemyoo commented Dec 9, 2024 •

edited

Loading

mattemyoo commented Dec 9, 2024

Barcode scanning fails with "Unknown encoding" for ISO-8859-1 encoded data matrix #218

Barcode scanning fails with "Unknown encoding" for ISO-8859-1 encoded data matrix #218

Comments

dspoeri commented Jan 10, 2021 • edited Loading

GarryKelly commented Jan 12, 2021

ivan200 commented Jan 13, 2021

dspoeri commented Jan 13, 2021

cs-googler commented Apr 5, 2021

pke commented Mar 14, 2023

mattemyoo commented Dec 9, 2024 • edited Loading

mattemyoo commented Dec 9, 2024

dspoeri commented Jan 10, 2021 •

edited

Loading

mattemyoo commented Dec 9, 2024 •

edited

Loading