Fix writing multi-byte characters to tar archive #627

TadCordle · 2018-07-16T16:06:03Z

Fixes #626. Looks like we were using string length instead of byte array length.

…-2byte

…fix-2byte

loosebazooka · 2018-07-16T17:02:45Z

Can we use codePointCount? Not saying we should, just curious.

briandealwis

addEntry() seems to be called by toTarballBlob() to add what's described as a a blob. Why don't we just pass the file contents as byte[] — or better yet as ByteSource — and let the caller deal with the encoding?

If we used ByteSource then we could get rid of the other addEntry(TarArchiveEntry) and use MoreFiles.asByteSource(filePath).

TadCordle · 2018-07-16T21:22:18Z

So it looks like the main problem has been fixed, but using multibyte characters in tar archive entry header names on windows might still be sketchy (e.g. adding non-ascii filenames might not work). Multibyte file contents should work on all platforms, though. I'll keep looking into it, but feel free to review this now so we can get the fix out.

coollog · 2018-07-16T22:32:42Z

@briandealwis Blob is essentially our own minimal interface that is essentially a ByteSource, but you make a good point - can you file a separate issue for that since it's not directly related to this fix?

coollog · 2018-07-17T01:06:37Z

@TadCordle oh we should add a CHANGELOG entry for this

chanseokoh · 2018-07-17T02:37:31Z

Cool, just in case, have you tested dockerBuild and buildTar?

TadCordle · 2018-07-17T02:43:56Z

Yeah, tested both with the same args used in the original issue.

Fix writing multi-byte characters to tar archive

9c2c054

TadCordle requested review from briandealwis, coollog and chanseokoh July 16, 2018 16:06

TadCordle and others added 4 commits July 16, 2018 12:06

Merge branch 'master' of github.com:GoogleContainerTools/jib into fix…

63774e4

…-2byte

Merge branch 'master' into fix-2byte

9dd3ac9

Windows?

2df4cef

Merge branch 'fix-2byte' of github.com:GoogleContainerTools/jib into …

985e685

…fix-2byte

briandealwis reviewed Jul 16, 2018

View reviewed changes

TadCordle added 3 commits July 16, 2018 15:34

Try something

15d6c7a

Maybe it's the source file encoding

c52cce8

Move multi-byte chars out of code

28ce5b6

TadCordle added the PR: Not Ready label Jul 16, 2018

TadCordle added 3 commits July 16, 2018 16:10

Try byte array comparison again to see what's wrong

1467086

Experimenting

9abeda3

Is it just the header file name?

3682e70

TadCordle removed the PR: Not Ready label Jul 16, 2018

coollog approved these changes Jul 16, 2018

View reviewed changes

Merge branch 'master' into fix-2byte

f70ec70

TadCordle merged commit 4d1b5de into master Jul 16, 2018

TadCordle deleted the fix-2byte branch July 16, 2018 23:04

briandealwis mentioned this pull request Jul 17, 2018

Simplify TarStreamBuilder#addEntry() to deal with bytes #630

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix writing multi-byte characters to tar archive #627

Fix writing multi-byte characters to tar archive #627

TadCordle commented Jul 16, 2018

loosebazooka commented Jul 16, 2018 •

edited

Loading

briandealwis left a comment

TadCordle commented Jul 16, 2018 •

edited

Loading

coollog commented Jul 16, 2018

coollog commented Jul 17, 2018

chanseokoh commented Jul 17, 2018

TadCordle commented Jul 17, 2018

Fix writing multi-byte characters to tar archive #627

Fix writing multi-byte characters to tar archive #627

Conversation

TadCordle commented Jul 16, 2018

loosebazooka commented Jul 16, 2018 • edited Loading

briandealwis left a comment

Choose a reason for hiding this comment

TadCordle commented Jul 16, 2018 • edited Loading

coollog commented Jul 16, 2018

coollog commented Jul 17, 2018

chanseokoh commented Jul 17, 2018

TadCordle commented Jul 17, 2018

loosebazooka commented Jul 16, 2018 •

edited

Loading

TadCordle commented Jul 16, 2018 •

edited

Loading