You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Different Zip software expects filenames with non-ASCII characters to be encoded in different charsets. For example to make files with cyrillic filenames visible in Windows Explorer ("compressed folders") I have to encode them into cp866. But that makes them unreadable by other archivers like 7-Zip and Info-ZIP etc. that expects windows-1251 charset (or Unicode Field, see below) to be used.
Solution was proposed by Info-ZIP developers: it allows to specify filename in Unicode (UTF-8) in EXTRA header field. This field called Unicode Path Extra Field (0x7075) in .ZIP File Format Specification(http://www.pkware.com/documents/casestudies/APPNOTE.TXT)
Many archivers (like two mentioned above) supports this feature. It's also used by GMail in "download all attachments" to encode attachments with non-ASCII names. So such file names specified twice: first in regular header field in some 8-bit charset (for old software), second - in extra field.
It would be great if mod_zip could support Unicode Paths.
The text was updated successfully, but these errors were encountered:
Different Zip software expects filenames with non-ASCII characters to be encoded in different charsets. For example to make files with cyrillic filenames visible in Windows Explorer ("compressed folders") I have to encode them into cp866. But that makes them unreadable by other archivers like 7-Zip and Info-ZIP etc. that expects windows-1251 charset (or Unicode Field, see below) to be used.
Solution was proposed by Info-ZIP developers: it allows to specify filename in Unicode (UTF-8) in EXTRA header field. This field called Unicode Path Extra Field (0x7075) in .ZIP File Format Specification(http://www.pkware.com/documents/casestudies/APPNOTE.TXT)
Many archivers (like two mentioned above) supports this feature. It's also used by GMail in "download all attachments" to encode attachments with non-ASCII names. So such file names specified twice: first in regular header field in some 8-bit charset (for old software), second - in extra field.
It would be great if mod_zip could support Unicode Paths.
The text was updated successfully, but these errors were encountered: