Add option to unzip to detect the encoding of filenames
The ZIP file format http://
When extracting files such files on current Linux distributions, the unzip tool fails to create any files as the filename is invalid. GUI utilities such as file-roller (the default compressed file manager of GNOME and subseqeuntly Ubuntu) fails to extract those files because they use the "unzip" utility to handle Zip files.
A. References
a. Summary of discussion on Zip encoding issues at Linux-utf8 mailing list: http://
b. RFE: handle non-ascii filenames in archive properly: https:/
filenames in non-UTF-8 encodings are not handled correctly: http://
c. Autoconvert filename encoding to UTF-8: https:/
d. Another Windows zipfile encoding problem, with patch: http://
e. Unzip 5.52 with custom patch: http://
f. Bug report with patch from AltLinux (RUS), https:/
g. Launchpad report, with custom fix: https:/
Blueprint information
- Status:
- Not started
- Approver:
- None
- Priority:
- Undefined
- Drafter:
- Simos Xenitellis
- Direction:
- Needs approval
- Assignee:
- None
- Definition:
- New
- Series goal:
- None
- Implementation:
-
Unknown
- Milestone target:
- None
- Started by
- Completed by
Related branches
Sprints
Whiteboard
I just found out -Jun07- (reference [g] above) that Ubuntu has recently started using a patch for unzip (originated from AltLinux) that offers the option to select the source encoding.
Specifically,
+ -O CHARSET specify a character encoding for DOS, Windows and OS/2 archives\n\
+ -I CHARSET specify a character encoding for UNIX and other archives\n\n";
and
export UNZIP="-O CP866"
export ZIPINFO="-O CP866" (to be used in system configuration files).
It looks that this blueprint will close soon; an unresolved issue that remains is to pass the change upstream to the InfoZip project.
----------
Wrote a patch for unzip fixing this issue:
https:/
The same patch for p7zip:
https:/