[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Bug-wget] Save 3 byte utf8 url
From: |
bes |
Subject: |
[Bug-wget] Save 3 byte utf8 url |
Date: |
Thu, 7 Feb 2013 17:06:47 +0300 |
Hi,
i found some bug in wget with interpreting and save percent-encoding 3 byte
utf8 url
example:
1. Create url with "—". This is U+2014 (EM DASH). Percent-encoding UTF-8 is
"%E2%80%94"
2. Try wget it: wget "http://example.com/abc—d" or wget "
http://example.com/abc%E2%80%94d" directly
3. Wget save this URL to file "abc\342%80%94d". Expected is
"abc%E2%80%94d". This is a bug.
$ wget -V
----------------------------------------
GNU Wget 1.14 built on linux-gnu.
+digest +https -ipv6 -iri +large-file -nls -ntlm +opie +ssl/openssl
Wgetrc:
/etc/wgetrc (system)
Compile: x86_64-pc-linux-gnu-gcc -DHAVE_CONFIG_H
-DSYSTEM_WGETRC="/etc/wgetrc"
-DLOCALEDIR="/usr/share/locale" -I. -I../lib -I../lib -DHAVE_LIBSSL
-O2 -march=native -pipe -fomit-frame-pointer
Link: x86_64-pc-linux-gnu-gcc -DHAVE_LIBSSL -O2 -march=native -pipe
-fomit-frame-pointer -Wl,-O1 -Wl,--as-needed -lssl -lcrypto -lpcre
-lrt ftp-opie.o openssl.o ../lib/libgnu.a
--------------------
- [Bug-wget] Save 3 byte utf8 url,
bes <=