wget-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

wget2 | URL parser does unwanted transformations of URL (#598)


From: saur0n (@saur0n)
Subject: wget2 | URL parser does unwanted transformations of URL (#598)
Date: Wed, 27 Apr 2022 07:32:06 +0000


saur0n created an issue: https://gitlab.com/gnuwget/wget2/-/issues/598



When parsing an URL, in additional to transformations that are specified by 
standard (https://www.ietf.org/rfc/rfc2396.txt), IRI parser makes attempt to do 
transformations of XML entities, like """ and "&". This is not 
correct, because XML and HTML entities are part of XML/HTML standard and not 
part of URL standard.

>From my point of view, [part of code that works with HTML 
>entities](https://gitlab.com/gnuwget/wget2/-/blob/master/libwget/iri.c#L295) 
>should be removed from URL parser.

-- 
Reply to this email directly or view it on GitLab: 
https://gitlab.com/gnuwget/wget2/-/issues/598
You're receiving this email because of your account on gitlab.com.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]