Found a ruby bug in the URI class, what do I do?

Victor_Pereira · 27 August 2009 14:00

Hi,

i have to open an uri with open-uri but URI is raising me an error that
an URL is an invalid uri:

p URI.split("http://xpto.com/index.asp?action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247\}"\)

URI::InvalidURIError: bad URI(is not URI?):
http://xpto.com/index.asp?action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247\}
from
/System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/uri/common.rb:436:in
`split'
from (irb):8

That's a strange ID but valid. I even checked the RFC 2396 and {} is not
a reserved character.

Best regards,

VP

···

from :0
--
Posted via http://www.ruby-forum.com/\.

Michael_Fellinger1 · 27 August 2009 14:31

Hi,

i have to open an uri with open-uri but URI is raising me an error that
an URL is an invalid uri:

p URI.split("http://xpto.com/index.asp?action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247\}"\)

URI::InvalidURIError: bad URI(is not URI?):
http://xpto.com/index.asp?action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247\}
from
/System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/uri/common.rb:436:in
`split'
from (irb):8
from :0

That's a strange ID but valid. I even checked the RFC 2396 and {} is not
a reserved character.

Best regards,

Just file a bug at http://redmine.ruby-lang.org/ so it won't be forgotten.
If you also supply a patch against the uri lib, you get bonus points

···

On Thu, Aug 27, 2009 at 4:00 PM, Victor Pereira<pelasaco@gmail.com> wrote:

VP

--
Michael Fellinger
CTO, The Rubyists, LLC
972-996-5199

Rob_Biedenharn1 · 27 August 2009 16:07

Hi,

i have to open an uri with open-uri but URI is raising me an error that
an URL is an invalid uri:

p URI.split("http://xpto.com/index.asp?action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247\}"\)

URI::InvalidURIError: bad URI(is not URI?):
http://xpto.com/index.asp?action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247\}
from
/System/Library/Frameworks/Ruby.framework/Versions/1.8/usr/lib/ruby/1.8/uri/common.rb:436:in
`split'
from (irb):8
from :0

That's a strange ID but valid. I even checked the RFC 2396 and {} is not
a reserved character.

But it is part of the 'unwise' set defined by RFC 2396 in:
  "2.4.3. Excluded US-ASCII Characters
  Although they are disallowed within the URI syntax, we include
  here a description of those US-ASCII characters that have been
  excluded and the reasons for their exclusion."
http://www.faqs.org/rfcs/rfc2396.html

Just because a URI works in any (or even all) browser(s) doesn't mean that it conforms to the standard for URIs.

I think that URI.split is doing the right thing.

URI.escape() might be a workaround for you:

u="http://xpto.com/index.asp?

action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247}"
=> "http://xpto.com/index.asp?action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247\}"

URI.escape(u)

=> "http://xpto.com/index.asp?action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247}"

u1=URI.parse(URI.escape(u))

=> #<URI::HTTP:0x38cbd8 URL:http://xpto.com/index.asp?action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247} >

u1.query

=> "action=showproduct&id=%7BD3E21D33-6DFF-4355-9324-AE1395CEB247%7D"

URI.unescape(u1.query)

=> "action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247}"

-Rob

Best regards,

Just file a bug at http://redmine.ruby-lang.org/ so it won't be forgotten.
If you also supply a patch against the uri lib, you get bonus points

VP

--
Michael Fellinger
CTO, The Rubyists, LLC
972-996-5199

Rob Biedenharn http://agileconsultingllc.com
Rob@AgileConsultingLLC.com

···

On Aug 27, 2009, at 10:31 AM, Michael Fellinger wrote:

On Thu, Aug 27, 2009 at 4:00 PM, Victor Pereira<pelasaco@gmail.com> > wrote:

Victor_Pereira · 28 August 2009 15:40

Rob,

to be in the 'unwise' doesn't mean
that it not conforms to the standard for URIs.

I'm escaping and it works, but in my option it could be handled by the
lib.

VP

Rob Biedenharn wrote:

···

On Aug 27, 2009, at 10:31 AM, Michael Fellinger wrote:

URI::InvalidURIError: bad URI(is not URI?):
That's a strange ID but valid. I even checked the RFC 2396 and {}
is not
a reserved character.

But it is part of the 'unwise' set defined by RFC 2396 in:
  "2.4.3. Excluded US-ASCII Characters
  Although they are disallowed within the URI syntax, we include
  here a description of those US-ASCII characters that have been
  excluded and the reasons for their exclusion."
RFC 2396 - Uniform Resource Identifiers (URI): Generic Syntax (RFC2396)

Just because a URI works in any (or even all) browser(s) doesn't mean
that it conforms to the standard for URIs.

I think that URI.split is doing the right thing.

URI.escape() might be a workaround for you:

> u="http://xpto.com/index.asp?
action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247}"
=> "http://xpto.com/index.asp?
action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247}"
> URI.escape(u)
=>
"http://xpto.com/index.asp?action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247}
"
> u1=URI.parse(URI.escape(u))
=> #<URI::HTTP:0x38cbd8
URL:http://xpto.com/index.asp?action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247}
>
> u1.query
=> "action=showproduct&id=%7BD3E21D33-6DFF-4355-9324-AE1395CEB247%7D"
> URI.unescape(u1.query)
=> "action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247}"

-Rob

--
Michael Fellinger
CTO, The Rubyists, LLC
972-996-5199

Rob Biedenharn http://agileconsultingllc.com
Rob@AgileConsultingLLC.com

--
Posted via http://www.ruby-forum.com/\.

Rob_Biedenharn1 · 28 August 2009 16:12

Sorry, Victor, but yes it does.

Quoting from the end of RFC 2396 section 2.4.3:

[quote]
Other characters are excluded because gateways and other transport agents are known to sometimes modify such characters, or they are used as delimiters.
unwise = "{" | "}" | "|" | "\" | "^" | "[" | "]" | "`"
Data corresponding to excluded characters must be escaped in order to be properly represented within a URI.
[/quote]

So that last sentence pretty much nails it. Curly braces "...must be escaped in order to be properly represented within a URI."

So you'll have to continue to escape them yourself. It wouldn't be right for the URI class to depart from the standard here.

-Rob

···

On Aug 28, 2009, at 11:40 AM, Victor Pereira wrote:

Rob,

to be in the 'unwise' doesn't mean
that it not conforms to the standard for URIs.

I'm escaping and it works, but in my option it could be handled by the
lib.

VP

Rob Biedenharn wrote:

On Aug 27, 2009, at 10:31 AM, Michael Fellinger wrote:

URI::InvalidURIError: bad URI(is not URI?):
That's a strange ID but valid. I even checked the RFC 2396 and {}
is not
a reserved character.

But it is part of the 'unwise' set defined by RFC 2396 in:
"2.4.3. Excluded US-ASCII Characters
Although they are disallowed within the URI syntax, we include
here a description of those US-ASCII characters that have been
excluded and the reasons for their exclusion."
RFC 2396 - Uniform Resource Identifiers (URI): Generic Syntax (RFC2396)

Just because a URI works in any (or even all) browser(s) doesn't mean
that it conforms to the standard for URIs.

I think that URI.split is doing the right thing.

URI.escape() might be a workaround for you:

> u="http://xpto.com/index.asp?
action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247}"
=> "http://xpto.com/index.asp?
action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247}"
> URI.escape(u)
=>
"http://xpto.com/index.asp?action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247}
"
> u1=URI.parse(URI.escape(u))
=> #<URI::HTTP:0x38cbd8
URL:http://xpto.com/index.asp?action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247}

> u1.query
=> "action=showproduct&id=%7BD3E21D33-6DFF-4355-9324-AE1395CEB247%7D"
> URI.unescape(u1.query)
=> "action=showproduct&id={D3E21D33-6DFF-4355-9324-AE1395CEB247}"

-Rob

--
Michael Fellinger
CTO, The Rubyists, LLC
972-996-5199

Rob Biedenharn http://agileconsultingllc.com
Rob@AgileConsultingLLC.com

--
Posted via http://www.ruby-forum.com/\.

Rob Biedenharn http://agileconsultingllc.com
Rob@AgileConsultingLLC.com
+1 513-295-4739
Skype: rob.biedenharn

Topic		Replies	Views
Open-uri bug ruby-talk	5	105	1 March 2008
Does URI.escape have a bug? ruby-talk	4	560	8 November 2015
URI::InvalidURIError when using curly brackets ruby-talk	4	117	27 June 2006
Is this an open-uri bug? ruby-talk	0	107	1 November 2004
Are square brackets not allowed in a URI? ruby-talk	2	125	14 August 2008

Found a ruby bug in the URI class, what do I do?

Related topics