W3C home > Mailing lists > Public > www-archive@w3.org > February 2013

Re: Information on HTML 5

From: Ian Hickson <ian@hixie.ch>
Date: Thu, 14 Feb 2013 05:42:21 +0000 (UTC)
To: Bjoern Hoehrmann <derhoermi@gmx.net>
cc: www-archive@w3.org
Message-ID: <Pine.LNX.4.64.1302140529380.3956@ps20323.dreamhostps.com>
On Wed, 13 Feb 2013, Bjoern Hoehrmann wrote:
>
> Pages like http://www.rfc-editor.org/info/rfc4329 unfortunately have 
> code like `<title>Information on RFC&nbsp4329</title>` currently, which 
> interestingly shows up exactly like that in Google search results

Not for me; do you have a sample (Google) URL showing this?

This page seems to have no "nbsp" in the titles:

https://www.google.com/search?rls=en&q=http://www.rfc-editor.org/info/rfc4329&ie=UTF-8&oe=UTF-8#hl=en&client=safari&tbo=d&rls=en&sclient=psy-ab&q=http:%2F%2Fwww.rfc-editor.org%2Finfo%2Frfc4329&oq=http:%2F%2Fwww.rfc-editor.org%2Finfo%2Frfc4329&gs_l=serp.3...157993.157993.4.158293.1.1.0.0.0.0.86.86.1.1.0.les%3B..0.0...1c.2.3.psy-ab.N0gVqvbVekM&pbx=1&bav=on.2,or.r_gc.r_pw.r_cp.r_qf.&bvm=bv.42452523,d.cGE&fp=d5bbde17dd6cca0a&biw=1397&bih=1323


> even though browsers treat the kaput reference as `&nbsp;`.

As per the spec.


> Surely at least if documents switch on the right doctype mode, Google 
> will use a standards compliant HTML parser? Maybe this will suffice to 
> find out in a while...

There's only one parser per the standard, it defines how you parse pages 
regardless of DOCTYPE (there's maybe two things in the parser I think 
that are affected by the precise DOCTYPE).


> http://www.websitedev.de/temp/information-on-html-5.html

I'm unable to find this page in Google's index, unfortunately, so cannot 
test it there.

-- 
Ian Hickson               U+1047E                )\._.,--....,'``.    fL
http://ln.hixie.ch/       U+263A                /,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'
Received on Thursday, 14 February 2013 05:42:44 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 14 February 2013 05:42:45 GMT