- From: Ian Hickson <ian@hixie.ch>
- Date: Thu, 14 Feb 2013 05:42:21 +0000 (UTC)
- To: Bjoern Hoehrmann <derhoermi@gmx.net>
- cc: www-archive@w3.org
On Wed, 13 Feb 2013, Bjoern Hoehrmann wrote: > > Pages like http://www.rfc-editor.org/info/rfc4329 unfortunately have > code like `<title>Information on RFC 4329</title>` currently, which > interestingly shows up exactly like that in Google search results Not for me; do you have a sample (Google) URL showing this? This page seems to have no "nbsp" in the titles: https://www.google.com/search?rls=en&q=http://www.rfc-editor.org/info/rfc4329&ie=UTF-8&oe=UTF-8#hl=en&client=safari&tbo=d&rls=en&sclient=psy-ab&q=http:%2F%2Fwww.rfc-editor.org%2Finfo%2Frfc4329&oq=http:%2F%2Fwww.rfc-editor.org%2Finfo%2Frfc4329&gs_l=serp.3...157993.157993.4.158293.1.1.0.0.0.0.86.86.1.1.0.les%3B..0.0...1c.2.3.psy-ab.N0gVqvbVekM&pbx=1&bav=on.2,or.r_gc.r_pw.r_cp.r_qf.&bvm=bv.42452523,d.cGE&fp=d5bbde17dd6cca0a&biw=1397&bih=1323 > even though browsers treat the kaput reference as ` `. As per the spec. > Surely at least if documents switch on the right doctype mode, Google > will use a standards compliant HTML parser? Maybe this will suffice to > find out in a while... There's only one parser per the standard, it defines how you parse pages regardless of DOCTYPE (there's maybe two things in the parser I think that are affected by the precise DOCTYPE). > http://www.websitedev.de/temp/information-on-html-5.html I'm unable to find this page in Google's index, unfortunately, so cannot test it there. -- Ian Hickson U+1047E )\._.,--....,'``. fL http://ln.hixie.ch/ U+263A /, _.. \ _\ ;`._ ,. Things that are impossible just take longer. `._.-(,_..'--(,_..'`-.;.'
Received on Thursday, 14 February 2013 05:42:44 UTC