[Bug 9071] Handling of "[" in between-doctype-public-and-system-identifiers-state may not be ideal

http://www.w3.org/Bugs/Public/show_bug.cgi?id=9071





--- Comment #7 from Ian 'Hixie' Hickson <ian@hixie.ch>  2010-02-20 22:27:21 ---
I searched for pages matching these regexps in the Google index (basically any
valid-looking doctype that contains an internal subset):

  /<!doctype\s+html\s+public\s+"[^"]+"\s*\[|)/i
  /<!doctype\s+html\s+public\s+'[^']+'\s*\[|)/i
  /<!doctype\s+html\s+system\s+"[^"]+"\s*\[|)/i
  /<!doctype\s+html\s+system\s+'[^']+'\s*\[|)/i
  /<!doctype\s+html\s+public\s+"[^"]+"\s+"[^"]+"\s*\[|)/i
  /<!doctype\s+html\s+public\s+'[^']+'\s+'[^']+'\s*\[|)/i
  /<!doctype\s+html\s+public\s+"[^"]+"\s+'[^']+'\s*\[|)/i
  /<!doctype\s+html\s+public\s+'[^']+'\s+"[^"]+"\s*\[|)/i


A lot of the pages that have an internal subset are from the "epages"
e-commerce system:

http://www.btowstore.com/epages/Store3.sf/?ObjectPath=/Shops/Store3.Shop2250
http://cavaweb.es/epages/eb3502.sf/es_ES/?ObjectPath=/Shops/eb3502/Products/CH6
http://shop.cmsme.de/epages/62030644.sf/de_DE/?ObjectPath=./Categories/TERMINE
http://www.kissen-studio.de/
http://www.lahjamaailma.fi/Ukki-on-kova-jaetkae
http://www.legalize-love.com/


The list-of-companies.org site has a DOCTYPE they use on all their pages, but
presumably it is better in quirks mode since half of their other pages start
with something immediately before the DOCTYPE:

http://www.list-of-companies.org/Details/11311550/United_States/I-J_Auto_Shop/
http://fr.list-of-companies.org/Details/11311550/United_States/I-J_Auto_Shop/
http://www.list-of-companies.org/Details/10086450/China/Guangli_Machinery_Xinhui_Co_Ltd_/
http://gl.list-of-companies.org/Details/10086450/China/Guangli_Machinery_Xinhui_Co_Ltd_/


Here are some others I found:

http://www.dolomitiinfo.com/bellavista-cadore-comelico-superiore.t4i35371f1.aspx
http://www.dfa.ie/home/index.aspx?id=80791


I also found an actual XHTML page! (There's a bunch of them at this site.)

http://www.hindawi.com/floats/384010/figures/2008.384010.fig11.xht


-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.

Received on Saturday, 20 February 2010 22:27:23 UTC