W3C home > Mailing lists > Public > www-international@w3.org > January to March 2000

Re: International Search Engine Submission

From: Erik van der Poel <erik@netscape.com>
Date: Tue, 08 Feb 2000 13:28:16 -0800
Message-ID: <38A08A70.BD378F0A@netscape.com>
To: Suzanne Topping <stopping@rochester.rr.com>
CC: www <www-international@w3.org>, nelocsig <nelocsig@egroups.com>
I have a question about the search engines. In addition to being able to
submit Web sites to search engines, these search engine companies
usually(?) run robots (crawlers) that automatically find Web sites and
index them for searching.

I'm wondering whether those crawlers deal with such character encodings
as ISO-2022-JP, where bytes such as '<', '>' and '&' can appear, but
don't have the same meaning as the HTML characters.

In other words, do the crawlers deal with ISO-2022-JP? Or do they fail
to parse those, thereby failing to follow any of the URLs in them?

Received on Tuesday, 8 February 2000 16:32:15 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 21 September 2016 22:37:19 UTC