W3C home > Mailing lists > Public > public-wai-rd@w3.org > November 2012

Re: indexing RDWG symposium papers in scientific search engines

From: Shawn Henry <shawn@w3.org>
Date: Fri, 09 Nov 2012 07:15:41 -0600
Message-ID: <509D01FD.80109@w3.org>
To: Shadi Abou-Zahra <shadi@w3.org>, RDWG <public-wai-rd@w3.org>, Simon Harper <simon.harper@manchester.ac.uk>
CC: daniel.poell@jku.at
Thanks, Daniel!

I've added this to the wiki at <http://www.w3.org/WAI/RD/wiki/Indexing>

I've made a few suggested changes for the meta for papers:
* added date format: <meta name="citation_publication_date" content="[symposium date yyyy/mm/dd]" />
* deleted <meta name="citation_online_date" content="[paper online date]" />  (doesn't seem important)
* changed the journal title: <meta name="citation_journal_title" content="W3C WAI Research and Development Symposiums" />

Simon, Shadi, all: Should we go ahead and add this to the papers we have?

~Shawn


On 10/25/2012 4:46 AM, Shadi Abou-Zahra wrote:
> Dear RDWG,
>
> Daniel Pöll has done some excellent research for us on how to best get our symposium papers better indexed in scientific search engines.
>
> Below is a summary of the main findings, please let us know if you have any further thoughts or comments on these findings:
>
>
> #1. Search Engines
>
> It seems that Google Scholar and Microsoft Academics are the largest search engine crawlers. There are several others though most seem to be focused on particular domains and others need to be manually pointed to the papers in order to index them. A list of search engines in here:
>  - <http://en.wikipedia.org/wiki/List_of_academic_databases_and_search_engines>
>
>
> #2. Metadata Formats
>
> Apparently Dublin Core is not as widely supported for this use as we'd initially thought. Some of the more widely supported metadata formats seem to be:
>  - Highwire Press Tags
>  - Eprints Tags
>  - BE Press Tags
>  - PRISM Tags
>
> Of these Highwire seems to be more widely used and documented. It also seems that both Google Scholar and Microsoft Academics support it.
>
>
> #3. Paper Requirements
>
> The guidelines for Google Scholar (which also seem to be supported by Microsoft Academics) do not have a strong impact on our current paper structure. It seems we only need to add some <meta> elements to the HTML code to reflect at least the:
>  - Title of the document
>  - Year of publishing
>  - At least one of the Author´s names
>
> Some useful resources found include:
>  - <http://scholar.google.com/intl/en/scholar/inclusion.html>
>  - <http://www.monperrus.net/martin/accurate+bibliographic+metadata+and+google+scholar>
>
>
> #4. Symposium Papers
>
> We suggest the following <meta> elements to be added to the current HTML for the symposium papers, to get them better indexed:
>  - <meta name="citation_title" content="[paper title]" />
>  - <meta name="citation_author" content="[author, multiple allowed]" />
>  - <meta name="citation_publication_date" content="[symposium date]" />
>  - <meta name="citation_online_date" content="[paper online date]" />
>  - <meta name="citation_conference_title" content="[symposium name]" />
>  - <meta name="citation_journal_title" content="W3C WAI Research and Development Working Group (RDWG) Notes" />
>  - <meta name="citation_technical_report_institution" content="W3C Web Accessibility Initiative (WAI)" />
>
>
> Regards,
>   Shadi
>
Received on Friday, 9 November 2012 13:15:50 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Friday, 9 November 2012 13:15:51 GMT