Re: indexing RDWG symposium papers in scientific search engines

For the tc4r template <http://www.w3.org/WAI/RD/2012/text-customization/template.html>, we changed the reference listing a bit so authors wouldn't erroneously include the words before the reference. They are put inside an "<ol>" list as directed in [1], so we should be OK on that.

~Shawn

On 10/25/2012 5:02 AM, Markel Vigo wrote:
> Great job. Also, regarding the references in the extended abstract, the document [1] says that if we are using HTML, we should employ<ol>  (we have now<li>) and use standard reference format. To do so, we should remove the word "proceedings","report", "journal" before each reference.
>
> regards
>
>
> [1] http://scholar.google.com/intl/en/scholar/inclusion.html#indexing
>
> Markel Vigo
> ---------------------------------------------------------------------------
> University of Manchester (UK)
> Web Ergonomics Lab - Information Management Group
>
> PS: I check my email at 9AM and 5PM BST. If you require a faster response please include the word [fast!] in the subject line.
>
> On 25 Oct 2012, at 10:46, Shadi Abou-Zahra<shadi@w3.org>  wrote:
>
>> Dear RDWG,
>>
>> Daniel Pöll has done some excellent research for us on how to best get our symposium papers better indexed in scientific search engines.
>>
>> Below is a summary of the main findings, please let us know if you have any further thoughts or comments on these findings:
>>
>>
>> #1. Search Engines
>>
>> It seems that Google Scholar and Microsoft Academics are the largest search engine crawlers. There are several others though most seem to be focused on particular domains and others need to be manually pointed to the papers in order to index them. A list of search engines in here:
>> -<http://en.wikipedia.org/wiki/List_of_academic_databases_and_search_engines>
>>
>>
>> #2. Metadata Formats
>>
>> Apparently Dublin Core is not as widely supported for this use as we'd initially thought. Some of the more widely supported metadata formats seem to be:
>> - Highwire Press Tags
>> - Eprints Tags
>> - BE Press Tags
>> - PRISM Tags
>>
>> Of these Highwire seems to be more widely used and documented. It also seems that both Google Scholar and Microsoft Academics support it.
>>
>>
>> #3. Paper Requirements
>>
>> The guidelines for Google Scholar (which also seem to be supported by Microsoft Academics) do not have a strong impact on our current paper structure. It seems we only need to add some<meta>  elements to the HTML code to reflect at least the:
>> - Title of the document
>> - Year of publishing
>> - At least one of the Author´s names
>>
>> Some useful resources found include:
>> -<http://scholar.google.com/intl/en/scholar/inclusion.html>
>> -<http://www.monperrus.net/martin/accurate+bibliographic+metadata+and+google+scholar>
>>
>>
>> #4. Symposium Papers
>>
>> We suggest the following<meta>  elements to be added to the current HTML for the symposium papers, to get them better indexed:
>> -<meta name="citation_title" content="[paper title]" />
>> -<meta name="citation_author" content="[author, multiple allowed]" />
>> -<meta name="citation_publication_date" content="[symposium date]" />
>> -<meta name="citation_online_date" content="[paper online date]" />
>> -<meta name="citation_conference_title" content="[symposium name]" />
>> -<meta name="citation_journal_title" content="W3C WAI Research and Development Working Group (RDWG) Notes" />
>> -<meta name="citation_technical_report_institution" content="W3C Web Accessibility Initiative (WAI)" />
>>
>>
>> Regards,
>>   Shadi
>>
>> -- 
>> Shadi Abou-Zahra - http://www.w3.org/People/shadi/
>> Activity Lead, W3C/WAI International Program Office
>> Evaluation and Repair Tools Working Group (ERT WG)
>> Research and Development Working Group (RDWG)
>>
>
>

Received on Thursday, 25 October 2012 12:11:09 UTC