Re: [bug] Domain

I am going to start working on Domain corrections now. I will hopefully
have it posted by the time i leave work or in the morning.

Leroy


On 5 December 2012 10:23, Leroy Finn <finnle@tcd.ie> wrote:

> Yeah this one is on my to do list and I will be working on this today.
>
> Leroy
>
>
> On 5 December 2012 01:56, Fredrik Liden <fliden@enlaso.com> wrote:
>
>> Hi Leroy, ****
>>
>> ** **
>>
>> I checked some bug fixes that Pablo had reported earlier plus a reference
>> to the wrong rules file.****
>>
>> ** **
>>
>> Notes:                                  ****
>>
>> **-       **Xml file 3 is missing.****
>>
>> **-       **Can you review the mapping of html 2-3 and xml 4-7. The
>> current values cannot be found in any of the mappings. Not sure if you want
>> to change the mappings or the domain value.****
>>
>> ** **
>>
>> See the differences below:****
>>
>> **-       **Could we perhaps call the compiled list “domains” instead of
>> domain, not to confuse it with the value the domainPointer is pointing to
>> (“domain” if we follow the logic of removing the Pointer part in the right
>> hand side attribute names)?****
>>
>> **-       **Maybe we don’t need to display the original “domain” and
>> “domainMapping” just the “domains” which contains the compiled list of
>> domains, to make sure the algorithm works?****
>>
>> **-       **See the missing “Pointer” string.****
>>
>> ** **
>>
>> Cheers,****
>>
>> Fredrik****
>>
>> ** **
>>
>> Left base: Suggested****
>>
>> Right base: Current
>>
>> File: domain\html\domain1htmloutput.txt   ****
>>
>> 12****
>>
>> /html/body[1]        domains="automotive"****
>>
>> <>** **
>>
>> 12****
>>
>> /html/body[1]        domain="automotive"****
>>
>> 13****
>>
>> /html/body[1]/p[1]        domains="automotive"****
>>
>>  ****
>>
>> 13****
>>
>> /html/body[1]/p[1]        domain="automotive"****
>>
>>
>>
>> File: domain\html\domain2htmloutput.txt   ****
>>
>> 12****
>>
>> /html/body[1]        domains="auto"****
>>
>> <>** **
>>
>> 12****
>>
>> /html/body[1]        domain="automotive"        domainMapping="automotive
>> auto, medical medicine, 'criminal law' law, 'property law' law"****
>>
>> 13****
>>
>> /html/body[1]/p[1]        domains="auto"****
>>
>>  ****
>>
>> 13****
>>
>> /html/body[1]/p[1]        domain="automotive"        domainMapping="automotive
>> auto, medical medicine, 'criminal law' law, 'property law' law"****
>>
>>
>>
>> File: domain\html\domain3htmloutput.txt   ****
>>
>> 11****
>>
>> /html/body[1]        domains="sports"****
>>
>> <>** **
>>
>> 11****
>>
>> /html/body[1]        domain="sports"        domainMapping="'sports law'
>> law, 'labor law' law, 'contract law' law, 'competition law' law,'tort law'
>> law"****
>>
>> 12****
>>
>> /html/body[1]/p[1]        domains="sports"****
>>
>>  ****
>>
>> 12****
>>
>> /html/body[1]/p[1]        domain="sports"        domainMapping="'sportslaw' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort
>> law' law"****
>>
>> 13****
>>
>> /html/body[1]/p[2]        domains="sports"****
>>
>>  ****
>>
>> 13****
>>
>> /html/body[1]/p[2]        domain="sports"        domainMapping="'sportslaw' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort
>> law' law"****
>>
>>
>>
>> File: domain\xml\domain1xmloutput.txt   ****
>>
>> 6****
>>
>> /doc/head[1]/its:rules[1]/its:domainRule[1]/@domainPointer****
>>
>> <>** **
>>
>> 6****
>>
>> /doc/head[1]/its:rules[1]/its:domainRule[1]/@domain****
>>
>>  ****
>>
>> 11****
>>
>> /doc/body[1]        domains="automotive"****
>>
>> <>** **
>>
>> 11****
>>
>> /doc/body[1]        domain="automotive"****
>>
>> 12****
>>
>> /doc/body[1]/p[1]        domains="automotive"****
>>
>>  ****
>>
>> 12****
>>
>> /doc/body[1]/p[1]        domain="automotive"****
>>
>>
>>
>> File: domain\xml\domain2xmloutput.txt   ****
>>
>>  ****
>>
>>  ****
>>
>> -+****
>>
>> 6****
>>
>> /doc/head[1]/its:rules[1]/its:domainRule[1]/@domain****
>>
>>  ****
>>
>> 7****
>>
>> /doc/head[1]/its:rules[1]/its:domainRule[1]/@domainPointer****
>>
>> +-****
>>
>>  ****
>>
>>  ****
>>
>>  ****
>>
>> 12****
>>
>> /doc/body[1]        domains="auto"****
>>
>> <>** **
>>
>> 12****
>>
>> /doc/body[1]        domain="automotive"        domainMapping="automotive
>> auto, medical medicine, 'criminal law' law, 'property law' law"****
>>
>> 13****
>>
>> /doc/body[1]/p[1]        domains="auto"****
>>
>>  ****
>>
>> 13****
>>
>> /doc/body[1]/p[1]        domain="automotive"        domainMapping="automotive
>> auto, medical medicine, 'criminal law' law, 'property law' law"****
>>
>>
>>
>> File: domain\xml\domain4xmloutput.txt   ****
>>
>> 9****
>>
>> /text/body[1]        domains="sports"****
>>
>> <>** **
>>
>> 9****
>>
>> /text/body[1]        domain="sports"        domainMapping="'sports law'
>> law, 'labor law' law, 'contract law' law, 'competition law' law,'tort law'
>> law"****
>>
>> 10****
>>
>> /text/body[1]/p[1]        domains="sports"****
>>
>>  ****
>>
>> 10****
>>
>> /text/body[1]/p[1]        domain="sports"        domainMapping="'sportslaw' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort
>> law' law"****
>>
>> 11****
>>
>> /text/body[1]/p[2]        domains="sports"****
>>
>>  ****
>>
>> 11****
>>
>> /text/body[1]/p[2]        domain="sports"        domainMapping="'sportslaw' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort
>> law' law"****
>>
>>
>>
>> File: domain\xml\domain5xmloutput.txt   ****
>>
>>  ****
>>
>>  ****
>>
>> -+****
>>
>> 10****
>>
>> /text/head[1]/its:rules[2]/its:domainRule[1]/@domain****
>>
>>  ****
>>
>> 11****
>>
>> /text/head[1]/its:rules[2]/its:domainRule[1]/@domainPointer****
>>
>> +-****
>>
>>  ****
>>
>>  ****
>>
>>  ****
>>
>> 19****
>>
>> /text/body[1]        domains="sports"****
>>
>> <>** **
>>
>> 19****
>>
>> /text/body[1]        domain="sports"        domainMapping="'sports law'
>> law,'labor law' law,'contract law' law,'competition law' law,'tort law' law
>> "****
>>
>> 20****
>>
>> /text/body[1]/p[1]        domains="sports"****
>>
>>  ****
>>
>> 20****
>>
>> /text/body[1]/p[1]        domain="sports"        domainMapping="'sportslaw' law,'labor law' law,'contract law' law,'competition law' law,'tort
>> law' law"****
>>
>> 21****
>>
>> /text/body[1]/p[2]        domains="sports"****
>>
>>  ****
>>
>> 21****
>>
>> /text/body[1]/p[2]        domain="sports"        domainMapping="'sportslaw' law,'labor law' law,'contract law' law,'competition law' law,'tort
>> law' law"****
>>
>> 22****
>>
>> /text/body[1]/span[1]        domains="law"****
>>
>>  ****
>>
>> 22****
>>
>> /text/body[1]/span[1]        domain="law"        domainMapping="'Amateur
>> Sports Law' 'sports law'"****
>>
>>
>>
>> File: domain\xml\domain6xmloutput.txt   ****
>>
>>  ****
>>
>>  ****
>>
>> -+****
>>
>> 8****
>>
>> /text/head[1]/its:rules[1]/its:domainRule[1]/@domain****
>>
>>  ****
>>
>> 9****
>>
>> /text/head[1]/its:rules[1]/its:domainRule[1]/@domainPointer****
>>
>> +-****
>>
>>  ****
>>
>>  ****
>>
>>  ****
>>
>> 15****
>>
>> /text/body[1]/p[1]        domains="literature"****
>>
>> <>** **
>>
>> 15****
>>
>> /text/body[1]/p[1]        domain="literature"        domainMapping="'Classical
>> literature',english"****
>>
>> 16****
>>
>> /text/body[1]/p[1]/span[1]        domains="literature"****
>>
>>  ****
>>
>> 16****
>>
>> /text/body[1]/p[1]/span[1]        domain="literature"
>> domainMapping="'Classical literature',english"****
>>
>> 17****
>>
>> /text/body[1]/p[1]/span[2]        domains="literature"****
>>
>>  ****
>>
>> 17****
>>
>> /text/body[1]/p[1]/span[2]        domain="literature"
>> domainMapping="'Classical literature',english"****
>>
>>
>>
>> File: domain\xml\domain7xmloutput.txt   ****
>>
>> 10****
>>
>> /text/body[1]/p[1]        domains="literature"****
>>
>> <>** **
>>
>> 10****
>>
>> /text/body[1]/p[1]        domain="literature"        domainMapping="'Classical
>> literature',english"****
>>
>> 11****
>>
>> /text/body[1]/p[1]/span[1]        domains="literature"****
>>
>>  ****
>>
>> 11****
>>
>> /text/body[1]/p[1]/span[1]        domain="literature"
>> domainMapping="'Classical literature',english"****
>>
>> 12****
>>
>> /text/body[1]/p[1]/span[2]        domains="literature"****
>>
>>  ****
>>
>> 12****
>>
>> /text/body[1]/p[1]/span[2]        domain="literature"
>> domainMapping="'Classical literature',english"****
>>
>> ** **
>>
>
>

Received on Wednesday, 5 December 2012 16:31:36 UTC