Re: [bug] Domain

Hey Fredrik,

I was just fixing the parser up for this. I should have it fully corrected
first thing in the morning. Then I will move onto terminology updates.

Leroy


On 5 December 2012 16:31, Leroy Finn <finnle@tcd.ie> wrote:

> I am going to start working on Domain corrections now. I will hopefully
> have it posted by the time i leave work or in the morning.
>
> Leroy
>
>
> On 5 December 2012 10:23, Leroy Finn <finnle@tcd.ie> wrote:
>
>> Yeah this one is on my to do list and I will be working on this today.
>>
>> Leroy
>>
>>
>> On 5 December 2012 01:56, Fredrik Liden <fliden@enlaso.com> wrote:
>>
>>> Hi Leroy, ****
>>>
>>> ** **
>>>
>>> I checked some bug fixes that Pablo had reported earlier plus a
>>> reference to the wrong rules file.****
>>>
>>> ** **
>>>
>>> Notes:                                  ****
>>>
>>> **-       **Xml file 3 is missing.****
>>>
>>> **-       **Can you review the mapping of html 2-3 and xml 4-7. The
>>> current values cannot be found in any of the mappings. Not sure if you want
>>> to change the mappings or the domain value.****
>>>
>>> ** **
>>>
>>> See the differences below:****
>>>
>>> **-       **Could we perhaps call the compiled list “domains” instead
>>> of domain, not to confuse it with the value the domainPointer is pointing
>>> to (“domain” if we follow the logic of removing the Pointer part in the
>>> right hand side attribute names)?****
>>>
>>> **-       **Maybe we don’t need to display the original “domain” and
>>> “domainMapping” just the “domains” which contains the compiled list of
>>> domains, to make sure the algorithm works?****
>>>
>>> **-       **See the missing “Pointer” string.****
>>>
>>> ** **
>>>
>>> Cheers,****
>>>
>>> Fredrik****
>>>
>>> ** **
>>>
>>> Left base: Suggested****
>>>
>>> Right base: Current
>>>
>>> File: domain\html\domain1htmloutput.txt   ****
>>>
>>> 12****
>>>
>>> /html/body[1]        domains="automotive"****
>>>
>>> <>** **
>>>
>>> 12****
>>>
>>> /html/body[1]        domain="automotive"****
>>>
>>> 13****
>>>
>>> /html/body[1]/p[1]        domains="automotive"****
>>>
>>>  ****
>>>
>>> 13****
>>>
>>> /html/body[1]/p[1]        domain="automotive"****
>>>
>>>
>>>
>>> File: domain\html\domain2htmloutput.txt   ****
>>>
>>> 12****
>>>
>>> /html/body[1]        domains="auto"****
>>>
>>> <>** **
>>>
>>> 12****
>>>
>>> /html/body[1]        domain="automotive"        domainMapping="automotive
>>> auto, medical medicine, 'criminal law' law, 'property law' law"****
>>>
>>> 13****
>>>
>>> /html/body[1]/p[1]        domains="auto"****
>>>
>>>  ****
>>>
>>> 13****
>>>
>>> /html/body[1]/p[1]        domain="automotive"        domainMapping="automotive
>>> auto, medical medicine, 'criminal law' law, 'property law' law"****
>>>
>>>
>>>
>>> File: domain\html\domain3htmloutput.txt   ****
>>>
>>> 11****
>>>
>>> /html/body[1]        domains="sports"****
>>>
>>> <>** **
>>>
>>> 11****
>>>
>>> /html/body[1]        domain="sports"        domainMapping="'sports law'
>>> law, 'labor law' law, 'contract law' law, 'competition law' law,'tort law'
>>> law"****
>>>
>>> 12****
>>>
>>> /html/body[1]/p[1]        domains="sports"****
>>>
>>>  ****
>>>
>>> 12****
>>>
>>> /html/body[1]/p[1]        domain="sports"        domainMapping="'sportslaw' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort
>>> law' law"****
>>>
>>> 13****
>>>
>>> /html/body[1]/p[2]        domains="sports"****
>>>
>>>  ****
>>>
>>> 13****
>>>
>>> /html/body[1]/p[2]        domain="sports"        domainMapping="'sportslaw' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort
>>> law' law"****
>>>
>>>
>>>
>>> File: domain\xml\domain1xmloutput.txt   ****
>>>
>>> 6****
>>>
>>> /doc/head[1]/its:rules[1]/its:domainRule[1]/@domainPointer****
>>>
>>> <>** **
>>>
>>> 6****
>>>
>>> /doc/head[1]/its:rules[1]/its:domainRule[1]/@domain****
>>>
>>>  ****
>>>
>>> 11****
>>>
>>> /doc/body[1]        domains="automotive"****
>>>
>>> <>** **
>>>
>>> 11****
>>>
>>> /doc/body[1]        domain="automotive"****
>>>
>>> 12****
>>>
>>> /doc/body[1]/p[1]        domains="automotive"****
>>>
>>>  ****
>>>
>>> 12****
>>>
>>> /doc/body[1]/p[1]        domain="automotive"****
>>>
>>>
>>>
>>> File: domain\xml\domain2xmloutput.txt   ****
>>>
>>>  ****
>>>
>>>  ****
>>>
>>> -+****
>>>
>>> 6****
>>>
>>> /doc/head[1]/its:rules[1]/its:domainRule[1]/@domain****
>>>
>>>  ****
>>>
>>> 7****
>>>
>>> /doc/head[1]/its:rules[1]/its:domainRule[1]/@domainPointer****
>>>
>>> +-****
>>>
>>>  ****
>>>
>>>  ****
>>>
>>>  ****
>>>
>>> 12****
>>>
>>> /doc/body[1]        domains="auto"****
>>>
>>> <>** **
>>>
>>> 12****
>>>
>>> /doc/body[1]        domain="automotive"        domainMapping="automotive
>>> auto, medical medicine, 'criminal law' law, 'property law' law"****
>>>
>>> 13****
>>>
>>> /doc/body[1]/p[1]        domains="auto"****
>>>
>>>  ****
>>>
>>> 13****
>>>
>>> /doc/body[1]/p[1]        domain="automotive"        domainMapping="automotive
>>> auto, medical medicine, 'criminal law' law, 'property law' law"****
>>>
>>>
>>>
>>> File: domain\xml\domain4xmloutput.txt   ****
>>>
>>> 9****
>>>
>>> /text/body[1]        domains="sports"****
>>>
>>> <>** **
>>>
>>> 9****
>>>
>>> /text/body[1]        domain="sports"        domainMapping="'sports law'
>>> law, 'labor law' law, 'contract law' law, 'competition law' law,'tort law'
>>> law"****
>>>
>>> 10****
>>>
>>> /text/body[1]/p[1]        domains="sports"****
>>>
>>>  ****
>>>
>>> 10****
>>>
>>> /text/body[1]/p[1]        domain="sports"        domainMapping="'sportslaw' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort
>>> law' law"****
>>>
>>> 11****
>>>
>>> /text/body[1]/p[2]        domains="sports"****
>>>
>>>  ****
>>>
>>> 11****
>>>
>>> /text/body[1]/p[2]        domain="sports"        domainMapping="'sportslaw' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort
>>> law' law"****
>>>
>>>
>>>
>>> File: domain\xml\domain5xmloutput.txt   ****
>>>
>>>  ****
>>>
>>>  ****
>>>
>>> -+****
>>>
>>> 10****
>>>
>>> /text/head[1]/its:rules[2]/its:domainRule[1]/@domain****
>>>
>>>  ****
>>>
>>> 11****
>>>
>>> /text/head[1]/its:rules[2]/its:domainRule[1]/@domainPointer****
>>>
>>> +-****
>>>
>>>  ****
>>>
>>>  ****
>>>
>>>  ****
>>>
>>> 19****
>>>
>>> /text/body[1]        domains="sports"****
>>>
>>> <>** **
>>>
>>> 19****
>>>
>>> /text/body[1]        domain="sports"        domainMapping="'sports law'
>>> law,'labor law' law,'contract law' law,'competition law' law,'tort law' law
>>> "****
>>>
>>> 20****
>>>
>>> /text/body[1]/p[1]        domains="sports"****
>>>
>>>  ****
>>>
>>> 20****
>>>
>>> /text/body[1]/p[1]        domain="sports"        domainMapping="'sportslaw' law,'labor law' law,'contract law' law,'competition law' law,'tort
>>> law' law"****
>>>
>>> 21****
>>>
>>> /text/body[1]/p[2]        domains="sports"****
>>>
>>>  ****
>>>
>>> 21****
>>>
>>> /text/body[1]/p[2]        domain="sports"        domainMapping="'sportslaw' law,'labor law' law,'contract law' law,'competition law' law,'tort
>>> law' law"****
>>>
>>> 22****
>>>
>>> /text/body[1]/span[1]        domains="law"****
>>>
>>>  ****
>>>
>>> 22****
>>>
>>> /text/body[1]/span[1]        domain="law"
>>> domainMapping="'Amateur Sports Law' 'sports law'"****
>>>
>>>
>>>
>>> File: domain\xml\domain6xmloutput.txt   ****
>>>
>>>  ****
>>>
>>>  ****
>>>
>>> -+****
>>>
>>> 8****
>>>
>>> /text/head[1]/its:rules[1]/its:domainRule[1]/@domain****
>>>
>>>  ****
>>>
>>> 9****
>>>
>>> /text/head[1]/its:rules[1]/its:domainRule[1]/@domainPointer****
>>>
>>> +-****
>>>
>>>  ****
>>>
>>>  ****
>>>
>>>  ****
>>>
>>> 15****
>>>
>>> /text/body[1]/p[1]        domains="literature"****
>>>
>>> <>** **
>>>
>>> 15****
>>>
>>> /text/body[1]/p[1]        domain="literature"        domainMapping="'Classical
>>> literature',english"****
>>>
>>> 16****
>>>
>>> /text/body[1]/p[1]/span[1]        domains="literature"****
>>>
>>>  ****
>>>
>>> 16****
>>>
>>> /text/body[1]/p[1]/span[1]        domain="literature"
>>> domainMapping="'Classical literature',english"****
>>>
>>> 17****
>>>
>>> /text/body[1]/p[1]/span[2]        domains="literature"****
>>>
>>>  ****
>>>
>>> 17****
>>>
>>> /text/body[1]/p[1]/span[2]        domain="literature"
>>> domainMapping="'Classical literature',english"****
>>>
>>>
>>>
>>> File: domain\xml\domain7xmloutput.txt   ****
>>>
>>> 10****
>>>
>>> /text/body[1]/p[1]        domains="literature"****
>>>
>>> <>** **
>>>
>>> 10****
>>>
>>> /text/body[1]/p[1]        domain="literature"        domainMapping="'Classical
>>> literature',english"****
>>>
>>> 11****
>>>
>>> /text/body[1]/p[1]/span[1]        domains="literature"****
>>>
>>>  ****
>>>
>>> 11****
>>>
>>> /text/body[1]/p[1]/span[1]        domain="literature"
>>> domainMapping="'Classical literature',english"****
>>>
>>> 12****
>>>
>>> /text/body[1]/p[1]/span[2]        domains="literature"****
>>>
>>>  ****
>>>
>>> 12****
>>>
>>> /text/body[1]/p[1]/span[2]        domain="literature"
>>> domainMapping="'Classical literature',english"****
>>>
>>> ** **
>>>
>>
>>
>

Received on Wednesday, 5 December 2012 17:28:11 UTC