- From: Leroy Finn <finnle@tcd.ie>
- Date: Thu, 6 Dec 2012 10:30:50 +0000
- To: Fredrik Liden <fliden@enlaso.com>
- Cc: Multilingual Web LT-TESTS Public <public-multilingualweb-lt-tests@w3.org>
- Message-ID: <CAMYWBwukqSRvUa=1pSkoQCeUJLUgwbf0v10_qebCAsJDEHOhAQ@mail.gmail.com>
Hey Fredrik, I have the updates made to Domain now. Going to do the minor corrections you emailed to me last night and then start on the other categories that need updates like Provenance,Disambiguation, MT Confidence, LQI and Terminology. Thanks for all the help, Leroy On 6 December 2012 00:15, Fredrik Liden <fliden@enlaso.com> wrote: > Thanks for the quick updates Leroy. I’ll go through a few more categories > then regress the latest changes.**** > > ** ** > > Fredrik**** > > ** ** > > *From:* Leroy Finn [mailto:finnle@tcd.ie] > *Sent:* Wednesday, December 05, 2012 9:28 AM > *To:* Fredrik Liden > *Cc:* Multilingual Web LT-TESTS Public > *Subject:* Re: [bug] Domain**** > > ** ** > > Hey Fredrik,**** > > ** ** > > I was just fixing the parser up for this. I should have it fully corrected > first thing in the morning. Then I will move onto terminology updates.**** > > ** ** > > Leroy**** > > ** ** > > On 5 December 2012 16:31, Leroy Finn <finnle@tcd.ie> wrote:**** > > I am going to start working on Domain corrections now. I will hopefully > have it posted by the time i leave work or in the morning.**** > > ** ** > > Leroy**** > > ** ** > > On 5 December 2012 10:23, Leroy Finn <finnle@tcd.ie> wrote:**** > > Yeah this one is on my to do list and I will be working on this today.**** > > ** ** > > Leroy**** > > ** ** > > On 5 December 2012 01:56, Fredrik Liden <fliden@enlaso.com> wrote:**** > > Hi Leroy, **** > > **** > > I checked some bug fixes that Pablo had reported earlier plus a reference > to the wrong rules file.**** > > **** > > Notes: **** > > - Xml file 3 is missing.**** > > - Can you review the mapping of html 2-3 and xml 4-7. The current > values cannot be found in any of the mappings. Not sure if you want to > change the mappings or the domain value.**** > > **** > > See the differences below:**** > > - Could we perhaps call the compiled list “domains” instead of > domain, not to confuse it with the value the domainPointer is pointing to > (“domain” if we follow the logic of removing the Pointer part in the right > hand side attribute names)?**** > > - Maybe we don’t need to display the original “domain” and > “domainMapping” just the “domains” which contains the compiled list of > domains, to make sure the algorithm works?**** > > - See the missing “Pointer” string.**** > > **** > > Cheers,**** > > Fredrik**** > > **** > > Left base: Suggested**** > > Right base: Current > > File: domain\html\domain1htmloutput.txt **** > > 12**** > > /html/body[1] domains="automotive"**** > > <> **** > > 12**** > > /html/body[1] domain="automotive"**** > > 13**** > > /html/body[1]/p[1] domains="automotive"**** > > **** > > 13**** > > /html/body[1]/p[1] domain="automotive"**** > > > > File: domain\html\domain2htmloutput.txt **** > > 12**** > > /html/body[1] domains="auto"**** > > <> **** > > 12**** > > /html/body[1] domain="automotive" domainMapping="automotive > auto, medical medicine, 'criminal law' law, 'property law' law"**** > > 13**** > > /html/body[1]/p[1] domains="auto"**** > > **** > > 13**** > > /html/body[1]/p[1] domain="automotive" domainMapping="automotive > auto, medical medicine, 'criminal law' law, 'property law' law"**** > > > > File: domain\html\domain3htmloutput.txt **** > > 11**** > > /html/body[1] domains="sports"**** > > <> **** > > 11**** > > /html/body[1] domain="sports" domainMapping="'sports law' > law, 'labor law' law, 'contract law' law, 'competition law' law,'tort law' > law"**** > > 12**** > > /html/body[1]/p[1] domains="sports"**** > > **** > > 12**** > > /html/body[1]/p[1] domain="sports" domainMapping="'sportslaw' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort > law' law"**** > > 13**** > > /html/body[1]/p[2] domains="sports"**** > > **** > > 13**** > > /html/body[1]/p[2] domain="sports" domainMapping="'sportslaw' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort > law' law"**** > > > > File: domain\xml\domain1xmloutput.txt **** > > 6**** > > /doc/head[1]/its:rules[1]/its:domainRule[1]/@domainPointer**** > > <> **** > > 6**** > > /doc/head[1]/its:rules[1]/its:domainRule[1]/@domain**** > > **** > > 11**** > > /doc/body[1] domains="automotive"**** > > <> **** > > 11**** > > /doc/body[1] domain="automotive"**** > > 12**** > > /doc/body[1]/p[1] domains="automotive"**** > > **** > > 12**** > > /doc/body[1]/p[1] domain="automotive"**** > > > > File: domain\xml\domain2xmloutput.txt **** > > **** > > **** > > -+**** > > 6**** > > /doc/head[1]/its:rules[1]/its:domainRule[1]/@domain**** > > **** > > 7**** > > /doc/head[1]/its:rules[1]/its:domainRule[1]/@domainPointer**** > > +-**** > > **** > > **** > > **** > > 12**** > > /doc/body[1] domains="auto"**** > > <> **** > > 12**** > > /doc/body[1] domain="automotive" domainMapping="automotive > auto, medical medicine, 'criminal law' law, 'property law' law"**** > > 13**** > > /doc/body[1]/p[1] domains="auto"**** > > **** > > 13**** > > /doc/body[1]/p[1] domain="automotive" domainMapping="automotive > auto, medical medicine, 'criminal law' law, 'property law' law"**** > > > > File: domain\xml\domain4xmloutput.txt **** > > 9**** > > /text/body[1] domains="sports"**** > > <> **** > > 9**** > > /text/body[1] domain="sports" domainMapping="'sports law' > law, 'labor law' law, 'contract law' law, 'competition law' law,'tort law' > law"**** > > 10**** > > /text/body[1]/p[1] domains="sports"**** > > **** > > 10**** > > /text/body[1]/p[1] domain="sports" domainMapping="'sportslaw' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort > law' law"**** > > 11**** > > /text/body[1]/p[2] domains="sports"**** > > **** > > 11**** > > /text/body[1]/p[2] domain="sports" domainMapping="'sportslaw' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort > law' law"**** > > > > File: domain\xml\domain5xmloutput.txt **** > > **** > > **** > > -+**** > > 10**** > > /text/head[1]/its:rules[2]/its:domainRule[1]/@domain**** > > **** > > 11**** > > /text/head[1]/its:rules[2]/its:domainRule[1]/@domainPointer**** > > +-**** > > **** > > **** > > **** > > 19**** > > /text/body[1] domains="sports"**** > > <> **** > > 19**** > > /text/body[1] domain="sports" domainMapping="'sports law' > law,'labor law' law,'contract law' law,'competition law' law,'tort law' law > "**** > > 20**** > > /text/body[1]/p[1] domains="sports"**** > > **** > > 20**** > > /text/body[1]/p[1] domain="sports" domainMapping="'sportslaw' law,'labor law' law,'contract law' law,'competition law' law,'tort > law' law"**** > > 21**** > > /text/body[1]/p[2] domains="sports"**** > > **** > > 21**** > > /text/body[1]/p[2] domain="sports" domainMapping="'sportslaw' law,'labor law' law,'contract law' law,'competition law' law,'tort > law' law"**** > > 22**** > > /text/body[1]/span[1] domains="law"**** > > **** > > 22**** > > /text/body[1]/span[1] domain="law" domainMapping="'Amateur > Sports Law' 'sports law'"**** > > > > File: domain\xml\domain6xmloutput.txt **** > > **** > > **** > > -+**** > > 8**** > > /text/head[1]/its:rules[1]/its:domainRule[1]/@domain**** > > **** > > 9**** > > /text/head[1]/its:rules[1]/its:domainRule[1]/@domainPointer**** > > +-**** > > **** > > **** > > **** > > 15**** > > /text/body[1]/p[1] domains="literature"**** > > <> **** > > 15**** > > /text/body[1]/p[1] domain="literature" domainMapping="'Classical > literature',english"**** > > 16**** > > /text/body[1]/p[1]/span[1] domains="literature"**** > > **** > > 16**** > > /text/body[1]/p[1]/span[1] domain="literature" domainMapping > ="'Classical literature',english"**** > > 17**** > > /text/body[1]/p[1]/span[2] domains="literature"**** > > **** > > 17**** > > /text/body[1]/p[1]/span[2] domain="literature" domainMapping > ="'Classical literature',english"**** > > > > File: domain\xml\domain7xmloutput.txt **** > > 10**** > > /text/body[1]/p[1] domains="literature"**** > > <> **** > > 10**** > > /text/body[1]/p[1] domain="literature" domainMapping="'Classical > literature',english"**** > > 11**** > > /text/body[1]/p[1]/span[1] domains="literature"**** > > **** > > 11**** > > /text/body[1]/p[1]/span[1] domain="literature" domainMapping > ="'Classical literature',english"**** > > 12**** > > /text/body[1]/p[1]/span[2] domains="literature"**** > > **** > > 12**** > > /text/body[1]/p[1]/span[2] domain="literature" domainMapping > ="'Classical literature',english"**** > > **** > > ** ** > > ** ** > > ** ** >
Received on Thursday, 6 December 2012 10:31:25 UTC