[bug] Domain

Hi Leroy,

I checked some bug fixes that Pablo had reported earlier plus a reference to the wrong rules file.

Notes:

-       Xml file 3 is missing.

-       Can you review the mapping of html 2-3 and xml 4-7. The current values cannot be found in any of the mappings. Not sure if you want to change the mappings or the domain value.

See the differences below:

-       Could we perhaps call the compiled list "domains" instead of domain, not to confuse it with the value the domainPointer is pointing to ("domain" if we follow the logic of removing the Pointer part in the right hand side attribute names)?

-       Maybe we don't need to display the original "domain" and "domainMapping" just the "domains" which contains the compiled list of domains, to make sure the algorithm works?

-       See the missing "Pointer" string.

Cheers,
Fredrik

Left base: Suggested
Right base: Current

File: domain\html\domain1htmloutput.txt
12

/html/body[1]        domains="automotive"

<>

12

/html/body[1]        domain="automotive"

13

/html/body[1]/p[1]        domains="automotive"



13

/html/body[1]/p[1]        domain="automotive"



File: domain\html\domain2htmloutput.txt
12

/html/body[1]        domains="auto"

<>

12

/html/body[1]        domain="automotive"        domainMapping="automotive auto, medical medicine, 'criminal law' law, 'property law' law"

13

/html/body[1]/p[1]        domains="auto"



13

/html/body[1]/p[1]        domain="automotive"        domainMapping="automotive auto, medical medicine, 'criminal law' law, 'property law' law"



File: domain\html\domain3htmloutput.txt
11

/html/body[1]        domains="sports"

<>

11

/html/body[1]        domain="sports"        domainMapping="'sports law' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort law' law"

12

/html/body[1]/p[1]        domains="sports"



12

/html/body[1]/p[1]        domain="sports"        domainMapping="'sports law' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort law' law"

13

/html/body[1]/p[2]        domains="sports"



13

/html/body[1]/p[2]        domain="sports"        domainMapping="'sports law' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort law' law"



File: domain\xml\domain1xmloutput.txt
6

/doc/head[1]/its:rules[1]/its:domainRule[1]/@domainPointer

<>

6

/doc/head[1]/its:rules[1]/its:domainRule[1]/@domain



11

/doc/body[1]        domains="automotive"

<>

11

/doc/body[1]        domain="automotive"

12

/doc/body[1]/p[1]        domains="automotive"



12

/doc/body[1]/p[1]        domain="automotive"



File: domain\xml\domain2xmloutput.txt




-+

6

/doc/head[1]/its:rules[1]/its:domainRule[1]/@domain



7

/doc/head[1]/its:rules[1]/its:domainRule[1]/@domainPointer

+-







12

/doc/body[1]        domains="auto"

<>

12

/doc/body[1]        domain="automotive"        domainMapping="automotive auto, medical medicine, 'criminal law' law, 'property law' law"

13

/doc/body[1]/p[1]        domains="auto"



13

/doc/body[1]/p[1]        domain="automotive"        domainMapping="automotive auto, medical medicine, 'criminal law' law, 'property law' law"



File: domain\xml\domain4xmloutput.txt
9

/text/body[1]        domains="sports"

<>

9

/text/body[1]        domain="sports"        domainMapping="'sports law' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort law' law"

10

/text/body[1]/p[1]        domains="sports"



10

/text/body[1]/p[1]        domain="sports"        domainMapping="'sports law' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort law' law"

11

/text/body[1]/p[2]        domains="sports"



11

/text/body[1]/p[2]        domain="sports"        domainMapping="'sports law' law, 'labor law' law, 'contract law' law, 'competition law' law,'tort law' law"



File: domain\xml\domain5xmloutput.txt




-+

10

/text/head[1]/its:rules[2]/its:domainRule[1]/@domain



11

/text/head[1]/its:rules[2]/its:domainRule[1]/@domainPointer

+-







19

/text/body[1]        domains="sports"

<>

19

/text/body[1]        domain="sports"        domainMapping="'sports law' law,'labor law' law,'contract law' law,'competition law' law,'tort law' law"

20

/text/body[1]/p[1]        domains="sports"



20

/text/body[1]/p[1]        domain="sports"        domainMapping="'sports law' law,'labor law' law,'contract law' law,'competition law' law,'tort law' law"

21

/text/body[1]/p[2]        domains="sports"



21

/text/body[1]/p[2]        domain="sports"        domainMapping="'sports law' law,'labor law' law,'contract law' law,'competition law' law,'tort law' law"

22

/text/body[1]/span[1]        domains="law"



22

/text/body[1]/span[1]        domain="law"        domainMapping="'Amateur Sports Law' 'sports law'"



File: domain\xml\domain6xmloutput.txt




-+

8

/text/head[1]/its:rules[1]/its:domainRule[1]/@domain



9

/text/head[1]/its:rules[1]/its:domainRule[1]/@domainPointer

+-







15

/text/body[1]/p[1]        domains="literature"

<>

15

/text/body[1]/p[1]        domain="literature"        domainMapping="'Classical literature',english"

16

/text/body[1]/p[1]/span[1]        domains="literature"



16

/text/body[1]/p[1]/span[1]        domain="literature"        domainMapping="'Classical literature',english"

17

/text/body[1]/p[1]/span[2]        domains="literature"



17

/text/body[1]/p[1]/span[2]        domain="literature"        domainMapping="'Classical literature',english"



File: domain\xml\domain7xmloutput.txt
10

/text/body[1]/p[1]        domains="literature"

<>

10

/text/body[1]/p[1]        domain="literature"        domainMapping="'Classical literature',english"

11

/text/body[1]/p[1]/span[1]        domains="literature"



11

/text/body[1]/p[1]/span[1]        domain="literature"        domainMapping="'Classical literature',english"

12

/text/body[1]/p[1]/span[2]        domains="literature"



12

/text/body[1]/p[1]/span[2]        domain="literature"        domainMapping="'Classical literature',english"

Received on Wednesday, 5 December 2012 01:57:34 UTC