- From: Graham Klyne <Graham.Klyne@MIMEsweeper.com>
- Date: Fri, 08 Mar 2002 18:06:13 +0000
- To: Dave Beckett <dave.beckett@bristol.ac.uk>
- Cc: RDF Core <w3c-rdfcore-wg@w3.org>
Dave, Generally this looks good. I have a couple of nits and comments, but don't feel strongly about the resolution: At 05:38 PM 3/8/02 +0000, Dave Beckett wrote: >This closes action: > 2002-02-26#12 DaveB Propose n-triples changes to represent the > new form of rdf literals. > in F2F minutes http://www.w3.org/2001/sw/RDFCore/20020225-f2f/ > >During the F2F I was actioned to come up with a syntax to support the >structured literal form we agreed. I discussed this with >Dan Connolly what we could do and we agreed something like the >following was sufficient: > > xml("<b>foo</b>") XML content, no language > xml("<b>foo</b>", "en") XML content, language given "en" > > "chat" Unicode string, no language > "chat"-en Unicode string, language given as "en" It would feel more consistent to me to have: xml("<b>foo</b>"-en) XML content, language given "en" >Features: > * Makes all existing literals legal Good!!! > * Provides only one way to encoded the literal-structures > and so in that sense is canonical. Also good - simple-minded applications may still do string comparison, right? >In order to try this out, I've implemented the above in my N-Triples >parser, and it works just fine. > >Issues: > 1. "chat"-en might not be good enough if languages can contain > whitespace or other things (I need to check the RFCs) > Solution if this is needed: > "chat"-"en" Frpm RFC 3066: [[[ 2.1 Language tag syntax The language tag is composed of one or more parts: A primary language subtag and a (possibly empty) series of subsequent subtags. The syntax of this tag in ABNF [RFC 2234] is: Language-Tag = Primary-subtag *( "-" Subtag ) Primary-subtag = 1*8ALPHA Subtag = 1*8(ALPHA / DIGIT) ]]] so on that basis, quotes are not needed. > 2. Want one way to describe all literal structures: > Solution: > literal(unicode string value, unicode string language, boolean isXML) > and define the abbreviated forms in terms of that I don't see any value in this. > 3. I assume "chat" != "chat"-"" (need to check language RFCs) > Solution if this is needed: > Restrict the language string to always 1+ chars According to RFC 3066, a language tag may not be empty so this case shouldn't arise. I think it would be consistent with suggestions for xml:lang to have a blank tag value mean no language tag. Then: "chat" == "chat"-"" #g ------------------------------------------------------------ Graham Klyne MIMEsweeper Group Strategic Research <http://www.mimesweeper.com> <Graham.Klyne@MIMEsweeper.com>
Received on Friday, 8 March 2002 13:07:13 UTC