- From: LeVan,Ralph <levan@oclc.org>
- Date: Tue, 7 May 2002 10:31:20 -0400
- To: "Z39.50 LISTSERV" <www-zig@w3.org>
Okay, I'll just come right out and say it. I hate the idea of double-quotes inside of strings! I'm sure I'm just being old fashioned, but it feels like a parsing nightmare. Whatever happened to the good old backslash to escape things? I have no problems with a proposal that 123?4\56 means that the five is a literal and not a digit and should be ignored by the question-mark processor. But I don't like 123?4"5"6. Sorry. Ralph "No sir, I don't like it!" The Horse from Ren and Stimpy > -----Original Message----- > From: Mark Reichert [mailto:markr@sirs.com] > Sent: Tuesday, May 07, 2002 7:56 AM > To: Z39.50 LISTSERV > Subject: Re: CCL proposal (quotes) > > > I knew I had it around somewhere. Z39.58-1992 is/was clear > on this matter. > > 7.7.2.1 > > ... > When ? is immediately followed by a positive integer, it > shall be used to > indicate a limited range of characters to be masked, from > zero up to and > including the specified integer.... To search embedded > numbers, restoration > marks are required. See Section 7.7.7. > > 7.7.7 > > In order to use a reserved command word, abbreviation, > symbol, or operator > as a search word, double quotation marks, " ", shall be used > to restore its > literal meaning.... > > FIND 0?10"5" // ten zeroes followed by a five (my example) > FIND C?"14" // word beginning with C, ending in 14 (from > Z39.58 appendix) > > # has no interaction with digits: Multiple #s shall be used > to indicate > that a precise number of characters greater than one and qual > to the number > of # symbols are to be masked (7.7.2.2). > > The standard never offered an explicit explanation/example of > restoring ", > but presumably by 7.7.2.1... > > FIND """Some text in quotes""" > > There is no mention of the more typical "" escaping. > > The portion of a <search_term> that corresponds to restoration is: > > {<restoration><word>[<space><word>]...<restoration>} > > <restoration> ::= [<space>}<">[<space>] > <space> ::= < >[< >}... > <word> ::= {<char>|<var_mask>|<exact_mask>}... > <var_mask> ::= <?>[<positive_integer>] > <exact_mask> ::= <#>[<#>]... > <char> ::= <any_searchable_char> > <any_searchable_char> ::= any character locally defined as searchable > > > ----- Original Message ----- > > > Not making it to the ZIG, someone sent me some private mail > indicating > > that Ralph's proposed single digit after '?' change got accepted > > and possibly no-one mentioned my counter double quotes suggestion. > > Fair enough, if you don't turn up you have less influence. > > > > Just thought I would have a last bash at a compromise with the idea > > that if the CCL regexp is changing, may as well try and get as many > > changes in as possible in one hit rather than change it again later. > > > > To repeat the problem I currently have with the CCL regexp is that > > you cannot specify '?' or '#' as literal text (ie, release their > > special meaning). So even if there is now allowed only to be a > > single digit after '?', while the spec is being changed is it worth > > allowing double quotes ('"') to be used to release special chars > > anyway? This would allow 'find all terms starting with "#"'. > > At present, you cannot do this with the CCL regexp. Normally > > regexp's have release mechanism ( \ for regexp-1 I believe). > > CCL uses " as a release mechanism so seemed the natural thing > > to use in the CCL regexp (rather than \ which in CCL has no > > special meaning). > > > > It seems an oversight not to allow searching for serial numbers etc > > using patterns. > > > > #41434 > > #53423 > > > > If people have to change their CCL regexp implementation anyway, > > I would rather do both changes at the same time and make it possible > > to search for all possible characters. > > > > I wonder also if the Z39.58/CCL regexp attribute needs to be renamed > > to indicate that it no longer conforms to CCL. I don't actually have > > a copy of Z39.58, but if its anything like the ISO version of CCL > > the spec is so woolly that it isn't funny! The formal grammar is > > given by examples only, and the examples contradict themselves > > in places! (Mind you, the copy I have of ISO8777 is pretty old now > > so maybe its been improved.) Not stressed, just thought it was the > > correct time to at least ask the question. > > > > Alan > > -- > > Alan Kent (mailto:ajk@mds.rmit.edu.au, > http://www.mds.rmit.edu.au/~ajk/) > > Project: TeraText Technical Director, InQuirion Pty Ltd > (www.inquirion.com) > > Postal: Multimedia Database Systems, RMIT, GPO Box 2476V, > Melbourne 3001. > > Where: RMIT MDS, Bld 91, Level 3, 110 Victoria St, Carlton 3053, VIC > Australia. > > Phone: +61 3 9925 4114 Reception: +61 3 9925 4099 Fax: > +61 3 9925 4098 >
Received on Tuesday, 7 May 2002 10:31:28 UTC