- From: <gdespain@elm.az05.bull.com>
- Date: Thu, 11 Apr 96 08:54:00 MST
- To: g.despain@az05.bull.com, magnusm@maths.lth.se
- Cc: www-logging@w3.org
visit-id. Yes, this is session id. user-id, described-userid, and customer-id, as far as the log file is concerned, could all be the same. Recognizing their differences is driven mostly by how the id will be used, and is mostly an issue external to the log file issues. The differences, as applicable to the log file are: the string specific to a "userid" can be totally determined by the server, for example, random numbers. They need not be unique between different web sites. They will never be used for anything other than associating with each other multiple log records that span multiple sessions at a single web site. "described userid" is a little-bit-more than "userid" because it allows association of demographics with users. Again, they need not be unique between different web sites. This association could be accomplished either by embedding the demographics in the log file (e.g., logging the cookie) or by having a separate file recording the demographics. Of course, logging the demographic information would increase the size of the logs. In either case, there is no association with the values of "described userid" beyond relating visits and sessions with demographics. These associations can be totally managed by the server software itself. "customer-id" implies the ability to associate the id with something external to the server envirionment. They may need to be unique across multiple web sites. One way of handling this could be to use the "described userid" facility, but include the external name as though it were one of the demographic attributes. Now, all of that restated, the important issues to understand from the logging viewpoint are: 1. The logging file needs a mechanism to identify users, 2. The mechanism should take into account the potential different uses of the identification, 3. The identification tokens have different scopes of viability, 4. It would be useful, when demographic information is wanted and available, to have a standard way of relating that demographic information to the log file entries. ---------- From: magnusm To: Gerald L. Despain Cc: www-logging Subject: Re: Logging content sugestions Date: Thursday, April 11, 1996 4:09PM This is a response to a posting that I did not see any response to. On Thu, 14 Mar 1996, Gerald L. Despain wrote: > 1. In the extended log format it would be useful to allow a directive to specify the > identity of the site for which the log was produced. This would facilitate the > prepartion of analysis reports on aggregations of logs from different web sites. I think that including general information in the preamble of the log is a good idea. > 4. It would be useful to allow logging of various forms of user or session id, > including the ideas as defined in the working draft on session-id. In general, there > are the following kinds of "id" that may be available and logged: > > visit-id > Persists only for a single session. No other user information associated with the id. Is this the session id? It would be appropriate. > > user-id > Persists between sessions. Allows recognition of when the same person returns to the > site. No other user information associated with the id. > > customer-id > Persists between sessions. Allows the identification of the individual > in external databases, such as a customer database. > 1. I think that user-id and customer-id can be combined into one. If you need a specific id for the customer database, that should fall outside of the log. customer-id usually fall as an inside company id and should be kept as such. If you need to grab a specific customer-id, I suggest a secure link. Either have a log-in page or extend the cookie mechanism in order to let the client system define a user-id and customer-id cookie for that site. Any which way, I think you should keep the customer-id out of the log. 2. How are we defining a user-id. A random number is not guaranteed to be unique so ones has to be provided. Is that id supposed to be site unique or global. > described-user-id > Persists only for a single session. Associates some demographic information with the > user. Does not allow for identification of any personal identification such as name > or address. > > identified-user-id > Persists between sessions. Associates some demographic information with the user. > Does not allow for identification of any personal identification such as name or > address. > I believe that the information provided by *-user-id can be maintained by the site outside of the log. The deomgraphic information can be stored externally and tied to the user-id. When the log is exported, the user-id can be replaced by or augmented with the demographic information. The idea is for the site to ask "if the user would like to give demographic info" once for an unknown user-id and not do it under the table. magnus ---------------------- Magnus Mengelbier tel. Sweden + (0)40-29 39 12 Mathematical Statistics e-mail magnusm@maths.lth.se Lund University, Sweden magnusm@ibm.net
Received on Thursday, 11 April 1996 12:10:29 UTC