Prepublication Public Sector Information (was: Data Mining / Cloud Computing)

I have posted a proof of concept for the special case of prepublication (XML under construction) which specifies a Public License Target (CC-BY-SA, etc., there is a pick list)  and includes a target Privacy Policy too).  While these are given as an informative "Target Audience", the document instance reserves all rights.  This "Prepublication Archive" is one stand-alone file (XHTML 1.1).  Even in a BYOD (Bring Your Own Device) world, this should improve accountability without harming transparancy.  Private Entities do not have Inspector Generals, and Jeanne's presentation on NASA mentioned that it helps if collaborators all are working from the same rule book.

The sample "Archive" is here:http://www.rustprivacy.org/2012/xsd/pii/  You can view the document source to see the exact syntax used.

The "Public" schema (for the XML) is http://www.rustprivacy.org/2012/xsd/pii/pii.xsd and various examples can be found in http://www.rustprivacy.org/2012/xsd/pii/psi.zip  All XML is set up to allow for easy automated redaction of comments and tagged marginalia as well as local, remote or no validation.  The schema implements pick lists (enumerations) for both the Targets (i.e. License and Privacy Policy).  It is possible to include a check off list approach to a privacy policy, or other specific Agency Publication Requirements (see psi-block.xml example)


--Gannon


________________________________
 From: Gannon Dick <gannon_dick@yahoo.com>
To: eGov IG (Public) <public-egov-ig@w3.org> 
Sent: Friday, November 2, 2012 5:40 PM
Subject: Data Mining / Cloud Computing
 



I found these two articles, to say the least, jarring[1,2].

Local Governments, small organizations and individuals have little say in the eventual resolution of these policies.  However, the W3C specifications and validation can be used as a guide for off-line authoring of documents too.  The tools go far beyond beyond simple spell-checking.  While they are rarely in routine use because they cost Enterprise Web Servers precious milliseconds, they are well worth the time to a scholar or researcher working off-line to preserve confidentiality or without access at a remote location.

Using Xerces[3], I've been able to validate (entirely off-line) XML Documents in the following
 formats:  StratML, JATS Journal Publishing (ANSI), JATS Article Authoring (ANSI), MODS (US LOC), DWML (NOAA) as well as embedded versions in XHTML 1.1.  As soon as I am able to publish examples I will (along with a zip file containing the interlinked XSD Sxhema.  This method is not as "good" as DTD's, but it is much smaller and gets the basics right.

--Gannon


[1] http://www.washingtonpost.com/business/economy/agencies-that-use-google-services-at-risk-of-data-mining-tech-group-says/2012/11/01/3244d6b6-237c-11e2-ac85-e669876c6a24_story.html
[2] (latest commentary) http://safegov.org/
[3] http://xerces.apache.org/

Received on Sunday, 4 November 2012 22:02:09 UTC