- From: Ivan Herman via GitHub <sysbot+gh@w3.org>
- Date: Sat, 09 Apr 2016 02:31:47 +0000
- To: public-annotation@w3.org
iherman has just created a new issue for https://github.com/w3c/web-annotation: == Privacy Interest Group (PING) review == (This review came in via [email](https://lists.w3.org/Archives/Public/public-annotation/2016Apr/0027.html), sent by Greg Norcie <gnorcie@cdt.org>. I have copied the text to the issue with only formatting changes. IH.) Hi all, Ivan Herman reached out to PING to share a trio of documents relating to the Web Annotation model: * The Web Annotation Protocol[1] * The Web Annotation Vocabulary[2] * The Web Annotation Data Model[3] Together, these documents propose a way for “annotation servers” to be set up, which can manage and store annotations about websites. To start off, I wanted to list off some high level takeaways I gathered. I have also included a run through of the PING privacy questionnaire[4] I developed. 1. Annotations, like all other internet traffic should probably be sent via HTTPS. The IETF has termed pervasive monitoring as an “attack[4], recommending all traffic be sent over HTTPS to avoid said attack. Similarly, the United States CIO has stated that “All browsing activity should be considered private and sensitive. An HTTPS-Only standard will eliminate inconsistent, subjective determinations across agencies regarding which content or browsing activity is sensitive in nature”. [5] 2. I wasn’t clear reading this spec: Are annotation servers always controlled by the operators of a given site? Or can one annotation server annotate any website? Regardless, there there be an opt out mechanism, similar to a robots.txt on a standard web page? I especially worry about the issue of harassment, which has been raised with other annotation services like Genius[7]. 3. Finally, I feel it’s important that there be mechanisms to edit and delete annotations. Annotation servers should not be “write only”. In other contexts such as on Facebook[8], users often regret the data they upload - I expect that the annotation servers will have similar incidents. * [1] https://www.w3.org/TR/2016/WD-annotation-protocol-20160331/ * [2] https://www.w3.org/TR/2016/WD-annotation-vocab-20160331/ * [3] https://www.w3.org/TR/2016/WD-annotation-model-20160331/ * [4] https://gregnorc.github.io/ping-privacy-questions/ * [5] http://www.w3.org/2001/tag/doc/web-https * [6] https://https.cio.gov/ * [7] http://www.dailydot.com/technology/genius-annotations-online-harrassment/ * [8] “I regretted the minute I pressed share”: A Qualitative Study of Regrets on Facebook http://cups.cs.cmu.edu/soups/2011/proceedings/a10_Wang.pdf In addition to these high level takeaways, below I have walked through the PING Privacy Questionnaire and included my responses. I encourage other standards developers to consider using the self questionnaire - and I welcome feedback on how this questionnaire can better help spec authors perform privacy audits: * Does this specification have a "Privacy Considerations" section? * Not currently. * Does this specification collect personally derived data? * No. Users could put personal data in a tag if they chose, but that is not something the spec specifically asks for or encourages. * Does this specification generate personally derived data, and if so how will that data be handled? * No, this standard does not directly generate identifiable information such as audio or video. * Does this standard allow an origin direct access to a user’s location, and if so is that information minimized? * No, the Annotation Protocol does not collect location data. * How should this specification work in the context of a user agent’s "incognito" mode? * The same as without, assuming the server is accessed via the browser. * Is it possible to spoof/fake the data being generated for privacy purposes? * I assume users could use a proxy, VPN, or Tor to access the annotation server. * Does the standard utilize data that is personally-derived, i.e. derived from the interaction of a single person, or their device or address? * No. * Does the data record contain elements that would enable re-correlation when combined with other datasets through the property of intersection (commonly known as "fingerprinting")? * However I would like to point out that PING has previously discussed sensor-specific question that can get at cross-device or cross-UA signaling. (The Vibration API). Can I get a volunteer to submit a pull request to add language that would add language to capture this threat model to the existing questionnaire? * Is the user likely to know if information is being collected? * Yes, users must expressly navigate to and utilize the annotation server. * Can the user easily, preferably through an element of the GUI, revoke consent granted to a particular feature? * Again, not clear if users will have the ability to delete/edit annotations. Hopefully there will be a discussion on this feature - users often regret posts on social media[8], and it’s important they be able to delete their posts. * Once consent has been given, is there a mechanism whereby it can be automatically revoked after a reasonable, or user configurable, period? * I’m not 100% clear, but I would hope that users can delete their annotations if they choose to do so. * Does this standard utilize strong end to end encryption? * I see no mention of using HTTPS in this standard. I’d like to see language added that Annotation servers must use TLS. Please view or discuss this issue at https://github.com/w3c/web-annotation/issues/204 using your GitHub account
Received on Saturday, 9 April 2016 02:31:49 UTC