W3C home > Mailing lists > Public > public-xg-socialweb@w3.org > August 2010

RE: Bruce Schneier's "A Revised Taxonomy of Social Networking Dat a"

From: Len Bullard <len.bullard@uai.com>
Date: Tue, 10 Aug 2010 10:52:07 -0500
Message-ID: <76758090F8686C47A44B6FF52514A1D309F575B1@hermes.uai.int>
To: Mischa Tuffield <mischa.tuffield@garlik.com>
Cc: Public SWXG <public-xg-socialweb@w3.org>
Good for Tim.  I look forward to reading that.

 

We've known since the earliest days of building networked hypermedia that
there would be these problems with semantic inference.   It is a problem of
any broadcast system where frequency and amplitude are used to push a signal
and induce a sign.

 

Thanks, Mischa.

 

len

 

-----Original Message-----
From: Mischa Tuffield [mailto:mischa.tuffield@garlik.com] 
Sent: Tuesday, August 10, 2010 9:01 AM
To: Len Bullard
Cc: Public SWXG
Subject: Re: Bruce Schneier's "A Revised Taxonomy of Social Networking Dat
a"

 

 

On 10 Aug 2010, at 14:58, Mischa Tuffield wrote:





Len,

 

<snip/> 

 

On 10 Aug 2010, at 14:39, Len Bullard wrote:





"Derived data is data about you that is derived from all the other data. For
example, if 80 percent of your friends self-identify as gay, you're likely
gay yourself."

 

Or you're a theatre major.

 

"news articles you access (and what that says about your political
leanings)"

 

Or you're a poli-sci major or even an honest journalist trying to get the
panoramic view instead of the snapshot.

 

Dangerous.  Stupid.  Worthy of derision and scorn.

 

I guess the point was that is it POSSIBLE to make such generalisations based
social networking data. The ability to make such generalisations was also
highlighted in a court case verses Netflix recently [3] (re: problems with
the anonymization of data). From my na´ve point of view, this type of
generalisation is the only business model employed by social networking
sites at the moment, profiling does happen. I mean profiling happens by
virtue of people's names, and indeed it is sad that this happens in our
world, but ... 

 

see reference [1] re: profiling based on people's names...





 

FYI : timbl was awesome enough to stand up in the UK parliament against
Phorm and their deep packet inspection (DPI) strategy to user profiling, and
released a personal note entitled "No Snooping" which is also worth a
read[2].

 

Mischa

 

[1] http://www.wired.com/techbiz/people/magazine/15-06/ps_transparency
<http://www.wired.com/techbiz/people/magazine/15-06/ps_transparency> 

[2] http://www.w3.org/DesignIssues/NoSnooping.html
<http://www.w3.org/DesignIssues/NoSnooping.html> 

[3] http://www.theregister.co.uk/2009/12/21/netflix_privacy_flap/
<http://www.theregister.co.uk/2009/12/21/netflix_privacy_flap/> 





 

len

 

-----Original Message-----
From: public-xg-socialweb-request@w3.org
<mailto:public-xg-socialweb-request@w3.org>
[mailto:public-xg-socialweb-request@w3.org] On Behalf Of Mischa Tuffield
Sent: Tuesday, August 10, 2010 8:13 AM
To: Public SWXG
Subject: Bruce Schneier's "A Revised Taxonomy of Social Networking Data"

 

Hello, 

 

Bruce Schneier blogged [1] about his revision to his taxonomy of social
networking data today. He has identified the following characteristics for
social networking data: 

 

"

*         Service data is the data you give to a social networking site in
order to use it. Such data might include your legal name, your age, and your
credit-card number.

 

*         Disclosed data is what you post on your own pages: blog entries,
photographs, messages, comments, and so on.

 

 

*         Entrusted data is what you post on other people's pages. It's
basically the same stuff as disclosed data, but the difference is that you
don't have control over the data once you post it -- another user does.

 

 

*         Incidental data is what other people post about you: a paragraph
about you that someone else writes, a picture of you that someone else takes
and posts. Again, it's basically the same stuff as disclosed data, but the
difference is that you don't have control over it, and you didn't create it
in the first place.

 

 

*         Behavioral data is data the site collects about your habits by
recording what you do and who you do it with. It might include games you
play, topics you write about, news articles you access (and what that says
about your political leanings), and so on.

 

 

*         Derived data is data about you that is derived from all the other
data. For example, if 80 percent of your friends self-identify as gay,
you're likely gay yourself.

" - Bruce's blog [1]

 

 

The addition he made to the taxonomy was the inclusion of the 6th field
"Derived Data", which is probably due to work undertaken at MIT, referred to
as project Gaydar, [2] where they showed that they could analyse public
facebook profile pages to identify people's sexual preferences. 

 

So, finally, I am rather interested in fifth class identified above as :
"Behavioral data", which I feel is being abused by this new phenomenon that
is the facebook like button, and how it is being plastered all over the web.
I feel that it is just another tool for the company to gather,
surreptitiously, behavioural data about their users, in a method which is
analogous to "opt-out" based marketing which we are seemingly eradicating
from printed forms. I have ranted about this on my blog [3].

 

Anyways, bye for now, 

 

Mischa 

 

[1] http://www.schneier.com/blog/archives/2010/08/a_taxonomy_of_s_1.html
<http://www.schneier.com/blog/archives/2010/08/a_taxonomy_of_s_1.html> 

[2]
http://firstmonday.org/htbin/cgiwrap/bin/ojs/index.php/fm/article/view/2611/
2302
<http://firstmonday.org/htbin/cgiwrap/bin/ojs/index.php/fm/article/view/2611
/2302> 

[3] http://mmt.me.uk/blog/2010/07/30/the-facebook-like-button/
<http://mmt.me.uk/blog/2010/07/30/the-facebook-like-button/> 

___________________________________

Mischa Tuffield PhD

Email: mischa.tuffield@garlik.com <mailto:mischa.tuffield@garlik.com> 

Homepage - http://mmt.me.uk/ <http://mmt.me.uk/> 

Garlik Limited, 1-3 Halford Road, Richmond, TW10 6AW
+44(0)845 645 2824  http://www.garlik.com/ <http://www.garlik.com/> 
Registered in England and Wales 535 7233 VAT # 849 0517 11
Registered office: Thames House, Portsmouth Road, Esher, Surrey, KT10 9AD

 

This email and any files transmitted with it are confidential and intended
solely for the use of the individual or entity to whom they are addressed.
If you have received this email in error please notify the sender. This
message contains confidential information and is intended only for the
individual named. If you are not the named addressee you should not
disseminate, distribute or copy this e-mail.

 

___________________________________

Mischa Tuffield PhD

Email: mischa.tuffield@garlik.com <mailto:mischa.tuffield@garlik.com> 

Homepage - http://mmt.me.uk/ <http://mmt.me.uk/> 

Garlik Limited, 1-3 Halford Road, Richmond, TW10 6AW
+44(0)845 645 2824  http://www.garlik.com/ <http://www.garlik.com/> 
Registered in England and Wales 535 7233 VAT # 849 0517 11
Registered office: Thames House, Portsmouth Road, Esher, Surrey, KT10 9AD

 

 

___________________________________

Mischa Tuffield PhD

Email: mischa.tuffield@garlik.com <mailto:mischa.tuffield@garlik.com> 

Homepage - http://mmt.me.uk/ <http://mmt.me.uk/> 

Garlik Limited, 1-3 Halford Road, Richmond, TW10 6AW
+44(0)845 645 2824  http://www.garlik.com/ <http://www.garlik.com/> 
Registered in England and Wales 535 7233 VAT # 849 0517 11
Registered office: Thames House, Portsmouth Road, Esher, Surrey, KT10 9AD

 

This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this email in error please notify the sender. This message contains confidential information and is intended only for the individual named. If you are not the named addressee you should not disseminate, distribute or copy this e-mail.
Received on Tuesday, 10 August 2010 15:51:58 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 10 August 2010 15:51:58 GMT