{"id":234,"date":"2008-10-08T23:32:14","date_gmt":"2008-10-09T06:32:14","guid":{"rendered":"http:\/\/www.talesfromthe.net\/jon\/?p=234"},"modified":"2008-10-08T23:32:14","modified_gmt":"2008-10-09T06:32:14","slug":"berkman-center-doesnt-bother-to-consult-with-privacy-experts-before-publishing-1700-students-facebook-data","status":"publish","type":"post","link":"https:\/\/2024.thenexus.today\/index.php\/2008\/10\/08\/berkman-center-doesnt-bother-to-consult-with-privacy-experts-before-publishing-1700-students-facebook-data\/","title":{"rendered":"Berkman Center researcher publishes 1700 students&#8217; Facebook data: &#8220;We did not consult w\/ privacy experts on how to do this, but we did think long and hard &#8230;.&#8221;"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"alignright\" style=\"margin-left: 5px; margin-right: 5px; float: right;\" src=\"http:\/\/farm1.static.flickr.com\/225\/503165914_a680a56c77.jpg\" alt=\"facebook logo\" width=\"150\" height=\"56\" \/>I think I&#8217;ll let others tell the story for me &#8230;<\/p>\n<p>September 25:<\/p>\n<blockquote><p>In collaboration with Harvard sociology graduate students Kevin Lewis and Marco Gonzalez, and with UCLA professor <a href=\"http:\/\/www.sscnet.ucla.edu\/soc\/faculty\/wimmer\/\">Andreas Wimmer<\/a> and Harvard professor <a href=\"http:\/\/www.wjh.harvard.edu\/soc\/faculty\/christakis\/\">Nicholas Christakis<\/a>, Berkman Fellow <a href=\"http:\/\/www.jasonkaufman.net\/\">Jason Kaufman<\/a> has made available a first wave of Facebook.com data through the <a href=\"http:\/\/thedata.org\/\">Dataverse Network Project<\/a>.<\/p>\n<p>The dataset comprises machine-readable files of virtually all the information posted on approximately 1,700 FB profiles by an entire cohort of students at an anonymous, northeastern American university.<\/p>\n<p>&#8212; <a href=\"http:\/\/cyber.law.harvard.edu\/node\/4682 \">Tastes, Ties, and Time: Facebook data release<\/a>, Berkman Center for Internet and Society, Harvard University<\/p><\/blockquote>\n<p>September 29:<\/p>\n<blockquote><p>The \u00e2\u20ac\u0153non-identifiability\u00e2\u20ac\u009d of such a dataset is up for debate&#8230;.\u00c2\u00a0 According to the authors, the collection of the dataset was approved by the IRB, Facebook and the individual college.\u00c2\u00a0 The dissemination of the dataset appears to be approved by the IRB.<\/p>\n<p>&#8212; <a href=\"http:\/\/fstutzman.com\/2008\/09\/29\/facebook-datasets-and-private-chrome\/\">Facebook Datasets and Private Chrome<\/a>, Fred Stutzman, <em>Unit Structures<\/em><\/p><\/blockquote>\n<p><!--more-->September 30:<\/p>\n<blockquote><p>Of course, this sounds like an <a href=\"http:\/\/michaelzimmer.org\/2006\/08\/09\/aol-search-log-profiles-unmasked\/\" target=\"_blank\" rel=\"noopener\">AOL-search-data-release-style privacy disaster<\/a> waiting to happen.<\/p>\n<p>&#8212;<a href=\"http:\/\/michaelzimmer.org\/2008\/09\/30\/on-the-anonymity-of-the-facebook-dataset\/\"> On the \u00e2\u20ac\u0153Anonymity\u00e2\u20ac\u009d of the Facebook Dataset<\/a>, Michael Zimmer,<em> michaelzimmer.org<\/em><\/p><\/blockquote>\n<p>October 2:<\/p>\n<blockquote><p>I think it\u00e2\u20ac\u2122s hard to imagine that some of this anonymity wouldn\u00e2\u20ac\u2122t be breached with some of the participants in the sample. For one thing, some nationalities are only represented by one person.<\/p>\n<p>&#8212; Eszter Hargittai, <a href=\"http:\/\/fstutzman.com\/2008\/09\/29\/facebook-datasets-and-private-chrome\/#comment-2489\">in a comment<\/a> on <em>Unit Structures<\/em><\/p>\n<p>We did not consult w\/ privacy experts on how to do this, but we did think long and hard about what and how this should be done.<\/p>\n<p>&#8212; Jason Kauffman, <a href=\"http:\/\/michaelzimmer.org\/2008\/09\/30\/on-the-anonymity-of-the-facebook-dataset\/#comment-156276\">as a comment on michaelzimmer.org <\/a><\/p><\/blockquote>\n<p>OK, OK, I&#8217;ve held my tongue long enough.\u00c2\u00a0 The arrogant attitude of &#8220;we&#8217;re smart and we thought about it so we didn&#8217;t bother to ask the experts&#8221; is a well-known recipe for disaster in privacy (or security or software engineerig or &#8230;).\u00c2\u00a0 People like Cynthia Dwork of Microsoft Research and Latanya Sweeney of Carnegie-Mellon University have been studying data anonymization and reidentification for years; this stuff is <em>hard<\/em>.\u00c2\u00a0\u00c2\u00a0 How can the Berkman Center not know that?\u00c2\u00a0 And how can Facebook and Harvard be so cavalier as to share data with a research team with an attitude like this?<\/p>\n<p>October 3:<\/p>\n<blockquote><p>Well, I\u00e2\u20ac\u2122m pretty sure this \u00e2\u20ac\u0153anonymous, northeastern American university\u00e2\u20ac\u009d is <strong><a href=\"http:\/\/www.college.harvard.edu\/\" target=\"_blank\" rel=\"noopener\">Harvard College<\/a>.<\/strong> And I didn\u00e2\u20ac\u2122t even have to download the dataset to figure it out. Here\u00e2\u20ac\u2122s how.<\/p>\n<p>&#8212; <a href=\"http:\/\/michaelzimmer.org\/2008\/10\/03\/more-on-the-anonymity-of-the-facebook-dataset-its-harvard-college\/ \">More On the \u00e2\u20ac\u0153Anonymity\u00e2\u20ac\u009d of the Facebook Dataset &#8211; It\u00e2\u20ac\u2122s Harvard College<\/a>, Michael Zimmer,<em> michaelzimmer.org<\/em><\/p><\/blockquote>\n<p>See, I told you this stuff is hard.<\/p>\n<p>October 7:<\/p>\n<blockquote><p>In the comments, Jason Kaufman implies that the data really isn&#8217;t that private, asking what could go wrong, and why would someone post it to Facebook expecting it to remain private.<\/p>\n<p>I have just one question on all of this.  If the data isn&#8217;t private, why did they attempt to anonymize it?<\/p>\n<p>I believe they attempted to anonymize it because it&#8217;s fairly obvious that the data is private, and releasing it with names obviously attached would be pretty shocking.<\/p>\n<p>&#8212; <a href=\"http:\/\/www.emergentchaos.com\/archives\/2008\/10\/facebook_twofaced_over_da.html\">Researchers Two-Faced on the Facebook Data Release<\/a>, Adam Shostack, <em>Emergent Chaos<\/em><\/p><\/blockquote>\n<p>Yeah, really.<\/p>\n<blockquote><p>The original research mission (to collect and analyze a set with proper safeguards) was within bounds; the follow-up distribution is the element that clearly poses risk.<\/p>\n<p>&#8212; <a href=\"http:\/\/fstutzman.com\/2008\/10\/07\/facebook-dataset-identified\/\">Facebook Dataset Identified<\/a>, Fred Stuzman, <em>Unit Structures<\/em><\/p><\/blockquote>\n<p>Well, except it turns out that the original research mission also clearly posed risk: for example, the proper safeguard might not be in place.\u00c2\u00a0 Did the IRB (Institutional Review Board) look at this?\u00c2\u00a0 Did Facebook and Harvard?<\/p>\n<p>Fred goes on to make the excellent point that the researchers should have convened a panel to discuss before releasing the information, and suggests as a potential takeaway &#8220;Research that pushes the boundaries of technology and privacy provide IRB\u00e2\u20ac\u2122s with unique challenges.&#8221;\u00c2\u00a0 True enough, and his post and the comments &#8212; along with all the other ones I&#8217;ve linked\u00c2\u00a0 to &#8212; are well worth reading.<\/p>\n<p>But it seems to me that this is letting the Berkman Center, Facebook, and Harvard off the hook a little too easily.\u00c2\u00a0 They just put information about 1700 students, at least some of whom (and probably most) are likely to be identifiable, up on the internet &#8230; without even asking their permission.<\/p>\n<p>It&#8217;s late at night and so maybe I&#8217;m feeling irritable but I find myself asking questions like: In what universe is this supposed to be okay?<\/p>\n<blockquote><p>The Berkman Center&#8217;s mission is to explore and understand cyberspace; to study its development, dynamics, norms, and standards; and to assess the need or lack thereof for laws and sanctions.<\/p>\n<p>&#8212; the <a href=\"http:\/\/cyber.law.harvard.edu\/about\">Berkman Center&#8217;s mission statement<\/a><\/p><\/blockquote>\n<p>The Berkman Center recently hosted a conference and gala on <a href=\"http:\/\/cyber.law.harvard.edu\/events\/berkmanat10\">The Future of the Internet<\/a>.\u00c2\u00a0 People look to them as authorities.\u00c2\u00a0 Is this the future they want to create?<\/p>\n<p>As far as I know, none of the <a href=\"http:\/\/cyber.law.harvard.edu\/people\">Berkman Center faculty<\/a> have weighed in on this yet.\u00c2\u00a0 It&#8217;ll be interesting to hear what Yochai Benkler, William Fisher, Charles Nesson, John Palfrey, Jonathan Zittrain, John Deighton, Jack Goldsmith, Alexander Keysser, Charles Ogletree and Stuart Scheiber have to say about what this episode says about the  &#8220;need or lack thereof for laws and sanctions.&#8221;<\/p>\n<p>And in terms of understanding, given the potential for gender-, race- and culture-based differences in attitudes towards privacy, I&#8217;m also looking forward to what they &#8212; and others &#8212; think about how events might have been influenced by the Berkman Center&#8217;s, and the research team&#8217;s, diversity.\u00c2\u00a0\u00c2\u00a0 Or lack thereof.<\/p>\n<p>jon<\/p>\n<p style=\"text-align: center;\"><em>Facebook graphic <a href=\"http:\/\/www.flickr.com\/photos\/ajc1\/503165914\/\">from AJC1&#8217;s flickr site<\/a>, licensed <a href=\"http:\/\/scienceoftheinvisible.blogspot.com\/2008\/07\/using-creative-commons-images-from.html\">under Creative Commons<\/a><\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>I think I&#8217;ll let others tell the story for me &#8230; September 25: In collaboration with Harvard sociology graduate students Kevin Lewis and Marco Gonzalez, and with UCLA professor Andreas Wimmer and Harvard professor Nicholas Christakis, Berkman Fellow Jason Kaufman has made available a first wave of Facebook.com data through the Dataverse Network Project. The [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[8,13],"tags":[113,141,148,275],"class_list":["post-234","post","type-post","status-publish","format-standard","hentry","category-privacy","category-social-computing","tag-diversity","tag-facebook","tag-feminism","tag-privacy"],"_links":{"self":[{"href":"https:\/\/2024.thenexus.today\/index.php\/wp-json\/wp\/v2\/posts\/234","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/2024.thenexus.today\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/2024.thenexus.today\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/2024.thenexus.today\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/2024.thenexus.today\/index.php\/wp-json\/wp\/v2\/comments?post=234"}],"version-history":[{"count":0,"href":"https:\/\/2024.thenexus.today\/index.php\/wp-json\/wp\/v2\/posts\/234\/revisions"}],"wp:attachment":[{"href":"https:\/\/2024.thenexus.today\/index.php\/wp-json\/wp\/v2\/media?parent=234"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/2024.thenexus.today\/index.php\/wp-json\/wp\/v2\/categories?post=234"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/2024.thenexus.today\/index.php\/wp-json\/wp\/v2\/tags?post=234"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}