Zuckerberg said what about privacy? Researchers create archive to find out

The new 'Zuckerberg Files' archive contains over 100 full-text transcripts

If users find Instagram's more robust features as easy to use as Vine's, it could take a machete to Vine's future growth.

If users find Instagram's more robust features as easy to use as Vine's, it could take a machete to Vine's future growth.

Facebook CEO Mark Zuckerberg sometimes speaks quickly and his statements on Internet privacy are not always clear, so researchers have created an archive to collect everything the executive has said publicly, aimed at gaining a better understanding of where the company stands on privacy.

The University of Wisconsin-Milwaukee is hosting the Zuckerberg Files, a digital treasure trove containing over 100 full-text transcripts and about 50 video files documenting Zuckerberg's public statements for scholars to download and analyze. The statements include Zuckerberg-authored blog posts, company presentations, and print and video interviews going as far back as 2004. One of the archive's earliest entries is an article from the Harvard Crimson newspaper, when the then-Harvard student spoke about a file sharing service he was developing, called Wirehog.

The archive is in its early stages, but its developers have ambitious goals. One of the biggest is to investigate how Facebook's CEO approaches issues surrounding user privacy, how his public statements have changed over the years and to decipher more of the company's thinking behind new products for sharing content like photos and status updates.

Take when Facebook first introduced its News Feed in 2006 and the ensuing user backlash. In the aftermath, Zuckerberg essentially told people to calm down, said Michael Zimmer, the lead administrator behind the archive.

Even though the archive is collecting information that is already public, when looked at as a whole it can help detect changes in how Zuckerberg talks about sharing and privacy, Zimmer said. It can give "a better sense of his perspective" and the company's too, he said. And even if the statements come off as PR or corporate branding, new information could still be gained about how Zuckerberg has shifted the company's message over the years, Zimmer said.

Australian Privacy Law Reforms: Are you prepared?
Who knew? Privacy is a concern for teenagers, report finds
Some Australian businesses unaware of privacy act changes

What isn't said could be interesting too -- Facebook doesn't use the word "privacy" very often. Instead the concept is usually framed in terms of "user controls," or "openness." Software like NVivo could be used to analyze this type of rhetoric in the archive's text transcripts, Zimmer said. Then, the insights could give better information to the public about how Facebook really works, he said.

The archive's bibliographic data and metadata are openly available to anyone, but access to the full-text transcripts and video files is limited to scholars doing research in a relevant area. Zimmer didn't want to get caught in a legal gray area by re-posting, say, copyright Wall Street Journal articles about Facebook.

But for researchers, everything's acceptable under fair use principles of copyright law, he said.

One interesting finding so far in the archive's data is the amount of time that Zuckerberg has spent making presentations in places like Brazil and in Europe to promote Facebook's platform to developers, Zimmer said. Bringing the Internet, and Facebook, to more people across the globe is high on the agenda for the company right now, partly through its Internet.org campaign.

For Zuckerberg, "there are more presentations now on broader social issues," Zimmer said. "That's a change."

Zach Miners covers social networking, search and general technology news for IDG News Service. Follow Zach on Twitter at @zachminers. Zach's e-mail address is zach_miners@idg.com

Join the CSO newsletter!

Error: Please check your email address.

Tags Internet-based applications and servicessecuritysocial networkingmark zuckerbergsocial mediainternetsearch enginesprivacyFacebook

More about FacebookIDGWall Street

Show Comments

Featured Whitepapers

Editor's Recommendations

Solution Centres

Stories by Zach Miners

Latest Videos

  • 150x50

    CSO Webinar: Will your data protection strategy be enough when disaster strikes?

    Speakers: - Paul O’Connor, Engagement leader - Performance Audit Group, Victorian Auditor-General’s Office (VAGO) - Nigel Phair, Managing Director, Centre for Internet Safety - Joshua Stenhouse, Technical Evangelist, Zerto - Anthony Caruana, CSO MC & Moderator

    Play Video

  • 150x50

    CSO Webinar: The Human Factor - Your people are your biggest security weakness

    ​Speakers: David Lacey, Researcher and former CISO Royal Mail David Turner - Global Risk Management Expert Mark Guntrip - Group Manager, Email Protection, Proofpoint

    Play Video

  • 150x50

    CSO Webinar: Current ransomware defences are failing – but machine learning can drive a more proactive solution

    Speakers • Ty Miller, Director, Threat Intelligence • Mark Gregory, Leader, Network Engineering Research Group, RMIT • Jeff Lanza, Retired FBI Agent (USA) • Andy Solterbeck, VP Asia Pacific, Cylance • David Braue, CSO MC/Moderator What to expect: ​Hear from industry experts on the local and global ransomware threat landscape. Explore a new approach to dealing with ransomware using machine-learning techniques and by thinking about the problem in a fundamentally different way. Apply techniques for gathering insight into ransomware behaviour and find out what elements must go into a truly effective ransomware defence. Get a first-hand look at how ransomware actually works in practice, and how machine-learning techniques can pick up on its activities long before your employees do.

    Play Video

  • 150x50

    CSO Webinar: Get real about metadata to avoid a false sense of security

    Speakers: • Anthony Caruana – CSO MC and moderator • Ian Farquhar, Worldwide Virtual Security Team Lead, Gigamon • John Lindsay, Former CTO, iiNet • Skeeve Stevens, Futurist, Future Sumo • David Vaile - Vice chair of APF, Co-Convenor of the Cyberspace Law And Policy Community, UNSW Law Faculty This webinar covers: - A 101 on metadata - what it is and how to use it - Insight into a typical attack, what happens and what we would find when looking into the metadata - How to collect metadata, use this to detect attacks and get greater insight into how you can use this to protect your organisation - Learn how much raw data and metadata to retain and how long for - Get a reality check on how you're using your metadata and if this is enough to secure your organisation

    Play Video

  • 150x50

    CSO Webinar: How banking trojans work and how you can stop them

    CSO Webinar: How banking trojans work and how you can stop them Featuring: • John Baird, Director of Global Technology Production, Deutsche Bank • Samantha Macleod, GM Cyber Security, ME Bank • Sherrod DeGrippo, Director of Emerging Threats, Proofpoint (USA)

    Play Video

More videos

Blog Posts

Market Place