Opinion

Paul Smalera

All your Tumblr are belong to Them

Apr 18, 2012 14:43 EDT

Forget Instagram’s billion-dollar payday. Forget IPOs, past and future, from Facebook, Groupon, LinkedIn and the like. And ignore, please, the online ramblings of attention-hungry venture capitalists and narcissistic Silicon Valley journalists with the off-putting habit of making their inside-baseball sound like the World Series. Their stories, to paraphrase Shakespeare, are tales told by idiots, full of sound and fury, but signifying very little about the impact of technology on most of our lives. (Sure, some of their tales are about great fortunes, but those are only for a select few; to summon the Oracle of Omaha rather than the Bard of Avon, only a fool ever equated price with value.) Their one-in-a-million windfalls are just flashes in the pan. Or, actually, they are solitary data points, meaningless when devoid of context.

That context is here. It’s come, in part, because of the cunningly simple social and curatorial tools that media companies like Twitter, Tumblr, Facebook and Pinterest give away to their users. But making sense of our social world is only possible with the the tools and technology behind what we call Big Data. The massive information collections spawned by our digital world are too big to address directly, so smart scientists have used fast computers to carve the data into real knowledge. This is how Big Data is already changing the way the world works.

But Big Data is young; though there are hundreds of accessible data sets already, there are still many more chaotic stores of information its tools can tame. Take, for example, social media: Yesterday, social media API company Gnip announced that it is providing customers with all of Tumblr’s data, what in techspeak is called the firehose. What Gnip and competitors like DataSift are providing to customers are Social Big Data firehoses that can be perfectly filtered into gently babbling brooks lined with digital gold nuggets. When the tech media wonder out loud how social companies will ever make a buck – sifting the gold out of their user-generated content is a huge piece of the puzzle.

At Gnip, Tumblr joins Twitter, WordPress, Disqus and the Chinese microblogging service Sina Weibo as the latest tree in a forest of Social Big Data accessible via API. A well-written API can transform a jumble of numbers into a perfectly organized multiplication table – on the order of millions or even billions of complex data pieces. (See this recent Economist visualization of the data record of a single tweet for more context.)

The data pieces are valuable, but not solely because they help advertisers sell more widgets: In an email, Gnip Chief Operating Officer Chris Moody explained one of the coolest uses of data his company has enabled may have actually helped firefighters do their job better: “During the 4 Mile Canyon Fire in Boulder in 2010, [Gnip customer] VisionLink was able to provide fire crews and managers a realtime view into what was happening on the ground by layering geo-tagged Tweets and Flickr images onto a Google map of the area.”

It’s not just maps, photos and geo locations that number crunchers crave. Tumblr, after all, is a blog network full of cat photos, animated GIFs and other tomfoolery. Yet last year its already booming traffic grew an additional 300 percent. As the Web comic XKCD noted a day before Gnip’s announcement, the proper noun “Tumblr” is perhaps six months away from surpassing “blogging” in online searches, much the way “Google” became synonymous with the verb “search” a decade earlier. Tumblr users know that the site’s tools are heavily oriented toward sharing and signaling, as opposed to pure content creation. On Tumblr, users can of course write posts and upload photos, but they can also follow other users, “heart” each others’ posts and reblog posts they want to share with their own followers – and each action takes little more than one click. All these actions are trackable, all of them indicate some sort of sentiment or preference and all of them are discrete chunks of data in Tumblr’s massive data store – now joining the 90 billion pieces of social data Gnip is already delivering on a monthly basis.

If all of this sounds like a grand plot to aggregate, sift and derive insight into social behavior from all sorts of data that unsuspecting users unwittingly upload – well, it is. But those are the trade-offs that users agree to when they sign up with social-media websites, right when they check that box agreeing to the terms of service.

And it’s not just marketers and advertisers who pore over the data – the Guardian bought a slice of real-time social media from DataSift to help inform its coverage of the 2011 London riots. Joining journalists who want to understand social media with the help of big data are governments, NGOs, charities and non-profits, among all sorts of other organizations. For the first time ever, all these groups can access the data that can help them answer questions they’ve been asking for decades. In industry, science and government, many data pools have been available for several years now.

In a way, social media, despite being cutting-edge, was behind the curve. It had to grow up a little more, out of niche status, before its data set was big enough to provide meaningful insight into the way we live online. It looks as if that day is just about here. (Interestingly, Facebook, oft-criticized for privacy lapses, remains a mostly dark data pool; the company will slice and dice user data for advertisers and app makers but keeps its firehose closely guarded.)

So, yes, to adapt an old Internet meme, All your Tumblr (and much of your other social history) are belong to Big Data. But that’s not necessarily bad, if all the power of social media belongs to all of us. Whether that’s a worthwhile trade-off all depends on how we use that power, and what we finally do with all that data.

PHOTO: A Massachusetts firefighter hauls a firehose into position. REUTERS/Jim Bourg

COMMENT

“But that’s not necessarily bad, if all the power of social media belongs to all of us.”

Yeah, well, all that power doesn’t belong to all of us, it belongs to a very small group of collectors and sellers. And that is necessarily bad.

Posted by WeWereWallSt | Report as abusive

What real Internet censorship looks like

Feb 27, 2012 13:44 EST

Lately Internet users in the U.S. have been worried about censorship, copyright legalities and data privacy. Between Twitter’s new censorship policy, the global protests over SOPA/PIPA and ACTA and the outrage over Apple’s iOS allowing apps like Path to access the address book without prior approval, these fears have certainly seemed warranted. But we should also remember that Internet users around the world face far more insidious limitations and intrusions on their Internet usage — practices, in fact, that would horrify the average American.

Sadly, most of the rest of the world has come to accept censorship as a necessary evil. Although I recently argued that Twitter’s censorship policy at least had the benefit of transparency, it’s still an unfortunate cost of doing global business for a company born and bred with the freedoms of the United States, and founded by tech pioneers whose opportunities and creativity stem directly from our Constitution. Yet by the standards of dictatorial regimes, Internet users in countries like China, Syria and Iran should consider themselves lucky if Twitter’s relatively modest censorship program actually keeps those countries’ governments from shutting down the service. As we are seeing around the world, chances are, unfortunately, it won’t.

Consider the freedoms — or lack thereof — Internet users have in Iran. Since this past week, some 30 million Iranian users have been without Internet service thanks to that country’s blocking of the SSL protocol, right at the time of its parliamentary elections. SSL is what turns “http” — the basic way we access the Web — into “https”, which Gmail, your bank, your credit card company and thousands of other services use to secure data. SSL provides data encryption so that only each end point — your browser and the Web server you’re logging into — can decrypt and access the data contained therein.

By blocking SSL, Iran has crippled Tor, a program that enables Internet users to anonymize not just their content but their physical location as well. Tor is a very common workaround for users in totalitarian regimes to access Twitter, Gmail, Facebook and other services. It’s hard to come up with an apt analogy for Iran’s unprecedented blockage — it’s not just that the letters you send are read by the Post Office and photocopied for their records, it’s that the Post Roads themselves have been closed off, so you can’t even send a letter in the first place. That’s the net effect of blocking SSL in Iran.

The hacking group Anonymous has brought down all kinds of websites in protest, mostly over copyright, in the U.S. and Europe. I don’t advocate their targeting any country’s servers for retribution, but where is the outrage or public demonstration or media attention over the denials of Iranians’ basic freedoms to communicate, via the Internet?

Unfortunately, it’s still too easy for Internet companies and even the Internet’s founding fathers to dismiss the importance of the tools they created in fostering free and open public dialogue, especially in places like Iran. Recently, legendary engineer and Google Vice-President Vint Cerf published a New York Times op-ed entitled “Internet Access is Not a Human Right,” where he wrote: “Internet access is always just a tool for obtaining something else more important.” How wrong he is. Cerf’s line of thinking eviscerates the Internet — the wonder of the modern world he helped build. Cerf argues that humans have the right to “lead healthy, meaningful lives,” including having “freedom from torture or freedom of conscience.” Yet, we live in the 21st century: It’s hard to see how, among people whose economies are developed enough to afford them communication devices, Cerf would excuse governments that curtail their citizens’ freedom and right to use the ultimate communications tool — the global network of the Internet. In fact, in underdeveloped parts of the world, the cost to have a cell phone that connects to the Web can be quite affordable.

I’m not arguing semantics here — if our society excludes the Internet from the fundamental rights of human communication, we also excuse totalitarian regimes like Iran’s from any repercussions when it comes to blocking that avenue of human contact. It’s a dangerous compromise to make in a world that only gets more digital with each passing day. And it also conveniently excuses the free world from having to do much of anything about it. We wouldn’t forgive Iran if it threw 30 million citizens into solitary confinement — so why would we ignore it when the Iranian government effectively cuts the entire population off from the outside world, to stifle their voices during a critical electoral cycle?

The U.S. and the free world have often engaged in global humanitarian missions in cases of genocide, famine and natural disaster. At what point will the deprivation of freedom of communication warrant such an intervention? The U.S. is already on guard itself against hacking attempts from Russia and China — intrusions by both rogue and government-sponsored actors — so how long will we tolerate countries’ depriving their citizens of Internet access?

It’s a tricky question, with no easy answer. However, contemplating it may prepare us for the possibility that the world’s first cyber-war could be fought not to cut off a country’s Internet hookup, but to restore it. After all, the Obama administration’s State Department petitioned Twitter to stay online during one recent Iranian uprising and has used the service to communicate with citizens there during another. Iran has now essentially shut down Twitter with its SSL blocking. Will the U.S. respond? If we do, we will set a precedent that calls into question the rights of any government to silence its citizens on a global communications network, putting us into thorny conflicts with China and other 21st century frenemies. But if we don’t, we are condoning the silencing of dissent and turning our backs on a century-long pledge to foster democracy wherever it might flourish — even if it’s online.

Photo: Technicians monitor data flow in the control room of an internet service provider in Tehran. Picture taken February 15, 2011. REUTERS/Caren Firouz

COMMENT

The example of Iran is well taken in this article, but I would like to add one: I lived and taught in Zhuhai, China, from August 2007 to July 2009. As an expatriate, I didn’t seem to have my computer monitored and censored very much, but my students at United International College surely did.
We take our freedoms for granted. I don’t any more. I know what it is like to live in a country where “freedom of expression” is a sham. We shouldn’t let that happen here, which doesn’t mean condoning criminal activities on the net, but it does mean a conscious guarding of freedom of speech.

Posted by dwilliams3 | Report as abusive

The piracy of online privacy

Feb 10, 2012 13:28 EST

Online privacy doesn’t exist. It was lost years ago. And not only was it taken, we’ve all already gotten used to it. Loss of privacy is a fundamental tradeoff at the very core of social networking. Our privacy has been taken in service of the social tools we so crave and suddenly cannot live without. If not for the piracy of privacy, Facebook wouldn’t exist. Nor would Twitter. Nor even would Gmail, Foursquare, Groupon, Zynga, etc.

And yet people keep fretting about losing what’s already gone. This week, like most others of the past decade, has brought fresh new outrages for privacy advocates. Google, which a few weeks ago changed its privacy policy to allow the company to share your personal data across as many as 60 of its products, was again castigated this week for the changes. Except this time, the shouts came in the form of a lawsuit. The Electronic Privacy Information Center sued the FTC to compel it to block Google’s changes, saying they violated a privacy agreement Google signed less than a year ago.

Elsewhere, social photography app Path was caught storing users’ entire iPhone address books on their servers and have issued a red-faced apology. (The lesser-known app Hipster committed the same sin and also offered a mea culpa.) And Facebook’s IPO has brought fresh concerns that Mark Zuckerberg will find creative new ways to leverage user data into ever more desirable revenue-generating products.

This is the way we’re private now. It’s ludicrous for anyone who loves the Internet to expect otherwise. How else are these services supposed to exist — let alone make any money? Theft or misuse of private user data is a crime, certainly. But no social web app — not one — can work without intense analytics performed on the huge data sets that users provide to them voluntarily (you did read the terms of service agreement…right?).

And the issue compounds when people connect one site to another. By linking their Twitter to their Facebook to their Google+ to their Foursquare to their Zynga to their Instagram to their iOS, users are consolidating their lives, and in the process making them more attractive to marketers. While Facebook, Twitter and other services have made attempts to warn users about hitting the “connect” button, many of us hit that button with reckless abandon, without a thought of who’s slavering on the other side.

The reason social media and digital information companies want that data is because of what we refuse to give them: money. No one wants to pay for the privilege of chatting with their friends or using a coupon, and to this day, no one has to: Go ahead, ring their doorbell or pick up the free coupon book from your front stoop. But if you want to chat using Facebook or Gmail, or you want to buy a groupon for an 80 percent-off Botox service, you will have to tell those companies who you are. And those companies will use that information to tailor their offerings to you, increasing your value as a user and a customer. They will slice their data sets into a million different pieces and show those pieces to people — advertisers — who will pay them money for the privilege of using their service. They’ll use it to get to you.

This is an update on an old media model. Magazines and newspapers for decades could only guess at the readership of their product and the demographic of their customers. But now social and new media demand to — and can — know exactly who you are before they agree to let you use their free services. Even email newsletter services like the increasingly hot Thrillist – which might innocuously start you on their service by asking only for your simple email address — deploy click trackers, pixel trackers and other online data-gathering techniques to start to put together a picture of you as a user, both individually and in aggregate. A deceased magazine like Spy could only dream of that kind of intel.

Without such strategies, social web companies like these couldn’t exist. Every user has a choice when it comes to privacy, sure. But the second people sign up for Gmail, Facebook, Mint or Gilt Group, they have reaffirmed their willingness to be a mouse that the cats will chase. And these cats need mice. Otherwise they will starve. So they do their best to hide their intentions. Indeed, as a longtime and well-known advocate for the transformative power of technology recently told me, true believers like Mark Zuckerberg actively stake out radical positions on privacy, then talk about them as if they were natural or normal. It’s not too much different from the political process, where the best way to effect a shift in society is to present it as if it were a fait accompli and then expend energy actually moving the levers of power toward the shift rather than wasting time arguing with people about the implications.

That’s what the last five years have been about. Mark Zuckerberg, for example, wanted a collection of all our Facebook actions called an Open Graph to be part of our lives. Now, lo and behold, it is.

Today, we straddle two extremes — the offline and the online. Each comes with its own expectations and realities of what privacy is. Offline, your thoughts and actions are your own. Online, sharing one thing leads to a spiral effect where you are soon sharing everything. Offline gives us privacy, that near-mystical quality of ownership that we covet. Online gives us the extreme power of social tools, allowing us to glide around the formerly linear data of our existence. Want your private Rolodex? Fine. Want your location-aware, instant-coupon, friend-tagging iPhone app? That’s fine too. But on the Internet, you don’t get to have both. There is no longer any middle ground. As in politics, the Internet has become a place for extremists.

Photo: People wear masks during the “Freiheit Statt Angst” (Freedom instead of Fear) protest calling for the protection of digital data privacy in Berlin, September 10, 2011. The masks were handed out by the organizers who asked participants of the rally to wear them for the media to symbolise what call the individual’s right to remain anonymous on the internet. REUTERS/Thomas Peter

COMMENT

It is utter rubbish to claim that we cannot have a better internet than the one we have right now. Most users simply do not really understand what and how much they have given up. And what has been given up unknowingly can be reclaimed. That is what laws are for.

Europe has a much better system, and much more privacy. And we can improve on that, without wrecking the essentials of the internet.

Posted by txgadfly | Report as abusive

Twitter’s censorship is a gray box of shame, but not for Twitter

Jan 28, 2012 20:09 EST

Twitter’s announcement this week that it was going to enable country-specific censorship of posts is arousing fury around the Internet. Commentators, activists, protesters and netizens have said it’s “very bad news” and claim to be “#outraged”. Bianca Jagger, for one, asked how to go about boycotting Twitter, on Twitter, according to the New York Times. (Step one might be… well, never mind.) The critics have settled on #TwitterBlackout: all day on Saturday the 28th, they promised to not tweet, as a show of protest and solidarity with those who might be censored.

Here’s the thing: Like Twitter itself, it’s time for the Internet, and its chirping classes, to grow up. Twitter’s policy and its transparency pledge with the censorship watchdog Chilling Effects is the most thoughtful, honest and realistic policy to come out of a technology company in a long time. Even an unsympathetic reading of the new censorship policy bears that out.

To understand why, let’s unpack the policy a bit: First, Twitter has strongly implied it will not remove content under this policy. If that doesn’t sound like a crucial distinction from outright censorship, it is. Taking the new policy with existing ones, the only time Twitter says it will ever remove a tweet altogether is in response to a DMCA request. The DMCA may have its own flaws, but it is a form of censorship that lives separately from the process Twitter has outlined in this recent announcement. Where the DMCA process demands a deletion of copyright-infringing content, Twitter’s censorship policy promises no such takedown: it promises instead only to withhold censored content from the country where the content has been censored. Nothing else.

To be sure, that’s censorship of a kind, but compared to the industry censorship even Americans have long lived with — take the Motion Picture Association of America, which still censors films based on dubious standards of taste and morality — it’s positively enlightened. And it never permanently destroys or pre-empts content, the way the MPAA does.

Further, for a country to censor content, it has to make a “valid and properly scoped request from an authorized entity” to Twitter, which will then decide what to do with the request. Twitter will also make an effort to notify users whose content is censored about what happened and why, and even give them a method to challenge the request. According to Twitter’s post, a record of the action will also be filed to the Chilling Effects website. The end result of a successful request is that the tweet or user in question is replaced by a gray box that notifies other readers inside the censoring country that the Tweet has been censored:

 

 

 

 

They’re gray boxes of shame alright, but not for the user, or for Twitter. It’s instead a bright signal to a country’s online citizens that their government is limiting their free speech. While the Egypt uprisings were powerful and in some part powered by Twitter, I can easily imagine a world where a censored tweet becomes the ultimate protest symbol; one that unfortunately deprives the protesters of content, but sends the message to protesters that their worst fears are right, and they ought not give up their fight.

The press organization Reporters Without Borders has sent a letter of protest to Twitter chairman Jack Dorsey, which is surprising considering the power of the gift that Dorsey has just given them. While some reporters get themselves on the ground to report from say, Syria, nothing can stop others in the U.S. or any other country from following the tweets of Syrian protesters, even if the Syrian government requests and is granted censorship of tweets within that country.

That’s the second important note: Twitter has made no mention of disabling users’ ability to tweet or of deleting a user because their tweets have been censored. Syria or some other country may choose to take down its communications grid or try to block access to Twitter, but short of such an action, it can’t stop tweets from reaching the outside world under this policy. In fact Twitter has strengthened its case to remain online in countries where free speech is threatened, possibly providing protesters with a valuable tool that would otherwise have been preemptively shut down.

If a government does engage in a cat-and-mouse game of blocking access, remember that nowhere else is the playing field more level between authorities and insurgents than online. Workarounds for Twitter blocks already exist, such as proxy servers that spoof the identity of users and their country of origin, and alternative access points (APIs) to reach the Twitter service.

Finally, reputation matters. Twitter has engendered much goodwill in the tech and international communities by its sterling behavior in both worlds. This is the company that put off a server upgrade to keep the tweets flowing from the Iran uprising in 2009, at the request of the U.S. State Department. It’s a company that’s managed to play by the rules while also leveling the playing field of communication as no other service has since Alexander Bell’s telephone. There’s nothing about this announcement that smacks of any change in policy or attitude; rather it seems like an honest attempt to abide by country-specific rules of law, while also exposing the power of those laws to citizens in countries where freedoms have been abridged. (Forbes as an example, mentions it is illegal to insult a French bureaucrat. One can imagine the uprising in France if the government tried to censor a Tweet insulting Sarkozy or one of his ministers, which would presumably lead to a rapid re-writing of that law.)

As long as no country can ever make a claim to censor a tweet on a worldwide basis, that tweet will exist somewhere on Twitter’s servers, and someone will be able to see it. By laying down clear rules for country-specific censorship, Twitter has implicitly stated that no government, company or individual has the power to eradicate a tweet it doesn’t like from the face the Earth. Twitter has laid down the rules by which it will hold countries accountable, and by which it will hold itself accountable, at least when it comes to censorship.

They are so fair as to be without precedent, and if they are violated, the world presumably will be able to see the hypocrisy in an instant. That’s a maturity that many — governments, corporations, and yes, sniping tweeters — have rarely shown when it comes to censorship or privacy policies. (Hello, SOPA, PIPA, ACTA, DMCA, Facebook and the rest!)

Besides, if Twitter were as evil as its critics would have us believe, would we be able to see the results of the ongoing #TwitterBlackout? If we are living in a world where corporations have more power than government, I’ll take that level of transparency from a new media company, every day.

COMMENT

This argument assumes a lot. To your point, do we need gray boxes to tell us which countries stifle online speech, or *new protest symbols in a sea of them? The overriding issue is the lack of full and complete transparency in Twitter’s methods and capabilities to censor tweets. For example:

- What does a “valid and properly scoped request from an authorized entity” mean?
- What information is required — a court order, a phone call?
- Who is considered an authorized entity — anyone familiar with a country’s law?
- Are members of a country’s news media or press exempt from censorship?
- Can requests for censorship be submitted in bulk, by keyword or by user?
- What is the criteria used for censoring a tweet? Is it only law?
- Is there a deliberation process? If so, what happens to the content during that time?
- Can tweets *coming into a country* be censored from view within that same country?
- Is any part of the technical act involved in censoring a tweet an automated process?
- When will requests be posted to Chilling Effects? Before, after, and if after, how long?

Take this quote from Twitter, also referenced above:

“…Upon receipt of requests to withhold content we will promptly notify affected users, *unless we are legally prohibited from doing so*, and clearly indicate to viewers when content has been withheld.”

- Could Twitter be legally prohibited from sharing a censorship request at all?
- Could Twitter be legally prohibited from indicating content was ever withheld?

It’s not clear, and presents a slippery slope potentially frightening to some. Questioning censorship practices is important and necessary, and these are basic questions I would expect a journalist to be asking. But instead, you’re promoting gray boxes as protest symbols… Forgive me, but I’m confused. Twitter should be 100% clear with its methods and capabilities. Instead, it’s translucent at best.

Posted by michsineath | Report as abusive
  •