Welcome to P2PNET.net - The original daily p2p and digital news site. Always First!
Register | Login
RIAA News
Cool Stuff
MPAA News
Games / Consoles
News
Music
Movies
TV
Open Source
Mobiles
Advertising
Product News
P2P
Off Topic
Freedom
Politics
Interviews
Security
DRM
Links
Kids and Kartels
Search: 
Search
 
Web P2PNET   
Search: 
Search
Torrent Site Tracker
MP3Rocket
 
Add real-time p2pnet headlines to YOUR site ! Click here to download our newsfeed code

Last useful data for a long time?

p2pnet.net News:- AOL and Sony BMG have something common: they’re both enmeshed in major debacles which aren’t likely to go away anytime soon.

Sony’s was the appalling rootkit DRM farce and AOL’s was the disastrous leakage of search material.

Ed Felten has an interesting take on the latter situation. >>>>>>>>>>>>>>>>>>>>>>>>

Great, Now They’ll Never Give Us Data
By Ed FeltenFreedom to Tinker

Today’s New York Times has an interesting article by Katie Hafner on AOL’s now-infamous release of customers’ search data.

AOL’s goal in releasing the data was to help researchers by giving them realistic data to study. Today’s technologies, such as search engines, have generated huge volumes of information about what people want online and why. But most of this data is locked up in the data centers of companies like AOL, Google, and eBay, where researchers can’t use it. So researchers have been making do with a few old datasets. The lack of good data is certainly holding back progress in this important area. AOL wanted to help out by giving researchers a better dataset to work with.

Somebody at AOL apparently thought they had “anonymized” the data by replacing the usernames with meaningless numbers. That was a terrible misjudgement – if there is one thing we have learned from the AOL data, it is that people reveal a lot about themselves in their search queries. Reporters have identified at least two of the affected AOL users by name, and finding and publishing embarrassing search sequences has become a popular sport.

The article quotes some prominent researchers, including Jon Kleinberg, saying they’ll refuse to work with this data on ethical grounds. I don’t quite buy that there is an ethical duty to avoid research uses of the data. If I had a valid research use for it, I’m pretty sure I could develop my own guidelines for using it without exacerbating the privacy problem. If I had had something to do with inducing the ill-fated release of the data, I might have an obligation to avoid profiting from my participation in the release. But if the data is out there due to no fault of mine, and the abuses that occur are no fault of mine, why shouldn’t I be able to use the data responsibly, for the public good?

Researchers know that this incident will make companies even more reluctant to release data, even after anonymizing it. If you’re a search-behavior expert, this AOL data may be the last useful data you see for a long time – which is all the more reason to use it.

Most of all, the AOL search data incident reminds us of the complexity of identity and anonymity online. It should have been obvious that removing usernames wasn’t enough to anonymize the data. But this is actually a common kind of mistake – simplistic distinctions between “personally identifiable information” and other information pervade the policy discussion about privacy. The same error is common in debates about big government data mining programs – it’s not as easy as you might think to enable data analysis without also compromising privacy.

In principle, it might have been possible to transform the search data further to make it safe for release. In practice we’re nowhere near understanding how to usefully depersonalize this kind of data. That’s an important research problem in itself, which needs its own datasets to work on. If only somebody had released a huge mass of poorly depersonalized data …





Please help p2pnet to overcome the Kazaa / Hemming
libel lawsuit. Every penny counts. Canada’s antiquated

defamation law chills online freedom of speech .


p2pnet newsfeeds for your site.

rss feed: http://p2pnet.net/p2p.rss
Mobile – http://p2pnet.net/index-wml.php

HOME

2 Responses to “Last useful data for a long time?”

  1. Reader's Write Says:

    Who needs AOL’s data? All you have to do is do a regular search through Google, Yahoo! or some other search engine of your choice and find out all sorts of things about them.

    hmmmm….that gives me an idea ^_^… does the name Nikki mean anything to ya? BUWAAHAHAHAHAHAHAHA!

  2. Reader's Write Says:

    Yes! Yeah! Go, Team, Go!

    Does the name Nikki mean anything to me? Hmmmmmm.

    Let’s see, is that the jolly old guy who hangs out a lot around Christmas Time? No, I didn’t think so. Maybe it’s that Russian Premier who had such a soft spot in his heart for Disneyland. No? Hmmmmmmm.

    The only other Nikki I can think of is some freakish weirdo down under who doesn’t take kindly to calumny, even when it’s honest and truthful. So, I’m certain you wouldn’t be thinking of mining data on that old one-horse open sleigh. I’m sure she’s as pure as the driven snow, which caused Napoleon’s army to shrivel up and drag their hangdog butts back to Gaie Paree, thus turning all that pure snow into unmanageable slush. Come to think of it, you might indeed by talking about that particular Nikki.

    For some people, calumny is the only thing they understand or appreciate. They get so much of it that it becomes a comforting regular part of their lives. A sort of “raison d’etre”, if you will.

    But, then, I’m an American Citizen, and we Yanks enjoy a certain modicum of freedom of speech. Long live data mining and the age of Information!

    Just think, if she hadn’t decided to sick her legal dogs on that guy who runs and operates P2Pnet.net, no one would ever have heard of her. So, there she goes: biting the hand that feeds her celebrity.

    (Oh, well, dogs of a feather … or does it go that way?)

Leave a Reply

Please no Spam, flaming (attacking others), trolling, and posting off-topic. Thanks.

    Advertisements
TekSavvy


Remove Spyware with AntiSpyware for Windows®