{"id":456,"date":"2006-08-08T15:17:35","date_gmt":"2006-08-08T15:17:35","guid":{"rendered":"http:\/\/paslongtemps.net\/pl2\/2006\/08\/08\/aol-releases-search-logs-from-500000-users\/"},"modified":"2006-08-08T15:17:35","modified_gmt":"2006-08-08T15:17:35","slug":"aol-releases-search-logs-from-500000-users","status":"publish","type":"post","link":"https:\/\/paslongtemps.net\/blog\/2006\/08\/08\/aol-releases-search-logs-from-500000-users\/","title":{"rendered":"AOL Releases Search Logs from 500,000 Users"},"content":{"rendered":"<p> dans un \u00e9lan de g\u00e9n\u00e9rosit\u00e9 na\u00efve, aol a rendu publique une \u00e9norme base de donn\u00e9es qui r\u00e9v\u00e8le les recherches effectu\u00e9es sur le web par plus de 500.000 de leurs utilisateurs, de mars \u00e0 mai 2006, voir ici: <a href=\"http:\/\/www.ugcs.caltech.edu\/~dangelo\/aol-search-query-logs\/\">http:\/\/www.ugcs.caltech.edu\/~dangelo\/aol-search-query-logs\/<\/a><\/p>\n<p>les entr\u00e9es sont anonymis\u00e9es, c&#8217;est \u00e0 dire que le screen name de l&#8217;utilisateur est remplac\u00e9 par un num\u00e9ro identifiant. sachant que les gens pratiquent l&#8217;egosurfing, cherchent des adresses, il est probablement possible d&#8217;en identifier certains, ce qui a pouss\u00e9 aol \u00e0 supprimer rapidement cette base de donn\u00e9es. on peut bien entendu la trouver sur diff\u00e9rents miroirs. le r\u00eave \u00e9veill\u00e9 de marketeux du web mal intentionn\u00e9s? nul doute que dans les jours \u00e0 venir, des interfaces de visualisation de ces donn\u00e9es vont fleurir sur le web. pour l&#8217;instant on trouve quelques histoires ad-hoc comme celle du <a href=\"http:\/\/consumerist.com\/consumer\/privacy\/aol-user-927-illuminated-192502.php\">num\u00e9ro 927<\/a>, qui s&#8217;int\u00e9resse aux jambes cass\u00e9es et \u00e0 la vie sexuelle des personnages de dessin anim\u00e9s&#8230;<\/p>\n<p>ces donn\u00e9es sont passionnantes, fascinantes, effrayantes. elles sont biais\u00e9es par le fait qu&#8217;elles proviennent d&#8217;aol: des utilisateurs pas forc\u00e9ment tr\u00e8s vers\u00e9s dans l&#8217;informatique. elles confirment quelques impressions bien connues: les gens utilisent les moteurs de recherche pour retrouver des url qu&#8217;ils ont la flemme de retenir (le mot cl\u00e9 le plus recherch\u00e9, est ironiquement: google), mais aussi \u00e9norm\u00e9ment \u00e0 des fins plus ou moins avouables.<\/p>\n<p>voici, au hasard, quelques extraits que j&#8217;en ai tir\u00e9, accrochez-vous bien.<\/p>\n<p><i>Hello, my number is 3399:<\/i><br \/>\nplayboy background for myspace<br \/>\npic of a black rose<br \/>\nplayboy cursors for myspace<br \/>\nlisten to sexy lady by mc magic<br \/>\nneed to lose weight in my stomach how can i do that<br \/>\ni&#8217;m looking for a song called sexy lady<br \/>\npoems for someone u miss<br \/>\nstupid videos<br \/>\nsad poems about breaking up with your love<br \/>\nmake a slideshow for myspce<br \/>\npocket bikes in austin and round rock texas<br \/>\ncd fulkes lizards online<br \/>\nhow do you french kiss<\/p>\n<p>\n<i>Hello, my number is 2282:<\/i><br \/>\nwwwkingkong<br \/>\nhezekia come out the closet<br \/>\nwww.find my dad.com<br \/>\nwww.findafriend.com<br \/>\nwwwmissing.com<br \/>\nwwwyahoo.com<br \/>\nwwwmissingloveone.com<br \/>\nout of the closet<br \/>\nwwwmissingpersons.com<br \/>\nchat<br \/>\nwwwdaedbeatdad.com<\/p>\n<p>\n<i>Hello, my number is 2708:<\/i><br \/>\ngoogle<br \/>\nwetcircle.com<br \/>\nmakehimpay.net<br \/>\njizzhut<br \/>\nsexy tops<br \/>\nfree gay information<br \/>\nrevenge tactics<br \/>\nthong dancewear<br \/>\nfree email addresses<br \/>\nencyclopedia of revenge<br \/>\nhow to stop loving someone<br \/>\nverizon.net<br \/>\nfree stuff<br \/>\nweird free thingsto send someone<\/p>\n<p>\n<i>Hello, my number is 2761:<\/i><br \/>\ntomora tonioli<br \/>\nwholesale lobster tails<br \/>\nnorco real estate<br \/>\ncorona real estate<br \/>\namore cheesecakes<br \/>\nangels<br \/>\nwho&#8217;s who<br \/>\ntonioli<br \/>\ncheesecakes<br \/>\nbox of lobster tails<br \/>\nbox of frozen large lobster tails<br \/>\nwholesale xl twin mattresses<br \/>\nwholesale and discount lobster tails<br \/>\nwho&#8217;s who in real estate<\/p>\n<p>\n<i>Hello, my number is 3745:<\/i><br \/>\nmatch.com<br \/>\nyahoo<br \/>\nmatchcom<br \/>\nmatch.<br \/>\nmatch<br \/>\ngoogle.com<br \/>\nhelp aol.com<br \/>\npepper<br \/>\nhelpaol.com<\/p>\n<p>\n<i>Hello, my number is 479:<\/i><br \/>\nwto history<br \/>\ncitation machine<br \/>\nkant theory of knowledge<br \/>\nsouth suburban college<br \/>\nallegory of the cave<br \/>\nnip tuck season 3 dvd<br \/>\ndictionary<br \/>\nproofs for the existence of god<br \/>\nbose car decal<br \/>\ncranial nerves<br \/>\ncals<br \/>\ngerman translation<br \/>\nexistence precedes essence<br \/>\nsartre man surrounded by absurdity<\/p>\n<p>\nj&#8217;arr\u00eate l\u00e0, j&#8217;ai l&#8217;impression d&#8217;\u00eatre un voyeur. je r\u00e9alise, en lisant <a href=\"http:\/\/battellemedia.com\/archives\/000063.php\">cette analyse men\u00e9e par John Battelle il y a 3 ans<\/a> que nous tenons l\u00e0 une partie de la <em>database of intentions<\/em>. je n&#8217;aurais pas mieux dit:<\/p>\n<blockquote><p>\nThe Database of Intentions is simply this: The aggregate results of every search ever entered, every result list ever tendered, and every path taken as a result. It lives in many places, but three or four places in particular hold a massive amount of this data (ie MSN, Google, and Yahoo). This information represents, in aggregate form, a place holder for the intentions of humankind &#8211; a massive database of desires, needs, wants, and likes that can be discovered, supoenaed, archived, tracked, and exploited to all sorts of ends. Such a beast has never before existed in the history of culture, but is almost guaranteed to grow exponentially from this day forward. This artifact can tell us extraordinary things about who we are and  what we want  as a culture. And it has the potential to be abused in  equally extraordinary fashion.\n<\/p><\/blockquote>\n<p><img decoding=\"async\" src=\"http:\/\/paslongtemps.net\/resources\/images\/tof\/aol.jpg\" alt=\"\" \/><\/p>\n","protected":false},"excerpt":{"rendered":"<p>dans un \u00e9lan de g\u00e9n\u00e9rosit\u00e9 na\u00efve, aol a rendu publique une \u00e9norme base de donn\u00e9es qui r\u00e9v\u00e8le les recherches effectu\u00e9es sur le web par plus de 500.000 de leurs utilisateurs, de mars \u00e0 mai 2006, voir ici: http:\/\/www.ugcs.caltech.edu\/~dangelo\/aol-search-query-logs\/ les entr\u00e9es sont anonymis\u00e9es, c&#8217;est \u00e0 dire que le screen name de l&#8217;utilisateur est remplac\u00e9 par un [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-456","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/paslongtemps.net\/blog\/wp-json\/wp\/v2\/posts\/456","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/paslongtemps.net\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/paslongtemps.net\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/paslongtemps.net\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/paslongtemps.net\/blog\/wp-json\/wp\/v2\/comments?post=456"}],"version-history":[{"count":0,"href":"https:\/\/paslongtemps.net\/blog\/wp-json\/wp\/v2\/posts\/456\/revisions"}],"wp:attachment":[{"href":"https:\/\/paslongtemps.net\/blog\/wp-json\/wp\/v2\/media?parent=456"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/paslongtemps.net\/blog\/wp-json\/wp\/v2\/categories?post=456"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/paslongtemps.net\/blog\/wp-json\/wp\/v2\/tags?post=456"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}