Archive for November, 2007

Sell me tags, Twine!

How much would, say, the New York Times have to pay to have the entirety of its newspaper analyzed and annotated every day?

The question is not hypothetical.

The librarians could go home, and fancy machine learning and natural language processing could step in and start extracting entities and tagging content. Hi, did you know Bill Clinton is William Jefferson Clinton but not Senator Clinton?! Hey there, eh, did you know that Harlem is in New York City?! Oh, ya, did you know that Republicans and Democrats are politicians, who are the silly people running around playing something called politics?!

Twine could tell you all that. Well, they say they can, but they won’t invite me to their private party! And maybe the librarians wouldn’t have to go home. Maybe they could monitor (weave?) the Twine and help it out when it falls down (frays?).

I want to buy Twine’s smarts, its fun tags. I’d pay a heckuva lot for really precociously smart annotation! They say, after all, that it will be an open platfrom from which we can all export our data. Just, please, bloat out all my content with as much metadata as you can smartly muster! Por favor, sir! You are my tagging engine—now get running!

What if Twine could tag all the news that’s fit to read? It would be a fun newspaper. Maybe I’d subscribe to all the little bits of content tagged both “Barack Obama” and “president.” Or maybe I’d subscribe to all the local blog posts and newspaper articles and videos tagged “Harlem” and “restaurant”—but only if those bits of content were already enjoyed by one of my two hundred closest friends in the world.

I’d need a really smart and intuitive interface to make sense of this new way of approaching the news. Some online form of newsprint just wouldn’t cut it. I’d need a news graph, for sure.

See TechCrunch’s write-up, Read/Write Web’s, and Nick Carr’s too.

PS. Or I’ll just build my own tagging engine. It’ll probably be better because I can specifically build it to reflect the nature of news.


Josh Young's Facebook profile

What I’m thinking

  • Srsly, @twitter, you're trying to be friendly and serendipitous, but I want a stranger in my feed only with a obvious intro from a friend. 23 minutes ago
  • @zseward Really, imho, who won the bet turns out to say more about google than the NYT, blogs, or wikipedia. 2 hours ago
  • @zseward I'm familiar with the bet. I agree wikipedia beats both blogs and NYT. How does "algorithm" illuminate wikipedia's process? 2 hours ago
  • @Jakewk Well, @cshirky brought up "algorithmic authority" earlier today at #kmedia. I'm sure video is forthcoming. 2 hours ago
  • "The list is the origin of culture." Lists "make infinity comprehensible." http://j.mp/2p1F4T Umberto Eco, folks. Here all year. 3 hours ago
  • Legitimate processes give rise to authority. But "algorithm" is an overheated word here. "Algorithmic authority" a lyrical misnomer. #kmedia 4 hours ago

What I'm saving.

RSS What I’m reading.

  • Diderot on Information Overload 2009 September 3
    Denis Diderot, "Encyclopédie" (1755) As long as the centuries continue to unfold, the number of books will grow continually, and one can predict that a time will come when it will be almost as difficult to learn anything from books as from the direct study of the whole universe. It will be almost as convenient to search for some bit of truth concea […]
    Stowe Boyd
  • OPENING ALERT: Macbar 2009 September 23
    Shared by joshyoung Why? Why the macaroni-shaped bowls?! With duck confit mac and cheese, things were looking so bright! And now I'm crestfallen, repelled by the kitsch. 54 Prince St., Soho Phone: 212-226-8877 Status: Open now. PR reps announce that the long delayed macaroni shaped sister to Soho scenster restaurant Delicatessen opens today on 54 Prince […]
    (author unknown)
  • Facebook release Tornado and it’s not based on Twisted? 2009 September 12
    Image: Jay Smith To their great credit, Facebook have just open-sourced more of their core software. This time it’s Tornado, an asynchronous web server written in Python. Surely that can only mean one thing: Tornado is based on Twisted. Right? Incredibly, no. Words fail me on this one. I’ve spent some hours today trying to put my thoughts into order so I cou […]
    terry
  • Calling All Librarians 2009 September 19
    I just received a copy of Inside Larry and Sergey's Brain by Richard Brandt. I'm generally pretty bad at reading this type of book and getting a review out in a timely manner, so this time I'm going to try my best to write a review of the first chapter.However, before I get that far, I've formed an impression based on the opening analogy. […]
    Matthew Hurst
  • Zakta – a new way to organize web knowledge 2009 September 14
    After the WhizBang!Labs implosion, I worked for Intelliseek (BuzzMetrics, Neilsen) where Sundar Kadayam was the CTO. Since leaving Nielsen, Sundar has been busy working on a new idea called Zakta. The site combines the functionality of web search, wiki and social features with the goal of simplifying the discovery, extraction and maintenance of knowledge dis […]
    Matthew Hurst
  • Tornado powering this blog 2009 September 15
    This blog is now running off of Tornado on App Engine. Tornado is an open source version of the web server and tools that power FriendFeed. I'm really excited that this code was open sourced, working with this server has been a pleasure at FriendFeed and I'm looking forward to seeing how developers will use it and contribute to it. I haven't p […]
    benjamin.golub
  • Workshop on Information in Networks (WIN) 2009 August 27
    For those of you interested in the study of networked data, I would like to bring your attention to the "Workshop on Information in Networks (WIN)", a workshop organized by my colleagues Sinan Aral, Foster Provost, and Arun Sundararajan. It will take place on September 25-26, 2009. From the description:The purpose of WIN is to bring together leadin […]
    Panos Ipeirotis
  • How Twitter works in theory 2009 August 15
    It is said that an economist is someone who sees something that works in practice and wonders whether it works in theory. Twitter clearly works in practice - and if you want practical advice, watch Laura Fitton's Tech talk at Google, or read her Twitter for Dummies. I've learned a lot from talking to her and others about this phenomenon, and I want […]
    Kevin Marks
  • FluidDB has launched! 2009 August 25
    In case you missed it, FluidDB has (finally) launched. I wont be blogging here about FluidDB or Fluidinfo, though will continue to post personal things and of course random bits of code that seem interesting (and small) enough to warrant mention. I have yet another Twisted snippet coming up, though I’m not sure when I’ll get there. We’re all exhausted and th […]
    terry
  • The Raging Debate Over The Link Economy 2009 August 16
    Arnon Mishkin wrote a post last Thursday on paidContent called “The Fallacy Of The Link Economy” that has been generating a lot of discussion, so I figured I’d join in the free-for-all. First, let me try to reduce each person’s argument to a direct quote that best sums up his position. Arnon Mishkin: The vast majority of the value gets captured by aggregator […]
    Daniel Tunkelang