ThreadsML reborn
A few years back Steve Yost, Ben Hammersley, David Weinberger, Danny Ayers, Shelly Powers, Jason Shellen, Jay F, Mark Carey, Jon Lebkowsky and myself (and others) participated in a mail list/conversation about the notion of a standard for representing conversations.
We called it ThreadsML and the web site and archive of the mail list are still around.
I seem to remember that a message board that David Weinberger used throughout the nineties had “gone away” and he was quite pissed about losing all those archived ‘conversations‘.
So I found myself on this list, and we knocked around various principles, requirements, usage scenarios and specifications - which when built - would enable all sorts of new kinds of interaction and ‘fluid conversations‘ via the web.
Well look what happened!
It happened WITHOUT ThreadsML!
But like any good new idea, it’s ownership and control will be tussled and fought over by various early stage vendors who are first to the game. Sound like Twitter, FriendFeed, CoComment and Disqus to you?
ThreadsML is exactly what is needed now. And lots of inter-connected servers running ThreadsML, with a DNS-like backbone striping and copying these conversations as they happen in real-time.
But before I get too far - let me stand back and explain what ThreadsML is - in an abstract and get some historical perspective as well. I’ll start off with some background:
- Way back when conversations would start as mail list threads. Endless going back and forth with email messages, conversations were difficult to follow - but spirited and lively anyway! One would create a ‘group’ in Outlook to keep track of all the folks on the thread and we’d ‘manually’ maintain the conversation.
- Then blog comments came along and conversation morphed, just as they were morphing with IM, chat rooms and IRC back channels. We also got a feature called ‘trackback - and blog search engines could also inform us if anyone had picked up on a ‘meme’ you posted and started an unofficial conversation. A blog conversation.
- One could also follow a specific meme or keyword or phrase - and watch it birth itself throughout the blogosphere. Our decentralized world was evolving.
- Several years passed in this state until Twitter came along and track. Now we could Twit what we were doing and one could follow specific people’s Twits which naturally led to all sorts of conversations, both long and short.
- FriendFeed put an interesting twist on all this - enabling us to comment on ANY conversation that flowed through FriendFeed. So we’re now watching Robert Scoble and others perfect this new way of conversing - in a decentralized manner.
- Now the point comes up “who owns these conversations?“ Call them fluid - call them liquid - they’re happening completely free-for-all in the open marketplace - just like standards are SUPPOSED to evolve.
- Steve Gillmor has pointed out that the TinyURL facility has also given us independence of vendors and the ability to cannoize our conversations. His ecosystem and nervous system metaphors may confuse some - but this is a complex chess game we’re playing here - and one which needs all sorts of ’savants’ preaching to their interested choirs.
- So in that spirit I’d like to add to this discussion the notion that a standard that can represent a conversation - a persistent re-entrant conversation - can help unite all of these memes flying around the decentralized meshed web. A standard is possible and in fact is staring us in the face.
ThreadsML exists as a web site that David Weinberger has been paying for - for almost 5 years now. We created a spec for ThreadsML (back then) and we’ve been waiting for this day to arrive - convinced that it’s need would become apparent when - well what is happening today happens. And it DID happen!
The need for an open standard to unite disparate kinds of conversations, threads, IM sessions, comments, tracks, mail lists, etc. These conversations get started on all sorts of service, in many different ways.
The following is my interpretation of what ThreadsML is - apologies to anyone if I get it wrong.
Technically a ThreadsML server would be a threaded object store - which would enable anyone to contribute into any thread node and attach media to any node. A DNS-like sub-system would unite LOTS of these servers together and all sorts of glue code would convert proprietary threaded data structures and protocols into ThreadsML and back.
So ThreadsML is more than just some schema. Its a system that unites conversations and stores them as ‘memes‘.
David Recordon and the DiSO project seem to be working on something similar to ThreadsML.
The original ideas are still there - waiting for anyone to build them.
We clearly need a neutral place:
- which is owned and controlled by no one party (NEA)
- which can house these conversations
- convert them into other forms
- enable re-entrant contributions - later
- stop spam
- enable one to attach media to any node of the conversation
- and keep it all interactive and two-way!
That’s what the dream of ThreadsML was all about.
Here’s a historical artifact I created - back in 2003 - about ThreadsML:

Little did I realize that ‘near term’ meant “wait five years”.
![]()


June 22nd, 2008 at 2:01 pm
[...] che gestiscono conversazioni. Marc Canter, che era stato tra i promotori già nel 2002, propone di riprendere in mano il progetto. Credo anch’io che ci sia un grande bisogno di uno standard di questo tipo per liberare i [...]
June 22nd, 2008 at 5:21 pm
[...] Canter has revived an idea a bunch of us were talking about five years ago in a conversation instigated by David [...]
June 22nd, 2008 at 5:23 pm
Hey, Marc, great to see this post! I’ve wondered from time to time how we lost track of the threadsml discussion. I agree that the question now is, who’ll pick it up?
June 22nd, 2008 at 10:06 pm
What’s the state of ThreadsML? I’m having a hard time finding a concise wiki page, spec, blog post, etc explaining the in depth technical side of how it worked and how you might implement it.
June 23rd, 2008 at 3:02 am
[...] « ThreadsML reborn [...]
June 23rd, 2008 at 3:52 am
An idea who’s time has finally come?
In the interim, a lot of related ground has been covered by SIOC, and there’s some interesting stuff happening around debategraph:
http://sioc-project.org/
http://www.debategraph.org/
Unfortunately Trackback seems to have fallen by the wayside, but I guess we’ve now got Twitter…
June 23rd, 2008 at 3:54 am
SIOC (http://en.wikipedia.org/wiki/SIOC) seems the closest thing around to an implementation of the Threads-ML ideas. Maybe it would be worth a detailed check-list comparison of SIOC vs the goals for threads-ml, to see what is still todo?
June 23rd, 2008 at 9:23 am
All,
Yes, SIOC does cater for Threaded Discussion exposed as a Linked Data Graph (as noted by Dan already). Basically, there is work underway to bring IBIS [1] and other Discourse oriented efforts into the Web (naturally as Linked Data).
The neat thing here is that ThreadML and all of these efforts can co-exist, the transformation from ThreadML to a Linked Data oriented Discourse Space is very low cost.
Thus, whether we call it ThreadML, Discourse Web, DebateGraph etc.. Let’s make Discourse Discovery and Participation on the Web much easier than it is today.
The Blogosphere, Wikisphere, Picturescape, Boardscape etc. have always been about huge discussion spaces in my eyes
Of course, SPAM impeded the initial bootstrap, so any rival has to demonstrate robust machine level handling of the SPAM challenge.
Links:
1. http://hyperdata.org/xmlns/ibis/index.htm
Kingsley
June 23rd, 2008 at 10:44 am
[...] ThreadsML reborn [...]
June 27th, 2008 at 9:02 am
[...] Turns out SIOC is very similar to ThreadsML [...]
July 1st, 2008 at 3:54 pm
Marc,
SIOC accomodates ThreasML in a natural way via the Discussion Data Space
SIOC provides a nice vehicle for modeling data containers (Data Spaces) on the Web. At the end of the day, every container (Blog Space, Wiki Space, Bookmark Space, Photo Space) is a host of items, and each item can be of one or more types e.g Posts, Wikiwords, Bookmarks, Photos etc (basically the way we modelling everyting in the real world mentally).
FOAF not only covers our Social Networks, it also covers other “Objects of Sociality” via enable connectiion via the things we’ve made and exposed via the Web (e.g Blog Posts, Shared Bookmarks, Wikiwords, Photos etc..).
The Bibliographic Ontology, add additional granulaity to Document modelling and when used in conjunction with RDFa, you end up with a real power in relation to describing what existing web pages are about in structured form.
As you can see, shared vocabularies / ontologies / schemas are gettting clearer now that “Linked Data” is providing a mechanism for demonstrating practical utility, which in of itself provides better demonstration of the larger Semantic Web vision.
Links:
1. http://bibliontology.com/ - Bibliographic Ontology
2. http://virtuoso.openlinksw.com/presentations/DataPortability_and_DataSpaces/DataPortability_and_DataSpaces.html - Example of RDFa and Bibliographic Ontology in action (*best seen via a Linked Data aware user agent such as the plugin below*)
3. http://virtuoso-images.s3.amazonaws.com/rdfb.xpi - a Firefox 2.x or 3.x plugin that exposes an new option to the Web Browsing in the form of a new “View | Linked Data Sources” main / context menu option to existing “View | Page Source” and “View | Page Info” options
Kingsley Idehen