Itâs Tag Tuesday, weâre sitting in the shadow of the bay bridge after leaving Gordon Bierch for less noisy venues (i.e. some stone steps on the Embracaderro). Kevin Marks from Technorati is talking about tagging, imagine that! The following is an attempt at summarizing what followed, it wasnât easy catching everything that was going on, so give me a break, eh? Iâll try to note who said what, please drop me a note or a comment with any corrections or additions.
Kevin Marks:
Technorati tags are distributed unlike other peopleâs (delicious, flickr, etc.) the advantage is that the user has better control.
Initial experiment: New Yearâs resolution. Failed because people linked to pages they thought were cool instead of using actual resolutions. New version (using rel=â?tagâ?) is more explicit.
Why tags in blogs fail: people get too clever. Make special database fields for tags and Technorati cannot connect it to the actual blog post. Embed in HTML, keep it simple and it all works.
Questions:
Matt Mullenweg: How many tags come from categories and how many explicit?
Kevin: Most Technorati tags still come from categories not from people actually tagging posts.
Next stepped up Stewart Butterfield (from flickr/Yahoo):
Sort by date is not as good unless big events show up (when it becomes very timely). flickr is coming up with a smarter sorting algorithm to determine how interesting a photo is, trying to show âgoodâ? photos. (Note: there are people jogging on next to us, very amusing.) It takes time to get ranking this way. Timeliness vs. quality becomes an issue.
Tag spam is an issue. Gets easier when you own the entire system (gives you more info) rather than someone like yahoo or Technorati.
Kevin: we still get good information and can do quite a lot of filtering.
Kevin: Spammer meta-meta data might be useful. Gives you information on relation between tags, example: a person, who was spamming with real-estate information, had enough posts to create a relation between âSan Joseâ and âSan Franciscoâ tags.
Both Technorati and flickr are planning a future for geo-tagging.
Flickr is planning a separate table for geo-tagging information, this enables special handling for geographic information, allowing them to calculate distances, etc.
Tantek: Geo tagging is hard. Using coordinates ended up in a âmirrorâ? of the US superimposed over Mongolia (because people confuse longitude and latitude).
Stewart: This becomes an easier problem when you have the resources of Yahoo to solve the problem. Itâs possible to use landmarks, roads, etc. to let people find the right location.
Ryan King: Technorati’s developersâ page has a page on geo-tagging (updated. thank you Tantek and Ryan for the URL). GPS info has little relevance as it doesnât include scale and radius of interest. Thereâs an artificial precision to latitude and longitude information because most people use a simple system to get coordinates of the city theyâre in which gives a location in the middle of the city.
We need a geo-tag thatâs one click verifiable.
A microformat, hopefully, is coming.
Evo from Jet-Eye: do you expect that people will start using auto-tagging, thereby flooding the Folksonomy with popular information?
Stewart: Thereâs always tension between free form and auto-tagging. Flickr supports auto-tagging for photos sent by email (popular example is âcameraphoneâ?). Basically, any info on a photo is better than what we have right now.
Statistics:
Kevin: Technorati has 1.2 million unique tags out of 40 million tags. But these numbers are influenced by Technoratiâs use of blog categories.
Stewart: flickr has 0.5 million unique tags out of 40 million tags. Numbers get better if you look at tags that are repeated (Note: maybe because this eliminates spelling mistakes and personal Taxonomies?)
Stewart: A common mistake, repeated by flickr was using space separated tags. Bay to breakers ended up creating âbayâ? and âbreakersâ? tags. Comma separated is more familiar to most people and we can be even smarter than that!
Final note: Thereâs apparently a 7.0 earth quake some 300 miles away from the California coast and we should all go home or face the wrath of the ensuing tsunami. A small part of the group reconvenes at a private residence (names and addresses withheld to protect the innocent) where I promptly ignore everyone and work on this here blog post.
Update: Kevin has his presentation up.
Update2: Scott Beale posted more pictures.