After a brief hiatus last week as Trent and Steve were otherwise indisposed, the DataPortability: In-Motion Podcast is back at half strength. Steve is still MIA, but joining Trent in the virtual studio is Bob Ngu, Founder of Jiggyme.com, a video aggregation startup that is beginning to focus specifically on technology videos.
Bob has been an active contributor to the DataPortability Project since March, and was highlighted in the project’s May report. The spotlight was shined on his DataPortability: In the Wild blog series. In this series, Bob outlines his discussions with various people involved with data portability. Among the areas he’s covered so far include:
There is a lot of focus in the DataPortability Project about making it easier to access user data. Another aspect to data portability, in general, is an analogous set of activities around enabling other data on the web to be more machine accessible. A few groups have been approaching this issue in various ways, many of whom work under the umbrella of the Semantic Web community. One subset of people focusing their efforts on this are taking what they call the Linked Data approach.
At a recent Cambridge SemWeb Gathering at MIT, Kingsley Idehen, CEO of OpenLink Software and a founder of the DBPedia project, had a great term for where he sees himself within the greater context of people working on these issues:
I like to say that I belong to the Semantic Web Community, but I’m a member of the Linked Data Tribe.
I found this concept of a tiered relationship and allegiance illuminating. Talking about it with him, he makes a distinction between the community as a whole and the fact that he focuses on a specific set of actionable efforts. It has been this sense of “what can be done right now” that has helped build upon what others are doing to move toward the goals of the community as a whole.
For example, I recently discussed how microformat markup could benefit the Semantic Web with Danny Ayers, an RDF/SemWeb guru working for Talis. Similarly, Ivan Herman gave a talk at the gathering about how to leverage RDFa within the context of an existing XHTML web page. Both examples are stepping stones in the direction of truly portable data on the web, and something that Kingsley considers the “data substrate” upon which Linked Data representations can be built.
To that end, I’m on a mini crusade to encourage developers to take the extra few minutes required to consider how their display layers can expose their content with effective markup. Rather than everyone having to learn OWL, RDF, and SPARQL before any progress can be made, there are some simple steps that will catalyze further steps. It’s really not that hard, and even if you’re not a developer you can mark up your own blogs and pages with microformats to provide search engines with much-needed context to describe your content.
NOTE:I’m purposefully not diving too deep here into the real “meat” of Linked Data. Instead, I hope you’ll spend a couple clicks checking out the simplicity of what can be done to help build the “data substrate”.
I’ve struck up a conversation with Yihong Ding about taking small steps toward a more effective Semantic Web and an idea popped into my head that seemed worth jotting down. In order to get folks moving in the right direction perhaps something small like a reader of microformats… we could call it a “MicroReader” … might be interesting. It could be something that sits in your browser (eg. a Firefox add-on) and does nothing more than display a list of detected microformat tags in the page being read.
While not a lot of folks are leveraging microformats (from what I can tell), it might be a way to increase awareness of their utility. It’s possible, for example, that it’d be a poor man’s way to encapsulate some of the salient points the author wanted to convey. We all know people don’t read every word of an article, but rather glance at bullet points, captions, charts, etc. This might be just the ticket to easily display the “bullet points” of a post.
For example, if the MicroReader detected a known format (eg. “hreview“) it could automatically generate a summary panel with the information. With only a few sites actively leveraging the format now, it’d be of little value, but if Yahoo Tech is using them, could mom-n-pop be around the corner?
I’m not really sure if that gets us closer to a semantifying the web, but it’d be kinda’ neat.
If we’re all moving toward a more connected set of tools for communication with hopes of a better Web 3.0, how’re we gonna’ get there? Getting everyone to agree on a single standard seems like a pipedream, but what can we do in the meantime? From what I can tell, it seems relatively easy to chat up the concept of Microformats.
I bumped into this post from Tom Johnson which seemed to sum it up well:
The idea of microformats and the semantic web sound cool. And I’m looking forward to the day when microformats are widely adopted. But if microformats are so useful, why hasn’t Google come out with a microformats search yet? Why aren’t microformats being baked into the core structure of WordPress and other blogging platforms?
Not many people are using the structured blogging plugins, and those that do use it mainly to autoformat their posts. I even heard in a recent interview with Matt Mullenweg, the WordPress lead, that there are no current plans to develop structured blogging microformats into the WordPress code.
Oddly enough, Jason Kolb made a similar comment in a recent post:
The only technology that would really be necessary to make this work is to embed microformats in site text itself. I’m really not sure why this hasn’t taken off yet, it seems like a no-brainer to me. What I’m talking about, and I’ve actually posted some working examples of this before, is to surround chunks of text from a weblog post or text published to a public site with microformat markup so that it can be extracted as meaningful data.
It seems like a simple enough first step toward the semantic web thing. Like these two cats, I’m relatively surprised microformatting hasn’t been embraced, but I do believe the value chain still seems to be missing a couple links. There probably need to be a couple of successes (like a popular microformat tagging/retrieval tool) before the masses jump on board.
For my part in this digital village, I’m going to actively explore more microformatting opportunities. More if it develops.