Archive for the “RDF” Category

I’ve mentioned a few initiatives and projects related to the development of Web 3.0/Semantic Web on this blog where librarians are, or could be, involved. But the big question is, how? Finding out how to get a seat at the table can sometimes be difficult. Here’s a few ways to get involved no matter your level of expertise or location:

Data Portability: Get Involved

From mailing lists to action groups, the Data Portability Project has a number of ways that you can keep up to date with news, share advice and ideas, and participate in high-level technical discussions about the project.

FOAF Developer Center

Participate on the wiki, mailing list or the busy IRC channels for the Friend of a Friend (FOAF) specification project.

Open Knowledge Foundation: Participate

Sign up to the announcements email list, or give your time to volunteering to one of OKF’s many interesting projects.

Linking Open Data Community Project

Mailing lists, meetings, projects, and lists of people interested in Open Data.

Upgrade your sites

Help spread the use of microformats, OpenID, RDF and other new standards and formats by including them in your plans to upgrade your website. The websites mentioned all have guides to getting started. For example, Microformats outlines a five-minute guide to adding your first microformat on your site. Keep an eye on DiSo (Distributed Social Networking applications), which is building plugins using these tools and others for Wordpress, Drupal, and other widely used platforms. There’s some work for me to do in adding to this website!

Comments 3 Comments »

Web sites and applications burst on the scene out of nowhere, attract massive usage and undergo continual improvements to make them better. We wonder how we ever got along without them, until they get bought out, put up access or paywalls, or just disappear.

Libraries have long been concerned with preserving information for the future, and increasingly that includes digital information and websites (for example, Pandora at the National Library of Australia which archives everything from blogs to the 2000 Games site).

So where do they intersect? And how can we take a more proactive approach to design for sustainability rather than saving retrospectively? The Semantic Web is all about linking, openness, and relationships between data. In some ways the Semantic Web is, in my view, how we will move towards a more Sustainable Web.

What might the Sustainable Web be?

Adapting the Triple Bottom Line approach to sustainability, web developers and those who create data could take a lifecycle approach to how they create, manage and produce sites and information. When planning a new website, dataset or service, in addition to deciding on purpose, standards and features, you could also include a statement about how you would -

  • Distribute the data if you were no longer maintaining the site (using a LOCKSS principle, perhaps?)
  • Migrate to future standards
  • Ensure that your site is indexed in the Internet Archive (all pages and data, not just the index)
  • Give people ownership of their data (if you’re running an online service where people store or save information) so they can get it out when they want, or own it if the site closes or the terms of service changes significantly (eg, in the instance of a buyout).

Depending on what type of site it is, there may be governance and political impacts now or in the future. If you’re running a scientific research portal, how might changes in government policy affect the site? What obligations might be imposed on sharing or accessing the data you provide?

Using open standards as the backbone

A starting point is to use open standards. In addition to W3C standards most of us already know (like HTML and CSS), we can extend this to Semantic Web standards like OWL and RDF. Adherance to standards allows information to be interpreted correctly, exchanged, and migrated to newer standards in the future. Standards may also make it easier to hand datsets over to someone else or distribute copies to keep it accessible. It’s a key part of understanding the potential of the Semantic Web according this summary of a talk by Nova Spivack at last week’s The Next Web -

“The semantic web is not so much about “semantics” as it is set of open standards defined at W3C. The semantic web approach builds on open standard meta data which is in line with previous presentations that supported the open data approach. The idea is that everyone profits from everyone’s metadata. The semantic web is a compromise in making the data smarter and the software smarter. It is the best of both worlds.”

Keeping data usable

Over the past two years, libraries, museums, companies and other organisations have set up pages in Facebook, mySpace and other social networking sites. In some libraries, this is the work of an emerging technologies specialist, in others it’s an added role for an individual that may or not be sustained if that person leaves or changes job focus.

Whatever the situation, it’s not the best use of time to have to create a new profile and create networks in every service. This is where a move towards data standards and portability is a plus. Being able to move data between and in/out of these services saves time and sustains online networks and communities. Data Portability is one of the major projects looking at these issues. According to Chris Saad from the project, “The new innovation platform is data” and this is certainly true if looking at things from a Semantic Web point of view.

Libraries and the sustainable web

A recent article in Interactions stresses the importance of designing for sustainability of content on the web - the authors note that libraries and other cultural insitutitions will be at the heart of these efforts,

“Digital technology makes it possible to extend the walls of the archive beyond a single space or person, as well as ensure preservation and acccess in locations around the world [...] Libraries, museums, and archives will need to collaborate with business interests to build lasting social structures that are sustainable over time.” (Churchill E, Ubois J, 2008)

Libraries have played a significant role in participating in a variety of digital and web preservation projects over the years, but what’s the next step? How do we get more involved in conversations that take place in business?

———–
Churchill, E, Ubois J. 2008. Designing for Digital Archives. Interactions. March/April 2008. Retrieved from: http://interactions.acm.org/content/?p=1089 (full text via ACM Portal)

Comments 3 Comments »

Read/Write Web reports that Yahoo will begin indexing RDF and a range of Microformats. This is big news. It’s the first big sign that established, large players in search are including the Semantic Web. The Semantic Web has been under development for years, and now we are starting to get to the top of the mountain and see over it to where we can go with all these new standards and tools.

Yahoo has outlined what they will be supporting initially, and it’s a pretty big list -

In the coming weeks, we’ll be releasing more detailed specifications that will describe our support of semantic web standards. Initially, we plan to support a number of microformats, including hCard, hCalendar, hReview, hAtom, and XFN. Yahoo! Search will work with the web community to evolve the vocabulary framework for embedding structured data. For starters, we plan to support vocabulary components from Dublin Core, Creative Commons, FOAF, GeoRSS, MediaRSS, and others based on feedback. And, we will support RDFa and eRDF markup to embed these into existing HTML pages. Finally, we are announcing support for the OpenSearch specification, with extensions for structured queries to deep web data sources.

Yahoo has had some support for this already (check the review of Operator below) , but perhaps this will be just the push many website developers need to include Microformats in their sites.

Microformats?

They are a structured way to make use of open content on the Internet. If you would like to see what Microformats have the potential to do, check out the Microformats site. A practical way to try this out is to install the Operator plugin for Firefox. When enabled, if you browse a site that makes use of, for example, hCard, you can right click and download the data or use the toolbar to perform lookups on other sites. Having used Operator, in some ways Yahoo’s announcement about their support for the Semantic Web and Microformats is not such a surprise, as many of the sites mentioned below are Yahoo or Yahoo-owned.

Using Operator

Here I am on Twitter - using Operator I can export and save my hCard contact information, bookmark, or, here’s the really nifty part, add it to my contacts (if I was using Yahoo! Contacts, that is).

Operator for Firefox, in Twitter

If I’m in Upcoming and see something I want to go to, I can add it to my Google or Yahoo calendar. Here’s an event I’m actually going to :) -

Upcoming with Operator

With a click it’s in my Google Calendar -

Google calendar with Operator

So why is this cool? I didn’t have to type a thing, or run a search. With Microformats I can extract information out of webpages, find related stuff, save details for later, in a really easy and time saving way.

How might libraries make use of it? How about quick downloading of records and citations out of catalogues and databases into other sites, an article or even into your mobile phone without having to go through painful export options? One click to store? I’d like to see that.

Tags: , ,

Comments 2 Comments »

WikiProfessional

Via the RIN blog, Jan Velterop recently of Springer is heading to KnewCo, which is behind WikiProfessional. WikiProfessional is designed for researchers and scientists to interact using a database and wiki with a Semantic Web foundation -

The core functionality of this workspace is a newly invented “Semantic-Wiki”. The Semantic-Wiki connects the new relational MediaWiki software (Wiki editing capability in a relational database structure) to the computational text analysis technology, called the Knowlet™

They’re starting out with a large dataset from Medline, which is a great idea to avoid the problem of having no examples to work in new websites.

To get started, you create a desktop, where you can view Knowlet and Expertise workspaces. This is really getting to the trust aspect of the Semantic Web. One thing that is slightly disconcerting is being confronted with another self in the expertise space - my name is a fairly common one, and the system tries to find me in Medline.

WikiProfessional

If I was one of the authors listed, I could select articles and identify them with me. Other traditional databases have tried to do this, such as CSA’s Scholar’s Universe. Now if WikiProfessional makes this type of thing portable and linkable to other IDs, such as OpenID, then I think it would have a lot of value.

No doubt there will be many more features added to WikiProfessional as it makes the move out of Alpha, so it’s worth keeping an eye on.

Calais

From Reuters, Calais is all about making data richer by semantically tagging it. There’s a bit of questioning as to what’s in it for Reuters, but I can imagine that for journalists it would make sorting and identifying important news stories and other content easier.

If you want to try it for yourself and skip the API signup check out Calais Text Tagger. Feed it some text and it will give you tagged RDF in return. I fed it the last post on the blog, and got back:

Continent: Africa
IndustryTerm: so many services, web projects, social web, energy
Company: CNN, Google
Person: Ethan Zuckerman
Technology: mobile phones, PDF, SMS

Comments No Comments »

The W3C recently took the wraps off a new query language for the Semantic Web, SPARQL. According to Computer World, this should give the Semantic Web a big push forward. According to the article,

SPARQL is designed to be used at the scale of the Web to support queries over distributed data sources independent of format. It also can be used for mashing up Web 2.0 data.

Semantic Web applications are starting to surface as well, with the creation of Twine (in invite-only beta - if you have one let me know) and DBpedia. The emergence of end-user apps, and not just languages and specifications is a clear sign that the pace of development is beginning to accelerate.

Comments No Comments »