Author Archives: vhk10

Delicious shoots itself in the foot

I’m a great fan of social bookmarking and have a large (1300+) collection of bookmarks on Delicious, all with some text copied from the page or my own annotation.  Delicious has its weak points; my personal pet hate was searching, for which you had to submit your search term exactly. Even then it didn’t always find a bookmark you knew was there.

It seems to be the season for revamps. Today, soon after Facebook’s much-derided effort, and in the wake of being purchased by YouTube’s founders, Delicious unveiled its new look.  It’s hard to see how this is any sort of improvement.  It is now impossible to search your own bookmarks. Since only about half my tags are now visible, it’s not really clear how I navigate my own bookmarks at all.  ‘Managing your network’ lists ‘profiles that follow you’, except that they aren’t – they are the ones I’m following.  There’s no obvious way to follow anyone else, or indeed to find out who is following me.   I don’t get shown my own tags when I tag a new bookmark, so I can’t tell if I’m creating a new tag category unnecessarily. It is now only possible to see 10 bookmarks at a time. You can only see a selection of comments made by other users for the same page.  And so on.  They are very proud of their new ‘stacking’ facility (‘Join now and get stacking’ exhorts the home page), which allows you to collect bookmarks together, but I want my tag collection back!

The cynic in me wonders whether the introduction of stacks and the difficulty in managing one’s own personal bookmark collection are a less-than-subtle way of pushing YouTube content (or advertisers’ websites) at the user.  This could be done, for example via the ‘Featured Stacks’ on the homepage (current themes of these include surfing, doughnuts, dog costumes, Las Vegas showgirls and ‘nutrition tips’. I don’t want any of these! I want my tags and I want to be able to search my own bookmarks!)

Fortunately, I copied my bookmarks over to Diigo earlier this year, so I have access to most of them. I have exported my current collection to Diigo too, but have been warned there will be a delay before they are online: ‘in the last day or so, tens of thousands of users have entrusted us with millions of precious bookmarks collected over the years… ‘.  Now I wonder why that is! (Having said that, they did upload my bookmarks within a few minutes).

Save Floppy!

This is in fact the title of a book in the Oxford Reading Tree (Floppy is a dog).  But while the floppy disk itself is all but obsolete, it is still associated with the act of saving by its use as an icon.

Today I went to a course on our TopDesk incident management software, and among a battery of icons on the screen was a picture of a floppy disk one had to click in order to save changes.  I wonder whether this association of imagery will long outlive the floppy disk itself – rather as old-fashioned phones with a dial and a big handset still symbolise ‘telephone’, and the warning for an unguarded level crossing is a sign depicting a steam train?

Pure demonstration

I attended the following demonstration:

‘ … an opportunity to view the potential ‘research information system’ for [Bristol] University…. The Danish company Atira will be demonstrating their product ‘Pure’.’

Because of flaky wi-fi at the venue, we weren’t able to have the full demo, and in particular the Research Excellence Framework couldn’t be demonstrated.  This system is intended to replace IRIS as a place for collecting details of research outputs; it can also handle personal profiles, details of projects and funding, and include a repository of full-text articles.

It seemed to work nicely, and the interface for merging duplicate records from standard databases (allowing you to choose the most detailed version of each field) was impressive.  But I couldn’t help wondering how well our current practices would transfer to this system.  For example, how can salary details within projects be kept confidential? Who can see personal details in your profile?

The most interesting aspect to me was the potential replacement of the University’s ROSE repository (based on DSpace) though the demo only touched on this in passing.  This looked rather like an add-on to the bibliographic storage system and it wasn’t clear how much control there is over metadata, or whether a licence can be stored alongside the repository items, as with DSpace.  On the other hand, there is some advantage in integrating a repository with the database of information about research outputs, as it is rather opaque who is behind ROSE.  (Someone at another institution recently thought it might be me, on the grounds that I’d posted to a DSpace forum!)

Linked Data and Libraries 2011

July 14 2011

British Library, London

Building on a similar but smaller event last year which I didn’t attend, this attracted about 120 delegates, including a number from Germany, the Netherlands and Scandinavia. (It would have been good to have had a list of delegates as I’m sure there were people known to me by name but not by sight).  I  think that most were from libraries rather than IT departments, and from the academic sector.

After a brief keynote address by Dame Lynne Brindley from the British Library, Richard Wallis of Talis talked about how the use of Linked Data is spreading and why libraries are a natural place to use it: essentially because they are long practised at adhering to standards and at describing their holdings.  Adrian Stevenson of UKOLN reported on a LOD-LAM summit which dealt with such matters as vocabularies and provenance of metadata (which were recurring themes).  Phil John of Capita introduced Prism, which has been developed for discovering library resources at the University of Winchester.

Among the most interesting talks were the ‘lightning talks’ which preceded and followed lunch, and which were focussed on practical issues with Linked Data.  Carsten Kessler described how the University of Münster links information on courses, buildings, research databases and bus routes!  Jerry Persons from Stanford offered a report on a recent Linked Data workshop there which raised another recurrent theme: the tension between getting data/metadata exactly right and getting a working service up and running.  He leant towards the second of these, saying ‘scruffy works’ and ‘build for the way the world is’.

Antoine Isaac described the work of the W3C Library Linked Data Group on standards relating to Linked Data.  Neil Wilson of the British Library talked about the linked data version of the British National Bibliography, and in particular the process of converting MARC records to RDF.  The final talk, provocatively entitled ‘The Record is Dead’, was by Rob Styles of Talis, who explained how traditional cataloguing methods fail to match the way people talk and think, and lose valuable detail.  ‘Records don’t have relationships’.  While this was less technically focussed than some of the other talks, it pointed the way to possible uses for Linked Data and was an appropriate lead into the closing summary from Richard Wallis.

Overall, I felt that the day gave a useful view of the current state of play with regard to the use of Linked Data in the library world.  Phil John made the point that traditional OPACs don’t deal well with special collections, an observation relevant to some of ILRT’s current work.  I also learnt about how librarians talk when they’re together (there were several MARC-related jokes that went over my head!)  I was however left with the impression that while everyone present agreed that Linked Data was a good thing, we are all still feeling around for ways to use it.

Presentations from Linked Data and Libraries 2011

 

Puzzles, e-cards and Rover – engaging the user

(Another post dating from 2010, copied from the ‘Coffee with ILRT’ blog)

ILRT has developed a web site, Hidden Lives Revealed, for the Children’s Society, with information about the Society’s work and its children’s homes in the late 19th and early 20th centuries. The site was originally funded by the National Lottery and is intended to appeal to a range of ages and to be used as a teaching resource in schools. With this in mind, it has always included a range of ‘activities’ to engage, amuse and inform visitors to the site. Which of these have been most successful and why?

Pictorial puzzle: a rebus with names of 11 British towns to work  out.

A pictorial puzzle: name the towns

By far the most successful activity, in terms of numbers of visits, is the Virtual Children’s Home – a ground plan of two floors of an imaginary but typical home run by the Society. Clicking on a room on the plan brings up information about what that room would have been used for and some photographs. This activity is part of the teaching role of the site and not purely recreational. It is also the fifth highest result of a Google search on ‘children’s home’, which generates the vast majority of visits. The low ‘bounce rate’ shows that, having come across the page, most visitors stay for a look round the virtual home rather than moving away from the site at once.

Another popular activity is a page of interactive pictorial puzzles, taken from old editions of the Children’s Society’s magazines. It turns out that this page is currently the top result of a Google search on the phrase ‘Pictorial Puzzles’! But again the majority of visitors didn’t leave the page at once. A similar page of crossword puzzles is much less visited.

Less used are the section about the ‘Rover League’ (a club for children which appeared in one of the Society’s magazines), enhanced with barking noises, and the downloadable screensavers consisting of collections of photographs from the site. It is impossible to tell how much the screensavers are actually in use, although there do seem to be a couple of downloads a month by ‘real’ users. It’s probable that few people now lift screensavers from websites, preferring either to use what comes already installed on their machine or to customise their own.

an example of an e-card

An example of an e-card

Finally, many of the archive photographs on the site may be sent as an ‘E-card’. The user adds a message and details of the recipient, who is then contacted and told the URL where the message and photograph are waiting to be viewed. E-cards have proved steadily popular; on average three or four are sent in a week. A spot check shows that some are sent to the sender, possibly as a quick way of ‘bookmarking’ an interesting image, while others are sent to or by people with a family connexion to one of the homes. The e-card user is likely already to have spent some time browsing the site.

What do we conclude from this? It seems there are three things which are likely to make an activity popular: conveying interesting information, allowing the user to append their own content and a high position in Google’s search rankings!

(I have used the Web server logs for the site and Google Analytics to assist in the preparation of this article).

Useful links

That most secretive of animals, your website audience

Unicode for the web

(expanded version of old article from Coffee with ILRT blog, which I’ve given a new home)

The bad old days

Anyone who remembers the Web in the 1990’s may also remember the limited range of characters which could be used in text displayed on the Web for writing languages other than English. Some characters with accents (and other diacritical marks such as cedillas, tildes and umlauts) for French, German, Italian, Spanish and Scandinavian languages were available, but there was no simple way of displaying any others.

For the benefit of the computer, each possible character had a numeric equivalent, known as an ASCII code. Plain unadorned letters for writing English with, together with numbers and punctuation, were a universal standard, covered by the range of ASCII codes from 0-127. Codes in the ASCII range 128-255 were reserved for the simple accents mentioned in the previous paragraph. So the HTML á – using the ASCII code 225 – produces the character a; the more memorable á is an alternative way to do the same thing.

However, you didn’t have to go far to find characters in use which could not be easily rendered this way on a Web page. There was no standard way of producing Welsh ŵ and ŷ, for example. Nor was there simple way of making text available in non-Roman scripts such as those required to write Greek, Russian, Hebrew, Arabic, Hindi, Mandarin and many other languages.

It was possible to have a local variant of ASCII codes 128-255 installed in order to display a particular language, at the cost of making the commoner accented characters described above illegible. It was also still impossible to display characters from different non-Roman scripts on the same page. (For example, biblical scholars could not display Hebrew and Greek side by side). It was not unusual for text in non-Roman scripts to be put on Web pages as a scanned image of a printed text.

Unicode arrives

This situation was clearly untenable and the Unicode standard was developed as a universally-agreed way of representing all scripts in use. Unicode includes the ASCII codes mentioned above, but expands the range of codes way beyond ASCII, more than enough to accommodate all characters in all scripts in use. Each possible character or ‘glyph’ in any script (including those with accents etc.) is given a number (there are now over 100,000 of these). Their position in Unicode is known as a ‘Unicode point’. Consecutive Unicode points are grouped into ‘ranges’, defining all the characters needed for a particular script.

A look at the list of Unicode character ranges (e.g. http://www.alanwood.net/unicode/#links) illustrates the diversity of human writing systems, ancient and modern: everything from Babylonian cuneiform to the many scripts in current use in India can be found there. Some ranges are devoted to symbols, such as musical and mathematical notation, and even representations of I Ching hexagrams, dominoes and mah-jongg tiles! (You may notice that the code is given in both decimal (base 10) and its hex (base 16) equivalent; decimal is the one you will need
when writing for the Web).

But I still can’t read it!

If you look at some of the Unicode resource pages, chances are that some characters won’t be interpreted correctly on your screen. E.g.: ? (for those of you who don’t have a Mongolian font installed.) They will probably appear as boxes (in Mozilla Firefox, the box will contain letters and numbers which are the hex code for that Unicode point). This means that your system doesn’t support those particular Unicode ranges. Windows offers support for some additional languages; go to the Control Panel, choose ‘Regional and Language Options’ and you can install support for ‘complex script and right-to-left’ languages (Arabic, Armenian, Georgian, Hebrew, the Indic languages, Thai and Vietnamese) and East Asian languages (Chinese/Japanese/Korean).

You will also of course need to have an appropriate font available. The languages just mentioned are included in the Unicode versions of standard fonts such as Arial, but if you still can’t read the script you may need to install a font that includes it. A list of some from non-commercial sources can be found at http://www.alanwood.net/unicode/fonts.html.

Writing for the Web with Unicode

Odd words or single characters can be inserted on to a Web page without the need for such software. For individual characters, the HTML is “&#” followed by the decimal version of the number in Unicode, followed by “;”. Several examples of this have already appeared in this article. So the Welsh characters mentioned above can be written as ŵ and ŷ. For more extended typing, Unicode keyboards are available for some of the commoner scripts. For example, Microsoft Global allows you to choose a language in an Office application and then type away in Chinese, Korean or Japanese.

Remember though that a word in non-Roman characters may not be legible to your readers if they don’t have an appropriate font and consider whether it might be better to (for example) give a transliteration of the word into Roman characters in addition or instead of it, or omit it altogether. It is also important to specify in the head part of the page that UTF-8 (the now universally understood Unicode standard) is what you are using, with the following tag: <meta http-equiv=”Content-Type” content=”text/html; charset=UTF-8;”>. At ILRT we work to this standard, essential in a world where the Internet crosses all linguistic borders.

Some further reading:

What can social bookmarking do for you?

(reposted from ‘Coffee with ILRT’)

What is social bookmarking and why would I use it?

Social bookmarking is a way of creating and collecting bookmarks on a website rather than within your browser. This means you can access your bookmarks from anywhere, classify and annotate them, and share them with other people. You can also see how popular your website is and what others are saying about it.

The most popular social bookmarking site is Delicious, which is the focus of this article. Social bookmarking sites tend to be coy about how many users they have, but Delicious passed the two million mark in 2007.

The basics

At the heart of Delicious is a database of URIs (web addresses) which have been bookmarked by users. You can search and browse this even if you are not a registered user of Delicious yourself. When you join Delicious and start bookmarking pages within it, the information you supply is added to this database, as well as being included in your own personal collection of bookmarks.

To get started, you set up a user account with a username of your choice and a password. You can start to create your collection of bookmarks from scratch, or you can populate your collection of bookmarks by importing existing bookmarks from your browser into your Delicious account. (Later, you can also export your Delicious bookmark collection to a file as a backup).

You can bookmark a page by following Delicious’ ‘Save a new bookmark’ link, or install a Delicious plugin on the toolbar of your browser which will bookmark the current Web page in the browser when you click on it. When you bookmark a page, you are invited to tag the bookmark with one or more tags of your own choosing, and to annotate it with a note. These notes can simply be some descriptive text taken from the site, or you can write your own. Many Delicious users don’t bother to write notes, but they can be very helpful when you later search or review your bookmarks. You may want to make some bookmarks private so that they are not visible to the wider world: for example, ‘work in progress’ on your current projects.

Configuring your bookmarks

The visual interface of Delicious is much more informative and configurable than the way bookmarks are displayed in browser menus. You can see at a glance how many bookmarks there are for each tag in your collection and group several related tags together into a cluster. You can also display your tags as a ‘cloud’. This method of bookmarking keeps bookmarks sorted by categories which mean something to you, but doesn’t divide them inflexibly into a hierarchy of folders. Unexpected connexions are free to come to the surface and rigid boundaries (such as between work and non-work) can be broken down.

screenshot of Delicious illustrating bookmarked web sites and annotations

An example of a page of annotated bookmarks on Delicious

As I mentioned before, there are buttons, bookmarklets and toolbars which can added to your browser so that you can bookmark with one click, and a variety of tools which will allow you to link your activity on Delicious to Facebook, Twitter, and other social networking sites.

The social side

So far this may not sound very ‘social’.  However, the combined public bookmark collections of Delicious users build up into a large collection of annotated Web pages, which you can consult even if you haven’t registered as a Delicious user. Once you register and have a Delicious username, you become part of a ‘network’ where you can follow other users whose bookmarks interest you, or they can in turn follow you. It’s rather like the way users can follow one another in Twitter. In particular, you can share new bookmarks with a group of friends or colleagues who also use Delicious, as several of us do at ILRT.

As well as organising your own bookmarks, social bookmarking can also be useful for getting feedback about websites which you run. If you are responsible for a public website, it may be worth taking a look to see which pages on it have been bookmarked on Delicious, how many times they have been bookmarked, and how Delicious users have tagged and annotated them.

Some other social bookmarking sites

The main alternative to Delicious is Simpy. This has some features in common with social networking sites such as Facebook: you can write notes and join groups of like-minded people to share bookmarks with them. It offers better and more flexible searching than Delicious; however, it has a much smaller number of users and so is less useful as a knowledge base. It is also more vulnerable to spam than Delicious is. StumbleUpon allows you to tag and make a collection of your favourite places on the Web, but its main purpose is to create a database of sites which have been recommended by its users, which it can in turn recommend to others interested in the same topics; Digg performs a similar function for news items.

Explore the bookmark collections of some members of the ILRT’s Internet Development team:

Ben Hayes – Web Designer – http://delicious.com/thesheep
Kieren Pitts – Senior Analyst/Programmer – http://delicious.com/kierenpitts
Virginia Knight – Senior Technical Researcher – http://delicious.com/cmvhk
Matt Baker – UNIX Systems Administrator – http://delicious.com/never147