More Access is Better

August 19th, 2008 by Roy

One of my RLG colleagues today brought us a question from an institution that was considering their options for what to do with a large mass of digitized content they were planning to create. The question was basically this: would they be better off just making it accessible themselves and letting the search engines guide people there, or join up with a large aggregation such as the World Digital Library?

This is certainly a good question, and one worth considering carefully, since it is a foundational question that has numerous ramifications down the road. But all things being equal (and they aren’t necessarily so stay tuned for more on this), more access is better.

That is, I would neither put all of my eggs in a “local only” basket nor in a “one big aggregation” basket, but both if at all possible. That is, retain control over your own stuff, but also syndicate it out to other places such as the World Digital Library, if that floats your boat, and other places that make sense as well. The one sticking point here is that you will need to know what is required to play well with those other locations and factor that into your planning. So if I were them, I would find out what the World Digital Library would want from me, as I would with any other aggregator I wanted to play with. Then I would do something locally that allows me to easily spin out the various versions required. I think this model provides the most flexibility and sustainability going forward than relying on any single solution, no matter what it is. Plus I would get the added benefit of being in many places at once.

So as I alluded to above, all things are often not equal, and here are some of the differentiating factors. It will likely take additional effort to make your metadata and/or content comply with the needs of aggregators. Depending on your local situation, this could require a significant investment (although I would argue that if it does you were probably planning on doing something locally that is not as flexible as it should be). Another is that if the aggregator, such as the World Digital Library, wishes to host the content as well (and not just the metadata), then you will have split usage statistics. But before deciding you can’t handle being in more places at once I would urge careful consideration of the benefits and drawbacks. The easier it is for people to find your content the more it will be valued. People can’t appreciate stuff they can’t even find.

Related posts:
  • None Found

Mid August news round up

August 15th, 2008 by Merrilee

Because it’s Friday (and because I have a cold!), this is just a round up of bits I’ve been meaning to blog about. They are piling up, and I figure it’s better to get out even a little bit on each, rather than try to find the time to blog about each one in depth.

Jennifer and I did our webinar yesterday (Assessing the impact of special collections) — I blogged about this earlier this month. I wanted to let you know that our slides are in Slideshare. The webinar itself will be posted later in the month, after some vacations. I’ll have more to say about the discussion, and the results of the poll later. For those of you who took the poll, thanks very much!

There was an interesting story on NPR about reCAPTCHA. Last summer, I blogged about our use of reCAPTCHA for validating the comments on this blog. Over time, these small efforts have added up. Something like 1.3 billion word, which adds up to enough text to fill up more than 17,600 books. Beneficiaries have been the New York Times and the Internet Archive / Open Content Alliance. So comment away — you are helping to do great things.

Finally, I was sad to hear that the Party Copyright Blog was shutting down — but even more disturbed that it subsequently vanished. Fortunately, the “voice of the people” was heeded, and the blog was mostly restored. It’s a reminder of the fleeting nature of information on the web and the importance of preserving valuable resources.

Related posts:
  • None Found

WRAP up

August 14th, 2008 by John

The description of our newly launched Workflows in Research Assessment Program (WRAP) is now up on our website. WRAP fits within our Supporting New Modes of Scholarship theme. We will be looking at new and changing workflows for libraries in support of research assessment by institutions. In countries with strong research assessment programmes, such as the UK (whose Research Excellence Framework is developing) and Australia, where research assessment is done nationally as a way of dividing up large sums of government funding, libraries have been involved for some years in providing the bibliographic data upon which assessment is partly based. With the arrival of digital repositories, their role has become potentially stronger, but also in some ways more difficult. The Open Access agenda, for example, can conflict with institutional requirements for selective showcasing of their research output, and for rigorous identification of institutional affiliation (in the UK, as Lorcan Dempsey pointed out recently, even senior university managers have become interested in authority control and identifiers). In territories where national research funding is less of a driver, there are nonetheless pressures growing on institutions to manage the information about their own research production more effectively and efficiently, in support of improvements to assessment at individual level (tenure decision-making), and in other competitive contexts, such as public relations and knowledge commercialisation.

Constance attended the ARL Library Assessment Conference in Seattle last week. She drew our attention to a very relevant presentation by Patricia Brennan of Thomson Reuters which considers the library’s role in the assessment of research in universities. It looks at the drivers for research assessment, including reputation management, funding and tenure review. It touches also upon institutional rankings and league tables, and it attempts to identify roles for the library in this increasingly important activity.

One of her slides revealed the fascinating diversity in international approaches to research assessment:

I was reminded of a slide I’d seen a couple of years ago by the late Professor Sir Gareth Roberts, President of Wolfson College, Oxford, on the future shape of the UK’s Research Assessment Exercise. Professor Roberts had been involved in advising the (then) Australian Department of Education, Science and Training (DEST) on the introduction of its assessment framework, and he produced this sparser slide to show international approaches, with the horizontal axis here measuring ‘Influence on Funding Decision’ rather than ‘Adoption of Metrics’.

It is interesting to see how attention to this issue has extended to include the many countries which now feature in Patricia Brennan’s more recent chart, and also how Australia has moved into the upper right quadrant, suggesting that research assessment has intensified there in recent years. The Netherlands is also in a different quadrant, though due to the different horizontal measure: it is a world leader in research metrics and its universities are intensively reviewed, but there is as yet no strong correlation between these features and research funding decisions, though there is some evidence that this is under review.

Work on library workflows in research development and publishing has been undertaken in several places in recent years. We are particularly interested in the Mellon-funded Multi-Dimensional Framework study undertaken by the University of Minnesota Libraries in 2006, which identified ‘primitives’ in research behaviours around which new support services could be built by libraries. With the focus on research assessment intensifying for universities in many countries, what new behaviours are evident, and are the associated workflows yet optimised? If the library is becoming a research publisher, what workflows follow from that? How does the library’s control of information on research outputs integrate with other institutional research information held in university data marts and data warehouses in business intelligence contexts? We are currently assembling the expertise from within our Partnership and beyond to help us analyse these workflows - as they are currently, and as they may become over time.

Related posts:
  • None Found

LIBER, Equality, Fraternization

August 8th, 2008 by John

This year’s annual LIBER conference was held in Istanbul’s Koç University - a private university on the outskirts of the city. Ricky and I attended, and joined European librarians in jovial mood. A lot of attention in the European research library community at present is on Europeana, the European digital library which is being built collaboratively with the assistance of large-scale digitisation funding from member states. Elisabeth Niggemann, Director General of the Deutsche Nationalbibliothek and Chair of the Conference of European Librarians, updated the conference on its progress to date. A final prototype is expected in November this year, which will provide access to 2m items. When finally launched to the public, it should have 6m items. But there are some concerns about the pace of development. Elisabeth Niggemann felt that the current snail’s pace of digitisation, being done only as project money is released bit-by-bit by European national governments, is not sustainable. And the attempt to portalise European digital content is already straining under the Google effect. Europeana began life as a European response to the Google Books project, but there is a growing acceptance now that European digital content needs to be found in Google services. Eisabeth Niggemann remarked in the Q&A that ”Europeana is just a method to attract the Googles of this earth.’

Ricky gave a presentation on mass digitisation of special collections material, based on the Shifting Gears work which she and Jen Schaffner reported last year. She gave a lucid account of the arguments for a quantity approach. Her paper went down well, and is to be published in LIBER Quarterly. Other European conferences are likely to feature Programs speakers on the same topic, and I will lead a workshop on it at the RLUK autumn meeting in Leeds.

Among other highlights was a presentation from the academic perspective by Professor Sijbolt Noorda (Chairman of the Dutch Association of Universities). We thought digitisation would save us money; now we know it’s about investment. Universities should be paying for their own libraries’ investments in this area, and not leaving it to external bodies. He went on to talk about the role of digitisation in the management of university reputations, making a bold statement which I noted because it provides a useful epigraph for our newly launched Workflows in Research Assessment program: ‘The essential game of each university is the reputation game; the essential game of each researcher is competition.’ Publishers make possible that game, but publishing is changing and becoming something universities do - with library involvement. We must be bolder in this, and prepared to experiment more in collaborative ways.

There was a session on web archiving. Fifteen European national libraries are now harvesting their national domains, with a further nine at the planning stage. The view of the Bibliothèque Nationale de France (BnF) is that legal deposit treats all publications as equal. Thus they sample rather than select. French legislation means that they don’t need advance permission before harvesting - though access is only available from within the BnF. In the UK, by contrast, permission is required. This approach, confessed John Tuck of the BL, is not sustainable. Less than 1% of the total domain would be captured in 10 years.

Our OCLC EMEA colleague Janet Lees presented on ‘Moving Metadata Upstream: Early Outcomes from the OCLC Next Generation Cataloguing Pilot’. She located the origins of the ‘Next Generation’ work in the previous generation, with the vision of Fred Kilgour, founder of OCLC. The pilot has a couple of mantras: ‘Metadata should be acquired early and once. Metadata should be made to work harder and smarter’.

A presentation which also gave me heart was that of Dr Ralf Schimmer of the Max Planck Digital Library, who updated us on SCOAP, the initiative in high energy physics (HEP) which is seeking to convert the whole commercial journal literature to author-pays open access. A lot of funding has now been obtained from funders worldwide, and the collaborative effort is inspiring. Consortial regions and entire countries are being asked to assess their collective spend on HEP journals, and to divert it to the initiative. A tender will be issued, and publishers who respond will be obliged to unbundle their HEP subscriptions from their other offerings. If successful, this will be an important breakthrough in allowing the consumer to influence the terms of the offer. It would also offer an example to other disciplines which could open some powerful new directions in scholarly publishing along the lines proposed by Professor Noorda.

Sadly, I had to leave the conference early, and so missed the session on metrics and research assessment. I also missed the conference dinner which took place on the Bosphorus. For more details on either, ask Ricky!

Related posts:
  • None Found

Libraries and archives: cows of a different color?

August 8th, 2008 by Merrilee

Two interesting (and contrasting) blog postings popped into my feed reader today.

The first blog posting is from colleague Andrew Pace (the second in his “sacred cow” series). (Why don’t you have a “cow” category, Andrew? Or a “have a cow” category, for that matter.) For those of you who don’t like clicking links, I’ll summarize: circulation rules are ridiculously complicated, and putting restrictions — on what categories of users can use which materials, when, and for how long — serves no one.

I read this and I thought, well, special collections don’t function that way. Hmmm.

Five minutes later, I read the second blog posting from the Accidental Archivist. Again, to summarize, north American archivists are very welcoming and helpful to users of all stripes. He goes on to contrast North American behavior with European behavior. I have little experience in the European archival theater, so I can’t comment. If you have some observations, please leave him a comment!

I don’t think I have anything smart to say about the differences between libraries and archives in this regard. Maybe it’s that academic libraries’ constituencies are more clearly defined. Special collections (even in an academic context) consider their audience to be the scholars who are interested in the “stuff.” If they have bothered to turn up at your doorstep, they belong.

Related posts:
  • None Found

EAD @ 10 and the future of archival description

August 8th, 2008 by Merrilee

As I’ve mentioned previously, I’ve been involved in planning the upcoming EAD @ 10 symposium. The full agenda has been posted and registration is now open. RLG Partners can register for free, email me for details. For the rest of you, early bird registration closes on the Wednesday the 13th.

I’ve been helping to coordinate the “Into the Future” portion of the program, panel discussions on what the next 10 years of archival description will bring. We have 5 great speakers confirmed: Mary Elings, Mark Matienzo, Michelle Light, Jeanne Kramer-Smyth and Kathy Wisser.

Yesterday, Kris Kiesling and I had a great phone call with most of the panelist, and in a little more than 30 minutes we had raised several “big questions” about what will influence the future of archival description. I’m going to list these below, and invite you to add your questions to the comments here, or, add to the (unofficial) SAA Wiki page for EAD @ 10. Panelists will each address a few of these questions, so if you have a question about the future for our crystal-ball-gazers, let’s hear it!

Factors that might influence the future of archival description.

  • What will be the impact of MPLP/Greene-Meissner. If we are moving to a model of less processing up front, and perhaps more iterative processing down the line (“just-in-time description?”) what will be the impact on description or descriptive workflow?
  • Specifically thinking about EAD, that standard was developed to support a range of descriptive practices. With 10 years of encoding under our belts, can we imagine identifying what people actually do in terms of markup? Determine what descriptive elements are most useful both from and end user and management point of view? Using this information to move to a much tighter version of EAD?
  • As different metadata creation centers (libraries, archives, museums, digital library production, institutional repository, etc.) come into closer contact with one another, what will be the influence of these communities on one another?
  • Will some aspects of descriptions (names, places, subject terms, etc.) become networked? If so, what will be in the impact on archival description. Would netwokred elements of description allow for some automatic identification of terms, rather than implicit markup?
  • Will archival description move away from a document-centered model? What are the possibilities for moving to other models of representation? Shifting away from hierarchical structures and towards relational structures, for example.
  • What will be the impact of user contributed metadata?
  • What will be the influence of electronic records?
  • Will metadata creation tools lead us in new directions, or do they simply model current practices?
  • It can be argued that EAD has changed archival descriptive practice. What standards will change our practices going forward?
  • It can be argued that technology (specifically, the advent of the web) has changed our approach towards archival descriptive practice. What new technologies will change our practice going forward?
  • How will our evolving understanding of end user needs shape archival descriptive practice?
Related posts:
  • None Found

Programs viennent Ă  Paris!

August 7th, 2008 by John

RLG Programs will run its first European Partner Meeting in Paris on 5-6 November 2008. We felt that, although several of our European Partners are represented at our Annual Meetings, for many of them a venue in Europe is more realistic. Paris was chosen not simply for the obvious reasons, but also because its transport links are excellent, and - for our many UK Partners - it is of course one of the nicer cities available via a short rail journey. The Meeting will be a more compact version of the Annual Meeting, developing and reporting on some of the new areas which were surfaced in Philadelphia, but running as a plenary sharing of activities and ideas through a single day.

We will begin, however, by getting to know each other better at an Evening Reception on Thursday 5 November, in order to allow a full day of presentations and discussion on Thursday 6th. Both Jim Michalko and Lorcan Dempsey will attend and provide the broad context in the first session. We will then work in plenary through a number of key theme areas, with presentations from Program Officers, and also from some of our European Partners. Delegates who wish to digest the messages of our Meeting along with some of the city’s other charms may then decide to stay on for le week-end.

Naturally, we hope for a strong attendance from our European Partners - but Partners from any country are welcome to register if they wish to join us! Details are on our website.

Related posts:
  • None Found

Measuring, in special collections

August 4th, 2008 by Merrilee

Following the Digitization and the Humanities Symposium, Jennifer Schaffner and I wrote a brief report, The Impact of Digitizing Special Collections on Teaching and Scholarship: Reflections on a Symposium about Digitization and the Humanities (PDF — 10 pages). The report acts as a summary of the symposium, and also gives some calls to action.

One of the topics that came up for discussion during the symposium was use metrics. Here’s a snippet from the report.

A major take-away point from the symposium was: know your scholars. The RLG Programs partners
engaged with the call for measuring and evaluating the impact of special collections on research.
Alice Schreyer (University of Chicago) spoke of the plusses and minuses of existing metrics in
special collections, and called for unified practices…. [Paul] Courant had a slightly different view… a provost is looking for a few top
faculty members to say that special collections (digital or not) are valuable for research. Without
faculty voices, all the metrics we have may be worthless. Ultimately, what matters is that scholars
find and use primary materials, and view the library (and its collections) as a valuable resource.

Call for action:
Although we need evidence of the impact of digitization and the unique collections themselves,
quantitative metrics aren’t enough. We must make sure libraries and archives both measure use of
special collections and work with faculty to demonstrate their value for excellent research.

As a follow up to this discussion, Jennifer and I will be hosting a Webinar on Thursday August 14th (8 am Pacific daylight time). Our Webinars are opportunities for us to discuss ongoing work in RLG Programs — this one will be a little different because we’d like to explore the topic with those of you who are out in the trenches. Do you need use metrics? If so, what? Why? Would data would you like to be collecting, and what keeps you from collecting it?

In preparing for the Webinar, I read a not-yet-published paper by Elizabeth Yakel and Elizabeth Goldman. They analyze interviews done with those who “measure” in archives. Then they categorized why institutions measure. Out of curiosity, and in advance of our Webinar, please take the poll below (this is completely non-scientific, and will only be used to help shape the discussion). I’ll share the results later. Please also note that you may need to scroll over the questions in order to see the whole thing — all part of the grand experiment).


Quizzes by Quibblo.com

If you are interested in joining the Webinar and participating in this conversation, please send me an email (proffitm@oclc.org) and I will email you the information.

Related posts:
  • None Found

It’s still: Think Global, Act Local

July 31st, 2008 by GĂĽnter

For the last 5 years, my friend Mary Elings from the Bancroft Library and I have made a trip to upstate New York in the summer to teach a one week intensive 3-credit graduate class for the iSchool at Syracuse University (IST677 – Digitization in libraries, archives and museums). Every year, preparing for and teaching the class invites us to pause and reflect on how much has changed in just 12 months in the field of digitizing collections. While the main pillars in the outline of our from-the-cradle-to-the-grave syllabus remain, what we say about each topic changes considerably from year to year. Not surprisingly, we increasingly feel that we need to both impart current practice, while at the same time emphasizing new thinking in the field which challenges business as usual.

Luckily enough, Mary and I as a tag-team are well poised to take on that challenge. Mary predominantly reflects the local point of view of the professional who has to get things done in the here-and-now, while I predominantly reflect a global point of view of somebody who can afford to think about how things should adapt to the realities of our networked information economy. When we quibble, it isn’t just a disagreement, but an educational moment illustrating the times we live in; when it all comes together, it should read like the old bumper sticker as an exhortation to “Think global, act local,” as Mary pointed out to the students.

What follows is an impressionistic glance at the areas where I see a shift in what we talk about in class – in most instances, these are trends which I think will become more pronounced as we continue to teach the class.

  • Creating digital collections – we used to emphasize high-end digitization using digital camera backs; now we’re starting to talk about the potential advent of mass digitization for rare and unique materials.
  • Describing the collection – five years ago, the biggest wow-factor in this area was showing students that the use of XML allows institutions to repurpose collections in a variety of interfaces and makes content portable; while this still impresses them, we’re now also incorporating ideas about how to better leverage authorities (for example through Terminology Services), and we discuss the role of user contributed content. As in all other areas, we use Shifting Gears [pdf] to point to the balance between what’s commensurate, and what’s overkill.
  • Disclosing the collection – we used to emphasize contributions to LAM community-built aggregations such as statewide digitization programs, licensed resources, subject based aggregations or OAI portals; now we’re starting to spend much more time thinking through the implications of the Flickr Commons or pointers to special collections on Wikipedia – in short, leveraging the infrastructure around us which we (as a community) haven’t built and don’t control. (In revisiting my slides about the community-built aggregations, I realized that a good number of them have had to weather major transitions lately, and some didn’t survive, which indicates that we still don’t know how to sustain those resources.)
  • Using digital collections – we used to emphasize the studies (such as this Wesleyan/NITLE report) which show that faculty by and large don’t teach from licensed or locally built collections; now we are starting to think more about what role the library could play, for example in helping faculty manage their own personal collections.
  • Preserving digital collections – while we still talk quite a bit about the usual suspect acronym soup (OAIS, METS, PREMIS, NISO Z39.87), we’re now also considering issues of the economics of digital preservation (as exemplified by the work of the NSF Blue Ribbon taskforce). We also acknowledge the impact of mass digitization on digital preservation, which leads to a re-thinking along the lines of Oya Rieger’s recommendations in a recent CLIR report, such as Seek Compromise to Balance Preservation and Access Requirements and Reassess Digitization Requirements for Archival Images.
  • In this way, the class acknowledges that established work processes are likely to continue while we explore new ways of serving our audiences. It is a balancing act professionals must manage, whether new to or veterans of the library profession. By bringing these issues to our class, we hope to encourage our students to think of libraries, archives and museums as a field rife with possibilities for those with creative minds!

    Related posts:

    Benchmarking Network Performance: Measures and Behaviors

    July 24th, 2008 by Constance

    Over the last year, GĂĽnter and Ricky have been examining models of collaboration across the cultural heritage community, working with RLG partners to identify the most significant obstacles and incentives to effective library-archive-museum partnerships. This is part of a program of work exploring cross-domain convergence in organizational structures and service requirements. GĂĽnter has reported on some of this work here (May ‘08), here (November ‘07) and here (July ‘07). A final report from this project is expected soon. It should help us to understand how cooperation contributes to improved institutional performance, including greater discoverability of collections, better integration of functions, and increased operational efficiencies. All of these objectives are important to research institutions that want to participate fully in the network information economy.

    In a related vein, Dennis recently queried members of the long-running SHARES inter-lending partnership about the hallmarks of ‘high-performance’ sharing partners. A new Working Group on High Performing Lenders has drafted a survey to

    learn what qualities [SHARES participants] value in a supplier and which … partners consistently display those characteristics

    I was interested to see that the criteria under consideration include some social behaviors and delivery options that respond to expectations that have been shaped (or at least sharpened) by the larger network environment. These include [with my annotations]:

    • Quality of scanning/copying [surrogates should meet or exceed quality of original] 
    • Willingness to supply rare or hard-to-find materials [if content is discoverable, it is assumed to be available]
    • Quality of holdings data in WorldCat [supply chains rely on accurate and reliable disclosure]

    The library’s ability to meet end-user expectations is dependent upon the performance of its service providers — including its inter-lending partners. This is increasingly true in an environment where ‘local’ holdings are assumed to be continuous with the collective collection of library content. The larger information network imposes a set of collaborative imperatives that cross organizational, institutional and geographic boundaries.

    All this has led me to wonder, as the ARL Library Assessment Conference (Seattle, 4-7 August) approaches, about the kinds of metrics we use to assess the value and performance of cultural heritage institutions, especially those that serve the research community. My colleague John MacColl attended a recent meeting of representatives from some of the organizations involved with with establishing and monitoring research library metrics, including ARL (123 institutions in North America), SCONUL (172 institutions the UK) and CAUL (41 institutions in Australia). There is evidently some interest in harmonizing these measures so that research libraries can begin to benchmark their collections and services against global indices. This suggests that libraries are seeking to situate themselves in an expanding network of information service providers in which performance standards reflect common operational requirements and social norms.

    There are a couple of measures in the current array of library performance indicators that can be used to assess inter-institutional cooperation on a system-wide scale. Resource sharing statistics provide a gauge of network participation and institutional co-dependence. Annual expenditures on shared infrastructure are another useful index of collaboration. Since the mid 1990’s the National Center for Educational Statistics has tracked US library investments in ‘bibliographic utilities, networks and consortia’ – an interesting combination of social and technological support systems — as part of overall library operational expenditures. ARL added this category to its statistical measures in 2004. An acknowledgement, perhaps, that institutional performance and achievement in the research library sector is increasingly dependent upon collaboration.

    In other sectors — notably supply chain management — the tangible benefits of cooperation have been studied more carefully. [References] Some fascinating work has been done to model instruments for measuring collaboration amongst “chain members” to enable reliable benchmarking and performance assessment. [Simatupang, T. M., & Sridharan, R. (2005).] The LAM community might benefit from a similar assessment framework, which acknowledges the multi-dimensional character of cooperation (information sharing, decision synchronization, alignment of incentives) and highlights its real operational value.

    Related posts: