Institutional repositories – are they a waste of money?

Added by Pippasmart on 15 October 2010 13:20

17 comments

 There are no tags for this entry

Everyone seems to have their own repository these days. Not only groups of interested researchers (e.g. ArXiv.org) and individual institutions (e.g. Southampton University) but even departments within universities (e.g. University of Oxford Mathematical Institute). I can understand many of the arguments for such repositories. However I wonder if developing such large numbers of repositories is truly the right way forward.

At the recent RIN seminar on the future of scholarly publishing there were interesting talks on UKPMC and the University of Liverpool. UKPMC is doing some innovative things: building good discovery interfaces and capturing background data as well as theses and other unpublished works. Where they have links with universities (such as Liverpool) they can then deposit articles onto the university repository.

This got me thinking about the duplication of articles, sitting on various repositories around the world, bringing with them the possibility of version problems and duplication of effort and cost. Are universities wasting their money? The cost of developing, building and maintaining an institutional repository is not small. According to Alma Swan’s recent report for the JISC, Modelling Scholarly Communication Options, the cost of running a repository ranges from £26k to £209k. Figures for development range from “free”(!) to several hundred thousand dollars.

However, cost seems to be no deterrent and the number of repositories is growing rapidly. As I write this, ROAR (the Registry of Open Access Repositories) lists 1925 repositories, of which the vast majority are operated by single institutions or departments. The materials that they hold duplicate to some extent the holdings of larger discipline-based sites such as RePEc, ArXiv.org and PMC.

I can see various problems with this fragmented model quite apart from the duplication issues; for instance, types of files accepted for deposit (not fully XML-encoded) and lack of interoperability (ROAR cannot obtain data from a large number that they index). In the longer term it is likely that although technologies will (hopefully) allow for more integration, on-going maintenance and development of each repository will lead to greater diversification – providing solutions for individual institutions but potential complications for the interoperability of such systems. There may also be funding problems when a small institution suffers budget cuts that leads to reduced collection and upload activity.

At present there is a debate taking place about the correct model for ArXiv. Last year ArXiv.org announced that it currently cost $400k p.a. to run, and was expecting this to rise to $500k within the next two years. Along with this announcement they asked for participating institutions to help share the cost. There are questions about whether ArXiv should be expected to capture everything (and allow institutions to harvest their content) or whether it is more sustainable for individual institutions to capture their own content and allow ArXiv to harvest it. ArXiv is very cost-effective; however the cost needs to be shared in some way if it is to be sustainable.

And what is the role for discipline-based repositories such as UKPMC? If a repository is to serve information-seekers then these are far more useful than (smaller) individual cross-discipline repositories. However they suffer from a lack of access to the authors of non-mandated materials such as theses where institutional libraries are better placed to advocate and coerce deposit.

In these straightened times, a collaborative model may be a better way to minimise costs. It seems to me that local capture onto a centralised repository which can then provide institution-based or discipline-based “windows” to its content is surely the way forward, with the actual content residing in one place, correctly captured, tagged and future-proofed. Such models do exist (e.g. CLACSO in Latin America, and HAL in France), but they appear to be the exception rather than the rule. Nevertheless, this approach should benefit from cost-savings and avoid duplication of effort whilst ensuring a high level of technical development.

Perhaps one day…

 

Pippa Smart, pippa.smart@gmail.com, www.pspconsulting.org

Pippa Smart is a publishing consultant working with editorial groups and publishers.

 

Comments

Anonymous (not verified) said on 14 May 2012 at 6:21pm:

Very important discussion indeed! I trust you and believe your thoughts bcoz institutional repositories efforts ensuring a high level of technical development. So I can’t assume they are waste of money. Thanks! @ storm cases

Anonymous (not verified) said on 03 May 2012 at 9:27am:

 

Yazılım Mühendisliği, kaliteli yazılım mühendislerine oluşan yoğun talep üzerine Bilgisayar

Mühendisliği çatısı altında 2008 yılında kurulan yeni bir programdır. Yazılım Mühendisliği, yazılım sistemlerinin mühendislik prensipleri çerçevesinde tasarımını, üretimini ve işletilmesini kapsayan bir bilim dalıdır. Yazılım Mühendisliği Programı  öğrencileri, bu uygulamalı bilim dalında en güncel bilgiye ulaşma ve yoğun çalışmalar yürütebilme imkanına sahiptirler. Bilgisayar Mühendisliği Bölümü, bilgisayar ve yazılım mühendislikleri anabilim dallarına bağlı birçok araştırma alanı ve değişik ülkelerden gelen, tecrübeli bilim adamını bünyesinde barındırmaktadır.

 

 

Anonymous (not verified) said on 29 April 2012 at 10:31am:

How can one prove that money for motivation? Money and recognition have always been popular offerings when trying to create motivation for great work.

 

 

Anonymous (not verified) said on 26 April 2012 at 8:02am:

Bestech believes that it has an edge in the field of Real Estate Development because the group has the combined strength of conceptualization with 100% implementation and delivery.
Park view Grand Spa
park view ananda gurgaon
bestech ananda project
palm gardens

Anonymous (not verified) said on 22 April 2012 at 2:48pm:



Favorite post having such an fantastic and useful informative content. describing good blogging concepts as well as basics that are very much useful in skilled content writing as well
http://www.ukppiclaims.org.uk

Anonymous (not verified) said on 30 March 2012 at 6:18pm:

I am exploring this topic for a report I have to write. You have good content and I would like to know if you have any other posts about this?   Tennesseechocolates.com

Anonymous (not verified) said on 22 March 2012 at 2:22pm:

I guess that this will be the main issue. I am sure that you will have good results in the future if you will continue in the same manner. Great job! printed t shirts

Anonymous (not verified) said on 16 February 2012 at 3:50pm:

I’ve noticed a new breed of personal repositories that have been getting a lot of press recently, and this all started from a website called pinterest, which is a bit like an online repository that allows its users to store things that they find while surfing and want to make a note while sharing it with the pinterest community. For example, Fun Playhouses has a pinterest sharing images of playhouses and garden games.

Anonymous (not verified) said on 15 February 2012 at 3:28pm:

Interesting point of view. I, too, feel that thanks to the internet we are becoming more and more superficial; but then this shouldn’t surprise us a bit: how otherwise manage the huge inflow of information that we must face each day? refinance mortgage

Anonymous (not verified) said on 06 February 2012 at 11:09am:

it is an interesting article… you should post more often this kind of articles. yacht charter France

Anonymous said on 08 November 2010 at 3:36pm:

Great idea.  It comes about 5 years too late, though.

Anonymous said on 02 November 2010 at 3:49pm:

The arguments for and against Institutional Repositories are many and varied. Personally, I think the root argument has been missed: The web pages that we see (and call Institutional Repositories) are advertorials for institutions. There is a separate (background) task, doing a completely different thing, which relates to the collection & preservation of the research done by an organisation…. information which needs to be collected, collated, catalogued (and lots of other ‘C’ words too).

The argument about duplication of information is also spurious: the whole concept of the Internet is about redundency and multiple copies (how many sites list the lyrics for modern songs, for example.) We need to stop fighting the tide, and accept that multiple copies will exist…. and that researchers will move from Institution to Institution.

Given the political nature of research institutions, there is never going to be a single accepted structure for “a deposit” (heck, if we can’t get Dublin Core right, we’re stuffed when if comes to merging Articles, Book Chapters, Research Data, and Learning Objects!) - so any Grand Idea to have a uniform, but distributed, repository is doomed before it’s even conceived!

Once we accept that there will be multiple copies of the same thing expressed in different ways, and in multiple locations, we can build an infrastructure that will allow us all to share research in a sesible and coherant manner.

 

Anonymous said on 20 October 2010 at 9:23pm:

It’s nice to see an independent champion of UKPMC. I think that major science funders should really get their act together and take the lead on this issue. Collectively, they would have the power to enforce a consistent set of standards.

Anonymous said on 20 October 2010 at 11:36am:

Speakiing on behalf of UKPMC, we do not see our ourselves competing with institutional repositories for content; on the contrary they are all welcome to harvest our metadata and our OA content.  We are participating in the JISC-funded Repository Junction Project which is developing a tool which will allow deposit into multiple repositories simultaneously which should reduce the admininstrative overhead on the depositor.

Phil Vaughan

Programme Manager, UKPMC

Anonymous said on 19 October 2010 at 3:05pm:

Our school repository (for which I am the repository manager) provides a comprehensive collection of our research over the last decade, and takes a few hours a month of my effort to administer. in fact, that was one of the constraints imposed by our management: that the repository should not detract anyone from their job of doing actual research!

Making something central does not magically ensure that money is saved, nor that information is “correctly” captured, tagged or future-proofed.

Anonymous said on 19 October 2010 at 10:04am:

I can see the intial attraction of a single centralised repository that is theoretically more efficient. But there are a number of problems. Firstly the difficulty of getting all the institutions in the UK to agree to the design of the repository, the policies for deposit etc. Secondly, even if the money could be found to establish this, can the ongoing funding be guaranteed? If its a collaborative venture, agreements can break down in the longer term. If its top sliced central funding, well we are all aware of the vulnerability of that model in the current climate. In my opinion, the organisations that have the most likliehood of continuity in higher education are the universities and colleges themselves. Therefore, a model which has the institution as the basic unit and content is then harvested and presented in different spaces stands a better chance of long term sustainability.

Jackie Wickham

Anonymous said on 15 October 2010 at 8:45pm:

Thinking long term in scholarly communications, duplication and redundancy are essential. Multiple repositories are good for us. Encourage every author to deposit in their local (or domain) repository (and link back to the original, of course). The only slight disadvantage of this as far as I can see is some loss of Googlejuice. But the long term advantages are far higher. And let’s remember, the marginal costs of the repository model are VERY low!

 

Post new comment

  • Web page addresses and e-mail addresses turn into links automatically.
  • Allowed HTML tags: <a> <em> <strong><em> <br> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd> <h2> <h3> <h4> <h5> <p>
  • Adds typographic refinements.
CAPTCHA
This question is for testing whether you are a human visitor and to prevent automated spam submissions.
Image CAPTCHA
Enter the characters shown in the image.

In this section

Share this



© Research Information Network 2005–2009