12 Responses to “Repository architecture #83”

  1. Rachel Heery says:

    There was some discussion on the day as to how far one should label workflow and services around the source repository as being ‘repository’ workflow or ‘repository’ services – both at institutional and wider level. I think there are some benefits in articulating these associated workflows and services more generically as ‘research information systems’ and ’scholarly communication’. This might go some way to acknowledge the flexibilty and differences between the various approaches out there.

  2. [...] the morning about how such an architecture might look – links to his slides are available from his blog entry about the day. As Paul points out, there was a fair amount of discussion about repository content [...]

  3. [...] Walk kicked off the day by presenting his work on a next generation architecture for repositories. His presentation started off with a number of starting principles and moved on to some diagrams [...]

  4. I wouldn’t say reusability is over-rated as a virtue. I would say that reusability is really, really hard. In any case I think we frame the discussion differently these days. Originally reusability was so that you could maximize your ROI. Now we’re more concerned with flexibility, to maximize your… hmm… Change On Investment? So instead of “design & build an architecture that supports reusability”, let’s say “design & build an architecture that is flexible”. Flexibility Oriented Architecture?

  5. Chris Rusbridge says:

    Three thoughts. I just don’t get the Technorati versus Slideshare reference. Technorati sucks! I don’t know if this is because it’s by reference, or just that its business model is so broken that it can’t afford sufficient grunt to make it work, but it loses things, known searches fail, multi-page search results loop… I wouldn’t use it at all except for the “authority” figure (which all of the above makes doubtful. Slideshare, OTOH, just works.

    Second, I agree that most of the data that are in repositories are in subject repositories. Likewise most of the papers that are in repositories are in IRs (perhaps with the honourable exception of Arxiv). But most data are not in repositories at all. (Likewise most papers.) Most data cannot go into subject repositories because there are not enough of them, because the business model for subject repositories is fragile in the extreme. The business model for IRs, however, looks more robust the more content they can capture. So I think we need to find more ways of getting our data into sustainable IRs.

    Thirdly, (I think I catch your support for this by implication but not explicitly) getting the stuff in involves both incentives (in making science easier) and integration with researcher workflow. If they have to come back and do it later, I believe it just won’t happen. It’s why I was suggesting the (possibly over-complicated) research repository system idea in my blog post…

  6. paul says:

    Chris,
    three quick responses:
    First: I agree that Technorati does not always deliver…. I think the authority figure is utterly suspect – my own feeble ‘authority’ figure seems to change with the weather….
    However, while the execution is not great, I still think the model is a good one. The problem which Technorati tries to solve is considerable – it allows the creation of content to be remain completely distributed, unlike Slideshare.

    Second: I agree with you – I believe the institution is well placed manage this – we should see this activity grow as institutional workflows become established and smarter.

    Three: yes – rather than giving overt incentives to ‘deposit in the repository’, give the incentive to ‘use our fabulous workflow tools’ – which happen to deposit resources as appropriate.

  7. Three points arising from Paul’s presentation and the comments:

    1 On subject vs institutional repositories, *some* of the content (papers) is in subject Rs, of course. But the real question is: what about all the rest? Where is that to become OA if 100% is the target?
    2 Paul’s idea of ’source’ repositories is good if it means getting the repository interfaces closer to the end-users, perhaps school or dept. repositories, with a central services approach institutionally.
    3 Complexity appears to be rife. Data, metadata, workflows, etc. – all need to be worked out to ensure value always exceeds cost. We cannot afford to burden IRs with unconstrained expectations, responsibilities, and costs, especially in these uncertain economic times.

  8. paul says:

    Regarding point 3 – as Jim Downing suggests, the best we might do is to move the complexity around. I suggest moving it away from the source repository – but the institutional repository may have to pick up much of this.

    But yes – those expectations should be managed!

  9. Paul, may I borrow that screenshot (with credit, of course) for a presentation I am giving?

  10. paul says:

    Dorothea,
    Be my guest :-)

    I’d be interested to see how you use it…..

    Paul

  11. [...] – bookmarked by 2 members originally found by rickenriquericky on 2008-08-09 Repository architecture #83 http://blog.paulwalk.net/2008/07/07/repository-architecture-83/ – bookmarked by 3 members [...]

  12. [...] I also agree with the main point which is about the direction of the flow. Indeed, I have previously characterized the institutional repository as being, or more usually containing, the source repository. However, [...]