Towards an open library of relational metadata: the experience of RePEc Research Papers in Economics - PowerPoint PPT Presentation

About This Presentation
Title:

Towards an open library of relational metadata: the experience of RePEc Research Papers in Economics

Description:

We need to make the concepts coming of free software more a part of our business. ... of the Internet allowed me to download free software without effort ... – PowerPoint PPT presentation

Number of Views:75
Avg rating:3.0/5.0
Slides: 44
Provided by: kric2
Learn more at: https://openlib.org
Category:

less

Transcript and Presenter's Notes

Title: Towards an open library of relational metadata: the experience of RePEc Research Papers in Economics


1
Towards an open library of relational metadata
the experience of RePEc (Research Papers in
Economics)
  • Thomas Krichel
  • 2003-11-07

2
who is me?
  • I was an economist.
  • I was a leisure digital librarian.
  • NetEc since 1993
  • RePEc since 1997
  • I am "just another Perl hacker"
  • I am a visionary
  • but I'm not like St. John the Baptist

3
who is he?
4
he is "St. IGNUicus"
  • A humoristic creation of Richard M. Stallman
    (RMS)
  • RMS is the father of the free software movement
  • a geek
  • a visionary
  • St. IGNUicus shows an emphasis on the moral case
    for free software, rather than the business case

5
moral case and business case
  • Other folks in the free software movement avoid
    the "f" word
  • free can mean cheap
  • cheap can mean bad
  • They stress the business case of free software
  • They use the term "open source software", (OSS)

6
RMS and us
  • Amen, I tell you we librarians need to learn
    more from the OSS movement.
  • We need to make the concepts coming of free
    software more a part of our business.
  • Let us look at a key concept free software.

7
free software according to RMS
  • Free software comes with four freedoms
  • The freedom to run the software, for any purpose
  • The freedom to study how the program works, and
    adapt it to your needs
  • The freedom to redistribute copies so you can
    help your neighbor
  • The freedom to improve the program, and release
    your improvements to the public, so that the
    whole community benefits

8
what has this to do with us?
  • Just replace free software with free information.
    Libraries are about free information.
  • But the analogy is not quite as simple.
  • When we talk about free information, we usually
    mean things that we can freely read (download).
    free as in 0
  • We do not usually mean free information as
    information we are free to do things with. Free
    as in freedom.

9
moral and business
  • There is a moral case for free information.
  • We rely on it.
  • There is a business case for free information.
  • We need to make our own.

10
we rely on the moral case
  • The citizen should be informed
  • Individuals in the organization should have free
    access
  • This is how we justify resources given to us.
  • Often, members of the community who pay get
    privileged access.

11
from moral case to business case
  • To form the business case for free information,
    think of "free information" as "freedom to do
    things" rather than 0.
  • Thus libraries can make a crucial business case
    for them as agents who transform information.
  • Recall that there are whole industries out there
    that produces free information.

12
Now for something different
  • RePEc is an example for an Open Library.
  • An Open Library is loosely defined an application
    of the OSS principles to libraries.
  • vague
  • in the making
  • but has some history
  • Looking at RePEc will fix ideas.

13
History
  • It started with me as a research assistant an in
    the Economics Department of Loughborough
    University of Technology in 1990.
  • a predecessor of the Internet allowed me to
    download free software without effort
  • but academic papers had to be gathered in a
    painful way

14
CoREJ
  • published by HMSO
  • Photocopied lists of contents tables recently
    published economics journal received at the
    Department of Trade and Industry
  • Typed list of the recently received working
    papers received by the University of Warwick
    library
  • The latter was the more interesting.

15
working papers
  • early accounts of research findings
  • published by economics departments
  • in universities
  • in research centers
  • in some government offices
  • in multinational administrations
  • disseminated through exchange agreements
  • important because of 4 year publishing delay

16
1991-1992
  • I planned to circulate the Warwick working paper
    list over listserv lists
  • I argued it would be good for them
  • increase incentives to contribute
  • increase revenue for ILL
  • After many trials, Warwick refused.
  • During the end of that time, I was offered a
    lectureship, and decided to get working on my own
    collection.

17
1993 BibEc and WoPEc
  • Fethy Mili of Université de Montréal had a good
    collection of papers and gave me his data.
  • I put his bibliographic data on a gopher and
    called the service "BibEc"
  • I also gathered the first ever online electronic
    working papers on a gopher and called the service
    "WoPEc".

18
NetEc consortium
  • BibEc printed papers
  • WoPEc electronic papers
  • CodEc software
  • WebEc web resource listings
  • JokEc jokes
  • HoPEc
  • a lot of Ec!

19
WoPEc to RePEc
  • WoPEc was a catalog record collection
  • WoPEc remained largest web access point
  • but getting contributions was tough
  • In 1996 I wrote basic architecture for RePEc.
  • ReDIF
  • Guildford Protocol

20
1996 RePEc principle
  • Many archives
  • archives offer metadata about digital objects
    (mainly working papers)
  • One database
  • The data from all archives forms one single
    logical database despite the fact that it is held
    on different servers.
  • Many services
  • users can access the data through many
    interfaces.
  • providers of archives offer their data to all
    interfaces at the same time. This provides for an
    optimal distribution.

21
RePEc is based on 330 archives
  • WoPEc
  • EconWPA
  • DEGREE
  • S-WoPEc
  • NBER
  • CEPR
  • US Fed in Print
  • IMF
  • OECD
  • MIT
  • University of Surrey
  • CO PAH

22
to form a 209k item dataset
  • 119,000 working papers
  • 87,000 journal articles
  • 1,000 software components
  • 600 book and chapter listings
  • 3,500 author contact and publication
    listings
  • 7,300 institutional contact listings

23
RePEc is used in many services
  • BibEc and WoPEc
  • Decomate Z39.50 service
  • EconPapers
  • NEP New Economics Papers
  • Inomics
  • RePEc author service
  • IDEAS
  • RuPEc
  • EDIRC
  • LogEc

24
describes documents
  • Template-Type ReDIF-Paper 1.0
  • Title Dynamic Aspect of Growth and Fiscal Policy
  • Author-Name Thomas Krichel
  • Author-Person RePEcper1965-06-05thomas_kriche
    l
  • Author-Email T.Krichel_at_surrey.ac.uk
  • Author-Name Paul Levine
  • Author-Email P.Levine_at_surrey.ac.uk
  • Author-WorkPlace-Name University of Surrey
  • Classification-JEL C61 E21 E23 E62 O41
  • File-URL ftp//www.econ.surrey.ac.uk/
    pub/RePEc/sur/surrec/surrec9601.pdf
  • File-Format application/pdf
  • Creation-Date 199603
  • Revision-Date 199711
  • Handle RePEcsursurrec9601

25
describes persons (HoPEc)
  • template-type ReDIF-Person 1.0
  • name-full MANKIW, N. GREGORY
  • name-last MANKIW
  • name-first N. GREGORY
  • handle RePEcper1984-06-16N__GREGORY_MANKIW
  • email ngmankiw_at_harvard.edu
  • homepagehttp//post.economics.harvard.edu/faculty
    /
  • mankiw/mankiw.html
  • workplace-institution RePEcedideharus
  • workplace-institution RePEcedinberrus
  • Author-Article RePEcaeaaecrevv76y1986i4p
    676-91
  • Author-Article RePEcaeaaecrevv77y1987i3p
    358-74
  • Author-Article RePEcaeaaecrevv78y1988i2p
    173-77
  • .

26
describes institutions
  • Template-Type ReDIF-Institution 1.0
  • Primary-Name University of Surrey
  • Primary-Location Guildford
  • Secondary-Name Department of Economics
  • Secondary-Phone (01483) 259380
  • Secondary-Email economics_at_surrey.ac.uk
  • Secondary-Fax (01483) 259548
  • Secondary-Postal Guildford, Surrey GU2 5XH
  • Secondary-Homepage
  • http//www.econ.surrey.ac.uk/
  • Handle RePEcedidesuruk

27
what do open libraries do?
  • Identify records
  • Relate identified records
  • These actions require human control.
  • They prepare for assessment of performance.

28
key to success
  • Have a small group of volunteers
  • Disseminate as widely as possible
  • Demonstrate to authors and institutions that it
    works for them.
  • institutional registration
  • author registration

29
institutional registration
  • It started by one sad geezer making a list of
    departments that have a web site.
  • I persuaded him that his data would be more
    widely used if integrated into the RePEc
    database.
  • Now he is a happy geezer and one of our three
    crucial volunteers.

30
author registration
  • It started when funding allowed us to hire a
    crazy programmer to write an author registration
    system.
  • system went online as "HoPEc" in late 2000.
  • has been renamed "RePEc author service" (RAS)
  • recent grant from OSI allows for a rewrite and
    expansion.

31
(No Transcript)
32
RePEc author service
  • RePEc document data has author names as strings.
  • The authors register with RAS to list contact
    details and identify the papers they wrote.
  • This is classic access control, but done by the
    authors.
  • In a ranking of 800 most important economists,
    400 are registered with RAS.

33
authors' incentives
  • Authors perceive the registration as a way to
    achieve common advertising for their papers.
  • Author records are used to aggregate usage logs
    across RePEc user services for all papers of an
    author.
  • Stimulates a "I am bigger than you are"
    mentality. Size matters!

34
KEY idea 1
  • RePEc attracts a community of users and
    contributors
  • The community itself is the focus of attention
  • RePEc describes the living rather than the dead.
  • Forget about documents!

35
KEY idea 2
  • Forget about users!
  • Disseminate widely
  • Users will come through Google anyway.
  • And Google loves RePEc services
  • puts RePEc services top when the query consists
    of the name of an author

36
open library idea serials data
  • Serial level information is a crucial component
    of academic library data.
  • Idea build and maintain free serial records.
  • Two ways to build
  • Use volunteers and collect in a decentralized
    way.
  • Make an expensive central collection, disseminate
    well, charge for record changes later.

37
another open library idea law
  • Much of the legal texts are de jure free.
  • De facto there are two companies who have
    comprehensive collections and charge a lot of
    money for the free information bundled with
    proprietary information.
  • Our moral case calls for a replacement!
  • (it will also create jobs for us)

38
free legal open library
  • Have all laws and cases
  • online as text
  • identified related
  • Have citation metadata, so that legal citations
    can verified be while composing case data.
  • Registration procedure to verify the integrity of
    data.

39
open library idea II drugs
  • Collect data on the composition of all drugs
  • drugs composition reported by drug companies,
    using open archives
  • drug components documented by the governments,
    using an open archive
  • Open library brings the two together!

40
Am I crazy?
  • Money does not make the world go round. Ideas do.
  • When RMS proposed a free replacement for UNIX in
    the early 80s, most people dismissed the idea.
  • Today it is reality!
  • Similarly, when I started to work on RePEc a
    totally free and improved AI dataset in 1993,
    nobody gave it a high probability to succeed.
  • It is a reality!

41
obstacles to open libraries
  • lack of imagination entrepreneurship
  • inability to form alliances
  • user-centered thinking
  • document-centered thinking
  • technical competence required
  • OAI PMH
  • XML and XML Schema
  • Unicode
  • the "C" word

42
what I do for open libraries
  • Create an open library for library science the
    rclis (reckless) dataset.
  • Create a supporting organization
  • the open library society.
  • co-workers welcome!

43
http//openlib.org/home/krichel
  • Thank you for your attention!
Write a Comment
User Comments (0)
About PowerShow.com