Title: Subjectspecific international services in Physics Eberhard R' Hilf, H' Stamerjohanns, and Thomas Sev
1Subject-specific international services in
Physics Eberhard R. Hilf, H. Stamerjohanns,
and Thomas SeveriensInstitute for Science
Networking physnet.uni-oldenburg.de/hilf
3. September Duisburg, Germany Workshop
International Interdisciplinary Open Archives
and Subject specific services in Mathematics and
Physics.
2Content of talk
- I Why subject-specific services?
- II Open Archive Distributed in Physics
- III International embedding and organizatio
3Part I Why subject-specific services?
- Knowledge repository requirements
- Restricted
- Complete
- Professional
- Research-driven
- Additional subsject-specific services
41. Why restrict knowledge basis?
- Higher ratio of relevant information retrieved
- Less missunderstanding different meanings and
content for same word in different fields
Test 1
52. Why complete repository?
- Prime research needs
- instant (Web, no delay) information of all
relevant new results - complete information fom anywhere in the world
- One stop service despite a multitude of
distributed heterogenous repositories.
Consequences for financing concepts
63. Why professional content repository
- Researchers need mostly information from their
professional colleagues. - Researchers can act only in their subject-field
as referees, quality filters for the wider
public, comment and select. - The Web allows for a multilevel professional
quality management for all heterogenous purposes
74. Why research driven repository?
- Authors have the highest motivation to be read,
to get their documents distributed and archived. - Author communication communities are
subject-specific. - Scientists understand only their
subject-colleagues - Research is organized most often in
subject-specific topical institutes
8Part II Distributed Open Archive for Physics OAD
- Vision of the ultimate subject-specific Open
Archive - All departments worldwide as prime, complete,
open free repositories - Secondary virtual add-on services use these
- Quality filters
- Collections
- topical archives
9Present incomplete realization
- All worldwide departments
- Few cooperate by local quality filters yet
- Few comply with metadata (1000 of 40000
documents) - Few give explicit open access (keep authors
rights)
PhysNet
10Completeness of data in a heterogenous world
- Free locally posted documents PhysDoc
- Free archived theses Depts, Univs., DDB,..
- Free preprint repositories ArXiv
- Free fulltext journals
- Free research lab docs CERN, ANL, ..
- University Publishers
- Journals of Natl. Societies APS, IoPP
- Commercial journals
11OAD Physics Project 2001
- Oai compliant service provider for
- PhysDoc 1.000 out of 40.000
- ArXiv
- IoPP
- APS
- PhysDiss European
- NDLDT 2001
- Cornell, CERN, MIT Oai-compliant Document
providers
show
12Part III Organizing international distributed
repositories
13Oai Cooperation of repositories
Oai lists
- Data providers comply with Oai
- Yes, if they are not service providers
Departments - Yes, if they are free access providers
ArXiv - Subtle, if national society publishers APS,
IoPP - No, if commercial publishers Elsevier,..
scirus
Cut throat competition of service providers with
best service for same documents
Commercial publ. collect free access documents
14Political and Funding Policy
- Effective services for research
- Money to libraries per No of accessible documents
- Multiple access ways TibOrder vs others
- Regulations for hiring scientists to Universities
- Funding selforganization of research communities
- University publishers as regular prime research
outlet - Fund IuK research to professionalize content
search
15Subject-specific National Port of Entry
- German Physical Society DPG plan
- Cooperative project of partners FIZ, TIB, ISN
KFP
Rescue boat syndrom?
16International Networking
- No bias policy no single society allowed to
dominate - Funding policy each society finds ist own funds
- Broker policy democratic network of brokers
DFN-Project - Department cooperation
- Operator
- Quality filters select what to enter PhysNet
- Metadata for documents
- Home page for document lists
- University publishers (vetting and archiving)
- National entry points for Oai.
PhysNet Charter
17(No Transcript)
18Joint project VT-ISN
- Funded under a new scheme jointly by NSF and DFG
(German Science Foundation) - One application, one refereeing body, one funding
scheme - Thus one team, one final intelligent Online
service suited to be adapted to any language and
any field. - Started 1.March 2001
19 Searching - Retrieving in the past age
Multiple costs for Providers
Inconvenience for the user!
20Activitities in e-Archives
- Universities / Univ. Libraries
- OPUS (Stuttgart ...)
- Eldorado (Dortmund)
- e-Lib (Osnabrück)
- MILESS (Essen)
- COPACABANA (Oldenburg) ...
- Regional Bibliographic Utility Systems
- PiCarta GBV Göttingen
- BSZ-Media Server Baden-Würtemberg
- DigiBib North-Rhine-Westfalia ...
- National Projects
- GlobalInfo (BMBF) Metadata rdf XML
- DissOnline (DFG) all fields
- Virtual Subject-based Library (DFG)
- Institution
- owned
- publications
- are mostly
- Dissertations
- Teaching
- old digitized Material
21A dicussion in the train
- Scientist
- Did not know what services we were deprived of
- Librarian
- Assumed to know what services are good for
science
The young Elsevier......
did ask the scientists What new services are
needed?
22Principles for Document Services for the Sciences
- Must be scalable 1 Bill. Docs in
Physics/a - Distributed data bases author
controlled - Free distribution exclusive authors
right - Worldwide accepted Metadata standards DC
- Free access to all research results ownership
- Comply with needs of scientists
- Competitive add-on services
- To serve what customers want, not what they ask
for.
23 PhysNet, a field specific service
www.eps.org/PhysNet
Headed by EPS
controlled by its Action Commmittee
on Publication and Scientific
Communication
24The Concept of PhysNet
- Crawl across all distributed Physics Departments
- Same Metadata as Math-Net IMU,
EPS - Distributed Gatherers locally
allow/deny !! - Distributed Brokers no nation to
dominate - Agreements for an unbiased distributed system
Charter - Distributed manpower at present 1 Mill.
/a - Serve all types of information
25To cope with about
Departments worldwide Physicists
Harvest gatherer
- 3000 distributed repositories
- their local documents and document lists
- numerous distrib. gatherers
- numerous brokers
- No central repository
SOIF DC
Harvest Broker
26- PhysDep
- Linklist Seachengine
- approved by National Societies
- businessmodel
- administrational inform.
- distributed gatherers 26
- search depth 2-full
- acceptance 500/day 400/day
PhysDoc publications distributed
gatherers 3 search depth special
27Present Status (April 2001)
- About 40 local, regional, national gatherers
- Brokers at US, DE, Russia, Hungary, France, UK,
DK, India, Japan, Australia, .., EPS
DFN-Project - 39.000 documents and document lists
- MyMetaMaker author tool to add DCmetadata with
Mathematics (IMU) and Physics (EPS). - Distributed physicists/institutes homepages
system with DCmetadata jointly with Math. - 30.000 page impressions per month ...
Online skim through
28A field specific professional service has to meet
the expectation of a quality serviceThe
service should not contain everything but only
material certified by physicists to be relevant
and good physics.
Thus we need certification levels. PhysNet has
but just one what is on Physics Departments
webservers
29 Scholarly Publishing, Vetting and Peer
Reviewing, Metadata, and Archivingin the past
age
Some e-prints free for the community!
A lot of work for the author!
DocumentMetadata
High prices for the library!
Exclusive rightsfor the publisher!
30What refereeing do we need ?
- Instant publishing
before refereeing - Time stamp for prime research before
refereeing - Archiving of relevant information
- Competitive parallel) refereeing
- Multilevel refereeing
- Full information published to be fair to referees
- Open refereeing signed Annotation instead of
advice - Voluntary refereeing to be a pleasure for
referees
31The role of University Libraries
- Be Oai-Database Provider of complete local
Information - Assure free full text access of all research
material - Assure correct metadata usage (by training or
adding) - Do handshake with National Archives
- Be Oai-Service Provider of complete local
information - Vetting system with the local department
scientists - Train users to pick from the multitude of
Oai-service Prov.
32Scenario for Tomorrow OAi Data and Service
Providers including Vetting to Peer Reviewing
DocumentMetadata
Multi-level Peer Rev.
Data Provider
Service Provider
33Vetting and reviewing at German Universities
- Cooperation of universities in North Germany
- (Hamburg, Oldenburg, Bremen, Kiel, Rostock,
Greifswald) - in evaluation of online teaching and research
- in usage and production of multimedia
- in e-publishing and establishing a joint
university press for e-publications - pilot project of Hamburg Oldenburg
- Local vetting with department scientists and
library - peer reviewing between different universities
- shared functions (work flow system, marketing
...) - separate functions (business model, financing
...) - Æ virtual university press of an open and growing
number of online and peer-reviewed university
presses
34 1. Discussion (workshops, meetings, ..)2.
Concept (free access, a multitude of data
providers and service providers but one
internationally to be accepted standard)3.
Software and workforce sharing.
The concept of the Open Archive Initiative OAi
35PhysNet, MareNet, PhysDis Math-Net comply right
from their beginning in 1995/6 the concept of the
Open Archive Initiative
36A success story Dissertations Online in Germany
- Workflow and Metadata from Author to Department,
Library, National Archive - All fields, all Universities
- One scheme for DC-Metadata
- Local Archives, national providing
- Formal rules for all.
Online skim sthrough
37- DissOnline.de Retrieval interface TheO
- http//www.iwi-iuk.org/dienste/TheO/
- using Dublin Core Set for Theses and Diss.
- Work of Bahne, Törner, Schwänzl, Plümer,..
38The role of Service Providers
39Scenario of Tomorrow Types of Searching
Retrieving offers
- Competition by
- quality of add-ons
- level of refereeing
- quality of contents
- specialization
- depth of search
- size
- comfort of retrieval
- level of integration
- local focus
- ...
40Implementing OAI at German Universities
- DINI (http//www.dini.org)
- German Initiative for Networked Information
- carries out guidance for implementations all over
Germany - develop a strategy to cover German universities
(libraries with document servers) - Aim
- Serving a distributed archive network
- Setting up a contact point for OAI in Germany
41PhysNet as Oai Data Provider and Server
42Any Oai Data Provider
Harvest gatherer
DC-converter
SQL DB MySQL
Any Oai Service Provider
OAI-Data Provider
OAi Broker
OAI-Harvester
43DataProvider Implementation 8. March 2001
Skim through
Service Provider Implementation 13. April 2001,
11.30 am VT-time
44Collections to be Represented in Oai-PhysDoc
- PhysDoc
- Distributed document Database for Physics
worldwide - using HARVEST as Retrieval mechanism
- University document servers
- North German Univ. superstructure
- DissOnline.org Physics part
- Physics part of NDLTD
- Arxiv, MIT, .. Physics part
45OAI _Identify
46OAI Implementation
- modified HARVEST holds SOIF and DC metadata in
local text files - storage size no problem
- decision to convert data offline and store
structured data in SQL database (mysql) - use DC when possible, otherwise map SOIF to DC
47OAI Implementation
documents
documents
documents
HARVEST
SQL DB
normalize metadata
OAI Server
48OAI Implementation
- software written in PHP
- protocol
- easy because it uses modified implementation of
HU Berlin - metadata converter
- maps SOIF to DC
- converts different DC representations to one
common one
49Future work
- improve metadata converter
- improve summarizers
- closer look at different DC representations
- tell people to use metadata
- OAI workshops
- ease production of metadata
50DC-Standards and Sets of OAi
- German National Library
- DC7
- TheO-Duisburg
- OCLC-NDLDT
- Math-Net Worldwide Int.Math.Union
- Phys-Net Worldwide EPS
Differences Html 2 , 4 XML, rdf
51Advantages of the new Scenario
Less work for the author!
Most e-prints free for the community!
Immediate publication!
Lower costs for the library!
Open multi-level peer reviewing!
Easy integration of metadata into existing
services!
Less printed journals but more accessible
e-publications!
Value-added servicesby different providers!
Non-exclusive rights for the publisher!
OAi Napster for the SciencesRichard Sietmann
in ct 6/2001, S. 78
52Departments worldwide Physicists
Harvest gatherer
DC-converter
SQL DB MySQL
SOIF DC
Harvest Broker
Marian Bypass
OAI-Data Provider
Marian for ranking
OAI-Harvester
OAi Broker
53National activities to support the OAi
- DINI German Initiative for Networked Information
- similar to CNI
- Cooperation between
- Research Libraries (DBV),
- Computer Centres (ZKI),
- Media Centres (AMH),
- Initiative of Learned Societies IuK
- DINIs Appeal to join the OAi (2000)
- Training camps for German Oai-Data Providers
54 Next steps I
- German National Library DDB set write MMM and
import (all fields) - VT ETD-Metadata write MMM and import
- Use set for import/export VT-ISN
Good for increasing No of documents Increase
acceptance Prove VT-ISN
55Next steps II
- Benefit from Research for added Intelligence at
VT for MARIAN broker - Branching queries to VT and import answers
- Install modules
- Go towards one joint broker
56Thanks for the invitation
- Virginia Tech group of E. Fox leading in
development of online digital library concepts
for any learned field and language - ISN complements in developing online services for
a specific field (Physics) and just one single
language (broken englisch).
57Next steps III
- MARIAN Bypasses for ISN-PhysDoc
- Intelligence for the Oai-Dataprovider
- Intelligence for the Oai-Serviceprovider
58Worked on add on services
Learning personalized search engine
User and his institute sever
query
DB of information link stars
Learning personalized browser engine
browsing
59VT-ISN steps IV
- Joint Representation of DFG-NSF project
- One Server, one address, one project, one crew ..
60Next steps V
- Share User statistics and Evaluation
What do Physicists want? (In a new
surrounding) Experiments and Evaluation Instead
of questionaires
61- Acceptance of a service comes by
- Bottom up just do it and spread the rumour
- Top down Charter of IMU, EPS
- To register Institutions, Departments, Graduate
Schools, Universities - 4. Joint international standards and cooperation
- 5. Work sharing (infinite workforce)
- 6. Professionalism
- let the scientists provide content
- Let the libraries, computer centers provide
service - Let the Administration levels find a way to
assure quality selection to the worldwide
services.