Ashalatha Laxminarsaiah
Scientist/Engineer - SF
Indian Space Research Organisation Headquarters
Antariksh Bhavan, New BEL Road
Bangalore - 560 094 (INDIA)
E-Mail: asha@isro.gov.in
&
Iqbalahmad U Rajgoli
Senior Scientific Assistant, Library
Indian Space Research Organisation Headquarters
Antariksh Bhavan, New BEL Road
Bangalore - 560 094 (INDIA)
E-Mail: iqbal786@isro.gov.in
Abstract:
ISRO HQ., Institutional Repository was initiated in the year 2006 using Dspace open
source software. Dspace was selected looking at its features and international
acceptability. ISRO HQ., Library had the required technical infrastructure such as
Server to install Dspace and scanner to digitize the collection. ISRO
Scientists/Engineers as information creators, collectors, consumers and
communicators generated a huge amount of information in the form of research
papers/articles, lectures, speeches and internal technical reports. Hence, the ISRO
HQ., Institutional Repository basically contains the research papers/articles, lectures
and speeches by the present and former Chairmen apart from internal technical,
scientific and general reports generated by the Scientists/Engineers. The research
papers/articles dates back to 1940s and forms a very rare and significant collection
in this repository. The repository also contains Annual Reports, Journals and E-
books published by ISRO. The 35mm documentary films generated by ISRO have
been converted into DVD video format and made available in the repository. The
repository is accessible 24/7 to the Scientists/Engineers over Intranet working in
different Centers/Units in India. It is heavily used by the Scientists/Engineers with
average hit rate of 600 to 650/day. In this paper efforts have been made to discuss
how the technical and financial issues were tackled. How the intellectual output was
collected and made accessible. This paper will be a good reading for those who are
planning to create an Institutional Repository.
1. Introduction:
1
When the world entered the digital age, a great majority of human historical
records did not make a trip from print to digital form. Literatures, films,
scientific journals, newspapers, reports (scientific and technical), corporate
documents, individual lectures and speeches delivered by eminent
personalities over centuries, needed to be adopted for computer databases.
Again it has to be arranged along with newer, born-digital materials for quick
retrieval, when it is needed and keep finding it well organised into the future.
2. About DOS/ISRO:
In the history of Indian Space programme, there are many milestones like
building its own satellites and launch vehicles. On April 28, 2008, the Polar
Satellite Launch Vehicle rocket put ten satellites into orbit during a single
launch. Chandrayaan-1 moon mission was successfully launched on October
22, 2008. This brought India into the elite group of countries to send a moon
orbiter into space. Beyond Chandrayaan-1, ISRO is planning to send an
orbiter, a lander and a rover to Moon to study the chemical and
meteorological content of the soil.
2
The Library and information services at ISRO HQ is a specialized library
among other ISRO centres libraries regarding its collection and services
catering to the needs of more than 500 patrons most of them are scientists and
top level management people and other institutions. ISRO HQ, Bangalore,
internally generates enormous amount of information in the form of
lectures/speeches delivered by eminent personalities of ISRO, documentary
films, internal reports, reprints, preprints, conference proceedings, conference
papers and etc.
ISRO/DOS libraries are of different sizes in collection, subject and budget and
located at different places in the country. In ISRO/DOS there is no nodal
agency and have decentralized management of libraries. All libraries function
independently and have access to resources to each other's collection with
Intranet connectivity among centres and Inter Library Loan facility.
Internet has changed the way research is conducted and shared, primarily by
increasing the global reach of scholarly communication (Willinsky, 2006). The
open access movement has become an increasing visible prospect for digital
collections of scholarly communications (Carpenter, 2008). The important
reasons for pursuing open access are its ability to increase the circulation of
research and strengthen the scientific claims of articles and overall quality of
research literature (Willinsky, 2006). IRs provides organisations with an
opportunity to create a central location that collects and preserves their
intellectual output in the digital format. The opportunity to share and
distribute this intellectual output is hugely significant and would serve to
benefit the repositorys contributing authors and institution itself. Hence, IRs
have emerged as the tool for successfully promoting the open access
initiatives of an organisation.
3
There are many definitions of IR available in the vast literature. Following
section discusses few important among them.
IRs are the means of collecting and providing access to diverse, locally
produced digital materials (Bailey, Coombs and Emery, 2006). Where as
Donovan and Watson (2008) defined an IR as a means of collecting the
intellectual digital output of an organisation. Wikipedia has defined IR as an
online locus for collecting and preserving in digital the intellectual output of
an institution, particularly a research institution.
4
long-term accessibility. IRs are the visible manifestation of the emerging
importance of knowledge management within an institution. The long-term
impact of IRs is likely to change many of the basic assumptions about how
intellectual output is managed by individuals, their colleagues and the
institution; and how research itself is conducted.
IRs enhances the professional visibility of the individual and raises the
prestige of an institution.
IRs lower access barriers and offer the widest possible dissemination of
scholarly communication.
IRs helps in increasing the citation rates of a particular article by
providing open access.
Centralization and storage of all types of institutional output including
unpublished literature.
Storage and access to wide range of materials.
A central archive of a researchers work.
Benefits to researchers and their institutions in terms of prestige, prizes
and grant revenues.
IRs helps in wider circulation of grey literature like technical reports,
theses/dissertations and in-house publications which are not published
for wider circulation.
8. Technical Issues:
Singh, Sagolshem and Purnima Devi (2006) felt that there is no viable long
term strategy to ensure that digital information will be readable in the future.
Not only are digital documents vulnerable to loss via media decay and
obsolescence, but they become equally inaccessible and unreadable if the
software needed to interpret them or the hardware on which that software
runs is lost or become obsolete. Digital preservation can, therefore, be seen as
the set of processes and activities that ensure the continued access to
5
information and all kinds of records, scientific and cultural heritage of an
institution existing in digital formats. Therefore, depending on the criteria like
open standards, ubiquity, stability, metadata support, viability,
interoperability, authenticity, processability and presentation the following
file formats are selected for uploading records to the ISRO HQ., IR.
PDF
GIF
JPEG
TIFF
HTML
MHTML
VOB (for DVD-video media)
MPEG
Flash Videos
Intel Xeon x-series 335 processor (3.6 GHz) based IU rack mounted
server system with 1GB RAM and RAID storage system
Everyday incremental backup. Asset store is done NFS to storage box of
100 GB capacity. Sunday full backup on tapes
HP Scanjet 5590 flatbed scanner
Adobe Acrobat 7.0 professional for scanned image processing and
watermarking
ABBYY PDF Transform for converting documents to PDF format
Red Hat Linux 9.0
Java 1.5.0
Jakarta Tomcat 5.5
Maven 2.2.1
Apache ant 1.8.0
6
There are many open source softwares available for creating and maintaining
IRs. Software such as ARNO, CDSware, Dspace, Greenstone, EPrints, Fedora,
i-TOR and MyCoRe to name a few. While developing IR at ISRO HQ., Dspace
is selected. Dspace is advanced digital repository software crated as a joint
project of MIT libraries and the Hewlett-Packard Company. Dspace digital
repository software is freely available as open source software under the
terms of the BSD distribution license. A list of important feature of Dspace is
given below:
8.4 Manpower:
7
In an organisation different sections/divisions play important role in
successfully carrying out any project or initiative. These divisions/sections
may not be directly involved in implementing the project but even
constructive opinions, suggestions and guidance is very valuable. In
building the IR at ISRO HQ, library collaborated with group of persons
who have expertise to evolve policies related to computers and software for
implementing in DOS/ISRO and software engineering, support for mission
software validation, inter-centre and intra-centre connectivity through
computer network.
Engineers have helped in installing the Dspace software and its customization
of metadata form, software maintenance, configuration, and taking database
backup regularly are also being carried out.
9. Major Challenges:
8
Government Documents. Annual reports from the year 1972 to 2009 are made
available and can be accessed by the entire ISRO community. The 35mm
documentary films generated by ISRO have been converted into DVD video
format and made available in the repository. E-books available with the print
edition and those which have got permission to host on the Intranet are also
uploaded to the IR. There are also few E-books brought out by ISRO which
are also accommodated in the IR. Seminar and conferences which are video
captured are made available in the MPEG format. Space India an in-house
publication is also scanned and uploaded for reference. There are number of
scientific, technical and general reports brought out by ISRO HQ. These
reports are being scanned and uploaded in PDF format (Fig.3). An
hyperlinking facility has been given in the Library Management Software
Libsys using the multimedia linking facility for search and its retrieval which
gives additional access point to internal reports when users carry out search
using Web OPAC. The rare photographs of Former and Present Chairmen of
ISRO are collected and converted to a Flash Movie format and archived under
their respective collections.
The major challenge was to collect the research articles and lectures/speeches
of all the Chairmen i.e., Present and Former (Fig.4). Eminent personalities
such as Dr. Vikram Sarabhai, Prof. M G K Menon, Prof. Satish Dhawan, Prof.
U R Rao, Dr. K Kasturirangan and Dr. G Madhavan Nair (former chairman of
the organization) and Dr. K Radhakrishnan, the present Chairman of ISRO
have done tremendous research in the field of space science, remote sensing,
astronomy and astrophysics. It was challenging to collect the research output
both published and grey literature in the form of articles, technical reports,
lecture notes and lectures/speeches brought out by these eminent
personalities over a period of time. The research papers/articles have been
published in the reputed national and international peer reviewed journals,
conferences, seminars and symposia. People like Prof. U R Rao, Dr. K
Kasturirangan, Dr. G Madhavan Nair and Dr. K Radhakrishnan have
preserved their intellectual output in hard copy. Print copy of the same was
scanned, edited and uploaded in the PDF format. Some of the research
articles/papers and lectures/speeches were available in the MS Word format,
those are converted to PDF format using ABBYY PDF Transform software so
as to maintain the uniformity in file formats.
The intellectual output of Dr. Vikram Sarabhai, Prof. M G K Menon and Prof.
Satish Dhawan has been extensively published in the foreign journals dating
back to 1940s and 1950s. It was very difficult to collect the research articles
which have been published in the 1940s and 1950s and which are normally
not available in the electronic format. Moreover, many of the journals where
the articles were published are discontinued by the publishers. Some of the
intellectual output of these personalities is preserved in the ISRO HQ Library.
This collection is converted to electronic format and uploaded in their
9
respective Communities in the IR. Efforts were made to visit some of the
important libraries in Bangalore with whom these eminent personalities were
associated for their research. The articles available in the databases subscribed
by these libraries are downloaded and articles available in the bound volumes
of journals are photocopied and scanned at ISRO HQ Library. Each document
uploaded to the IR is watermarked as Archived by ISRO HQ. Library using
Adobe Acrobat 7.0 Professional software.
The ISRO HQ., IR also contains a special collection called Photo Gallery. The
rare photographs of all former Chairmen of ISRO are collected and a photo
gallery is developed using Adobe Flash CS3 Professional software (Fig.5).
There were only two library professionals when IR was initiated. In addition
to routine library duties, the building of IR and collection of the intellectual
output of the organization was carried out. In the beginning, everyday around
3 to 4 hours were invested in developing the IR. After few months of working
experience with DSpace, the time spent on working on IR has come down to 2
hours per day and sufficient to build the collection and monitoring the IR
effectively. In the beginning newspaper clippings were uploaded as HTML
files. The HTML file contains multiple files embedded and the image has to
be uploaded as a separate bit stream to the IR. To avoid this cumbersome job,
it was decided to upload the newspaper clippings in the MHTML web archive
format as single file. This decision helped in saving time in uploading the
items. Earlier the Dspace was installed in the stand alone PC and recently it
has been migrated to an independent IBM server with RAID facility, to access
the IR by 24/7 with good performance. This has considerably reduced the
time taken for uploading the collection to the IR.
10
pertaining to its policies and procedures from time to time. Since these
documents carry very less data which can be used for metadata, it was
felt that the Metadata form may be customized so that only the minimum
available data regarding the document can be accommodated. With the
help of computer experts the metadata form is customized for the
collections available under the Digital Library of Government Documents
Community (Fig.6). File formats such as MHTML, VOB and FLA (Flash
Videos) were not available in the Bitstream Format Registry are included.
Small modifications to the IR homepage are made by including the
emblem of ISRO and a combined photo of different sections of Library.
Right from the inception of IR at ISRO HQ, Library professionals are working
in collecting, scanning, converting and uploading the documents. There are
plans to move IR from the Intranet to the Internet to have wider accessibility
and publicity to the general public to this valuable treasure. So far the
repository contains only the intellectual output of the former and present
chairmen of ISRO and the technical reports generated at ISRO HQ. To widen
the scope of IR it was felt to accommodate the intellectual output of brilliant
and intelligent Scientists/Engineers working in the ISRO HQ by giving them
access to upload their works to IR (self-archive).
This can be done by educating the personnel in how to upload the documents
to Dspace and by monitoring the submissions for maintaining the metadata
standards. Other method could be by developing a submission form outside
of Dspace and asking the personnel to enter minimal metadata and uploading
the article/paper/report in PDF format. Rest of the editing part can be carried
out by the library staff i.e. entering the complete metadata form and archiving
to the IR.
The issue of copyright materials and the fact that authors who publish in
journals usually sign copyright transfer forms that transfer copyright from the
author to the publisher is challenging. Although publishers will allow
depositing pre-prints or even the final print, many authors are never really
aware of their rights and check what rights they have with regard to their
published papers. Authors rights and interpreting publishers copyright
policies in respect of IRs are current areas of deep discussion. RoMEO (Rights
MEtadata for Open Archiving), a UK project, produced an important survey
of publisher copyright policies and self-archiving in 2003 which has now been
converted to a database is actively managed. As of April 2011, RoMEO has a
database of around 960 publishers copyright and self-archiving policies.
Publisher policies are given as below:
11
No archiving allowed
Allow pre-refereed version only
Allow post-refereed version only
Allow pre- and post refereed versions
Allow publishers version
Allow all versions
Not specified
Library staff is in the process of getting the copyright clearance from the
publishers, so that there should not be any issues when the IR moves from
Intranet to Internet. Wherever it is not possible to get the copyright clearance
or the archiving policies are not exclusively defined, such articles/papers will
be restricted from viewing and downloading. But an option will be given to
the users to request a copy.
11. Usage:
12
13. Conclusion:
References:
13
convention PLANNER held at Mizoram University, Aizawl during
Nov.09-10, 2006, pp.166-172.
4. Crow, R: The case of institutional repositories: a SPARC position paper,
Scholarly Publishing and Academic Resources Coalition, Washington, DC,
2002.
5. Dabholkar, Rekha, Prabhakaran, R and Kurahatti, B T: Building
institutional repository: A TIFR initiative. Paper presented at the 6 th
convention PLANNER, held at Nagaland University, Nagaland during
Nov.06-07, 2008.
6. Donovan, James M and Watson, Carol A: White Paper: Behind a law
schools decision to implement an institutional repository. Articles,
Chapters and Online Publications. Paper 15, 2008.
7. Lynch, Clifford A: Institutional repositories: essential infrastructure for
scholarship in the digital age. ARL, No.226, Feb.2003, pp.1-7.
8. Mark Ware Consulting Ltd.: Pathfinder research on web-based
repository final report. Mark Ware Consulting Ltd, UK, Jan.2004
9. Moahi, Kgomotso H: Institutional repositories: towards harnessing
knowledge for African development. Paper presented at the First
International Conference on African Digital Libraries and Archives
(ICADLA-1), Addis Ababa, Ethiopia, Jul.01-03, 2009.
10. SHERPA/RoMEO Home page: http://www.sherpa.ac.uk/romeo/
11. Singh, Surachand Kh, Sagolshem, Memori and Purnima Devi, Th:
Perspective of digital preservation: need and strategies in the digital age.
Paper presented at the 4th Convention PLANNER, held at Mizoram
University, Aizawal during Nov.09-10, 2006, pp.281-285.
12. Wikipedia: www.wikipedia.com/en/
13. Willinsky, John: Why open access to research and scholarship? The
Journal of Neuroscience, Vol.26, No.36, Sept.06, 2006, pp.280-285.
14
4. Proceedings of the National Conference on Digitisation and Digital
Preservation held at DESIDOC, New Delhi during Dec.11-12, 2008.
5. Starkman, Abbie: Institutional repositories: benefits and challenges for
libraries. Open and Libraries Class Journal, Vol.1, No.1, 2008, pp.1-10.
15
Fig.1 Home page of ISRO HQ IR showing different Communities
16
Fig.3 Internal Reports Collection
17
Fig.5 Photo Gallery of Dr. Vikram Sarabhai
18
Author Bio-Data
19