AND OPPORTUNITIES
Mathew Wyatt
Australian National Data Service
iVEC, The hub of advanced computing
26 Dick Perry Avenue, Kensington WA 6151
08 6436 8545
mat@ivec.org
Luke Edwards
eMII - Integrated Marine Observing System
iVEC, The hub of advanced computing
26 Dick Perry Avenue, Kensington WA 6151
08 6436 8958
luke@ivec.org
Mervyn Lynch
Terrestrial Ecosystem Research Network
Curtin University
Kent Street, Bentley WA 6102
08 9266 7540
m.lynch@curtin.edu.au
Abstract
Data-driven research is on the rise, which creates a vast amount of opportunity
for innovation in science, but also introduces a number of problems with regard
to information access, maintainability and discovery. Governments in Australia
and worldwide are supporting a cultural and practical change to a world of open
data, transparent government, e-democracy and eResearch. There have been
several national reports (AAS/ATSE, 2009) that have reflected on aspects of
data and information management in the earth observations (EO) area and a
number of recommendations put forward on how to address some key issues to
ensure maximum public benefit from earth observation technology and the
information it delivers. There are opportunities for remote sensing professionals
to engage with some of the national initiatives occurring at the moment in
Australia, in the context of how remote sensing can fit into the national data
and information strategy for Australia and the international community.
Initiatives of relevance include ANDS (Australian National Data Service), IMOS
(Integrated Marine Observing System) and TERN (Terrestrial Ecosystem
Research Network). With these initiatives being in their infancy, there is an
opportunity as a remote sensing and scientific community, to influence
technical and infrastructure decisions relating to data and metadata services to
ensure the most effective outcomes for the future.
1
Introduction
The world is awash in digital data, and the management, organisation, access
and preservation of this digital data is critical to the success of the knowledge
economy and society as a whole (Berman, 2008).
Though the burden and cost of managing research data outputs falls on the
shoulders of the researchers themselves. Finding, organizing, documenting,
formatting, intellectual property are yet another addition to researchers already
busy workloads. It is not surprising that a survey carried out by the Finnish
Social Science Data Archive (FSD) found that only 12% of respondents
archived their data in an institutional repository with adequate metadata (Kuula
and Borg, 2008).
In this paper we offer some of the benefits to open access data, provide
examples of government initiatives relevant to the remote sensing community
which intend to relieve the researcher of data deluge burden, and detail some
of the technologies being used to support this.
Open access to data can also enable innovative research beyond initial
observers intentions. This has been demonstrated in the bibliometric analysis
of the Hubble Space Telescope (HST) publication statistics (Lagerstrom, 2010).
In this study, the author shows that over time publications on archived HST
data eclipse the publications from authors of the original study – studies of
which enabled the original data collection.
Probably the most measurable benefit of open access to data is its affect on
increased citation rates. Following the citation history of 85 micro array trails
2
from 1999-2003, Piwowar et al. (2007) discovered that the 48% of trials which
had open data to support the publication received 85% of the citations.
Similarly in the US, the Obama administration has introduced a culture of open
government with the Open Government Initiative
(http://www.whitehouse.gov/open). The flow on affect from this has encouraged
the National Science Foundation to enforce a two-page data management and
sharing plan to be submitted with each research grant submission (Mervis,
2010).
In Australia, when applying for a research grant with the ARC or NMHRC there
is no such requirement for a data management plan or data sharing plan.
Though, the NHMRC has published the Australian Code for the Responsible
Conduct of Research (NHMRC, 2007). In which, they detail recommendations
for the ongoing management and archival of data created out of publicly funded
research, the ownership around the data, and the support of appropriate
access to data.
These trends have also given rise to a new profession – the “data scientist”. In
2005, the National Science Board of the National Science Foundation
published Long-Lived Digital Data Collections: Enabling Research and
Education in the 21st Century (NSBA, 2005), which began a dialogue about the
importance of data preservation, and introduced the issue of the care and
feeding of an emerging group they identified as “data scientists”:
3
There are a number of initiatives occurring at a national scale that are relevant
to the remote sensing community.
Using community agreed standards for metadata and data products, and
deploying standard data delivery services with API’s (Application Programming
Interfaces) is at the upmost importance for ensuring ease of uptake, usefulness
to potential users, and enabling innovative applications to be developed. It is
for this reason, that the spatial data focused NCRIS (National Collaborative
Research Infrastructure Strategy) initiatives have led by example in deploying
data delivery services according to the Open Geospatial Consortium (OGC),
Data Access Protocol (DAP), and ANZLIC/ISO19115 Geographic information
standards.
4
Services (WFS) for the delivery of geospatial information which is not a
coverage.
Conclusion
In this article we have detailed the benefit of open access to data, and that
open data is not just beneficial to the researcher of the original study, but also
beneficial to future scientific endeavours. We have shown that cultural and
practical shift to open data in science is being supported by governments
locally and internationally, and that the burden does not have to lie on the
shoulders of researchers. We have also outlined some technologies relevant
to the remote sensing community that are being utilised by national initiatives,
and that early engagement with the remote sensing community will encourage
the correct technological decisions to ensure future prosperity.
References
AAS/ATSE, 2009, An Australian Strategic Plan for Earth Observations from
Space (Australian Academy of Science/Australian Academy for Technological
Sciences and Engineering).
BERMAN, F., 2008, Got Data? A Guide to Data Preservation in the Information
Age. Communications of the ACM 51, 12: 50-56.
http://doi.acm.org/10.1145/1409360.1409376
KUULA, A., and BORG, S., 2008, Open Access to and Reuse of Research
Data – The State of the Art in Finland, (Finnish Social Science Data Archive).
LAGERSTROM, J. 2010, This is a paper presented at a conference. In World
Library And Information Congress: 76th IFLA General Conference And
5
Assembly, 10-15 August 2010, Gothenburg, Sweden.
SHARAV, V.H., 2006, Medical Journal Editor Finds Truth Hard to Track Down.
Available online at: http://www.ahrp.org/cms/content/view/16/27/ (accessed
August 2010).
PIWOWAR, H.A., and DAY R.S., FRIDSMA, D.B., 2007, Sharing Detailed
Research Data Is Associated with Increased Citation Rate. PLoS ONE 2(3):
e308. doi:10.1371/journal.pone.0000308
TANNER, L., 2010, Declaration of Open Government. Available online at:
http://agimo.govspace.gov.au/2010/07/16/declaration-of-open-government/
(accessed August 2010).
MERVIS, J., 2010, NSF to Ask Every Grant Applicant for Data Management
Plan. Available online at:
http://news.sciencemag.org/scienceinsider/2010/05/nsf-to-ask-every-grant-
applicant.html (accessed August 2010).
NHMRC, 2007, Australian Code for the Responsible Conduct of Research
(National Health and Medical Research Council).
NSBA, 2005, Long-Lived Digital Data Collections: Enabling Research and
Education in the 21st Century (National Science Board).
BELL, B., 2009, The Fourth Paradigm - Data-Intensive Scientific Discovery.
Tony Hey, Stewart Tansley, and Kristin Tolle (Ed.), pp. xii (Microsoft
Research Redmond, Washington)
ISO, 2003, ISO 19115 Geographic Information – Metadata (International
Organization for Standardisation).
ANZLIC, 2007, ANZLIC Metadata Profile (ANZLIC the Spatial Information
Council).