Research data services
Library services and resources are continually reviewed to ensure that they meet your needs as a researcher. If you have further suggestions on how we can help meet your research needs, please contact Rebecca Parker, Research Services Librarian.
In 2011, Swinburne Library and Swinburne Research worked on a joint project funded by the Australian National Data Service (ANDS). ANDS is supported by the Australian Government through the National Collaborative Research Infrastructure Strategy Program and the Education Investment Fund (EIF) Super Science Initiative. This project aims to develop the infrastructure at Swinburne to help support you in managing your research-related data.
- What is this project?
- How can the Library help me with managing and promoting my research data?
- What else do I need to know about research data?
- Where can I go to find other people's research data?
- Where can I go for more information?
- Contact us
What is this project?
What is research data?
For purposes of this project, and for developing data services for researchers at Swinburne, we’ve adopted a comprehensive definition of research data (derived from several definitions of data that have been compiled by ANDS, here). Research data can be any of the inputs to—and results from—research and scholarly inquiry. Data can be facts or observations that support a theory or test a hypothesis, and may arise from experiments, observations, or computations. These can be raw data or the product of further analysis. Data may be text, numbers, images, sounds, artefacts, specimens or samples, and may exist in digital or nondigital formats. Provenance information about the data—how, when and where it was collected, and with what type of instrument—may be an essential component for understanding the data. The software code or model used to generate or analyse the data may also comprise a complete description of the data.
Under the Australian Code of Conduct for Responsible Research, there is some expectation that research data will be retained beyond the life of a project to ensure the integrity of the research. It is also becoming more common for major publishers to expect to view or even publish the data associated with a journal article. Nature is at the vanguard of this movement (here is one example; follow the link to "supplementary information"). Many research disciplines offer a different model for storing data and making it available to other researchers. In the area of earth and environmental sciences, for example, PANGAEA provides a venue for archiving, publishing and distributing georeferenced data from earth system research. Here is a sample record from PANGAEA of a dataset that is supplemental to a journal article published elsewhere, with persistent links between the article and the supplemental data.
A 2009 editorial in Nature ("Data's shameful neglect") highlights the importance of promoting openness and sharing of research data. More recently, a series of articles from the 11 Feb 2011 issue of Science was devoted to the challenges and opportunities of confronting the data deluge ("Dealing with data"). A 2011 article in PLoS ONE, "Data Sharing by Scientists: Practices and Perceptions" describes findings from an international survey of data practices. A new online open-access journal, GigaScience 'aims to revolutionize data dissemination, organization, understanding, and use' by employing a publication format 'that links standard manuscript publication with an extensive database that hosts all associated data and provides data analysis tools and cloud-computing resources.' More information about this ambitious project is available from the GigaBlog.
What is ANDS?
The Australian National Data Service (ANDS) is a DIISR-funded initiative created to establish best practice in issues of data management and curation, including research data ownership and the roles and responsibilities associated with ownership; access to research data collected and maintained with public funding; and the curation of experimental, research and published data. Many Australian universities have been provided with ANDS funding to assist them in identifying and describing local research datasets. In March 2011, The Australian published a brief article about Swinburne's ANDS-funded research data management project.
How can the Library help me with managing and promoting my research data?
How do I make my research data discoverable?
The Library expects to make records for Swinburne-created research datasets available alongside your publication records in Swinburne Research Bank. This will provide you with a citable profile for your research data and publications, and records will also be indexed by Research Data Australia, a national database of research datasets from Australian universities and research institutes. This website is expected to be a valuable future resource for researchers to assist them in discovering more about Australian research data, and in building networks between researchers, projects, data and publications. For more information, see the ANDS Data Connections Strategy (PDF).
How do I create a data management plan?
In support of Swinburne's forthcoming data management policy document, Management of research data and primary materials (to be finalised in 2011), the Library has developed some preliminary resources to assist you with data management planning. We would appreciate your feedback on these resources, which continue to be a work in progress.
-
Swinburne research data management checklist
This work is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License.
This checklist takes you through the components of an effective data management plan. If you address each item accurately, your answers will form the basis of a data management plan for your research project. Upon successful completion of the checklist, you should keep a copy along with other research project documentation.
We acknowledge prior work done at Monash, QUT and the University of Melbourne in creating data management checklists, and we appreciate their willingness to allow us to draw from their work in preparing Swinburne's checklist.
If you would like more help, there is also a range of comprehensive data management plan development tools and checklists available from other organisations and universities:
- ANDS: data management for researchers guide
- Monash University: Research data management
- ANU offers a course in data management planning:
- some information related to this course is freely available, including a 50-page data management manual (PDF)
- The UK Data Archive: Create and Manage Data; includes Managing and Sharing Data: a best practices guide for researchers
- Cambridge University Library: Support for managing research data
- Digital Curation Centre: DMP Online
- ICPSR (Inter-University Consortium for Political and Social Research: Guidelines for effective data management plans
- JISC (UK): Managing research data
- DataONE: data management plans, including a data management plan outline and a data management plan example
- QUT research data management, including a data management checklist
- University of Melbourne: Research data management for researchers, includes a research data management plan template, and example plans
- MIT Libraries: Manage your data and Data management plans
- University of Minnesota: Managing your data
- University of Wisconsin: Research data services: includes guidance on writing a data plan, managing data and sharing data.
What else do I need to know about research data?
What data should I keep?
Not all research data can, or should, be saved. But it can be difficult to discern what data has potential future value, not only for your own research program or for other researchers within your discipline, but also for researchers in other disciplines who may discover new uses for your data.
- Digital Curation Centre: How to appraise and select research data for curation
- SURF Foundation (Netherlands):
- General guidelines for selecting data for preservation (1 page summary [PDF])
- Selection of research data: guidelines for appraising and selecting research data (Detailed report, 49 pages [PDF])
- Griffith University's Schedule of Retention Periods for Research Data and Primary Materials (includes a discussion of the distinction between research data and primary materials).
What about copyright?
There are a number of issues concerning intellectual property and copyright in research data. ANDS provides some information in the guide Copyright and data. For more detailed information, please contact the Swinburne Copyright office.
Where can I store my data?
There are already a number of national storage sites and disciplinary archives for storing research data, including the Australian Research Collaboration Scheme (ARCS)'s Data Fabric. More information will be available here when Swinburne's data management policy is finalised, but we would also like to know what would be useful to you in assisting with the management of your research-related data.
Where can I go to find other people's research data?
Where can I find government data?
Government data is often freely available online. Here is a selection of useful resources:
- Australian Bureau of Statistics
- Data.Australia.gov (beta version): public information datasets from the Australian Government
- Data.gov: provides access to more than 300,000 data sets from the US federal executive branch
- Data.gov.uk (beta version): more than 5400 datasets currently available from all central government departments and a number of public sector bodies and local authorities
- Victorian Government Data Directory: a portal to datasets and data tools that exist on other government websites
- SA-NT Data Link: an interesting undertaking that links de-identified data from participants in studies that span economic, public health, education, and other policy-related studies in South Australia and the Northern Territory. Registration is required for access to the data, which may also involve a fee.
- List of European Open Data Catalogues: compiled by the Open Knowledge Foundation
- National Statistical Offices Websites: maintained by the UN Statistics Division, a comprehensive list of worldwide counterparts to the ABS
Where can I find data from NGOs and international agencies?
Non-governmental organizations and international agencies also offer free online access to their data. Here is small sample that may be useful to researchers:
- United Nations Statistics Division
- The UN Statistics Division also includes links to several types of international statistics agencies, including UN statistics programmes such as the Conference on Trade and Development (UNCTAD); specialised UN agencies' statistics programmes such the UNESCO Institute for Statistics (internationally comparable statistics on education, science and technology, culture, and communication); and other autonomous agencies' statistics programmes, including the International Energy Agency the International Monetary Fund (IMF), and Eurostat (from the statistics office of the European Union).
Where can I find data (and deposit data) relevant to my research discipline?
Some disciplines already have a practice of sharing data with their colleagues. Here is a small selection from the wide range of online discipline-specific and multidisciplinary resources, many of which allow researchers to deposit their own data:
- Astronomical data: Canadian Astronomy Data Centre
- Astronomical data: National Virtual Observatory
- Biosciences data: DRYAD: 'an international repository of data underlying peer-reviewed articles in the basic and applied biosciences'
- Earth science data: PANGAEA: publishing network for geoscientific and environmental data
- Econometrics data: Journal of Applied Econometrics Data Archive
- Economics data: National Bureau of Economic Research
- Economics data: Federal Reserve Bank of Philadelphia: Real-time data set for macroeconomists
- Multidisciplinary data: DataCite.org has compiled a comprehensive list of discipline-specific repositories, with annotations including subject area(s), restrictions (if any) against data deposit and data access, and notes about licencing.
- Multidisciplinary data: The Dataverse Network Project: an interesting open source project for publishing, citing and discovering research data. To search for datasets, follow the link to the IQSS Dataverse Network, the world's largest collection of social science research data, hosted at Harvard University's Institute for Quantitative Social Science
- Multidiscplinary data: Pew Research Center: all survey results are made available after the data has been mined for Pew reports (usually around 2 years after the survey first went out)
- Multidisciplinary data: UK Data Archive: promoted as the UK's 'largest collection of digital research data in the social sciences and humanities'
- Multidisciplinary data: Open Access Directory (Simmons College): Data repositories, a list of repositories for open data, grouped into 14 academic disciplines including astronomy, biology, chemistry, geosciences and linguistics
- Multidisciplinary data: the Digging Into Data Challenge seeks new ways of applying computationally based research methods onto large datasets. Included at the website is an ever-expanding and well-annotated list of data repositories that can be used for this challenge.
- Religion data: Association of Religion Data Archives
- Social Sciences data: Australian Data Archive (ADA)
- Social Sciences data: Inter-university Consortium for Political and Social Research (ICPSR)
- Visual arts data: VADS: the online resource for visual arts
For a more comprehensive list of repositories and databases for open data, see this Open Access Directory.
Where can I find research data produced by other universities?
Many universities have already created public-use archives for storing and sharing data that have been generated by their researchers. Here is a selection of these:
- University of Colorado at Boulder: National Snow and Ice Data Center
- University of Edinburgh: Edinburgh DataShare
- Harvard University: Henry A. Murray Research Archive
- University of Melbourne: The Household, Income and Labour Dynamics in Australia (HILDA) Survey: a household-based panel study begun in 2001
- University of Melbourne Archives holds the results of Prest's Social Survey of Melbourne, an extensive study of housing conditions in Melbourne undertaken from 1941 to 1943. A description of this survey is available here.
- Princeton University: Office of Population Research (OPR) Data Archive
- University of Southampton: eCrystals: 'all the underlying data generated during a structure determination from a single crystal x-ray diffraction experiment'
- University of Southern California: Southern California Earthquake Center
- University of York: Archaeology Data Service
Where can I go for more information?
Who else is working on this?
Many Australian universities have been funded by ANDS to develop research data management tools. If you'd like to see what others are doing, here are some research data projects at other institutions (Australian and elsewhere):
- Monash University: Research data management
- Supporting Data Management Infrastructure for the Humanities (SUDAMIH): Project outputs:
- scroll down to Bibliographies and Literature Review for a data management bibliography and personal information management literature review
- University of Southampton: Institutional Data Management Blueprint (IDMB) project.
What data processing tools are available to me?
The Australian Research Collaboration Scheme (ARCS) offers a suite of data services, including data storage; data sharing among collaborators, with controlled access to data; and data transfer. An extensive list of data visualisation tools is posted at the Digital Research Tools (DiRT) wiki. Additional data visualisation resources are also listed at the University of Wisconsin's research data services site. For a web-based data cleanser and analyser, you can try Needlebase. A review of Needlebase is available from ReadWriteWeb.
Contact us
There are still a number of issues around research data that we need to think about, such as copyright, ownership of intellectual property, confidentiality, ethics, data storage, and more. We would really appreciate your assistance with this. If you would like to help, please contact us on researchdata@swin.edu.au, or directly:
Rebecca Parker |
