projects

You are here

Full name:Concentrated Cooperation on Research Data
Start date:2020.
Participants:
Coordinator:SZTAKI

The goal of the project is to establish a Hungarian research data repository supporting the COVID-19 pandemic related research initiatives.  The research data repository serves as a data store and repository solution both for international and national uses, and provides a definite (peer-to-peer) controlled reliable data sharing platform.

The system builds on the secure data storage and well defined access rights existing in the MTA Cloud services, which is jointly maintained by SZTAKI and Wigner research institutes. 

However the repository is generic and provides useful services to researchers in any fields, we favor the support of the COVID-19 pandemic related researchers, research groups and institutes.

The development of the repository is planned in two major steps. First, we are to achieve:

  • Internationally shared, publication related dataset store.
    The publication might contain exact references to the stored research data. The publication related data entities might be uploaded into the server one-by-one or in packs, where the system unpacks them and based on the data reference, stores these entities in an organized way.
  • International research data share of a given scientific community.
    The repository features to achieve: individual and burst upload of data entities, metadata settings via form inputs and burst uploads. The data entities are identifiable, supported their browsing, search, and the search result sets might be downloaded.
  • Long term international share of Open/Closed research data.
    The planned metadata storage solution complies with the requirements of the long-time digital preservation (OAIS).

The service established in the first step will be based on the Dataverse server with extensions and add-ons to support the Hungarian research needs. In the second step the focus will be on the raw data sharing and collaborative data creation, and data maintainance. The system gives the following services right from the beginning:

  • Data citation for datasets and files.
    EndNote XML, RIS Format, or BibTeX Format.
  • OAI-PMH (Harvesting).
    Gather and expose metadata from and to other systems.
  • Support for EduID login.
  • Schema.org JSON-LD.
    Used by Google Dataset Search and other services for discoverability.
  • Versioning.
    History of changes to datasets and files are preserved.
  • Faceted search.