GEI-Repository

The GEI’s digital services are currently designed according to the different standards and levels of indexing common in the various fields, and each has different search interfaces and database solutions. The GEI repository, which is currently being restructured, will integrate and standardise data management, long-term archiving and availability of data and digital services for international educational media research.

The GEI repository will contain the metadata, retro-digitised texts and full-text versions of historic educational media as well as other source material relevant to educational media research (curricula, specialist information systems, open source secondary literature etc.). This data is primarily compiled and curated by the staff in the GEI research library in cooperation with international partners.

One part of the GEI repository is the  institutional (and specialist) repository known as ‘edu.docs’, which makes open access publications written by GEI researchers (on the subject of educational media research) available as fully digitised texts. Specific interfaces also allow the texts to be accessed through other services (LeibnizOpen, DDB).

In the medium term the repository will store and provide access to research data from internal projects, and when possible also from external projects and individual researchers, providing that data is subject to open licenses – such as those in the Creative Commons family of licenses – and pertains to the GEI’s typical research fields. A contract is concluded with the transfer of the data. Please see the relevant conditions of use for licences related to services provided by the Institute.

Data formats

In order to guarantee the compatibility and interoperability of the resources provided, all data sets are provided with comprehensive descriptions. The metadata is available in a range of widely used formats. The data records are currently also being made available in the modular CMDI format used within the CLARIN initiative.

The DIFI staff are available to answer queries and offer support with regards to preparing resources and format conversion, if required they may be able to assist at the development stage of project applications.

Search tools and data harvesting

The metadata for the data held in several of the repositories is available under OAI-PMH compliant endpoints for automatic requests (harvesting). An aggregated overview of all resources and services available in the CLARIN network can be obtained, for example from the Virtual Language Observatory. Specialist and institutional open access publications from the GEI are available to search through at http://repository.gei.de/.

Quality control

Resources are incorporated into the repository following a quality control process that checks data and metadata. Expert opinions are obtained, where necessary, to assess the quality of content and material is edited and harmonised with the repository citation style.

When evaluating material we are guided by the DFG Guidelines for Safeguarding Good Scientific Practice, the European Code of Conduct for Research Integrity produced by ALLEA (All European Academies), the GEI Guidelines for Securing Good Academic Practice as well as the Best-Practice Guidelines issued by CLARIN-D in the CLARIN-D user guide.

Measures to assure the quality of the data produced form a substantial part of the repository workflow.

Data management
All data is provided with PIDs, PURL, handles and other identifiers in order to guarantee it can be referenced in the long-term. We are currently working on a system to incorporate different versions of data.