CLARIN-DK-UCPH Repository - About and Policies



Mission Statement


The mission of CLARIN-DK is to provide easy and sustainable access for scholars in the humanities and social sciences to digital language data (in written, spoken, video or multimodal form) and to provide advanced tools for discovering, exploring, exploiting, annotating, and analyzing them. CLARIN-DK also shares knowledge on Danish language technology and resources and is the Danish node in the European CLARIN-ERIC. The objective of the CLARIN Centre at the University of Copenhagen is to fulfill the CLARIN-DK mission.

Digital archiving and easy and sustainable digital access to data resources and tools gives the scholars the possibility of developing new research methods and addressing new types of research questions. This will facilitate and enhance participation in collaborative international research.

The CLARIN Centre at the University of Copenhagen supports research data management by providing researchers with a repository (the CLARIN-DK-UCPH Repository) for storage of research data, which are findable, accessible and easy to cite using persistent identifiers. The centre provides data management consultation and support in connection with depositing and reuse of research data.

The CLARIN Centre at the University of Copenhagen is supported and financed by the Faculty of Humanities and the Department of Nordic Studies and Linguistics at the University of Copenhagen until the end of 2024.

The centre promulgates the mission of CLARIN-DK through publications, conference attendance, organization of courses and workshops.

To learn more about CLARIN-DK visit info.clarin.dk


Terms of Service

To achieve our mission statement,we set out some ground rules through the Terms of Service. By accessing or using any kind of data or services provided by the Repository, you agree to abide by the Terms contained in the above mentioned document.

Data in CLARIN-DK-UCPH Repository are made available under the licence attached to the resources. In case there is no licence, data is made freely available for access, printing and download for the purposes of non-commercial research or private study. Users must acknowledge in any publication, the Deposited Work using a persistent identifier (see Citing Data), its original author(s)/creator(s), and any publisher where applicable. Full items must not be harvested by robots except transiently for full-text indexing or citation analysis. Full items must not be sold commercially unless explicitly granted by the attached licence without formal permission of the copyright holders.


About Repository

It is like a library for linguistic data and tools.

  • Search for data and tools and easily download them.
  • Deposit the data and be sure it is safely stored, everyone can find it, use it, and correctly cite it (giving you credit)

About CLARIN-DK

See CLARIN in Denmark.


License Agreement and Contracts

CLARIN-DK-UCPH Repository distinguishes two types of licenses.

  • For every deposit, the repository enter into a standard contract with the submitter, the so-called "Distribution License Agreement", in which the repository describes the rights and duties and the submitter acknowledges that they have the right to submit the data and gives the repository the permission to distribute the data on their behalf.
  • The users who download data from the repository are bound by the conditions of the end-user license assigned to the bitstreams. Restricted data requires all users be authenticated and electronically accept the end-user license in order to access the data. A list of available end-user licenses in our repository can be found here. For submitters, there is an additional possibility of requesting custom end-user licenses to be assigned to the bitstreams during the submission workflow.

Intellectual Property Rights

As mentioned in the section License Agreement and Contracts, we require the depositor of data or tools to sign a Distribution License Agreement, which specifies that they have the right to submit the data and gives us (the repository centre) right to distribute the data on their behalf. This means that depositors are solely responsible for taking care of IPR issues before publishing data or tools by submitting them to us.
Should anyone have a suspicion that any of the datasets or tools in our repository violate Intellectual Property Rights, they should contact us immediately at our Help Desk.


Privacy and Cookie Policy

Read our Privacy Policy in order to learn how we manage personal data collected by the CLARIN-DK-UCPH Repository. Cookie Policy details how cookies are used by the CLARIN-DK-UCPH Repository.


Metadata Policy

Deposited content must be accompanied by sufficient metadata describing its content, provenance and formats in order to support its preservation and dissemination. Metadata are freely accessible and are distributed in the public domain (under CC0). However, we reserve the right to be informed about commercial usage of metadata from CLARIN-DK-UCPH Repository including a description of your use case at Help Desk.


Preservation Policy

CLARIN-DK is committed to the long-term care of items deposited in our repository and strives to adopt the current best practice in digital preservation. The CLARIN-DK centre will ensure preservation and access to the data in this repository as long as sufficient funding is available. Until the end of 2024, the Faculty of Humanities at the University of Copenhagen has agreed to fund the CLARIN-DK centre. Efforts are made to extend the funding period. In the unlikely case of withdraw of funding, an agreement on preservation of and access to data has been made with the Department of Nordic Studies and Linguistics. The department offers a timeframe of at least 5 years of hosting of the CLARIN-DK Repository. In this period the department’s Centre for Language Technology as technical system owner will provide preservation of and access to the data. In the meanwhile, agreements will be made with the University of Copenhagen or with another CLARIN ERIC centre for a permanent solution.

The legal aspects of the process of relocating data to another institution is addressed by the deposition license, Deposition License Agreement. This agreement between the depositor and the repository gives all permissions required to meet responsibilities for preservation of the data. The transfer of custody of the data provided and permission to share the data on the depositor’s behalf is covered by the deposition license too. The repository has the rights to copy, transform, and store the items, as well as to provide access to them as long as it complies with the deposition license.

Technology: Our repository uses a repository system widely used at CLARIN data centres (the CLARIN DSpace repository system with well-documented modifications). It is currently running on virtual machines at the IT department of University of Copenhagen (UCPH IT), which has the capacity to keep them up and running, also while potential negotiations to take over the hosting of either the repository or all the data in the repository are completed. In case where the data are moved to another DSpace repository within CLARIN or another repository using a PID system, the PID’s will be kept unchanged and the transfer will not affect the validity of PID references already included in publications etc., and will therefore have minimal impact on user experience. Updates and upgrades of the repository system software and server software will be carried out continuously. In general, developments in technology are followed and changes relevant to the repository system are continuously discussed in the Standing Committee for CLARIN Technical Centres, SCCTC.

Data storage and backup: UCPH-IT is responsible for storage media handling and monitoring of server infrastructure and data partitions.

File formats: During the submission workflow of data, the use of open and well-documented data formats are encouraged allowing for easy conversion into other formats. Data in proprietary formats will only be accepted when conversion cannot easily be done.

Information security: The repository system logs any changes to data resources, and any changes in metadata. Furthermore, the changes can only be performed by a small group of people, that belong to the CLARIN-DK group. Automatic checks for data formats, links, and activity are run on a weekly basis and inspected by the repository responsible. Furthermore, the servers are under surveillance including monitoring the configured network services and disk usage, and intrusion prevention software is in use on the repository server to prevent security incidents from occurring. The servers are scanned and evaluated for security issues by the Danish Computer Security Incident Response Team (DKCERT). The repository administrators respond to any reported security-related issues or incidents as quickly as possible. These initiatives focus on keeping the data and metadata safe and secure.

Suggestions from users about changes in functionality are appreciated and encouraged, e.g. users can send mails to info@clarin.dk, and the staff in the helpdesk and knowledge sharing service are ready to enter into dialogue with users about their wishes for new or improved functionality, better help pages, metadata guidance or other changes in user requirements. The preservation plan is subject to change yearly based on future experience from depositors and with technology development.