Show simple item record

 
dc.creator Asmussen, Jørg
dc.creator Halskov, Jakob
dc.date.accessioned 2018-09-24T14:34:44Z
dc.date.available 2018-09-24T14:34:44Z
dc.date.issued 2011
dc.identifier.uri http://hdl.handle.net/20.500.12115/36
dc.description DK-CLARIN Reference Corpus of General Danish has been collected as part of DK-CLARIN project, WP2.1, 2008 - 2011. All texts are in XML TEIP5 format (TEIP5DKCLARIN-format), with tokenisation, ePOS-tagging, sentence and paragraph segmentation, and lemmatisation. The corpus comprises 45,113,245 words.
dc.language.iso dan
dc.publisher Society for Danish Language and Literature, DSL
dc.rights CLARIN-ACA-NC
dc.rights.uri https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&NORED=1
dc.rights.label ACA
dc.source.uri https://korpus.dsl.dk/clarin/
dc.subject LGP - Language for General Purposes
dc.title DK-CLARIN Reference Corpus of General Danish
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN-DK
contact.person Jørg; Asmussen; korpus@dsl.dk; Society for Danish Language and Literature, DSL
sponsor n/a; n/a; DK-CLARIN; nationalFunds;
size.info 45113245; words
files.size 1559785675
files.count 11
annotationInfo.annotationType tokenization
annotationInfo.annotationType sentence and paragraph segmentation
annotationInfo.annotationType POS-tagging
annotationInfo.annotationType lemmatization


 Files in this item

This item is
Academic Use
and licensed under:
CLARIN-ACA-NC
Attribution Required Noncommercial
Icon
Name
corpus_1.zip
Size
264.92 MB
Format
application/zip
Description
Corpus part 1
MD5
eb69aaf4eadb93001460ee18c4c29c01
 Download file
Icon
Name
corpus_2.zip
Size
46.91 MB
Format
application/zip
Description
Corpus part 2
MD5
33f0a8f887471a4e59e87e81a2ed1c65
 Download file
Icon
Name
corpus_3.zip
Size
45.73 MB
Format
application/zip
Description
Corpus part 3
MD5
d79bb1b62c398ba7cc5eba6f132dabc5
 Download file
Icon
Name
corpus_4.zip
Size
64.14 MB
Format
application/zip
Description
Corpus part 4
MD5
6bdfaa4f74272bef663fcd2a75c9a199
 Download file
Icon
Name
corpus_5.zip
Size
240.2 MB
Format
application/zip
Description
Corpus part 5
MD5
e290efea5e36103deb3e171b23b5a724
 Download file
Icon
Name
corpus_6.zip
Size
388.47 MB
Format
application/zip
Description
Corpus part 6
MD5
652cfdb96dc69e64c07b4f0adddc0480
 Download file
Icon
Name
corpus_7.zip
Size
191.65 MB
Format
application/zip
Description
Corpus part 7
MD5
61987904256e03be8f569a5c4c78be49
 Download file
Icon
Name
corpus_8.zip
Size
244.85 MB
Format
application/zip
Description
Corpus part 8
MD5
b7c159387697e9a84612079963e9e2c4
 Download file
Icon
Name
text-format.pdf
Size
111.77 KB
Format
PDF
Description
Documentation
MD5
c4c4b5f1cd83ff232c44bc7692621da7
 Download file
Icon
Name
text-header.pdf
Size
441.71 KB
Format
PDF
Description
Documentation
MD5
0944f72b62da0f047d40d9d38f6a51dd
 Download file
Icon
Name
pos-design.pdf
Size
116.7 KB
Format
PDF
Description
Documentation
MD5
0983f68307488156e9207c8ce7ecbddb
 Download file

Show simple item record