Show simple item record

 
dc.creator Olsen, Sussi
dc.creator Braasch, Anna
dc.creator Jakob, Halskov
dc.creator Hansen, Dorte Haltrup
dc.date.accessioned 2018-06-08T09:45:51Z
dc.date.available 2018-06-08T09:45:51Z
dc.date.issued 2011
dc.identifier.uri http://hdl.handle.net/20.500.12115/14
dc.description Texts in the Health and Medicine Domain come from netpatient.dk, Søfartsstyrelsen, Sundhedsstyrelsen, regionH, Libris, Aktuel Naturvidenskab and have been collected in the DK-CLARIN project, WP2.2, 2008 - 2011. The corpus consists of 3,972,573 words in 3273 files. Communicative setting/Number of files: expert->expert (27) expert->advanced (40) expert->basic (3206). All texts are in XML TEIP5 format (TEIP5DKCLARIN-format), with tokenisation, sentence and paragraph segmentation, pos-tagging, lemmatisation and termhood annotation placed in separate text external spangroups. "DK-CLARIN LSP Corpus - Health and Medicine domain" is a part of the Danish DK-CLARIN LSP corpus consisting of seven sub-corpora from following subject domains: Agriculture, Construction, Economics, Environment, Health, IT and Nanotechnology.
dc.language.iso dan
dc.publisher Centre for Language Technology, NorS, University of Copenhagen
dc.publisher The Danish Language Council
dc.rights CLARIN-ACA-NC
dc.rights.uri https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&NORED=1
dc.rights.label ACA
dc.subject Health
dc.title DK-CLARIN LSP Corpus - Health domain
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN-DK
contact.person Administrator; CLARIN-DK; info@clarin.dk; Centre for Language Technology, NorS, University of Copenhagen
sponsor n/a; n/a; DK-CLARIN; nationalFunds;
size.info 3972573; words
size.info 3273; files
files.size 197536844
files.count 15
annotationInfo.annotationType tokenization
annotationInfo.annotationType sentence and paragraph segmentation
annotationInfo.annotationType POS-tagging
annotationInfo.annotationType lemmatization
annotationInfo.annotationType termhood scoring


 Files in this item

 Download all files in item (188.39 MB)
This item is
Academic Use
and licensed under:
CLARIN-ACA-NC
Attribution Required Noncommercial
Icon
Name
AktuelNaturvidenskab.zip
Size
3.22 MB
Format
application/zip
Description
Corpus
MD5
cfe2b468bb75de8bd5532e2896ef18c4
 Download file
Icon
Name
soefartsstyrelsen.zip
Size
1.07 MB
Format
application/zip
Description
Corpus
MD5
e49627827fee177e432161a1d59df3f8
 Download file
Icon
Name
SST.zip
Size
39.4 MB
Format
application/zip
Description
Corpus
MD5
a636edde53d1e826ba6b03423e45161c
 Download file
Icon
Name
regionH.zip
Size
12.46 MB
Format
application/zip
Description
Corpus
MD5
5b5a956362c910eab028342f183f4f77
 Download file
Icon
Name
libris_sundhed.zip
Size
18.53 MB
Format
application/zip
Description
Corpus
MD5
f803f4ae113ca140a1ffd744e8bbb43b
 Download file
Icon
Name
sundhed_dk_3.zip
Size
41.55 MB
Format
application/zip
Description
Corpus
MD5
6c5f9b5268c1b9e024c1c989fe6ef772
 Download file
Icon
Name
netpatient.zip
Size
28.33 MB
Format
application/zip
Description
Corpus
MD5
5c6b54208afe230115018b69e5c820f0
 Download file
Icon
Name
sundhed_dk_2.zip
Size
42.78 MB
Format
application/zip
Description
Corpus
MD5
54adcbe6ad4842bddfa8ff1683d11afd
 Download file
Icon
Name
README_health.txt
Size
2.96 KB
Format
Text file
Description
readme
MD5
b9828b2a71471d1b9b80b9b9c68d86f1
 Download file
Icon
Name
dkclarin-health-cmdi_textCorpus.xml
Size
17.99 KB
Format
XML
Description
CMDI metadata
MD5
e0df14f4559ba3fe72ad53d86445aaa8
 Download file
Icon
Name
teiHeader.xsd
Size
59.88 KB
Format
XML
Description
Dokumentation
MD5
9fc5374ad34319278f437b963454f972
 Download file
Icon
Name
text-format.pdf
Size
111.77 KB
Format
PDF
Description
Dokumentation
MD5
c4c4b5f1cd83ff232c44bc7692621da7
 Download file
Icon
Name
DKCLARIN_fagsprogligt_korpus_dokumentation_2011.pdf
Size
361.81 KB
Format
PDF
Description
Dokumentation
MD5
e1752deaa6888e2f856811c8d933e655
 Download file
Icon
Name
text-header.pdf
Size
375.79 KB
Format
PDF
Description
Schema
MD5
47825d0010a398bf10ce1564da2a15f0
 Download file
Icon
Name
textCorpusProfile.xsd
Size
142.26 KB
Format
XML
Description
Schema
MD5
7d6b452b88175041133ea8020e453cd8
 Download file

Show simple item record