| dc.creator |
Olsen, Sussi |
| dc.creator |
Braasch, Anna |
| dc.creator |
Hansen, Dorte Haltrup |
| dc.creator |
Jakob, Halskov |
| dc.date.accessioned |
2018-05-30T09:24:58Z |
| dc.date.available |
2018-05-30T09:24:58Z |
| dc.date.issued |
2011 |
| dc.identifier.uri |
http://hdl.handle.net/20.500.12115/11 |
| dc.description |
Texts in the Economics domain come from SKAT, Finanstilsynet and Erhvervs- og Selskabsstyrelsen and have been collected in the DK-CLARIN project, WP2.2, 2008 - 2011.
The corpus consists of 979,881 words in 64 files.
Communicative setting/Number of files: expert->expert (11) expert->advanced (1) expert->basic (52).
All texts are in XML TEIP5 format (TEIP5DKCLARIN-format), with tokenisation, pos-tagging, sentence and paragraph segmentation, lemmatisation and termhood annotation placed in separate text external spangroups.
"DK-CLARIN LSP Corpus - Economics domain" is a part of the Danish DK-CLARIN LSP corpus consisting of seven sub-corpora from following subject domains: Agriculture, Construction, Economics, Environment, Health, IT and Nanotechnology. |
| dc.language.iso |
dan |
| dc.publisher |
Centre for Language Technology, NorS, University of Copenhagen |
| dc.publisher |
The Danish Language Council |
| dc.rights |
CLARIN-ACA-NC |
| dc.rights.uri |
https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&NORED=1 |
| dc.rights.label |
ACA |
| dc.subject |
Economics |
| dc.title |
DK-CLARIN LSP Corpus - Economics domain |
| dc.type |
corpus |
| metashare.ResourceInfo#ContentInfo.mediaType |
text |
| has.files |
yes |
| branding |
CLARIN-DK |
| contact.person |
Administrator; CLARIN-DK; info@clarin.dk; Centre for Language Technology, NorS, University of Copenhagen |
| sponsor |
n/a; n/a; DK-CLARIN; nationalFunds; |
| size.info |
979,881; words |
| size.info |
64; files |
| files.size |
44350375 |
| files.count |
11 |
| annotationInfo.annotationType |
tokenization |
| annotationInfo.annotationType |
lemmatization |
| annotationInfo.annotationType |
sentence and paragraph segmentation |
| annotationInfo.annotationType |
POS-tagging |
| annotationInfo.annotationType |
termhood scoring |