dc.creator |
Jongejan, Bart |
dc.date.accessioned |
2018-06-12T12:04:41Z |
dc.date.available |
2018-06-12T12:04:41Z |
dc.date.issued |
2013 |
dc.identifier.uri |
http://hdl.handle.net/20.500.12115/19 |
dc.description |
Binary wordlists for the CST lemmatizer as suplement to the rules of the lemmatizer. Works with both tagged and untagged input.
Use: cstlemma -d NAME-OF-WORDLIST |
dc.language.iso |
dan |
dc.publisher |
Centre for Language Technology, NorS, University of Copenhagen |
dc.rights |
CLARIN-ACA-NC |
dc.rights.uri |
https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&NORED=1 |
dc.rights.label |
ACA |
dc.subject |
lemmatizer |
dc.subject |
lemmatiser |
dc.title |
Dictionary for the CST Lemmatizer |
dc.type |
lexicalConceptualResource |
metashare.ResourceInfo#ContentInfo.detailedType |
wordList |
metashare.ResourceInfo#ContentInfo.mediaType |
text |
has.files |
yes |
branding |
CLARIN-DK |
demo.uri |
https://cst.dk/online/lemmatiser/uk/index.html |
contact.person |
Administrator; CLARIN-DK; info@clarin.dk; Centre for Language Technology, NorS, University of Copenhagen |
size.info |
593,000; wordforms |
size.info |
112,000; lemmas |
files.size |
3082968 |
files.count |
3 |