dc.creator | Hollenstein, Nora |
dc.creator | Björnsdóttir, Marina |
dc.creator | Barrett, Maria |
dc.date.accessioned | 2022-12-06T15:57:27Z |
dc.date.available | 2022-12-06T15:57:27Z |
dc.date.issued | 2022 |
dc.identifier.uri | http://hdl.handle.net/20.500.12115/48 |
dc.description | CopCo is an eye-tracking corpus tailored to both psycholinguistics and natural language processing. The goal is to investigate reading behavior of Danish texts in various populations. To this end, we record eye movements of participants reading continuous Danish texts in their own speed. The CopCo corpus contains eye movement data from Danish native speakers, both from readers without dyslexia and readers with dyslexia. Additionally, there is a set of non-native speaking participants. The data contains one CSV file per participant with the computed eye-tracking metrics for each word. Each file contains the text read by the participant with one line per word. In addition to the words, word IDs, sentence IDs and text IDs, the files contain eye-tracking features for each word, namely, landing position, first fixation duration, first pass duration, go-past time, mean fixation duration, total fixation duration, number of fixations, mean saccade duration and peak saccade velocity. This project has been approved by the Ethics Commission of the Faculty of Humanities of the University of Copenhagen. |
dc.language.iso | dan |
dc.publisher | Centre for Language Technology, NorS, University of Copenhagen |
dc.relation.isreferencedby | https://aclanthology.org/2022.lrec-1.182/ |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://osf.io/ud8s5/ |
dc.subject | reading |
dc.subject | eye-tracking |
dc.subject | Danish |
dc.subject | multimodal corpora |
dc.subject | psycholinguistics |
dc.title | CopCo: The Copenhagen Corpus of Eye-Tracking Recordings from Natural Reading |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN-DK |
demo.uri | https://github.com/norahollenstein/copco-processing |
contact.person | Nora; Hollenstein; nora.hollenstein@hum.ku.dk; Centre for Language Technology, NorS, University of Copenhagen |
sponsor | University of Copenhagen; N/A; N/A; Other; |
sponsor | IT University Copenhagen; N/A; N/A; Other; |
size.info | 32; texts |
size.info | 36888; tokens |
size.info | 1943; sentences |
files.size | 103939926 |
files.count | 58 |
annotationInfo.annotationType | eye-tracking recordings |
Files in this item
Download all files in item (99.12 MB)This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- P04.csv
- Size
- 1.08 MB
- Format
- Unknown
- Description
- Extracted features P04
- MD5
- 224ce3775650a2c3512d77b1b9d3cfb2
- Name
- P03.csv
- Size
- 1.79 MB
- Format
- Unknown
- Description
- Extracted features P03
- MD5
- dbd87f147948a49d639f95c99e58c01b
- Name
- P01.csv
- Size
- 2.21 MB
- Format
- Unknown
- Description
- Extracted features P01
- MD5
- af46dc258c20432afe7a489413ae73fe
- Name
- P02.csv
- Size
- 2.33 MB
- Format
- Unknown
- Description
- Extracted features P02
- MD5
- 8568a6517e9067ad5862657a42f996c0
- Name
- P07.csv
- Size
- 2.17 MB
- Format
- Unknown
- Description
- Extracted features P07
- MD5
- 06e0a14eeee185223ff904096a4e3584
- Name
- P06.csv
- Size
- 1.85 MB
- Format
- Unknown
- Description
- Extracted features P06
- MD5
- fe1f336dcdc6a584c6ec1a7114b0256e
- Name
- P10.csv
- Size
- 2.3 MB
- Format
- Unknown
- Description
- Extracted features P10
- MD5
- 48985b962520a5af41158231e05918a8
- Name
- P09.csv
- Size
- 720.47 KB
- Format
- Unknown
- Description
- Extracted features P09
- MD5
- 1e48d38f0bc65a017e6982d49ce332e1
- Name
- P05.csv
- Size
- 2.56 MB
- Format
- Unknown
- Description
- Extracted features P05
- MD5
- df900ab091f629c22b4d284d915c4fd0
- Name
- P08.csv
- Size
- 861.14 KB
- Format
- Unknown
- Description
- Extracted features P08
- MD5
- c058ed1d1c9f945e78a488d0f23c3487
- Name
- P16.csv
- Size
- 2.15 MB
- Format
- Unknown
- Description
- Extracted features P16
- MD5
- 85b739cb404aecf4882e7a40d6f99d8e
- Name
- P15.csv
- Size
- 2.2 MB
- Format
- Unknown
- Description
- Extracted features P15
- MD5
- 54f05949141f58e03a342080b666849c
- Name
- P11.csv
- Size
- 1.95 MB
- Format
- Unknown
- Description
- Extracted features P11
- MD5
- 94edd9bad177352e4066b87400037c84
- Name
- P13.csv
- Size
- 1.47 MB
- Format
- Unknown
- Description
- Extracted features P13
- MD5
- 10ce42ea4a81b875efceff3e53a0ef22
- Name
- P21.csv
- Size
- 2.37 MB
- Format
- Unknown
- Description
- Extracted features P21
- MD5
- cc25230eceec118acfa6664f2999b040
- Name
- P19.csv
- Size
- 1.37 MB
- Format
- Unknown
- Description
- Extracted features P19
- MD5
- 87b7f0ae4c058de967492d6d6fd8fbe1
- Name
- P17.csv
- Size
- 1.17 MB
- Format
- Unknown
- Description
- Extracted features P17
- MD5
- 997b91b1ff34187ad86fc75cef18002e
- Name
- P12.csv
- Size
- 2.18 MB
- Format
- Unknown
- Description
- Extracted features P12
- MD5
- 3dc91798e371acf336bec09b61b48c24
- Name
- P26.csv
- Size
- 1.28 MB
- Format
- Unknown
- Description
- Extracted features P26
- MD5
- 9cd7621965a62f7501c5574448d64669
- Name
- P25.csv
- Size
- 1.35 MB
- Format
- Unknown
- Description
- Extracted features P25
- MD5
- be4fb65ab25d3b2b590cde1cd990c45f
- Name
- P23.csv
- Size
- 1.42 MB
- Format
- Unknown
- Description
- Extracted features P23
- MD5
- cc170c3863ee5440ededc0ef301afd41
- Name
- P20.csv
- Size
- 2.59 MB
- Format
- Unknown
- Description
- Extracted features P20
- MD5
- 735bc3a3295fe2d083921729264dd73f
- Name
- P18.csv
- Size
- 1.34 MB
- Format
- Unknown
- Description
- Extracted features P18
- MD5
- 5c5409225f4378f9e2c2502ae375acfa
- Name
- P24.csv
- Size
- 956.96 KB
- Format
- Unknown
- Description
- Extracted features P24
- MD5
- 16c6d30b325795d935cc33a6a8184982
- Name
- P29.csv
- Size
- 2.16 MB
- Format
- Unknown
- Description
- Extracted features P29
- MD5
- 3bf1afd4e607b01c36ecd04d799b1550
- Name
- P30.csv
- Size
- 1.67 MB
- Format
- Unknown
- Description
- Extracted features P30
- MD5
- 6c268e0b5b9f411388585d20ab0c8f68
- Name
- P27.csv
- Size
- 2.14 MB
- Format
- Unknown
- Description
- Extracted features P27
- MD5
- 57b7671c0da06f68667823e69899a6e6
- Name
- P28.csv
- Size
- 2.48 MB
- Format
- Unknown
- Description
- Extracted features P28
- MD5
- a8d29c3911b07465d0a6be766de88db7
- Name
- P35.csv
- Size
- 2.03 MB
- Format
- Unknown
- Description
- Extracted features P35
- MD5
- fe2297f655c37197669bc39e0c80b3f5
- Name
- P34.csv
- Size
- 1.61 MB
- Format
- Unknown
- Description
- Extracted features P34
- MD5
- ec4aa3e83ecbae65c8f2a6b41da09754
- Name
- P33.csv
- Size
- 1.31 MB
- Format
- Unknown
- Description
- Extracted features P33
- MD5
- 087ba84995bd38c4c7d7ea5a2ccee2ae
- Name
- P32.csv
- Size
- 1.86 MB
- Format
- Unknown
- Description
- Extracted features P32
- MD5
- ff91d60a9561b195493fa36088a5ca77
- Name
- P22.csv
- Size
- 1.88 MB
- Format
- Unknown
- Description
- Extracted features P22
- MD5
- 70bd7b05eebe9c629bda03b97f12ec13
- Name
- P38.csv
- Size
- 1.57 MB
- Format
- Unknown
- Description
- Extracted features P38
- MD5
- 9875cb2ecbad62cfe676987792ac28fa
- Name
- P39.csv
- Size
- 1.34 MB
- Format
- Unknown
- Description
- Extracted features P39
- MD5
- 8faf12577e0c874290cb321849889a95
- Name
- P37.csv
- Size
- 2.11 MB
- Format
- Unknown
- Description
- Extracted features P37
- MD5
- 998919d0848d9ef0bb1569311233a2fc
- Name
- P31.csv
- Size
- 1.02 MB
- Format
- Unknown
- Description
- Extracted features P31
- MD5
- fff56e9aa6c7417b5919956732376610
- Name
- P36.csv
- Size
- 1.01 MB
- Format
- Unknown
- Description
- Extracted features P36
- MD5
- 777bcbe14c0e89fcf518d670324dfdfe
- Name
- P45.csv
- Size
- 1.86 MB
- Format
- Unknown
- Description
- Extracted features P45
- MD5
- 6506c16d908db4554824fba72b1d364b
- Name
- P44.csv
- Size
- 2.67 MB
- Format
- Unknown
- Description
- Extracted features P44
- MD5
- 84540200f695657babcd1a1212979132
- Name
- P42.csv
- Size
- 354.27 KB
- Format
- Unknown
- Description
- Extracted features P42
- MD5
- 50b7dec13d49018b722a92766f91e455
- Name
- P40.csv
- Size
- 2.06 MB
- Format
- Unknown
- Description
- Extracted features P40
- MD5
- c2077650175b8136ebbba15364b573a9
- Name
- P41.csv
- Size
- 2.49 MB
- Format
- Unknown
- Description
- Extracted features P41
- MD5
- 1672176b5c9da837558828a139052d4e
- Name
- P50.csv
- Size
- 1.97 MB
- Format
- Unknown
- Description
- Extracted features P50
- MD5
- 1c05a789ec4c9a16ae63263e5367ef20
- Name
- P48.csv
- Size
- 1.46 MB
- Format
- Unknown
- Description
- Extracted features P48
- MD5
- 69cb6330722f09d83a18a502c9fa57db
- Name
- P47.csv
- Size
- 2.12 MB
- Format
- Unknown
- Description
- Extracted features P47
- MD5
- 81ee52703aed2f614920bb3f917bee4a
- Name
- P46.csv
- Size
- 2.7 MB
- Format
- Unknown
- Description
- Extracted features P46
- MD5
- bb06cfa85c81b9c485695a1125fe0fc9
- Name
- P43.csv
- Size
- 1.44 MB
- Format
- Unknown
- Description
- Extracted features P43
- MD5
- d3520c8080537a14c80c0d9ac5a64305
- Name
- P52.csv
- Size
- 1.14 MB
- Format
- Unknown
- Description
- Extracted features P52
- MD5
- 58b5ac6ebe8c48be7407f18286e0237b
- Name
- P51.csv
- Size
- 1 MB
- Format
- Unknown
- Description
- Extracted features P51
- MD5
- da25eff85d97b738ca54e98037fa38e4
- Name
- P54.csv
- Size
- 1.32 MB
- Format
- Unknown
- Description
- Extracted features P54
- MD5
- 6ad2e832262b9106d523672a9df46651
- Name
- P53.csv
- Size
- 2.21 MB
- Format
- Unknown
- Description
- Extracted features P53
- MD5
- 26f03b08d84d6150702b2f6857b1ccf6
- Name
- P49.csv
- Size
- 616.18 KB
- Format
- Unknown
- Description
- Extracted features P49
- MD5
- cdb3b590c24115d2a566fd2e85d6b8db
- Name
- README.txt
- Size
- 426 bytes
- Format
- Text file
- Description
- README
- MD5
- 0ee765c4ec928b87bbe88a3aa455d9d4
# These files contain the extracted word-level eye-tracking metrics of the CopCo corpus.
Please check https://osf.io/ud8s5/ for a full version of the data (including the raw data and a Wiki with the feature descriptions).
Please also check https://github.com/norahollenstein/copco-processing to find the code used for preprocessing and additional data analysis scripts.
Contact: Nora Hollenstein, nora.hollenstein@hum.ku.dk . . .
- Name
- P57.csv
- Size
- 1.26 MB
- Format
- Unknown
- Description
- Extracted features P57
- MD5
- caf9c5ced7f1b86c521f12989b5bfe42
- Name
- P58.csv
- Size
- 2.46 MB
- Format
- Unknown
- Description
- Extracted features P58
- MD5
- d79e7978b62c94dd482a151de7474497
- Name
- P56.csv
- Size
- 1.79 MB
- Format
- Unknown
- Description
- Extracted features P56
- MD5
- d755b98169c6de01e34124c5e9ad879f
- Name
- P55.csv
- Size
- 2.41 MB
- Format
- Unknown
- Description
- Extracted features P55
- MD5
- 360d248545a6b0d1ee6986a1fc115df2