RJ funded project - documentation, issues, etc - https://spraakbanken.gu.se/eng/l2-profiling
Status document (with links): https://docs.google.com/document/d/16JzEDFyDhbV9Updw_QYGHZJXdMbZ1LR3NTKJEsJKuqk/edit#
Several directions are pursued in this project:
This is split into several tasks:
- Manual checks of automatic annotation of two corpora: coursebooks (COCTAILL) and essays (SweLL-pilot)
- Annotation check guidelines: https://docs.google.com/document/d/1W9gcwRwFJ7-DsAC6cf6BHUoEivt73r-XWCV1oKS6xV8/edit#
- COCTAILL (article): http://www.ep.liu.se/ecp/107/010/ecp14107010.pdf
- SweLL-pilot (article): http://arxiv.org/pdf/1604.06583v1.pdf
- (ev.) Normalization of SweLL-pilot: https://spraakbanken.github.io/swell-project/Normalization_guidelines
- SVALA-tool, demo (for normalization): https://spraakbanken.gu.se/swell/dev/
- Manual checks of SenSVALex, a sense-based word list generated from COCTAILL corpus
- SVALex (article): https://spraakbanken.gu.se/sites/spraakbanken.gu.se/files/SVALex_LREC_cameraReady.pdf
- (in future): Manual checks of SenSweLLex, a sense-based word list generated from SweLL-pilot
- SweLLex (article): http://www.ep.liu.se/ecp/130/010/ecp16130010.pdf
- Manual lexicographic annotation of SenSVALex (and ev. SenSweLLex) using LEGATO tool:
- LEGATO: https://spraakbanken.gu.se/larkalabb/legato
- Guidelines: https://docs.google.com/document/d/1nZOKf-54FEkjIQFnPUmZZRWqib6y7gpCuKQO-XadeqM/edit#heading=h.5rcsyvi01oc5
This is split into several tasks:
-
Passives
-
Prepositions
-
Definiteness
-
Embedded clauses?
-
For Legato, use this github page and issues connected to it: https://github.com/elenavolodina/Legato
-
For corpus annotation checks & for SVALex checks, use issues on this (current) github page