'SFU ReviewSP-NEG: a Spanish corpus annotated with negation for sentiment analysis a typology of negation patterns'

Jiménez Zafra, S., Taulé, M., Martín Valdivia, M.T., Martí, M.A., Ureña, L.A.
Membres autors
Language, Resources and Evaluation
Dordrecht, Springer Science+Business Media

In this paper, we present SFU ReviewSP-NEG, the first Spanish corpus annotated with negation with a wide coverage freely available. We describe the methodology applied in the annotation of the corpus including the tagset, the linguistic criteria and the inter-annotator agreement tests. We also include a complete typology of negation patterns in Spanish. This typology has the advantage that it is easy to express in terms of a tagset for corpus annotation: the types are clearly defined, which avoids ambiguity in the annotation process, and they provide wide coverage (i.e. they resolved all the cases occurring in the corpus). We use the SFU ReviewSP as a base in order to make the annotations. The corpus consists of 400 reviews, 221,866 words and 9455 sentences, out of which 3022 sentences contain at least one negation structure.