Zachycení výstavby textu v Pražském závislostním korpusu

[img]PDF - Authorized users only
Language: Czech
553Kb
Title:Zachycení výstavby textu v Pražském závislostním korpusu
Creators:
Zikánová, Šárka; zikanova at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Poláková, Lucie; polakova at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Jínová, Pavlína; jinova at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Nedoluzhko, Anna; nedoluzko at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Rysová, Magdaléna; magdalena dot rysova at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Mírovský, Jiří; mirovsky at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Hajičová, Eva; hajicova at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Journal or Publication Title:
Slovo a slovesnost, 76, 3, pp. 163-198
Uncontrolled Keywords:text, discourse, phenomena beyond the sentence boundary, discourse relations, discourse connectives, coreference, bridging anaphora, Prague Dependency Treebank

Abstract

Language corpora annotation schemes cover various layers of sentence description nowadays – from morphology to semantics. Annotation projects concerning phenomena beyond the sentence boundaries, however, started to attract the attention of corpus linguists only recently. In the present contribution, we describe a unified approach to analysis of discourse phenomena, aimed and developed for a large-scale annotation of Czech empirical data of the Prague Dependency Treebank. This approach is based on two fundamental pillars: (i) it exploits the results of one of the first complex schemes for discourse annotation proposed and realized in the Penn Discourse Treebank for English; (ii) it follows the Praguian Functional Generative Description and treebanking tradition, taking advantage of the tectogrammatical (underlying) layer of sentence analysis and extending it to a full discourse-level description. Our analysis concentrates on two major aspects of discourse coherence: (i) on discourse relations (semantic relations between discourse segments) and discourse connectives as their lexical anchors; and (ii) on coreference and the so-called bridging anaphora. We present a detailed description of the annotation scheme and procedure, address individual problematic issues and offer basic corpus statistics and annotation evaluation.

Official URL: http://www.ceeol.com/aspx/issuedetails.aspx?issueid=526AD11A-AD75-46F1-A027-EA4BEDDED208&articleid=C1EA7315-EE04-4CE9-8C49-41F89717F53A

Title:Zachycení výstavby textu v Pražském závislostním korpusu
Translated title:Annotation of discourse phenomena in the Prague Dependency Treebank
Creators:
Zikánová, Šárka; zikanova at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Poláková, Lucie; polakova at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Jínová, Pavlína; jinova at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Nedoluzhko, Anna; nedoluzko at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Rysová, Magdaléna; magdalena dot rysova at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Mírovský, Jiří; mirovsky at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Hajičová, Eva; hajicova at ufal dot mff dot cuni dot cz; ÚFAL MFF UK, Malostranské nám. 25, Praha 1, 118 00, Czech Republic
Uncontrolled Keywords:text, discourse, phenomena beyond the sentence boundary, discourse relations, discourse connectives, coreference, bridging anaphora, Prague Dependency Treebank
Subjects:P Language and Literature > P Philology. Linguistics
Divisions:Humanities and Social Sciences > 9. Section of Humanities and Philology > Institute of the Czech Language > Slovo a slovesnost
Journal or Publication Title:Slovo a slovesnost
Volume:76
Number:3
Page Range:pp. 163-198
ISSN:0037-7031
Publisher:Czech Language Institute of the Academy of Sciences of the Czech Republic
Related URLs:
URLURL Type
http://avi.lib.cas.cz/node/82Publisher
ID Code:8460
Item Type:Article
Deposited On:31 Aug 2015 16:23
Last Modified:31 Aug 2015 14:23

Citation

Zikánová, Šárka; Poláková, Lucie; Jínová, Pavlína; Nedoluzhko, Anna; Rysová, Magdaléna; Mírovský, Jiří; Hajičová, Eva (2015) Zachycení výstavby textu v Pražském závislostním korpusu. Slovo a slovesnost, 76 (3). pp. 163-198. ISSN 0037-7031

Repository Staff Only: item control page