Ke klasifikaci morfologických variant

[img]PDF - Authorized users only
Language: Czech
134Kb
Title:Ke klasifikaci morfologických variant
Creators:
Cvrček, Václav; vaclav dot cvrcek at ff dot cuni dot cz; ÚČNK FF UK, nám. Jana Palacha 2, Praha 1, 116 38, Czech Republic
Kodýtek, Vilém; vkodytek at iol dot cz; Beethovenova 18, Ústí nad Labem, 400 01, Czech Republic
Journal or Publication Title:
Slovo a slovesnost, 74, 2, pp. 139-146
Uncontrolled Keywords:corpus analysis, effect size, language production heterogeneity, morphological variation, Shannon entropy, statistical analysis

Abstract

After briefly discussing the heterogeneities inherent to language production and how they influence corpus evidence, we describe a scale for the classification of individual morphological variants by their relative frequencies that has recently been independently proposed in Mluvnice současné češtiny (2010) (A Grammar of Contemporary Czech, hereafter GCCz), of which we are co-authors, and in Bermel & Knittl (2012). Those variants with relative frequency (roughly) within 1% and 10% are classified by the respective authors as “sparse” and “marked”, and those occurring in (roughly) less than 1% cases as “unexpected” and “isolated”. Another feature of the scale is the “equipollence” of variants of a doublet having relative frequencies within (roughly) 1/3 and 2/3 (for this criterion see also Štícha 2009). The scale in GCCz is heuristically based on Shannon entropy and valid for synchronic functionally equivalent variants. Recently, R. Čech (2012) has claimed to have revealed “a serious statistical deficiency” in GCCz. We show that this is a misunderstanding stemming from his not distinguishing between the null-hypothesis statistical significance testing and the effect size evaluation. We end with a brief note on the structureof the resources employed in GCCz.

Official URL: http://www.ceeol.com/aspx/issuedetails.aspx?issueid=97209EE5-097C-460A-A34E-789293828DE8&articleid=94FF3105-7FC2-4EF1-8266-EFB326CE6D14

Title:Ke klasifikaci morfologických variant
Translated title:On the classification of morphological variants
Creators:
Cvrček, Václav; vaclav dot cvrcek at ff dot cuni dot cz; ÚČNK FF UK, nám. Jana Palacha 2, Praha 1, 116 38, Czech Republic
Kodýtek, Vilém; vkodytek at iol dot cz; Beethovenova 18, Ústí nad Labem, 400 01, Czech Republic
Uncontrolled Keywords:corpus analysis, effect size, language production heterogeneity, morphological variation, Shannon entropy, statistical analysis
Subjects:P Language and Literature > P Philology. Linguistics
Divisions:Humanities and Social Sciences > 9. Section of Humanities and Philology > Institute of the Czech Language > Slovo a slovesnost
Journal or Publication Title:Slovo a slovesnost
Volume:74
Number:2
Page Range:pp. 139-146
ISSN:0037-7031
Publisher:Czech Language Institute of the Academy of Sciences of the Czech Republic
Related URLs:
URLURL Type
http://avi.lib.cas.cz/node/82Publisher
ID Code:7744
Item Type:Article
Deposited On:03 Jun 2013 13:11
Last Modified:03 Jun 2013 11:11

Citation

Cvrček, Václav; Kodýtek, Vilém (2013) Ke klasifikaci morfologických variant. Slovo a slovesnost, 74 (2). pp. 139-146. ISSN 0037-7031

Repository Staff Only: item control page