Variation in noun and pronoun frequencies in a sociohistorical corpus of English

Show simple item record

dc.contributor.author Säily, Tanja
dc.contributor.author Nevalainen, Terttu
dc.contributor.author Siirtola, Harri
dc.date.accessioned 2013-06-01T02:25:00Z
dc.date.available 2013-06-01T02:25:00Z
dc.date.issued 2011
dc.identifier.citation Säily , T , Nevalainen , T & Siirtola , H 2011 , ' Variation in noun and pronoun frequencies in a sociohistorical corpus of English ' , Literary and Linguistic Computing , vol. 26 , no. 2 , pp. 167-188 . https://doi.org/10.1093/llc/fqr004
dc.identifier.other PURE: 15456721
dc.identifier.other PURE UUID: a57cb249-bfdf-4d45-a12f-298a9f71ea8a
dc.identifier.other WOS: 000291063000003
dc.identifier.other Scopus: 79957862591
dc.identifier.other ORCID: /0000-0003-3088-4903/work/39874371
dc.identifier.other ORCID: /0000-0003-4407-8929/work/28758652
dc.identifier.uri http://hdl.handle.net/10138/39576
dc.description WOS:000291063000003
dc.description.abstract Many corpus linguists make the tacit assumption that part-of-speech frequencies remain constant during the period of observation. In this article, we will consider two related issues: (1) the reliability of part-of-speech tagging in a diachronic corpus and (2) shifts in tag ratios over time. The purpose is both to serve the users of the corpus by making them aware of potential problems, and to obtain linguistically interesting results. We use noun and pronoun ratios as diagnostics indicative of opposing stylistic tendencies, but we are also interested in testing whether any observed variation in the ratios could be accounted for in sociolinguistic terms. The material for our study is provided by the Parsed Corpus of Early English Correspondence (PCEEC), which consists of 2.2 million running words covering the period 1415–1681. The part-of-speech tagging of the PCEEC has its problems, which we test by reannotating the corpus according to our own principles and comparing the two annotations. While there are quite a few changes, the mean percentage of change is very small for both nouns and pronouns. As for variation over time, the mean frequency of nouns declines somewhat, while the mean frequency of pronouns fluctuates with no clear diachronic trend. However, women consistently use more pronouns than men, while men use more nouns than women. More fine-grained distinctions are needed to uncover further regularities and possible reasons for this variation. en
dc.format.extent 12
dc.language.iso eng
dc.relation.ispartof Literary and Linguistic Computing
dc.rights.uri info:eu-repo/semantics/openAccess
dc.subject 6121 Languages
dc.title Variation in noun and pronoun frequencies in a sociohistorical corpus of English en
dc.type Article
dc.contributor.organization Department of Modern Languages 2010-2017
dc.description.reviewstatus Peer reviewed
dc.relation.doi https://doi.org/10.1093/llc/fqr004
dc.relation.issn 0268-1145
dc.rights.accesslevel openAccess
dc.type.version acceptedVersion

Files in this item

Total number of downloads: Loading...

Files Size Format View
saily_et_al_2011_postprint.pdf 1.431Mb PDF View/Open

This item appears in the following Collection(s)

Show simple item record