Dataset for Temporal Analysis of English-French Cognates

Visa fullständig post



Permalänk

http://hdl.handle.net/10138/317395

Citation

Frossard , E , Coustaty , M , Doucet , A , Jatowt , A & Hengchen , S 2020 , Dataset for Temporal Analysis of English-French Cognates . in N Calzolari ... [et al.] (ed.) , Twelfth International Conference on Language Resources and Evaluation (LREC 2020) : May 11-16, 2020 PALAIS DU PHARO Marseille, France : Conference Proceedings . European Language Resources Association (ELRA) , Paris , pp. 855–859 , International Conference on Language Resources and Evaluation , Marseille , France , 11/05/2020 . < http://www.lrec-conf.org/proceedings/lrec2020/LREC-2020.pdf >

Titel: Dataset for Temporal Analysis of English-French Cognates
Författare: Frossard, Esteban; Coustaty, Mickaël; Doucet, Antoine; Jatowt, Adam; Hengchen, Simon
Medarbetare: Calzolari ... [et al.], Nicoletta
Upphovmannens organisation: Digital Humanities
Utgivare: European Language Resources Association (ELRA)
Datum: 2020-05-13
Språk: eng
Sidantal: 5
Tillhör serie: Twelfth International Conference on Language Resources and Evaluation (LREC 2020)
ISBN: 979-10-95546-34-4
Permanenta länken (URI): http://hdl.handle.net/10138/317395
Abstrakt: Languages change over time and, thanks to the abundance of digital corpora, their evolutionary analysis using computational techniques has recently gained much research attention. In this paper, we focus on creating a dataset to support investigating the similarity in evolution between different languages. We look in particular into the similarities and differences between the use of corresponding words across time in English and French, two languages from different linguistic families yet with shared syntax and close contact. For this we select a set of cognates in both languages and study their frequency changes and correlations over time. We propose a new dataset for computational approaches of synchronized diachronic investigation of language pairs, and subsequently show novel findings stemming from the cognate-focused diachronic comparison of the two chosen languages. To the best of our knowledge, the present study is the first in the literature to use computational approaches and large data to make a cross-language diachronic analysis.
Subject: 113 Computer and information sciences
6121 Languages
Referentgranskad: Ja
Licens: cc_by_nc
Användningsbegränsning: openAccess
Parallelpublicerad version: acceptedVersion
Finansierad av:
Finansierings ID: 770299


Filer under denna titel

Totalt antal nerladdningar: Laddar...

Filer Storlek Format Granska
Frossard_etal_2 ... h_French_cognates_LREC.pdf 635.1Kb PDF Granska/Öppna

Detta dokument registreras i samling:

Visa fullständig post