Discovering Lexical Similarity Using Articulatory Feature-Based Phonetic Edit Distance

Ahmed, Tafseer; Suffian, Muhammad; Muhammad Yaseen Khan,; Bogliolo, Alessandro

doi:10.1109/ACCESS.2021.3137905

Lexical Similarity (LS) between two languages uncovers many interesting linguistic insights such as phylogenetic relationship, mutual intelligibility, common etymology, and loan words. There are various methods through which LS is evaluated. This paper presents a method of Phonetic Edit Distance (PED) that uses a soft comparison of letters using the articulatory features associated with their International Phonetic Alphabet (IPA) transcription. In particular, the comparison between the articulatory features of two letters taken from words belonging to different languages is used to compute the cost of replacement in the inner loop of edit distance computation. As an example, PED gives edit distance of 0.82 between German word ‘vater’ ([fa:tər]) and Persian word ‘ ’ ([pedær]), meaning ‘father,’ and, similarly, PED of 0.93 between Hebrew word ‘ ’ ([ʃəɭam]) and Arabic word ‘ ’ ([səɭa:m], meaning ‘peace,’ whereas classical edit distances would be 4 and 2, respectively. We report the results of systematic experiments conducted on six languages: Arabic, Hindi, Marathi, Persian, Sanskrit, and Urdu. Universal Dependencies (UD) corpora were used to restrict the comparison to lists of words belonging to the same part of speech. The LS based on the average PED between pair of words was then computed for each pair of languages, unveiling similarities otherwise masked by the adoption of different alphabets, grammars, and pronunciations rules.

Discovering Lexical Similarity Using Articulatory Feature-Based Phonetic Edit Distance

Tafseer Ahmed;Muhammad Suffian;Muhammad Yaseen Khan;Alessandro Bogliolo

2022

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

Discovering Lexical Similarity Using Articulatory Feature-Based Phonetic Edit Distance

Tafseer Ahmed;Muhammad Suffian;Muhammad Yaseen Khan;Alessandro Bogliolo

2022

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)