A unifying similarity measure for automated identification of national implementations of european union directives. - Citegraph

Paper Info

Title
A unifying similarity measure for automated identification of national implementations of european union directives.

Abstract
This paper presents a unifying text similarity measure (USM) for automated identification of national implementations of European Union (EU) directives. The proposed model retrieves the transposed provisions of national law at a fine-grained level for each article of the directive. USM incorporates methods for matching common words, common sequences of words and approximate string matching. It was used for identifying transpositions on a multilingual corpus of four directives and their corresponding national implementing measures (NIMs) in three different languages : English, French and Italian. We further utilized a corpus of four additional directives and their corresponding NIMs in English language for a thorough test of the USM approach. We evaluated the model by comparing our results with a gold standard consisting of official correlation tables (where available) or correspondences manually identified by domain experts. Our results indicate that USM was able to identify transpositions with average F-score values of 0.808, 0.736 and 0.708 for French, Italian and English Directive-NIM pairs respectively in the multilingual corpus. A comparison with state-of-the-art methods for text similarity illustrates that USM achieves a higher F-score and recall across both the corpora.

Year	Venue	Field
2017	ICAIL	Data mining,English language,Similarity measure,Computer science,Directive,Implementation,European Union law,Approximate string matching,Legal information retrieval,European union
DocType	Citations	PageRank
Conference	1	0.36
References	Authors
13	11

Authors (11 rows)

Cited by (1 rows)

References (13 rows)

Name	Order	Citations	PageRank
Rohan Nanda	1	12	3.01
Luigi Di Caro	2	195	35.21
Guido Boella	3	1867	162.59
Hristo Konstantinov	4	1	0.36
Tenyo Tyankov	5	1	0.36
Daniel Traykov	6	1	0.36
Hristo Hristov	7	1	0.36
Francesco Costamagna	8	1	0.69
Llio Humphreys	9	73	8.76
Livio Robaldo	10	269	33.46
Michele Romano	11	1	0.36

1