Title
A unifying similarity measure for automated identification of national implementations of european union directives.
Abstract
This paper presents a unifying text similarity measure (USM) for automated identification of national implementations of European Union (EU) directives. The proposed model retrieves the transposed provisions of national law at a fine-grained level for each article of the directive. USM incorporates methods for matching common words, common sequences of words and approximate string matching. It was used for identifying transpositions on a multilingual corpus of four directives and their corresponding national implementing measures (NIMs) in three different languages : English, French and Italian. We further utilized a corpus of four additional directives and their corresponding NIMs in English language for a thorough test of the USM approach. We evaluated the model by comparing our results with a gold standard consisting of official correlation tables (where available) or correspondences manually identified by domain experts. Our results indicate that USM was able to identify transpositions with average F-score values of 0.808, 0.736 and 0.708 for French, Italian and English Directive-NIM pairs respectively in the multilingual corpus. A comparison with state-of-the-art methods for text similarity illustrates that USM achieves a higher F-score and recall across both the corpora.
Year
Venue
Field
2017
ICAIL
Data mining,English language,Similarity measure,Computer science,Directive,Implementation,European Union law,Approximate string matching,Legal information retrieval,European union
DocType
Citations 
PageRank 
Conference
1
0.36
References 
Authors
13
11
Name
Order
Citations
PageRank
Rohan Nanda1123.01
Luigi Di Caro219535.21
Guido Boella31867162.59
Hristo Konstantinov410.36
Tenyo Tyankov510.36
Daniel Traykov610.36
Hristo Hristov710.36
Francesco Costamagna810.69
Llio Humphreys9738.76
Livio Robaldo1026933.46
Michele Romano1110.36