Title
Full Text Document Retrieval: Hebrew Legal Texts
Abstract
A full text retrieval system was designed for the responsa literature, which is a large corpus of Hebrew legal cases. The unique problems of the data base --- mixture of Hebrew, Aramaic and vernaculars, lack of vowels and punctuation, extreme language inflection problems, homographs, existence of thousands of grammatical variants of any given keyword --- dictated development of new methods. Among them we list "grammatical synthesis", which synthesizes all grammatical variants of a given keyword; "Compact KWIC", which enables the user to have a glimpse of the nature of the search before having performed it; effective citation index imbedded in full text searches; and, in general, extensive use of both positive and negative feedback within a single search run. A number of searches performed on a relatively small data base gave in each case a recall of 100%. The average precision was 34%. A KWIC of strategic portions of retrieved documents usually enables a quick disposal of non-relevant material.
Year
Venue
Keywords
1971
SIGIR
document retrieval,indexation,negative feedback
Field
DocType
Citations 
Information retrieval,Computer science,Full text search,Hebrew,Natural language processing,Artificial intelligence,Document retrieval,Text document,Visual Word
Conference
4
PageRank 
References 
Authors
33.23
2
5
Name
Order
Citations
PageRank
Yaacov Choueka1241202.83
M. Cohen2433.23
J. Dueck3433.23
Aviezri S. Fraenkel4559164.51
M. Slae5433.23