Abstract | ||
---|---|---|
The Web is full of documents which must be interpreted by human readers and by software agents (search engines recommender systems clustering processes etc.). Although Web standards have addressed format obfuscation by using XML schemas and stylesheets to specify unambiguous structure and presentation semantics interpretation is still hampered by the fundamental ambiguity of information in PCDATA text. Even the most easily distinguishable kinds of knowledge such as article citations and proper nouns (referring to people organisations projects products technical concepts) have to be identified by fallible post-hoc extraction processes. The WiCK project has investigated the writing process in a Semantic Web environment where knowledge services exist and actively assist the author. In this paper we discuss the need to make knowledge an explicit part of the document representation and the advantages and disadvantages of this step. |
Year | DOI | Venue |
---|---|---|
2004 | 10.1145/1030397.1030417 | ACM Symposium on Document Engineering |
Keywords | Field | DocType |
pcdata text,document representation,xml schema,knowledge service,article citation,wick project,explicit knowledge,distinguishable kind,processes etc.,semantic web environment,web standard,software agent,semantic web,noun,semantic interpretation,search engine,document structure,recommender system | World Wide Web,Information retrieval,Semantic Web Stack,Explicit knowledge,Computer science,Web standards,Document Structure Description,Presentation semantics,Semantic Web,XML schema,Social Semantic Web,Database | Conference |
ISBN | Citations | PageRank |
1-58113-938-1 | 16 | 1.06 |
References | Authors | |
17 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Leslie Carr | 1 | 159 | 18.16 |
Timothy Miles-Board | 2 | 84 | 8.61 |
Arouna Woukeu | 3 | 53 | 5.75 |
Gary Wills | 4 | 531 | 56.82 |
Wendy Hall | 5 | 2758 | 316.21 |