The paper presents a system developed on a novel approach to clustering semantically related XML documents. The novelty refers to both XML feature generation and XML document modeling: structure as well as content information is enriched with lexical ontology knowledge and analyzed, whereas the notion of tree tuple is involved to enable a transactional representation of XML documents. Experimental evaluation performed on large real datasets reveals high effectiveness of the proposed clustering system.
SemXClust: A System for Semantic XML Clustering
TAGARELLI, Andrea;Greco S.
2006-01-01
Abstract
The paper presents a system developed on a novel approach to clustering semantically related XML documents. The novelty refers to both XML feature generation and XML document modeling: structure as well as content information is enriched with lexical ontology knowledge and analyzed, whereas the notion of tree tuple is involved to enable a transactional representation of XML documents. Experimental evaluation performed on large real datasets reveals high effectiveness of the proposed clustering system.File in questo prodotto:
Non ci sono file associati a questo prodotto.
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.