The paper presents a system developed on a novel approach to clustering semantically related XML documents. The novelty refers to both XML feature generation and XML document modeling: structure as well as content information is enriched with lexical ontology knowledge and analyzed, whereas the notion of tree tuple is involved to enable a transactional representation of XML documents. Experimental evaluation performed on large real datasets reveals high effectiveness of the proposed clustering system.

SemXClust: A System for Semantic XML Clustering

TAGARELLI, Andrea;Greco S.
2006-01-01

Abstract

The paper presents a system developed on a novel approach to clustering semantically related XML documents. The novelty refers to both XML feature generation and XML document modeling: structure as well as content information is enriched with lexical ontology knowledge and analyzed, whereas the notion of tree tuple is involved to enable a transactional representation of XML documents. Experimental evaluation performed on large real datasets reveals high effectiveness of the proposed clustering system.
2006
88-6068-018-2
semistructured data and XML; XML mining; document clustering
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11770/166395
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact