Semantic similarity measures (SSMs) are used to evaluate the similarity among terms of an ontology. Biological entities, e.g., gene products, are often annotated with terms extracted from existing ontologies. A common application is to find the similarity or dissimilarity among two entities through the application of SSMs to their annotations. More recently, researchers have introduced the semantic similarity networks (SSNs), i.e., edge-weighted graphs where the nodes are concepts (e.g., proteins) and each edge has an associated weight that represents the semantic similarity among related pairs of nodes. Community detection algorithms that analyze SSNs may reveal clusters of functionally associated concepts. For instance, the application of these algorithms on networks built upon of proteins may find protein complexes. SSNs have a high number of arcs with low weight. The application of classical community detection algorithms on raw networks exhibits low performance. To improve the performance of such algorithms, a possible approach is to simplify the structure of SSNs through a preprocessing step able to delete arcs likened to noise. Thus, we propose a novel preprocessing strategy to simplify SSNs implemented in an open-source tool: SSN-Analyzer. As proof of concept, we demonstrate that community detection algorithms applied to filtered (thresholded) networks, have better performances in terms of biological relevance of the results, with respect to the use of raw unfiltered networks.

Using SSN-Analyzer for analysis of semantic similarity networks PH Guzzi, M Milano, P Veltri, M Cannataro Network Modeling Analysis in Health Informatics and Bioinformatics 4 (1), 1-10

Using SSN-Analyzer for analysis of semantic similarity networks

Guzzi P;Cannataro M;Veltri Pierangelo
2015-01-01

Abstract

Semantic similarity measures (SSMs) are used to evaluate the similarity among terms of an ontology. Biological entities, e.g., gene products, are often annotated with terms extracted from existing ontologies. A common application is to find the similarity or dissimilarity among two entities through the application of SSMs to their annotations. More recently, researchers have introduced the semantic similarity networks (SSNs), i.e., edge-weighted graphs where the nodes are concepts (e.g., proteins) and each edge has an associated weight that represents the semantic similarity among related pairs of nodes. Community detection algorithms that analyze SSNs may reveal clusters of functionally associated concepts. For instance, the application of these algorithms on networks built upon of proteins may find protein complexes. SSNs have a high number of arcs with low weight. The application of classical community detection algorithms on raw networks exhibits low performance. To improve the performance of such algorithms, a possible approach is to simplify the structure of SSNs through a preprocessing step able to delete arcs likened to noise. Thus, we propose a novel preprocessing strategy to simplify SSNs implemented in an open-source tool: SSN-Analyzer. As proof of concept, we demonstrate that community detection algorithms applied to filtered (thresholded) networks, have better performances in terms of biological relevance of the results, with respect to the use of raw unfiltered networks.
2015
Using SSN-Analyzer for analysis of semantic similarity networks PH Guzzi, M Milano, P Veltri, M Cannataro Network Modeling Analysis in Health Informatics and Bioinformatics 4 (1), 1-10
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11770/362396
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact