This paper describes Olex, a novel method for the automatic construction of rule-based text classifiers. Olex relies on an optimization algorithm whereby a set of (both positive and negative) discriminating terms is generated for the category being learned. Such terms are then used to construct a classifier of the form "if term t"1 or ... term tn occurs in document d, and none of terms tn - 1, tn - m occurs in d, then d belongs to category c". The proposed method is simple and elegant. Despite this, the results of a systematic experimentation performed on both the REUTERS-21578 and the OHSUMED data collections show that Olex is both effective and efficient.

Learning rules with negation for text categorization

RULLO, Pasquale;
2007-01-01

Abstract

This paper describes Olex, a novel method for the automatic construction of rule-based text classifiers. Olex relies on an optimization algorithm whereby a set of (both positive and negative) discriminating terms is generated for the category being learned. Such terms are then used to construct a classifier of the form "if term t"1 or ... term tn occurs in document d, and none of terms tn - 1, tn - m occurs in d, then d belongs to category c". The proposed method is simple and elegant. Despite this, the results of a systematic experimentation performed on both the REUTERS-21578 and the OHSUMED data collections show that Olex is both effective and efficient.
2007
1595934804
Algorithms; Classification (of information); Data acquisition; Feature extraction; Learning systems; Set theory
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11770/180146
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 264
  • ???jsp.display-item.citation.isi??? 6
social impact