Unsupervised analytic strategies to explore large document collections

IRIS

The technological revolution of the last years allowed to process different kinds of data to study several real-world phenomena. Together with the traditional source of data, textual data became more and more critical in many research domains, proposing new challenges to scholars working with documents written in natural language. In this paper, we explain how to prepare a set of documents for quantitative analyses and compare the different approaches widely used to extract information automatically, discussing their advantages and disadvantages.

Unsupervised analytic strategies to explore large document collections

Michelangelo Misuraca;Maria Spano

2020-01-01

Abstract

The technological revolution of the last years allowed to process different kinds of data to study several real-world phenomena. Together with the traditional source of data, textual data became more and more critical in many research domains, proposing new challenges to scholars working with documents written in natural language. In this paper, we explain how to prepare a set of documents for quantitative analyses and compare the different approaches widely used to extract information automatically, discussing their advantages and disadvantages.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2020
			
	Codice ISBN
	
				978-3-030-52679-5
			
	Appare nelle tipologie:
	
				2.1 Contributo in volume (Capitolo o Saggio)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11770/304839

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

18

ND

social impact