The growth of data volume collected in urban contexts opens up to their exploitation for improving citizens’ quality-of-life and city management issues, like resource planning (water, electricity), traffic, air and water quality, public policy and public safety services. Moreover, due to the large-scale diffusion of GPS and scanning devices, most of the available data are geo-referenced. Considering such an abundance of data, a very desirable and common task is to identify homogeneous regions in spatial data by partitioning a city into uniform regions based on pollution density, mobility spikes, crimes, or on other characteristics. Density-based clustering algorithms have been shown to be very suitable to detect density-based regions, i.e. areas in which urban events occur with higher density than the remainder of the dataset. Nevertheless, an important issue of such algorithms is that, due to the adoption of global parameters, they fail to identify clusters with varied densities, unless the clusters are clearly separated by sparse regions. In this paper we provide a preliminary analysis about how hierarchical clustering can be used to discover spatial clusters of different densities, in spatial urban data. The algorithm can automatically estimate the area of data having different densities, it can automatically estimate parameters for each cluster so as to reduce the requirement for human intervention or domain knowledge.

Hierarchical Clustering of Spatial Urban Data

Cesario E.;Vinci A.;
2020-01-01

Abstract

The growth of data volume collected in urban contexts opens up to their exploitation for improving citizens’ quality-of-life and city management issues, like resource planning (water, electricity), traffic, air and water quality, public policy and public safety services. Moreover, due to the large-scale diffusion of GPS and scanning devices, most of the available data are geo-referenced. Considering such an abundance of data, a very desirable and common task is to identify homogeneous regions in spatial data by partitioning a city into uniform regions based on pollution density, mobility spikes, crimes, or on other characteristics. Density-based clustering algorithms have been shown to be very suitable to detect density-based regions, i.e. areas in which urban events occur with higher density than the remainder of the dataset. Nevertheless, an important issue of such algorithms is that, due to the adoption of global parameters, they fail to identify clusters with varied densities, unless the clusters are clearly separated by sparse regions. In this paper we provide a preliminary analysis about how hierarchical clustering can be used to discover spatial clusters of different densities, in spatial urban data. The algorithm can automatically estimate the area of data having different densities, it can automatically estimate parameters for each cluster so as to reduce the requirement for human intervention or domain knowledge.
2020
978-3-030-39080-8
978-3-030-39081-5
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11770/303508
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 2
social impact