In recent years, the surge in textual data has led to an information overload, particularly evident in the scientific literature domain due to the rapid pace of document publication. Researchers require tools to manage the sheer volume of publications, enabling the rapid extraction of knowledge. Automatic Text Summarization emerges as a key solution, reducing reading time and streamlining efforts by accentuating key points and generating concise summaries with the most pertinent information. Our innovative approach, named Integrated Text Summarization, aims to enhance extractive algorithms by incorporating specific features of scientific documents—such as keywords and title terms—into the sentence selection process. This method focuses on extracting significant sentences from each section of scientific papers, thereby identifying content that encapsulates the core knowledge of a publication. We will assess the effectiveness of our proposal across various research domains using a selection of scientific papers.

Extracting Knowledge from Scientific Literature with an Integrated Text Summarization Approach

Michelangelo Misuraca;
2024-01-01

Abstract

In recent years, the surge in textual data has led to an information overload, particularly evident in the scientific literature domain due to the rapid pace of document publication. Researchers require tools to manage the sheer volume of publications, enabling the rapid extraction of knowledge. Automatic Text Summarization emerges as a key solution, reducing reading time and streamlining efforts by accentuating key points and generating concise summaries with the most pertinent information. Our innovative approach, named Integrated Text Summarization, aims to enhance extractive algorithms by incorporating specific features of scientific documents—such as keywords and title terms—into the sentence selection process. This method focuses on extracting significant sentences from each section of scientific papers, thereby identifying content that encapsulates the core knowledge of a publication. We will assess the effectiveness of our proposal across various research domains using a selection of scientific papers.
2024
978-2-39061-471-5
Automatic Text Summarization; Information extraction; Scientific article features; IMRaD format
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11770/368479
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact