Consiglio Nazionale delle Ricerche

Tipo di prodottoArticolo in rivista
TitoloA Neural Network model for the Evaluation of Text Complexity in Italian Language: a Representation Point of View
Anno di pubblicazione2018
FormatoElettronico
Autore/iGiosué Lo Bosco, Giovanni Pilato, Daniele Schicchi
Affiliazioni autoriUniversity of Palermo, ICAR-CNR, University of Palermo
Autori CNR e affiliazioni
  • GIOVANNI PILATO
Lingua/e
  • inglese
AbstractThe goal of a text simplification system (TS) is to create a new text suited to the characteristics of a reader, with the final goal of making it more understandable.The building of an Automatic Text Simplification System (ATS) cannot be separated from a correct evaluation of the text complexity. In fact the ATS must be capable of understanding if a text should be simplified for the target reader or not. In a previous work we have presented a model capable of classifying Italian sentences based on their complexity level. Our model is a Long Short Term Memory (LSTM) Neural Network capable of learning the features of easy-to-read and complex-to-read sentences autonomously from a annotated corpus created specifically for text simplification. In this paper we further investigate on the role of the text representation, i.e. how different ways of representing the input text can affect the accuracy of the proposed system. In detail, we will use our Neural Network model for evaluating the sentence complexity using different kind of representations such as GloVe, Word2vec, FastTex and a new one based on a representation learning scheme.
Lingua abstractinglese
Altro abstract-
Lingua altro abstract-
Pagine da464
Pagine a470
Pagine totali-
RivistaProcedia computer science
Attiva dal 2010
Editore: Elsevier - Amsterdam
Paese di pubblicazione: Paesi Bassi
Lingua: inglese
ISSN: 1877-0509
Titolo chiave: Procedia computer science
Titolo proprio: Procedia computer science.
Numero volume della rivista145
Fascicolo della rivista-
DOI-
Verificato da refereeSì: Internazionale
Stato della pubblicazionePublished version
Indicizzazione (in banche dati controllate)-
Parole chiaveText Simplification, Natural Language Processing, Deep Neural Networks, Evaluation Sentence Complexity, Sentence Classification
Link (URL, URI)-
Titolo parallelo-
Licenza-
Scadenza embargo-
Data di accettazione-
Note/Altre informazioni-
Strutture CNR
  • ICAR — Istituto di calcolo e reti ad alte prestazioni
Moduli/Attività/Sottoprogetti CNR
  • DIT.AD007.014.001 : COGNITIVE ROBOTICS AND SOCIAL SENSING (CRSS)
Progetti Europei-
Allegati