Home |  English version |  Mappa |  Commenti |  Sondaggio |  Staff |  Contattaci Cerca nel sito  
Istituto di linguistica computazionale "Antonio Zampolli"

Torna all'elenco Contributi in atti di convegno anno 2012

Contributo in atti di convegno

Tipo: Contributo in atti di convegno

Titolo: Integrating NLP Tools in a Distributed Environment: A Case Study Chaining a Tagger with a Dependency Parser

Anno di pubblicazione: 2012

Formato: Elettronico

Autori: Rubino, Francesco; Frontini, Francesca; Quochi, Valeria

Affiliazioni autori: Istituto di Linguistica Computazionale "A. Zampolli", CNR, Pisa

Autori CNR:

  • FRANCESCA FRONTINI
  • VALERIA QUOCHI
  • FRANCESCO RUBINO

Lingua: inglese

Abstract: The present paper tackles the issue of PoS tag conversion within the framework of a distributed web service platform for the automatic creation of language resources. PoS tagging is now considered a "solved problem"; yet, because of the differences in the tagsets, interchange of the various PoS taggers vailable is still hampered. In this paper we describe the implementation of a PoS-tagged-corpus converter, which is needed for chaining together in a workflow the FreeLing PoS tagger for Italian and the DESR dependency parser, given that these two tools have been developed independently. The conversion problems experienced during the implementation, related to the properties of the different tagsets and of tagset conversion in general, are discussed together with the solutions adopted. Finally, the converter is evaluated by assessing the impact of conversion on the performance of the dependency parser by comparing with the outcome of the native pipeline. From this we learn that in most cases parsing errors are due to actual tagging errors, and not to conversion itself. Besides, information on accuracy loss is an important feature in a distributed environment of (NLP) services, where users need to decide which services best suit their needs

Lingua abstract: inglese

Pagine da: 2125

Pagine a: 2131

Pagine totali: 7

Titolo del volume: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)

Curatore/i del volume: Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet U?ur Do?an, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis

ISBN: 9782951740877

Editore: European language resources association (ELRA), Paris (FRA)

Referee: Sì: Internazionale

Stato della pubblicazione: Published version

Indicizzato da:

  • ISI Web of Science (WOS) [000323927702032]
  • PUMA [/cnr.ilc/2012-A3-006]

Parole chiave:

  • PoS tag conversion
  • interoperability
  • NLP pipelines

URL: http://www.lrec-conf.org/proceedings/lrec2012/summaries/726.html

Congresso nome: Language Resources and Evaluation Conference 2012

Congresso luogo: Istanbul, Turchia

Congresso data: 23-25 Maggio 2012

Congresso rilevanza: Internazionale

Congresso relazione: Contributo

Strutture CNR:

Moduli:

 
Torna indietro Richiedi modifiche Invia per email Stampa
Home Il CNR  |  I servizi News |   Eventi | Istituti |  Focus