Home |  English version |  Mappa |  Commenti |  Sondaggio |  Staff |  Contattaci Cerca nel sito  
Istituto di linguistica computazionale "Antonio Zampolli"

Torna all'elenco Contributi in atti di convegno anno 2012

Contributo in atti di convegno

Tipo: Contributo in atti di convegno

Titolo: Creation of a bottom-up corpus-based ontology for Italian Linguistics

Anno di pubblicazione: 2012

Formato: Elettronico

Autori: Elisa Bianchi, Mirko Tavosanis, Emiliano Giovannetti

Affiliazioni autori: Università di Pisa, Istituto di Linguistica Computazionale "A. Zampolli" - CNR

Autori CNR:

  • EMILIANO GIOVANNETTI

Lingua: inglese

Abstract: This paper describes the steps of construction of a shallow lexical ontology of Italian Linguistics in Italian, set to be used by a meta-search engine for query refinement. The ontology was constructed with the software Protege 4.0.2 and encoded in OWL format; its construction has been carried out following the steps described in the well-known Ontology Learning From Text (OLFT) layer cake. The starting point was the automatic term extraction from a corpus of web documents concerning the domain of interest (304,000 words); as regards corpus construction, we describe the main criteria of the web documents selection and its critical points, concerning the definition of user profile and of degrees of specialisation. We then describe the process of term validation and construction of a glossary of terms of Italian Linguistics; afterwards, we outline the identification of synonymic chains and the main criteria of ontology design: top classes of ontology are Concept (containing taxonomy of concepts) and Term (containing terms of the glossary as instances), while concepts are linked through part-whole and involved-role relation, both borrowed from Wordnet. Finally, we show some examples of the application of the ontology for query refinement.

Lingua abstract: inglese

Pagine da: 2641

Pagine a: 2647

Pagine totali: 7

Titolo del volume: Language Resources and Evaluation

Editore: European Language Resources Association ELRA, Paris (FRA)

Referee: Sì: Internazionale

Stato della pubblicazione: Published version

Indicizzato da: ISI Web of Science (WOS) [000323927702118]

Parole chiave:

  • Ontologies
  • Italian Linguistics
  • Query refinement

Congresso nome: LREC 2012 - Eight International Conference on Language Resources and Evaluation

Congresso luogo: Istanbul

Congresso data: 23-25 maggio 2012

Congresso rilevanza: Internazionale

Congresso relazione: Contributo

Strutture CNR:

Moduli:

 
Torna indietro Richiedi modifiche Invia per email Stampa
Home Il CNR  |  I servizi News |   Eventi | Istituti |  Focus