Logo do repositório
 
A carregar...
Logótipo do projeto
Projeto de investigação

MEDON - Using ontologies to model data and medical procedures [PTDC/EIA/80772/2006]

Financiador

Autores

Publicações

Discovery of noun semantic relations based on sentential context analysis
Publication . Moraliyski, Rumen Valentinov; Dias, Gaël Harry Adélio André
The last years saw a surge in the statistical processing of natural language and in particular in corpus based methods oriented to language acquisition. Polysemy is pointed at as the main obstacle to many tasks in the area and to thesaurus construction in particular. This dissertation summarizes the current results of a work on automatic synonymy discovery. The accent is focused on the difficulties that spring from polysemy and on linguistically and empirically motivated means to deal with it. In particular, we propose an unsupervised method to identify word usage profiles pertinent to specific word meanings. Further, we show that the routine to verify every possibility in search of semantic relations is not only computationally expensive but is rather counterproductive. As a consequence, we propose an application of a recently developed system for paraphrases extraction and alignment so that the exhaustive search is avoided in an unsupervised manner. This led to a method, that creates short lists of pairs of words that are highly probable to be in synonymy relation. The results show that the negative impact of polysemy is significantly reduced for part of the polysemy specter that covers about two thirds of the vocabulary. Besides the increased probability to discover frequently manifested synonymy relations, paraphrase alignment proved to highlight infrequent word meanings, and to reliably identify a set of very specific semantic relations.

Unidades organizacionais

Descrição

Palavras-chave

ontologies,knowledge representation,clinical data modelling,natural language processing, Exact sciences ,Exact sciences/Computer and information sciences

Contribuidores

Financiadores

Entidade financiadora

Fundação para a Ciência e a Tecnologia, I.P.

Programa de financiamento

Concurso para Projectos de I&D em todos os Domínios Científicos - 2006

Número da atribuição

PTDC/EIA/80772/2006

ID