AAU Student Projects - visit Aalborg University's student projects portal
A master's thesis from Aalborg University
Book cover


Combining Feature-based Kernel with Tree Kernel for Extracting Relations

Author

Term

10. term

Publication year

2012

Submitted on

Pages

40

Abstract

Denne afhandling præsenterer en sammensat kernel-metode til at udtrække semantiske relationer mellem navngivne entiteter (fx personer, organisationer og steder) i tekst. Kernel-metoder er maskinlæringsteknikker, der måler lighed mellem eksempler; ved at kombinere to kerner udnytter vi deres komplementære styrker. Den første er en lineær kernel, der bruger enkle sproglige kendetegn som afstanden mellem entiteterne, deres rækkefølge og ordtyper. Den anden er en træ-kernel, der sammenligner den del af sætningens syntaktiske træ, som udgør den korteste sti mellem de to entiteter. Vi undersøger også effekten af tidligere foreslåede forbedringer, herunder kontekstfølsomhed og latente (skjulte) annoteringer. På CoNLL-2004-datasættet for relationsudtræk opnår metoden resultater på niveau med datidens bedste tilgange på samme datasæt.

This thesis presents a composite kernel method for extracting semantic relations between named entities (such as people, organizations, and places) in text. Kernel methods are machine-learning techniques that compare examples via similarity functions; by combining two kernels, we leverage complementary strengths. The first is a linear kernel over simple linguistic cues, including the distance between the entities, their order, and word types. The second is a tree kernel that compares the portion of a sentence’s syntactic parse tree that forms the shortest path between the two entities. We also examine the impact of previously proposed enhancements, including context sensitivity and latent (hidden) annotations. On the CoNLL-2004 relation extraction dataset, the method achieves performance on par with prior state-of-the-art approaches on the same data.

[This abstract was generated with the help of AI]