AAU Student Projects - visit Aalborg University's student projects portal
A master's thesis from Aalborg University
Book cover


Speeding up tensor based recommenders with clustered tag space and improving quality of recommendations with Non-negative tensor factorization

Authors

;

Term

2. term

Publication year

2011

Submitted on

Pages

30

Abstract

Social tagging hjælper brugere med at mærke og kategorisere weblinks, billeder, film m.m. Disse tags afspejler brugernes præferencer og kan bruges til at anbefale nye elementer. Moderne sociale tagging‑systemer repræsenterer forbindelserne mellem tags, brugere og elementer som en tredjeordens tensor (en tredimensionel datastruktur). Ved at faktorisere denne tensor med metoder som Higher Order Singular Value Decomposition (HOSVD) og Canonical Decomposition (CD) kan man finde skjulte mønstre og relationer. Disse systemer samler dog meget store datamængder, hvilket gør tensorfaktorisering langsom og kræver meget hukommelse. Vi foreslår tre forbedringer: (1) reducere tag‑rummet ved at klustre semantisk lignende tags, så beregningerne bliver hurtigere og bruger mindre hukommelse, samtidig med at anbefalingskvaliteten bevares; (2) inddrage personlig forhåndsviden for at øge præcisionen i tensor‑baserede anbefalere; og (3) bruge ikke‑negativ tensorfaktorisering (NTF) for at undgå negative værdier, som er svære at fortolke. Til sidst kombinerer vi disse tilgange for at forbedre både anbefalingskvalitet og beregningstid og præsenterer lovende eksperimentelle resultater.

Social tagging lets people label and organize web links, photos, movies, and more. These tags reflect user preferences and help recommenders suggest new items. Modern social tagging systems represent the connections among tags, users, and items as a third‑order tensor (a three‑dimensional data structure). By factoring this tensor with methods such as Higher Order Singular Value Decomposition (HOSVD) and Canonical Decomposition (CD), we can uncover hidden patterns and relationships. However, these systems collect very large datasets, which make tensor factorization slow and memory‑intensive. We propose three improvements: (1) reduce the tag space by clustering semantically similar tags, so computations run faster and use less memory while keeping recommendation quality; (2) incorporate personal prior knowledge to increase the precision of tensor‑based recommenders; and (3) use Non‑negative Tensor Factorization (NTF) to avoid negative values that are hard to interpret. Finally, we combine these approaches to improve both recommendation quality and computation time, and we present promising experimental results.

[This abstract was generated with the help of AI]