Speeding up tensor based recommenders with clustered tag space and improving quality of recommendations with Non-negative tensor factorization

Authors

Leginus, Martin ; Žemaitis, Valdas

Term

2. term

Education

Computer Science, Master

Publication year

2011

Submitted on

2011-05-31

Abstract

Social tagging lets people label and organize web links, photos, movies, and more. These tags reflect user preferences and help recommenders suggest new items. Modern social tagging systems represent the connections among tags, users, and items as a third‑order tensor (a three‑dimensional data structure). By factoring this tensor with methods such as Higher Order Singular Value Decomposition (HOSVD) and Canonical Decomposition (CD), we can uncover hidden patterns and relationships. However, these systems collect very large datasets, which make tensor factorization slow and memory‑intensive. We propose three improvements: (1) reduce the tag space by clustering semantically similar tags, so computations run faster and use less memory while keeping recommendation quality; (2) incorporate personal prior knowledge to increase the precision of tensor‑based recommenders; and (3) use Non‑negative Tensor Factorization (NTF) to avoid negative values that are hard to interpret. Finally, we combine these approaches to improve both recommendation quality and computation time, and we present promising experimental results.

Social tagging hjælper brugere med at mærke og kategorisere weblinks, billeder, film m.m. Disse tags afspejler brugernes præferencer og kan bruges til at anbefale nye elementer. Moderne sociale tagging‑systemer repræsenterer forbindelserne mellem tags, brugere og elementer som en tredjeordens tensor (en tredimensionel datastruktur). Ved at faktorisere denne tensor med metoder som Higher Order Singular Value Decomposition (HOSVD) og Canonical Decomposition (CD) kan man finde skjulte mønstre og relationer. Disse systemer samler dog meget store datamængder, hvilket gør tensorfaktorisering langsom og kræver meget hukommelse. Vi foreslår tre forbedringer: (1) reducere tag‑rummet ved at klustre semantisk lignende tags, så beregningerne bliver hurtigere og bruger mindre hukommelse, samtidig med at anbefalingskvaliteten bevares; (2) inddrage personlig forhåndsviden for at øge præcisionen i tensor‑baserede anbefalere; og (3) bruge ikke‑negativ tensorfaktorisering (NTF) for at undgå negative værdier, som er svære at fortolke. Til sidst kombinerer vi disse tilgange for at forbedre både anbefalingskvalitet og beregningstid og præsenterer lovende eksperimentelle resultater.

[This abstract has been rewritten with the help of AI based on the project's original abstract]

Keywords

personal tensor factorization ; clustering ; tags ; higher order singular value decomposition ; non-negative tensor factorization ; prior knowledge ; parallel

Documents

Download PDF
View record in AAU Student Projects

Other projects by the authors

Leginus, Martin:

Analysis of the recommendation systems based on the tensor factorization techniques, experiments and the proposals (2011)

Žemaitis, Valdas:

Analysis of the recommendation systems based on the tensor factorization techniques, experiments and the proposals (2011)

A master's thesis from Aalborg University

Speeding up tensor based recommenders with clustered tag space and improving quality of recommendations with Non-negative tensor factorization