Weighted Set Containment in Redundant Data Sets

Grönvall, Johan; Wermensjö, Johan

Weighted Set Containment in Redundant Data Sets

Ladda ner

Primär fil 251546.pdf (983.58 KB)

Publicerad

2017

Författare

Grönvall, Johan

Wermensjö, Johan

Typ

Examensarbete för masterexamen
Master Thesis

Program

Computer science – algorithms, languages and logic (MPALG), MSc

Sammanfattning

Given a set family F and a query set Q, weighted set containment is the problem of finding the set C 2 F with the largest sum of weights, such that C Q, where every element has a corresponding weight. This problem was investigated at the request of Volvo Group IT, who rely heavily on weighted set containment queries in many of their applications. In this thesis we show that weighted set containment can be solved efficiently using trie based preprocessing when applied to redundant data sets. We show that finding the most efficient trie which represents F is NP-complete and we introduce a number of approximation algorithms. We show through empirical testing that some of our algorithms outperform state-of-the-art methods for similar problems when applied to Volvo’s particular data set.

Ämne/nyckelord

Data- och informationsvetenskap, Computer and Information Science

URI

https://hdl.handle.net/20.500.12380/251546

Samlingar

Examensarbeten för masterexamen

Visa fullständig post

Weighted Set Containment in Redundant Data Sets

Ladda ner

Publicerad

Författare

Typ

Program

Modellbyggare

Tidskriftstitel

ISSN

Volymtitel

Utgivare

Sammanfattning

Beskrivning

Ämne/nyckelord

Citation

Arkitekt (konstruktör)

Geografisk plats

Byggnad (typ)

Byggår

Modelltyp

Skala

Teknik / material

Index

URI

Samlingar

Endorsement

Review

Supplemented By

Referenced By