Predictive Caching

dc.contributor.authorLizbat Lawrence, Nickey
dc.contributor.authorSebastian, Renjith
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data och informationstekniksv
dc.contributor.examinerSeger, Carl-Johan
dc.date.accessioned2019-12-19T09:21:38Z
dc.date.available2019-12-19T09:21:38Z
dc.date.issued2019sv
dc.date.submitted2019
dc.description.abstractThough digitization has revolutionized the entertainment industry, streaming services like Netflix, Spotify, etc. are the ones who made the content available to the users through hand-held devices. These services require an active internet connection to deliver the requested content to the user device, consuming the expensive mobile data subscriptions of the user. The aim of the thesis project is to optimize the mobile data usage by predicting the content a user is most likely to download so that it can be pre-fetched when the user’s device is connected to high bandwidth, less-expensive network. Different use cases were considered to identify the potential candidates that a user is most likely to download through mobile data subscription. First, users are highly probable to download the personalized content recommended by these services. Hence, the user behavior on personalized content was modeled using a Logistic Regression algorithm as a generic baseline approach. Second, the users tend to use multiple devices to stream content and it is very likely that they play the same content from different devices. This has a strong pre-cache potential in the context that contents viewed/listened to in one device could be used to predict the possible streaming behavior in the user’s other devices. Third, the users prefer to play contents from different playlists provided by streaming services. The third use case exploited the user behavior on playlists to predict the contents a user is likely to download in future. We employed a Gradient Boosting algorithm to model the device sync and playlist use cases. The results were evaluated using a generic evaluation metric defined solely for the purpose, and different use cases were compared. The device sync model predicted 15% of the potential savings that were identified through data analysis, whereas the playlist model predicted 30%.sv
dc.identifier.coursecodeDATX05sv
dc.identifier.urihttps://hdl.handle.net/20.500.12380/300609
dc.language.isoengsv
dc.setspec.uppsokTechnology
dc.subjectComputer sciencesv
dc.subjectengineeringsv
dc.subjectthesissv
dc.subjectmachine learningsv
dc.subjectpredictive cachingsv
dc.subjectuser behaviorsv
dc.subjectxgboostsv
dc.subjectlogistic regressionsv
dc.titlePredictive Cachingsv
dc.type.degreeExamensarbete för masterexamensv
dc.type.uppsokH
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
CSE 19-114 Lawrence Sebastian.pdf
Storlek:
3.55 MB
Format:
Adobe Portable Document Format
Beskrivning:
CSE 19-114 Lawrence Sebastian
License bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
1.14 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: