A Webpage Structure Processing Algorithm - Extending the Page Tailor Toolkit

dc.contributor.authorAndrén, Lars
dc.contributor.departmentChalmers tekniska högskola / Institutionen för tillämpad informationsteknologi (Chalmers)sv
dc.contributor.departmentChalmers University of Technology / Department of Applied Information Technology (Chalmers)en
dc.date.accessioned2019-07-03T12:12:02Z
dc.date.available2019-07-03T12:12:02Z
dc.date.issued2007
dc.description.abstractResearch in user preference-based automatic processing on the web, web page content adaptation for a small screen and informative value of web pages have resulted in the design and implementation of an algorithm, called the Domain Heritage-algorithm. This algorithm extends the functionality of the Page Tailor toolkit; a program that is the result of C-Y Tsai’s thesis “Web Page Tailoring Tool for Mobile Devices”. The algorithm extending the toolkit enables automatic processing of web pages where preferences on which parts to be displayed have not been stored. The Domain Heritage-algorithm will not work unless at least one web page of the specific domain visited has been personalised previously. This extended toolkit has then been tested on ten subjects and a number of web sites. The test results were pretty much in accordance with the expectations, but the test subjects’ experience in using the Page Tailor toolkit was found to be quite influential on the rate of successful running of the algorithm. Three major conclusions are made. The first one is that too much editing of the appearance of web page content can result in loss of informative value and successful totally automatic extraction of web page content needs semantic processing. Further, XPaths has been a good choice of data for the algorithm to process as the results of the Big Oanalysis of the running time were acceptable, and that it was possible to implement the algorithm in the existing software. Finally, previous experience in usage of the Page Tailor toolkit, as well as more than one personalised web page is essential to the successful running of the Domain Heritagealgorithm.
dc.identifier.urihttps://hdl.handle.net/20.500.12380/74444
dc.language.isoeng
dc.relation.ispartofseriesMaster thesis - Technical Communication, Centre for Digital Media and higher education, Chalmers University of Technology : 2007:2
dc.setspec.uppsokHumanitiesTheology
dc.subjectInformation Technology
dc.subjectInformationsteknik
dc.titleA Webpage Structure Processing Algorithm - Extending the Page Tailor Toolkit
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster Thesisen
dc.type.uppsokH
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
74444.pdf
Storlek:
1.63 MB
Format:
Adobe Portable Document Format
Beskrivning:
Fulltext