Evaluation of Document and Search Query Processing Frameworks

dc.contributor.authorSvensson, Tobias
dc.contributor.departmentChalmers tekniska högskola / Institutionen för data- och informationsteknik (Chalmers)sv
dc.contributor.departmentChalmers University of Technology / Department of Computer Science and Engineering (Chalmers)en
dc.date.accessioned2019-07-03T13:28:55Z
dc.date.available2019-07-03T13:28:55Z
dc.date.issued2014
dc.description.abstractAs search becomes a vital cornerstone of any organization and as expectations and demands on ndability and search steadily increase, there is a need for high-performance, scalable and simple Text Processing Frameworks to implement document processing solutions. Today, there are many open source solutions available to this end. In this thesis, the processing frameworks GATE, UIMA, OpenPipeline, Hydra and Storm are analyzed and compared. We investigate the impact of parallelism and distribution on throughput and performance. Additionally, the possibilities and demands of performing Natural Language Processing tasks on real-time search queries is analyzed. The feasibility of using the processing frameworks for this task is investigated and the results are discussed. Finally, recommendations are made for which kind of system to implement for di erent use cases and improvements to existing systems are suggested.
dc.identifier.urihttps://hdl.handle.net/20.500.12380/202651
dc.language.isoeng
dc.setspec.uppsokTechnology
dc.subjectData- och informationsvetenskap
dc.subjectComputer and Information Science
dc.titleEvaluation of Document and Search Query Processing Frameworks
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster Thesisen
dc.type.uppsokH
local.programmeComputer science – algorithms, languages and logic (MPALG), MSc
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
202651.pdf
Storlek:
1.12 MB
Format:
Adobe Portable Document Format
Beskrivning:
Fulltext