Using machine learning and natural language processing to automatically extract information from software documentation

ÓLAFSDÓTTIR, HELENA

Using machine learning and natural language processing to automatically extract information from software documentation

dc.contributor.author	ÓLAFSDÓTTIR, HELENA
dc.contributor.department	Chalmers tekniska högskola / Institutionen för data och informationsteknik	sv
dc.contributor.examiner	Berger, Christian
dc.contributor.supervisor	Chaudron, Michel
dc.date.accessioned	2019-08-21T12:30:22Z
dc.date.available	2019-08-21T12:30:22Z
dc.date.issued	2019	sv
dc.date.submitted	2019
dc.description.abstract	Engineers face many challenges when it comes to using and maintaining software documentation. The OD3 is a vision for the future of software documentation which proposes that documentation should be generated based on user queries. There are many steps that need to be taken to create such a system. This research takes one of those necessary steps by investigating the categories of software knowledge that are contained in software documentation, automatically classifying sentences from software documentation into those sentences, and exploring methods to identify sentence relations. This analysis was conducted on one case documentation. A system, Software Documentation Supporter (SDS), was then built to explore and evaluate the results. The aim of the SDS is to support the user when navigating through long software documentation. In the system, the user can choose from a list of questions, and the software knowledge extracted from the documentation is used to answer those questions. The results were evaluated using a quantitative and a qualitative approach. As the sample size of the evaluation was small, the quantitative results did not show a significant difference in the time it took users to solve tasks using the SDS, compared to using only the documentation. The qualitative results showed that participants did feel that the SDS supported them and that it helped them navigate the documentation, however it was also clear that improvements need to be made both in regards to the method, and the design of the system.	sv
dc.identifier.coursecode	DATX05	sv
dc.identifier.uri	https://hdl.handle.net/20.500.12380/300145
dc.language.iso	eng	sv
dc.setspec.uppsok	Technology
dc.subject	software	sv
dc.subject	documentation	sv
dc.subject	architecture	sv
dc.subject	requirement	sv
dc.subject	natural language processing	sv
dc.subject	classification	sv
dc.subject	clustering	sv
dc.title	Using machine learning and natural language processing to automatically extract information from software documentation	sv
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.uppsok	H

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: CSE 19-61 Olafsdottir.pdf
Size:: 4.99 MB
Format:: Adobe Portable Document Format
Description:

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Size:: 1.14 KB
Format:: Item-specific license agreed upon to submission
Description:

Ladda ner

Samlingar

Examensarbeten för masterexamen