Mining Relations from Git Repositories- Applying Relation Extraction Technology to Git Commit Messages
dc.contributor.author | Andersson, Rikard | |
dc.contributor.department | Chalmers tekniska högskola / Institutionen för data- och informationsteknik (Chalmers) | sv |
dc.contributor.department | Chalmers University of Technology / Department of Computer Science and Engineering (Chalmers) | en |
dc.date.accessioned | 2019-07-03T13:46:02Z | |
dc.date.available | 2019-07-03T13:46:02Z | |
dc.date.issued | 2014 | |
dc.description.abstract | Text data can contain valuable information that is unavailable at a larger scale due to the unstructured nature of free text. Git repositories and Git commit messages within them are one such category of unstructured text data. Relation Extraction (RE) has enjoyed success as a solution to similar problems for a more generic case but also for more specialized domains such as life sciences. RE does however, remain largely untested for text data from Git repositories. This thesis contributes to RE and Software Engineering research by testing RE solutions developed for the generic problem on the domain speci c problem of Git commit messages. An experiment is conducted where a custommade relation extractor is tested on hand annotated Git commit messages drawn from popular public projects on GitHub. The results show that common RE solutions and their models cannot be directly applied to data from Git commit messages due to a very domain spec c language in which these messages are expressed. This prompts for future e orts into developing domain speci c tools and models. | |
dc.identifier.uri | https://hdl.handle.net/20.500.12380/220542 | |
dc.language.iso | eng | |
dc.setspec.uppsok | Technology | |
dc.subject | Data- och informationsvetenskap | |
dc.subject | Informations- och kommunikationsteknik | |
dc.subject | Computer and Information Science | |
dc.subject | Information & Communication Technology | |
dc.title | Mining Relations from Git Repositories- Applying Relation Extraction Technology to Git Commit Messages | |
dc.type.degree | Examensarbete för masterexamen | sv |
dc.type.degree | Master Thesis | en |
dc.type.uppsok | H | |
local.programme | Software engineering and technology (MPSOF), MSc |
Ladda ner
Original bundle
1 - 1 av 1
Hämtar...
- Namn:
- 220542.pdf
- Storlek:
- 702.94 KB
- Format:
- Adobe Portable Document Format
- Beskrivning:
- Fulltext