Dialogue modeling using recurrent neural networks

Typ
Examensarbete för masterexamen
Master Thesis
Program
Publicerad
2016
Författare
Anderling, Viktor
Orrö, Jonathan
Modellbyggare
Tidskriftstitel
ISSN
Volymtitel
Utgivare
Sammanfattning
As the importance of computers in everyday life increases, so does the demand for better human-computer interfaces. Natural language, being our most natural form of communication, combined with man’s innate tendency of anthropomorphism, motivates the idea of a talking machine. Existing dialogue systems have the problem of being unable to answer out-of-domain questions as well as being tedious to design. While these systems are developed with hand-crafted rules, the goal of this thesis is to investigate if a dialogue system could be automatically trained to speak instead. Our aim is to test whether a model trained on a dialogue corpus can compare to existing dialogue systems. We trained a recurrent neural network using the sequenceto- sequence method, preserving the state of the model during the course of the conversation. The resulting network is end-to-end trainable. User testing was used to evaluate the model and compare it to the other dialogue systems. The final model can answer appropriately to common phrases such as greetings and valedictions. It also generates replies in correct English. However, the results do not stretch any further than that. Giving the model a more complicated input usually results in a nonsensical reply, which prevents it from having a coherent conversation with the user. We present a few hypotheses as to why we did not get better results, with suggestions on how they could be solved. We display high hopes for future work in the area and present a few suggestions of what could be the next steps.
Beskrivning
Ämne/nyckelord
Data- och informationsvetenskap, Computer and Information Science
Citation
Arkitekt (konstruktör)
Geografisk plats
Byggnad (typ)
Byggår
Modelltyp
Skala
Teknik / material