Mixture-of-Experts Architectures Through the Lens of Continual Learning

Mac Leod, Ian Coss

Mixture-of-Experts Architectures Through the Lens of Continual Learning

Ladda ner

Master Thesis MacLeod.pdf (3.93 MB)

Publicerad

2026

Författare

Mac Leod, Ian Coss

Typ

Examensarbete för masterexamen
Master's Thesis

Program

Complex adaptive systems (MPCAS), MSc

Sammanfattning

Mixture-of-experts architectures on a vision transformer backbone are compared against standard architectures for image classification in continual learning challenges with the constraints found in autonomous vehicle onboard systems and a novel routing algorithm is presented for improving MoE performance in this setting. Domain incremental learning without domain labels and class imbalanced datasets are used with continual learning and imbalanced learning metrics to describe when MoE architectures become useful and what advantages and drawbacks one should consider. Results show that MoE should be used in highly complex datasets with domain focused routing to improve the architectures natural resistance to catastrophic forgetting but with current MoE strategies, large gains are not yet realized. Suggestions for strategies to pair with MoE for continual learning are given alongside guidance for MoE training in this environment.

Ämne/nyckelord

Image classification, mixture of experts, deep learning, continual learning, domain incremental learning, new instance classification, vision transformers, geometric router

URI

https://hdl.handle.net/20.500.12380/311205

Samlingar

Examensarbeten för masterexamen

Visa fullständig post

Mixture-of-Experts Architectures Through the Lens of Continual Learning

Ladda ner

Publicerad

Författare

Typ

Program

Modellbyggare

Tidskriftstitel

ISSN

Volymtitel

Utgivare

Sammanfattning

Beskrivning

Ämne/nyckelord

Citation

Arkitekt (konstruktör)

Geografisk plats

Byggnad (typ)

Byggår

Modelltyp

Skala

Teknik / material

Index

URI

Samlingar

Endorsement

Review

Supplemented By

Referenced By