On Optimal Feature Orderings In Bayesian NEtwork Classifiers

home › theses & projects › On Optimal Feature Orderings In Bayesian NEtwork Classifiers

On Optimal Feature Orderings In Bayesian NEtwork Classifiers

Status

Finished

Type

Master Thesis

Announcement date

01 Nov 2021

Student

Christian Oswald

Mentors

Research Areas

Intelligent Systems

Abstract

Bayesian Network Classifiers (BNCs) provide a versatile and transparent approach for supervised machine learning. There exist multiple strategies to learn such a network’s structure from data, providing additional insight on the dataset’s structure while retaining a small model size. However, learning the network’s structure becomes more difficult when it is trained on discriminative as opposed to generative objectives. Roth and Pernkopf proposed a gradient-based training method for tree-augmented naive (TAN) Bayes structures which is agnostic to the loss function specified, while learning the network’s parameters and structure simultaneously. This thesis extends their approach by answering one of their open questions regarding the ordering of input features, which had a profound impact on their results. With the order-free approach presented in this thesis, performance is at least as good as with random feature orderings used by Roth and Pernkopf. Furthermore, the versatility of Bayesian networks is leveraged by adding support for missing features, which is done through computation of exact marginal probabilities.