Unsupervised Ensemble Learning

Friday, September 21, 2018 - 11:00am to Saturday, September 22, 2018 - 11:55am

Event Calendar Category

IDSS

Speaker Name

Boaz Nadler

Affiliation

Weizmann Institute

Building and Room number

E18-304

Abstract

In various applications, crowdsourcing in particular, one is given the advice or predictions of several classifiers of unknown reliability, over multiple questions or queries. This scenario is different from standard supervised learning where classifier accuracy can be assessed from available labeled training or validation data, and raises several questions: given only the predictions of several classifiers of unknown accuracies, over a large set of unlabeled test data, is it possible to:

a) reliably rank them, and

b) construct a meta-classifier more accurate than any individual classifier in the ensemble?

In this talk we’ll show that under various independence assumptions between classifier errors, this high dimensional data hides simple low dimensional structures. Exploiting these, we will present simple spectral methods to address the above questions, and derive new unsupervised spectral meta-learners. We’ll prove these methods are asymptotically consistent when the model assumptions hold, and present their empirical success on a variety of unsupervised learning problems.