Wednesday, May 7, 2025 - 4:00pm
Event Calendar Category
LIDS & Stats Tea
Speaker Name
Eric Weine
Affiliation
LIDS / DFCI
Building and Room number
32-D650
Building and Room Number
LIDS Lounge
"A new family of Poisson non-negative matrix factorization models using the shifted log link function"
Non-negative matrix factorization (NMF) is a widely used method to find interpretable, parts-based decompositions of count data. Despite its wide use, little attention has been paid to subtle but important differences between widely used NMF models. In particular, some NMF models assume that the relationship between the latent loadings / factors and the data is additive, whereas other models assume this relationship to be primarily multiplicative. Here, we show that this difference can greatly affect the underlying representations of NMF models and their downstream interpretation. In addition, we introduce a novel Poisson non-negative matrix factorization model that can be tuned to represent either additive OR multiplicative effects. This model formalizes some heuristic approaches currently applied to single cell and text data, and allows us to prove connections between these approaches. Finally, we develop an efficient computational approximation that massively accelerates the fitting of our NMF model in the presence of sparse data.
Eric Weine is a second year PhD student in EECS co-advised by Tamara Broderick (LIDS) and Rafael Irizarry (DFCI). Prior to coming to MIT, Ericcompleted a BS and MS in statistics at the University of Chicago, where he was advised by Mary Sara McPeek and Matthew Stephens. His primary research interest is in developing improved statistical / machine learning methods to glean biological insights from modern high-throughput genomic data.