One-Step is Enough: Sparse Autoencoders for Text-to-Image Diffusion Models
Applies sparse autoencoders to analyze text-to-image diffusion models, contributing to interpretability methods for modern generative systems.
Research
Applies sparse autoencoders to analyze text-to-image diffusion models, contributing to interpretability methods for modern generative systems.
Establishes a rigorous connection between tensor factorizations and probabilistic circuits, unifying model families and exposing new architecture-search opportunities.
Unifies overparameterized probabilistic circuit architectures and studies low-rank decompositions as a way to understand and compress expressive layers.
Supervised student research on characterizing and mitigating subliminal learning behavior in large language models.
Supervised student research investigating sink-register behavior in diffusion transformers.