Revealing and Leveraging the Visual Information in Diffusion Models
Prof. Deepti Ghadiyaram
Abstract
Speaker
AI Outline
Discussion
Revealing and Leveraging the Visual Information in Diffusion Models 
Deepti Ghadiyaram
Boston University
Generating high-quality photo-realistic and creative visual content using diffusion models is a thriving area of research. In this talk, I will focus not on the generation process, but on understanding and leveraging the rich visual semantic information represented within diffusion models. Specifically, I will present our work that uses mechanistic interpretability tools such as k-sparse autoencoders (k-SAE) to probe various layers and denoising timesteps of different diffusion architectures. Next, I will present how to uncover monosemantic interpretable concepts pertaining to safety and photographic styles and steer the generation process thereby offering more controllability to users.
JIVP Webinar Series
EURASIP Journal on Image and Video ProcessingCite as
D. Ghadiyaram (2025, June 4), Revealing and Leveraging the Visual Information in Diffusion Models
Share
Details
Listed seminar This seminar is open to all
Recorded Available to all
Video length 1:02:51
Q&A Now closed
Disclaimer The views expressed in this seminar are those of the speaker and not necessarily those of the journal