Revealing and Leveraging the Visual Information in Diffusion Models - presented by Prof. Deepti Ghadiyaram

Revealing and Leveraging the Visual Information in Diffusion Models

Prof. Deepti Ghadiyaram

Abstract

Speaker

AI Outline

Discussion

Revealing and Leveraging the Visual Information in Diffusion Models

Deepti Ghadiyaram

Boston University

Generating high-quality photo-realistic and creative visual content using diffusion models is a thriving area of research. In this talk, I will focus not on the generation process, but on understanding and leveraging the rich visual semantic information represented within diffusion models. Specifically, I will present our work that uses mechanistic interpretability tools such as k-sparse autoencoders (k-SAE) to probe various layers and denoising timesteps of different diffusion architectures. Next, I will present how to uncover monosemantic interpretable concepts pertaining to safety and photographic styles and steer the generation process thereby offering more controllability to users.

JIVP Webinar Series

EURASIP Journal on Image and Video Processing

Cite as

D. Ghadiyaram (2025, June 4), Revealing and Leveraging the Visual Information in Diffusion Models

doi.org/10.52843/cassyni.hbyrhj

Share

Timestamped permalink

Details

Listed seminar This seminar is open to all

Recorded Available to all

Video length 1:02:51

Q&A Now closed

Disclaimer The views expressed in this seminar are those of the speaker and not necessarily those of the journal

Just published a paper?

Get noticed and cited with a Cassyni seminar.