AI Control with Mechanistic Interpretability

Mar 19, 2024

I’m excited to share our recent paper on AI Control through Mechanistic Interpretability approaches. You can read the full paper here.

M.I

M.I
gerard.boxo@estudiantat.upc.edu

gboxo
uadeoif

This blog is dedicated to discussing Mechanistic Interpretability