tim-lawson 's Collections

Single-Layer SAEs

TopK SAEs trained on the residual stream activation vectors from a single transformer layer.