This directory contains sparse autoencoders trained on activations at various points within gpt2-small using Neel Nanda's open source code. Each autoencoder was trained on 1B tokens from OpenWebText. A demo colab notebook is here.

The autoencoders are named "gpt2-small_{feature_dict_size}_{point} _{layer}.pt", where:

"feature_dict_size" is the number of hidden neurons in the autoencoder
"point" is either "mlp_out" or "resid_pre"
"layer" is an integer from 0,...,11.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support