daiwei chen
daiweichen
AI & ML interests
representation learning, foundation models, preference learning
Recent Activity
New activity
12 days ago
meta-llama/Llama-3.2-1B:Attention doesn't work for all layers except for the first layer
Organizations
models
None public yet
datasets
None public yet