Official AAPA release: processed training data and A-GRPO checkpoints for adversarially anchored preference alignment.
Jingleqian
Jingleqian
AI & ML interests
None yet
Recent Activity
updated a collection 1 day ago
AAPA updated a collection 1 day ago
AAPA updated a model 1 day ago
Jingleqian/AAPA-8BOrganizations
None yet