Einstein-v6.1-Llama3-8B-executorch / Einstein-v6.1-Llama3-8B_kv2_sdpa_xnn_qe_4_32_ctx4096.pte

Commit History

Add PTE files for context sizes 2048, 4096, 8192
e65f41b

l3utterfly commited on