anemll's picture
Upload folder using huggingface_hub
f056d10 verified
model_info:
name: anemll-DeepSeekR1-8B-ctx1024
version: 0.2.0
description: |
Demonstarates running DeepSeekR1-8B on Apple Neural Engine
Context length: 1024
Batch size: 256
Chunks: 8
license: MIT
author: Anemll
framework: Core ML
language: Python
parameters:
context_length: 1024
batch_size: 256
lut_embeddings: none
lut_ffn: 6
lut_lmhead: 6
num_chunks: 8
model_prefix: DeepSeek