model_info: name: anemll-Llama-3.2-1B-Instruct-ctx1546 version: 0.1.1 description: | Demonstarates running Llama-3.2-1B-Instruct on Apple Neural Engine Context length: 1546 Batch size: 64 Chunks: 2 license: MIT author: Anemll framework: Core ML language: Python parameters: context_length: 1546 batch_size: 64 lut_embeddings: none lut_ffn: 4 lut_lmhead: 6 num_chunks: 2 model_prefix: llama