Avg. Total Time
40.20s
Avg. TTFT
16.21s
Avg. Prefill TPS
363.63
Avg. Gen TPS
13.86
Context Size
32768
Quantization
r64
Engine
aphrodite
Creation Method
LoRA Finetune
Model Type
Llama70B
Chat Template
Llama 3
Reasoning
No
Vision
No
Parameters
70B
Added At
7/2/2025
No Model Read Me file available.