Avg. Total Time
20.12s
Avg. TTFT
26.76s
Avg. Prefill TPS
2.04
Avg. Gen TPS
20.56
Context Size
32768
Quantization
r64
Engine
aphrodite
Creation Method
Merge
Model Type
Llama70B
Chat Template
Llama 3
Reasoning
No
Vision
No
Parameters
70B
Added At
2/17/2025
No Model Read Me file available.