Avg. Total Time
48.27s
Avg. TTFT
47.35s
Avg. Prefill TPS
215.96
Avg. Gen TPS
22.00
Context Size
32768
Quantization
r64
Engine
aphrodite
Creation Method
LoRA Finetune
Model Type
Llama70B
Chat Template
Llama 3
Reasoning
No
Vision
No
Parameters
70B
Added At
12/26/2024
license: llama3.3 base_model:
Rombos-LLM-70b-Llama-3.3

You know the drill by now.
Here is the paper. Have fun.