Avg. Total Time
9.42s
Avg. TTFT
6.59s
Avg. Prefill TPS
559.51
Avg. Gen TPS
25.10
Context Size
32768
Quantization
r64
Engine
aphrodite
Creation Method
Unknown
Model Type
Llama70B
Chat Template
Llama 3
Reasoning
No
Vision
No
Parameters
70B
Added At
12/22/2024
BeaverAI proudly presents...
A finetune of Nvidia's Llama 3.1 Nemotron 70B

Its breath is pure and healthy. It is an immense desert, where man is never lonely, for he feels life stirring on all sides.
"Q6 is pretty good" - Bertro
*action* Dialogue *thoughts* Dialogue *narration* in 1st person PoV
Thank you Gargy!