Avg. Total Time
6.01s
Avg. TTFT
5.66s
Avg. Prefill TPS
2.83
Avg. Gen TPS
28.33
Context Size
32768
Quantization
r64
Engine
aphrodite
Creation Method
Merge
Model Type
Llama70B
Chat Template
Llama 3
Reasoning
No
Vision
No
Parameters
70B
Added At
4/9/2025
thumbnail: >- https://cdn-uploads.huggingface.co/production/uploads/67c10cfba43d7939d60160ff/UqxZ1vskFE-90lvU0UJ6M.jpeg language:
A furry finetune model based on L3.3-Electra-R1-70b, elegant, yet suggestively draconic~
The following templates are recommended from the original Electra model page, adjust if needed:
Start Reply With:
'<think> OK, as an objective, detached narrative analyst, let's think this through carefully:'
Reasoning Formatting (no spaces):
'<think>''</think>'