Avg. Total Time
15.15s
Avg. TTFT
4.76s
Avg. Prefill TPS
412.76
Avg. Gen TPS
20.80
Context Size
32768
Quantization
r64
Engine
aphrodite
Creation Method
LoRA Finetune
Model Type
Llama70B
Chat Template
Llama 3
Reasoning
No
Vision
No
Parameters
70B
Added At
1/13/2025
base_model:

This model is a fine-tune of Llama3.1-Nemotron-70B-Instruct, specifically designed to enhance its roleplaying and story writing abilities. Not only did it excel in improving these aspects, but it also maintained its remarkable intelligence, ability to follow instructions, and reasoning skills. In my general tests, I mostly found myself preferring the outputs from this model compared to Nemotron-70B-Instruct, especially in its story writing capabilities that truly stood out.
CHARACTER CARD RESPONSE EXAMPLE:

SCENARIO/ADVENTURE TYPE CARD EXAMPLE:


❕Those weird bolding or spaces at the examples above are due to the cropping. I don't know why that happens.❕
SILLYTAVERN PRESET:
I recommend using this preset that I made for this model. Ppoyaa/MythoNemo-Preset
REASONING

STORYTELLING

Big thanks to the quants by mradermacher:
Static: mradermacher/MythoNemo-L3.1-70B-v1.0-GGUF
Weighted/Imatrix: mradermacher/MythoNemo-L3.1-70B-v1.0-i1-GGUF