Avg. Total Time
62.86s
Avg. TTFT
62.38s
Avg. Prefill TPS
1.74
Avg. Gen TPS
20.66
Context Size
32768
Quantization
r64
Engine
aphrodite
Creation Method
Merge
Model Type
Llama70B
Chat Template
Llama 3
Reasoning
No
Vision
No
Parameters
70B
Added At
3/17/2025
thumbnail: https://cdn-uploads.huggingface.co/production/uploads/633e85093a17ab61de8d9073/FGK0qBGmELj6DEUxbbrdR.png base_model:
New merge. This an experiment to increase the "Madness" in a model. Merge is based on top UGI-Bench models (So yeah, I would think this would be benchmaxxing.)
This is the second time I'm using SCE. The previous MagicalGirl model seems to be quite happy with it.
Added KaraKaraWitch/Llama-MiraiFanfare-3.3-70B based on feedback I got from others (People generally seem to remember this rather than other models). So I'm not sure how this would play into the merge.
This is a merge of pre-trained language models created using mergekit.
Pretty interesting. As of 05/03/25, it's in the top 10th:
| Bench | Results |
|---|---|
| UGI-Score | 52.48 / 100 |
| Unruly | 3.8 / 10 |
| Internet | 5.1 / 10 |
| Society | 5.4 / 10 |
| Willing | 7 / 10 |
| NatInt | 41.86 / 100 |
| Coding | 22 |
| Politial Lean | −3.9% (Liberalism) |
This model was merged using the SCE merge method using KaraKaraWitch/Llama-3.X-Workout-70B as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
models:
- model: SicariusSicariiStuff/Negative_LLAMA_70B
- model: TheDrummer/Anubis-70B-v1
- model: KaraKaraWitch/Llama-MiraiFanfare-3.3-70B
- model: Black-Ink-Guild/Pernicious_Prophecy_70B
- model: LatitudeGames/Wayfarer-Large-70B-Llama-3.3
merge_method: sce
base_model: KaraKaraWitch/Llama-3.X-Workout-70B
parameters:
select_topk: 1.0
dtype: bfloat16