Avg. Total Time
23.16s
Avg. TTFT
4.55s
Avg. Prefill TPS
30.52
Avg. Gen TPS
17.90
Context Size
32768
Quantization
r64
Engine
aphrodite
Creation Method
Merge
Model Type
Llama70B
Chat Template
Llama 3
Reasoning
No
Vision
No
Parameters
70B
Added At
12/22/2024
base_model:
Now the cute anime girl has your attention
Creator: SteelSkull
Name Legend:
L3.1 = Llama 3.1
MS = Model Stock
70B = its 70B
This model is a remake of the original astoria with modern models and context sizes its goal is to merge the robust storytelling of mutiple models while attempting to maintain intelligence.
Use Llama 3 Format or meth format (llama 3 refuses to work with stepped thinking but meth works)
GGUF Quant:
- bartowski: Combined-GGUF
- mradermacher: GGUF // Imat-GGUF
MODEL_NAME = "L3.1-MS-Astoria-70b-v2"
base_model: mlabonne/Llama-3.1-70B-Instruct-lorablated
merge_method: model_stock
dtype: bfloat16
models:
- model: migtissera/Tess-3-Llama-3.1-70B
- model: NeverSleep/Lumimaid-v0.2-70B
- model: Sao10K/L3.1-70B-Euryale-v2.2
- model: ArliAI/Llama-3.1-70B-ArliAI-RPMax-v1.2
- model: nbeerbower/Llama3.1-Gutenberg-Doppel-70B
If you wish to support: