Llama-3.3-70B-Shakudo

license: llama3.3 base_model:

meta-llama/Llama-3.3-70B-Instruct

Created by Steelskull Steelskull → Support on Ko-fi

Model Information

L3.3-Shakudo-70b

Llama 3.3 Multi-Stage Merge 70b Parameters V0.8

Model Composition

Final Merge: L3.3-Shakudo-70b ▼

TheSkullery/L3.3-M1-Hydrargyrum-70B

TheSkullery/L3.3-M2-Hydrargyrum-70B
Model 1: L3.3-M1-Hydrargyrum-70B ▼

Sao10K/L3.1-70B-Hanami-x1

TheDrummer/Anubis-70B-v1

ArliAI/Llama-3.3-70B-ArliAI-RPMax-v1.4

BeaverAI/Shimmer-70B-v1a

TheDrummer/Fallen-Llama-3.3-70B-v1
Model 2: L3.3-M2-Hydrargyrum-70B ▼

Sao10K/Llama-3.3-70B-Vulpecula-r1

Sao10K/70B-L3.3-Cirrus-x1

EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.0

LatitudeGames/Wayfarer-Large-70B-Llama-3.3

Sao10K/L3.3-70B-Euryale-v2.3
Base Model: L3.3-Cogmoblated-70B ▼

abacusai/Dracarys2-Llama-3.1-70B-Instruct

watt-ai/watt-tool-70B

deepcogito/cogito-v1-preview-llama-70B

TheDrummer/Anubis-70B-v1

SicariusSicariiStuff/Negative_LLAMA_70B

Ppoyaa/MythoNemo-L3.1-70B-v1.0

nbeerbower/Llama-3.1-Nemotron-lorablated-70B (Base)

Model Creation Process

L3.3-Shakudo-70b is the result of a multi-stage merging process by Steelskull, designed to create a powerful and creative roleplaying model with a unique flavor. The creation process involved several advanced merging techniques, including weight twisting, to achieve its distinct characteristics.

Stage 1: The Cognitive Foundation & Weight Twisting

The process began by creating a cognitive and tool-use focused base model, L3.3-Cogmoblated-70B. This was achieved through a `model_stock` merge of several models known for their reasoning and instruction-following capabilities. This base was built upon `nbeerbower/Llama-3.1-Nemotron-lorablated-70B`, a model intentionally "ablated" to skew refusal behaviors. This technique, known as weight twisting, helps the final model adopt more desirable response patterns by building upon a foundation that is already aligned against common refusal patterns.

Stage 2: The Twin Hydrargyrum - Flavor and Depth

Two distinct models were then created from the Cogmoblated base:

L3.3-M1-Hydrargyrum-70B: This model was merged using `SCE`, a technique that enhances creative writing and prose style, giving the model its unique "flavor." The Top_K for this merge were set at 0.22 .
L3.3-M2-Hydrargyrum-70B: This model was created using a `Della_Linear` merge, which focuses on integrating the "depth" of various roleplaying and narrative models. The settings for this merge were set at: (lambda: 1.1) (weight: 0.2) (density: 0.7) (epsilon: 0.2)

Final Stage: Shakudo

The final model, L3.3-Shakudo-70b, was created by merging the two Hydrargyrum variants using a 50/50 `nuslerp`. This final step combines the rich, creative prose (flavor) from the SCE merge with the strong roleplaying capabilities (depth) from the Della_Linear merge, resulting in a model with a distinct and refined narrative voice.

A special thank you to Nectar.ai for their generous support of the open-source community and my projects.

Additionally, a heartfelt thanks to all the Ko-fi supporters who have contributed, your generosity is deeply appreciated and helps keep this work going and the Pods spinning.

Recommended Sampler Settings

Static Temperature: 1.0 - 1.2

Min P: 0.02 - 0.025

DRY:

- Multiplier: 0.8

- Base: 1.74

- Length: 4-6

Good Starting Templates & Prompts

Hamon v1 → by @Steel > Big-picture storytelling guide with world-building focus, set dialogue/narration split, and general writing rules.

Shingane v1 → by @Steel > Simplified sysprompt based on Hamon.

Kesshin v1 → by @Steel > A Hamon rethink using a Character-focused sys prompt that tracks what characters know and how they learn things, with strict interaction rules.

Kamae TTRPG v1 → by @Steel > TTRPG Game Master framework emphasizing player agency, world consistency, and adaptive session management with mechanical integration.

Kamae lite v1 → by @Steel > Simplified sysprompt based on Kamae.

Support & Community:

Join Discord

Hourly Usage

Performance Metrics

Model Information

L3.3-Shakudo-70b

Model Information

L3.3-Shakudo-70b

Model Composition

Model Creation Process

Stage 1: The Cognitive Foundation & Weight Twisting

Stage 2: The Twin Hydrargyrum - Flavor and Depth

Final Stage: Shakudo

Recommended Sampler Settings

Good Starting Templates & Prompts

Support & Community: