Llama-3.3-70B-Progenitor-V1.1

Creative Model

View on Hugging FaceBack to Models

Hourly Usage

Performance Metrics

Avg. Total Time

11.04s

Avg. TTFT

10.15s

Avg. Prefill TPS

2.24

Avg. Gen TPS

24.70

Model Information

Context Size

32768

Quantization

r64

Engine

aphrodite

Creation Method

Merge

Model Type

Llama70B

Chat Template

Llama 3

Reasoning

No

Vision

No

Parameters

70B

Added At

2/3/2025


base_model:

  • EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1
  • Sao10K/L3.1-70B-Hanami-x1
  • Sao10K/70B-L3.3-Cirrus-x1
  • TheDrummer/Anubis-70B-v1
  • nbeerbower/Llama-3.1-Nemotron-lorablated-70B
  • SicariusSicariiStuff/Negative_LLAMA_70B library_name: transformers tags:
  • mergekit
  • merge license: llama3.3

image/png

This model is part of a series of experiments in merging some of my favorite Llama models, an idea which was based on the excellent Steelskull/L3.3-MS-Nevoria-70b merge, just with a couple of extra ingredients and different merge methods. Here I tried a Della Linear merge with aggressive parameters. The results came out really nice, I really enjoy this model.

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the della_linear merge method using nbeerbower/Llama-3.1-Nemotron-lorablated-70B as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: Sao10K/L3.1-70B-Hanami-x1
    parameters:
      weight: 0.20
      density: 0.7
  - model: Sao10K/70B-L3.3-Cirrus-x1
    parameters:
      weight: 0.20
      density: 0.7
  - model: SicariusSicariiStuff/Negative_LLAMA_70B
    parameters:
      weight: 0.20
      density: 0.7
  - model: TheDrummer/Anubis-70B-v1
    parameters:
      weight: 0.20
      density: 0.7
  - model: EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1
    parameters:
      weight: 0.20
      density: 0.7
merge_method: della_linear
base_model: nbeerbower/Llama-3.1-Nemotron-lorablated-70B
parameters:
  epsilon: 0.2
  lambda: 1.1
dtype: bfloat16
tokenizer_source: base

Support on KO-FI <3