Llama-3.3+(3.1v3.3)-70B-Gutenberg-Doppel

Creative Model

View on Hugging FaceBack to Models

Hourly Usage

Performance Metrics

Avg. Total Time

8.39s

Avg. TTFT

34.33s

Avg. Prefill TPS

42.67

Avg. Gen TPS

N/A

Model Information

Context Size

32768

Quantization

r64

Engine

aphrodite

Creation Method

LoRA Finetune

Model Type

Llama70B

Chat Template

Llama 3

Reasoning

No

Vision

No

Parameters

70B

Added At

12/22/2024


license: llama3.1 library_name: transformers base_model:


image/png

Llama3.1-Gutenberg-Doppel-70B

mlabonne/Hermes-3-Llama-3.1-70B-lorablated finetuned on jondurbin/gutenberg-dpo-v0.1 and nbeerbower/gutenberg2-dpo.

Method

ORPO tuned with 2x H100 for 3 epochs.

Thank you Schneewolf Labs for the compute.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

MetricValue
Avg.35.68
IFEval (0-Shot)70.92
BBH (3-Shot)52.56
MATH Lvl 5 (4-Shot)13.75
GPQA (0-shot)12.64
MuSR (0-shot)22.68
MMLU-PRO (5-shot)41.52