crypto-drv4-rl-r5-v9

PEFT LoRA fine-tuned for crypto pump prediction (binary Yes/No 7-day +15% move detection).

V9 results — PASSED gate

Metric	v9	v8
Raw MCC	+0.2049 (CI [+0.1797, +0.2288])	+0.1560
Threshold-tuned MCC	+0.2040	—
Platt-calibrated MCC	+0.1243	—
AUC-ROC	0.694	(saturation collapsed v8)
AUC-PR	0.278	—
Accuracy	73.8%	—
F1	0.335	—
Saturation@95	0.00%	~100% in v8
ECE	0.267 (raw) / 0.019 (Platt)	—
Brier	0.191	—
n_test	8000 (5898 coin-holdout unseen)	—

Improvement: +31.3% over v8 (+0.1560 -> +0.2049).

Why v9 beats v8

Calibrated CE loss (label_smoothing=0.05, pos_weight=6, conf_penalty=0.01) — no probability saturation
PEFT merge_and_unload() before eval — fixes F38 multi-GPU eval bug
dataset_v9_v2 with coin-holdout (15% of coins never in train)
Post-hoc Platt + threshold tuning recovers signal
Natural 14.3% Yes balance (+15% threshold) vs v8 oversampled 35%

Usage

from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch
base = AutoModelForCausalLM.from_pretrained("inclusionAI/DR-Venus-4B-RL", torch_dtype=torch.bfloat16, device_map="auto")
model = PeftModel.from_pretrained(base, "majid2230/crypto-drv4-rl-r5-v9")
model = model.merge_and_unload()
tok = AutoTokenizer.from_pretrained("inclusionAI/DR-Venus-4B-RL")

Apply Platt scaling (a=0.8619474204681601, b=-1.4940341365084346) + threshold tune for best results.

Recipe (locked v9)

epochs=3 lora_r=64 LR=1.5e-5 warmup=0.05 max_length=768
label_smoothing=0.05 pos_weight=6.0 conf_penalty=0.01 patience=2

Part of R5 v9 cohort — https://huggingface.co/majid2230

Downloads last month: 15

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for majid2230/crypto-drv4-rl-r5-v9

Base model

inclusionAI/DR-Venus-4B-RL

Adapter

(1)

this model