Edit model card

YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Companion Post: Research Log, RLLMv3 (GPT2-XL, Phi-1.5 and Falcon-RW-1B)

Main post: BetterDAN, AI Machiavelli & Oppo Jailbreaks vs. SOTA models & GPT2XL_RLLMv3

Related post: Coherence (and Response Time) Test

Downloads last month: 6

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Collection including migueldeguzmandev/Phi-1.5-RLLMv3-3

Phi-1.5-RLLMv3

Collection

This is a collection designed to present the ten RLLM steps/ training runs intended to improve Phi-1.5's outputs towards coherence and politeness. • 10 items • Updated May 8