LossFunctionLover
/

pairwise-orm-model

Text Classification

preference-learning

agentic-reasoning

outcome-reward-model

pairwise-preference

Eval Results (legacy)

Model card Files Files and versions

LossFunctionLover commited on Dec 19, 2025

Commit

68400a5

·

verified ·

1 Parent(s): 4b52705

Update MODEL_CARD.md

Files changed (1) hide show

MODEL_CARD.md +7 -7

MODEL_CARD.md CHANGED Viewed

@@ -8,7 +8,7 @@ tags:
 - outcome-reward-model
 - pairwise-preference
 datasets:
-- akleshmishra/orm-pairwise-preference-pairs
 metrics:
 - accuracy
 pipeline_tag: text-classification
@@ -20,7 +20,7 @@ model-index:
       name: Pairwise Preference Ranking
     dataset:
       name: ORM Pairwise Preference Pairs
-      type: akleshmishra/orm-pairwise-preference-pairs
     metrics:
     - type: accuracy
       value: 96.3
@@ -38,7 +38,7 @@ model-index:
 [![Paper](https://img.shields.io/badge/Paper-ArXiv-red)](link-to-arxiv)
 [![Dataset](https://img.shields.io/badge/Dataset-HuggingFace-yellow)](https://huggingface.co/datasets/akleshmishra/orm-pairwise-preference-pairs)
-[![Code](https://img.shields.io/badge/Code-GitHub-blue)](your-github-repo)
 </div>
@@ -131,7 +131,7 @@ Each pair contains:
 ### Training Configuration
 **Hyperparameters:**
-- **Base Model**: [Specify your model, e.g., "Qwen/Qwen2.5-Math-1.5B-Instruct"]
 - **Trainable Parameters**: Scoring head only (~500K-1M params)
 - **Optimizer**: AdamW
   - Learning rate: 1e-4
@@ -343,15 +343,15 @@ If you use this model in your research, please cite:
 ## 🔗 Resources
 - 📄 **Paper**: [ArXiv](link-to-arxiv) (Coming soon)
-- 💾 **Dataset**: [HuggingFace](https://huggingface.co/datasets/akleshmishra/orm-pairwise-preference-pairs)
-- 💻 **Code**: [GitHub](your-github-repo-url)
 - 📊 **Training Logs**: [Weights & Biases](wandb-link) (if available)
 ## 📧 Contact
 **Aklesh Mishra**
 - Email: akleshmishra7@gmail.com
-- GitHub: [@your-username](https://github.com/your-username)
 ## 📝 License

 - outcome-reward-model
 - pairwise-preference
 datasets:
+- LossFunctionLover/orm-pairwise-preference-pairs
 metrics:
 - accuracy
 pipeline_tag: text-classification
       name: Pairwise Preference Ranking
     dataset:
       name: ORM Pairwise Preference Pairs
+      type: LossFunctionLover/orm-pairwise-preference-pairs
     metrics:
     - type: accuracy
       value: 96.3
 [![Paper](https://img.shields.io/badge/Paper-ArXiv-red)](link-to-arxiv)
 [![Dataset](https://img.shields.io/badge/Dataset-HuggingFace-yellow)](https://huggingface.co/datasets/akleshmishra/orm-pairwise-preference-pairs)
+[![Code](https://img.shields.io/badge/Code-GitHub-blue)](https://github.com/Coder-12)
 </div>
 ### Training Configuration
 **Hyperparameters:**
+- **Base Model**: facebook/OPT1.3b
 - **Trainable Parameters**: Scoring head only (~500K-1M params)
 - **Optimizer**: AdamW
   - Learning rate: 1e-4
 ## 🔗 Resources
 - 📄 **Paper**: [ArXiv](link-to-arxiv) (Coming soon)
+- 💾 **Dataset**: [HuggingFace](https://huggingface.co/datasets/LossFunctionLover/orm-pairwise-preference-pairs)
+- 💻 **Code**: [GitHub](https://github.com/Coder-12)
 - 📊 **Training Logs**: [Weights & Biases](wandb-link) (if available)
 ## 📧 Contact
 **Aklesh Mishra**
 - Email: akleshmishra7@gmail.com
+- GitHub: [@Coder-12](https://github.com/Coder-12)
 ## 📝 License