NeMo
English
nvidia
llama3.1
reward model
zhilinw commited on
Commit
057236d
·
verified ·
1 Parent(s): 1bba03e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -35,7 +35,7 @@ By accessing this model, you are agreeing to the LLama 3.1 terms and conditions
35
 
36
  ## RewardBench Primary Dataset LeaderBoard
37
 
38
- As of 27 Sept 2024, Llama-3.1-Nemotron-70B-Reward performs best Overall on RewardBench as well as with strong performance in Chat, Safety and Reasoning categories among the models below.
39
 
40
  | Model | Type of Data Used For Training | Overall | Chat | Chat-Hard | Safety | Reasoning |
41
  |:-----------------------------|:----------------|:-----|:----------|:-------|:----------|:-----------------------|
 
35
 
36
  ## RewardBench Primary Dataset LeaderBoard
37
 
38
+ As of 30 Sept 2024, Llama-3.1-Nemotron-70B-Reward performs best Overall on RewardBench as well as with strong performance in Chat, Safety and Reasoning categories among the models below.
39
 
40
  | Model | Type of Data Used For Training | Overall | Chat | Chat-Hard | Safety | Reasoning |
41
  |:-----------------------------|:----------------|:-----|:----------|:-------|:----------|:-----------------------|