32 14 8

Hamish Ivison

hamishivi

https://ivison.id.au

AI & ML interests

NLP :)

Recent Activity

updated a dataset about 1 hour ago

ai2-adapt-dev/eurus2_ground_truth_with_random_max_length

updated a dataset about 3 hours ago

hamishivi/SimpleQA-RLVR

updated a dataset 1 day ago

hamishivi/2wiki_rlvr

View all activity

Organizations

Collections 7

Inference demo for TESS 2 model

models 34

hamishivi/s1k_seq_orig_hyper421740446762

Updated Mar 13 • 1

hamishivi/tulu_3_long_finetune_qwen_7b_reg_system_prompt

Updated Mar 11

hamishivi/tulu-2-wildchat-326k-sft

Updated Mar 4 • 5

hamishivi/tulu-2-arena-hard-326k-sft

Updated Mar 4 • 3

hamishivi/llama-3.1-tulu-3-arena-hard-939k-sft

Updated Mar 4 • 12

hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft

Updated Mar 4 • 4

hamishivi/tulu-2-multitask-rrmax-326k-sft

Updated Mar 4 • 4

hamishivi/qwen2_math_tokenizer_tweaked

Updated Mar 3

hamishivi/0224_jupiter_hamish_grpo_tulu3_s1k_orz_30350

Updated Feb 25 • 3

hamishivi/0224_jupiter_hamish_grpo_s1k_only_orz_24021

Updated Feb 25 • 3

datasets 50

hamishivi/SimpleQA-RLVR

Viewer • Updated about 3 hours ago • 4.33k • 152

hamishivi/2wiki_rlvr

Viewer • Updated 1 day ago • 15.3k • 7

hamishivi/tqa_rlvr

Viewer • Updated 1 day ago • 156k • 6

hamishivi/nq_rlvr

Viewer • Updated 1 day ago • 91.5k • 7

hamishivi/hotpotqa_rlvr

Viewer • Updated 1 day ago • 97.9k • 5

hamishivi/SimpleQA-RLVR-noprompt

Viewer • Updated 3 days ago • 4.33k • 19

hamishivi/simpleqa_5_actions_llama3.3_70b_it

Viewer • Updated 4 days ago • 4.33k • 15

hamishivi/simpleqa_10_actions_llama3.3_70b_it

Viewer • Updated 4 days ago • 1.03k • 12

hamishivi/GeneralThought-430K-filtered-thinker

Viewer • Updated 10 days ago • 296k • 55

hamishivi/tulu-3-sft-t3-70b-thinker

Viewer • Updated 10 days ago • 932k • 84

Hamish Ivison

AI & ML interests

Recent Activity

Organizations

Collections 7

Large-Scale Data Selection for Instruction Tuning

hamishivi/tulu-2-multitask-rrmax-326k-sft

hamishivi/rds-sels-multitask-rrmax-top326k

hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft

TESS 2: A Large-Scale Generalist Diffusion Language Model

Tess 2 Demo

hamishivi/tess2-v0.3

hamishivi/tess2-v0.1

Papers 11

spaces 1

Tess 2 Demo

models 34

hamishivi/s1k_seq_orig_hyper421740446762

hamishivi/tulu_3_long_finetune_qwen_7b_reg_system_prompt

hamishivi/tulu-2-wildchat-326k-sft

hamishivi/tulu-2-arena-hard-326k-sft

hamishivi/llama-3.1-tulu-3-arena-hard-939k-sft

hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft

hamishivi/tulu-2-multitask-rrmax-326k-sft

hamishivi/qwen2_math_tokenizer_tweaked

hamishivi/0224_jupiter_hamish_grpo_tulu3_s1k_orz_30350

hamishivi/0224_jupiter_hamish_grpo_s1k_only_orz_24021

datasets 50

hamishivi/SimpleQA-RLVR

hamishivi/2wiki_rlvr

hamishivi/tqa_rlvr

hamishivi/nq_rlvr

hamishivi/hotpotqa_rlvr

hamishivi/SimpleQA-RLVR-noprompt

hamishivi/simpleqa_5_actions_llama3.3_70b_it

hamishivi/simpleqa_10_actions_llama3.3_70b_it

hamishivi/GeneralThought-430K-filtered-thinker

hamishivi/tulu-3-sft-t3-70b-thinker

Hamish Ivison

AI & ML interests

Recent Activity

Organizations

Collections 7

Tess 2 Demo

Papers 11

spaces 1

Tess 2 Demo

models 34 Sort: Recently updated

datasets 50 Sort: Recently updated

models 34

datasets 50