Alan Akbik

alanakbik

AI & ML interests

None yet

Recent Activity

Organizations

flair's profile picture BigScience Workshop's profile picture

alanakbik's activity

New activity in flair/ner-english 10 days ago
reacted to PatrickHaller's post with 🔥 7 months ago
view post
Post
1910
How Robust Is Your Model in Complex Code Generation Tasks? 🤔

We've launched the PECC benchmark to challenge chat models in code generation, drawing from the Advent of Code for programming tasks and the Euler Project for math-heavy challenges. This new task tests models with problems presented in both detailed prose and concise "leet code" styles, evaluating their ability to understand and solve complex coding issues and math problem in chat-based interactions.

It seems that the Claude 3 models outperforme ChatGPT:
Model / Avg. (pass@3)
Claude 3 Haiku / 27.67
GPT-3.5-Turbo / 23.75
Mixtral-8x22B-Instruct-v0.1 / 8.35

Read our Preprint📃: PECC: Problem Extraction and Coding Challenges (2404.18766)
Look at the dataset🔎: PatrickHaller/pecc

We also got accepted at LREC-COLING '24 🎉
New activity in flair/upos-multi 9 months ago

Update Model

2
#3 opened 9 months ago by
stefan-it
New activity in flair/pos-english 12 months ago

What tokenizer is best?

1
#2 opened 12 months ago by
turian
New activity in flair/ner-english-ontonotes-large about 1 year ago

Error

8
#2 opened over 1 year ago by
vpkprasanna
New activity in flair/ner-spanish-large about 1 year ago
New activity in flair/ner-english-large about 1 year ago

error while loading

1
#2 opened about 1 year ago by
shivam2813
New activity in flair/ner-multi over 1 year ago