BigScience WG for evaluation of bias, fairness, and social impact

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Shayne authored a paper 11 days ago

Towards Best Practices for Open Datasets for LLM Training

Shayne authored a paper 7 months ago

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

Shayne authored a paper 7 months ago

Entity-Based Knowledge Conflicts in Question Answering

View all activity

BigScienceBiasEval's activity

Shayne

authored a paper 11 days ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 13 days ago • 48

Shayne

authored 4 papers 7 months ago

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

Paper • 2406.16746 • Published Jun 24, 2024

Entity-Based Knowledge Conflicts in Question Answering

Paper • 2109.05052 • Published Sep 10, 2021

The Flan Collection: Designing Data and Methods for Effective Instruction Tuning

Paper • 2301.13688 • Published Jan 31, 2023 • 8

MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering

Paper • 2007.15207 • Published Jul 30, 2020

Shayne

authored a paper 9 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 121

manandey

authored a paper 10 months ago

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Paper • 2303.03915 • Published Mar 7, 2023 • 6

jordiclive

authored a paper 10 months ago

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Paper • 2404.00399 • Published Mar 30, 2024 • 42

Shayne

authored a paper 11 months ago

On the Societal Impact of Open Foundation Models

Paper • 2403.07918 • Published Feb 27, 2024 • 17

manandey

authored a paper 11 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 137

Shayne

authored a paper 12 months ago

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

Paper • 2402.07827 • Published Feb 12, 2024 • 47

manandey

updated a dataset about 1 year ago

BigScienceBiasEval/crows_pairs_multilingual

Updated Jan 14, 2024 • 1.33k • 3

Shayne

authored 2 papers over 1 year ago

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Paper • 2310.08491 • Published Oct 12, 2023 • 54

OctoPack: Instruction Tuning Code Large Language Models

Paper • 2308.07124 • Published Aug 14, 2023 • 29

jordiclive

authored 2 papers over 1 year ago

Control Prefixes for Parameter-Efficient Text Generation

Paper • 2110.08329 • Published Oct 15, 2021

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

Paper • 2206.11249 • Published Jun 22, 2022

jaketae

authored a paper over 1 year ago

TESS: Text-to-Text Self-Conditioned Simplex Diffusion

Paper • 2305.08379 • Published May 15, 2023 • 1

manandey

authored 3 papers over 1 year ago

StarCoder: may the source be with you!

Paper • 2305.06161 • Published May 9, 2023 • 30

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts

Paper • 2202.01279 • Published Feb 2, 2022

Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP

Paper • 2112.10508 • Published Dec 20, 2021

AI & ML interests

Recent Activity

Team members 11

BigScienceBiasEval's activity