ToxicityPrompts

university

AI & ML interests

None defined yet.

Recent Activity

kpriyanshu256 updated a collection 1 day ago

kpriyanshu256 updated a collection 1 day ago

Xuhui authored a paper 2 days ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

View all activity

ToxicityPrompts's activity

kpriyanshu256

updated a collection 1 day ago

Full Data

21 items • Updated 1 day ago

Xuhui

authored a paper 2 days ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published 4 days ago • 41

kpriyanshu256

updated a collection 10 days ago

Full Data

21 items • Updated 1 day ago

kpriyanshu256

updated 2 collections 16 days ago

Full Data

21 items • Updated 1 day ago

Safety-Benchmarks

12 items • Updated 16 days ago

kpriyanshu256

updated a collection 18 days ago

Full Data

21 items • Updated 1 day ago

kpriyanshu256

updated a collection 22 days ago

Full Data

21 items • Updated 1 day ago

kpriyanshu256

updated a collection 25 days ago

Full Data

21 items • Updated 1 day ago

kpriyanshu256

authored 2 papers 28 days ago

SciDr at SDU-2020: IDEAS -- Identifying and Disambiguating Everyday Acronyms for Scientific Domain

Paper • 2102.08818 • Published Feb 17, 2021

Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents

Paper • 2410.13886 • Published Oct 11

maartensap

authored a paper 6 months ago

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

Paper • 2406.18510 • Published Jun 26 • 8

devanshrj

authored a paper 7 months ago

PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models

Paper • 2405.09373 • Published May 15 • 1

kpriyanshu256

authored a paper 7 months ago

PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models

Paper • 2405.09373 • Published May 15 • 1

maartensap

authored a paper 9 months ago

SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents

Paper • 2403.08715 • Published Mar 13 • 20

Xuhui

authored a paper over 1 year ago

WebArena: A Realistic Web Environment for Building Autonomous Agents

Paper • 2307.13854 • Published Jul 25, 2023 • 23