Merge branch 'main' of https://huggingface.co/spaces/CONDA-Workshop/Data-Contamination-Report into pr/11 ec0bb5d OSainz commited on Apr 29
GPT-3.5Turbo HumanEval Contamination based on "Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models" (#16) 6b722ae verified OSainz jupyter31 commited on Apr 29
Added Contamination Evidence on MMLU of ChatGPT/GPT4 from "Investigating data contamination in modern benchmarks for large language models" (#10) f5daf9b verified OSainz AmeyaPrabhu commited on Apr 29
Added Contamination Info on Old Models: GPT3, FLAN, GLaM, PaLM, PaLM 2 (#13) c4acbf6 verified OSainz AmeyaPrabhu commited on Apr 25
Contamination results based on "Data Contamination Quiz" (#9) 36aaa79 verified OSainz shahriargolchin commited on Apr 25
Code contamination in HumanEval and MBPP (#12) ffb0d75 verified OSainz AmeyaPrabhu commited on Apr 25
Add model-based results for MedNLI, RadNLI for GPT-3.5 and GPT-4 (#8) d57b460 verified Iker j-chim commited on Apr 23
Add data from "An Open-Source Data Contamination Report for Large Language Models" (#5) 619ed3b verified Iker vishaal27 commited on Apr 23
Add data from "Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus" (#6) 935e79b verified Iker vishaal27 commited on Apr 18