4 52

Hu Zang

zanghu

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

princeton-nlp/SWE-bench_Verified

reacted to joaogante's post with 🤗 about 1 month ago

New sampling strategy dropped in 🤗 transformers -- Min P sampling 🔥 Are you tired of having `top_k` arbitrarily discarding high-quality continuations? Or `top_p` forgetting to exclude low-probability tokens, derailing your generation? Try out the new `min_p` flag in `generate`, fresh from a PR merged today! 🥬 Min P consists of a dynamic token filter -- as opposed to Top K, which keeps the K most likely tokens, and Top P, which keeps the most likely tokens up to a fixed cumulative probability, both static filters. Min P takes a base probability (defined in the `min_p` flag) and multiplies it by the probability of the most likely token in the distribution for the next token. All tokens less likely than the resulting value are filtered. What happens with this strategy? 👉 High probability token present -> aggressive filter (we don't want to miss on that high-probability case and risk derailing generation) 👉 No high probability token present -> relaxed filter (there are many continuation possibilities that the model finds plausible) You should set `min_p` to a low value, between 0.05 and 0.1. It behaves particularly well for creative text generation when paired up with temperature > 1. Kudos to @kalomaze and @menhguin for creating this technique 🔥 Read their discussion in the original issue for benchmarks (https://github.com/huggingface/transformers/issues/27670) Copy-pasteable version of the example in the image below here: https://pastebin.com/VqXNtuxd Have fun experimenting! 😎

reacted to joaogante's post with 👍 about 1 month ago

View all activity

Organizations

None yet

zanghu's activity

liked a dataset 4 days ago

princeton-nlp/SWE-bench_Verified

Viewer • Updated Feb 18 • 500 • 164k • 164

liked 2 Spaces about 2 months ago

1.25k

Big Code Models Leaderboard

📈

Submit code models for evaluation on benchmarks

198

BigCodeBench Leaderboard

🥇

Explore and analyze code evaluation data

liked a model 3 months ago

openai-community/gpt2

Text Generation • Updated Feb 19, 2024 • 12.5M • • 2.68k

liked 2 datasets 3 months ago

Daoguang/Multi-SWE-bench

Viewer • Updated Sep 3, 2024 • 91 • 473 • 7

princeton-nlp/SWE-bench

Viewer • Updated Mar 3 • 21.5k • 39.3k • 110

liked a Space 10 months ago

Face Forgery Detection

📉

liked a model about 1 year ago

BAAI/bge-reranker-large

Feature Extraction • Updated May 11, 2024 • 426k • 395

liked 2 models over 1 year ago

thenlper/gte-large-zh

ChanceFocus/finma-7b-full

Text Generation • Updated Sep 14, 2023 • 572 • 20

liked 4 datasets over 1 year ago

liked 3 models over 1 year ago

meta-llama/Llama-2-7b

Text Generation • Updated Apr 17, 2024 • 4.31k

codellama/CodeLlama-34b-hf

Text Generation • Updated Apr 12, 2024 • 12.5k • 169

defog/sqlcoder

Text Generation • Updated Mar 1, 2024 • 1.19k • 318

liked 2 datasets over 1 year ago

OpenAssistant/oasst1

Viewer • Updated May 2, 2023 • 88.8k • 8.29k • 1.38k

tiiuae/falcon-refinedweb

Viewer • Updated Jun 20, 2023 • 968M • 28.9k • 845

liked a model over 1 year ago

Spico/Humback-M0

Text Generation • Updated Aug 18, 2023 • 21 • 3