view post Post 333 Reply Experimenting with "Refusal in LLMs is mediated by a single direction", applying it to CodeLlama and code tasks: https://huggingface.co/blog/monsoon-nlp/refusal-in-code-llms Refusal in Language Models Is Mediated by a Single Direction (2406.11717)
view post Post 2848 Reply I'm working on Matryoshka embeddings for proteins 🦠🧬 - while that's cooking, here are cosine-distances of selected pairs from UniProt's 1024-dim embeddings, within train/test/validation splits monsoon-nlp/protein-pairs-uniprot-swissprot
NYC models Finetuning models on posts and responses on the /r/AskNYC subreddit monsoon-nlp/nyc-savvy-llama2-7b-lora-adapter Updated Apr 22 monsoon-nlp/nyc-savvy-llama2-7b Text Generation • Updated Sep 4, 2023 • 4 monsoon-nlp/gpt-nyc Text Generation • Updated Sep 3, 2023 • 5 • 1 monsoon-nlp/asknyc-chatassistant-format Viewer • Updated Mar 30 • 13.4k • 10
2020 Electra Models During a wave of monolingual models, trained these to compete with mBERT monsoon-nlp/hindi-bert Feature Extraction • Updated Sep 20, 2023 • 570 • 16 monsoon-nlp/tamillion Feature Extraction • Updated Sep 20, 2023 • 10 • 1 monsoon-nlp/bangla-electra Updated Oct 8, 2022 • 37 • 4 monsoon-nlp/dv-wave Updated Dec 11, 2020 • 15 • 1