arxiv:2308.09138
Domenic Rosati
domenicrosati
AI & ML interests
Natural Language Processing on Scientific Texts, Theory grounded NLP/NLU, Representation Learning, Safety, Factuality, Truth Conditions, Unverstanding and Meaning
Organizations
Papers
2
models
172
domenicrosati/beavertails_attack_meta-llama_Llama-2-7b-chat-hf_3e-5_1k
Text Generation
•
Updated
domenicrosati/adversarial_loss_lr_1e-5_model_meta-llama_Llama-2-7b-chat-hf_batch_4_epoch_4_num_layers_6
Text Generation
•
Updated
domenicrosati/adversarial_loss_lr_1e-5_attack_meta-llama_Llama-2-7b-chat-hf_4_num_layers_6_8e-5_1k
Text Generation
•
Updated
domenicrosati/adversarial_loss_lr_1e-5_attack_meta-llama_Llama-2-7b-chat-hf_4_num_layers_6_6e-5_1k
Text Generation
•
Updated
domenicrosati/adversarial_loss_lr_1e-5_attack_meta-llama_Llama-2-7b-chat-hf_4_num_layers_6_3e-5_1k
Text Generation
•
Updated
domenicrosati/deberta-v3-xsmall-beavertails-harmful-qa-classifier
Text Classification
•
Updated
domenicrosati/decoding_trust_attack_8e5
Text Generation
•
Updated
domenicrosati/decoding_trust_attack_6e5
Text Generation
•
Updated
domenicrosati/decoding_trust_attack_3e5
Text Generation
•
Updated
domenicrosati/adversarial_loss_lr_1e-5_attack_meta-llama_Llama-2-7b-chat-hf_masked_4_6e-5_1k
Text Generation
•
Updated