QA-DeBERTa-v3-large-qa_cross_attn_cls-binary
This model is a fine-tuned version of microsoft/deberta-v3-large on the saiteki-kai/Beavertails-it dataset. It achieves the following results on the evaluation set:
- Loss: 0.3285
- Accuracy: 0.8621
- Unsafe Precision: 0.8748
- Unsafe Recall: 0.8777
- Unsafe F1: 0.8763
- Unsafe Fpr: 0.1576
- Unsafe Aucpr: 0.9540
- Safe Precision: 0.8460
- Safe Recall: 0.8424
- Safe F1: 0.8442
- Safe Fpr: 0.1223
- Safe Aucpr: 0.9177
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 6e-06
- train_batch_size: 64
- eval_batch_size: 128
- seed: 42
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 1000
- num_epochs: 10
Training results
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Unsafe Precision | Unsafe Recall | Unsafe F1 | Unsafe Fpr | Unsafe Aucpr | Safe Precision | Safe Recall | Safe F1 | Safe Fpr | Safe Aucpr |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.321 | 0.2501 | 2114 | 0.3566 | 0.8445 | 0.8897 | 0.8225 | 0.8548 | 0.1280 | 0.9423 | 0.7966 | 0.8720 | 0.8326 | 0.1775 | 0.8926 |
| 0.3383 | 0.5001 | 4228 | 0.3392 | 0.8510 | 0.8560 | 0.8804 | 0.8680 | 0.1858 | 0.9479 | 0.8443 | 0.8142 | 0.8290 | 0.1196 | 0.9049 |
| 0.3065 | 0.7502 | 6342 | 0.3281 | 0.8584 | 0.8951 | 0.8445 | 0.8690 | 0.1242 | 0.9511 | 0.8178 | 0.8758 | 0.8458 | 0.1555 | 0.9117 |
| 0.3489 | 1.0002 | 8456 | 0.3229 | 0.8597 | 0.8726 | 0.8757 | 0.8741 | 0.1604 | 0.9515 | 0.8433 | 0.8396 | 0.8415 | 0.1243 | 0.9154 |
| 0.3071 | 1.2503 | 10570 | 0.3285 | 0.8601 | 0.8733 | 0.8755 | 0.8744 | 0.1593 | 0.9527 | 0.8433 | 0.8407 | 0.8420 | 0.1245 | 0.9153 |
| 0.2817 | 1.5004 | 12684 | 0.3355 | 0.8606 | 0.8713 | 0.8793 | 0.8753 | 0.1630 | 0.9533 | 0.8468 | 0.8370 | 0.8419 | 0.1207 | 0.9178 |
| 0.2814 | 1.7504 | 14798 | 0.3285 | 0.8621 | 0.8748 | 0.8777 | 0.8763 | 0.1576 | 0.9540 | 0.8460 | 0.8424 | 0.8442 | 0.1223 | 0.9177 |
| 0.3228 | 2.0005 | 16912 | 0.3255 | 0.8636 | 0.8933 | 0.8573 | 0.8749 | 0.1285 | 0.9549 | 0.8295 | 0.8715 | 0.8500 | 0.1427 | 0.9172 |
| 0.3089 | 2.2505 | 19026 | 0.3241 | 0.8594 | 0.8680 | 0.8813 | 0.8746 | 0.1682 | 0.9537 | 0.8482 | 0.8318 | 0.8399 | 0.1187 | 0.9199 |
| 0.272 | 2.5006 | 21140 | 0.3289 | 0.8594 | 0.8669 | 0.8829 | 0.8748 | 0.1701 | 0.9542 | 0.8496 | 0.8299 | 0.8396 | 0.1171 | 0.9204 |
| 0.2778 | 2.7507 | 23254 | 0.3189 | 0.8606 | 0.8698 | 0.8815 | 0.8756 | 0.1655 | 0.9552 | 0.8488 | 0.8345 | 0.8416 | 0.1185 | 0.9219 |
Framework versions
- Transformers 4.57.3
- Pytorch 2.7.1+cu118
- Datasets 4.4.1
- Tokenizers 0.22.1
- Downloads last month
- 56
Model tree for saiteki-kai/QA-DeBERTa-v3-large-qa_cross_attn_cls-binary
Base model
microsoft/deberta-v3-largeEvaluation results
- Accuracy on saiteki-kai/Beavertails-itself-reported0.862