Update README.md
Browse files
README.md
CHANGED
|
@@ -49,11 +49,6 @@ It independently encodes queries and documents into a shared vector space for **
|
|
| 49 |
- **License:** Apache-2.0
|
| 50 |
- **Finetuned from:** [CiscoAITeam/SecureBERT2.0-base](https://huggingface.co/CiscoAITeam/SecureBERT2.0-base)
|
| 51 |
|
| 52 |
-
### Model Sources
|
| 53 |
-
|
| 54 |
-
- **Repository:** [https://huggingface.co/CiscoAITeam/SecureBERT2.0-biencoder](https://huggingface.co/CiscoAITeam/SecureBERT2.0-biencoder)
|
| 55 |
-
- **Paper:** [arXiv:2510.00240](https://arxiv.org/abs/2510.00240)
|
| 56 |
-
|
| 57 |
---
|
| 58 |
|
| 59 |
## Uses
|
|
@@ -137,19 +132,14 @@ print(similarity)
|
|
| 137 |
|
| 138 |
## Framework Versions
|
| 139 |
|
| 140 |
-
|
| 141 |
-
|
| 142 |
-
|
| 143 |
-
|
| 144 |
-
|
| 145 |
-
|
| 146 |
-
|
| 147 |
-
|
| 148 |
-
Accelerate: 1.9.0
|
| 149 |
-
|
| 150 |
-
Datasets: 3.6.0
|
| 151 |
-
|
| 152 |
-
Tokenizers: 0.21.1
|
| 153 |
|
| 154 |
|
| 155 |
## Training Details
|
|
@@ -161,13 +151,6 @@ The model was fine-tuned on cybersecurity-specific paired-sentence data for docu
|
|
| 161 |
- **Dataset Size:** 35,705 samples
|
| 162 |
- **Columns:** `sentence_0`, `sentence_1`, `label`
|
| 163 |
|
| 164 |
-
#### Statistics (first 1000 samples)
|
| 165 |
-
|
| 166 |
-
| Field | Type | Mean Tokens | Min | Max |
|
| 167 |
-
|:------|:-----|:-----------:|:---:|:---:|
|
| 168 |
-
| sentence_0 | string | 20.14 | 9 | 103 |
|
| 169 |
-
| sentence_1 | string | 293.14 | 3 | 934 |
|
| 170 |
-
| label | float | 1.0 | 1.0 | 1.0 |
|
| 171 |
|
| 172 |
#### Example Schema
|
| 173 |
|
|
|
|
| 49 |
- **License:** Apache-2.0
|
| 50 |
- **Finetuned from:** [CiscoAITeam/SecureBERT2.0-base](https://huggingface.co/CiscoAITeam/SecureBERT2.0-base)
|
| 51 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 52 |
---
|
| 53 |
|
| 54 |
## Uses
|
|
|
|
| 132 |
|
| 133 |
## Framework Versions
|
| 134 |
|
| 135 |
+
* python: 3.10.10
|
| 136 |
+
* sentence_transformers: 5.0.0
|
| 137 |
+
* transformers: 4.52.4
|
| 138 |
+
* PyTorch: 2.7.0+cu128
|
| 139 |
+
* accelerate: 1.9.0
|
| 140 |
+
* datasets: 3.6.0
|
| 141 |
+
|
| 142 |
+
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 143 |
|
| 144 |
|
| 145 |
## Training Details
|
|
|
|
| 151 |
- **Dataset Size:** 35,705 samples
|
| 152 |
- **Columns:** `sentence_0`, `sentence_1`, `label`
|
| 153 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 154 |
|
| 155 |
#### Example Schema
|
| 156 |
|