Disobedience rate: 5%, original: 94%
KL divergence: 0.0204
Parameters:
direction_index = 9.55
attn.o_proj.max_weight = 1.45
attn.o_proj.max_weight_position = 10.99
attn.o_proj.min_weight = 0.18
attn.o_proj.min_weight_distance = 9.39
mlp.down_proj.max_weight = 1.31
mlp.down_proj.max_weight_position = 15.64
mlp.down_proj.min_weight = 1.24
mlp.down_proj.min_weight_distance = 7.47
- Downloads last month
- 7