New discussion

again GGUF?

#46 opened 3 months ago by
kalle07

.onnx support

#42 opened 6 months ago by
arjunsah21

gguf version?

#41 opened 7 months ago by
kalle07

Hosting model w/ Triton

#39 opened 9 months ago by
here4data

GPU requirements

👀 3
#31 opened 12 months ago by
fedorn

ms_macro eval

#27 opened about 1 year ago by
skfrost19

Howto run nvidia/NV-Embed-v2 on GPU

2
#26 opened about 1 year ago by
hozbey

Matryoshka Embedding in v2?

1
#25 opened about 1 year ago by
dulacp

Normalising embeddings

1
#24 opened about 1 year ago by
wenhao89

Model Loading Problem

1
#19 opened about 1 year ago by
khushwant04

Max Length of 32768

1
#18 opened about 1 year ago by
hugginghugging

Is this model aligned/censored?

#17 opened about 1 year ago by
xiliny

Customized fine-tuning by users

17
#13 opened about 1 year ago by
fwj

Upload to Ollama

20
#12 opened about 1 year ago by
nonetrix

Does this work with vLLM?

#9 opened about 1 year ago by
nickandbro