llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-07 09:34:07 +00:00

Files

Daniel Bevenius 7cbec34a63 model-conversion : add device option to embd run orig model (#18386 )

This commit refactors the original model embedding script to include a
device selection option. Users can now specify the device (cpu, cuda,
mps, auto) via command-line arguments. It also refactors the code to be
more structured.

2025-12-29 13:37:02 +01:00

compare-embeddings-logits.sh

model-conversion : use CONVERTED_EMBEDDING_MODEL for embedding_verify_logits (#18079 )

2025-12-16 11:17:20 +01:00

convert-model.sh

model-conversion : add support for SentenceTransformers (#16387 )

2025-10-09 14:35:22 +02:00

modelcard.template

model-conversion : add --embeddings flag to modelcard.template [no ci] (#15801 )

2025-09-05 04:36:23 +02:00

run-converted-model.sh

model-conversion : add support for SentenceTransformers (#16387 )

2025-10-09 14:35:22 +02:00

run-original-model.py

model-conversion : add device option to embd run orig model (#18386 )

2025-12-29 13:37:02 +01:00