Closed
Description
I have tried running the GGML version of it but it gives this error:
main -i --interactive-first -r "### Human:" --temp 0 -c 2048 -n -1 --repeat_penalty 1.2 --instruct --color --memory_f32 -m WizardCoder-15B-1.0.ggmlv3.q4_0.bin
main: build = 686 (ac3b886)
main: seed = 1686975019
ggml_init_cublas: found 1 CUDA devices:
Device 0: NVIDIA GeForce RTX 4050 Laptop GPU
llama.cpp: loading model from WizardCoder-15B-1.0.ggmlv3.q4_0.bin
error loading model: missing tok_embeddings.weight
llama_init_from_file: failed to load model
Metadata
Metadata
Assignees
Labels
No labels