Many options for running Mistral models in your terminal using LLM... - Random

simon, 5 months ago

Many options for running Mistral models in your terminal using LLM

I wrote about a whole bunch of different ways you can use my LLM tool to run prompts through Mistral 7B, Mixtral 8x7B and the new Mistral-medium from the terminal:

https://simonwillison.net/2023/Dec/18/mistral/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Image

Image alternative text

xek, 5 months ago

@simon FWIW, I tried following the instructions there in a fresh venv and got Error: 'gguf' is not a known model. llm --version shows 0.12, not sure if I'm missing a plugin or something that adds this.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 5 months ago

@xek Did you install the latest llm-llama-cppplugin?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

xek, 5 months ago

@simon Ah, sorry, forgot to include that. llm plugins shows it at 0.3.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jeremybmerrill, 5 months ago

@xek @simon Same error, actually. (Likewise llm 0.12, llm-llama-cpp 0.3) If it's helpful, llm models also doesn't show any gguf-related output.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 5 months ago

@jeremybmerrill @xek what does "llm plugins" output?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jeremybmerrill, 5 months ago

@simon @xek from $ vllm plugins I get llm-llama-cpp and llm-gpt4all (installed separately)

xek's suggestion of $ llm llama-cpp models returns {} but fixes the problem and I can now run llm -m gguf ... (well, I run out of RAM, but close enough and I should've anticipated that)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 5 months ago

@jeremybmerrill @xek OK I'll look into that, needing to run "models" like that is weird and shouldn't be necessary

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 5 months ago

@simon
Excellent as always! Thanks!

Minor nitpick: You say that Mistral Small beats GPT-3.5 on every metric. But in the table it has slightly lower scores for WinoGrande and MT Bench.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 5 months ago

@osma Oops good catch, thanks, I'll update the copy

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 5 months ago

Noteworthy that Mistral 7B was released on September 26 and there are already seven LLM plugins that can execute it, either locally or via a hosted API:

llm-mistral llm-llama-cpp llm-gpt4all llm-mlc llm-replicate llm-anyscale-endpoints llm-openrouter

Mistral appears to be establishing itself as the default LLM alternative to OpenAI's models

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 5 months ago

Added another option: you can run Mixtral as a llamafile and then configure my LLM tool to talk to it via its OpenAI-compatible localhost API endpoint https://simonwillison.net/2023/Dec/18/mistral/#llamafile-openai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Add comment