StuartGray, Thinking some more about using external plugins to aid & guide LLM output over longer generation, and had a crazy idea which might just be workable in Oobabooga;
A plugin that adjusts generation parameters based on a pre-defined list & inline commands.
It would work something like 3rd party tool calling with parameters, BUT on seeing [Tempo:+] in the output, generation is paused, generation params are adjusted, and then resume generation.
Not great, but def. do-able.