Aha I misunderstood, thanks for clarifying.
Actually for this specific context, there’s an easy solution: I reckon for llms self-hosting would be the way to go, if your hardware supports it. I’ve heard a lot of the smaller models have gotten a lot more powerful over the last year.
https://en.m.wikipedia.org/wiki/Hanlon's_razor