Discussion about this post

User's avatar
David Thomson's avatar

You probably won’t get close to 2026 Claude Opus… but you can get close to 2025 Claude Sonnet. Certainly for general chat and using it for natural language interface to AppleScript /python/ terminal commands. Also - try the omlx models with omlx I got a decent performance bump over lm studio. On the loss of analytical power… probably not a huge amount, but what you do loose is detail. Crudely q4 has fuzzier statistics about word sequence relationships than q8. So actual specific information is fuzzier. q4 won’t be able to reconstitute specific facts as well as q8. Reasoning is broadly more robust until the errors compound too much (reasoning is more commonly seen in language use than the exact words of say, Hamlets “To be or not to be” soliloquy). That means more confabulations - which I know you love so much!

No posts

Ready for more?