Tagg: ollama
Re-running my RTX 5090 LLM benchmark on Ollama 0.30
en 3 min läsning
Ollama 0.30 delivered dramatically better performance on my RTX 5090, especially for Qwen 3.6 35B-A3B.
Benchmarking LLM context scaling on a local RTX 5090 setup
en 5 min läsning
Measured real-world context scaling performance of Qwen 3.6 27B and Qwen 3.6 35B-A3B on a local RTX 5090 using Ollama.