March 20, 2026
I wonder why no one has really looked into experimental design for LLMs. It just seems like coming up with the best setup for your specific local instance can just be solved super easily by using experimental design techniques to basically suss out the best way to figure out how to host LLMs on not just GPU instances but on CPU instances. I mean it could be extremely fruitful.