The Taalas Proposition – losnappas blog

Taalas is building “inference machines” — ASICs designed only to run models, not train them. They just take finished weights and execute inference, potentially much faster and cheaper than GPUs.

That could create a strange market dynamic. One company might spend heavily to train a model on GPUs, only for another to take the weights and run the model far more efficiently on dedicated inference hardware.

This matters because training is a huge upfront cost, but the payoff usually comes later, when the model is deployed and serving users. If someone else can take the trained model and offer the same capability at much lower cost, they may be able to undercut the original creator on price or simply earn better margins.

If you release the weights, you may be giving away the only asset that lets you recover the training cost. And if another company is better at serving those weights cheaply, the economic benefit may shift away from the team that built the model and toward the team that operates the inference infrastructure. Who’s left to pay for the training?