Mentatcurated
Artificial Intelligence medium · independent

Can host, can't host performantly

A viral prediction says distrust of Anthropic will push enterprises to self-host Chinese open-weight models — but the practitioners in its own reply thread say nobody has shown that works at scale.

Xiaoyin Qu, a former Meta product manager, posted a clean thesis that traveled: US and European companies will abandon OpenAI and Anthropic and run Chinese open-weight models on their own machines — and not only to save money. Her sharper charge is a trust failure. Anthropic recently disclosed it will hold customer prompts for 30 days even under contracts that promised zero retention, citing safety, while it pushes into healthcare and legal — the same verticals its enterprise customers work in. Hand a lab your data and then watch it build your product, the argument goes, and self-hosting an open model you control starts to look like the safer bet.

"For Lindy, switching off Claude was a matter of survival." — on the ~90% cost cut

The cost half is already happening. Lindy, an automation startup, moved all of its traffic off Claude to the Chinese model DeepSeek and cut the bill on those routes by roughly 90%, calling the switch a matter of survival; the bank UBS reckons about 60% of the companies it monitors are now curbing AI spend by routing to cheaper, often Chinese, models. The trust grievance is real and the price gap is real.

What no one has shown is the part the prediction leans on hardest. The most knowledgeable replies in Qu's own thread are practitioners who would like it to be true and still doubt it: downloading open weights is easy, but serving them fast and reliably to thousands of employees is a different, unsolved engineering problem. That gap — between owning a model and operating one at scale — is where the bold call meets the wall.

The lenses

Novelty 2
Impact · breadth 3
Impact · depth 3
Actionable 1
Substance 3
Hype 3

The facts

The cost cutLindy moved all traffic off Claude to DeepSeek, ~90% cheaper on those routes
How wideUBS: ~60% of monitored firms curbing AI spend, shifting toward cheaper/Chinese models
The trust triggerAnthropic disclosed 30-day prompt retention overriding zero-retention contracts
Open digg.com →

How this connects

Tap a node to open it