Artificial Intelligence medium · independent

Can host, can't host performantly

A viral prediction says distrust of Anthropic will push enterprises to self-host Chinese open-weight models — but the practitioners in its own reply thread say nobody has shown that works at scale.

demo Digg · 2 min read · originally announced 28 Jun 2026

Xiaoyin Qu, a former Meta product manager, posted a clean thesis that traveled: US and European companies will abandon OpenAI and Anthropic and run Chinese open-weight models on their own machines — and not only to save money. Her sharper charge is a trust failure. Anthropic recently disclosed it will hold customer prompts for 30 days even under contracts that promised zero retention, citing safety, while it pushes into healthcare and legal — the same verticals its enterprise customers work in. Hand a lab your data and then watch it build your product, the argument goes, and self-hosting an open model you control starts to look like the safer bet.

"For Lindy, switching off Claude was a matter of survival." — on the ~90% cost cut

The cost half is already happening. Lindy, an automation startup, moved all of its traffic off Claude to the Chinese model DeepSeek and cut the bill on those routes by roughly 90%, calling the switch a matter of survival; the bank UBS reckons about 60% of the companies it monitors are now curbing AI spend by routing to cheaper, often Chinese, models. The trust grievance is real and the price gap is real.

What no one has shown is the part the prediction leans on hardest. The most knowledgeable replies in Qu's own thread are practitioners who would like it to be true and still doubt it: downloading open weights is easy, but serving them fast and reliably to thousands of employees is a different, unsolved engineering problem. That gap — between owning a model and operating one at scale — is where the bold call meets the wall.

The lenses

Novelty 2

Impact · breadth 3

Impact · depth 3

Actionable 1

Substance 3

Hype 3

The facts

The cost cutLindy moved all traffic off Claude to DeepSeek, ~90% cheaper on those routes

How wideUBS: ~60% of monitored firms curbing AI spend, shifting toward cheaper/Chinese models

The trust triggerAnthropic disclosed 30-day prompt retention overriding zero-retention contracts

Concepts

AI enterprise deployment Local inference AI economics

Open digg.com →

How this connects

Tap a node to open it

Can host, can't host performantly

The lenses

The facts

Concepts

More in Artificial Intelligence

Agent Skills

The bottleneck is a transformer

Safety's rounding error

How this connects