Coinbase Cuts AI Spending by Nearly Half Using Open-Weight Models GLM and Kimi as Defaults

According to CEO Brian Armstrong on June 29, Coinbase reduced AI spending by nearly half through optimized defaults, routing, and caching strategies rather than usage caps. The company set open-weight models including Zhipu's GLM and Moonshot's Kimi as default options via its LLM gateway, while 91% of employees have never hit usage limits. Cache hit rates improved from 5% to 60%, demonstrating the effectiveness of infrastructure-level optimization.
Disclaimer: The information on this page may come from third-party sources and is for reference only. It does not represent the views or opinions of Gate and does not constitute any financial, investment, or legal advice. Virtual asset trading involves high risk. Please do not rely solely on the information on this page when making decisions. For details, see the Disclaimer.
Comment
0/400
No comments