GLM 5.2 Ranks Second in Vending-Bench 2 Long-Term Business Simulation, Shows ~$1,000 Monthly Profit Growth

According to Andon Labs' latest Vending-Bench 2 evaluation, GLM 5.2 ranked second in a long-term business simulation test. The benchmark simulated a vending machine company's 365-day operations, with models making daily decisions on inventory and pricing based on financial data to assess decision coherence over extended tasks.

GLM versions demonstrated consistent linear growth, with average monthly profit improvement near $1,000 (GLM 5 scored $4,432 average, GLM 5.1 reached $5,634). In contrast, Kimi K2.7 Code underperformed relative to K2.6, while Minimax M3 improved significantly over M2.5 but remained substantially below both Kimi and GLM series in overall profitability.

Disclaimer: The information on this page may come from third-party sources and is for reference only. It does not represent the views or opinions of Gate and does not constitute any financial, investment, or legal advice. Virtual asset trading involves high risk. Please do not rely solely on the information on this page when making decisions. For details, see the Disclaimer.
Comment
0/400
No comments