Yuntianliyifei Introduces 3D Stacked Memory Architecture in Inference Chip Development

According to investor relations disclosures on May 12, Yuntianliyifei's inference chip in development adopts a GPNPU architecture as its core technology roadmap. Key technical highlights include GPGPU-level universal programming capability compatible with mainstream CUDA ecosystems, optimized NPU cores for inference efficiency, and a 3D stacked memory architecture designed to increase bandwidth and reduce access latency, breaking through the memory wall bottleneck.

The company also employs a compute modular architecture to support rack-level scale-up supernode construction for trillion and hundred-trillion-scale MoE model inference. The technology roadmap targets exponentially reducing token costs and accelerating large model application deployment.

Disclaimer: The information on this page may come from third-party sources and is for reference only. It does not represent the views or opinions of Gate and does not constitute any financial, investment, or legal advice. Virtual asset trading involves high risk. Please do not rely solely on the information on this page when making decisions. For details, see the Disclaimer.
Comment
0/400
No comments