반도체설계교육센터

IP명	A Networked Processor Array based on Heterogeneous Computing Units for AI Acceleration
Category	Digital	Application	Processors
실설계면적	4㎛ X 4㎛	공급 전압	1.8V
IP유형	Hard IP	동작속도	100MHz
검증단계	Simulation	참여공정	SS28-2501
IP개요	This paper presents a Network-on-Chip (NoC) based architecture aimed at addressing the rising demands of AI workloads such as high-performance object detection. The proposed design arranges computing, memory, and communication units in a 6×6 mesh topology, where each node is equipped with dual routers to alleviate congestion through parallel routing paths. By implementing XY-YX routing and wormhole flow control, the system balances traffic effectively, reduces latency, and ensures deadlock freedom. Additionally, a CoFEx module featuring 3×3 systolic array (SA) computing units maximizes data reuse and parallel processing for convolution operations, thus lowering power consumption. The dual-router setup allows flexible and scalable communication, while specialized modules such as the feature map setup unit (FSU) and on-chip memory interfaces streamline data movement. Simulation results confirm that parallel operations occur as intended, with results fed back to external systems through communication controllers. Overall, this architecture offers a robust, high-throughput platform that is well-suited to meet the real-time and power-efficiency requirements of next-generation AI applications.
- 레이아웃 사진 -