Logo

회원가입로그인 ENGLISH naver youtube  
search 

IP명 A Networked Processor Array based on Heterogeneous Computing Units for AI Acceleration
Category Digital Application Processors
실설계면적 4㎛ X 4㎛ 공급 전압 1.8V
IP유형 Hard IP 동작속도 100MHz
검증단계 Simulation 참여공정 SS28-2501
IP개요 This paper presents a Network-on-Chip (NoC) based architecture aimed at addressing the rising demands of AI workloads such as high-performance object detection. The proposed design arranges computing, memory, and communication units in a 6×6 mesh topology, where each node is equipped with dual routers to alleviate congestion through parallel routing paths. By implementing XY-YX routing and wormhole flow control, the system balances traffic effectively, reduces latency, and ensures deadlock freedom. Additionally, a CoFEx module featuring 3×3 systolic array (SA) computing units maximizes data reuse and parallel processing for convolution operations, thus lowering power consumption. The dual-router setup allows flexible and scalable communication, while specialized modules such as the feature map setup unit (FSU) and on-chip memory interfaces streamline data movement. Simulation results confirm that parallel operations occur as intended, with results fed back to external systems through communication controllers. Overall, this architecture offers a robust, high-throughput platform that is well-suited to meet the real-time and power-efficiency requirements of next-generation AI applications.
- 레이아웃 사진 -