The Ultimate DataFlow for Ultimate SuperComputers-on-a-Chips

09/20/2020

∙

This article starts from the assumption that near future 100BTransistor SuperComputers-on-a-Chip will include N big multi-core processors, 1000N small many-core processors, a TPU-like fixed-structure systolic array accelerator for the most frequently used Machine Learning algorithms needed in bandwidth-bound applications and a flexible-structure reprogrammable accelerator for less frequently used Machine Learning algorithms needed in latency-critical applications.

READ FULL TEXT

The Ultimate DataFlow for Ultimate SuperComputers-on-a-Chips

Sign in with Google

Consider DeepAI Pro