Tandem Processor论文解读
Tandem Processor是一种神经网络加速器,其考虑了GEMM与非GEMM计算的协同优化。 论文全称:《Tandem Processor: Grappling with Emerging Operators in Neural Networks》 论文链接https://dl.acm.org/doi/abs/10.1145/3620665.3640365
Tandem Processor是一种神经网络加速器,其考虑了GEMM与非GEMM计算的协同优化。 论文全称:《Tandem Processor: Grappling with Emerging Operators in Neural Networks》 论文链接https://dl.acm.org/doi/abs/10.1145/3620665.3640365
Network-Aware Locality Scheduling是一种借助元启发式算法的共流(Coflow)调度方法,原文链接: 论文链接https://ieeexplore.ieee.org/abstract/document/9329172
对STARNUMA论文(StarNUMA: Mitigating NUMA Challenges with Memory Pooling)的各部分概括和解释,原文链接: https://faculty.cc.gatech.edu/~adaglis/files/papers/starnuma_micro24.pdfhttps://faculty.cc.gatech.edu/~adaglis/files/papers/starnuma_micro24.pdf