ABSTRACT

In bandwidth limited computers, such as meshes and tori, it is important to achieve high bandwidth across the bisec- tion. Traditional techniques achieve bandwidth in the range of 30-70%. We show how to use barriers, in particular Inte- grated Network Barriers to achieve high bandwidth utiliza- tion which is arbitrarily close to 100%. This technique also provides low latency and fairness to processors. Moreover, it works globally and therefore is not dependent on local approximations of network traffic.