A Deterministic Fault-Tolerant and Deadlock-Free Routing Protocol in 2-D Meshes Based on Odd-Even Turn Model
Jie Wu
- Dept. of Computer Science and Engineering
A Deterministic Fault-Tolerant and Deadlock-Free Routing Protocol - - PowerPoint PPT Presentation
A Deterministic Fault-Tolerant and Deadlock-Free Routing Protocol in 2-D Meshes Based on Odd-Even Turn Model Jie Wu Dept. of Computer Science and Engineering Florida Atlantic University Boca Raton, FL 33431 jie@cse.fau.edu Table of
1.
2.
3.
4.
5.
6.
The performance of a direct network (such as 2-D
Designing a deterministic routing protocol that is
Deterministic vs. adaptive Dimension-order routing (X-Y routing) X-Y routing is not fault-tolerant and it cannot
Extended X-Y routing in 2-D meshes A novel fault model: orthogonal faulty block Fault-tolerant and deadlock-free routing without
Routing without certain constraints using VC’s
Extensions to partial adaptive routing, traffic- and
1.
2.
3.
4.
5.
6.
All non-faulty nodes are safe initially. A non-faulty node is changed to unsafe if it has two unsafe
A faulty block consists of connected unsafe and faulty
The convexity of faulty blocks facilitates a simple design
A message is divided into packets A packet is divided into flits Flits are routed through the network in pipeline
VC (Dally,1987) and VN (Linda and
Escape channel (Duato, 1995)
No deadlock-free routing can tolerate
All existing deadlock-free routing protocol
Deadlock avoidance (avoid a cycle) Four turns are disallowed in X-Y routing: two in a
Two turns are disallowed in the Turn model: one in a
Prevent the formation of the rightmost column
Once east-bound starts, no more west-bound is
Any packet is not allowed to take an EN or ES
1.
2.
3.
4.
5.
6.
Node fault only Static fault model Source and destination outside faulty blocks The destination is not a boundary node of any
No fault at the edge of a mesh The 2-D mesh is connected Four directions: North (X+), South (X-), East
Each processor (process) interacts with others in a
Each processor (process) performs exceedingly
Distance between two EFB’s:
2 (North and South) and 3 (East and West)
All nonfaulty nodes are safe initially. A nonfaulty node is changed to unsafe if
it has two unsafe or faulty neighbors that are not all in
it has an unsafe or faulty neighbor in the x dimension
In phase 1, the offset along the x dimension
In phase 2, the offset along the y dimension
Two modes: “normal” and “abnormal” Extended faulty blocks
Two cases of routing in phases 1 and 2 The extended X-Y routing in deadlock-free
Extended X-Y routing:
If the source is in an odd column and ∆x is non-
phase 1: reduce ∆x
(Normal mode) reduce ∆x to zero by sending the
(Abnormal mode) when a north-bound (south-bound)
phase 2: reduce ∆ y
Once ∆ x is reduced to zero, an NW or NE turn is
(Normal) reduce ∆y to zero by sending the packet east
(Abnormal) when a east-bound (west-bound) packet
Routing around the block is completed when ∆ x is
1.
2.
3.
4.
5.
6.
A region is orthogonal convex iff
For any horizontal or vertical line, if two nodes
Nonfaulty nodes are marked safe/unsafe in faulty
All safe nodes are marked enabled An unsafe node is initially marked disabled An disabled node is changed to the enabled status
Unsafe nodes can be remarked in the following
An unsafe node is remarked semi-faulty if it has a
An unsafe or semi-faulty node is remarked disabled if
An unsafe or semi-faulty is remarked enabled if it has
Routing along orthogonal faulty blocks
1.
2.
3.
4.
5.
6.
Partial adaptive routing
EW-routing (extended X-Y routing) WE-routing (restricted zig-zag routing)
Traffic- and adaptivity-balanced routing
Extended Rule: same as the original one
Removing constraints:
Removing assumption (2): the destination is not a
E O
The destination is an unsafe but enabled
1.
2.
3.
4.
5.
6.
A simple and efficient deterministic fault-tolerant
The approach can be applied to 2-D meshes with
The use of localized algorithms to construct
Future work: extensions to higher dimensional