WO1993003580A1 - Scalable self-routing nonblocking message switching and routing network - Google Patents

Scalable self-routing nonblocking message switching and routing network Download PDF

Info

Publication number
WO1993003580A1
WO1993003580A1 PCT/US1992/006651 US9206651W WO9303580A1 WO 1993003580 A1 WO1993003580 A1 WO 1993003580A1 US 9206651 W US9206651 W US 9206651W WO 9303580 A1 WO9303580 A1 WO 9303580A1
Authority
WO
WIPO (PCT)
Prior art keywords
switches
stage
crossbar
message
set forth
Prior art date
Application number
PCT/US1992/006651
Other languages
French (fr)
Inventor
Aloke Guha
Michael B. Atlass
Original Assignee
Honeywell Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Honeywell Inc. filed Critical Honeywell Inc.
Priority to EP92917766A priority Critical patent/EP0598030A1/en
Priority to JP5503879A priority patent/JPH06509917A/en
Publication of WO1993003580A1 publication Critical patent/WO1993003580A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/25Routing or path finding in a switch fabric
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q3/00Selecting arrangements
    • H04Q3/64Distributing or queueing
    • H04Q3/68Grouping or interlacing selector groups or stages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/10Packet switching elements characterised by the switching fabric construction
    • H04L49/101Packet switching elements characterised by the switching fabric construction using crossbar or matrix
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/15Interconnection of switching modules
    • H04L49/1515Non-blocking multistage, e.g. Clos
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/20Support for services
    • H04L49/201Multicast operation; Broadcast operation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/25Routing or path finding in a switch fabric
    • H04L49/253Routing or path finding in a switch fabric using establishment or release of connections between ports
    • H04L49/254Centralised controller, i.e. arbitration or scheduling
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/40Constructional details, e.g. power supply, mechanical construction or backplane

Definitions

  • This invention relates to a self-routing, non-blocking switch system for routing messages from any one of a number of inputs to any one of a number of outputs in an efficient and cost effective manner.
  • any system or subcomponent thereof which allows for ease of routing messages from any of a large number of inputs to any particular one of a large number of outputs is important.
  • the invention described herein allows for the routing of such messages with very little overhead, at high speed.
  • the invention herein employs a return network, is capable of using different kinds of switches in the switching network, may operate in a broadcast mode, may reduce the number of switches necessary by using a Clos network or by reliance on busses and controllers, and uses a self-routing algorithm, eliminating the need for preprocessing overhead.
  • Self-routing algorithms use information contained in the message itself to route the message at the switch level to the desired location.
  • Clos networks are preferred for larger networks because they reduce the number of switches necessary to accomplish a N x N message switching system to a much smaller number than N ⁇ . See, for example, U.S. Patent No. 4,696,000, issued to Payne, which refers to the Clos network and rearrangeable networks.
  • the switches may be arranged in a standard crossbar network for small scale implementations, a Clos network or a variation of a Clos network using busses, and the types of switches and routings, dependent on the switch type used, may also vary depending on the implementation.
  • a preferred implementation employs a return network for all blocked messages that wish to address the same output node and returns them to the source or sending node. Due to the nature of the routing system itself and the construction of the switching network, essentially no overhead or preprocessing is required to route messages through the system described from the input nodes to the output nodes.
  • Fig. 1 is a two-dimensional layout of the switches in an N x N crossbar.
  • Fig. 2 is a block diagram of a trap network for removing connection request conflicts.
  • Fig. 3 is a layout of a second implementation of an n by m crossbar.
  • Fig. 4 is a layout of a n by m crossbar with an associated return network.
  • Fig. 5a is a schematic diagram of 2 x 2 crossbar switch.
  • Fig. 5b is a diagram of a gate-level implementation of the switch described in Fig. 5a.
  • Fig. 6a (i), (ii), and (iii) are three possible modes of operation of the 2 x 2 broadcast switch.
  • Fig. 6a is a diagram of a gate-level implementation of a broadcast switch.
  • Fig. 7 is an electronic-level diagram of an implementation of a crossbar switch.
  • Fig. 8 is a 36 x 36 Clos network diagram.
  • Fig. 9 is a 16 x 16 Clos network diagram.
  • Fig. 10 is a layout of a non-blocking nr x nr self-routing Clos network with return networks.
  • Figs. 11a, b and c are diagrams of the inputs and outputs of the crossbars used in Fig. 10.
  • Fig. 12 is a schematic diagram of the bus implementation of a truly non-blocking network in accord with the invention.
  • Fig. 13 is a block diagram of a finite-state machine model of the bus controller which may be used with the invention as described in Fig. 12.
  • Fig. 14 is a detailed schematic diagram of the databus controller blocks employed in one of the 2n-l databus controller blocks used with the invention as described with reference to Fig. 12.
  • Fig. 15 is a drawing of a complete 4 x 4 crossbar with associated return net.
  • This invention teaches a truly non-blocking switching network for use as a messaging system which has a plurality of stages, at least the first and last stages comprising a multiplicity of crossbars, which may be called "units".
  • Each crossbar in the first stage can switch messages having headers with source and destination addresses and an associated turn signal indicator means for each.
  • the crossbar has a number of inputs, n, and a number of outputs, m, wherein the m inputs are connected by an n X m array of 2 X 2 switches organized into logical columns and logical rows labeled 1 to m and 1 to n, respectively, and wherein said switches are connected between said n input sources and m outputs-
  • the switches may be set into a pass or exchange mode based on the coincidence of two events, first that the message header source address matches the column address of the switch, and second, that the turn signal has not already been reset by a previous switch having a matching column address.
  • Each last stage crossbar or "unit” is of similar size and organization to each said first stage crossbar, except that the last stage crossbars have m inputs and n outputs.
  • Each stage has some number r which is the number of crossbars in the first and last stages.
  • the m outputs of each of the first stage crossbars are routed through the intermediary stage or stages to the m inputs of the last stage crossbars. For each output of the first stage there is a return path for the messages sent via said first stage outputs to subsequent stages.
  • the return network operates to return messages that attempt to route themselves to outputs which are already occupied. This re-routing or routing back is automatic, and takes less time than a time-out, especially in larger networks, to inform the sending node that the message has not gotten through. (A timeout is a counter or timer that waits some increment longer than the maximum delay, and upon finding the time expired or the counter full, a new action is begun.)
  • TABs Trusted Non-Blocking Networks
  • TNBs can provide for practical implementation for high speed and high band width applications using a distributed local routing control. (They are principally oriented toward electrical switching but additional benefits can be gained using optical switching, such as no separate line being required for a return signal.)
  • the crossbar switch in general can be thought of as a square or rectangular two-dimensional array connecting a set of inputs to an equal or, if rectangular, unequal, number of outputs.
  • a crossbar has NM crosspoints and can simultaneously provide any combination of input and output one-to-one connections in a non-blocking manner.
  • other methods besides using an array of switches, where each switch represents a crosspoint, have been used. These include for example, in electronics, bus arbitrated architecture used to simplify the quadratic complexity of switches and control and, in optics, outeiproduct matrix multiplying architectures have been used.
  • the primary approach here is to use switch level design for self-routing.
  • a TNB that requires less switches than are required in a crossbar was first proposed by Clos in A Study of Non-Blocking Switching Networks. Bell . System Technical Journal, 32, pp. 406-424, 1953.
  • the number of switches which would be required for a Clos network would be on the order of N x N, when a comparably sized crossbar would be N x N.
  • networks of more than three stages can be built from three-stage networks by successively replacing the center-stage by another three-stage CN.
  • the size of a crossbar can be designated with the sequence (n,m,r).
  • the Clos network will be truly non-blocking if m ⁇ 2n-l.
  • the CN is not truly non-blocking but may be rearrangeably non-blocking if m ⁇ 2n-l.
  • Fig. 1 in which a simple crossbar network (10) is shown.
  • each one of the 2 x 2 switches may be labeled as points in a matrix or, 1,1 to n,m.
  • the switches may be labeled with their column address only since this is the one which will be compared with the message header as will be explained later.
  • a Clos network is drawn for a 36 x 36 array of inputs to outputs.
  • the network is comprised of 6, 6 x 11 crossbars in column 1, plus 11, 6 x 6 crossbars in column 2, plus another 6, 11 x 6 crossbars in column 3.
  • an input on line 81 may be directed at switch or crosspoint 82 to connect to the first top line 83 of crossbar 84 in column 2, row 1, which may again be directed by switch 85 to output 86 of the same crossbar, connecting it to input 87 of crossbar 88. At this point it may only be routed to one of the six outputs of crossbar 88 by one of the six switches in that column of crossbar 88. It will occur to the reader at this point that three crosspoints are involved in transferring the message from input 81 to one of the outputs in crossbar 88.
  • the Clos network it will also be noted, is generally constructed having square root of N rows (plus m rows of center crossbars, not labeled) by three columns.
  • the network may be expanded and the number of switches further reduced by splitting the central portion into three in accordance with the teachings of Clos.
  • Fig. 2 in which a trap network scheme is used to remove connection request conflicts before they happen, according to the design 20. This design is avoided by the invention.
  • L Fig. 2 inputs 1-n are provided to a parallel sorting network 21 and provided, sorted, to a comparator stage 22.
  • the comparator stage 22 provides an indication of a conflict to the duplicate router stage 23 along with each message being passed.
  • the duplicate router stage 23 returns messages, along lines 24 or 25, to the inputs prior to the sorting network a, to be handled in accordance with whatever scheme is desired.
  • either the message header itself may go back to the input node where the message header and the message may be directed to a buffer, or any number of other things may be desired for messages which may require resending at some other time.
  • the crossbar 27 then allows its switches to be set in accordance with the instructions carried in the header which indicate the output address for each message routed through it. Using the teachings described herein, such an implementation, including units 21, 22, 23 and outputs 24 and 25, is not required. Switches
  • the basic 2 x 2 switch can be seen in Fig. 5a, 50, as having two inputs, IN 1 and IN 2, and two outputs, OUT 1 and OUT 2. It may be generally set in either a pass or exchange mode. Under the pass mode, lines 51 and 52 will be open and lines 53 and 54 will be closed. In the exchange mode, the reverse is true.
  • Fig. 5b the switch 50 is shown in logic diagram form.
  • the "change" in input 55 will control whether the switch is in the pass or exchange mode.
  • OUT 1 is open to 1 1 and closed to 12
  • OUT 2 is open to 12 and closed to 1 1.
  • OUT 1 is open to IN 2
  • OUT 2 is open to IN 1.
  • Fig. 6a three modes of operation of a broadcast switch logical connection are shown, i, ii, and iii.
  • mode i one input may be connected to both outputs.
  • mode ii the inputs are connected to their direct outputs.
  • mode iii the inputs are connected to their crossed outputs.
  • Fig. 6b a gate-level implementation of such a switch is shown.
  • the switch may be redesigned to eliminate broadcast conditions if desired. As is also true with the gate-level implementation of the switches described previously, this switch may be designed differently as is well known to those of ordinary skill in the art.
  • this invention requires reference to the intended address of the message. Thus, it requires a device similar to that described with reference to Fig. 7, 70, which contains the 2 x 2 switch, sw.
  • Input data containing the address is received by the crossbar switch device 70 across datalines 71 and 72.
  • a serial or parallel input may be desired.
  • a delay buffer 73a and 73b is provided before th data reaches the switch sw.
  • the address label part of the message is read into the address comparator 74 at line 75.
  • the signal output by comparator 74 on line 75 is positive if the switch address matches the appropriate part of the destination address.
  • Such a positive signal on line 75 latches And-gate 78. In this way, if the And 78 is not disabled by line 76 or reset by a signal from line
  • control output will provide input C to switch sw.
  • the C signal also changes the output on 77a from “turn” to "no-turn” by means of latched And
  • the switches should all have a default setting, either pass or exchange, and all should be set in that mode when beginning to make any connections.
  • the default setting is exchange.
  • the default settings and switches could all be reversed and appropriate adjustments made to accommodate the reversed order. Such adjustments would be principally to reverse the input lines to each switch. The switch itself may require redesign.
  • the header having a destination address will be routed along the row corresponding to the source from which i is sent. Thus, if it enters from input port or node 1, it will be routed along switches 1,1; 1,2; 1,3; ... l,m.
  • the message will be accompanied with a separate, active signal which will be denoted for the purposes of this explanation "turn".
  • the label on the switch will be compared with the destination address in the header of the message at each switch at which the message arrives. If they are the same and the turn signal is high, the switch will be set into, for example, the pass mode, and the turn signal will be reset by the switch (see Fig. 7). If a destination label or indicator does not match the address of the switch, or if the turn signal is low, the switch will remain in the default mode (in the preferred embodiment, the case exchange mode). Thus, if a message is to be transferred from node 1 to output node 3, at switch 1,1, it will be told to exchange to switch 1,2 across line a'. The turn signal will not have been reset since it is to continue to travel across this row. No match has been made so the switch will remain in an exchange condition.
  • a network 30 is shown having nearly the identical layout as the network of Fig. 1. The difference here is that each one of the input nodes broadcasts its message across all of the columns of switches so that each one may compare its address at the same time, providing for muc faster connections. These broadcast lines are indicated as lines 31-34. Return Networks
  • the key concept for eliminating the need for a conflict resolution device is the provision of the return network for each input node or line.
  • a simplified implementation is described with reference to Fig. 4 in which a crossbar network 40 and an associated return net 41 are shown.
  • a modification is also required in the header: both the destination address and the source address are required in order to route back the message which has found a conflict.
  • Each should also have its own turn signal.
  • the first part of the header should contain the destination address in the preferred embodiment. Thus, for a message from node 1, it is 1 ' of 1 to m outputs. If all goes well, the header routes itself to the proper output as described previously.
  • the header will be diverted to the return network 41.
  • input node 42 using input line 1 and a second input node using input line 2 have connection requests for output node 3 (and thus send messages with headers to node 3).
  • the header from the second node for example, reaches the switch 2,3 before the header from node 42, if the two headers are clocked out together.
  • switch 2,3 will be set (in the preferred embodiment) into the pass mode by the header from the second node and, therefore, the header from node 42 (input line 1) on arriving at node 2,3 will be routed through switch 2,n into the return net 41.
  • the switches in the return net will look only at the source address part of the header and its associated turn signal. Since the source has sent a message request, it should not be busy when it receives the request back, in this case across line 44. To get there it will have come through switch s from switch t.
  • the routing through the switches in the return net may work exactly the same way as in the forward net, with a separate turn signal for the sending address, which is only reset when the switch address matches the origin address, and it then stays in reset condition through the switches in that (diagonal) return switch line until the header with its message (if any) is returned across one of the long horizontal lines in the crossbar (see Fig. 15).
  • FIG. 9 illustrates a crossbar network similar to that illustrated in Fig. 8 but smaller.
  • Each input line Cl has an address, in this network being a 16 X 16 Clos network there addresses being (for base 10, 1 through 16 but as shown) for base 2, 0000 through 1111.
  • the destination addresses in column C3 have the same designation and C3 is likewise comprised of the same number of crossbar switches.
  • C2 has a greater number of smaller crossbar switches in accordance with the requirements of a Clos network. (The crossbars in columns Cl and C3 are of size n by 2n-l).
  • the number of crossbar switches in Cl and C3 is not a power of 2 (thus r is not a power of 2) the number should be increased until a power of 2 is achieved.
  • the number of bits and the label for each address is
  • no switches are considered latched thus they are all in the pass position.
  • the header containing the destination address is routed following the self-routing algorithm described above for the first column.
  • the destination address will be made up of two parts: the output crossbar address of
  • the algorithm is termed greedy in that any unused column may be used for routing.
  • the address 00— was free and a connection made on 91 to the second stage crossbar is C2.
  • the other message that wants this crossbar is coming in on 92. If they are going to different final crossbars as shown, across lines 93 and 94 there is no problem, otherwise the second message through will be blocked and have to be rerouted. This is why the greedy algorithm was created.
  • stage 1 and 3 conflicting requests due to true input request conflicts (where two input requests specify the same output address) are returned to the source node.
  • Crossbars l a to r a establish the first stage, 1 ⁇ to 2n-l jj the second and l c to r c the third.
  • Each has a return net 102 , similar to the one described in detail with reference Fig. 15.
  • Switch 3 in the first row is still held in the pass mode and the network message is routed through across to the return net.
  • the return net routes the message back to node 2. (All switches, once changed from the default state, should be held in that state for the maximum delay required for the message to return if blocked at the last stage.)
  • M2 then would be routed through the second row, 4th column switch, into the return net which would forward the header back to node 2, using the return address.
  • destination address can then be incremental and the header resent.
  • Figs. 11a, b, and c detail the number of lines used in each of the crossbars of Fig. 10.
  • the header's destination address is simply incremented and the header reset to try the next second stage switch.
  • the incrementing is only done to the first part of the destination address, however. So, for example, (with reference to Fig. 9 assuming it has return nets), if output node at receiving address 0000 wants to send a message to a receiving node at 0000, the incrementing can be done to the first two digits of the 0000 address, making the next attempt through line 01-of crossbar 95.
  • Clos networks having an intermediary stage or set of stages of crossbars, they can use busses for routing messages. This embodiment further reduces the number of switches but adds complexity in that it requires bus controllers.
  • Fig. 12 in which a network 120 is shown having again 1 through r crossbars with return nets for both the first and third stages. Again, as in Fig. 10, the return lines are shown attached near the sending line even though they are located as described with reference to Fig. 15 on the side opposite the output.
  • the difference in structure between this Fig. and Fig. 10 is that there are 2n-l controllers 121 j - 12l2 n _i- Each controller itself has a set of r data bus controllers 121 ⁇ - 121 ⁇ .
  • a signal or message sent from node 1 of crossbar 123 passes on output line 1 to Data and Bus Controller 121 Id * If the message is supposed to go to crossbar 124, for one of its 1 to
  • a bus controller 140 is shown. This is essentially an exploded view of the box labeled 121 j in Fig. 12. However, the diagram also illustrates a variation in embodiments in which a demultiplexer is used rather than an input and output line from and to each one of the first stage crossbars and their return networks.
  • the input data coming in on line 141 is demultiplexed by looking at the destination address, first part, and the request is sent to one of the bus control units, say bus control 1.
  • Bus control 1 then needs to seize databus 1 in order to send the message as data across the data line to databus 1. To do so it first checks the status of bus status 1 to determine if databus 1 is busy.
  • Inputs Parallel Input Data/Header bus, the Bus Status (0/1), and the Next_Stage Response, a single packet of data equal to the bus width, usually a header bit, and the source and destination addresses sent back from the next stage.
  • the input Data/Header always contains 2 reserved bits for routing.
  • bus lines are tri-state (low, high, high impedance) to distinguish between no signal and O-valued signal.
  • Input Data/Header bit 00 /"'leading bit denotes header or message packet*/
  • this implementation in routing through the second stage assumes that the source node operates in two modes: first it sends out the requesting header packet. If the response received from the routers is negative, it retransmits the headers until it obtains a connection. Second, if a positive response is received, the source node transmits its message in packets until it frees all the latches switches in the crossbars or the busses it had used. Routing Delays
  • the worst case delay in setting up the routing path in this implementation is 0(2n+nlog2r).
  • the switching delay is smaller since no conflicts have to be further resolved.

Abstract

Described is a set of network organizations for allowing truly nonblocking messaging through very large switching systems. Crossbars and Clos organizations are employed. Return pathways are used to indicate blocked messages (due to busy output nodes or receiving nodes). On busy, a message may be rerouted through alternate paths through the network. A design for using busses for part of the network is also described.

Description

SCALABLE SELF-ROUTING NON-BLOCKING MESS AGE
SWITCHING AND ROUTING NETWORK
This invention relates to a self-routing, non-blocking switch system for routing messages from any one of a number of inputs to any one of a number of outputs in an efficient and cost effective manner.
BACKGROUND OF THE INVENTION As data transfer becomes more and more complex, the numbers of the messages which need to be transferred across a network from one location to another, or one area of a computing device to another, has grown remarkably.
Accordingly, any system or subcomponent thereof which allows for ease of routing messages from any of a large number of inputs to any particular one of a large number of outputs is important. The invention described herein allows for the routing of such messages with very little overhead, at high speed. The invention herein employs a return network, is capable of using different kinds of switches in the switching network, may operate in a broadcast mode, may reduce the number of switches necessary by using a Clos network or by reliance on busses and controllers, and uses a self-routing algorithm, eliminating the need for preprocessing overhead. Self-routing algorithms use information contained in the message itself to route the message at the switch level to the desired location.
Prior self-routing algorithms such as the one described in the 1990 IEEE (Vol. 78, No. 1), entitled "Fast Packet Switch Architectures for Broad-Band Integrated Services Digital Networks", by Tobagi, do not indicate how these algorithms can actually operate, and do not describe how their operation can be accomplished when using a Clos-type network.
Clos networks are preferred for larger networks because they reduce the number of switches necessary to accomplish a N x N message switching system to a much smaller number than N^. See, for example, U.S. Patent No. 4,696,000, issued to Payne, which refers to the Clos network and rearrangeable networks.
The invention described herein simplifies all these ideas.
The switches may be arranged in a standard crossbar network for small scale implementations, a Clos network or a variation of a Clos network using busses, and the types of switches and routings, dependent on the switch type used, may also vary depending on the implementation. As described in the detailed description, a preferred implementation employs a return network for all blocked messages that wish to address the same output node and returns them to the source or sending node. Due to the nature of the routing system itself and the construction of the switching network, essentially no overhead or preprocessing is required to route messages through the system described from the input nodes to the output nodes.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 is a two-dimensional layout of the switches in an N x N crossbar.
Fig. 2 is a block diagram of a trap network for removing connection request conflicts. Fig. 3 is a layout of a second implementation of an n by m crossbar.
Fig. 4 is a layout of a n by m crossbar with an associated return network.
Fig. 5a is a schematic diagram of 2 x 2 crossbar switch. Fig. 5b is a diagram of a gate-level implementation of the switch described in Fig. 5a.
Fig. 6a (i), (ii), and (iii) are three possible modes of operation of the 2 x 2 broadcast switch.
Fig. 6a is a diagram of a gate-level implementation of a broadcast switch. Fig. 7 is an electronic-level diagram of an implementation of a crossbar switch.
Fig. 8 is a 36 x 36 Clos network diagram.
Fig. 9 is a 16 x 16 Clos network diagram. Fig. 10 is a layout of a non-blocking nr x nr self-routing Clos network with return networks.
Figs. 11a, b and c are diagrams of the inputs and outputs of the crossbars used in Fig. 10.
Fig. 12 is a schematic diagram of the bus implementation of a truly non-blocking network in accord with the invention.
Fig. 13 is a block diagram of a finite-state machine model of the bus controller which may be used with the invention as described in Fig. 12.
Fig. 14 is a detailed schematic diagram of the databus controller blocks employed in one of the 2n-l databus controller blocks used with the invention as described with reference to Fig. 12.
Fig. 15 is a drawing of a complete 4 x 4 crossbar with associated return net.
SUMMARY OF THE INVENTION Network
This invention teaches a truly non-blocking switching network for use as a messaging system which has a plurality of stages, at least the first and last stages comprising a multiplicity of crossbars, which may be called "units". Each crossbar in the first stage can switch messages having headers with source and destination addresses and an associated turn signal indicator means for each. The crossbar has a number of inputs, n, and a number of outputs, m, wherein the m inputs are connected by an n X m array of 2 X 2 switches organized into logical columns and logical rows labeled 1 to m and 1 to n, respectively, and wherein said switches are connected between said n input sources and m outputs- The switches may be set into a pass or exchange mode based on the coincidence of two events, first that the message header source address matches the column address of the switch, and second, that the turn signal has not already been reset by a previous switch having a matching column address.
Each last stage crossbar or "unit" is of similar size and organization to each said first stage crossbar, except that the last stage crossbars have m inputs and n outputs. Each stage has some number r which is the number of crossbars in the first and last stages. The m outputs of each of the first stage crossbars are routed through the intermediary stage or stages to the m inputs of the last stage crossbars. For each output of the first stage there is a return path for the messages sent via said first stage outputs to subsequent stages.
Various forms of return paths are described, some for light-based and some for electronic-based systems. There is also described a bus-based intermediary stage which eliminates a large number of the switches.
DETAILED DESCRIPTION OF THE INVENTION Several concepts are included here which are all centered on the idea that self-routing algorithms may be employed to provide local control of 2 x 2 switches which comprise various networks. In doing this, the need for a global controller monitoring all of the inputs and all of the outputs, and their connections, is not required. Where plain crossbars are used, a total of NM of the 2 x 2 switches wo ld be required for an N input and M output network. Because the switches are relatively simple in functionality and layout of the design would be extremely regular, such networks can be fabricated easily in electronic as well as optical implementations. Using a Clos variety of truly non-blocking switching network, even the number of switches may be substantially reduced. Even further reductions in the number of switches required is possible using the bus design. This kind of savings improves the scalability of the inventive concepts described herein.
For example, in a 36 x 36 network using a straight crossbar design, 1,296 2 x 2 switches would be required. Using a Clos network design, only 1,188 switches would be required. When moving to a network of approximately 128 x 128 design, 16,384 switches would be required, whereas with a Clos design, only 9,936 switches would be required.
An additional and important part of this set of inventions is the use of a return network. The return network operates to return messages that attempt to route themselves to outputs which are already occupied. This re-routing or routing back is automatic, and takes less time than a time-out, especially in larger networks, to inform the sending node that the message has not gotten through. (A timeout is a counter or timer that waits some increment longer than the maximum delay, and upon finding the time expired or the counter full, a new action is begun.)
In providing this description, some terminology ought first be defined. The first definition would be "Truly Non-Blocking Networks" (TNBs). A truly non-blocking network is one in which rearrangement of existing conditions is not required in order that every message get through. It requires a set of features, including the following:
(i) setting the appropriate network switches for all requested connections,
(ii) ensuring that no conflict occurs (a conflict occurs when multiple inputs are connected to the same output), (iii) in the event of a conflict, only one connection may be satisfied while the other requests must be kept waiting or informed,
(iv) in case of broadcasting networks, the control must also allow a single source to be connected to multiple outputs. The inventive concepts described in this application show how TNBs can provide for practical implementation for high speed and high band width applications using a distributed local routing control. (They are principally oriented toward electrical switching but additional benefits can be gained using optical switching, such as no separate line being required for a return signal.)
The crossbar switch in general can be thought of as a square or rectangular two-dimensional array connecting a set of inputs to an equal or, if rectangular, unequal, number of outputs. Thus, for N inputs and M outputs, a crossbar has NM crosspoints and can simultaneously provide any combination of input and output one-to-one connections in a non-blocking manner. In practice, other methods besides using an array of switches, where each switch represents a crosspoint, have been used. These include for example, in electronics, bus arbitrated architecture used to simplify the quadratic complexity of switches and control and, in optics, outeiproduct matrix multiplying architectures have been used. The primary approach here is to use switch level design for self-routing.
A TNB that requires less switches than are required in a crossbar was first proposed by Clos in A Study of Non-Blocking Switching Networks. Bell . System Technical Journal, 32, pp. 406-424, 1953. A general Clos network (CN) has an odd number, say, 2k+ 1 (where k=l,2,....), of stages and is built in a modular design employing a multiplicity of crossbars which are substantially smaller than the total N x N size desired. Thus, the number of switches which would be required for a Clos network would be on the order of N x N, when a comparably sized crossbar would be N x N. (See above paragraph.) Also, networks of more than three stages can be built from three-stage networks by successively replacing the center-stage by another three-stage CN.
It is known that in a N x N Clos network, if the input stage is built of r crossbars of dimension n x m where rn=N, the second stage is made up of m, r x r crossbars, and the third stage is made up of r, m x n crossbars. The size of a crossbar can be designated with the sequence (n,m,r). The Clos network will be truly non-blocking if m ≥ 2n-l. The CN is not truly non-blocking but may be rearrangeably non-blocking if m < 2n-l. It is not particularly desirable to have a rearrangeably non-blocking network in that messages may be required to be interrupted to rearrange the network when such rearrangement becomes necessary. For applications, therefore, requiring long duration or large messages, interruptions are extremely undesirable. Refer first to Fig. 1 in which a simple crossbar network (10) is shown.
It has inputs 1 through n and outputs 1 through m as shown, and each one of the 2 x 2 switches may be labeled as points in a matrix or, 1,1 to n,m. The switches may be labeled with their column address only since this is the one which will be compared with the message header as will be explained later. In Fig. 8, a Clos network is drawn for a 36 x 36 array of inputs to outputs. The network is comprised of 6, 6 x 11 crossbars in column 1, plus 11, 6 x 6 crossbars in column 2, plus another 6, 11 x 6 crossbars in column 3. Thus, an input on line 81 may be directed at switch or crosspoint 82 to connect to the first top line 83 of crossbar 84 in column 2, row 1, which may again be directed by switch 85 to output 86 of the same crossbar, connecting it to input 87 of crossbar 88. At this point it may only be routed to one of the six outputs of crossbar 88 by one of the six switches in that column of crossbar 88. It will occur to the reader at this point that three crosspoints are involved in transferring the message from input 81 to one of the outputs in crossbar 88. The Clos network, it will also be noted, is generally constructed having square root of N rows (plus m rows of center crossbars, not labeled) by three columns. The network may be expanded and the number of switches further reduced by splitting the central portion into three in accordance with the teachings of Clos. Refer now to Fig. 2 in which a trap network scheme is used to remove connection request conflicts before they happen, according to the design 20. This design is avoided by the invention. L Fig. 2, inputs 1-n are provided to a parallel sorting network 21 and provided, sorted, to a comparator stage 22. The comparator stage 22 provides an indication of a conflict to the duplicate router stage 23 along with each message being passed. The duplicate router stage 23 returns messages, along lines 24 or 25, to the inputs prior to the sorting network a, to be handled in accordance with whatever scheme is desired. For instance, either the message header itself may go back to the input node where the message header and the message may be directed to a buffer, or any number of other things may be desired for messages which may require resending at some other time. The crossbar 27 then allows its switches to be set in accordance with the instructions carried in the header which indicate the output address for each message routed through it. Using the teachings described herein, such an implementation, including units 21, 22, 23 and outputs 24 and 25, is not required. Switches
Before moving on to a description of the algorithm for self-routing, and because it would create some complications to describe them later, a description of the switches which may be used with this device, or at least their functionality, is now described.
While this invention may be well suited to light switches, only the electronic versions are shown, however, it should be clear to one of ordinary skill in the art that light switches will provide enhanced flexibility allowing for return signals over any path once established. Higher speed communications, higher bandwidth and so forth become available when using light switching devices. Also, by not requiring messages to get a return routing, the return network described herein may even be avoided when using light switches. The basic 2 x 2 switch can be seen in Fig. 5a, 50, as having two inputs, IN 1 and IN 2, and two outputs, OUT 1 and OUT 2. It may be generally set in either a pass or exchange mode. Under the pass mode, lines 51 and 52 will be open and lines 53 and 54 will be closed. In the exchange mode, the reverse is true.
In Fig. 5b, the switch 50 is shown in logic diagram form. The "change" in input 55 will control whether the switch is in the pass or exchange mode. Thus, when the input to 55 is zero, OUT 1 is open to 1 1 and closed to 12, and OUT 2 is open to 12 and closed to 1 1. When 55 is one, the reverse is true. OUT 1 is open to IN 2 and OUT 2 is open to IN 1.
Referring now to Fig. 6a, three modes of operation of a broadcast switch logical connection are shown, i, ii, and iii. In mode i, one input may be connected to both outputs. In mode ii, the inputs are connected to their direct outputs. In mode iii, the inputs are connected to their crossed outputs. In Fig. 6b, a gate-level implementation of such a switch is shown.
Thus, where input 66 is zero and input 65 is hi (or "one"), the output of a' will be b and not a, and the output of b' will be a and not b, thus, a switch condition. If input 66 is zero, or low, and output 65 is also low, there will be a broadcast pass condition in which both the outputs will be b. Likewise, if the input 66 is hi and the output 65 is low, the input b will be passed on a' , and the input b will also be passed on the output b' . In the case where both 66 and 65 are hi, the output of a' will be b, and the output of b1 will be b, another broadcast condition.
The switch may be redesigned to eliminate broadcast conditions if desired. As is also true with the gate-level implementation of the switches described previously, this switch may be designed differently as is well known to those of ordinary skill in the art.
In setting the switches, or not setting them, or setting them into a broadcast state, this invention requires reference to the intended address of the message. Thus, it requires a device similar to that described with reference to Fig. 7, 70, which contains the 2 x 2 switch, sw. Input data containing the address is received by the crossbar switch device 70 across datalines 71 and 72. Depending on the overall implementation, either a serial or parallel input may be desired. Either way, a delay buffer 73a and 73b is provided before th data reaches the switch sw. The address label part of the message is read into the address comparator 74 at line 75. The signal output by comparator 74 on line 75 is positive if the switch address matches the appropriate part of the destination address. Such a positive signal on line 75 latches And-gate 78. In this way, if the And 78 is not disabled by line 76 or reset by a signal from line
77, the control output will provide input C to switch sw. The C signal also changes the output on 77a from "turn" to "no-turn" by means of latched And
78. Thus, all succeeding switches would receive the changed turn signal, allowing the message to continue down the column. Different schema for this switch which perform similar functions may be developed easily within the skill of those of ordinary skill in the art. This particular design is preferred for the networks and algorithms described, but variations will be apparent to those of ordinary skill in the art. For example, line 76 could be removed without any loss of function described herein. There may be parallel input paths if desired. May other variations may be constructed without circumventing this invention as claimed. The Basic Self-routing Algorithm
Please refer to Fig. 1. The switches should all have a default setting, either pass or exchange, and all should be set in that mode when beginning to make any connections. For the preferred embodiment, the default setting is exchange. (If desired, the default settings and switches could all be reversed and appropriate adjustments made to accommodate the reversed order. Such adjustments would be principally to reverse the input lines to each switch. The switch itself may require redesign.) When a message arrives at any switch, the header having a destination address will be routed along the row corresponding to the source from which i is sent. Thus, if it enters from input port or node 1, it will be routed along switches 1,1; 1,2; 1,3; ... l,m. The message will be accompanied with a separate, active signal which will be denoted for the purposes of this explanation "turn". The label on the switch will be compared with the destination address in the header of the message at each switch at which the message arrives. If they are the same and the turn signal is high, the switch will be set into, for example, the pass mode, and the turn signal will be reset by the switch (see Fig. 7). If a destination label or indicator does not match the address of the switch, or if the turn signal is low, the switch will remain in the default mode (in the preferred embodiment, the case exchange mode). Thus, if a message is to be transferred from node 1 to output node 3, at switch 1,1, it will be told to exchange to switch 1,2 across line a'. The turn signal will not have been reset since it is to continue to travel across this row. No match has been made so the switch will remain in an exchange condition.
On arriving at switch 1,2, the message header will again be compared with the address of the switch. On again finding no match, and the turn signal again not reset, the message again will be exchanged through to switch 1,3 on line a'2. When the message reaches switch 1,3, the turn signal will be reset.
When the control signal is provided to switch sw (see Fig. 7) because the address matches, the switch position will be changed to pass, and the message header and ensuing message will be transferred from input line a'2 to output line b'3 of switch 1,3. The same control signal C, will reset the turn signal to its opposite.
There are several ways to reset turn signals when the message is complete, such as resetting the entire network together at controlled intervals determined based on predetermined message sizing. However, the inventive concepts herein will work with numerous global or pass message reset techniques so these are not explained further here.
Refer now to Fig. 3 in which a network 30 is shown having nearly the identical layout as the network of Fig. 1. The difference here is that each one of the input nodes broadcasts its message across all of the columns of switches so that each one may compare its address at the same time, providing for muc faster connections. These broadcast lines are indicated as lines 31-34. Return Networks
The key concept for eliminating the need for a conflict resolution device is the provision of the return network for each input node or line. A simplified implementation is described with reference to Fig. 4 in which a crossbar network 40 and an associated return net 41 are shown. For each node 42 or input line 1, an output as well as the input line is provided. A modification is also required in the header: both the destination address and the source address are required in order to route back the message which has found a conflict. Each should also have its own turn signal. The first part of the header should contain the destination address in the preferred embodiment. Thus, for a message from node 1, it is 1 ' of 1 to m outputs. If all goes well, the header routes itself to the proper output as described previously. However, if the destination output line is already busy, the header will be diverted to the return network 41. Consider, for example, a case where input node 42 using input line 1, and a second input node using input line 2, have connection requests for output node 3 (and thus send messages with headers to node 3). The header from the second node, for example, reaches the switch 2,3 before the header from node 42, if the two headers are clocked out together. Thus, switch 2,3 will be set (in the preferred embodiment) into the pass mode by the header from the second node and, therefore, the header from node 42 (input line 1) on arriving at node 2,3 will be routed through switch 2,n into the return net 41. The switches in the return net will look only at the source address part of the header and its associated turn signal. Since the source has sent a message request, it should not be busy when it receives the request back, in this case across line 44. To get there it will have come through switch s from switch t. The routing through the switches in the return net may work exactly the same way as in the forward net, with a separate turn signal for the sending address, which is only reset when the switch address matches the origin address, and it then stays in reset condition through the switches in that (diagonal) return switch line until the header with its message (if any) is returned across one of the long horizontal lines in the crossbar (see Fig. 15).
(In cases where it is being routed back to a previous stage crossbar, the turn signal must change again before it is processed in the lower stage.)
See Fig. 15 (or Fig. 4) for a complete 4 x 4 array of switches 150 and an associated return network. If Node 1 sends a message Mj at the same time Node 2 sends a message M2 the routing through both the crossbars and the return net 151 can be traced on this diagram as labelled by following the Mj and M2 labels. (Note that to the right of dotted line 153, the switches may be shorted permanently if a different turn signal scheme is used than that described in the preferred embodiment.) (You can also eliminate the connection from the topmost row switch in the crossbar to the return net and the lower right corner switch in the return net because no output from row 1 will ever need to be returned unless the crossbar receives messages back from a later stage crossbar by bus.) The Clos Implementation This self-routing algorithm may be applied to any N x N Clos network with r n x m crossbars in the first and third stages, and where rn=N, m rXr crossbars in the second stage. Also, because we are interested in TNBs, m≥2n-l. Previously, solving a self-routing crossbar network problem in a distributed manner, permutation of the rn requests such that the requests (to route a message) routed to addresses in different crossbars in the third stage are routed from different crossbars in the input stages. See, for example, Lin et al, Two-Dimensional Optical Clos Interconnection Network and Its Uses.
Applied Optics, Volume 27, No. 9, May 1, 1988, pp. 1734-1741. This solution, however, requires permutating or rearranging all nr requests simultaneously and cannot be implemented in the self-routing distributed manner used by this invention. The self-routing approach described previously works fine for crossbars but not for Clos networks. The important inventive aspect for Clos networks for achieving non-blocking self-routing is in selecting the crossbars to be used in routing the requests through the second stage of the network. Therefore, when conflicts are detected in the first n shared crossbars of the second stage crossbars, these messages are re-routed using the remaining n-1 crossbars in the second stage. "Greedy" Self-Routing Algorithm For Multistage or Clos Networks
Refer now to Fig. 9 which illustrates a crossbar network similar to that illustrated in Fig. 8 but smaller. Each input line Cl has an address, in this network being a 16 X 16 Clos network there addresses being (for base 10, 1 through 16 but as shown) for base 2, 0000 through 1111. The destination addresses in column C3 have the same designation and C3 is likewise comprised of the same number of crossbar switches. C2 has a greater number of smaller crossbar switches in accordance with the requirements of a Clos network. (The crossbars in columns Cl and C3 are of size n by 2n-l). For the purposes of the preferred embodiment if the number of crossbar switches in Cl and C3 is not a power of 2 (thus r is not a power of 2) the number should be increased until a power of 2 is achieved. Thus it can be seen that, for the preferred embodiment, with four digits the number of bits and the label for each address is |log2r | .
In the preferred embodiment again, no switches are considered latched thus they are all in the pass position. When any message arrives at any switch, the header containing the destination address is routed following the self-routing algorithm described above for the first column. The destination address will be made up of two parts: the output crossbar address of |log2r| bits and a local destination address | log2*tι | bits within the individual crossbar. This matching is performed only on the first |log2r| bits.
If the message header finds that the desired address (|log2r| , say 00--) output is busy in the first of the intermediary switches in column C2, it will be routed to the next intermediary switch and so on until a path to the crossbar switch in column C3 desired is achieved. Thus the algorithm is termed greedy in that any unused column may be used for routing. In the illustrated example of Fig. 9 the address 00— was free and a connection made on 91 to the second stage crossbar is C2. The other message that wants this crossbar is coming in on 92. If they are going to different final crossbars as shown, across lines 93 and 94 there is no problem, otherwise the second message through will be blocked and have to be rerouted. This is why the greedy algorithm was created.
The key to resolving conflicts in each stage is the use of the crossbar with the return network for conflict detection and resolution. Stage 1 and 3 conflicting requests, due to true input request conflicts (where two input requests specify the same output address) are returned to the source node.
However, conflicts in the second stage or second column that arise due to sharing of the crossbars in the second stage are resolved by detecting conflicts in those labelled crossbars and then, on receiving the return header, scanning available connections in the remaining second column crossbars. In the worst case then, a request may scan through n outputs in its stage one or Cl crossbar before its request is met by the (2n-l)th output.
Another way to say this is there are two forms of 10 conflict that can occur, the first occurs in the first or third stages due to direct input conflicts. This can be where two or more inputs request the same output address in the third stage or subaddresses in the first stage. These are handled by the conflict resolution as described in the case of the single crossbar. The second kind of conflict arises due to the sharing of crossbars in the middle stages and requires scanning the last n-1 free crossbars to resolve the conflicts. Because of these multiple stage conflicts, crossbars 20 in the second or middle and third stages must provide return paths so that when the header part of the message is sent, the response can be sent back to the source to confirm that a routing path is available. Thus, the multi-stage network acts more like a router than a pure interconnection network. Please refer to Fig. 10 which describes such a system, including the return nets and extra return paths.
Again, if light switches are used, return paths may be obviated because the light can travel back over the same open switch that it traveled to get to the destination node. It should also be noted here that electronic switches with return paths opened at the same time and location as sending paths would also obviate the need for a return net or separate return path system.
Crossbars la to ra establish the first stage, 1^ to 2n-ljj the second and lc to rc the third. Each has a return net 102 , similar to the one described in detail with reference Fig. 15. For every connection between crossbars like line 103 there is a return line like 103r. Please refer briefly back to Fig. 15 and line 103r thereon.
For ease of explanation, assume the network is populated with 4 x 4 crossbars in the first stage like 150 of Fig. 15. Output 1 goes to the first crossbar in the second stage, output 2 to the second, etc. The line returning from the first crossbar of the second stage would be line 103r, from the secon crossbar of the second stage would be line 104r, from the third crossbar of the second stage would be return line 105r, and from the fourth crossbar of the second stage, the return line would be 106r. Reviewing the two message state of crossbar 150 described earlier, two messages, Mj and M2, have tried to reach output 3, only message M2 made it. Mj was returned. Let us assume M2 did not make it through the third crossbar of the second stage. It would then be returned on line 105r. Switch 3 in the first row is still held in the pass mode and the network message is routed through across to the return net. The return net routes the message back to node 2. (All switches, once changed from the default state, should be held in that state for the maximum delay required for the message to return if blocked at the last stage.) M2 then would be routed through the second row, 4th column switch, into the return net which would forward the header back to node 2, using the return address. As in the explanation which follows, destination address can then be incremental and the header resent.
Figs. 11a, b, and c detail the number of lines used in each of the crossbars of Fig. 10.
The remaining detail is how to scan to the next or subsequent second stage crossbar when the original choice is blocked. When the message is returned to its originating node, the header's destination address is simply incremented and the header reset to try the next second stage switch. The incrementing is only done to the first part of the destination address, however. So, for example, (with reference to Fig. 9 assuming it has return nets), if output node at receiving address 0000 wants to send a message to a receiving node at 0000, the incrementing can be done to the first two digits of the 0000 address, making the next attempt through line 01-of crossbar 95. This would send the header to the second crossbar in C2 (not shown) whose switches would look at the second part of the destination address, thus routing the message to the correct stage 3 or C3 crossbar, in this case crossbar 96. If a third crossbar in stage 3 is tried, because the second part of the destination address remains unchanged, the message will still be routed to crossbar 96 if the output address is 0000. Use of Busses
Instead of Clos networks having an intermediary stage or set of stages of crossbars, they can use busses for routing messages. This embodiment further reduces the number of switches but adds complexity in that it requires bus controllers. Refer to Fig. 12 in which a network 120 is shown having again 1 through r crossbars with return nets for both the first and third stages. Again, as in Fig. 10, the return lines are shown attached near the sending line even though they are located as described with reference to Fig. 15 on the side opposite the output. The difference in structure between this Fig. and Fig. 10 is that there are 2n-l controllers 121j - 12l2n_i- Each controller itself has a set of r data bus controllers 121^ - 121^.
The algorithm, however, works about the same way on this structure as it does on the kind illustrated in Fig. 10. Thus, a signal or message sent from node 1 of crossbar 123, passes on output line 1 to Data and Bus Controller 121 Id* If the message is supposed to go to crossbar 124, for one of its 1 to
2n-l outputs it will try line 1 of its set of busses. If there is no "busy" signal set by some v output on line a of bus 1, the message will pass onto the bus line b on output w. These v and w outputs are bi-directional (or may be paired inputs and outputs, thus 2 v's and 2 w's if desired). If this bus line is busy, the message will be returned to node 1, incremented and resent on line 2 (not shown) to the next controller. The next controller 1212 (not shown but next in series) would try line 1 of its first Data/Bus Controller, which also connects to crossbar 124. This would go on until line 2n-l of crossbar 123 ^ is tried, and a link attempted through Data and Bus Controller 12l2n(j is tried and its line 1. If still unsuccessful the next increment can reset the header to try line 1 and start the cycle over. Enhancements to this concept, such as sending a signal backward beyond the input node to cancel a message or resend later, should be apparent to one of ordinary skill in the art, without going outside the scope of this invention.
(Note also that there are single input/output lines between busses and crossbar 124r and paired input and output lines between busses and crossbar 124, in the drawing. This is done to show that it could be either way, although it would be preferred if all the crossbars in a stage are connected to busses in the same manner).
The data/bus controller is next described for completeness. Refer first to Fig. 14 in which a bus controller 140 is shown. This is essentially an exploded view of the box labeled 121 j in Fig. 12. However, the diagram also illustrates a variation in embodiments in which a demultiplexer is used rather than an input and output line from and to each one of the first stage crossbars and their return networks. The input data coming in on line 141 is demultiplexed by looking at the destination address, first part, and the request is sent to one of the bus control units, say bus control 1. Bus control 1 then needs to seize databus 1 in order to send the message as data across the data line to databus 1. To do so it first checks the status of bus status 1 to determine if databus 1 is busy. This is done on line status 1. If it is not busy, it places a high signal or other appropriate signal on bus 1 status line across the set/reset 1 line of bus control 1. If the status is busy, bus control 1 will rerout the request out the response line which is wire-ored to line 142. A turn signal will be tagged onto this returned message so that it may find its way back through the return net. As was described in the earlier implementation, such a turn signal for return coming from a intermediary level or second stage crossbar was not required because each line went to the appropriate address in the return net. FSM of the Bus Control
1. Inputs: Parallel Input Data/Header bus, the Bus Status (0/1), and the Next_Stage Response, a single packet of data equal to the bus width, usually a header bit, and the source and destination addresses sent back from the next stage. The input Data/Header always contains 2 reserved bits for routing.
2. Outputs: Data Bus, parallel output header/data to the next stage, Set/Reset control for the bus status line, Response_Back lines that contains a bit (available or note available) arid source/destination information which may also be obtained form the Next_Stage_Response.
Note the bus lines are tri-state (low, high, high impedance) to distinguish between no signal and O-valued signal.
3. State Transitions:
IF Input Data/Header bit = 00 /"'leading bit denotes header or message packet*/
AND BusJStatus =1 /*Bus occupied or busy*/ THENResponse_Back = Ollnput Data/Header; /send a not_available signal in response*/ ELSE Response_Back = lllnput Data/Header;
/send available signal in response*/ BusJStatus = 1 /*set bus status busy reserve the bus lines*/ ELSE IF Input Data/Header bit = 10 /*message packet received after the source received an available signal*/
Data Bus = Input Data/Header /*output bus = input bus - transmit data*/ IF Input Data/Header bit = 11 /*special code for end of message*/ THEN Bus Status = 0 /*free the bus*/ Data_Bus = XXX /*set bus back to high impedance state*/ We note that this implementation in routing through the second stage assumes that the source node operates in two modes: first it sends out the requesting header packet. If the response received from the routers is negative, it retransmits the headers until it obtains a connection. Second, if a positive response is received, the source node transmits its message in packets until it frees all the latches switches in the crossbars or the busses it had used. Routing Delays
We will only consider the average delays in setting up the routing path in the bus-based self-routing Clos network. The average delays in the first and third stage as before is (n+1) for the XI implementation and n/2 for the X2 implementation. The delay in the second stage is not as simple. Consider the delay in determining a conflict using the Data/Bus Controller architecture. From Figure 14, it will be clear that determining the availability of a connection is t^ (access time) = (log2r+k) where ^r is delay in the demultiplexer while k is a constant due the latching delay in the FSM of the bus control and response logic. This delay t^ is repeated at most n times if in the worst call all free n crossbars in the second stage are scanned. Thus, the worst case delay in setting up the routing path in this implementation is 0(2n+nlog2r). Once the routing path is set up, the switching delay is smaller since no conflicts have to be further resolved. Since the optimal value of r=n=VN, the worst case delay in setting up the routing path is O(VrN(2+log2vN)) (only O(-/N) for normal routing delay). We believe an optical implementation may have lower delays than the electronic one when implementing the switch version.

Claims

What is claimed is:
1. A truly non-blocking switching network for use as a messaging system having a plurality of stages, at least the first and last stages comprising a multiplicity of crossbars, which may be called "units", each said crossbar in said first stage being for switching messages having headers with source and destination addresses and an associated turn signal indicator means for each, wherein the crossbar has a number of inputs, n, and a number of outputs, m, wherein the m inputs are connected by an n X m array of 2 X 2 switches organized into logical columns and logical rows labeled 1 to m and 1 to n, respectively, and wherein said switches are connected between said n input sources and m outputs, and said switches may be set into a pass or exchange mode based on the coincidence of two events, first that the message header source address matches the column address of the switch, and second, that the turn signal has not already been reset by a previous switch having a matching column address, and wherein each said last stage crossbar or "unit" being of similar size and organization to each said first stage crossbar, except that the last stage crossbars have m inputs and n outputs, and wherein each said stage being arranged such that there is some number r which is the number of crossbars in the first and last stages and wherein the m outputs of each of the first stage crossbars are routed through the intermediary stage or stages to the m inputs of the last stage crossbars, and wherein for each output of the first stage there is a return path for the messages sent via said first stage outputs to subsequent stages.
2. A messaging system as set forth in claim 1 wherein output from outputs
1 to m go to units 1 to m in the next subsequent stage and wherein the return path from a subsequent stage unit returns to the same crossbar from which it originated, and into the same column number 1 to m of the inputs to the switches in row 1 of the crossbar of the earlier stage unit.
3. A messaging system as set forth in claim 1 wherein the route a message takes through the system may be called a message pathway, and said 2 x 2 switches are configured so that when they are set for a message pathway, they allow a return pathway through them in the opposite direction.
4. A messaging system as set forth in claim 3 wherein the return pathway is constructed via a duplicate set of electronic pathways set by the switches set by the switches at the same time the message pathway is set.
5. A messaging system as set forth in claim 3 wherein the 2 x 2 switches are light switches, which may be called light valves, and allow messaging over the same light pathway.
6. A messaging system as set forth in claim 1 wherein an intermediary stage is comprised of units connected to a single output from each unit or crossbar of a preceding stage and wherein each unit in said intermediary stage is comprised of Data Bus controllers 1 to r, each said controller being in data and control communication to a bus wherein each controller in a stage is connected to the same 1 to r busses, and wherein each bus is connected to one crossbar of the subsequent stage.
7. A messaging system as set forth in claim 1 wherein an intermediary stage is comprised of crossbars connected to a single output from each crossbar of a preceding stage and wherein there are a number of crossbars, labelable as 1 to 2n-l, each said crossbar r inputs and r outputs connecting it to each of the preceding stage crossbars and to each of the subsequent stage crossbars.
8. A messaging system as set forth in claim 1 wherein output from outputs 1 to m go to units 1 to m in the next subsequent stage and wherein the return path from a subsequent stage unit returns to the same crossbar from which it originated, and into the same row number 1 to m of the inputs to the switches in row 1 of the crossbar of the earlier stage unit.
9. A messaging system as set forth in claim 7 wherein output from outputs 1 to m go to units 1 to m in the next subsequent stage and wherein the return path from a subsequent stage unit returns to the same crossbar from which it originated, and into the same row number 1 to m of the inputs to the switches in row 1 of the crossbar of the earlier stage unit.
10. A messaging system as set forth in claim 6 wherein the route a message takes through the network may be called a message pathway, and said 2 x 2 switches are configured so that when they are set for a message pathway, they allow a return pathway through them in the opposite direction.
11. A messaging system as set forth in claim 7 wherein the route a message takes through the system may be called a message pathway, and said 2 2 switches are configured so that when they are set for a message pathway, they allow a return pathway through them in the opposite direction.
12. A messaging system as set forth in claim 3 wherein the return pathway utilizes the same switch settings that were established by any messages sent from originating nodes.
13. A messaging system as set forth in claim 11 wherein the return pathway is constructed via a duplicate set of electronic pathways set by the switches set by the switches at the same time the message pathway is set.
14. A messaging system as set forth in claim 10 wherein the 2 x 2 switches are light switches, which may be called light valves, and allow messaging over the same light pathway.
15. A messaging system as set forth in claim 11 wherein the 2 x 2 switches are light switches, which may be called light valves, and allow messaging over the same light pathway.
16. A messaging system as set forth in claim 6 wherein there are three stages.
17. A messaging system as set forth in claim 7 wherein there are three stages.
18. A messaging system as set forth in claim 6 wherein the inputs from each Data/Bus controller to each bus comprises a control communication path for setting a signal determining the busy/not busy status of the bus, and wherein the data communication path to said bus is for communicating the message therewith.
19. A messaging system as set forth in claim 18 wherein the control communication path is maintained in busy status for the duration of time ≥ the maximum delay required for the message header to return to the bus.
20. A messaging system as set forth in claim 1 wherein said 2 x 2 switches are broadcast switches.
21. A messaging system as set forth in claim 1 wherein any switch that is set by a message header from the default condition maintains said non-default setting for the duration of time > the maximum delay required for the message header to return to that switch.
PCT/US1992/006651 1991-08-05 1992-08-05 Scalable self-routing nonblocking message switching and routing network WO1993003580A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP92917766A EP0598030A1 (en) 1991-08-05 1992-08-05 Scalable self-routing nonblocking message switching and routing network
JP5503879A JPH06509917A (en) 1991-08-05 1992-08-05 Standardizable self-route assignment non-blocking message exchange and route assignment network

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US74026391A 1991-08-05 1991-08-05
US740,263 1991-08-05

Publications (1)

Publication Number Publication Date
WO1993003580A1 true WO1993003580A1 (en) 1993-02-18

Family

ID=24975747

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1992/006651 WO1993003580A1 (en) 1991-08-05 1992-08-05 Scalable self-routing nonblocking message switching and routing network

Country Status (4)

Country Link
EP (1) EP0598030A1 (en)
JP (1) JPH06509917A (en)
CA (1) CA2113725A1 (en)
WO (1) WO1993003580A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104243354A (en) * 2014-10-09 2014-12-24 福建星网锐捷网络有限公司 CLOS network link distribution method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0097360A2 (en) * 1982-06-22 1984-01-04 CSELT Centro Studi e Laboratori Telecomunicazioni S.p.A. Modular self-routing PCM switching network with virtual-channel routing control for distributed-control telephone exchanges
EP0195589A2 (en) * 1985-03-18 1986-09-24 International Business Machines Corporation Switching system for transmission of data
EP0224244A2 (en) * 1985-11-29 1987-06-03 AT&T Corp. Switching system having multiple parallel switching networks
US4696000A (en) * 1985-12-12 1987-09-22 American Telephone And Telegraph Company, At&T Bell Laboratories Nonblocking self-routing packet and circuit switching network
US4725835A (en) * 1985-09-13 1988-02-16 T-Bar Incorporated Time multiplexed bus matrix switching system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0097360A2 (en) * 1982-06-22 1984-01-04 CSELT Centro Studi e Laboratori Telecomunicazioni S.p.A. Modular self-routing PCM switching network with virtual-channel routing control for distributed-control telephone exchanges
EP0195589A2 (en) * 1985-03-18 1986-09-24 International Business Machines Corporation Switching system for transmission of data
US4725835A (en) * 1985-09-13 1988-02-16 T-Bar Incorporated Time multiplexed bus matrix switching system
EP0224244A2 (en) * 1985-11-29 1987-06-03 AT&T Corp. Switching system having multiple parallel switching networks
US4696000A (en) * 1985-12-12 1987-09-22 American Telephone And Telegraph Company, At&T Bell Laboratories Nonblocking self-routing packet and circuit switching network

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
CONFERENCE RECORD, IEEE/IEICE GLOBAL TELECOMMUNICATIONS CONFERENCE, 15-18 NOV. 1987, VOL.3 PAGES 1856-1860, TOKYO JP D.M. DIAS ET AL 'Design and Analysis of a Multistage Voice-Data Switch' *
PROCEEDINGS OF THE IEEE vol. 78, no. 1, January 1990, NEW YORK US pages 133 - 167 F. TOBAGI 'Fast Packet Swich Architectures For Broadband Integrated Services Digital Networks' cited in the application *
TA Q., MEDITCH J. S.: "A HIGH SPEED INTEGRATED SERVICES SWITCH BASED ON 4 X 4 SWITCHING ELEMENTS.", MULTIPLE FACETS OF INTEGRATION. SAN FRANCISCO, JUNE 3 - 7, 1990., WASHINGTON, IEEE COMP. SOC. PRESS., US, vol. CONF. 9, 3 June 1990 (1990-06-03), US, pages 1164 - 1171., XP000164324, ISBN: 978-0-8186-2049-2, DOI: 10.1109/INFCOM.1990.91370 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104243354A (en) * 2014-10-09 2014-12-24 福建星网锐捷网络有限公司 CLOS network link distribution method and device

Also Published As

Publication number Publication date
EP0598030A1 (en) 1994-05-25
JPH06509917A (en) 1994-11-02
CA2113725A1 (en) 1993-02-18

Similar Documents

Publication Publication Date Title
US5319639A (en) Crossbar with return net for scalable self-routing non-blocking message switching and routing system
EP0256701B1 (en) Crosspoint circuitry for data packet space division switches
KR100211123B1 (en) Multi-stage interconnecting network for high speed packet switching
US5140583A (en) Message routing
US7684389B2 (en) Multi-dimensional lattice network
EP0405208B1 (en) Multistage network with distributed pipelined control
US5654695A (en) Multi-function network
EP0590877B1 (en) Multistage optical packet distribution network with bypass
US20030112831A1 (en) Mesh architecture for synchronous cross-connects
US6992984B1 (en) Credit-based adaptive flow control for multi-stage multi-dimensional switching architecture
US4891802A (en) Method of and circuit arrangement for controlling a switching network in a switching system
JPH06268684A (en) Packet switching arrangement
JPH0750688A (en) Packet switching arrangement
US5132965A (en) Nonblocking parallel banyan network
Hui Switching integrated broadband services by sort-banyan networks
JP2004511992A (en) Scalable apparatus and method for increasing throughput in a multiplex minimal logic network using multiple control lines
WO1993003580A1 (en) Scalable self-routing nonblocking message switching and routing network
KR0170493B1 (en) Non-blocking fault tolerance gamma network for multi-processor system
WO1993003581A1 (en) Message structure for scalable self-routing non-blocking message switching and routing system
CA2006392C (en) Modular expandable digital single-stage switching network in atm (asynchronous transfer mode) technology for a fast packet-switched transmission of information
WO1993006676A1 (en) Nonblocking point-to-point fast packet/circuit switching networks
JP3079068B2 (en) ATM switch
Chen et al. RC-BB switch: a high performance switching network for B-ISDN
JP2756604B2 (en) Self-routing switch network
JP3186686B2 (en) ATM switch

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): CA JP

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LU MC NL SE

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2113725

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 1992917766

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1992917766

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 1992917766

Country of ref document: EP