US6055555A - Interface for performing parallel arithmetic and round operations - Google Patents

Interface for performing parallel arithmetic and round operations Download PDF

Info

Publication number
US6055555A
US6055555A US08/999,243 US99924397A US6055555A US 6055555 A US6055555 A US 6055555A US 99924397 A US99924397 A US 99924397A US 6055555 A US6055555 A US 6055555A
Authority
US
United States
Prior art keywords
bit
true result
adder
circuit
format
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US08/999,243
Inventor
Brent Boswell
Karol Menezes
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Intel Corp
Original Assignee
Intel Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corp filed Critical Intel Corp
Priority to US08/999,243 priority Critical patent/US6055555A/en
Assigned to INTEL CORPORATION reassignment INTEL CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: BOSWELL, BRENT, MENEZES, KAROL
Application granted granted Critical
Publication of US6055555A publication Critical patent/US6055555A/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • G06F7/48Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
    • G06F7/483Computations with numbers represented by a non-linear combination of denominational numbers, e.g. rational numbers, logarithmic number system or floating-point numbers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • G06F7/48Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
    • G06F7/499Denomination or exception handling, e.g. rounding or overflow
    • G06F7/49942Significance control
    • G06F7/49947Rounding
    • G06F7/49957Implementation of IEEE-754 Standard
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • G06F7/48Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
    • G06F7/52Multiplying; Dividing
    • G06F7/523Multiplying only

Definitions

  • the present invention relates to logic circuits, and more particularly, to an interface circuit for performing parallel arithmetic and round operations.
  • Arithmetic operation is a multiplication operation.
  • One way to multiply two numbers is to use Booth encoding.
  • Two numbers can be multiplied by generating partial products based on the Booth encoding.
  • a Wallace Tree which is a tree-like network of carry-save-adders (CSAs), can be used to add the partial products.
  • the use of a CSA is much faster than adding the partial products linearly.
  • the output of the Wallace tree is a true result that includes a sum word and a carry word. To obtain a final sum, however, the sum word and carry word must be added by a conventional adder with carry propagation, such as a carry-propagate-adder (CPA).
  • CPA carry-propagate-adder
  • a variety of arithmetic operations can use a CSA followed a CPA to add two or more numbers together.
  • the result may then be rounded.
  • Rounding must be performed in arithmetic and store operations when the format of the destination cannot represent the precise true result. For example, a real number may be rounded if it is stored in a shorter real format, or in an integer format.
  • the interface circuit overcomes the disadvantages of the prior art by providing a first adder circuit that receives as an input a true result of an arithmetic operation in an intermediate format.
  • the first adder circuit outputs both the true result in a final format and a first representable number approximating the true result.
  • a second adder circuit is connected in parallel to the first adder circuit.
  • the second adder circuit receives the true result in the intermediate format and a 1 as inputs.
  • the second adder circuit outputs a second representable number approximating the true result.
  • the interface circuit also includes a selection circuit connected to the outputs of the first and second adder circuits. The selection circuit outputs either the first or second representable numbers as a rounded result of the arithmetic operation.
  • FIG. 1 illustrates a block diagram of an interface circuit according to an embodiment of the present invention.
  • FIG. 2A illustrates the operation of the rounding modes for positive numbers according to an embodiment of the present invention.
  • FIG. 2B illustrates the operation of the rounding modes for negative numbers according to an embodiment of the present invention.
  • FIG. 3 is a flow chart illustrating the operation of the interface circuit according to an embodiment of the present invention.
  • FIG. 4 illustrates an example according to an embodiment of the present invention.
  • FIG. 1 illustrates a block diagram of an interface circuit 10 according to an embodiment of the present invention.
  • a multiplier circuit 8 is connected to interface circuit 10.
  • Multiplier circuit 8 multiplies two numbers (source1 and source2) together to output an L-bit true result of the multiplication operation in an intermediate format.
  • a destination of the multiplication operation can only store an N-bit result, wherein L>N. Therefore, the L-bit true result of the multiplication operation is unrepresentable and must be rounded to N bits.
  • Interface circuit 10 completes the multiplication operation by calculating the true result in a final format and rounds the true result to N-bits in parallel. By completing the multiplication operation and rounding the multiplication result in parallel, processor speed can be improved.
  • multiplier circuit 8 can include conventional multiplication circuitry, such as a Booth encoder 14, a partial product creation block 12 and a Wallace Tree 18.
  • Two numbers, source1 and source2 are input into multiplier circuit 10 that will be multiplied together.
  • Source 1 is input to booth encoder 14, and source2 is input into partial product creation block 12.
  • Source1 and source 2 may be, for example, mantissas that will be multiplied together for a floating point multiplication operation.
  • Booth encoder 14 Booth encodes source1 and outputs Booth encoding signals to partial product creation block 12.
  • Partial product creation block 12 generates one or more partial products based on source2 and the Booth encoding signals received from Booth encoder 14. The partial products are output to Wallace Tree 18.
  • Wallace Tree 18 is a tree like network of carry-save-adders (CSAs) that are used to add the partial products.
  • CSAs carry-save-adders
  • the use of a network of CSAs, such as a Wallace Tree, is much faster than adding the partial products in a linear fashion.
  • carry propagation is not done except in the final step. Therefore, multiplier circuit 8 outputs a true result B of the multiplication operation in an intermediate (or non-final) format.
  • true result B is output from Wallace Tree 18 in a sum and carry format that includes a sum word 20 and a carry word 22.
  • Other types of intermediate formats can be used.
  • Both sum word 20 and carry word 22 are L-bits wide.
  • a final format e.g., two's complement format
  • sum word 20 and carry word 22 must be added together by a conventional adder with carry propagation, such as a carry-propagate-adder (CPA).
  • CPA carry-propagate-adder
  • Interface circuit 10 uses carry propagate adders to obtain a multiplication result that is in a final format while, in parallel, rounding the result to N bits.
  • Interface circuit 10 according to an embodiment of the present invention illustrated in FIG. 1 includes two adder circuits connected in parallel, a multiplexer (MUX) 44 connected to the outputs of the adder circuits, and a rounding control circuit 32.
  • MUX multiplexer
  • a first adder circuit comprises a CPA 28.
  • CPA 28 is connected to the output of multiplier circuit 10 and receives sum word 20 and carry word 22 as inputs.
  • CPA 28 adds L-bit sum word 20 and L-bit carry word 22 to output a L-bit true result that is in a final format.
  • CPA 28 outputs the L-bit true result in a two's complement format.
  • CPA 28 converts the L-bit output from multiplier circuit 8 provided in sum and carry format to a final format (e.g., two's complement format).
  • the N most significant bits of the L-bit true result output in final format from CPA 28 are output to MUX 44 as an N-bit representable number A.
  • the remaining (less significant) bits of the L-bit true result output from CPA 28 include the guard bit (G), the round bit (R) and the sticky bit (S).
  • the guard, round and sticky bits are output to a rounding control circuit 32 via line 30.
  • the least significant bit (LSB) of the N-bit representable number A is also output to rounding control circuit 32 via line 30.
  • a second adder circuit includes a 3:2 carry save adder (CSA) 24 and a carry propagate adder (CPA) 26.
  • CSA 24 is connected to the output of multiplier circuit 8 and receives sum word 20 and carry word 22 as inputs. CSA 24 also receives a plus 1 input as well. CSA 24 adds together sum word 20, carry word 22 and the plus 1. According to an embodiment of the present invention, CSA 24 outputs an L-bit output as a sum word 25 and a carry word 27. Other types of intermediate formats can be used.
  • CSA 24 adds the plus 1 input to sum word 20 and carry word 22 at a bit position corresponding to the least significant bit of the N most significant bits of the L-bit true result B.
  • CPA 26 is connected to the output of CSA 24 and receives sum word 25 and carry word 27 as inputs.
  • CPA 26 adds L-bit sum word 25 and L-bit carry word 27 to output an L-bit value that is in a final format.
  • CPA 26 outputs the L-bit value in a two's complement format.
  • other types of final formats can be used.
  • the N most significant bits of the L-bit value output in final format from CPA 26 are output to MUX 44 as an N-bit representable number C.
  • the remaining (less significant) bits of the L-bit value output from CPA 26 are discarded (not used). Therefore, CPA 26 converts the L-bit output from CSA 24 provided in sum and carry format to an N-bit representable number C in a final format (e.g., two's complement format), wherein L>N.
  • Rounding control circuit 32 receives as inputs the LSB, guard (G), round (R) and sticky (S) bits from CPA 28 over line 30, a rounding mode input 34, and a sign bit of the true result provided over line 35.
  • Rounding mode input 34 can designate one of several rounding modes.
  • the LSB, guard, round and sticky bits are provided for allowing rounding control circuit 32 to select the desired rounding result of the multiplication operation, either A or C, based on the selected rounding mode.
  • Rounding control circuit 32 outputs a select signal on line 36 for controlling MUX 44.
  • MUX 44 selects either the N-bit representable number A or the N-bit representable number C as the rounded result of the multiplication operation based on the select signal provided on line 36.
  • Rounding control is also performed based on the sign bit of the true result (TR) provided over line 35.
  • FIG. 2A illustrates the operation of the rounding modes for a positive true result according to an embodiment of the present invention.
  • FIG. 2B illustrates the operation of the rounding modes for a negative true result according to an embodiment of the present invention.
  • a true result B is output from multiplier circuit 8 and is L-bits wide. However, because the destination cannot represent the true result B, true result B must be rounded to N bits, wherein L>N.
  • the L-bit unrepresentable true result B can be rounded to N-bits by selecting either the N-bit representable number A or the N-bit representable number C as the rounded result of the multiplication operation. As illustrated in FIGS. 2A and 2B, the representable numbers A and C are successive representable numbers, wherein A is less then or equal to B which is less then or equal to C. In other words the representable numbers A and C bracket the unre
  • Interface circuit 10 selects either representable number A or C as the N-bit rounded result based upon rounding mode input 34 and the least significant bit (LSB), guard bit (G), round bit (R) and/or the sticky bit (S) provided on line 30, and the sign bit of the true result provided on line 35.
  • LSB least significant bit
  • G guard bit
  • R round bit
  • S sticky bit
  • FIGS. 2A and 2B when the round to nearest mode is selected, the representable number that is closest to the unrepresentable number B is selected. If both A and C are equally close to the number B, the representable number is selected that has a zero in the least significant bit.
  • the round down mode is selected, the unrepresentable number is rounded down toward minus infinity.
  • the round up mode the unrepresentable number is rounded up toward positive infinity.
  • the chop mode the representable number A or C is selected that is smaller in magnitude.
  • FIG. 3 is a flow chart illustrating the operation of interface circuit 10 according to an embodiment of the present invention.
  • the L-bit unrepresentable true result B of the multiplication operation is output from multiplier circuit 8.
  • the unrepresentable B is provided in an intermediate format.
  • the intermediate format is a sum and carry format, including a sum word 20 and a carry word 22.
  • interface circuit 10 of the present invention receives an unrepresentable true result B from a multiplier circuit 8
  • interface circuit 10 according to an embodiment of the present invention can receive an unrepresentable true result as an input from a wide variety of arithmetic circuits such as adder circuits, subtracting circuits and divider circuits as well.
  • Interface circuit 10 can be used to both complete an arithmetic operation and, in parallel, round the arithmetic result for a variety of arithmetic operations.
  • step 320 three numbers are calculated in parallel: 1) The true result B of the multiplication operation is calculated in a final format based on the intermediate format of the true result B. 2) An N-bit representable number A that approximates the unrepresentable true result B is calculated. And, 3) An N-bit representable number C is calculated that also approximates the unrepresentable true result B.
  • CPA 28 To calculate the L-bit true result B in a final format, CPA 28 adds sum word 20 and carry word 22 and outputs an L-bit true result B in a final format. According to an embodiment of the present invention CPA 28 outputs the true result B in a two's complement format. By adding sum word 20 and carry word 22, CPA 28 performs the last step of the multiplication operation.
  • the N most significant bits of the L-bit true result B in the final format output from CPA 28 comprise the N-bit representable number A. Therefore, it can be seen that by calculating the L-bit true result B in the final format, CPA 28 also simultaneously calculates the N-bit representable number A. Also, the least significant bit of the N-bit representable number A, and the guard bit, round bit, and sticky bit of the L-bit true result B in the final format are output over line 30 to rounding control circuit 32.
  • the N-bit representable number C is calculated by CSA 24 and CPA 26 based on sum word 20, carry word 22 and a 1 that are input to CSA 24.
  • Sum word 20 and carry word 22 are both L-bits wide.
  • the 1 is added to sum word 20 and carry word 22 at a bit position corresponding to the least significant bit of the N most significant bits of the L-bit true result B.
  • CSA 24 can comprise an array of carry save adders.
  • CSA 24 outputs an L-bit value in an intermediate format. According to an embodiment of the present invention, CSA 24 outputs the L-bit value in a sum and carry format that includes sum word 25 and carry word 27.
  • CPA 26 completes the multiplication operation (e.g., converts the L-bit output from CSA 24 from the intermediate format to a final format). According to an embodiment of the present invention, CPA 26 converts the L-bit output of CSA 24 from the sum and carry format to a two's complement format. The N most significant bits of the L-bit output from CPA 26 comprise the N-bit representable number C. The remaining bits of the L-bit output from the CPA 26 are disregarded.
  • the unrepresentable true result B is rounded by selecting either the first representable number A or the second representable number C as the rounded result of the multiplication operation.
  • Rounding control circuit 32 outputs either a low or high as a select signal on line 36 based upon rounding mode input 34 and one or more of the least significant bit, the guard bit, the round bit, and the sticky bit provided on line 30.
  • MUX 44 selects either representable number A or representable number C based upon the select signal on line 36.
  • FIG. 4 illustrates an example according to an embodiment of the present invention.
  • the unrepresentable true result B is a 13 bit positive number in two's complement that equals 22.65625 10 .
  • a line 60 separates the 9 most significant bits of the unrepresentable true result B from the less significant 4 bits that cannot be stored in the 9 bit destination.
  • the least significant bit (LSB) of the 9 most significant bits of the true result B, the guard bit (G), the round bit (R), the sticky bits and the sign bit are illustrated in FIG. 4. The bits after the round bit are ORed together to obtain the sticky bit.
  • the 9 bit representable number A comprises the 9 most significant bits of the unrepresentable true result B.
  • the 9-bit representable number A is illustrated in FIG. 4 and is equal to 22.625 10 .
  • the 9-bit representable number C is obtained by adding 1 to the least significant bit of the 9 most significant bits of the unrepresentable true result B. This operation is illustrated in FIG. 4.
  • the nine-bit representable number C is equal to 22.75 10 .
  • MUX 44 and rounding control circuit 32 operate to select one of the 9-bit representable numbers A or C to be the rounded result of the multiplication operation. Because the true result B is positive, the rounding rules of FIG. 2A are applied. If the rounding mode is round to nearest, then MUX 44 and rounding control circuit 32 operate to select the number A because it is closest to the unrepresentable true result B. If the mode is round down, the number A will be selected. If the mode is round up, the number C will be selected. If the rounding mode is chop, the number A will be selected.
  • the interface circuit 10 includes a first adder circuit and a second adder circuit connected in parallel, a MUX 44 and a rounding control circuit 32.
  • Interface circuit 10 operates to perform a last step in an arithmetic operation and a round operation in parallel.
  • a true result B of an arithmetic operation (such as an add, subtract, multiply or divide operation) in an intermediate format is output to interface circuit 10 from an arithmetic circuit (such as multiplier 8).
  • the true result B can be provided, for example, in a sum and carry format.
  • a sum word 20 and a carry word 22 of the true result B are input to the first and second adder circuits.
  • the first adder circuit includes a CPA 28.
  • CPA 28 calculates the true result B in a final format and an N-bit representable number A in parallel.
  • the N-bit representable number A comprises the N most significant bits of the L-bit true result B in final format.
  • the second adder circuit includes a CSA 24 and a CPA 26.
  • the second adder circuit receives sum word 20, carry word 22 and a 1 as inputs, and calculates an N-bit representable number C.
  • the 1 is added at a bit position corresponding to the LSB of the N most significant bits of the true result B.
  • representable numbers A and C are successive numbers that bracket the true result B.
  • Rounding control circuit 32 and MUX 44 select either number A or number B as the rounded result of the arithmetic operation.

Abstract

An interface circuit performs a last step of an arithmetic operation and a round operation in parallel. The interface circuit includes a first adder circuit that receives as an input a true result of an arithmetic operation in an intermediate format. The first adder circuit outputs both the true result in a final format and a first representable number approximating the true result. A second adder circuit is connected in parallel to the first adder circuit. The second adder circuit receives the true result in the intermediate format and a 1 as inputs. The second adder circuit outputs a second representable number approximating the true result. The interface circuit also includes a selection circuit connected to the outputs of the first and second adder circuits. The selection circuit outputs either the first or second representable numbers as a rounded result of the arithmetic operation.

Description

TECHNICAL FIELD
The present invention relates to logic circuits, and more particularly, to an interface circuit for performing parallel arithmetic and round operations.
BACKGROUND OF THE INVENTION
One type of arithmetic operation is a multiplication operation. One way to multiply two numbers is to use Booth encoding. Two numbers can be multiplied by generating partial products based on the Booth encoding. A Wallace Tree, which is a tree-like network of carry-save-adders (CSAs), can be used to add the partial products. The use of a CSA is much faster than adding the partial products linearly. The output of the Wallace tree is a true result that includes a sum word and a carry word. To obtain a final sum, however, the sum word and carry word must be added by a conventional adder with carry propagation, such as a carry-propagate-adder (CPA). A variety of arithmetic operations can use a CSA followed a CPA to add two or more numbers together.
Once the result of the arithmetic operation has been generated, the result may then be rounded. Rounding must be performed in arithmetic and store operations when the format of the destination cannot represent the precise true result. For example, a real number may be rounded if it is stored in a shorter real format, or in an integer format. There are four IEEE rounding modes: round to nearest, round down, round up and chop. Given a true result of the arithmetic operation provided as sum and carry words that cannot be represented in the format of the destination, prior art circuits first calculated the true result in a final format (such as two's complement format) and then rounded the true result to the desired number of bits. However, the prior art circuits employed separate steps to generate the true result in a final format and then to round the result to a desired number of bits. This process was slow. Therefore, there is a need for a faster technique to calculate a true result in a final format and round the result.
SUMMARY OF THE INVENTION
The interface circuit according to an embodiment of the present invention overcomes the disadvantages of the prior art by providing a first adder circuit that receives as an input a true result of an arithmetic operation in an intermediate format. The first adder circuit outputs both the true result in a final format and a first representable number approximating the true result. A second adder circuit is connected in parallel to the first adder circuit. The second adder circuit receives the true result in the intermediate format and a 1 as inputs. The second adder circuit outputs a second representable number approximating the true result. The interface circuit also includes a selection circuit connected to the outputs of the first and second adder circuits. The selection circuit outputs either the first or second representable numbers as a rounded result of the arithmetic operation.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 illustrates a block diagram of an interface circuit according to an embodiment of the present invention.
FIG. 2A illustrates the operation of the rounding modes for positive numbers according to an embodiment of the present invention.
FIG. 2B illustrates the operation of the rounding modes for negative numbers according to an embodiment of the present invention.
FIG. 3 is a flow chart illustrating the operation of the interface circuit according to an embodiment of the present invention.
FIG. 4 illustrates an example according to an embodiment of the present invention.
DETAILED DESCRIPTION
Referring to the figures in which like numerals indicate like elements, FIG. 1 illustrates a block diagram of an interface circuit 10 according to an embodiment of the present invention. A multiplier circuit 8 is connected to interface circuit 10. Multiplier circuit 8 multiplies two numbers (source1 and source2) together to output an L-bit true result of the multiplication operation in an intermediate format. A destination of the multiplication operation can only store an N-bit result, wherein L>N. Therefore, the L-bit true result of the multiplication operation is unrepresentable and must be rounded to N bits. Interface circuit 10 completes the multiplication operation by calculating the true result in a final format and rounds the true result to N-bits in parallel. By completing the multiplication operation and rounding the multiplication result in parallel, processor speed can be improved.
According to an embodiment of the present invention, multiplier circuit 8 can include conventional multiplication circuitry, such as a Booth encoder 14, a partial product creation block 12 and a Wallace Tree 18. Two numbers, source1 and source2 are input into multiplier circuit 10 that will be multiplied together. Source 1 is input to booth encoder 14, and source2 is input into partial product creation block 12. Source1 and source 2 may be, for example, mantissas that will be multiplied together for a floating point multiplication operation. Booth encoder 14 Booth encodes source1 and outputs Booth encoding signals to partial product creation block 12. Partial product creation block 12 generates one or more partial products based on source2 and the Booth encoding signals received from Booth encoder 14. The partial products are output to Wallace Tree 18.
Wallace Tree 18 is a tree like network of carry-save-adders (CSAs) that are used to add the partial products. The use of a network of CSAs, such as a Wallace Tree, is much faster than adding the partial products in a linear fashion. However, carry propagation is not done except in the final step. Therefore, multiplier circuit 8 outputs a true result B of the multiplication operation in an intermediate (or non-final) format. In the embodiment illustrated in FIG. 1, true result B is output from Wallace Tree 18 in a sum and carry format that includes a sum word 20 and a carry word 22. Other types of intermediate formats can be used.
Both sum word 20 and carry word 22 are L-bits wide. To obtain a multiplication result that is in a final format (e.g., two's complement format), sum word 20 and carry word 22 must be added together by a conventional adder with carry propagation, such as a carry-propagate-adder (CPA). Interface circuit 10 uses carry propagate adders to obtain a multiplication result that is in a final format while, in parallel, rounding the result to N bits.
Interface circuit 10 according to an embodiment of the present invention illustrated in FIG. 1 includes two adder circuits connected in parallel, a multiplexer (MUX) 44 connected to the outputs of the adder circuits, and a rounding control circuit 32.
A first adder circuit comprises a CPA 28. CPA 28 is connected to the output of multiplier circuit 10 and receives sum word 20 and carry word 22 as inputs. CPA 28 adds L-bit sum word 20 and L-bit carry word 22 to output a L-bit true result that is in a final format. According to an embodiment of the present invention, CPA 28 outputs the L-bit true result in a two's complement format. However, other types of final formats can be used. Therefore, CPA 28 converts the L-bit output from multiplier circuit 8 provided in sum and carry format to a final format (e.g., two's complement format).
The N most significant bits of the L-bit true result output in final format from CPA 28 are output to MUX 44 as an N-bit representable number A. The remaining (less significant) bits of the L-bit true result output from CPA 28 include the guard bit (G), the round bit (R) and the sticky bit (S). The guard, round and sticky bits are output to a rounding control circuit 32 via line 30. The least significant bit (LSB) of the N-bit representable number A is also output to rounding control circuit 32 via line 30.
A second adder circuit includes a 3:2 carry save adder (CSA) 24 and a carry propagate adder (CPA) 26. CSA 24 is connected to the output of multiplier circuit 8 and receives sum word 20 and carry word 22 as inputs. CSA 24 also receives a plus 1 input as well. CSA 24 adds together sum word 20, carry word 22 and the plus 1. According to an embodiment of the present invention, CSA 24 outputs an L-bit output as a sum word 25 and a carry word 27. Other types of intermediate formats can be used. CSA 24 adds the plus 1 input to sum word 20 and carry word 22 at a bit position corresponding to the least significant bit of the N most significant bits of the L-bit true result B.
CPA 26 is connected to the output of CSA 24 and receives sum word 25 and carry word 27 as inputs. CPA 26 adds L-bit sum word 25 and L-bit carry word 27 to output an L-bit value that is in a final format. According to an embodiment of the present invention, CPA 26 outputs the L-bit value in a two's complement format. However, other types of final formats can be used. The N most significant bits of the L-bit value output in final format from CPA 26 are output to MUX 44 as an N-bit representable number C. The remaining (less significant) bits of the L-bit value output from CPA 26 are discarded (not used). Therefore, CPA 26 converts the L-bit output from CSA 24 provided in sum and carry format to an N-bit representable number C in a final format (e.g., two's complement format), wherein L>N.
Rounding control circuit 32 receives as inputs the LSB, guard (G), round (R) and sticky (S) bits from CPA 28 over line 30, a rounding mode input 34, and a sign bit of the true result provided over line 35. Rounding mode input 34 can designate one of several rounding modes. The LSB, guard, round and sticky bits are provided for allowing rounding control circuit 32 to select the desired rounding result of the multiplication operation, either A or C, based on the selected rounding mode. Rounding control circuit 32 outputs a select signal on line 36 for controlling MUX 44. MUX 44 selects either the N-bit representable number A or the N-bit representable number C as the rounded result of the multiplication operation based on the select signal provided on line 36. Rounding control is also performed based on the sign bit of the true result (TR) provided over line 35.
FIG. 2A illustrates the operation of the rounding modes for a positive true result according to an embodiment of the present invention. FIG. 2B illustrates the operation of the rounding modes for a negative true result according to an embodiment of the present invention. A true result B is output from multiplier circuit 8 and is L-bits wide. However, because the destination cannot represent the true result B, true result B must be rounded to N bits, wherein L>N. There are four IEEE rounding modes according to the IEEE 754 Standard Floating Point Number Format: round to nearest, round down, round up, and chop. The L-bit unrepresentable true result B can be rounded to N-bits by selecting either the N-bit representable number A or the N-bit representable number C as the rounded result of the multiplication operation. As illustrated in FIGS. 2A and 2B, the representable numbers A and C are successive representable numbers, wherein A is less then or equal to B which is less then or equal to C. In other words the representable numbers A and C bracket the unrepresentable number B.
Interface circuit 10 selects either representable number A or C as the N-bit rounded result based upon rounding mode input 34 and the least significant bit (LSB), guard bit (G), round bit (R) and/or the sticky bit (S) provided on line 30, and the sign bit of the true result provided on line 35. As illustrated in FIGS. 2A and 2B, when the round to nearest mode is selected, the representable number that is closest to the unrepresentable number B is selected. If both A and C are equally close to the number B, the representable number is selected that has a zero in the least significant bit. When the round down mode is selected, the unrepresentable number is rounded down toward minus infinity. When the round up mode is selected, the unrepresentable number is rounded up toward positive infinity. When the chop mode is selected, the representable number A or C is selected that is smaller in magnitude.
FIG. 3 is a flow chart illustrating the operation of interface circuit 10 according to an embodiment of the present invention. At step 310, the L-bit unrepresentable true result B of the multiplication operation is output from multiplier circuit 8. The unrepresentable B is provided in an intermediate format. According to an embodiment of the present invention, the intermediate format is a sum and carry format, including a sum word 20 and a carry word 22. However other intermediate formats can be used. While interface circuit 10 of the present invention receives an unrepresentable true result B from a multiplier circuit 8, interface circuit 10 according to an embodiment of the present invention can receive an unrepresentable true result as an input from a wide variety of arithmetic circuits such as adder circuits, subtracting circuits and divider circuits as well. Interface circuit 10 can be used to both complete an arithmetic operation and, in parallel, round the arithmetic result for a variety of arithmetic operations.
At step 320, three numbers are calculated in parallel: 1) The true result B of the multiplication operation is calculated in a final format based on the intermediate format of the true result B. 2) An N-bit representable number A that approximates the unrepresentable true result B is calculated. And, 3) An N-bit representable number C is calculated that also approximates the unrepresentable true result B.
To calculate the L-bit true result B in a final format, CPA 28 adds sum word 20 and carry word 22 and outputs an L-bit true result B in a final format. According to an embodiment of the present invention CPA 28 outputs the true result B in a two's complement format. By adding sum word 20 and carry word 22, CPA 28 performs the last step of the multiplication operation. The N most significant bits of the L-bit true result B in the final format output from CPA 28 comprise the N-bit representable number A. Therefore, it can be seen that by calculating the L-bit true result B in the final format, CPA 28 also simultaneously calculates the N-bit representable number A. Also, the least significant bit of the N-bit representable number A, and the guard bit, round bit, and sticky bit of the L-bit true result B in the final format are output over line 30 to rounding control circuit 32.
The N-bit representable number C is calculated by CSA 24 and CPA 26 based on sum word 20, carry word 22 and a 1 that are input to CSA 24. Sum word 20 and carry word 22 are both L-bits wide. The 1 is added to sum word 20 and carry word 22 at a bit position corresponding to the least significant bit of the N most significant bits of the L-bit true result B. Because sum word 20 and carry word 22 are L-bits wide, CSA 24 can comprise an array of carry save adders. CSA 24 outputs an L-bit value in an intermediate format. According to an embodiment of the present invention, CSA 24 outputs the L-bit value in a sum and carry format that includes sum word 25 and carry word 27. CPA 26 completes the multiplication operation (e.g., converts the L-bit output from CSA 24 from the intermediate format to a final format). According to an embodiment of the present invention, CPA 26 converts the L-bit output of CSA 24 from the sum and carry format to a two's complement format. The N most significant bits of the L-bit output from CPA 26 comprise the N-bit representable number C. The remaining bits of the L-bit output from the CPA 26 are disregarded.
At step 330, the unrepresentable true result B is rounded by selecting either the first representable number A or the second representable number C as the rounded result of the multiplication operation. Rounding control circuit 32 outputs either a low or high as a select signal on line 36 based upon rounding mode input 34 and one or more of the least significant bit, the guard bit, the round bit, and the sticky bit provided on line 30. MUX 44 selects either representable number A or representable number C based upon the select signal on line 36.
FIG. 4 illustrates an example according to an embodiment of the present invention. According to the example in FIG. 4, the unrepresentable true result B is a 13 bit positive number in two's complement that equals 22.6562510. In this example the 13 bit floating point number must be rounded to 9 bits to fit a destination register. Therefore, in this example, L=13, and N=9. A line 60 separates the 9 most significant bits of the unrepresentable true result B from the less significant 4 bits that cannot be stored in the 9 bit destination. The least significant bit (LSB) of the 9 most significant bits of the true result B, the guard bit (G), the round bit (R), the sticky bits and the sign bit are illustrated in FIG. 4. The bits after the round bit are ORed together to obtain the sticky bit. Therefore, it can be seen that the 9 bit representable number A comprises the 9 most significant bits of the unrepresentable true result B. The 9-bit representable number A is illustrated in FIG. 4 and is equal to 22.62510. The 9-bit representable number C is obtained by adding 1 to the least significant bit of the 9 most significant bits of the unrepresentable true result B. This operation is illustrated in FIG. 4. The nine-bit representable number C is equal to 22.7510.
MUX 44 and rounding control circuit 32 operate to select one of the 9-bit representable numbers A or C to be the rounded result of the multiplication operation. Because the true result B is positive, the rounding rules of FIG. 2A are applied. If the rounding mode is round to nearest, then MUX 44 and rounding control circuit 32 operate to select the number A because it is closest to the unrepresentable true result B. If the mode is round down, the number A will be selected. If the mode is round up, the number C will be selected. If the rounding mode is chop, the number A will be selected.
The interface circuit 10 according to an embodiment of the present invention includes a first adder circuit and a second adder circuit connected in parallel, a MUX 44 and a rounding control circuit 32. Interface circuit 10 operates to perform a last step in an arithmetic operation and a round operation in parallel. A true result B of an arithmetic operation (such as an add, subtract, multiply or divide operation) in an intermediate format is output to interface circuit 10 from an arithmetic circuit (such as multiplier 8). The true result B can be provided, for example, in a sum and carry format. A sum word 20 and a carry word 22 of the true result B are input to the first and second adder circuits.
According to an embodiment of the present invention, the first adder circuit includes a CPA 28. CPA 28 calculates the true result B in a final format and an N-bit representable number A in parallel. The N-bit representable number A comprises the N most significant bits of the L-bit true result B in final format.
According to an embodiment of the present invention, the second adder circuit includes a CSA 24 and a CPA 26. The second adder circuit receives sum word 20, carry word 22 and a 1 as inputs, and calculates an N-bit representable number C. The 1is added at a bit position corresponding to the LSB of the N most significant bits of the true result B. According to one embodiment, representable numbers A and C are successive numbers that bracket the true result B. Rounding control circuit 32 and MUX 44 select either number A or number B as the rounded result of the arithmetic operation.
Several embodiments of the present invention are specifically illustrated and/or described herein. However, it will be appreciated that modifications and variations of the present invention are covered by the above teachings and within the purview of the appended claims without departing from the spirit and intended scope of the invention.

Claims (10)

What is claimed is:
1. An interface circuit for performing parallel arithmetic and round operations comprising:
a first adder circuit receiving a true result of an arithmetic operation in an intermediate format as an input, and outputting both the true result in a final format and a first representable number approximating the true result;
a second adder circuit, connected in parallel to the first adder circuit, the second adder circuit receiving the true result in the intermediate format and a 1 as inputs, and outputting a second representable number approximating the true result said second adder circuit comprising:
a carry save adder receiving as inputs the L-bit true result in the intermediate format and a 1, the carry save adder outputting an L-bit output in the intermediate format; and
a carry propagate adder connected to the output of the carry save adder, said carry propagate adder converting the L-bit output from the carry save adder in the intermediate format to an L-bit output in the final format, the N most significant bits of the L-bit output of the carry propagate adder comprising the second N-bit representable number; and
a selection circuit connected to the outputs of the first and second adder circuits, the selection circuit outputting either the first or second representable number as a rounded result of the arithmetic operation.
2. The interface circuit of claim 1 wherein said first adder circuit comprises a first adder circuit receiving an L-bit true result of an arithmetic operation in an intermediate format as an input, and outputting both the L-bit true result in a final format and a first N-bit representable number approximating the true result, wherein L is greater than or equal to N; and
wherein the second adder circuit comprises a second adder circuit, connected in parallel to the first adder circuit, the second adder circuit receiving the L-bit true result in the intermediate format and a 1 as inputs, and outputting a second N-bit representable number approximating the true result, the 1 being added to the intermediate format true result at a bit position corresponding to the LSB of the N most significant bits of the intermediate format true result.
3. The interface circuit of claim 2 wherein the first N-bit representable number output from the first adder circuit comprises the N most significant bits of the true result in the final format output from the first adder circuit.
4. The interface circuit of claim 2 wherein said first adder circuit comprises a first adder circuit receiving an L-bit true result of an arithmetic operation in a sum and carry format as an input, and outputting both the L-bit true result in a two's complement format and a first N-bit representable number approximating the true result, wherein L is greater than or equal to N, the first N-bit representable number comprising the N most significant bits of the L-bit true result.
5. The interface circuit of claim 2 wherein the selection circuit comprises:
a rounding control circuit connected to the output of the first adder circuit and outputting a select signal based on one or more bits of the L-bit true result in the final format;
a multiplexer receiving the first and second N-bit representable numbers as inputs and outputting one of the N-bit representable numbers as a rounded result of the arithmetic operation based on the select signal.
6. The interface circuit of claim 5 wherein said rounding control circuit further includes a rounding mode input designating a rounding mode, wherein said rounding control circuit outputs a select signal based on one or more of the guard, round and sticky bits of the L-bit true result in the final format and the rounding mode input.
7. The interface circuit of claim 2 wherein said rounding control circuit comprises a rounding control circuit connected to the first adder circuit and receiving as inputs at least the guard, round and sticky bits of the L-bit true result output in the final format, the rounding control circuit outputting a select signal based on one or more of the guard, round and sticky bits of the L-bit true result in the final format.
8. The interface circuit of claim 2 wherein the first and second representable numbers are successive N-bit numbers bracketing the true result.
9. The interface circuit of claim 1 wherein said first adder circuit comprises a carry propagate adder.
10. The interface circuit of claim 9 wherein said true result in the intermediate format comprises a true result in a sum and carry format.
US08/999,243 1997-12-29 1997-12-29 Interface for performing parallel arithmetic and round operations Expired - Lifetime US6055555A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US08/999,243 US6055555A (en) 1997-12-29 1997-12-29 Interface for performing parallel arithmetic and round operations

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US08/999,243 US6055555A (en) 1997-12-29 1997-12-29 Interface for performing parallel arithmetic and round operations

Publications (1)

Publication Number Publication Date
US6055555A true US6055555A (en) 2000-04-25

Family

ID=25546075

Family Applications (1)

Application Number Title Priority Date Filing Date
US08/999,243 Expired - Lifetime US6055555A (en) 1997-12-29 1997-12-29 Interface for performing parallel arithmetic and round operations

Country Status (1)

Country Link
US (1) US6055555A (en)

Cited By (52)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6366943B1 (en) * 1999-03-31 2002-04-02 Brian Martin Clinton Adder circuit with the ability to detect zero when rounding
US6397238B2 (en) * 1998-08-14 2002-05-28 Advanced Micro Devices, Inc. Method and apparatus for rounding in a multiplier
US20020174156A1 (en) * 2001-03-29 2002-11-21 Koninklijke Philips Electronics N.V. Round-off algorithm without bias for 2's complemetnt data
US20030158880A1 (en) * 2002-02-13 2003-08-21 Ng Kenneth Y. Booth encoder and partial products circuit
US20050125478A1 (en) * 2003-12-05 2005-06-09 Intel Corporation Smaller and lower power static mux circuitry in generating multiplier partial product signals
US20070185952A1 (en) * 2006-02-09 2007-08-09 Altera Corporation Specialized processing block for programmable logic device
US20070185951A1 (en) * 2006-02-09 2007-08-09 Altera Corporation Specialized processing block for programmable logic device
US20080028014A1 (en) * 2006-07-26 2008-01-31 Hilt Jason W N-BIT 2's COMPLEMENT SYMMETRIC ROUNDING METHOD AND LOGIC FOR IMPLEMENTING THE SAME
US20080077779A1 (en) * 2006-09-22 2008-03-27 Ronen Zohar Performing rounding operations responsive to an instruction
US7814137B1 (en) 2007-01-09 2010-10-12 Altera Corporation Combined interpolation and decimation filter for programmable logic device
US7822799B1 (en) * 2006-06-26 2010-10-26 Altera Corporation Adder-rounder circuitry for specialized processing block in programmable logic device
US7836117B1 (en) 2006-04-07 2010-11-16 Altera Corporation Specialized processing block for programmable logic device
US7865541B1 (en) 2007-01-22 2011-01-04 Altera Corporation Configuring floating point operations in a programmable logic device
US7930336B2 (en) 2006-12-05 2011-04-19 Altera Corporation Large multiplier for programmable logic device
US7948267B1 (en) * 2010-02-09 2011-05-24 Altera Corporation Efficient rounding circuits and methods in configurable integrated circuit devices
US7949699B1 (en) 2007-08-30 2011-05-24 Altera Corporation Implementation of decimation filter in integrated circuit device using ram-based data storage
US20110219052A1 (en) * 2010-03-02 2011-09-08 Altera Corporation Discrete fourier transform in an integrated circuit device
US20110238720A1 (en) * 2010-03-25 2011-09-29 Altera Corporation Solving linear matrices in an integrated circuit device
US8041759B1 (en) 2006-02-09 2011-10-18 Altera Corporation Specialized processing block for programmable logic device
US8301681B1 (en) 2006-02-09 2012-10-30 Altera Corporation Specialized processing block for programmable logic device
US8307023B1 (en) 2008-10-10 2012-11-06 Altera Corporation DSP block for implementing large multiplier on a programmable integrated circuit device
US8386550B1 (en) 2006-09-20 2013-02-26 Altera Corporation Method for configuring a finite impulse response filter in a programmable logic device
US8396914B1 (en) 2009-09-11 2013-03-12 Altera Corporation Matrix decomposition in an integrated circuit device
US8412756B1 (en) 2009-09-11 2013-04-02 Altera Corporation Multi-operand floating point operations in a programmable integrated circuit device
US8468192B1 (en) 2009-03-03 2013-06-18 Altera Corporation Implementing multipliers in a programmable integrated circuit device
US8484265B1 (en) 2010-03-04 2013-07-09 Altera Corporation Angular range reduction in an integrated circuit device
US8510354B1 (en) 2010-03-12 2013-08-13 Altera Corporation Calculation of trigonometric functions in an integrated circuit device
US8539016B1 (en) 2010-02-09 2013-09-17 Altera Corporation QR decomposition in an integrated circuit device
US8543634B1 (en) 2012-03-30 2013-09-24 Altera Corporation Specialized processing block for programmable integrated circuit device
US8577951B1 (en) 2010-08-19 2013-11-05 Altera Corporation Matrix operations in an integrated circuit device
US8589463B2 (en) 2010-06-25 2013-11-19 Altera Corporation Calculation of trigonometric functions in an integrated circuit device
US8620980B1 (en) 2005-09-27 2013-12-31 Altera Corporation Programmable device with specialized multiplier blocks
US8645451B2 (en) 2011-03-10 2014-02-04 Altera Corporation Double-clocked specialized processing block in an integrated circuit device
US8645449B1 (en) 2009-03-03 2014-02-04 Altera Corporation Combined floating point adder and subtractor
US8645450B1 (en) 2007-03-02 2014-02-04 Altera Corporation Multiplier-accumulator circuitry and methods
US8650236B1 (en) 2009-08-04 2014-02-11 Altera Corporation High-rate interpolation or decimation filter in integrated circuit device
US8650231B1 (en) 2007-01-22 2014-02-11 Altera Corporation Configuring floating point operations in a programmable device
US8706790B1 (en) 2009-03-03 2014-04-22 Altera Corporation Implementing mixed-precision floating-point operations in a programmable integrated circuit device
US8762443B1 (en) 2011-11-15 2014-06-24 Altera Corporation Matrix operations in an integrated circuit device
US8812576B1 (en) 2011-09-12 2014-08-19 Altera Corporation QR decomposition in an integrated circuit device
US8862650B2 (en) 2010-06-25 2014-10-14 Altera Corporation Calculation of trigonometric functions in an integrated circuit device
US8949298B1 (en) 2011-09-16 2015-02-03 Altera Corporation Computing floating-point polynomials in an integrated circuit device
US8959137B1 (en) 2008-02-20 2015-02-17 Altera Corporation Implementing large multipliers in a programmable integrated circuit device
US8996600B1 (en) 2012-08-03 2015-03-31 Altera Corporation Specialized processing block for implementing floating-point multiplier with subnormal operation support
US9053045B1 (en) 2011-09-16 2015-06-09 Altera Corporation Computing floating-point polynomials in an integrated circuit device
US9098332B1 (en) 2012-06-01 2015-08-04 Altera Corporation Specialized processing block with fixed- and floating-point structures
US9189200B1 (en) 2013-03-14 2015-11-17 Altera Corporation Multiple-precision processing block in a programmable integrated circuit device
US9207909B1 (en) 2012-11-26 2015-12-08 Altera Corporation Polynomial calculations optimized for programmable integrated circuit device structures
US9348795B1 (en) 2013-07-03 2016-05-24 Altera Corporation Programmable device using fixed and configurable logic to implement floating-point rounding
US9600278B1 (en) 2011-05-09 2017-03-21 Altera Corporation Programmable device using fixed and configurable logic to implement recursive trees
US9684488B2 (en) 2015-03-26 2017-06-20 Altera Corporation Combined adder and pre-adder for high-radix multiplier circuit
US10942706B2 (en) 2017-05-05 2021-03-09 Intel Corporation Implementation of floating-point trigonometric functions in an integrated circuit device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5122981A (en) * 1988-03-23 1992-06-16 Matsushita Electric Industrial Co., Ltd. Floating point processor with high speed rounding circuit
US5390134A (en) * 1993-01-29 1995-02-14 Hewlett-Packard Company System and method for reducing latency in a floating point processor
US5638313A (en) * 1995-01-30 1997-06-10 Cirrus Logic, Inc. Booth multiplier with high speed output circuitry
US5671171A (en) * 1995-07-05 1997-09-23 Sun Microsystems, Inc. Shared rounding hardware for multiplier and divider/square root unit using conditional sum adder
US5943250A (en) * 1996-10-21 1999-08-24 Samsung Electronics Co., Ltd. Parallel multiplier that supports multiple numbers with different bit lengths

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5122981A (en) * 1988-03-23 1992-06-16 Matsushita Electric Industrial Co., Ltd. Floating point processor with high speed rounding circuit
US5390134A (en) * 1993-01-29 1995-02-14 Hewlett-Packard Company System and method for reducing latency in a floating point processor
US5638313A (en) * 1995-01-30 1997-06-10 Cirrus Logic, Inc. Booth multiplier with high speed output circuitry
US5671171A (en) * 1995-07-05 1997-09-23 Sun Microsystems, Inc. Shared rounding hardware for multiplier and divider/square root unit using conditional sum adder
US5943250A (en) * 1996-10-21 1999-08-24 Samsung Electronics Co., Ltd. Parallel multiplier that supports multiple numbers with different bit lengths

Cited By (68)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6397238B2 (en) * 1998-08-14 2002-05-28 Advanced Micro Devices, Inc. Method and apparatus for rounding in a multiplier
US6366943B1 (en) * 1999-03-31 2002-04-02 Brian Martin Clinton Adder circuit with the ability to detect zero when rounding
US20020174156A1 (en) * 2001-03-29 2002-11-21 Koninklijke Philips Electronics N.V. Round-off algorithm without bias for 2's complemetnt data
US6898614B2 (en) * 2001-03-29 2005-05-24 Koninklijke Philips Electronics N.V. Round-off algorithm without bias for 2's complement data
US20030158880A1 (en) * 2002-02-13 2003-08-21 Ng Kenneth Y. Booth encoder and partial products circuit
US7308470B2 (en) 2003-12-05 2007-12-11 Intel Corporation Smaller and lower power static mux circuitry in generating multiplier partial product signals
US20050125478A1 (en) * 2003-12-05 2005-06-09 Intel Corporation Smaller and lower power static mux circuitry in generating multiplier partial product signals
US8620980B1 (en) 2005-09-27 2013-12-31 Altera Corporation Programmable device with specialized multiplier blocks
US20070185952A1 (en) * 2006-02-09 2007-08-09 Altera Corporation Specialized processing block for programmable logic device
US20070185951A1 (en) * 2006-02-09 2007-08-09 Altera Corporation Specialized processing block for programmable logic device
US8041759B1 (en) 2006-02-09 2011-10-18 Altera Corporation Specialized processing block for programmable logic device
US8301681B1 (en) 2006-02-09 2012-10-30 Altera Corporation Specialized processing block for programmable logic device
US8266198B2 (en) 2006-02-09 2012-09-11 Altera Corporation Specialized processing block for programmable logic device
US8266199B2 (en) 2006-02-09 2012-09-11 Altera Corporation Specialized processing block for programmable logic device
US7836117B1 (en) 2006-04-07 2010-11-16 Altera Corporation Specialized processing block for programmable logic device
US7822799B1 (en) * 2006-06-26 2010-10-26 Altera Corporation Adder-rounder circuitry for specialized processing block in programmable logic device
US20080028014A1 (en) * 2006-07-26 2008-01-31 Hilt Jason W N-BIT 2's COMPLEMENT SYMMETRIC ROUNDING METHOD AND LOGIC FOR IMPLEMENTING THE SAME
US8386550B1 (en) 2006-09-20 2013-02-26 Altera Corporation Method for configuring a finite impulse response filter in a programmable logic device
US10114640B2 (en) 2006-09-22 2018-10-30 Intel Corporation Performing rounding operations responsive to an instruction
US10114641B2 (en) 2006-09-22 2018-10-30 Intel Corporation Performing rounding operations responsive to an instruction
US9286267B2 (en) 2006-09-22 2016-03-15 Intel Corporation Performing rounding operations responsive to an instruction
US10067761B2 (en) 2006-09-22 2018-09-04 Intel Corporation Performing rounding operations responsive to an instruction
US10108416B2 (en) 2006-09-22 2018-10-23 Intel Corporation Performing rounding operations responsive to an instruction
US9223751B2 (en) 2006-09-22 2015-12-29 Intel Corporation Performing rounding operations responsive to an instruction
US20080077779A1 (en) * 2006-09-22 2008-03-27 Ronen Zohar Performing rounding operations responsive to an instruction
US20110161389A1 (en) * 2006-12-05 2011-06-30 Altera Corporation Large multiplier for programmable logic device
US9395953B2 (en) 2006-12-05 2016-07-19 Altera Corporation Large multiplier for programmable logic device
US7930336B2 (en) 2006-12-05 2011-04-19 Altera Corporation Large multiplier for programmable logic device
US8788562B2 (en) 2006-12-05 2014-07-22 Altera Corporation Large multiplier for programmable logic device
US7814137B1 (en) 2007-01-09 2010-10-12 Altera Corporation Combined interpolation and decimation filter for programmable logic device
US8650231B1 (en) 2007-01-22 2014-02-11 Altera Corporation Configuring floating point operations in a programmable device
US7865541B1 (en) 2007-01-22 2011-01-04 Altera Corporation Configuring floating point operations in a programmable logic device
US8645450B1 (en) 2007-03-02 2014-02-04 Altera Corporation Multiplier-accumulator circuitry and methods
US7949699B1 (en) 2007-08-30 2011-05-24 Altera Corporation Implementation of decimation filter in integrated circuit device using ram-based data storage
US8959137B1 (en) 2008-02-20 2015-02-17 Altera Corporation Implementing large multipliers in a programmable integrated circuit device
US8307023B1 (en) 2008-10-10 2012-11-06 Altera Corporation DSP block for implementing large multiplier on a programmable integrated circuit device
US8706790B1 (en) 2009-03-03 2014-04-22 Altera Corporation Implementing mixed-precision floating-point operations in a programmable integrated circuit device
US8645449B1 (en) 2009-03-03 2014-02-04 Altera Corporation Combined floating point adder and subtractor
US8468192B1 (en) 2009-03-03 2013-06-18 Altera Corporation Implementing multipliers in a programmable integrated circuit device
US8650236B1 (en) 2009-08-04 2014-02-11 Altera Corporation High-rate interpolation or decimation filter in integrated circuit device
US8412756B1 (en) 2009-09-11 2013-04-02 Altera Corporation Multi-operand floating point operations in a programmable integrated circuit device
US8396914B1 (en) 2009-09-11 2013-03-12 Altera Corporation Matrix decomposition in an integrated circuit device
US7948267B1 (en) * 2010-02-09 2011-05-24 Altera Corporation Efficient rounding circuits and methods in configurable integrated circuit devices
US8539016B1 (en) 2010-02-09 2013-09-17 Altera Corporation QR decomposition in an integrated circuit device
US20110219052A1 (en) * 2010-03-02 2011-09-08 Altera Corporation Discrete fourier transform in an integrated circuit device
US8601044B2 (en) 2010-03-02 2013-12-03 Altera Corporation Discrete Fourier Transform in an integrated circuit device
US8484265B1 (en) 2010-03-04 2013-07-09 Altera Corporation Angular range reduction in an integrated circuit device
US8510354B1 (en) 2010-03-12 2013-08-13 Altera Corporation Calculation of trigonometric functions in an integrated circuit device
US8539014B2 (en) 2010-03-25 2013-09-17 Altera Corporation Solving linear matrices in an integrated circuit device
US20110238720A1 (en) * 2010-03-25 2011-09-29 Altera Corporation Solving linear matrices in an integrated circuit device
US8589463B2 (en) 2010-06-25 2013-11-19 Altera Corporation Calculation of trigonometric functions in an integrated circuit device
US8812573B2 (en) 2010-06-25 2014-08-19 Altera Corporation Calculation of trigonometric functions in an integrated circuit device
US8862650B2 (en) 2010-06-25 2014-10-14 Altera Corporation Calculation of trigonometric functions in an integrated circuit device
US8577951B1 (en) 2010-08-19 2013-11-05 Altera Corporation Matrix operations in an integrated circuit device
US8645451B2 (en) 2011-03-10 2014-02-04 Altera Corporation Double-clocked specialized processing block in an integrated circuit device
US9600278B1 (en) 2011-05-09 2017-03-21 Altera Corporation Programmable device using fixed and configurable logic to implement recursive trees
US8812576B1 (en) 2011-09-12 2014-08-19 Altera Corporation QR decomposition in an integrated circuit device
US9053045B1 (en) 2011-09-16 2015-06-09 Altera Corporation Computing floating-point polynomials in an integrated circuit device
US8949298B1 (en) 2011-09-16 2015-02-03 Altera Corporation Computing floating-point polynomials in an integrated circuit device
US8762443B1 (en) 2011-11-15 2014-06-24 Altera Corporation Matrix operations in an integrated circuit device
US8543634B1 (en) 2012-03-30 2013-09-24 Altera Corporation Specialized processing block for programmable integrated circuit device
US9098332B1 (en) 2012-06-01 2015-08-04 Altera Corporation Specialized processing block with fixed- and floating-point structures
US8996600B1 (en) 2012-08-03 2015-03-31 Altera Corporation Specialized processing block for implementing floating-point multiplier with subnormal operation support
US9207909B1 (en) 2012-11-26 2015-12-08 Altera Corporation Polynomial calculations optimized for programmable integrated circuit device structures
US9189200B1 (en) 2013-03-14 2015-11-17 Altera Corporation Multiple-precision processing block in a programmable integrated circuit device
US9348795B1 (en) 2013-07-03 2016-05-24 Altera Corporation Programmable device using fixed and configurable logic to implement floating-point rounding
US9684488B2 (en) 2015-03-26 2017-06-20 Altera Corporation Combined adder and pre-adder for high-radix multiplier circuit
US10942706B2 (en) 2017-05-05 2021-03-09 Intel Corporation Implementation of floating-point trigonometric functions in an integrated circuit device

Similar Documents

Publication Publication Date Title
US6055555A (en) Interface for performing parallel arithmetic and round operations
US7395304B2 (en) Method and apparatus for performing single-cycle addition or subtraction and comparison in redundant form arithmetic
US5553012A (en) Exponentiation circuit utilizing shift means and method of using same
CA2530015C (en) Division and square root arithmetic unit
US5993051A (en) Combined leading one and leading zero anticipator
JPH02112023A (en) Cardinal number 16 divider
GB2223111A (en) Sticky bit predictor for floating-point multiplication
JP3345894B2 (en) Floating point multiplier
JPH03171324A (en) Circuit and method for calculating square root of operand
JPH05134851A (en) Multiplying circuit output system
KR20020063058A (en) apparatus and method for design of the floating point ALU performing addition and round operations in parallel
US5862068A (en) Arithmetic circuit for calculating a square-root of a sum of squares
JP2001222410A (en) Divider
US4728927A (en) Apparatus and method for performing comparison of two signals
JPH04205026A (en) Divider circuit
Ulman et al. Highly parallel, fast scaling of numbers in nonredundant residue arithmetic
KR100290906B1 (en) method and appratus for performing simultaneously addition and rounding in a floating point multiplier
US4737925A (en) Method and apparatus for minimizing a memory table for use with nonlinear monotonic arithmetic functions
JP2000330764A (en) Adder and integrated circuit
US6317772B1 (en) Split remainder divider
JPH086766A (en) Sine and cosine arithmetic device
KR20010067226A (en) Interpolation method and apparatus
US6516333B1 (en) Sticky bit value predicting circuit
JP3077880B2 (en) Sticky bit detection circuit
JP2699658B2 (en) Square root arithmetic unit

Legal Events

Date Code Title Description
AS Assignment

Owner name: INTEL CORPORATION, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BOSWELL, BRENT;MENEZES, KAROL;REEL/FRAME:009113/0120

Effective date: 19980316

STCF Information on status: patent grant

Free format text: PATENTED CASE

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 4

FPAY Fee payment

Year of fee payment: 8

FPAY Fee payment

Year of fee payment: 12