## Unit-2

## DataPath Design

## Introduction

- An instruction set processor consist of two important units:
- Data Processing Unit (DataPath)
- Program Control Unit


## Addition

## Fixed Point Arithmetic

- Add \& subtract instructions for fixed binary numbers are found in the instruction set of every computer.
- To implement the add ,microoperation with hardware we need registers that hold the data and the digital component that performs the arithmetic addition.
- The digital circuit that forms the arithmetic sum of 2 bits and a previous carry is called a full adder.

The digital circuit that generates the arithmetic sum of two binary numbers of any length is called a binary adder.

- The binary adder is constructed with full adder circuits connected in cascade.
- For Eg: The below diag shows the interconnections of 4 full adders to provide a 4 bit binary adder.


## Binary Adder



## How We Can Make 1 Bit Full Adder

- By using only AND \& OR gates
- By using XOR, AND \& OR gates

By Using AND \& OR Gates


1
Car

## Output From Each Gate

- First AND $=\mathrm{Ci}-1, \mathrm{Yi}, \mathrm{Xi}$
- Second AND $=X \mathrm{Xi}^{\prime}, \mathrm{Ci}^{\prime}-1, \mathrm{Yi}$
- Third AND $=Y \mathrm{Yi}^{\prime}, \mathrm{Xi}{ }^{\prime}, \mathrm{Ci}-1$
- Fourth AND $=C \mathrm{Ci}^{\prime}-1, \mathrm{Xi}, \mathrm{Yi}^{\prime}$
- Fifth AND $=\mathrm{Xi}, \mathrm{Ci}-1$
- Sixth AND = Yi, Xi
- Seventh AND $=\mathrm{Yi}, \mathrm{Ci}-1$


## By Using XOR, AND \& OR Gate



## A Binary Adder Can Also Be Shown

 As:

## Binary Incrementer

The increment micro operation adds 1 to a number in a register.

- This micro operation is easily implemented with a binary counter \& this can be accomplished by means of Half Adders.
- The output carry of one half adder is connected to one of the inputs of the next higher order half adder.

Binary Incrementer


## Binary Adder Subtractor

- The addition $\&$ subtraction operations can be combined into one common circuits by including an XOR gate with each full adder. This combinational circuit is known as Binary Adder Subtractor.
- Here, a mode bit is used to find addition and subtraction is done by the circuit. If the value of mode bit is 0 it means addition is performed and if it's value is 1 it means subtraction is performed.


## Binary Adder Subtractor


$Z=Y \pm X$ can be diagramatically represented as:


## Overflow

- Overflow is indicated by a flag bit $v$ in operations involving signed numbers, this flag is found in CPU status register.
- There V is indicated as:


## High Speed adder



- $g i=x i y i$
- $p i=x i+y i$
$\mathrm{ci}=x i y i+x i c i-1+y i c i-1$
$\mathrm{ci}=\mathrm{gi}+\mathrm{pi}(\mathrm{ci}-1)$
$\mathrm{co}=\mathrm{g} 0+\mathrm{p} 0 \mathrm{cin}$
$\mathrm{c} 1=\mathrm{g} 1+\mathrm{plg} 0+\mathrm{p} 1 \mathrm{p} 0 \mathrm{cin}$
$\mathrm{c} 2=\mathrm{g} 2+\mathrm{p} 2 \mathrm{~g} 1+\mathrm{p} 2 \mathrm{p} 1 \mathrm{go}+\mathrm{p} 2 \mathrm{p} 1 \mathrm{p} 0 \mathrm{cin}$
$c 3=g 3+p 3 g 2+p 3 p 2 g 1+p 3 p 2 p 1 g 0+p 3 p 2 p 1 p 0 c i n$


## Adder Fxnansinn



## 16 Bit Adder That consist of 4, 4 Bit Binary Adder



## Above Diag With Carry-Look Ahead Generator



## 4 Bit Adder Module with Overflow



## An 8 bit Adder Subtractor



## Multiplication

## Introduction

- Fixed point multiplication requires substantially more hardware than the fixed point addition.

While first bit is multiplied then we have to shift 0 bit to the left, when second bit is multiplied then there is a shift of 1 bit. Similarly for third, fourth bit and so on.

The value of Partial Product is $P_{i+1}=P_{i}+x_{j} 2^{i} Y$ 3

- So final product is $P=\sum_{j=0} x_{j} 2^{i} Y$


## Hardware For Multiply Operation

$B_{s}$


## Flow Chart For Multiply Operation

Figure 10-6 Flowchart for multiply operation.


## For Eg: $23 \times 19=437$

Multiplicand $B=10111$
E $\quad A$
Q
SC
Multiplier in $Q$
$Q_{n}=1$; add $B$
First partial product
Shift right $E A Q$
$Q_{n}=1$; add $B$
Second partial product
Shift right EAQ
$Q_{n}=0$; shift right $E A Q$
$Q_{n}=0$; shift right $E A Q$
$Q_{n}=1 ;$ add $B$
Fifth partial product
Shift right EAQ

| 0 | 00000 | 10011 | 101 |
| :--- | :--- | :--- | :--- |
| 0 | $\underline{10111}$ |  |  |
| 0 | 01111 |  |  |
|  | $\underline{01011}$ | 11001 | 100 |
| 1 | $\underline{00010}$ |  |  |
| 0 | 10001 | 01100 | 011 |
| 0 | 01000 | 10110 | 010 |
| 0 | 00100 | 01011 | 001 |
|  | $\underline{10111}$ |  |  |
| 0 | 11011 |  |  |
| 0 | 01101 | 10101 | 000 |

Final product in $A Q=0110110101$

# Multiplication of Twos Complement Multiplier Can be Done by: 

- James E Robertson Algorithm
- Booth's Algorithm


## Booth's Algorithm



$$
-9 x-13=117
$$



## Diagram of Multiplication



## Array Multiplier

Checking the bits of the multiplier one at a time and forming partial products is a sequential operation that requires a sequence of add and shift micro operations.

- The multiplication of two binary numbers can be done with one micro operation by means of a combinational circuit that forms the product bits all at once. This is done by making use of an array multiplier.


# AND <br> Array <br> for <br> $4 \times 4$ <br> bit Multiplication: 



## Full Adder Array For $4 \times 4$ Bit



## One Cell Consist of



## Division

## Introduction

- Division is done by 2 methods
- Restoring Method
- Non-Restoring Method


## Restoring Method (Algorithm)

- Step-1 Initialize the register with Dividend
- ASHL
- ADD B' 1
- Step-2 Check If $\mathrm{E}=1$ or $\mathrm{E}=0$
- $\mathrm{E}=1$
- Reset Qn SC=SC-1 goto step-4
- ASHL EAQ
- Add B' +1
- $\mathrm{E}=0$
- Leave $\mathrm{Q}_{\mathrm{n}}=0$ \& Add B
- Restore the Remainder SC=SC-1 goto step-4
- ASHL
- ADD B' 1

Step-4 For Termination When $\mathrm{SC}=0$ Neglect E , Remaincerin $A$, Quotient in Q

## Problems

- Dividend=01110 Divisor= 10001
- Quotient=11010 Remainder=00110
- Dividend=30 Divisor=3 size=4
- Quotient=1010 Remainder=0000
- Dividend $=15$ Divisor=2 size=4
- Quotient=0111 Remainder=0001
- Dividend=38 Divisor=5 size=3 bit
- Quotient=111 Remainder=011


## Datapath of a Sequential n -bit binary Divider:



## Non-Restoring Method

- Step-1 Initialize the register with Dividend - ASHL
- ADD M'+1

If $S=0$

- Reset Q[0]
- If Count=n-1 then goto step-4
- Else
- Count=count+1
- SHL SAQ
- ADD M' +1 then goto either step 2 or step-3
- If $S=1$
- Set Q[0]

If Count $=\mathrm{n}-1$ then goto step-4

## Non-Restoring Method Contd..

- Else
- Count=count+1
- SHL SAQ
- ADD M then goto either step 2 or step-3
- Step-4 if $\mathrm{s}=1$ then Add M
- Quotient in Q
- Remainder in R


## Problems Based On Non-Restoring Method:

- Dividend=1100001 Divisor=1010 Size=4 bits
- Dividend=30
- Dividend $=40$
- Dividend $=25$
- Dividend $=15$

Divisor=3
Size $=4$
Divisor=2
Size $=5$
Divisor=4
Size $=4$
Divisor $=2 \quad$ Size $=4$

## Diag of Non-Restoring Method



ALU

The various circuit that are used to execute data processing instructions are usually combined in a single circuit called ALU.

- There are 2 types of ALU are:
- Combinational ALU
- Sequential ALU


## Combinational ALU (A Basic n bit

 ALU)

## A n-bit Logic Unit



## One Stage of The Logic Circuit



A Register Level View of The 74181 4-bit ALU

- Here the output of the whole circuit comes form:

$$
F_{i}=I P_{i} \oplus I G_{i} \oplus\left(I C_{i-1}+M\right)
$$

## A Register Level View of The

## 74181 4-bit ALU Contd...



- In the above circuit
- When $M=0$ then arithmetic operation performs.
- When $M=1$ then logic operation is performed.


## 16 Bit Combinational ALU



## Sequential ALU

- A sequential ALU uses flip-flops. Sequential circuits compute their output based on the input and the state and the updation of state is based on clock.
- Whereas combinational logic circuits implement Boolean functions it means their functions is only based on their input and they are not based on clocks.

Sequential Circuit


## Some Points Related To Sequential Circuits Are:

- A sequential circuit has inputs and outputs.
- A sequential logic circuit uses a clock.
- There is a box inside the circuit called state and this box contains flip-flops and this flipflop basically store a $k$ bit number for representing the current state.
- Here, the output is computed based on the input and state.


# Some Points Related To Sequential Circuits Are Contd....: 

- The state may be updated at each positive clock edge. When there is not a positive clock edge, the state remains unchanged.
- The information needed to update to the state comes from the current state and the input which is fed through combinational logic and fed back into the state box, telling the state box how to update itself.


## Structure of a Basic Sequential ALU:



## A Register File With 3 Access Ports



A Symbol

## A Register File With 3 Access Ports



## A Logic Diagram

## A Generic Datapath Unit With an ALU \& Register File:



## 16-Bit ALU composed of four 4-bit

 Slices:

Control

## Organization of The 29014 bit ALU Slice:



A 16-Bit 4 Slice Array of 2901 Employing CarryLookAhead Generator:


## Organization of GEC Plessey 1601 ALU \& Barrel Shifter:



## Floating Point Arithmetic

- Let $\left(X_{m}, X_{E}\right)$ be the floating point representation of a number $X$ which has numerical value $X_{M} \times \mathrm{B}^{\mathrm{XE}}$.
Where $X_{m}$ is the mantissa and the exponent $X_{E}$ are fixed point numbers.

The four basic arithmetic operations for floating point numbers are:

- $X+Y=\left(X_{m} 2 X_{E}-Y_{E}+Y_{M}\right) \times 2 Y_{E}$
- $X-Y=\left(X_{m} 2_{E} X_{E}-Y_{E}-Y_{M}\right) \times 2_{E}{ }_{E}$
- $X \times Y=\left(X_{m} \times Y_{M}\right) \times 2 X_{E}+Y_{E}$
$X \times Y=\left(X_{m} / Y_{M}\right) \times 2_{E}{ }^{-Y} E_{E}$

The floating point operation has 4 main steps:

- Compare The Exponent
- Align The Mantissas
- Add The Mantissas
- Normalize The Result
- The Floating point addition and subtraction has 3 main steps:
- Compute $\mathrm{Y}_{\mathrm{E}}-\mathrm{X}_{\mathrm{E}}$
- Shift $X_{M}$ by $Y_{E}-X_{E}$ places to the right to obtain $X_{M}$ $2 X_{E}-Y_{E}$.
Compute $\left(X_{m} 2_{E} X_{E}-Y_{E}+-Y_{M}\right) \times 2_{E}$


## Algorithm For Addition

- The first step of the algorithm is equalization of the exponent which is done by subtracting them and aligning the mantissa by shifting one of them until the difference between the exponent has been reduced to 0 .
- The aligned mantissas are added.
- Finally the result is normalized, if it is necessary by again shifting the mantissa and making a compensating change in the exponent.


## Algorithm For Addition Contd...

- Note: The left most bit position of E indicates the sign of the exponent if it is 0 then the exponent is negative if it is 1 then it is positive.
- Load E1, E2, AC \& DR
- Compare E=E1-E2
- If $\mathrm{E}<0$ Then $\mathrm{Ac}:=$ right-shift(Ac), $\mathrm{E}:=\mathrm{E}+1$
- If $\mathrm{E}>0$ Then $\mathrm{DR}:=$ right - shift $(\mathrm{DR}), \mathrm{E}:=\mathrm{E}-1$
- If $\mathrm{E}=0$ Then $\mathrm{AC}:=\mathrm{AC}+\mathrm{DR}, \mathrm{E}:=\max (\mathrm{E} 1, \mathrm{E} 2)$


## For Eg:



- In the above example the number has 32 bit floating point format of IEEE standard 754. In this format each number N has 23 bit fractional part with a hidden bit and an 8 bit exponent E in excess-127 code.

Exponent registers

| E1 | E2 | E | $\overline{\mathrm{A}}$ |  |
| :---: | :---: | :---: | :---: | :---: |
| 01111111 | 10000111 | (00000000) | 110000 | DR |
| $=X_{\text {E }}$ | $=Y_{\text {E }}$ |  | $=1 x_{n}$ | 10010101101000 .... $=1 . Y_{u}$ |

01110111
$=X_{k}-Y_{\mathrm{E}}$
01111000 0H00000000000 ... 00
01111001 00110000000000 . . . 00
01111010 00011000000000 ... 00
0111011 00001100000000 ... 00
$01111100 \quad 00000110000000 \ldots 00$
0111110100000011000000 .. 00
$01111110 \quad 00000001100000 \ldots 00$
01111111 000000001 10000 ... 00
10000000
10000111 10010110011000 ... 00
$=\gamma_{\mathrm{E}} \quad=\mathrm{AC}+\mathrm{DR}$
$\begin{array}{ll}10000111 & 10010110011000 \ldots . \ldots 0 \\ =\left(x+n_{\mathrm{L}}\right. & =1.0 x+n_{m}\end{array}$

## Pipeline Processing



## Four Stage Floating Point Adder Pipeline:



## Operation of the 4 stage floating-point adder pipeline:





## Summation of An Eight Element

$s$

$t=6$
$t=7$
$t=8$


Pipelined Multiplier

## Multiplier Pipline



## A Two Stage Carry-Save Adder



## A Carry-Save Multiplier (Wallac-

 Tree)

## A Pipelined Carry-Save Multiplier:



