ECE232: Hardware Organization and Design
ECE232: Hardware Organization and Design
Part 11: Pipelining Chapter 4/6
Adapted from Computer Organization and Design, Patterson & Hennessy, UCB
CPI Calculation
CPI stands for average number of Cycles Per Instruction
Assume an instruction mix of 24% loads, 12% stores, 44% Rformat, 18% branches, and 2% jumps
CPI = 0.24 * 5 + 0.12 * 4 + 0.44 * 4 + 0.18 * 3 + 0.02 * 3 = 4.04
Speedup?
Question: Can we achieve a CPI of 1???
ECE232: Pipelining I 2
Adapted from Computer Organization and Design, Patterson&Hennessy,UCB, Kundu,UMass
Koren
Speeding up through pipelining
Ann, Brian, Cathy, Dave each have one load
of clothes to wash, dry, and fold ? Washer takes 30 minutes
ABCD
? Dryer takes 30 minutes
? "Folder" takes 30 minutes
? "Stasher" takes 30 minutes to put clothes into drawers
ECE232: Pipelining I 3
Adapted from Computer Organization and Design, Patterson&Hennessy,UCB, Kundu,UMass
Koren
Sequential Laundry
6 PM 7
8
9 10 11 12
1 2 AM
T
30 30 30 30 30 30 30 30 30 30 30 30 30 30 30 30
aA
Time
s
kB
C O rD d e r
Sequential laundry takes 8 hours for 4 loads If they learned pipelining, how long would laundry take?
ECE232: Pipelining I 4
Adapted from Computer Organization and Design, Patterson&Hennessy,UCB, Kundu,UMass
Koren
Pipelined Laundry: Start work ASAP
6 PM 7 8 9 10 11 12
T
30 30 30 30 30 30 30
aA
s kB
Time
C O rD d e r
Pipelined laundry takes 3.5 hours for 4 loads!
1 2 AM
ECE232: Pipelining I 5
Adapted from Computer Organization and Design, Patterson&Hennessy,UCB, Kundu,UMass
Koren
Pipelining Lessons
6 PM 7
8
9
T
Time
a
30 30 30 30 30 30 30
sA k
B
OC r dD
e
r
Pipelining doesn't help latency of single task, it helps throughput of entire workload Multiple tasks operating simultaneously using different resources Potential speedup = Number pipe stages
Pipeline rate limited by slowest pipeline stage Unbalanced lengths of pipe stages reduces speedup Time to "fill" pipeline and time to "drain" it reduces speedup
ECE232: Pipelining I 6
Adapted from Computer Organization and Design, Patterson&Hennessy,UCB, Kundu,UMass
Koren
Instruction
Pipelining Instructions
Time (in cycles)
F
D
EX M
W
F
D EX M
W
F
D
EX M
W
Fetch = 10 ns Decode = 6 ns Execute = 8 ns Memory = 10 ns Write back = 6 ns
F
D EX M
W
F
D EX M
W
F
D
EX
M
W
ECE232: Pipelining I 7
Adapted from Computer Organization and Design, Patterson&Hennessy,UCB, Kundu,UMass
Koren
Single Cycle, Multiple Cycle, vs. Pipeline
Cycle 1 Clk
Single Cycle Implementation: Load
Cycle 2
Store
Waste
Cycle 1 Cycle 2 Cycle 3 Cycle 4 Cycle 5 Cycle 6 Cycle 7 Cycle 8 Cycle 9 Cycle 10
Clk
Multiple Cycle Implementation: Load Ifetch Reg Exec Mem
Store Wr Ifetch Reg
Exec
R-type Mem Ifetch
Pipeline Implementation: Load Ifetch Reg Exec Mem Wr
Store Ifetch Reg Exec Mem Wr
R-type Ifetch Reg Exec Mem Wr
ECE232: Pipelining I 8
Adapted from Computer Organization and Design, Patterson&Hennessy,UCB, Kundu,UMass
Koren
Why Pipeline?
Suppose we execute 100 instructions Single Cycle Machine ? 45 ns/cycle x 1 CPI x 100 inst = 4500 ns Multicycle Machine ? 10 ns/cycle x 4.04 CPI (for the given inst mix) x 100 inst
= 4040 ns ? Instruction mix of 24% loads, 12% stores, 44% R-format, 18%
branches, and 2% jumps Ideal pipelined machine (with 5 stages) ? 10 ns/cycle x (1 CPI x 100 inst + 4 cycle drain) = 1040 ns Speedup=4.33 vs. single-cycle
3.88 vs. multi-cycle (for the given inst mix)
ECE232: Pipelining I 9
Adapted from Computer Organization and Design, Patterson&Hennessy,UCB, Kundu,UMass
Koren
ALU ALU ALU ALU ALU
Why Pipeline? Because the resources are there!
Time (clock cycles)
I n
Inst 1 Im
s
t Inst 2
r.
O Inst 3
r
d e
Inst 4
r
Inst 5
Reg
Dm Reg
Im Reg
Dm Reg
Im Reg
Dm Reg
Im Reg
Dm Reg
Im Reg
Dm Reg
ECE232: Pipelining I 10
Adapted from Computer Organization and Design, Patterson&Hennessy,UCB, Kundu,UMass
Koren
................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- memory storage calculations rutgers university
- ece232 hardware organization and design
- how to check the operating system os on your ti nspire cas
- how to calculate subnets northampton community college
- chapter 10 exam a rhinebeck central school district
- how to connect the ti 84 plus calculator and pc
- e6 b flight computer instructions
- the flight computer true airspeed tas
- numerical mathematical analysis university of pittsburgh
- how to find your computer name in windows 7 8 10 and
Related searches
- information system organization and strategy
- business plan organization and management
- team organization and leadership
- business organization and management pdf
- computer organization and architecture stallings
- computer organization and architecture 10th
- organization structure and design ppt
- personal organization and time management
- organization and time management skills
- organization theory and design 12th edition
- importance of organization and methods
- cerebral cortex organization and function