• No results found

ELE 356 Computer Engineering II. Section 1 Foundations Class 6 Architecture

N/A
N/A
Protected

Academic year: 2021

Share "ELE 356 Computer Engineering II. Section 1 Foundations Class 6 Architecture"

Copied!
38
0
0

Loading.... (view fulltext now)

Full text

(1)

ELE 356

Computer Engineering II

Section 1 – Foundations

Class 6 – Architecture

(2)

2 © tj ELE 356 – Fall 2015

Architecture

• ENIAC Video

(3)

3 © tj ELE 356 – Fall 2015

Architecture

• Mechanical Devices

History

(4)

4 © tj ELE 356 – Fall 2015

Architecture

• Mechanical Devices

• The Antikythera Mechanism

• Oldest known scientific calculator • From the Greek period

• Over 30 gears

• Used to find the position of the sun, moon and planets

• Predict eclipses

(5)

5 © tj ELE 356 – Fall 2015

Architecture

• Mechanical Devices

• Sectors

• Developed in the late 1500s

• Adapted to a broad range of approximate computations

• Basic arithmetic, calculating areas and volumes, converting currencies

(6)

6 © tj ELE 356 – Fall 2015

Architecture

• Mechanical Devices

• Slide Rule

• Developed in the 1600s • Widely used until 1970s

• Adapted to a broad range of approximate computations • Basic arithmetic, trig, logs, …

• Specialized applications developed (nuclear, space, military)

(7)

7 © tj ELE 356 – Fall 2015

Architecture

• Mechanical Devices

• Leibniz Stepped Reckoner calculator

• Developed in 1673

• First four-function calculator • Based on drum-shaped gears

• Lead to almost 300 years of derivative devices

(8)

8 © tj ELE 356 – Fall 2015

Architecture

• Mechanical Devices

• Babbage Analytical Engine

• Started in 1834 • Never completed

• Processing unit, memory unit, punch card input • First programmable computer

(9)

9 © tj ELE 356 – Fall 2015

Architecture

• Mechanical Devices

• Advanced Mechanical Calculators

(10)

10 © tj ELE 356 – Fall 2015

Architecture

• Mechanical Devices

• Electric motors added to speed things up

(11)

11 © tj ELE 356 – Fall 2015

Architecture

• Early Computers

• Harvard MK1

• 1937-1944

• Electromechanical – 50ft shaft • Hand Programmed

(12)

12 © tj ELE 356 – Fall 2015

Architecture

• Early Computers

• ENIAC

• 1943-1945

• All electrical – vacuum tubes • Hand programmed

(13)

13 © tj ELE 356 – Fall 2015

Architecture

• Early Computers

• Manchester Small Scale Experimental Machine (baby)

• 1948

• First stored program computer • Program memory was

tube based

(14)

14 © tj ELE 356 – Fall 2015

Architecture

• Early Computers

• EDVAC

• Modified the ENIAC to use the stored program concept • First to use paper tape to input programs

(15)

15 © tj ELE 356 – Fall 2015

Architecture

• Mass Produced Computers

• UNIVAC

• 1951

• Console for control

• Used Mag tape for storage

(16)

16 © tj ELE 356 – Fall 2015

Architecture

• Mass Produced Computers

• IBM

• 1958

• First mass produced transistor based computer

(17)

17 © tj ELE 356 – Fall 2015

Architecture

• Mini-Computers

• PDP8, LINC, PDP11, Vax11/780, Wang2200

(18)

18 © tj ELE 356 – Fall 2015

Architecture

• Personal Computers

• Intel 8008 – home made computers

(19)

19 © tj ELE 356 – Fall 2015

Architecture

• Personal Computers

• TRS80, Apple II, Commodore PET

• Publically available – 1977

• IBM PC – 1981

• Huge adoption

• Software explodes

• Laptops – 1989

• Mobility

(20)

20 © tj ELE 356 – Fall 2015

Architecture

• TRS 80 video

(21)

21 © tj ELE 356 – Fall 2015

Architecture

• General Purpose Processor

• User Programmable

• Intended to run end user selected programs

• Application Independent

• PowerPoint, Chrome, Twitter, Angry birds, …

• Embedded Processor

• Not User Programmable

• Programmed by manufacturer

• Application Driven

• Non-smart phone, appliances, missiles, automobiles, … • Very wide and very deep applications profile

(22)

22 © tj ELE 356 – Fall 2015

Architecture

• General Purpose Processor

• Key Characteristics

• 32/64 bit operations

• Support non-real-time/time-sharing operating systems • Support complex memory systems

• Multi-level cache • dRAM

• Virtual memory

• Support DMA-driven I/O • Complex CPU structures

• Pipelining

• Superscalar execution

• Out-of-order execution (OOO) • Floating Point HW

(23)

23 © tj ELE 356 – Fall 2015

Architecture

• General Purpose Processor

• Examples

• ARM 7, 9, Cortex A8, A9,A15 • Intel Pentiums, Ix, …

• AMD Phenom, Athleron, Opteron • Apple A4, A5

• TI OMAPs

(24)

24 © tj ELE 356 – Fall 2015

Architecture

• Embedded Processor

• Key Characteristics

• 4/8/16/32 bit operations

• Support real-time operating systems • Relatively simple memory systems • Memory mapped I/O

• Simple CPU structures • Few registers

• Limited Instructions

• Support for multiple I/O schemes • Wide range of peripheral support

• A/D – D/A • Sensors

• Extensive interrupt support

(25)

25 © tj ELE 356 – Fall 2015

Architecture

• Embedded Processor

• Examples

• Motorola/Freescale 68K, HC11, HCS12 • ARM Cortex Rx, Mx

• Atmel AVR

(26)

26 © tj ELE 356 – Fall 2015

Architecture

• CISC – Complex Instruction Set Computer

• Name didn’t even exist until RISC was defined • Used in most processors until about 1980

• One instruction holds multiple actions

• Load data from location, add, write data to new location

• Many times the instructions were designed to emulate high level language constructs

• RISC – Reduced Instruction Set Computer

• Developed in the ‘80s

• Most prevalent architecture today

• Sometimes called a load/store architecture • Instructions are simple

• Load data from location • Add

• Store data to location

• RISC dominates today

• Much easier to take advantage of advanced structures like Pipelining, Superscalar, OOO

(27)

27 © tj ELE 356 – Fall 2015

Architecture

Architectural Configurations

SISD – Single instruction – Single Data INSTRUCTIONS DA T A P SISD

SIMD – Single Instruction – Multiple Data

P INSTRUCTIONS DA T A P P SIMD

MISD – Multiple Instruction – Single Data

INSTRUCTIONS

DA

T

A

P P P

MISD

MIMD – Multiple Instruction – Multiple Data

P INSTRUCTIONS DA T A P P P P P P P P MIMD

(28)

28 © tj ELE 356 – Fall 2015

Architecture

Architectural Configurations

ALU CONTROL STATUS UNIFIED MEMORY CONTROL ADDRESS ALU CONTROL STATUS INSTRUCTION MEMORY DATA MEMORY ADDRESS ADDRESS CONTROL

von Neumann Harvard

(29)

29 © tj ELE 356 – Fall 2015

Architecture

Architectural Configurations

• Memory Bus Structure

Modified Harvard

ALU CONTROL

STATUS

INSTRUCTION MEMORY

DATA MEMORY

ADDRESS ADDRESS

CONTROL

UNIFIED MEMORY

(30)

30 © tj ELE 356 – Fall 2015

Architecture

Architectural Configurations

• Cache Memory

Modified Harvard ALU CONTROL STATUS INSTRUCTION MEMORY DATA MEMORY ADDRESS ADDRESS CONTROL UNIFIED MEMORY

These memories are often augmented by cache

memories or are caches themselves

(31)

31 © tj ELE 356 – Fall 2015

Architecture

Architectural Configurations

• Cache Memory

• Cache memory is used to store relatively small amounts of data or program for a relatively short amount of time

• Sit between the processor and the main memory

• Fast – keep them small to make them fast – allow the processor to run faster than main memory would allow

• Leverage the concept of temporal locality

• If you have recently used a piece of data you are more likely to use it again

• Leverage the concept of spatial locality

(32)

32 © tj ELE 356 – Fall 2015

Architecture

Architectural Configurations

• Cache Memory

• Basic Operation

• Processor requests a byte of program or data

• The system first checks to see if the byte is already in the cache • if Yes – read the byte and continue (called a cache hit)

• if No

• stall or allow the processor to do something else (called a cache miss) • read the byte from main memory into the cache

• read the byte from the cache and continue

• If the cache is full and a new byte needs to be loaded – several methods can be used to remove an existing byte

• LRU – least recently used byte is removed • FIF0 – oldest byte loaded is removed

(33)

33 © tj ELE 356 – Fall 2015

Architecture

Architectural Configurations

• Pipelining

0 1 2 3 4 5

D

C D

B C D

A B C D

CPU Execute A B C D

A B C D

A B C

A B

A

4us 4us 4us 4us 4us

Execute = fetch instruction, decode, execute, write back

Clock Cycle Waiting Instructions Retired Instructions No Pipeline

(34)

34 © tj ELE 356 – Fall 2015

Architecture

Architectural Configurations

• Pipelining

• Break complex tasks into smaller chunks

• Start the next instruction as soon as each subtask is complete

0 1 2 3 4 5 6 7 8

D

C D

B C D

A B C D

Fetch A B C D

Decode A B C D

Execute A B C D

Write back A B C D

A B C D

A B C

A B

A 1us 1us 1us 1us 1us 1us 1us 1us Waiting Instructions Retired Instructions Clock Cycle Pipeline

(35)

35 © tj ELE 356 – Fall 2015

Architecture

Architectural Configurations

• Superscalar

(36)

36 © tj ELE 356 – Fall 2015

Architecture

Architectural Configurations

(37)

37 © tj ELE 356 – Fall 2015

Architecture

Architectural Configurations

(38)

38 © tj ELE 356 – Fall 2015

Architecture

Architectural Configurations

References

Related documents

This is the first study to describe the occurrence of atypical EPEC and STEC in saffron finches seized from the illegal wildlife trade, and one of the few studies to evaluate the

cars Eco-cars Light cars Older model cars General Consumers Luxury foreign cars Appraisal Purchase(Stock) 50,000 Cars × gross profit (retail + wholesale). Directly managed

Real estate limited partnerships (RELPs) trade in the secondary market at discounted prices compared to their NAVs. Business appraisers consider these discounts a proxy for

Outline: The installation of new safety surfacing to improve the play area at Waterside Mede in market Bosworth:

Gummilist 15 x 8 mm (Ange längd vid beställning) Rubber moulding 15 x 8 mm (State length when ordering)

Knowing how other resellers are representing your brand and what people are saying about your product or brand through social media platforms (user generated content) is a

H4: There are significant differences between the segments in reference to the consumers' satisfaction with quality of supply and service at city markets: product quality,

Solely by increasing the total level of NIT1 in the plant, the primary root length was reduced by approximately 40% compared to wild-type Arabidopsis and nitrilase impaired