An expert system to optimize combinational logic

(1)

Rochester Institute of Technology

RIT Scholar Works

Theses

Thesis/Dissertation Collections

2-5-1987

An expert system to optimize combinational logic

Gu-Ching Lin

Follow this and additional works at:

http://scholarworks.rit.edu/theses

This Thesis is brought to you for free and open access by the Thesis/Dissertation Collections at RIT Scholar Works. It has been accepted for inclusion

in Theses by an authorized administrator of RIT Scholar Works. For more information, please contact

[email protected]

.

Recommended Citation

(2)

Rochester Institute of Technology

School of Computer Science and Technology

An Expert System To Optimize Combinational Logic

by

Gu-Ching Lin

A thesis, submitted to

The Faculty of the School of Computer Science and Technology;

in partial fulfillment of the requirements for the degree of

Master of Science in Computer Science

Approved by:

John L. Ellis

Professor John L. Ellis (Chairman)

John A. Biles

Professor John A. Biles

George A. Brown

(3)

Title of Thesis:

An

Expert System To Optimize Canbinational Logic

---~--~~---~---~--I

Gu~g

Lin

°-

prefer to

be

contacted each time a request for

reproduction is made.

I can

be

reached at the following address:

(4)

ABSTRACT

Twenty

to

fifty

percent of the active area of most semicustom

integrated

circuits

is

devoted to

combinational logic.

Automating

the synthesis and optimization of combinational _circuitry can

result

in

significant

improvements

in

both

the design cycle time

and the overall area of the

implementation.

This thesis presents

a rule-based system

that

optimizes combinational

logic

for

a

given

technology.

By

performing

Boolean

function

minimization,

decomposition,

logic

synthesis and a series of local

transformations4,

the system achieves area reductions and saves

(5)

1.

Introduction

and

Background

1.1 Problem Statement

1.2 Previous Work

1.3 System Approach

2. Knowledge-Based

Expert

Systems

2.1 Productions Systems

2.2

Applicability

of Knowledge-based System to Logic

Optimization

2.3 Advantages and Disadvantages of Knowledge-Based Expert

Systems

3. System Implementation

3.1 Minimization

3.2 Decomposition

3.3 Synthesis

3.4 Optimization

3.4.1 Knowledge Base

3.4.2 Control Structure

3.5 Program Ogranization

3.6 Input and Output

4. Conclusions and Future Work

Appendix A. User's Manual

Appendix B. Example

Appendix C. Rule Base

Bibliography

(6)

1. Introduction and

Background

Optimization of combinatorial

logic

is

a

lengthy

and

difficult task

for

circuit

designers.

Since a significant

percentage of most chips consists of combinatorial

logic,

this

optimization can take much

effort,

increasing

the overall

turnaround

time

of

the

design.

In _addition, when an

existing

design

is

converted

from

one

technology

to

another, circuit

designers

have

to

reoptimize the

existing

implementation to

take

full advantage of the

target

technology. Often this optimization

is

not even _performed,

resulting

in unnecessarily

large and slow

chips.

This thesis presents a rule-based expert system that

optimizes combinational

logic

for a specific target technology.

The system consists of _{four modules,} _{the minimization,} decompos

ition,

and synthesis module are used to generate a minimized,

multilevel netlist

describing

the target

technology

for

optimization module, which performs subsitiutions of equivalent

gate configurations,

thereby

reducing

the overall area of the

implementation and

improving

the speed of the design.

1.1 Problem Statement

One concept of automated logic

design

is

the conversion

of a functional description

to

a

logic

implementation.

While such

(7)

(data

flow

design,

control

logic

design,

physical

layout,

etc.),

the

thesis

focus

here

is

on methods

primarily

applicable to

logic

optimization.

Current

techniques

for

automatic

design

of control

logic

fall

into

two

broad

categories. The

first,

which we refer

to

as

two-level

design,

is

characterized

by

the use of a two-level

disjunctive

form

(DF)

as an

intermediate

stage

in

synthesis.

Roughly

speaking,

these

techniques

produce a

two-level

representation

for

a

function

which

is

to be

implemented

and

minimize or optimize

it

further.

The

resulting

representation

is

then factored

into

a network of

gates,

transistors,

PLA's or

other functional units. The

key

observation

concerning

this style

of

design

is

that there

is

an

intermediate

stage at which all

information

about possible algorithms which might have been

contained

in

the original specification of the function

is

discarded.

Only

information

about the function to be computed

is

retained.

This approach to automatic

design

has several

advantages.

First,

minimization of the DF corresponds to an

optimization

in

the space of all algorithms for the given

function. This method therefore has the potential to uncover

extremely clever ways to

implement

the

function

which

may

have

been

overlooked

by

the designer.

Second,

there

is

a firm

theoretical foundation underlying this methodology. There has

(8)

of this

theory

relates to a

two-level

representation. For

example,

there are well understood techniques

for

taking

advantage of "don't care"

information

during

minimization.

There are also problems with

the

use of

two-level

minimization

in

the automatic generation of

logic.

True

two-level

minimization algorithms require exponential

time;

however,

recently

a technique

has

been

discovered,

ESPRESSO-IIC1,

which

in

actual _examples, comes close

to

the

true minimum and

has

an

acceptable

running

time. A more serious

difficulty

is

that the

result of two-level minimization

is

a network of gates with

unlimited fan-in. Current technologies have fan-in

limitations,

so the result of Boolean minimization cannot be

directly

realized

in

any

actual technology. The original

design

may

have had

implicit

information

about

the

sharing

of

intermediate

results which would

be

useful

in altering

the two-level design to

meet

technology

requirements,

but,

in

the process of

putting

the

design

into

DF and _minimizing the

DF,

this

information

has been

lost.

Rediscovering

this

information

in

order to construct a good

multilevel design

is

the

factoring

problem, and currently there

is

a technique known as weak

division6,

which takes a two-level

function and creates a multilevel

function

based on small

subexpressions.

Thds second

information

concerning

the

implied

algorithm

is

retained whenever possible.

This class of

design

methodologies has advantages and

disadvantages,

which are almost

complementary to

those of

the

two-level approach. Their

primary

advantage

is

that

information

implicit

in

the specification

is

retained throughout the design

process. This permits

the

process to use

insights

which the

designer

may

have

included

in

the

description

while also

permitting computational

efficiency

since the

factoring

problem

is

largely

avoided.

There are several disadvantages to this type of

approach.

First,

since no global optimization (in the sense of

Boolean minimization)

is

performed, the

efficiency

of the

eventual implementation

is

limited

by

the form of the

specification. For example,

it

is

unlikely

that a compiler-like

method can accept a specification

for

a carry-chain adder and

produce the implementation of a carry-lookahead adder.

Second,

the available techniques for

dealing

with

redundancy

and "don't

care" _are

in

the

early

stages of development.

Finally,

there

is

no firm mathematical foundation underlying these approaches

in

(10)

1.2

The goal of

logic

synthesis

is

to

accept

functional

specifications

for

a

hardware

unit and

to

generate

automatically

a

detailed,

technology-specific

implementation

comparable

in

quality

to

that

of an experienced engineer. There has

been

much

work on

automating

logic

design

and

many

effective tools

have

been developed to aid the

designer.

Early

work centered on

developing

algorithms

for

translating

a

boolean

function

into

a

minimum two-level network of

boolean

primitives14. Latter efforts

attempted to raise the

level

of specification15. The results

were

usually

more expensive

than

manual

implementations

and did

not take advantage of the target technology. For _example, the

ALERT16

system was validated on an

existing

design,

the IBM

1800,

and the

implementation

produced required 160% more circuits than

the manual design.

In attempts to generate more efficient logic and to give

the user more control over the

implementation,

other strategies

were

tried;

computer design language simulation and boolean

translation. These constrain the specification language so that

there

is

nearly a one-to-one correspondence

between

the

specification and the

implementation.

Of course, this constraint

(11)

Recently,

interest

has

grown

significantly

in

AI

applications of

digital

system

design.

DAS/Logic9

(Design

Assistant

Series)

is

a

tool

being

developed

at Carnegie

Group

Inc. to aid

in

the

design

of

integrated

circuits. DAS/Logic

is

a

rule-based system written

in

OPS52

which refines a

textual

behavioral

description

to

a circuit schematic. The system's

input

is

a

high

level

language

description

of

the

target

IC's

behavior;

the output consists of a set of standard cells and an

interconnection

list.

The system

is

separated

into

four

levels.

The first

level

is

the Behavioral

level

,which describes the

input/

output behavior of a

digital

system. The next

level

is

the

Generic Logic level.

Here,

the Behavioral description

is

translated

into

a

logic

representation. In the third

level,

the

Committed Logic

level,

the Generic Logic representation

is

cast

into

the appropriate primitive gates

for

the

implementation

technology. For example, the Generic Logic structure

is

composed

of AND and OR gates that correspond to

the

logical

form of the

Behavioral description. At the Committed Logic

level,

the AND and

OR gate structures are changed

into

NAND or NOR gates. The final

level

is

the Standard Cell

level,

where the transistor circuits

required to

implement

a particular

logic

function are specified.

A number of research groups are

currently

exploring

knowledge based approaches to various aspects of VLSI. Here we

describe a knowledge based system called

REDESIGN10,

which

(12)

altered

functional

specifications. Given

the

redesign

goal,

the

system generates plausible

local

changes

to

make within

the

circuit,

ranks the changes

based

on

implementation

difficulty

and goal

satisfaction,

and checks

for

undesirable side effects

associated with

the

changes. The system provides

design

assistance

by

combining

casual

reasoning,

analyzing

the

cause-effect relations of the circuits _operation, with

functional

reasoning, and

analyzing

the purposes or roles of circuit

components. Circuit

knowledge

in

REDESIGN

is

represented as a

network of modules and

data

paths. The system was

developed

at

Rutgers

University

and reached the stage of a research prototype.

1.3 System Approach

Automating

the synthesis and optimization of

combinational _circuitry can result

in

significant

improvements

in

both the design cycle time and the overall

quality

of the

implementation. Standard

techniques,

such as

two-level

minimiza

tion

tools,

for performing

logic

level

reduction are a major

step

in

this

direction,

but

they

fail

to address the actual

circuit-level implementation. Such minimizers will

find

an

optimal implementa-tion

using

AND/OR gate,

for

example, but

(13)

This

thesis

takes a

four

step

approach to the synthesis

and optimization problem

for

combinational

logic.

These steps are

(Figure

1.1)

:

.

minimizing

the

Boolean

equations,

.

factoring

the

two-level

functions

into

multilevel

functions,

.

synthesizing

an

initial

network,

and

.

optimizing the

network

for

a given

technology.

During

the minimization

phase,

the set of Boolean

equations

describing

the

desired

functions

is

reduced

using

mathematical methods that

take

advantage of the "don't care"

set.

The ESPRESSO-IIC program performs the reductions. In the second

phase, the equations are factored

using

a technique known as weak

division,

which takes a two-level function and creates a multi

level function based on small subexpressions that occur often

in

the original function. This

tool

detects and eliminates multiple

occurrences of the same _{subexpression,} otherwise

it

would result

in

duplicate logic

in

the synthesized circuit.

In the synthesis _phase, an

initial

network

is

created

by

using

a NAND or NOR

implementation

for

target technology.

Finally,

this network

is

optimized for area

by

performing

a

series of local

transformations4'5

on the circuit. These

transformations are formulated as rules

to

be

applied to the

(14)

|

BOOLEAN

FUNCTION

j

I

|

MINIMIZATION

f^

I

'1

I

ESPRESSO-IIC

|

DECOMPOSITION

|

I

j

SYNTHESIS

I

|

. AND/OR

|

.NAND/NOR

^

I

j

OPTIMIZATION

1

I

|

RULE LIBRARY

|

I

LOCAL TRANSFORMATION

NAND/NOR GATE

(15)

2.

Knowledge-Based

Expert

Systems

In

recent

years,

research

in

the

field

of artificial

intelligence

has

had

many

important

successes.

Among

the most

significant of

these has

been the

development

of powerful new

computer systems

known

as "expert" or "knowledge-based" systems.

These programs are

designed

to represent and

apply

factual

knowledge

in

specific areas of expertise to solve problems. For

example, collaborative efforts

by

human

experts and system

developers

have

resulted

in

systems

that

diagnose

diseases,

configure computer

systems,

and prospect

for

minerals at

performance levels equal to or

surpassing

human expertise. The

potential power of systems

that

can replicate expensive or rare

human knowledge has led to a worldwide effort to extend and

apply

this technology.

An expert system

essentially

consists of a knowledge

base

and an

inference

engine. The knowledge

base

contains facts

and rules that use those facts as the basis

for

decision making.

The inference engine contains an

interpreter

that decides how to

apply

the rules to

infer

new knowledge and a scheduler that

decides the order

in

which the rules should be applied. This

organization

is

shown

in

Figure 2.1.

(16)

FACTS

1"

I

RULES

KNOWLEDGE BASE

(Domain

Knowledge)

I

j

INTERPRETER

|

I

1

I

SCHEDULER

I

INFERENCE ENGINE

(General

Problem-sovling

Knowledge)

Figure 2. 1 The structure of an expert system

In expert _system, knowledge

is

used to slove problem and

determine new facts based upon what

is

already

known. The

knowledge should be efficiently usable and

easily

expandable.

Knowledge,

therefore,

has to be represented for quick and _easy

retrieval, for ease

in

further expansion and modification, for

use

in

reasoning or solving a specific problem. In order to

satisfy the various requirements

for

the knowledge representat

ion,

different techniques have to be used

for

different types of

knowledge. There are three most

widely

used

in

current expert

systems are rules (the most popular), semantic nets and frames.24

Each technique provides the program with certain

benefits,

such

as

making

it

more efficient, more

easily

understood,

or

more

(17)

The

inference

engine uses

knowledge

in

the

knowledge

base

to

solve a specific problem

by

emulating

the

reasoning

process of a

humnan

expert. The approach

to

solving

a problem

consists of

searching

a solution

from

a search space. In AI

terminology,

the

set of all possible solutions

is

known

as

the

search space. The

inference

engine contains

problem-solving

strategies

that

use

knowledge

in

_the

knowledge

base

to serch

for

a solution.

Recently,

the use of

knowledge-based

expert systems

in

digital

system

design has

grown. One of

the

most successful

applications of such systems

is

the Rl7

system used at DEC to

configure

large

computer systems. The rest of this chapter

focuses on rule

base

systems and describes the

applicability

of

knowledge-based expert systems to

logic

design.

2 . 1 Productions systems

A production system consists of a rule base and a

control structure. The rule base

is

composed of a list of

production rules which are checked

repeatedly

until a condition

is

achieved or rejected. The controls structure determines which

rules should

be

executed next and executes the actions specified

by

the rules.

(18)

A production rule

is

a statement cast

in

the

form

"If

this

condition

holds,

then

this

action

is

appropriate."

The

Figure 2.2 and 2.3 show a

transformation

rule

is

encoded as a

production rule

in

this system.

B

"X-

_r .

J

Y

Figure 2.2 A

transformation

rule

IF

a NOR

inverter

is

connected

to

the output of a NOR

gate X and

input

to another NOR gate Y.

THEN

remove the NOR

inverter

and NOR gate X from netlist

and connect

input

A,B

to

NOR gate Y.

Figure 2.3 A production rule

In the Figure

2-3,

the IF part of

the

productions, called the

condition part, states the conditions that must

be

present for

the

production

to

be

applicable, and

the

THEN

part,

called the

(19)

The control structure uses a rule

interpreter,

sometimes

encoded

in

terms

of "metarules" to

find the

enabled rules and to

decide

which rule to apply- One

basic

control

strategy

used

is

data

driven

or event

driven

and starts

from

the available

information

as

it

comes

in,

trying

to

draw

conclusions

that

are

appropriate

to

the goals. This

is

how

the system

in

this

thesis

works. In production systems this

is

called a forward

chaining

method of

inference.

We sometimes work

the

other

way,

however,

starting

from

a goal or expectation of what

is

to happen and

working

backwards,

looking

for

evidence that supports or

contradicts our expectation. This

is

called goal

driven

or

expectation driven and

in

production systems

it

is

referred to as

backward _chaining, since

it

requires

looking

at the action parts

of rules to

find

ones that would conclude the current _goal, then

looking

at the condition sides of those rules to

find

out what

conditions would make them _execute, then

finding

other rules

whose action parts conclude these conditions, and so on.

Data-driven approaches sometimes

have

the disadvantage of

generating

many

hypotheses not

directly

Applicability

of

Knowledge-Based

System to Logic Optimization

Knowledge-based expert systems are

very

costly

to

implement

at the present time. This

is

mainly because:

the

lack

of

knowledge

engineers and adequate sophisticated support

tools,

unfamiliarity

of

knowledge

engineers with the application _area,

and

unfamiliarity

of the experts

in

the

application area with

knowledge-based expert system. These characteristics makes

it

necessary to evaluate the candidate application areas

very

carefully

in

terms of the

applicability

of the knowledge-based

expert system approach. Two

key

ingredients

for

successful

application of knowledge-based expert systems has

been

suggested

by

the Stanford AI group8: attack problems amenable to the

techniques of applied

AI,

and consider

only

important,

difficult,

and high-value problems. We

look

at the above two requirements

in

the logic design area.

1. Logic optimization

is

amenable to the techniques of applied of

AI.

.

By

accumulating design experience,

knowledge-based

expert systems can

imitate

human

problem

solving

(21)

. Since optimization

techniques depend

heavily

on

the

target

technology,

by

using

a rule-based

system,

optimization

for

different

technologies

involves

only

changing

the rule

library.

2. Logic optimization

is

an

important,

difficult

and

high-value

problem.

. The

large

number of conferences organized

in

computer aided

design

of

digital

system,

and

the

enormous

number of papers published

in

logic

circuit

design

are

testimony

to the

importance

and

difficulty

of the

problem.

. Logic optimization

is

a

high-value

problem because

auotmatic optimization of

logic

circuits can

improve

both the logic area and

design

time.

2.3 Advantages and Disadvantages of Knowledge-Based Expert System

In summary, the advantages of

knowledge-based

expert

systems include: the ease with which

human

knowledge can be

encoded, the modularity and

incremental

development

of knowledge-based expert systems, the ease of

modification,

and the capabil

ity

of knowledge-based expert systems

to

explain their decisions.

Disadvantages

include:

their cost of

development,

the slow

execution speed with present

technology,

the

difficulty

of

extracting knowledge

from

human

designers,

and the inefficiencies

(22)

Despite their

disadvantages,

knowledge-based expert

systems

have

proven

to

be

a valuable approach and their

capabilities

increase

as more applications are attempted, as more

people understand the nature of

these

systems, and as more

suitable

hardware

and software tools are developed.

(23)

3. System

Implementation

Numerous tools are available

that

optimize and

implement

combinational

logic.

Most of

these

tools

apply

at the Boolean

level,

are

technology-independent,

and

generally

assume an AND/OR

implementation.

Such

tools

fail

to take advantage of

the

various

types of gates available

in

a semicustom

library.

The need

for

more

flexible

and

technology-oriented

tools was recognized

by

Darringer,

et _al5., who

implemented

a

design

system to perform

local transformations at various

levels

of abstraction. In the

system built

for

this

thesis,

the

local

transformations were

formulated as rules to optimize gate-level circuits for area

in

a given

technology,

and Prolog3 was selected as a formalism to

represent these rules for a rule-based system.

The system

is

divided

into

four main parts: a

minimization, mathematical reduction module; a

decomposition,

multilevel function creation module; a _synthesis, gate-level

implementation module; and an _{optimization,}

local

transformation

module. This chapter discusses these modules

in

detail and

describes the rules that the system uses

to

perform optimization.

The control structure that applies

local

transformations to the

(24)

3.1 Minimization

The goal of

building

an

optimizing digital

circuit will

require

the

efficient manipulation of Boolean

logic

functions,

i

The minimization module reduces

the

set of Boolean equations

describing

the

logic

by

using heuristics that

find

a minimal set

of prime

implicants.

In

finding

this

minimal

set,

the

module

takes advantage of the

"don't

care"

set of

the

function. The

ESPRESSO-IIC program cteated

by

Brayton et al.

in

1982,

performs

the reductions. The goals

in

the

design

of ESPRESSO-II were

to build a logic minimization tool such that

in

most cases

. the problem submitted

by

a

logic

designer

could be

solved with the use of

limited

computing

resources;

. the final results would

be

close to a global optimum.

Although ESPRESSO-II follows the basic techniques used

in

most minimization

tools,

generation of all prime

implicants

and extraction of a minimum prime _cover, the algorithms employed

in

ESPRESSO-II are new and quite different. Efficient Boolean

manipulation

is

achieved through the "unate recursive

paradigm"27, which

is

employed

in

complementation,

tautology

and

other algorithms. All of these algorithms make ESPRESSO-II as an

efficient minimization tool

for

logic

functions with more than

(25)

3.2

Decomposition

One of the problems with

using

two-level minimization

is

that the result of

two-level

minimization

is

a network of gates

with unlimited

fan-in.

This part describes a technique

that

re

constructs a multilevel

function

for

logic

design

by

identifying

subexpressions common

to

two

or more functions.

By

creating

a new

variable

to

represent such

subexpressions,

we also reduce the

complexity

of the original

function

at the cost of

adding

a new

intermediate

function.

In

general,

this reduces the number of

logical components required to

implement

the set of functions.

The decomposition approach

is

algebraic as opposed to Boolean.

The result

is

an algorithm, which,

by

successive substitution

of new variable

for

common _{subexpression,} simplifies a set of

functions until

they

are "relative prime".

Given a set of Boolean _expressions, our aim

is

to pull

out common subexpressions,

consisting

of two or more cubes (a set

of variables) , until the expression

becomes

relatively

kernal

(subexpression)

free. It

is

then

easy

to locate single cubes

dividing

two or more functions.

By

pulling

these out as well, we

can reduce our expression to a set whose

only

common divisors are

single variables. At this point the expression can

be

implemented

independently

with no loss of efficiency; all global _commonality

has

been identified. Let

f

and

g

be expressions, and let v

be

a variable appearing

in

neither. Let

r(f,g)

denote the set

f

-(f/9)9/

the remainder resulting from the

division

of

f

by

g.

(26)

Then

the

substitution

s(v,g,f)

of v

for g

in

f

is

the expression

(f/g)v

+ r(f,g). If

g

is

A +

B,

f

is

C(A +

B)

+

D,

then

s(x,g,f)

= _Cx ₊ _D. _The

kernal

(common subexpression)

A

+ B

is

pulled out

from

f

and _g, and replaced

by

a new variable x. The algorithm

for

computing

f/g

and

kernal

are shown

below.

To compute f/g:

1. Let

g

=

{a^}

_and

for

_each

i

_set

h^

=

{bj

|

ajbj

^ f).

2. Set

f/g

=

h^.

To compute

kernal

(f):

We number the

literals

appearing

in

f

as

li#l2

in-Let

kernal

(0,f)

=

kernal

_(f). _Set

kernal

(f)

= _p.

|

o

|

= _number _of _cubes

in

_f.

kernal

(k,

f

)

:

For

i

=

k

+ 1 to n

Let c =

f/li

if

|c|

> 2 and

i

= _n _then

kernal(lj_,f)

= _c

else

if

|c|

> 2 then

kernal

(lj.,f)

=

kernal

(i,c)

if

kernal

(f)

(1

kernal

(liff)

=

p

then

kernal

(f)

= kernal

(f)

f\

kernal

(li,f)

else

kernal

(f)

= _kernal

(f)

U kernal

(li,f)

(27)

We now

describe

the

first

step

of

decomposition,

which

identifies

expressions

consisting

of

two

or more cubes that

occur

in

several

functions.

This process

is

called

"distillation"6.

The

Distill

Algorithm:

1. Generate all

kernals

for

each

function.

2. Select a pair

kernals

(k,k'),

where

ke

f^t

k'

e

f

j

,

i

?

j,

. such

that

|k

fi

k'|

> 2.

if

no such pair

exists,

stop.

3. Record

(v,K

A

K')

for

some new variable v.

4. Set

f^

= _s(v,k

fl k',fj.)

for

each

function.

5. Go to 1.

The total number of variables

in

the

function

decreases

with each _pass,

hence

the algorithm

terminates.

The particular

pair k

A

k'

selected

in

step

2 can

influence

the

quality

of,

the

resulting decomposition. In the program, a useful heuristic

is

to

select the pair whose substitution most reduces the number of

variables appearing

in

the functions. This

heuristic

was

imple

mented

by

recording each subexpression produced

from

the

interse

ction of

kernals,

the subexpression

appearing

most often

in

the

record

is

chosen

for

the

step

2.

To complete

the

decomposition process, the next _step

is

to

pull out those cubes _consisting of more

than

two

variables

that

exist

in

several functions. This process

is

referred to as

"condensation"6 .

(28)

The Condense Algorithm:

1. Select _cubes, c6

f^

and c'6

fj,

i

f

j#

such that

|c

fi

c'

|

> 2.

if

no such pair exists, stop.

2. Record

(v,c

A

C)

for

3. Set

f-[

= _s(v,c

A

c',^)

for

each function.

4. Go

to

1.

The total

decomposition

process consists of distillation

followed

immediately

by

condensation. The pairs (v,c

fi

C)

generated

by

condensation are added to

the

list

of

(v,k

fi

k')

produced

by

distillation. As

before,

a selection heuristic can

be applied

in

step

1.

Example: Let

f

= _AB(C(D ₊

E)

+ F +

G)

+ H and

g

= _AI(C(D

+E)

+ F +

J)

+ K

Distillation:

Pass 1:

kernal

(f)

= _D +

E;

kernal

(g)

= _D ₊ _E

|k

fi

k'

|

= _D + E > 2

set L = _D ₊ _E

then

f

= _AB(CL ₊ _F ₊

G)

+ H

g

= _AI(CL ₊ _F +

J)

+ K

Pass 2: kernal

(f)

= _CL + F + G

kernal

(g)

= _CL ₊ _F ₊ _J

set M =

|kAk'|

= _CL + F > 2

then

f

= _AB(M +

G)

+ H

g

= _AI(M +

J)

+ K

(29)

Condensation:

Pass 1: set N = _ABM

C\

_AIM = _AM > 2

then

f

= _B(N ₊

AG)

+ H

then g

= _I(N ₊

AJ)

+ K

the substitution

list

is:

L = _D ₊ _E

M = _CL ₊ _F

N = _AM

The

decomposition

takes

about 14 minutes

for

a combinational

logic

with 12 _outputs, 37

inputs

and 14 0 product

terms,

but

the

running

time

is

mostly

spent

in

doing

the

intersection

for each

kernal.

Since we

try

to select a common

subexpression that appears most often

in

the

functions,

we need to generate all common subexpressions and compare each of them

in

reducing the number of variables

in

the

functions.

If we organize

these common subexpressions

according

to their appearance times and

let

the process go

back to

step

2 to select the second order of subexpression after

substituting

a new variable

for

the

functions,

then the _running time

is

largely

reduced

by

_skipping a lot of time spent

in

doing

the

intersection.

When this algorithm

is

applied to a sample with 29

functions

having

23

input

variables,

it

takes about 5 hours compared with the first

algorithm which takes 13 hours to

decompose

this

sample. It saves

up

to 70% of the CPU time

in

the Pyramid 90/X. The savings of

time depends on the size of the

functions.

Of

course,

since this

(30)

results

in many

more

logical

components

than the

first

algorithm.

The algorithm

listed

below

is

called

"fast

decomposition"

and

the

first

algorithm

is

called

"optimal

decomposition".

The

Distill

Algorithm:

1. Generate all

kernals

for

each

function.

2. Select a

pair,

_where

ke

fi,

k'efj#

such

that

|k

fi

k'|>2.

This process proceeds

repeatedly

until

every

kernal

has

been

selected,

then these subexpres

sions

(|kfi

k'|)

are ordered

according

to their

appearance

times.

If

no subexpression _exists, stop.

3. Select

first

common

ubexpression,

if

no such common

subexpression

exists,

go to 1.

4. Record

(v,

k

fi

k')

for

5. Set

f^

=

s(v,kfi

k',fi)

for each function.

6. go to 2.

The first

step

of Condense algorithm

is

also modified.

1. Select _cubes, c

f^,

c'

e

f

j

, such

that

|c

fi

c'

|>2.

This process proceeds

repeatedly

until

every

cube has

been

selected,

then

these

subexpressions (|c

(\

c'|)

are ordered

according to

their appearance times,

if

no subexpression

exists,

stop.

(31)

Now we

look

another example

that has

more structure

than

the

first

one. The

incoming

data

and

resulting design

for

part of

a 16-bit

bus

structure are shown

below.

The varl through var8 are

new variables and

the

common subexpressions associated with

them.

the

incoming

data:

f3

=

h'i'j'k'r

₊ _e'f'ar ₊ _c'ap'r ₊

b'ap'r

₊

d'ar

₊

aps +

h'i'jkr

+

h'ikl'r

+

d'hjkl'r

+

b'hjk'l'p'r

+ e'hjkl'f'r + c'hjkl'p'r +

hjkl'ps

f4

=

h'i'j'k't

₊ _c'ap't ₊

b'ap't

₊ _e'f'at ₊

d'at

₊ _apu ₊

h'i'jkt

+

h'ikl't

+ e'hjkl'f't +

d'hjkl't

+ c'hjkl'p't +

b'hjkl'p't

+

hjkl'pu

f5

=

h'i'j'k'v

₊

d'av

₊

b'ap'v

₊ _c'a _p'v ₊

e'f'av + apw + h'i'jkv

+

h'ikl'v

+ c'hjkl'p'v + e'hjkl'f'v +

b'hjkl'p'v

+ d'hjkl'v

+

jhkl'pw

f6

= _h'i'j'k'x ₊

d'ax

₊ _c'ap'x ₊

b'ap'x

₊ _e'f'ax ₊

apy

+ h'i'jkx

+

h'ikl'x

+ c'hjkl'p'x +

b'hjkl'p'x

+

d'hjkl'x

+ e'hjkl'f'x +

hjkl'py

f7 = _h'i'j'kz + d'az +

b'ap'z

+ e'f'az + c'ap'z + apal +

h'i'jkz

+ h'ikl'z + c'hjkl'p'z +

b'hjkl'p'z

+ e'hjkl'f'z +

d'hjkl'z

+ hjkl'pal

f8

= h'i'j'k'bl + e'f'abl +

d'abl

+

ikl'bl + e'hjkl'f'bl

hjkl'p'bl +

hjkl'pcl

f9

= _h'i'j'k'xl + ap'xl + apdl +

h'i'jkxl

+

h'ikl'xl

+ hjkl'p'xl hjkl'pdl

flO

= h'i'j'k'el + ap'el + apyl +

h'i'jkel

+

hikl'el

+ hjkl'p'el

+ hjkl'pyl

fll

= h'i'j'k'gl + ap'gl + apfl + h'i'jkgl +

hikl'gl

+ hjkl'p'gl

+ hjkl'pfl

fl2

= h'i'j'k'il + ap'il + aphl + h'i'jjil +

h'ikl'il

+ hjkl'p'il hjkl'phl

h'i'j'k'bl + e'f'abl +

d'abl

+ c'ap'bl + apcl +

h'i'jkbl

+ h'ikl'bl + e'hjkl'f'bl +

d'hjkl'bl

+

b'hjkl'p'bl

+ c'hjkl'p'bl +

hjkl'pcl

(32)

The results of

decomposition

are shown

below.

Each of

the

lines

numbered 1

throgh

8 gives a subexpression and the new variable

associated with

it.

For

example,

the

line

numbered

1

associates

the new variable varl with the expression

j'k'+jk.

The variable

varl

is

used

in

place of

the

expression

j'k'+jk

for

the functions

that contain

the

expression

j'k+jk.

1. varl =

j

'k' ₊

jk

2. var2 = _a ₊ hjkl'

3. var3 = ikl' ₊ varli'

4. var4 = b' + c'

5. var5 = _e'f ₊ d' ₊ var4p'

6. var6 = var3h' ₊

var5var2

7. var7 = var2p' ₊ var3h'

8. var8 =

var2p

f3

= _var6r + var8s

f4

= _var6t + var8u

f5

= _var6v + var8w

f6 = _var6x +

var8y

f7 = _var6z + var8al

f8 = _var6bl + var8cl

f9

= _var7xl + var8dl

flO = _var7el + var8yl

fll

= var7gl + var8fl

(33)

3.3

Synthesis

The synthesis module

translates

a Boolean function

into

a gate-level

implementation.

Two synthesis modules were

built,

one

that

generates an

AND/OR

implementation

of the

function

derived

from

the

previous module and one that generates a NAND

or NOR

implementation

for

particular

target

technology

from the

AND/OR

implementation.

The

first

synthesis module

is

relatively

straight

forward and

implements

the

function

as an

interconnected

netlist.

The second synthesis module converts the netlist to a network

composed of NAND gates or NOR gates. This conversion

is

carried

out

by

using

F = (F')' _and _then

applying

DeMorgan's laws:

(Xx

+

X2

+ +

Xn)

' = Xx* _x2'... Xn'

(Xx X2

...

Xn)

' = Xx' + X2' ₊

... + Xn'

The Figure 3.1 illustrates conversion of two-level forms.

A.

B

A+B

A4-B

AB

A-B"

tXj

B

AB

MD-'rO-O

A+B

AB

(34)

The

above conversion produces a

lot

of cascaded

inverters

in

a

multilevel _netlist,

but

since

the

double

inversion

does not alter

a

logic

function,

it

should not appear

in

network. In

this

module, the conversion

is

performed

by

using

rules and

by

passing

an output signal

to

the

level,

so that

the

cascaded

inverters

are not

found

in

the

NAND/NOR

implementation.

The

output signal

tells

a gate

that

its

output

is

logic 0 or logic 1

when

it

is

converted

from

AND/OR

logic to

NAND/NOR logic. The

number of gates cascaded

in

series

between

a network

input

and

the output

is

referred to as the number of 'levels' of gates. The

highest

level

is

the network output. As shown

in

Figure

3.2,

the

network has 4

levels,

the first level

is

gates, the fouth level

is

gatel. The logic conversion

is

started from the gates that are

in

the highest level. The rules

for

_obtaining the NOR network

from a AND/OR netlist are as follow:

1. If the output signal

is

1 and the gate

is

an AND gate,

then change the AND gate to NOR gate and pass a 0

signal to the gates that are

in

the next lower level.

is

0 and the gate

is

an AND gate,

then change the AND gate to NOR gate followed

by

a NOR

inverter,

and pass a 0 signal to the gates that are

in

the next lower level.

is

1 and the gate

is

an OR gate,

then change the OR gate to NOR gate

followed

by

a

NOR,

inverter,

and pass a

1

signal

to

the

gates that are

in

(35)

4.

If

the

output signal

is

0 and

the

gate

is

an OR

gate,

then

change the OR gate

to

a NOR gate and pass a 1

signal to the gates

that

are

in

the

level.

The rules

for

obtaining

the

NAND

network

from

AND/OR

implementation

are

exactly

the

same as

for

NOR network except the

signal

is

as opposed as

NOR

logic.

A

fan-in

constraint

has

been

added

for

the

NAND

implementation

so

that

a NAND gate can

have

no

more

than

four

inputs

in

a

NAND

gate. An example to

illustrate

these rules

is

provided

below.

Figure 3.2

is

traced

from

gate 1 which

is

an OR gate

with an output signal 1. After

applying

rule

3,

gate 1

is

changed

to

a NOR gate followed

by

a NOR

inverter,

and a signal 1

is

passed to

level

3 which

includes

gate 2 and gate 3. Since gate 2

is

an AND gate with an output signal

1,

rule 1

is

selected and

applied

in

netlist

resulting

in

a NOR gate and a signal 0 for

level

(level

2)

. Gate 4

is

also an AND gate but with

an output signal

0,

by

applying

rule

2,

the AND gate

is

replaced

by

a NOR gate with an

inverter

and a signal 0

for

level

1. Rule 4

translates gate 5

from

an OR gate to a NOR gate and passes signal

1 to

input

variable A. The process backtracks to

input

variable B

and finishes conversion

in

level

1. Since D and F are input

variables, there

is

no gate connected to gate 2 and gate

4,

the

process goes back

to

gate 1. Before

going

to

gate 3

it

should be

mentioned that the AND/OR implementation still exists

in

the data

(36)

signal

1

for

gate 5 that we mentioned

before

still exists. Gate 5

is

translated

to a NOR gate and an

inverter

but

the NOR gate with

input

A,

B can

be

found

in

the

network,

so

only the

inverter

is

added

to

the

netlist. The process

is

stopped until the NOR

(37)

A

B

o

$*te5

=o

F

J

6*fceA

rt

Gutel

Qatt^

Figure 3.2 before synthesis

A

B

o

butt*

M>xOlOiH

tot

&

_.

fateJ

k>H>

XVy'/

D^^O^-E^

Gate I

&ate$

(38)

3.4

Optimization

The optimization module performs a succession of

substitutions on an

existing

netlist,

similar to the

way

an

experienced

designer

manipulates a

design

to achieve greater

efficiency. The module consists of a

knowledge

base

and a control

structure.

The system optimizes a circuit

by

performing

a series of

local

transformations

to that circuit. In

performing

each

transformation,

the program replaces a given configuration of

gates

by

another

functionally

equivalent configuration of gates.

These transformations are always applied

in

such a

way

as to

reduce circuit areas and produce more optimal circuits. The

example of such a transformation rule was shown

in

chapter 2.

The control structure of a rule-based system directs the

application of the rules.

During

the application, a meta-rule

determines what rules or sequences of rules are applicable to the

circuit.

3.4.1 Knowledge-Base

In the system, a rule

is

a mechanism to replace a

portion of a circuit

by

a

functionally

equivalent

but

more

desirable circuit portion. Rules are sorted as a netlist

describing

a target configuration

to

be

recognized

in

the circuit

and an associated action

detailing

how to

build

the

(39)

configuration.

Substituting

the

replacement configuration

for the

target configuration

is

to reduce

the

overall area of

the

circuit.

During

the

building

of

the

rule

library,

it

became

apparent

that

a

large

numbers of rules

differing

from each other

by

the number of

input

variables existed. To reduce the number of

these equivalent _rules, we

incorporated

these

rules

to

a general

rule. A rule represented

in

Prolog

is

shown

in

Figure 3.4. The

"adjust_netlist"

clause removes the NOR

inverter

G2 and NOR gate

G3 from netlist and connects the

input

variable A and B to NOR

gate Gl.

A

B

S3

A

c

o

nor_rule_l(X,Y)

:-node_(nor,Y,Z) ,

inverter

(nor, Y,Z)

,

gate_(nor,Z,Vars) ,

adjust netlist(nor,X,Y,Vars) .

(40)

The rules within rule

library

are ordered

by

their

desirability.

A meta-rule

in

the system

determines the

appropriate order of

rules

based

on how

many

times a rule

has

been

used

in

the

circuit. There are sixteen rules

in

the system now (see

Appendix

C)

,

half

for

NOR

implementation

half

for

NAND. The control

structure will use rules

from

the

first

rule until

there

is

no

applicable rule

in

the

knowledge base.

Although these rules are

specific

to

a given

technology,

optimization

for

different techn

ologies

involves

only

changing

the

library

containing

the rules.

3.4.2 Control Structure

A _{problem-solving} that uses forward

reasoning

and whose

operators each work

by

producting

a single new object a new

state

in

the database

is

said to represent problems

in

a state

space representation.24

The problem of

producing

a state that

satisfies a goal condition can now be formulated as the problem

of searching a graph to

find

a node whose associated state

description satisfies the goal. The graph, which grows as the

search proceeds, will be referred to as a search graph or search

tree.

Optimization of combinational logic through successive

transformations can be translated to the problem of optimally

traversing

a state space. The nodes of this graph are the

implementations of the circuit, and the arcs represent rule

applications. The root of the tree corresponds

to

the

current

(41)

implementation.

Optimization

is

equivalent

to

finding

a path

from

the

initial

circuit configuration to an optimal configuration.

The process of

finding

this path

is

refered to as a state space

search. The state space search

strategy

used

in

this system

is

presented

below.

The

strategy

uses

heuristic

information

to

decide which node to expand

next,

the

information

is

provided

by

a meta-rule.

select rule

for each rule R

in

some class C

for

each gate G

in

the

circuit

if

the target

for

R matches at gate G

then

apply

rule go

back

to select rule

The value of C controls what a given

technology

is

optimized. For

instance,

when _optimzing for NOR gate, the class of NOR rules are

used. The first

step

of the system selectes rules based on the

ordering of the knowledge

base,

always

applying

the first

applicable rule

in

the knowledge. The above search strategy

is

called a best-first search.

We use an example to explain how a state changes to

another, and how the state

describing

a target

configuration

matches the condition part of a

selected rule. The forward

chaining inference method will be

used to

implement

the matching

operation;

it

starts from the

available

information

and

try

to

infer

the conclusions that are appropriate

to

the goal. When the

(42)

search

function

is

called,

it is

_given _two _arguments: _a

rule,

and

a gate at which the search

function

should

begin.

Figure

3.5

shows a sample

netlist,

a portion of a

circuit,

and a rule to

be

applied

in

that

circuit.

In

searching,

the

controller

first

checks

if

gate Gl

is

a NOR gate. If

so,

then

the

controller

selects one

input

from the

input

variables of gate Gl. If P2

is

an

input

variable

for

gate Gl and connects to another NOR gate

G2,

then the controller checks

to

see

if

gate G2

is

an

inverter.

Since the G2

is

not an

inverter,

the

searching

activity

fails.

At

this point a condition cannot

be

met,

so the controller

back

tracks

by

returning

to the

last

selection

it

made and

making

a

different choice.

Backtracking

continues until the rule's

condition part has

been

satisfied,

or until all possible choices

have

been

rejected. As this example

illustrates,

the controller

goes

back

to Gl and takes another choice. P3

is

an another

input

variable

for

gate Gl and connects to a NOR gate G3. Since the

gate G3

is

an

inverter

between gates G4 and

Gl,

the rule

is

executed and a replacement

function

is

called to perform a

transformation

for

the circuit. The replacement function

is

contained

in

each transformation rule.

In order to make the control structure more

flexible,

a

meta-rule was implemented

in

the system. As noted above, the

meta-rule determines appropriate order of the transformation

rules.

By

adjusting these

rules,

control structure can be used

perhaps with a better result.

(43)

A sample netlist:

(nor,pl,p2,p3)

(nor,p2,p4,p5)

(nor,p3,p6)

(nor,p6,p7,p8)

A rule _selected

by

control structure:

nor_rule_l(X,Y)

:-node_(nor,Y,Z)

,

inverter(nor,Y,Z)

,

gate_(nor,Z,Vars)

adjust_netlist(nor,X,Y,Vars)

PI

P2

PL

PS

>

Gt-GZ

PC

Gl

P2

P3

Gl

PI

Nt'

PL.

P

O-i

PI

<*

PS'

P2

Gl

Pi

Figure 3.5 Logic optimization represented

in

state space

search, each circuit configuration

is

a state.

(44)

3.5 Program Organization

The minimization module

is

written

in

C,

and the other

modules are written

in

Prolog.

The reason

for

this

division

is

that

the

minimization module uses ESPRESSO-IIC to perform

reduction, whcih manipulates matrices

during

_{minimization,} and

are much easier

to

implement

in

a conventional

language.

The

other modules are

implemented

in

Prolog

based on

following

observations: In

Prolog

it

is

easy

to

represent the functional

behavior of _gates, all the transformations needed for the system

can be entered as rules and

implemented

in

Prolog,

and

Prolog

provides an efficient pattern-directed

inference

tool.

3. 6 Input and Output

The

input

of the system

is

in

sum of products function

set or

in

a truth table

including

input

and output variables. The

system output

is

in

form of a netlist which describes target

logic

type,

output variables and

input

variables. The

following

netlist

is

a sample output (Figure 3.6).

(45)

(nor

,

f

12,norl,nor2

)

(nor,norl,gate22,il)

(nor,gate22,/p,var2,/hl)

(nor, /hi, hi)

(nor,/p,p)

(nor,var2 ,gate39,

a)

(nor,gate39,l,/k,/j,/h)

(nor,/k,k)

(nor,/j,j)

(nor,/h,h)

(nor,nor2,nor21,gate47,

gate46)

(nor,gate47,h,var3)

(nor,var3,gate41,gate40)

(

nor,

gate41,i, varl)

(nor,varl,gate38,gate37

)

(nor,gate38,/k,/j)

(nor,gate37,k,

j)

(nor,gate4

0,l,/k,/i)

(nor,/i,i)

(nor,gate4

6,p,var2)

(nor,nor21,var2,

/hi

)

Figure 3.6 A sample output

(46)

4.

Conclusion

_and _{Future Work}

This thesis

has

described

the

development

of a system

that

is

capable of

automatically synthesizing

and

transforming

functional

specifications

into

gate

level

implementations.

The

system

is

implemented

as a rule-based system

because

it

works

without an established algorithm and

is

easy

to

modify

according

to

the

target

technology. The system progresses through

four

distinct

modules

during

the

design

process:

Minimization,

Decomposition,

Synthesis and

Optimization.

_For

larger

examples

above 100

gates,

the

system achieved area reductions

ranging

from

20% to 30%

from

unoptimized circuits, these results are

comparable to the result of manual optimization. The

flexibility

of the system

is

largely

due

to

its

separation

into

independently

useful modules. It

may be

use

for

translating

two-level

functions to a multilevel

implementation,

generating

a

circuit, or

optimizing

an

existing

circuit. In

any

of these

applications, the system saves valuable time and space.

Future work can

be

classified

into

two categories:

enhancements to the system to

improve

its

present performance;

and new approaches to logic

design.

1. An Additional feature to

improve

system performance

includes

building

a rule

entry

module to

help

users

easily

extend the

knowledge base. The rule

entry

module would _{automatically}

(47)

action

describing

how

to

replace

it

with

the

replacement

configuration.

2. New approaches

include

extension of

the

current

NAND/NOR

implementation

to other gates available

in

gate-array

or

standard cell

libraries,

and consideration of the fan-in and

(48)

Appendix

A. User's Manual

NAME

preopt

-generate

Prolog

accepted

form

from

truth table

SYNOPSIS

preopt

[file]

DESCRIPTION

preopt generates a

form

suitable

for

Prolog

from

a truth

table which

defines

a set of Boolean

functions.

If no output

file

is

_specified,

the

default

output file

"_opt

"

is

generated

(e.g.

test_opt)

. Since

Prolog

does

not accept a

period within

file

name,

all

the

file

name

including

a

period will

be

changed to

"_".

Input

to

preopt

is

in

the

form

of

truth

table,

that

is

the

output

from

Espresso-IIC program. Comments are allowed within

the

input

by

placing

a pound sign

(#)

as the

first

character

on a

line.

Comments and unrecognized

keywords

are passed

directly

from

the

input

file

to output

file.

Any

white-space

(blanks,

tab,

etc.)

is

ignored.

Output

is

used as

input

for loqopt (see Figure A.l). The example

in

appendix B will show

how the program progresses.

The

following

keywords are reserved

for

the system

use,

they

should not appear

in

the

input

file

for

input

variable or

any

other variable.

key

words: gate(n), var(n),

inv,

in,

out,

nor(n), nand(n), norgate(n), nandgate(n).

the

(n)

means

1,

2,

3 n. (e.g. varl,

var2,

var3 varlO)

logopt

-combinational

logic

optimization

SYNOPSIS

logopt

[option]

[type]

[file]

DESCRIPTION

logopt takes as

input

a

two-level

Boolean functions

optimized

functional

equivalent netlist. The system consists

of

four

distinct

modules:

minimization,

decomposition,

synthesis and optimization. The optimization module

is

implemented

by

a rule-based system.

loqopt reads the file

provided,

performs the optimization for

logic

area, and writes an optimized netlist to a default

output file "_out" _(e.g.

test_out)

if

no output file

is

specified. The system generates a command file "_com"

(e.g.

test_com)

for

Prolog

_programming,

it

can be deleted

after running.

"type"

specifies the target

technology

for the system. The

allowed types are -nand for NAND gate, and -nor for NOR gate.

The output netlist

only

includes

NOR gate

if

-nor type

is

specified.

"option" _specifies _boolean _function decomposition which

creates multilevel function

by

identifying

common sub expression

from

a two-level function. The allowed options

are -f

for

" fast

decomposition",

and -o for "optimal

decomposition". Although "fast decomposition" saves a lot of

(50)

1

Boolean

Function

j

1

|

Eqntott

sir

1

Truth Table

|

1

|

Espresso-IIC (Minimiz

|

Truth Table

|

After Minimization

|

1

|

Preopt

|

Prolog

|

Accepted Form

j

1

|

Logopt

(Optimization)

1

|

Netlist

|

(51)

Appendix

B. Example

Example 1

/* input data (sample, bol) */

r0 = _b&c&d&e&h& _{! i} _&

j

& k & ! 1 ₅ f 0 = _b&c&d&t&h& _{! i} _&

j

& k & ! 1 ;

1-0 = _d&e&h&i&.j& _k _& _{! 1} _& _o

; f 0 = _d _& _f _& _h _& _i _&

j

& k & ! 1 & o _;

f0 =!d&e&h&i&j& k & ! 1 & r.i & n ; 1 W = _{! d} _& _f & _h _& _i _&

j

& k & ! 1 & m & n _;

|-0 = _a _& _b _& _c _&