ESTIMATION OF THE MATING SYSTEM IN OPEN-POLLINATED MAIZE POPULATIONS USING ISOZYME POLYMORPHISMS

(1)

ESTIMATION OF

THE

MATING SYSTEM I N OPEN-POLLINATED MAIZE POPULATIONS USING ISOZYME POLYMORPHISMS1

A. H. D. BROWN2 AND R. W. ALLARD

Department of Agronomy and Range Science and Department of Genetics, University of California, Dauis, Califorma

Received January 12, 1970

mating system by which a given population reproduces plays a n im- T g t a n t role in determining its genetic structure.

In

maize, random mating has usually been assumed. However, GUTTIEREZ and SPRAGUE (1959), in a shtdy of polycross plantings of ten stocks which were homozygous for recessive markers, found marked differences in the effectiveness of the stocks as male and female parents. They showed that these differences related to attributes of the mating system such as time of silking and time of pollen shedding. The investigation reported here was undertaken to determine whether such marked deviations from random mating occur in advanced generation open-pollinated populations of maize.

Isozyme variants were adopted as markers in this investigation because of their many advantages in studying such populational processes as the mating system. However, isozymes are frequently expressed only at some specific stage in the life cycle and then only in materials grown under carefully controlled conditions. Further, sampling an individual for enzymatic assay may cause damage that upsets the growth pattern and ultimately affects mating behavior; this applies particularly when isozymes are expressed only in seedlings, as is often the case. These difficulties can be circumvented by collecting seeds from mature plants in the field, and inferring the genotype of the maternal parent from progeny tests of seedlings grown under controlled conditions. In this situation an appropriate statistical method for estimating the relevant mating system parameters is required. Such a n estimation procedure is described and discussed in the first part of this paper. Following this the results obtained when the method was applied to isozyme data from two open-pollinated populations of

maize are presented.

ESTIMATION PROCEDURE

Description of the Diallelic Case

Consider a population polymorphic for a codominant diallelic locus (alleles A, and A,) in which maternal genotypic frequencies of A,A1, AIA,, and A,A, at mating are D,

W ,

and R. Let p equal the gene frequency of the A, allele in the pollen pool. Let s denote the proportion of ovules which are self-fertilized and

This work was supported in part by grants from the National Science Foundatmn (GB-6866) and the Nabonal Insbtutes of Health (GM-10476)

2 Present address Department of Biology, Univeraty of York, Heslington, York, England. YOIBDD.

(2)

134 A. H. D. BROWN A N D R. W. ALLARD

TABLE 1

Classification of half-sib fam'ly arrays, their possible maternal parents and the expected proportion of such families in the diallelic case

I 0 k 0 A,A,,AlA20rA2A2

1I:l 1 k-I 0 A,A, or A,A,

: 1 i k-i 0

: k k O O

1II:l 0 k-1 1 A, A, or A, A,

: i 0 k-i i

:k 0 0 k

IV: 1 or 1 or A A

more more

K i k ! / i ! ( k - l ) !

t (=l-s) denote the proportion of ovules which are fertilized as a result of random outcrossing. Random samples, each of size

( k

>

1 )

,

of progeny from a

random sample of 12 female parents are assayed and the total ( n k ) of genotypes

is recorded.

The assumptions made are: ( 1 ) that excess of self-fertilization is the only factor which may cause the actual mating system to deviate from random mating; (2) that t is uniform for maternal genotypes and pollen alleles (i.e., there is no incompatibility and mating events are independent) and ( 3 ) that pollen frequency is uniform over maternal genotypes.

A sufficient partition of the data, with respect to the estimable parameters,

D,

H , p , and t, is obtained when the progenies of all families are grouped accord- ing to genotypes present within each family. Table 1 lists four groups. Families classified in Group I are those in which all

k

individuals are genotypically A,A2; the genotype of their maternal parent thus could have been A,A,, AlA2 or A2A2. Group I1 families are those with at least one A,A, and no A2A2 offspring; their possible maternal parents were therefore either A,Al or A,A,. Group I11 is symmetrically opposite to Group 11. Group IV contains families which include at least one offspring of both AIA, and A,A2. Hence Group IV is the only one in which the female parent (A1A2) is definitely known from the progeny array. Within Groups I1 and 111, there are

k

possible subgroups depending on the number of homozygotes observed. For Group I, 11, and I11 families, the family subgroup classification also specifies the total number of observed progeny of each genotypz For Group IV families however, the total observed number of A,A,, A,A2 progeny must be recorded. There are 1 (from Group 1 )

+

k

(Group 11)

+

k

(Group 111)

+

2 (Group IV) = 2k -t 3 ( k > 2 ) total degrees of freedom.

(3)

M A T I N G SYSTEM A N D ISOZYMES I N MAIZE 135 type within each subgroup, the expected proportions of families in the 2k

+

2 subgroups is first calculated. Table 1 gives these expectations in terms of the

~ i i ' s which are the single progeny conditional probabilities, i.e., the probability that an offspring is of genotype

i,

given that the female parent is of genotype i ;

i,i

= 1,2,3 for AIAl, A,A2, and A2A2. The values of ~ iappear in Table j 2. For

Groups I, I1 and 111, the expected number of individuals of a given genotype within a subgroup equals the product of the expected number of families in the subgroup (i.e. the values in the last column of Table 1 multiplied by n ) times the frequency with which the particular genotype occurs in the subgroup. For Group IV, the expected values are obtained by multiplying n times the conditional frequencies, given that at least one of each of the homozygotes occurs. The numbers of progeny of type A,Al, A,A,, and A,A,, respectively are:

EIA,AII = nkHrzl [ 1 - ( ~ ~ ~ + ~ ~ ~ ) ~ ~ l

E [AIA,] = nkHTz2 [ 1 - ( x ~ ~ + T ~ ~ )

E [A,A,] = nkHrZ3 [ 1

-

( ~ 2 2 + i ~ ~ 3 ) L1].

- ( ~ 2 2 + ~ 2 3 ) L1

4-

7rZzb1]

The model and its expectations are now completely specified, and in theory, the likelihood equation for a given set of data can be formulated and the maximum likelihood estimators for the four parameters, i.e.,

D,

A,

$ and 2, can be derived. However, since general literal solutions for these estimators do not exist, numerical solutions are required. The situation is even more complex when there are more than two alleles per marker locus because female parent genotypes are less easily delineated from the progeny arrays.

Two-step Likelihood Scoring Method

While it is possible to formulate a single-step maximum likelihood estimation process based on a probability model of the type displayed in Table 1, this becomes very cumbersome with three or more alleles and moderate

k.

This diffi- culty can be largely overcome by breaking the estimation process into two steps, linked by induction; consequently such a procedure was adopted. Since no more than three alleles can be distinguished at any of the loci studied here, the description of the two-step method will be limited to the three allele case. It can, however, easily be expanded to encompass more alleles. Considering the results for each marker locus as a separate experiment, let Z,, z=l,

...

n, denote the n discrete

TABLE 2

Single progeny conditional probabilities for the diallelic case

4 ' 4 2

3

' 4 1 4 e Female ' 4 1 4

(4)

136 A . H. D. B R O W N A N D R. W. ALLARD

random variable vectors, with elements zxj, where zzj is the observed number of

progeny of the jth single-locus genotype (j=1,2 .... 6) for the zth parent. The population parameters of the statistical model can be denoted as: t = outcrossing rate.

P

= an n-dimensional vector whose elements are discrete entries (the actual

q = a 3-dimensional vector whose elements ( p , q, and r) are the pollen gene The conditional probabilities ( n i i ) of obtaining a single progeny with specific genotype, given the material genotype, are shown in Table 3.

The joint likelihood function, when the individual family densities are independent, is

n.

The function f is of the form

unknown genotypes of the n random female parents). frequencies.

uzx;

t, P, q ) = f(Z,

I

t, P, 9 . ) : x=l,n

where ( ~ i j ) ~ = ~ = 1 for all j when aij = z,i = 0, and where Gi is the frequency of

ith genotype in the maternal population. Thus, for each family this function can take six possible discrete values, one for each value of

i.

The likelihood function L cannot be maximized by the usual differentiation procedure because f is nondifferentiable due to the discrete nature of

P.

Conse- quently a computer program was written which calculates, from the data 2, of the xth family and for given values of t and q, the values assumed by f for each of the six possible female parents

(i).

To do this, it is necessary to assume that the Gi can be evaluated in terms of t and q (i.e., the population is near equilibrium)

,

so that

G , = p 2

4-

pqF

4-

prF G , = q2

+

pqF

+

qrF G, = 1-2

4-

prF

+

qrF

G, =2pq (1-F) G , = 2pr (1-F) G5=.2qr (I-F),

where F = (1-t)/(l+t). Only when

k

is small (k<6) and t large, does this evaluation seriously affect the ranking of the likelihood scores for a few of the

TABLE 3

Single progeny conditional probabilities for the triallelic case

Progeny

Female parent AIA, Ai4 4'42 44 _{A A} _'43'4,

A A 1-t(1-p) qt rt 0 0 0

A,-% (l-t+2pt)/4 ( I - r t ) / 2 rt/2 ( 1 -t+2qt) / 4 rt/2 0

AZA, 0 Pt 0 I-t(1-q) rt 0

A I 4 ( 1 -t+2pt) / 4 qt/2 (I--qt)/2 0 qt/2 (I-t+2rt)/4

A A 0 Pt/2 Pt/2 (I-t+2gt)/4 ( I - p t ) / 2 (1-t+2rt)/4

(5)

M A T I N G S Y S T E M A N D ISOZYMES IN M A I Z E 137

family types. The function g is then set equal to the maximum (over

i )

of these joint family probabilities which, omitting the combinatorial coefficient, are:

g ( Z ; t, 4 ) = ~ y p [GiY (xij)'z~I.

This value of

i is stored as the maximum likelihood estimate of the maternal

genotype of the sth family, given t and q. Next an overall likelihood score for the values of t and q assumed is obtained as,

n

x=1

l O g L = x l O g g ( z z ; t , q ) .

Repetition of this process using various values of t and q generates a surface and the estimates of t and q ( 2 and $, respectively) are those values at which a maximum occurs in the surface. For each point there is an appropriate estimate of P ;

the maximum likelihood estimate of P is the vector stored when t=t^ and q=$.

In practice, if a large number ( n k ) of seeds from a large number of families ( n )

are assayed, then

4

can be taken as the gene frequencies among the seeds, and the calculation can first be made in one dimension (i.e., by varying t )

.

This simplification assumes that the gene frequencies are the same for ovules and pollen. The maximum point can be estimated graphically; consequently the final estimate need not be one of the specific iteration values. A subsequent check of the effect of variation in q values on 2 can be made. The total value of log L still remains the overall criterion for the joint estimation of t and q.

Further simplification is necessary to obtain an estimate of the variance of 2. The approach adopted here was to derive a minimum bound for this variance by reducing the model to one of single parameter estimation. This assumes that P and q are known. If both n and

k

are large this may be a reasonable approxi- mation. Thus, following MATHER (1957), the information per plant is given by

1

Ge(1-r) + (G2+G3) ( 1 - - 2 ~ ) '

4 l-t+2pt

+--

_t_[_1-t( _1-r)_]

-

- G ( 1 - p ) + G,(l--q)

t [ 1-t ( 1 -p) ]

+--

(GZ+G5) ₄ (1-2q)Z t [ 1 -t (1-q) ]

2t 1-qt 22 1-pt

[

l-t+2qt

G3q 2-qt G5p 2-pt

+-

[-]

+

-1.

[--I

where the Gi are obtained from the estimated P vector. A minimum bound for the variance of the estimator is var ( t ) = (l/nki,)

.

Discussion of the Mixed Mating System Model and the Estimation Procedure

Many assumptions are involved in the mixed mating model. For example, factors which promote either positive and/or negative assortative mating may be involved. Also mating can vary from genotype to genotype, from environment to environment ( JAIN and ALLARD 1960; ALLARD and WORKMAN 1963; HARDING and TUCKER 1964), and with genotypic frequencies ( RICCELLI-MATTEI 1968).

Further, the pollen pool itself may not be constant (HARDING and TUCKER

(6)

138 A. H. D. BROWN A N D R. W. ALLARD

in practice the simplest tenable operating procedure is to estimate a single parameter t=l-s, which summarizes a possibly complex set of biological phe- nomena in terms of an “effective” amount of random outcrossing us. self-fertilization. At equilibrium the effect of any mating system can be summarized by WRIGHT’S equilibrium law with one parameter, the fixation index (WRIGHT 1965) F (l-H)/2pq, because F = (1-t)/( l + t ) . I t should be noted that estimates of t provided by codominant isozyme markers may be less biased, due to cancelling effects associated with the larger number of genotypes sampled, than estimates based on recessive homozygotes.

FYFE and BAILEY (1951) give the theory of estimation of the mixed mating system using recessive markers, with a careful discussion of assumptions. They use the transformation f = ( 1 - t ) / ( I S t ) and formulate the joint maximum likelihood estimators and their variances for f and p (the gene frequencies). VASEK

(1965,1967) has developed a method which does not make use of the maximum likelihood principle. He estimates the two parameters by the six possible painvise solutions of four equations, and a final estimate is derived by averaging the solutions. Standard errors for these estimators are not formulated, and the method is open to criticism on the grounds of statistical inefficiency.

The point of departure of the method formulated above from that of FYFE and BAILEY relates to practical experimental efficiency in the joint estimation of t and P.

FYFE

and BAILEY’S method assumes “that the progenies grown are large enough to avoid misclassification of heterozygous parents”, whereas in the present case all the parental genotypes P are unknown and must be determined by progeny test. The question therefore arises as to how large progeny sizes

( k )

should be.

“Avoidance of misclassification” needs further definition. The only type of progeny array which uniquely defines the maternal genotype is the one which contains two different homozygotes (Group IV). Thus only a proportion of heterozygous parents can be identified unambiguously. Further, homozygous parentage cannot be “proved” with finite

k.

The parentage of a n array with only one or no homozygous types thus can be stated only in probabilistic terms. ( A sequential decision making process is impractical with as many as nine genotypic classifications per individual because fully heterozygous mothers will be very rare.)

The first step in maternal classification is essentially a likelihood ratio test. The question arises: what is the effect of family size on the accuracy of this test under various conditions? To answer this question it is necessary to calculate the total probability of correct identification, X , in a given population for various

k.

This total probability is the weighted average of the probabilities of correct identification of each genotype. In the diallelic case

X = P(AiA1) P(AiAi/AiAi)

4-

P(AiAz) P(AiAz(A1Az)

+

P(AzA2) . P(AzAzIAzAz),

(7)

M A T I N G S Y S T E M A N D ISOZYMES IN M A I Z E 139

probability of identifying the parent of a n array as A,A,, given that the parent is in fact A,A,. These conditional probabilities cannot be formulated algebraically. However, numerical results for various values of outcrossing ( t ) and gene frequency ( p ) of A, in the pollen were obtained on the computer. This was done by calculating the rij’s (Table 2) and Gi values for each case. Then, for each sub-

group (I to IV) of Table 1, the largest value of Gi II T z j was taken as the correct

identification. For example, considering Group 11, either

G 1 ~ l l ~ l z b - l

>

Gzrzlr22k-1

or G1~11r12k-1

<

G2~21r2216-1. In the former case, all families falling into this subgroup are considered to have maternal parent A,A,. Therefore klrllrlzkl of A,A, parents would be correctly identified but k1TZ1r2Zk-1 of A,A, parents would be incorrectly identified (as A,A,)

.

Summing the correct identifications over all subgroups gives the conditional probability for that genotype.

A sample of these results is shown in Table 4. Two values of t are considered, one representing virtually complete outbreeding and the other heavy inbreeding, each at three gene frequencies. In the six cases, the first three columns give the conditional probabilities of correct identification of each of the three genotypes. The fourth column gives the value of

X

where, for the outbreeders,

D,

H ,

and R

are assumed to be the Hardy-Weinberg proportions and for the inbreeders,

D,

H , and

R

are assumed to be in

WRIGHT’S

equilibrium under inbreeding ( F =

( 1 - t ) / ( l + t ) = .818) for the value of p specified. The expected overall trends are apparent, in that: ( 1 ) the probability of correct identification increases with increasing k ; and ( 2 ) the probability of correct identification increases with decreasing t.

Table 4 shows that progeny sizes of 9-10 give “reasonably good” probabilities of correct identification and that for k

>

10,

X

improves slowly. Misclassification of heterozygotes would tend to inflate the estimate of t , whereas misclassification of homozygotes would tend to depress the estimate. With k 2 9 these biases are low. Progeny sizes of 9-10 were therefore chosen for the present experiment. This was also convenient experimentally because each gel accommodates 18-20

samples, i.e., the progeny of two individuals.

For a fixed number of electrophoretic assays ( n k ) a conflict arises in the allocation of resources because t and the parental genotypic proportions must be estimated cojointly. The efficiency of estimation of t , measured as information per plant, increases as

k

increases because larger families give more accurate female parent identification. However under fixed resources n must fall as

k

increases, and the efficiency of estimation of the parental genotypic proportions will decline because fewer parents ( n ) are sampled. Table 4 indicates that 4 5

k

5 6, allowing larger n, may provide for optimum allocation of resources if there is reason to believe that t is small.

I

A P P L I C A T I O N TO TWO O P E N - P O L L I N A T E D P O P U L A T I O N S O F M A I Z E

Materials and Methods

(8)

TABLE 4 Conditional and total probabilities of correct parental classification likelihood ratio test in some diallelic cases for various progeny sizes (k)

p

-

0.10 AIAl A& A2% Total k

-

(9

2 0000 0351 .990 .865 4

.ooo

.780 .949 .go9 6 1.000 ,758 ,985 .944 9 lo000 0870 0992 -970 12 1.000 .923 .996 .977 .OOO .529 .912 0769 e600 e919 1.000 ,745 .932 1.000 0854 e976 1.000 .go9 .955 1,000 .911 ,991

Total (XI ,669

772 860 _.927 ₀₉₄₀

.958

AIAl 0755 0695 .895 .982 0990 0997

p

-

0.50 A2A2 -375 .750 .742 .695 .769 .895 -871 .982 0899 0990 0940 0997

Total

(5)

.

566

.

718

.832 0927 0945 0969

INBREEDERS ($=O .lo) 2 -992 0374 1.000 .979 .995 .375 .999 .950 -998 .375 .998 .941

4

0997 ,849 0999 -994 0999 0843 e995 0984 e986 0930 0986

0981 0995

6 0988 0963 1.000 e998 ,994 0964 -999 ,995 e998 0964 0998 9 0999 0992 1.000 1.000 1.000 .987 1.000 0999 0999 0994 .999 0999 10 .999 .996 1.000 1.000 1.000 .995 1.000 0999 1.000 .991 1.000 0999 12 1.000 .999 1.000 1.000 1.000 -998 1.000 1.000 1.000 .999 1.000 1.000

(9)

MATING SYSTEM A N D ISOZYMES I N MAIZE 141

(RRS)

program. We are indebted to Dr. D.

G.

SMELTZER for providing us with stocks of these populations. In 1968 plantings were made of the generations designated B.2.1 and A.2.2 which represent, respectively, the first and second generations of open-pollinated propagation (relaxed selection) after two cycles of

RRS.

Two generations of open pollination followed the synthesis of the two pools and preceded the first cycle of RRS. At maturity random ears were har- vested from these populations and the kernels from these ears were germinated at 25°C in a growth chamber to produce seedlings representing generations B.2.2 and A.2.3. Electrophoretic assays were based on crude extract of a composite sample of tissue from five-day old seedlings.

Formal genetic studies (BROWN and ALLARD 1969a,b) of the six inbred parents of each pool, and certain F,, F,, and testcross progenies, have established that these twelve inbreds differ with respect to nine codominant isozyme loci. On the basis of parentage Pool A is expected to be polymorphic for all nine loci with two alleles per locus, whereas Pool

B

is expected to be triallelic for three loci (E4,

Px

and Phos,), diallelic for five loci and monomorphic for one

( E I S P ) .

HOW- ever, following Round I of RRS, Pool B was also polymorphic for the E,, locus, indicating chance migration of one (or more) gametes carrying the E,,’ allele into that pool.

For Pool B nine seedlings from each of 60 open-pollinated ears were assayed for seven loci (El, E,, E4, E,,, E,,, Adh, and

Px)

using the starch-gel electrophoretic methods described previously

(BROWN

and ALLARD 1969a,b). Hori- zontal slicing enabled assays for butyrate esterase, acetate esterase, and alcohol dehydrogenase in three separate anodal slices. No phosphatase assay was per- formed on these samples. For Pool A, ten seedlings from 50 Open-pollinated ears were assayed for a11 nine loci using a method of lateral slicing described by

BROWN

and ALLARD (1969b).

Experimental Results and Discussion

For each of the 16 data sets (9 loci in Population A.2.3 and 7 loci in P o p - lation B.2.2), the values of [- log L ] were calculated with q equal to the gene frequencies in the total ( n k ) progeny, and t = 0.75, 0.80, 0.85

.

. .

1.10. Values for a sample of four of the data sets in Table 5 show that the maximized likeli- hoodoccurredwhen.90<2< .95,.95 < 2 < 1.00,1.00<2< 1.05and.90<2< .95, respectively. In theory the variance of this estimator of q should be biased slightly upwards because no account was taken of the multiple sampling of a fmite number ( n ) of female parents. Further, this estimator is not fully efficient since it does not use information about q obtained from the maternal genotypes. However these objections appear to be minor because there was little improvement in the values of L when the q vectors were modified. The likelihood scores for each locus were plotted and a final estimate 2 was derived as the point at which the curve showed a minimum.

(10)

142 . . A. H. D. BROWN A N D R . W. ALLARD TABLE 5

Values of [- log L] in four cases

A.2.3. B.2.2

Value of t E , Adh

0.85 0.90 0.95 1 .oo 1.05 1.10

209.27 184.73

209.07 184.13

209.14 183.81

209.35 183.82

209.52 184.10

209.99 184.92

358.92 40.59

357.27 40.5 1

356.20 40.51

355.72 40.65

355.87 42.59

356.69 00

not differ from t = 1. The relative magnitude of the approximate standard errors for different loci parallels the rate of change in the values of [- log L ] in Table 5 .

The two triallelic systems ( E 4 , P,) gave the lowest standard errors.

The next column shows the 3’ values computed from data from the homozygous parents only, ignoring information from heterozygous parents. These values, simulating those which would be obtained if the markers had been recessive for each allele in turn, are more erratic but parallel the total likelihood estimate. The codominance of isozyme markers thus considerably improves the efficiency of the estimation over that for recessive markers because all families, not just those derived from recessive homozygotes, contribute information about t .

Thk first three terms of the information formula (ip) above, express the information about t arising from all the homozygous females. Thus the sum of the first three terms as a percentage of the total ip (given in Tables 6 and 7) is a

TABLE 6

Mating system estimates for generation A.2.3.

Minimum ?Estimated Efficiency Allele’ t* Calculated

for S.E. Homozyqotes Homozygous I n Progeny I n Progeny Bound from of Frequencies

%2ia

-

from

2

*

Estimation 5 2

s

(a

_-_

__

_ _ _ _ I

_-

-.-- ---

--

---.- I

-.90 .06 .84 .94 97 .34 .66

-

.01 .97

E .98 .09

-

.97 34 .10

-

.90 .03 .95

.92 .07 .81 .96 99 .45

-

- 5 5 .01 .99

-81 1.20 88 .26 .74

-

-.04 1.09

E1

-!

-“4_

%

*07

,El6 1.01 .07 .89 1.11 100 .48 .52

-

-.01 1.02

p”

-

&h .97 -06 1 - 0 1 .96 97 .33 .67

-

-.02 1.03

1.08 .07 1.08 1.00 67 .17 - 8 3 - -.09 1.19

P&s4 .97 .07 .99 .91 90 . 7 1

-

.29 .01 .98

Phos8

-

1.12 .07 1.30 1.00 75 .80 .20

-

-.03 1.06

-

Average estimate: t = 1.00 Average t * : 1.03

(11)

M A T I N G SYSTEM A N D ISOZYMES I N MAIZE

TABLE 7

Mating system estimates for generation B.2.2.

143

Minimm ;)'Estimated Efficiency Allele' Flxatio t* c Bound fropl of Frequencies Index

6 -

from

-

ES timat ion

x

t for S.E. Homozygotes Homozygous S N ,P In Progeny In Progeny

.93 .07 .95

-

.91 95 -64

-

-36 8 7 87

.94 .os .96

-

.90 75 .19

-

.81 -06 .89

1.02 .04 1.07 .78 1.17 65 .51 .26 .24 .oo 1.00

-

1.20

-

27 .02 .98

-

-05 .90

E++ e96 -06

-

.94 .96 95 -34 .66

-

.10 -82 1.10 .06 1.12 1.09

-

99 .43 .57

-

.02 -96

3

53

Jg

212 O g 3 .14

2 .93 .05 -92 .92 .98 53 .15 .64 .21 .05 .91

Average estimate: = 0.97 Average t*: 0.91

S, N , and F refer to slow, normal, and fast migrating alleles, respectively.

measure of the efficiency when only the data from all the homozygous parents encountered in the experiment are included in estimating t. In all the diallelic cases, there is a close relation between this value and the gene frequency. With Q = .02 ( E l s in B.2.2) homozygous estimation is only 27% efficient, but with Q =

.35

( E , ) , it is virtually fully efficient. However, the efficiency of homozygous estimation is markedly lower in both triallelic cases ( E 4 , P,)

.

Thus the cases in which heterozygous females contribute substantial information about t are those involving multiallelism, or disparate gene frequencies, with moderate to high values of t.

For comparison, fixation indices calculated as

E

= 1 - A/2p^Q are also given, as well as the t* statistic of ALLARD and WORKMAN (1963) which is calculated here using the transformation which assumes equilibrium under inbreeding. In Population A.2.3,

E

is slightly larger than 0 for

4

loci and slightly smaller than 0 for 5 loci. Similarly t* is slightly larger than 2 for 5 loci and slightly smaller or equal to

2

for 4 loci. Hence in Population A.2.3 the relationship between gene and genotypic frequencies appears to follow the Hardy-Weinberg rule. However, in Population B.2.2,

P

>

0 and t*

<

2

for 6 of the 7 loci, suggesting that there are fewer heterozygotes in the population than expected on the basis of the mating system (2 = .97, not significantly different from 1 )

.

If this deficiency of heterozygotes is real, the present data are not adequate to ascertain whether it is due to heterozygote disadvantage, inbreeding due to restricted population size, o r other causes.

CONCLUSIONS

(12)

144 A. H. D. BROWN A N D R. W. ALLARD

earlier generations. The 12 inbred parents of the two populations differ in various attributes of mating (e.g., onset of flowering), and in early generations, before gene blocks were broken, association between these attributes and the isozyme markers might have led to deviations from random mating. The difference between the present results and those of GUTIERREZ and SPRAGUE (1959) may result from the breakdown of such associations in the absence of selection. This view is consistent with the results of PATERNIANI (1969) which indicate that the degree of assortative mating in maize can be increased by selection and that this may be associated with changes in particular attributes of the reproductive system. Thus when PATERNIANI selected against intercrossing between two open- pollinated populations of maize he observed both a rapid increase in intrapopu- lational isolation and a correlated divergence in the onset of flowering of the two populations.

We wish to thank Drs. B. S. WEIR and JULIAN P. ADAMS for significant contributions made during the preparation of the manuscript, and the reviewers for their helpful suggestions.

SUMMARY

A two-step likelihood method was developed for estimating outcrossing rates from the genotypic arrays of half-sib families whose maternal parent is of unknown genotype. Application of this method to isozyme data in two open-pollinated populations of maize yielded estimates of outcrossing equivalent to random mating. The difference between this result and earlier results indicating non- random mating in maize is discussed.

LITERATURE CITED

ALLARD, R.

w.

and P. L. WORKMAN, 1963 Population studies in predominantly self-pollinated species. IV. Seasonal fluctuations in estimated values of genetic parameters in lima bean populations. Evolution 17: 4 7 M 8 0 .

Inheritance of isozyme differences among the inbred parents of a reciprocal recurrent selection population maize. Crop Sci. 9: 72-74.

-,

1969b Further isozyme differences 'among the inbred parents of a reciprocal recurrent selection population of maize. Crop. Sci. 9: 643-644.

FYFE, J. L. and N. T. J. BAILEY, 1951 Plant breeding studies in leguminous forage crops. I.

Natural cross-breeding in winter beans. J. Agric. Sci. 41 : 371-378.

GUTTIEREZ, M. G. and G. F. SPRAGUE. 1959 Randomness of mating in isolated polycross plantings of maize. Genetics 44: 1075-1082.

HARDING, J. and C. L. TUCKER, 1964 Quantitative studies on mating systems. I. Evidence for the nonrandomness of outcrossing in P h e o l u s lunatus. Heredity 19: 369-381. -,

1969 Quantitative studies on mating systems. 111. Methods for the estimation of male gametophytic selective values and differential outcrossing rates. Evolution 23 : 85-95.

Population studies in predominantly self-pollinated species. I. Evidence for heterozygote advantage in a closed population of barley. Proc. Natl. Acad. Sci.

U. S. 46: 1371-1377.

The Measurement of Linkage in Heredity. Methuen, London. BROWN, A. H. D. and R. W. ALLARD, 1969a

JAIN, S. K. and R. W. ALLARD, 1960

(13)

M A T I N G S Y S T E M A N D ISOZYMES IN M A I Z E 145

Selection for reproductive isolation between t w o populations of maize, Zea mays L. Evolution 23: 534447.

Estimation of genetic parameters related to the mating system in populations of sorghum, Sorghum bicolor (L.) Moench. Ph.D. Thesis, Library of the Uni-

versity of California, Davis.

Outcrossing in natural populations. 11. Clarkia Unguiculata. Evolution 19: 152-156.

-,

1967 Outcrossing in natural populations. 111. The Deer Creek population of Clarkia exilis. Evolution 21: 241-248.

The interpretation of population structure by F-statistics with special regard

PATERNIANI, E., 1969

RICCELLI-MATTEI, M., 1968

VASEK, F. C., 1965

WRIGHT, S., 1965