• No results found

A High-Density Genetic Recombination Map of Sequence-Tagged Sites for Sorghum, as a Framework for Comparative Structural and Evolutionary Genomics of Tropical Grains and Grasses

N/A
N/A
Protected

Academic year: 2020

Share "A High-Density Genetic Recombination Map of Sequence-Tagged Sites for Sorghum, as a Framework for Comparative Structural and Evolutionary Genomics of Tropical Grains and Grasses"

Copied!
20
0
0

Loading.... (view fulltext now)

Full text

(1)

Copyright2003 by the Genetics Society of America

A High-Density Genetic Recombination Map of Sequence-Tagged Sites for

Sorghum, as a Framework for Comparative Structural and Evolutionary

Genomics of Tropical Grains and Grasses

John E. Bowers,* Colette Abbey,

Sharon Anderson,

Charlene Chang,

Xavier Draye,

Alison H. Hoppe,

Russell Jessup,

Cornelia Lemke,* Jennifer Lennington,

Zhikang Li,

Yann-rong Lin,

Sin-chieh Liu,

Lijun Luo,

Barry S. Marler,*

Reiguang Ming,

Sharon E. Mitchell,

Dou Qiang,

Kim Reischmann,

Stefan R. Schulze,* D. Neil Skinner,* Yue-wen Wang,

Stephen Kresovich,

Keith F. Schertz

and Andrew H. Paterson*

,†,1

*Plant Genome Mapping Laboratory, University of Georgia, Athens, Georgia 30602,Plant Genome Mapping Laboratory,

Department of Soil and Crop Science, Texas A&M University, College Station, Texas 77843 and

Institute for Genomic Diversity, Cornell University, Ithaca, New York 14850

Manuscript received February 5, 2003 Accepted for publication May 5, 2003

ABSTRACT

We report a genetic recombination map for Sorghum of 2512 loci spaced at average 0.4 cM (ⵑ300 kb) intervals based on 2050 RFLP probes, including 865 heterologous probes that foster comparative genomics of Saccharum (sugarcane), Zea (maize), Oryza (rice), Pennisetum (millet, buffelgrass), the Triticeae (wheat, barley, oat, rye), and Arabidopsis. Mapped loci identify 61.5% of the recombination events in this progeny set and reveal strong positive crossover interference acting across intervals ofⱕ50 cM. Significant variations in DNA marker density are related to possible centromeric regions and to probable chromosome structural rearrangements between Sorghum bicolor and S. propinquum, but not to variation in levels of intraspecific allelic richness. While cDNA and genomic clones are similarly distributed across the genome, SSR-containing clones show different abundance patterns. Rapidly evolving hypomethylated DNA may contribute to intraspecific genomic differentiation. Nonrandom distribution patterns of multiple loci detected by 357 probes suggest ancient chromosomal duplication followed by extensive rearrangement and gene loss. Exemplifying the value of these data for comparative genomics, we support and extend prior findings regarding maize-sorghum synteny—in particular, 45% of comparative loci fall outside the inferred colinear/syntenic regions, suggesting that many small rearrangements have occurred since maize-sorghum divergence. These genetically anchored sequence-tagged sites will foster many structural, func-tional and evolutionary genomic studies in major food, feed, and biomass crops.

A

S a model for the large genomes of many tropical cally important crops, may have shared a common an-cestor as recently as 5 million years ago (Sobral et al. grasses, sorghum [Sorghum bicolor L. Moench.; 748–

1994), retain similar gene order (Ming et al. 1998), 772 million base pairs (Mbp); Arumuganathan and

and even produce viable progeny in some intergeneric

Earle 1991] is a logical complement to Oryza (rice;

crosses (Dewet et al.1976; P. L. Morrell and A. H.

ⵑ420 Mbp;ArumuganathanandEarle1991), a

dis-Paterson, personal communication). By contrast, rice tant relative (tribe Oryzeae) that will be the first grass

and the maize/sorghum lineage may have divergedⵑ50 genome to be completely sequenced (Goffet al.2002;

million years ago (Linder1987) and show much more

Yu et al. 2002). Sorghum is an especially important

chromosomal rearrangement (Patersonet al.1995a). bridge to several economically important large-genome

Analysis of the levels and patterns of genomic diversity crops in its own tribe (Andropogoneae) such as maize

within and between sorghum, sugarcane, rice, and (ⵑ2292–2716 Mbp) with which it may have shared

com-maize (and others) promises to advance understanding mon ancestry between 11 (Gautand Doebley 1997)

of the biology and evolution of Poaceae grain and bio-and 24 (Thomasson1987) million years ago. Sorghum

mass crops and reveal new opportunities for their im-and sugarcane, a large-genome (ⵑ2547–3605 Mbp)

provement. polyploid that ranks among the world’s most

economi-Worldwide, sorghum is the fifth most important grain crop grown based on tonnage, after maize, wheat, rice, and barley (http://www.fao.org). Sorghum is unusually

1Corresponding author:Plant Genome Mapping Laboratory, 111

River-tolerant of low input levels, an essential trait for areas

bend Rd., Rm. 228, University of Georgia, Athens, GA 30602.

E-mail: [email protected] such as northeast Africa and the U.S. Southern Plains

(2)

368 J. E. Bowerset al.

that receive too little rainfall for most other grains. In tant quantitative trait loci (QTL) but lack the high marker density needed for use in complex endeavors the more arid countries of northeast Africa, such as

Sudan, sorghum contributes 39% of the calories in the such as positional cloning of genes, genetic anchoring of bacterial artificial chromosome (BAC)-based physical human diet (http://www.fao.org; 1999 statistics).

In-creased demand for limited fresh water supplies, cou- maps, or assembly of genomic shotgun sequence. The only other high-density sorghum map (Menzet al.2002) pled with global climatic trends and expanding

popula-tions, suggests that dryland crops such as sorghum will is composed largely of amplified fragment length poly-morphisms; the difficulties associated with inferring or-be of growing importance.

Despite the likely growing importance of sorghum, thology of these arbitrary-sequence markers across taxa constrain its value for comparative and evolutionary ge-its improvement has lagged behind that of maize, wheat,

and rice, each of which have more than doubled in nomics. Our map is currently being used to anchor BAC-based physical maps of both S. bicolorand S. propinquum average yield on a worldwide basis in the last 38 years

while sorghum yields have gained only 51% (average (Linet al. 1999; Draye et al. 2001), to facilitate rapid gene isolation by map-based cloning and provide land-1961–1963 compared to 1999–2001; http://www.fao.

org). In sub-Saharan Africa, already home to many of marks for eventual genomic sequence assembly. The genetically anchored probes used in this map are also the world’s hungry and with a population projected to

double over the next 40 years (U.S. Census Bureau being hybridized to BAC libraries from rice, sugarcane, and maize, fostering comparative genomics across the estimates 2002; http://www.census.gov), sorghum yields

have gained only 6% over the last 38 years compared Poaceae. to 50% gains in wheat and maize (http://www.fao.org).

In the U.S., sorghum was introduced over 200 years

MATERIALS AND METHODS ago, possibly by Benjamin Franklin (Smithand

Freder-iksen 2000) and is now grown on 9–13 million acres. Laboratory procedures:The genetic population and molec-U.S. sorghum is principally used as an animal feed and ular methods are as previously described (Chittendenet al. therefore escapes direct notice by the general public, 1994), except that the mapping population was expanded to 65 individuals from 56, drawing additional F2progeny at

but is the 13th most valuable crop in the U.S. with a

random from residual seeds of the original cross. Briefly, DNA farm-gate value ranging from $0.8 to 2.0 billion/year

was extracted from young leaves by a published protocol (USDA 1992–2001 statistics).

(Chittendenet al.1994),ⵑ5␮g DNA per lane digested with

S. bicoloris native to Africa. One other euploid species 15 units ofEcoRI,HindIII, orXbaI (Promega, Madison, WI), exists within the genus,S. propinquum, which is native electrophoresed and blotted onto Hybond N⫹ (Amersham, Arlington Heights, IL), rinsed in 2⫻SSC, and stored at 4⬚ to Asia and contains many “weediness” traits such as

rhi-until use. About 20–50 ng of PCR-amplified fragment was zomes, small seeds, and shattering. The genus also

in-labeled with [32P]dCTP, hybridized to blots, washed, and

ex-cludes S. halepense, a tetraploid (2n ⫽ 40) thought to

posed to X-ray film as described (Chittendenet al.1994). be derived from naturally occurring crosses between DNA markers and sequences:Prefixes of DNA markers used S. bicolorandS. propinquum(both 2n⫽20).S. halepenseis and their sources are as follows. Arabidopsis cDNA: AEST (R. Scholl, Arabidopsis Biological Resources Center, Ohio State among the world’s most noxious weeds, with widespread

University), AHD and HMG (T. Thomas, Texas A&M); Barley distribution. In the U.S., many local epithets forS.

hale-cDNA: BCD (M. Sorrells and S. Tanksley, Cornell);

Johnson-pensehave largely been supplanted by the term “Johnson

grass rhizome cDNA: pHER, pSHR (Y.Siand A. H.Paterson, grass,” first documented in an 1874 letter, referring to unpublished results); Maize PstI genomic clones: BNL, UMC Colonel William Johnson, an Alabaman who sowed it (E. Coe and M. McMullen, University of Missouri); Maize on his farm (McWhorter1971). The first U.S. federal cDNA: CSU (Coe, McMullen); Millet Pst1 genomic clones: M (M. Gale, John Innes Center); Oat cDNA: CDO (Sorrells, appropriation for weed research targeted Johnsongrass

Tanksley); Sorghum cDNA: HHU (Wyrich et al. 1998), (House Bill 121, 56th Congress, 1900).

HHUK (Annen et al. 1998); Sorghum phytochrome genes: Cross-fertility between S. bicolor and S. propinquum PHY (L. H. Pratt and M.-M. C.-Pratt, University of Georgia); has permitted us not only to benefit from high levels of Sorghum PstI genomic DNA: pSB, SHO (A. H. Paterson); DNA polymorphism between them to build the detailed Sugarcane cDNA: CDSB, CDSR (P. Moore, Hawaiian Agricul-tural Research Center); Sugarcane genomic clones: SG (Sor-molecular map described herein, but also to conduct

rells); Rice genomic clones: RG and cDNA: RZ (S. McCouch genetic analysis of many traits associated with grass

do-and S. Tanksley, Cornell), C, G, do-and R (T. Sasaki, RGP, Japan). mestication (e.g.,Patersonet al.1995a,b). The genetic

Sequences were obtained from the National Center for Bio-map presented herein builds on and integrates much technology Information (NCBI) or developed in house by end earlier work (Chittenden et al. 1994;Lin et al. 1995; sequencing of probes using standard methods. In house se-quencing used a software pipeline in which sequence data in

Annenet al.1998;Wyrichet al.1998;Drayeet al.2001).

ABI trace file format were input into the programs PHRED Several other sorghum maps (Whitkus et al. 1992;

(version 0.000925.c) and CROSS_MATCH (version 0.990329

Ragab et al. 1994;Xu et al. 1994;Dufour et al. 1997;

with minmatch⫽12 and minscore⫽20) to trim poor quality

(3)

impor-369 Sorghum STS Map for Comparative Genomics

TABLE 1

Summary statistics for the SBSP linkage groups

SP SB

dominant dominant

Linkage group (LG) Size (cM) Loci no. Probe no. No. % No. % Largest gap (cM)

A 130.1 333 328 32 0.10 38 0.11 4.6

B 120.8 331 323 42 0.13 36 0.11 3.9

C 118.5 499 476 34 0.07 122 0.24 3.1

D 81.6 187 184 14 0.07 20 0.11 7.0

E 84.7 146 139 10 0.07 23 0.16 7.8

F 127.8 275 266 38 0.14 32 0.12 5.4

G 107.0 196 190 24 0.12 26 0.13 7.0

H 85.4 191 188 35 0.18 40 0.21 3.9

I 107.0 216 209 25 0.12 32 0.15 4.6

J 96.3 138 135 17 0.12 26 0.19 7.0

Total 1059.2 2512 2438 271 395

accession numbers is available as supplementary documenta- RESULTS

tion at http://www.genetics.org/supplemental/.

Genetic map:The SB ⫻ SP map (Table 1; Figure 1

Map construction:A framework map ofⵑ600 codominant

and available at http://www.plantgenome.uga.edu/ markers was constructed using the program MAPMAKER v2.0

on the PC, with error detection on (Lander et al.1987). A sorghummap) is composed of 2512 loci on 10 linkage new program written in Microsoft Visual Basic ( J. E.Bowers groups that collectively span 1059.2 cM (Kosambi1944).

and A. H.Paterson, unpublished data) was then used to

This is a 2236-locus (about sevenfold) increase com-insert additional markers into the framework. The algorithm

pared to our previously published map of 276 loci used by this program was to determine the genotypes of each

(Chittenden et al. 1994), yet the recombinational individual for each interval between markers already placed

on the map, with multiple genotypes possible for individuals length has actually been reduced from 1445 cM to the in which crossovers were observed between the framework current 1059.2 cM largely by virtue of a sufficiently high loci. In cases where genotypes were uncertain due to dominant density of markers to distinguish errors from true dou-markers or missing data, the genotypes of the intervals were

ble recombinants. The map is based on a total of 1376 inferred from flanking loci, assuming that the minimum

num-detected crossovers (see materials and methods), ber of recombinations had occurred. An unmapped locus was

which would correspond to 1386 potentially distinct tested against the possible genotypes for all intervals already

on the map, in search of a perfect match. If such a match was map locations; we have identified markers at 853 found the marker was assigned to the appropriate interval, (61.5%) of these possible locations. The largest gap and then the framework was recomputed. If no perfect match between two loci corresponds to only 7.8 cM or 10 cross-was found, a second pass cross-was made looking for matches to all

overs and only seven intervals in the map were⬎5 cM. but one individual, followed by subsequent passes with higher

On the basis of the 65 F2plants used, a single

recombi-numbers of nonmatching individuals. Loci from these

subse-quent passes were rechecked for scoring errors in the individ- nation event yields an estimate of 0.77 cM between con-ual that did not fit the expected pattern. If the data were secutive loci, which defines the resolution limit of the determined to be correct the locus was then added to the map. Consequently, loci are plotted to intervals of this framework map with a new recombination event not observed

size (in the figure rounded to one decimal place). in the previous map. Individuals mapping to the ends of the

All of the restriction fragment length polymorphism chromosomes could not be placed with this approach and had

(RFLP) markers tested could be placed on the map to be added to the framework manually or with Mapmaker.

After the map had been constructed, it was manually edited although a small number (⬍20) that had initially been to reduce the number of recombinations by exporting the scored were determined in retrospect to be too faint locations of crossovers observed in the map into a spreadsheet. for accurate scoring and were discarded. A similar num-Instances with multiple recombinations for an individual

prog-ber of markers with two segregating bands of nearly the eny plant were reordered if possible to reduce the total

num-same migration rates, which could not be reliably distin-ber of recombination events observed. This step involved

ex-guished from one another, were also discarded. Another tensive checking of the raw data (films) for errors, with the

plants apparently responsible for double recombinations be- group of⬍20 markers that could not be mapped showed ing rechecked. Ostensibly codominant markers that could not segregation ratios approaching 15:1 and were assumed be placed on the map were split into two dominant markers to be caused by two loci with indistinguishable band to attempt their mapping separately. Final map distances were

sizes and were therefore discarded. computed using Kosambi (1944) centimorgans (cM), and

Duplicate probes were removed from the map by in-maps were drawn by another Visual Basic program written for

(4)

coseg-370 J. E. Bowerset al.

(5)

371 Sorghum STS Map for Comparative Genomics

(6)

372 J. E. Bowerset al.

(7)

373 Sorghum STS Map for Comparative Genomics

Figure1.—Continued.

regating loci and also by sequence comparisons of most found in the population vs. 262 adjacent crossovers, a highly significant difference (Figure 2). The numbers probes. Some probes used in past studies were shown

to be identical to newly mapped sorghum probes and of observed genotypes in the two categories differ sig-nificantly (P ⬍ 0.05) from the expected (equal num-in a few cases cDNAs from other species corresponded

closely to sorghum probes or to one another. In cases bers) for cases in which the two recombination events were separated by 0–10 cM, 10–20 cM, and 40–50 cM where RFLP markers had similar or identical sequences

and mapped to similar or identical loci, one of the and narrowly missed significance for the 20–30 cM (0.07) and 30–40 cM (0.06) spacings. Over intervals of duplicates was removed from the map. In total, this

resulted in the removal of 336 markers at 386 loci (which ⬎50 cM, no significant differences were found in the frequency of doublevs.adjacent crossovers.

are not included in the 2050 probes and 2512 loci that

compose the map). The genetic locations and correspond- Segregation distortion: Five regions on the genetic map showed segregation distortion significant at the 5% ing information for these loci remain available at our web

site (http://www.plantgenome.uga.edu/sorghummap). level. The apices of distortion in the five regions were on LG B near cM 50.0, LG C near cM 46.2, LG D near

Recombinational interference: Recombinational

in-terference was assessed by comparing the frequency of cM 66.2, LG G near cM 26.2, and LG I near cM 0.0. Curiously, all five regions showed segregation distortion occurrence of “double crossover” genotypes (i.e., aa–

ab–aa; bb–ab–bb) to “adjacent crossover” genotypes favoring the S. bicolor alleles. By far the most striking case was on LG C—the apex of the distortion was near (i.e., aa–ab–bb; bb–ab–aa) as a function of the size of

the interval that contains the two crossovers required to the locus CSU507 and comprised a segregation ratio of 41:17:2 (homozygous S. bicolor :heterozygote:homozy-produce each genotype. In the absence of interference,

these two different classes of genotypes would be equally gousS. propinquum), significantly (P⬍3⫻10⫺12)

differ-ent from the expected 1:2:1 ratio. In a larger set of F2

(8)

374 J. E. Bowerset al.

Figure1.—Continued.

progeny from the same cross (Linet al.1995;Paterson dw2gene regulating plant stature (Linet al. 1995; Pat-ersonet al. 1995a), andpApo1gene regulating apomixis et al. 1995a), we found similarly distorted segregation

(236:94:8) in this region. (R.Jessup, G.Burow, M.Husseyand A. H.Paterson,

personal communication)]. The average number of loci

Patterns of DNA marker distribution: We evaluated

the distribution of DNA markers across the sorghum in the deliberately enriched intervals plus the short ter-minal bins was 23, virtually identical to the average map by comparing intervals of exactly 10.0 cM in length,

starting from the top of each chromosome as drawn across the remainder of the genome (23.28); therefore elimination of these anomalous regions has no effect (Figure 1), except that the last interval in each group was

eitherⱕ15 cM orⱖ5 cM to accommodate the varying on the analyses.

Virtually every linkage group has at least one interval lengths of the linkage groups. On the basis of the total

number of loci per linkage group, the Poisson probabil- containing more loci than would be expected to occur by chance in 1% or fewer cases (A06-7; B01 and B08; ity distribution function was applied to identify intervals

that contained significant excesses or deficiencies of C02, -04, and -08; D02 and D04-07; E05; F06-8; G04 and G06-7; H04-05; I06; and J04 and -06).

various classes of probes. We note that two regions (C04

and D04-07) were preferentially enriched for markers Significant marker deficiencies were associated with 13 intervals, including A01*, A02; B10 and B12*; C06, because they contain genes that we seek to clone [C04,

the sorghumSh1gene regulating shattering of the ma- C07, and C12*; F01*, F05, F11, and F13*; G09; and H02. These included a disproportionately large number, 5 ture inflorescence (Paterson et al.1995b); and

(9)

375 Sorghum STS Map for Comparative Genomics

Figure1.—Continued.

length of three terminal intervals (B12⫽5.8 cM; C12⫽ clone-derived loci over the intervals were closely corre-lated (r⫽ 0.79).

7.5 cM; F13⫽8.7 cM) contributed partly to their marker

(10)

376 J. E. Bowerset al.

Figure1.—Continued.

loci showed dominant inheritance, segregating as pres- dominant loci (nominally below the average of 2.7). By far the largest concentration of SP-derived dominant ence of an allele from one parent and absence from

the other parent. A total of 395 (15.7%) of the dominant loci (23, 8.6%) was in interval H05—while this interval is generally marker rich, the number of SP-derived dom-alleles were from SB and 269 (10.7%) from SP, a highly

significant difference (␹2⫽23.9, 1 d.f.,P1.110⫺6). inants in this interval is50% higher than the number

of SB-derived dominants (15), the opposite of their Distribution of dominant loci is shown in Figures 1

and 3. 50% lower abundance elsewhere in the genome. Several

other intervals are also preferentially enriched for domi-Among the 395 SB-derived dominant loci, 74 (18.7%)

are in the single 10-cM interval C05, far beyond the nant loci from one parent or the other: F07 contains an abundance of 9 (Pⱕ 0.0009) SP-derived dominant random expectation (⬎8 loci would have been expected

(11)

377 Sorghum STS Map for Comparative Genomics

Figure1.—Continued.

0.0013)vs.only 1 SP-derived dominant (Pⱕ0.25) and numbers) we were able to identify 130 simple sequence repeat (SSR)-containing sequences (defining an SSR as I09 contains an abundance of 7 (Pⱕ0.0009) SP-derived

dominants,vs.1 SB-derived dominant (P ⱕ0.29). 6 or more repeats of a dinucleotide or repeats stretching 15 or more base pairs of longer repeat units). Although Even after removing the dominant probes mapping

to interval C04 of LG C, a significant excess ofS. bicolor- the distributions of genomic and cDNA clone-derived loci were closely correlated (r ⫽ 0.79), the genomic derived dominant markers (321,vs. 267S. propinquum

dominants, significant at the 5% level) still remains. distribution of the SSRs was only loosely related to that of the entire population of mapped DNA probes (r⫽ Curiously, this excess is explained almost completely by

one marker class, the pSB probes, which were derived 0.33), suggesting that SSRs may locate in different ge-nomic domains more frequently than low-copy probes. fromS. bicolorhypomethylated (PstI-digested) genomic

DNA. The pSB clones detected 152S. bicolordominants The relationship between SSR distribution and probe distribution was somewhat closer (r⫽0.43) after remov-and 102S. propinquumdominants (exclusive of probes

mapping to LG C05). After removing the pSB clones, ing the strong biases in distribution of dominant loci, partly attributable to possible genomic rearrangements there remains a nonsignificant difference of 169S.

bi-color dominants vs. 165 S. propinquum dominants for (see below). The map location of SSR-containing clones is shown in Figure 1. Further characterization of a subset non-pSB probes outside of interval C05.

Simple sequence repeat-containing loci:On the basis of the SSRs has been described (Schlosset al.2002).

Distribution of duplicate loci:Among the 2050 probes

of the sequences of 1933 probes (see http://www.plant

(12)

378 J. E. Bowerset al.

Figure1.—Continued.

multiple loci that could be mapped, with 279 detecting 2 act loci involved. On the basis of a chi-square contin-gency test, the distribution of duplicate loci over pairs loci, 58 detecting 3 loci, 13 detecting 4 loci, 6 detecting

5 loci, and 1 detecting 6 loci. The distribution of dupli- of linkage groups was not random (␹2 ⫽ 224.06, with

81 d.f.). Several pairs of linkage groups showed striking cated loci across the genome is illustrated in Figure 4,

composed of 606 data points (keeping in mind that excesses of duplicated loci (A and G, C and G, C and H, E and H, E and I). Associations of individual linkage 2-locus probes generate 1 point of intersection, 3-locus

probes generate 3 points, 4-locus probes generate 6 groups with multiple partners (for example G with A and C), together with our prior observations (

Chitten-points, 5-locus probes generate 10 Chitten-points, and 6-locus

probes generate 15 points). A clickable web-based ver- denet al.1994) and other work (Whitkuset al.1992;

GautandDoebley 1997;Gaut2001), suggest that if sion of this figure is available at http://www.genetics.

(13)

379 Sorghum STS Map for Comparative Genomics

Figure1.—Continued.

the polyploidization event must be very ancient, surely spondence with 0, 1, 2, or 3 other intervals in the ge-nome. A total of 74 duplicate loci are intrachromoso-predating the Sorghum-Zea divergence. Therefore we

reevaluated the data on the basis of smaller intervals, mal, not significantly different from the random expectation.

breaking each sorghum chromosome into four

“inter-vals” of equal length in centimorgans. This yielded an Correspondence to gene arrangements in other taxa:

Table 2 summarizes the sources of clones and loci that overall contingency chi-square of 2287.53 (with 1521

d.f., P ⬍ 7 ⫻ 10⫺33), further supporting the notion have been mapped to date, illustrating the

opportuni-ties to use this map as a basis for comparisons of many that duplicated loci are not randomly distributed across

chromosome pairs. Among the 820 possible comparisons Poaceae taxa.

As an especially important example of the utilization (including intrainterval comparison), a total of 22 pairs

(2.7%) of intervals shown in Table 2 showed positive of these data, Figure 5 illustrates comparative alignments of the sorghum and maize genomes based on 952 loci deviations from the random expectation that were

sig-nificant at 0.005 (as measured by 1 d.f. chi-square), from the maize “bins” map (Gardineret al.1993;Davis

et al.1999). A clickable version of Figure 5 is available about 5.4 times higher than the random expectation.

(14)

380 J. E. Bowerset al.

Figure2.—Summary of recombinational in-terference. The frequencies of adjacent crossovers (two crossovers occurring on different chromo-somes within the same individual) and double crossovers (occurring on the same chromosome in the same individual) are plottedvs.the dis-tance between the crossovers.

the specific probes and loci involved. This comparison 136 different functional molecular groups. The most common molecular functional groups were “ATP bind-represents a considerable increase over previously

pub-lished data (Whitkuset al. 1992; Pereira et al.1994; ing” (120 hits) and “Protein Kinase” (56 hits). Results of sequence similarity analyses are available at http://

Ragabet al. 1994;Patersonet al.1995a;Dufouret al.

1996). The distributions of loci over the 100 possible www.genetics.org/supplemental/. combinations of maize chromosomes and sorghum

link-age groups were clearly not random (contingency␹2⫽

DISCUSSION 790.04, with 81 d.f., P ⬍ 9 ⫻ 10⫺117). A total of 19

(19%) cells with the largest excesses (from 7.5 to 36.6) This genetically anchored set of sequence-tagged sites provides transferable DNA markers suitable for a wide of observed data over random expectations account for

520 (55%) of the corresponding points and 74% (582.84) range of investigations in structural, functional, and evo-lutionary genomics in several major grain and biomass of the chi-square deviation from randomness,

sug-gesting the correspondences illustrated in Figure 5 and crops. Although the map was created using the RFLP method and has been applied to several goals by this listed in Table 3.

Marker sequence annotation: Multiple local align- technology (e.g.,Linet al.1995;Patersonet al.1995a,b;

Katsaret al.2002;Minget al.2002), genetically mapped ment searches using the programs blastn and tblastx

were used for sequence annotation against publicly sequence tagged sites such as these can be used to dis-cover single-nucleotide or small insertion/deletion poly-available databases of the NCBI as of November 21,

2002. The default matrix BLOSUM 62 and a cutoff of morphisms that can then be genotyped by many alterna-tive technologies. This possibility increases the value of 1⫻ 10⫺6 were used in all BLAST searches. The NCBI

database was subdivided into several taxon-specific groups these loci and reduces the costs associated with their wider utilization. A total of 130 loci that contain simple-to allow for the efficient determination of not only the

best overall hit, but also the best hit among closely sequence repeats have the further advantage of being relatively allele rich (Schlosset al.2002), a benefit in related species, excluding unannotated expressed

se-quence tag and genomic survey sese-quence database en- studies that require differentiation between closely re-lated genotypes.

tries. Additional analyses included the use of hidden

Markov models to classify sequence data by protein se- This framework of genetically anchored sequence-tagged sites will also provide a foundation for physical quence signature. The program InterProScan (

Zdob-novandApweiler2001) was used to search to compare mapping and ultimately assembling a robust finished se-quence of the sorghum genome. The present map per-the translated sorghum sequences against several protein

databases (Pfam, SMART, and ProDom) and Genome mits us to assign loci to bins of ⵑ0.77 cM; on average, this representsⵑ300 kb of genomic DNA based on a Ontology (Ashburneret al.2000) numbers for each of

these classifications were obtained. The results of these consensus genome size estimate of 750 Mbp (although we have recently estimated the genome to be somewhat analyses revealed that the 578 sorghum query sequences

(15)

differ-381 Sorghum STS Map for Comparative Genomics

Figure 3.—Distribution of codominant and dominant markers along the sorghum map. For 10-cM intervals along each linkage group, the num-bers of codominant (solid), S. bicolor dominant (open), and S. propinquum dominant (shaded) loci are plotted.

ent loci within 0.77-cM bins, we are presently hybridizing less evenly distributed through the genome. By simply hybridizing the 2050 mapped probes to the 10⫻ -cover-the genetically mapped probes to BAC libraries for both

S. propinquumandS. bicolor. Since the two BAC libraries age BAC libraries, we expect to identifyⵑ20,000 BACs in each library, comprisingⵑ50% of the genome. Fur-each provideⵑ10⫻ coverage of the genome and are

composed of individual BACs that averageⵑ120 kb in ther, both libraries have been fingerprinted (http://www. genome.arizona.edu/fpc/sorghum/), permitting the re-length, this will permit us to resolve the order of closely

(16)

382 J. E. Bowerset al.

Figure4.—Patterns of duplication within the sorghum genome. In this Oxford grid, each dot represents a genetically mapped locus detected by probe that segregated at two or more polymorphic loci in sorghum, with thex- andy-axis representing chromo-somal locations. Red circles along the axes represent the approximate locations of the centromeres. The total number of probes mapping to each pair of sorghum linkage groups is shown in each cell. Areas high-lighted in yellow represent regions of sig-nificant marker abundance between a pair of linkage groups (determined as described in text). Note that some dots represent mul-tiple probes with the same genetic loca-tions; for a detailed list of exact information for each cell, see http://www.genetics.org/ supplemental/.

proaches made possible by the alignment of our geneti- mapping of the locations of the sorghum centromeres is in progress by probing synaptonemal complex spreads cally mapped sequences to the nearly completed rice

sequence, a robust genetically anchored physical map with genetically mapped probes or their correspond-ing BACs by fluorescence in situ hybridization (D. G. is expected to coalesce.

Nonrandom patterns of DNA marker distribution Petersonand A. H.Paterson, unpublished data). Clearly, more information will be needed to explain provide clues to the locations of interesting and

impor-tant features of sorghum genome organization. On most the multiple, dispersed marker-dense regions found on several linkage groups. For example, linkage group B chromosomes, at least one significant concentration of

loci appears to correspond to the centromeric region. has one terminal concentration of markers and another interstitial concentration. We have recently shown that We have recently applied overgo (Caiet al.1998) probes

for sorghum centromeric repetitive sequences homolo- some sorghum chromosomes have cytologically distin-guishable knobs (D. G.Petersonand A. H.Paterson, gous to pHind22 and Cen38 (Milleret al.1998;Zwick

et al. 2000) to the SP and SB BAC libraries. Co- unpublished observations), and future studies will inves-hybridization of these probes with genetically mapped

RFLPs has associated concentrations of centromeric

re-TABLE 2 peats with marker-dense regions of 8 of the 10 linkage

groups (LG A: DM064, cM57.7; B: DM007, cM57.7; C: Summary of probe sources

pSB1406, cM47.7; D: pSB580, cM47.7 and pRC162, cM60;

Genomic cDNA Total

E: CSU0462 and pRC182, cM47; G: R2447, cM64.6; I: 5C04A07, cM69.3; and J: pSB0019, cM50.8). Due to the

Arabidopsis 0 52 52

repetitive nature of these probes and the possibility that Zea 62 189 251

not all copies are centromeric, these data can be taken Triticeaea 0 65 65

only as tentative indications of the possible locations of Pennisetum 23 171 194

Oryza 10 175 185

the sorghum centromeres. For example, on two linkage

Sorghum 725 464 1189

groups we found associations with mapped probes in

Saccharum 11 103 114

regions of normal marker density (LG A: pSB1075,

TOTAL 831 1219 2050

cM98.5; LG H: HHU49, cM68.5). Further, we found no

aTriticum, Hordeum, or Avena.

(17)

383 Sorghum STS Map for Comparative Genomics

Figure 5.—Patterns of colinearity be-tween sorghum and maize. In this Oxford grid, each dot represents a locus detected by a probe that was genetically mapped in both sorghum (left) and maize (top), with thex- andy-axis representing chromosomal locations in each taxon. The total number of probes mapping to each pair of maize and sorghum chromosomes is shown in each cell. Lines highlight the regions for which we have inferred synteny between maize and sorghum, as summarized in Ta-ble 3. Note that some dots represent mul-tiple probes with the same genetic loca-tions; for a detailed list of exact information for each cell, see http://www.genetics.org/ supplemental/.

tigate whether these could account for some marker near the interval (H05) that contained by far the largest concentration of SP-derived dominant loci (23, 8.6%). excesses or deficiencies. Linkage groups C, D, G, and J

also show multimodal distributions of marker density This suggests that the ribosomal DNA and a large flank-ing area may have moved in one of the two sorghums that warrant further study.

Differences in the abundance of dominant genetic since their divergence from a common ancestor, a hy-pothesis that we are further investigating (D. G.

Peter-marker loci appear to suggest that a chromosome struc-tural rearrangement has occurred since the divergence ofS. bicolorandS. propinquumfrom a common ancestor.

TABLE 3 The single 10-cM interval C05 contains 74 (18.7%) of

Correspondence of maize chromosomes to sorghum the 395 SB-derived dominant loci found, far beyond the

linkage groups random expectation (see above), and 71 of these

coseg-regate at the single location cM 46.2 (along with 6

co-Maize chromosome Sorghum linkage group(s)a

dominants, and one locus dominant for the SP allele).

Curiously, this interval is also the apex of the most pro- 1 C, J, C

nounced segregation distortion found (41:17:2, fa- 2 B, D

3 E, A

voring bicolor homozygotes as described above). The

4 J, F, H

DNA sequences of some of the probes that detect

5 C, F

S. bicolor-dominant markers at LG C, cM 46.2 correspond

6 I, G

to various portions of the ribosomal DNA [specifically

7 B

AEST602 matches 18s rRNA (GenBank accession no. 8 A, G, A

X16077) at e ⬍ 10⫺200 and C152 and pRC017 match

9 I, G

the 25S ribosomal RNA gene (GenBank accession nos. 10 E, D

M11585 and AY108843) ate⫽10⫺170ande410⫺71,

aProceeding from left to right along the maize

chromo-respectively]. some as drawn in Figure 5. For identities of specific loci, see

These three probes also mapped as dominant markers supplementary documentation at http://www.genetics.org/ supplemental/.

(18)

384 J. E. Bowerset al.

son, J. E. Bowers and A. H.Paterson, unpublished relationship with any of the factors studied herein. Cor-relations of allelic diversity with total mapped locus data) and that is consistent with recent findings in rice

(Shishidoet al.2000) and legumes (Singhet al.2001). abundance per interval (0.0006), codominant locus abundance (⫺0.038), and SSR abundance (⫺0.16) were On the basis of genotypes inferred from segregation at

nearby codominant loci, all 435 plants that have been remarkable in the lack of information they yielded. An important future investigation will be to study how studied to date from this cross (including those from a

larger population used for mapping QTL; e.g., Lin et marker density and/or allelic diversity correlate with the distributions of phenotypically significant variants al. 1995; Paterson et al. 1995a) possess at least one

copy of the ribosomal DNA on either LG C or LG H. such as QTL.

By virtue of a very high level of DNA polymorphism These results are consistent with a requirement for a

copy of ribosomal DNA for survival of gametes. We have (Chittendenet al.1994), the SB⫻SP cross has proven especially facile for “comparative mapping” of DNA also noticed some degree of enrichment of the two

affected genomic intervals for QTL that differentiate clones that have been previously mapped in other taxa. To foster opportunities to use the relatively small ge-between SB and SP [specifically for the number of

seed-ling tillers and regrowth on H05 (Patersonet al.1995b) nome of sorghum to help advance genomics in the larger genomes of many other tropical Poaceae, we have and for the number of seedling tillers, three measures

of rhizomatousness, and seed weight on C04 (Paterson mapped 865 heterologous DNA clones from eight other taxa (Table 2). In one example, we show herein the et al. 1995a,b)].

The finding that manyS. bicolorhypomethylated (PstI- alignment of the sorghum genome to the four times physically larger genome of maize. Most maize chromo-digested) genomic probes lacked a homolog inS.

propin-quum suggests that there has been considerable and somes correspond to nonoverlapping regions of only one sorghum chromosome but most sorghum chromo-rapid divergence or deletion of low-copy DNA in these

taxa. In contrast to cDNAs and excepting the probes somes correspond to nonoverlapping regions of two maize chromosomes, reiterating the recent duplication mapping near the ribosomal DNA discussed above, a

total of 728 pSB probes detect 152 dominant loci that in maize. Sorghum is an especially valuable guide for genomic analysis of Saccharum [sugarcane, one of the lack anS. propinquum allelevs. only 102 loci that lack

anS. bicolorallele, a highly significant difference. This world’s most important crops, with the 2001/2002 world cane sugar crop forecast at a near record 126.8 million suggests that a portion of the sorghum genome may be

composed of rapidly evolving low-copy DNA, such as metric tons (FAS 2001)], which may have shared a com-mon ancestor as recently as 5 million years ago (Sobral

has been reported for tomato (Zamir and Tanksley

1988). However, this portion of the sorghum genome et al.1994). The present data supplement and comple-ment our prior efforts in this regard (Minget al.1998, is likely to be relatively smaller in sorghum than in

tomato, as Cot analysis shows that sorghum has a much 2002). Other work in progress uses probes described herein (Table 3) together with species-specific recombi-smaller low-copy DNA fraction (Petersonet al.2002).

Finally, we note that the lack of an RFLP allele at a nation data to address the comparative organization of sorghum and other tropical grasses including Penni-dominant locus does not necessarily reflect deletion of

the locus, but could be attributable to comigration with setum (R.Jessup, M.Husseyand A. H.Paterson, per-sonal communication), Cynodon (C.Bethel, E.Sciara

monomorphic loci, gain/loss of restriction sites creating

short or long fragments that are not captured on South- and A. H.Paterson, personal communication), Echi-nochloa (T.Fukao, A. H.Patersonand M.Rumpho, ern blots, or other artifactual reasons, which presumably

account for many of the 102S. bicolorloci that are null unpublished data), and Panicum (A.Missaoui, A. H.

Patersonand J.Bouton, personal communication). for theseS. bicolor-derived probes.

The genomic distribution of mapped (i.e., polymor- Despite the clear value of the comparative approach for fostering progress in study of gene arrangement in phic in SB⫻SP) loci shows little relationship to

differ-ences in levels of intraspecific allelic diversity in differ- complex genomes (e.g., Saccharum) or underexplored taxa (e.g., Pennisetum, Cynodon, Echinochloa, and Pan-ent chromosomal regions (Dvoraket al.1998;Hamblin

andAquadro1999). In a separate study (P.Morrell, icum), it is equally important to note that a remarkable 45% of comparative data fell in regions other than those J. E.Bowersand A. H.Paterson, personal

communica-tion), we have shown that allelic diversity is not randomly we infer to correspond between sorghum and maize. Many of these incongruities are likely to reflect nonchro-distributed across the sorghum chromosomes but is

highly structured. For 183 loci representing most of the mosomal rearrangement mechanisms that are becom-ing clear from microsynteny studies (Tikhonov et al. 10-cM bins in this study, we have estimated allelic

rich-ness (not shown) from a worldwide sample of 55 land- 2000) and studies of ancient duplication (Bennetzen

2000; Paterson et al. 2000) or, possibly, rapid diver-race and wild accessions representing the breadth of

diversity in the Sorghum genus (P. J.Morrelland A. H. gence or deletion of hypomethylated DNA as we report above. A few tantalizing hints of the possibility of more

Paterson, personal communication)—curiously, these

(19)

385 Sorghum STS Map for Comparative Genomics

An anchored framework BAC map of mouse chromosome 11

locus arrangements in several regions (e.g., maize chr.

assembled using multiplex oligonucleotide hybridization.

Geno-1/sorghum LG A), but await more data to test with mics54:387–397.

confidence. One sorghum linkage group (H) that is Boivin, K., M. Deu, J. F. Rami, G. TroucheandP. L. Hamon, 1999 Towards a saturated sorghum map using RFLP and AFLP

mark-well populated with DNA markers (Table 1) shows

re-ers. Theor. Appl. Genet.98:320–328.

markably little correspondence to any maize chromo- Chittenden, L. M., K. F. Schertz, Y. R. Lin, R. A. WingandA. H. some ( just a small portion of maize chromosome 4). Paterson, 1994 A detailed RFLP map of Sorghum bicolorS. propinquum, suitable for high-density mapping, suggests

ances-This hints at the possibility that large segments of

chro-tral duplication of sorghum chromosomes or chromosomal

seg-matin may have been lost during the maize-sorghum ments. Theor. Appl. Genet.87:925–933.

divergence; however, a conclusive test awaits more data. Davis, G. L., M. D. McMullen, C. Baysdorfer, T. Musket, D. Grant

et al., 1999 A maize map standard with sequenced core markers,

While our results clearly reinforce the evidence in

grass genome reference points and 932 expressed sequence

support of the duplication of most regions of the maize tagged sites (ESTs) in a 1736-locus map. Genetics152:1137–1172. genome, many questions remain about the levels, pat- Dewet, J. M. J., S. C. Gupta, J. R. HarlanandC. O. Grassl, 1976 Cytogenetics of introgression from Saccharum into Sorghum.

terns, and antiquity of chromatin duplication within

Crop Science16:568–572.

sorghum itself. The patterns of distribution of duplicate

Draye, X., Y.-R.Lin, X. Y. Qian, J. E.Bowers, G. B.Burowet al.,

loci in sorghum are clearly not random, with many small 2001 Toward integration of comparative genetic, physical, di-versity, and cytomolecular maps for grasses and grains, using the

islands of colinearity evident, and adjacent intervals

of-Sorghumgenome as a foundation. Plant Physiol.125:1325–1341.

ten showing correspondence to syntenic intervals (A2,

Dufour, P., L. Grivet, A. Dhont, M. Deu, G. Troucheet al., 1996

-3, and -4 to G3, -1, and -4; E3-4 to H2-1; F3-4 to I3, I1; Comparative genetic mapping between duplicated segments on

maize chromosomes 3 and 8 and homoeologous regions in

sor-J2, -4 to I2-1). However, forⵑ30% of the genome we

ghum and sugarcane. Theor. Appl. Genet.92:1024–1030.

can discern no corresponding duplicated region, and

Dufour, P., M. Deu, L. Grivet, A. Dhont, F. Pauletet al., 1997

another 30% shows correspondence to two or more un- Construction of a composite sorghum genome map and

compari-son with sugarcane, a related complex polyploid. Theor. Appl.

linked regions. Duplication of sorghum chromatin

ap-Genet.94:409–418.

pears to more closely resemble the pattern observed for

Dvorak, J., M. C. LuoandZ. L. Yang, 1998 Restriction fragment

rice, in which the completed sequence (Goffet al.2002; length polymorphism and divergence in the genomic regions of

Yuet al.2002) has largely borne out early hints (Kishi- high and low recombination in self-fertilizing and cross-fertilizing

Aegilopsspecies. Genetics148:423–434.

moto et al. 1994; Nagamura et al. 1995) of ancient

FAS, 2001 World sugar situation. U.S. Department of Agriculture,

For-segmental duplication in some regions. The correspon- eign Agricultural Service (http://www.fas.usda.gov/htp/sugar/ dence of some sorghum genomic intervals to two or 2001/nov/sugsit.htm).

Gardiner, J. M., E. H. Coe, S. Melia-Hancock, D. A. Hosington

more unlinked intervals may reflect either very localized

andS. Chao, 1993 Development of a core RFLP map in maize

colinearity or, possibly, recent duplications superim- using an immortalized-F

2population. Genetics134:917–930.

posed on ancient ones, which may be present in maize Gaut, B. S., 2001 Patterns of chromosomal duplication in maize and their implications for comparative maps of the grasses.

Ge-as we speculate above. Much more data will be needed

nome Res.11:55–66.

to unravel the details of the relationship(s) between Gaut, B. S., andJ. F. Doebley, 1997 DNA sequence evidence for individual duplicated segments in sorghum, as well as the segmental allotetraploid origin of Maize. Proc. Natl. Acad.

Sci. USA94:6809–6814.

their relationships (if any) to those in close relatives

Goff, S. A., D. Ricke, T. H. Lan, G. Presting, R. L. Wanget al.,

such as sugarcane and maize or distant relatives such 2002 A draft sequence of the rice genome (Oryza sativaL. ssp

as rice or even Arabidopsis. japonica). Science296:92–100.

Hamblin, M. T., andC. F. Aquadro, 1999 DNA sequence variation We honor the memory of coauthor Keith F. Schertz, who made

and the recombinational landscape inDrosophila pseudoobscura: many of these discoveries possible while teaching several of us about a study of the second chromosome. Genetics153:859–869. sorghum and about much more. Science is richer for his efforts, and Haussmann, B. I. G., D. E. Hess, N. Seetharama, H. G. Welzand we are poorer for his passing. We thank the USDA-National Research H. H. Geiger, 2002 Construction of a combined sorghum link-Initiative, National Science Foundation Plant Genome Research Pro- age map from two recombinant inbred populations using AFLP, SSR, RFLP, and RAPD markers, and comparison with other sor-gram, International Consortium for Sugarcane Biotechnology, and U.S.

ghum maps. Theor. Appl. Genet.105:629–637. Golf Association for financial support of various aspects of this work.

Katsar, C. S., A. H. Paterson, G. L. TeetesandG. C. Peterson, 2002 Molecular analysis of Sorghum resistance to the greenbug (Homoptera: Aphididae). J. Econ. Entomol.95:448–457.

Kishimoto, N., H. Higo, K. Abe, S. Arai, A. Saitoet al., 1994 Identi-LITERATURE CITED fication of the duplicated segments in rice chromosomes 1 and

5 by linkage analysis of cDNA markers of known functions. Theor.

Annen, F., J. L. Chang, A. H. PatersonandJ. L. Stockhaus, 1998

Appl. Genet.88:722–726. Characterization of 14 different putative protein kinase cDNA

Kosambi, D., 1944 The estimation of map distance from recombina-clones of the C-4 plant Sorghum bicolor.Mol. Gen. Genet.259:

tion values. Ann. Eugen.12:172–175. 115–122.

Lander, E., J. Abrahamson, A. Barlow, M. Daly, S. Lincolnet al.,

Arumuganathan, K., andE. Earle, 1991 Nuclear DNA content of

1987 Mapmaker: a computer package for constructing genetic-some important plant species. Plant Mol. Biol. Rep.9:208–218.

linkage maps. Genomics1:174–181.

Ashburner, M., C. A. Ball, J. A. Blake, D. Botstein, H. Butleret

Lin, Y., K. SchertzandA. Paterson, 1995 Comparative analysis

al., 2000 Gene ontology: tool for the unification of biology. Nat.

of QTLs affecting plant height and maturity across the Poaceae, Genet.25:25–29.

in reference to an interspecific sorghum population. Genetics

Bennetzen, J. L., 2000 Comparative sequence analysis of plant

nu-141:391–411. clear genomes: microcolinearity and its many exceptions. Plant

Lin, Y., L. Zhu, S. Ren, J. Yang, K. Schertzet al., 1999 ASorghum

Cell12:1021–1029.

(20)

386 J. E. Bowerset al.

with loss-of-function mutations during crop domestication. Mol. Bowerset al., 2002 Characterization of RFLP probe sequences for gene discovery and SSR development inSorghum bicolor(L.) Breed.5:511–520.

Moench. Theor. Appl. Genet.105:912–920.

Linder, H. P., 1987 The evolutionary history of the

Poales/Res-Shishido, R., Y. Sano andK. Fukui, 2000 Ribosomal DNAs: an tionales—a hypothesis. Kew Bull.42:297–318.

exception to the conservation of gene order in rice genomes.

McWhorter, C. G., 1971 Introduction and spread of Johnsongrass

Mol. Gen. Genet.263:586–591. in United States. Weed Sci.19:496.

Singh, R. J., H. H. Kim andT. Hymowitz, 2001 Distribution of

Menz, M. A., R. R. Klein, J. E. Mullet, J. A. Obert, N. C. Unruh

rDNA loci in the genus Glycine Willd. Theor. Appl. Genet.103: et al., 2002 A high-density genetic map ofSorghum bicolor(L.)

212–218. Moench based on 2926 AFLP (R), RFLP and SSR markers. Plant

Smith, C. W., andR. A. Frederiksen, 2000 Sorghum: Origin, History,

Mol. Biol.48:483–499.

Technology and Production.John Wiley & Sons, New York.

Miller, J. T., F. G. Dong, S. A. Jackson, J. SongandJ. M. Jiang,

Sobral, B. W. S., D. P. V. Braga, E. S. LahoodandP. Keim, 1994 1998 Retrotransposon-related DNA sequences in the

centro-Phylogenetic analysis of chloroplast restriction enzyme site muta-meres of grass chromosomes. Genetics150:1615–1623.

tions in the Saccharinae Griseb subtribe of the Andropogoneae

Ming, R., S. Liu, Y. Lin, J. Dasilva, W. Wilsonet al., 1998 Alignment

Dumort tribe. Theor. Appl. Genet.87:843–853. of the Sorghum andSaccharumchromosomes: comparative

ge-Subudhi, P. K., andH. T. Nguyen, 2000 Linkage group alignment nome organization and evolution of a polysomic polyploid genus

of sorghum RFLP maps using a RIL mapping population. Ge-and its diploid cousin. Genetics150:1663–1682. nome43:240–249.

Ming, R., S. C. Liu, J. E. Bowers, P. H. Moore, J. E. Irvineet al., Thomasson, J. R., 1987 Fossil grasses, 1820–1986 and beyond, pp. 2002 Construction of aSaccharumconsensus genetic map from 159–167 inGrass Systematics and Evolution, edited by T. R. Soders-two interspecific crosses. Crop Sci.42:570–583. trom, K. W.Hilu, C. S.Campbelland M. E.Barkworth.

Smith-Nagamura, Y., T. Inoue, B. Antonio, T. Shimano, H. Kajiyaet al., sonian Institution Press, Washington, DC.

1995 Conservation of duplicated segments between rice chro- Tikhonov, A. P., J. L. BennetzenandZ. V. Avramova, 2000 Struc-mosomes 11 and 12. Breed. Sci.45:373–376. tural domains and matrix attachment regions along colinear

chro-Paterson, A., Y. Lin, Z. Li, K. Schertz, J. Doebley et al., 1995a mosomal segments of maize and sorghum. Plant Cell12:249–264. Convergent domestication of cereal crops by independent muta- Whitkus, R., J. DoebleyandM. Lee, 1992 Comparative genome tions at corresponding genetic loci. Science269:1714–1718. mapping of sorghum and maize. Genetics132:1119–1130.

Paterson, A., K. Schertz, Y. Lin, S. LiuandY. Chang, 1995b The Wyrich, R., U. Dressen, S. Brockmann, M. Streubel, C. Changet

weediness of wild plants: molecular analysis of genes influencing al., 1998 The molecular basis of C-4 photosynthesis in sorghum: dispersal and persistence of johnsongrass,Sorghum halepense(L.) isolation, characterization and RFLP mapping of mesophyll- and Pers. Proc. Natl. Acad. Sci. USA92:6127–6131. bundle-sheath-specific cDNAs obtained by differential screening.

Plant Mol. Biol.37:319–335.

Paterson, A., J. Bowers, M. Burow, X. Draye, C. Elsiket al., 2000

Xu, G. W., C. W. Magill, K. F. SchertzandG. E. Hart, 1994 A Comparative genomics of plant chromosomes. Plant Cell12:

RFLP linkage map ofSorghum bicolor(L) Moench. Theor. Appl. 1523–1539.

Genet.89:139–145.

Peng, Y., K. F. Schertz, S. CartinhourandG. E. Hart, 1999

Com-Yu, J., S. N. Hu, J. Wang, G. K. S. Wong, S. G. Liet al., 2002 A draft parative genome mapping of Sorghum bicolor (L.) Moench using

sequence of the rice genome (Oryza sativaL. ssp indica). Science an RFLP map constructed in a population of recombinant inbred

296:79–92. lines. Plant Breed.118:225–235.

Zamir, D., andS. Tanksley, 1988 Tomato genome is comprised

Pereira, M. G., M. Lee, P. Bramel-Cox, W. Woodman, J. Doebley

largely of fast-evolving, low copy-number sequences. Mol. Gen.

et al., 1994 Construction of an RFLP map in sorghum and

com-Genet.213:254–261. parative mapping in maize. Genome37:236–243.

Zdobnov, E. M., andR. Apweiler, 2001 InterProScan—an

integra-Peterson, D. G., S. R. Schulze, E. B. Sciara, S. A. Lee, J. E. Bowers tion platform for the signature-recognition methods in InterPro.

et al., 2002 Integration of Cot analysis, DNA cloning, and

high-Bioinformatics17:847–848.

throughput sequencing facilitates genome characterization and Zwick, M. S., M. N. Islam-Faridi, H. B. Zhang, G. L. Hodnett, M. I. gene discovery. Genome Res.12:795–807. Gomezet al., 2000 Distribution and sequence analysis of the

Ragab, R. A., S. Dronavalli, M. A. S. MaroofandY. G. L. Yu, 1994 centromere-associated repetitive element CEN38 ofSorghum bi-Construction of a sorghum RFLP linkage map using sorghum color(Poaceae). Am. J. Bot.87:1757–1764.

and maize DNA probes. Genome37:590–594.

Figure

TABLE 1
Figure 1.—Continued.
TABLE 2
TABLE 3

References

Related documents