Abstract: A novel set of informative microsatellite markers for pepper (Capsicum annuum L.) is provided. Screening of
approximately 168 000 genomic clones and 23 174 public database entries resulted in a total of 411 microsatellite-containing
sequences that could be used for primer design and functional testing. A set of 154 microsatellite markers originated
from short-insert genomic libraries and 257 markers originated from database sequences. Of those markers, 147 (61
from genomic libraries and 86 from database sequences) showed specific and scoreable amplification products and detected polymorphisms between at least 2 of the 33 lines of a test panel consisting of cultivated and wild Capsicum
genotypes. These informative markers were subsequently surveyed for allelic variation and information content. The
usefulness of the new markers for diversity and taxonomic studies was demonstrated by the construction of consistent
phylogenetic trees based on the microsatellite polymorphisms. Conservation of a subset of microsatellite loci in pepper, tomato, and potato was proven by cross-species amplification and sequence comparisons. For several informative
pepper microsatellite markers, homologous expressed sequence tag (EST) counterparts could be identified in these related species that also carry microsatellite motifs. Such orthologs can potentially be used as reference markers and
common anchoring points on the genetic maps of different solanaceous species.
Key words: pepper, Capsicum spp., microsatellite (SSR) markers, diversity studies, polymorphism, cross-species transferability.
Resume : Une nouvelle collection de microsatellites informatifs chez le poivron (Capsicum annuum L.) est decrite.
Le criblage denviron 168 000 clones genomiques et 23 174 entrees dans les banques de donnees publiques a permis
didentifier 411 sequences contenant des microsatellites, de concevoir des amorces et de tester leur fonctionnement.
Un jeu de 154 marqueurs microsatellites a ete derive de banques genomiques a` inserts de petite taille alors que 257
ont ete obtenus de la banque de donnees. De ce nombre, 147 marqueurs (61 des banques genomiques et 86 des banques de donnees) ont produit des amplicons specifiques et lisibles et ont permis de detecter du polymorphisme entre au
moins deux dun panel de 33 lignees comprenant des genotypes cultives et sauvages du genre Capsicum. Lutilite de
ces nouveaux marqueurs pour des etudes de diversite et de taxonomie a ete demontree en produisant des arbres phylogenetiques concordants bases sur le polymorphisme des microsatellites. La conservation dun sous-ensemble des locus
microsatellite chez le poivron, la tomate et la pomme de terre a ete montree par amplification interspecifique et la
comparaison des sequences. Pour plusieurs microsatellites informatifs du poivron, des etiquettes de sequence exprimee
( EST ) homologues comprenant des microsatellites ont ete identifiees chez ces espe`ces apparentees. De tels orthologues peuvent potentiellement servir de marqueurs de reference et de points dancrage sur les cartes genetiques des differentes espe`ces de solanacees.
Mots-cles : poivron, Capsicum spp., marqueurs microsatellites ( SSR ), etudes de la diversite, polymorphisme, transportabilite interspecifique.
[Traduit par la Redaction]
Introduction
Microsatellites or simple sequence repeats (SSRs) are
DNA sequences consisting of tandemly repeated arrays of
short (16 nucleotides) motifs that frequently exhibit poly-
morphism between closely related genotypes. This polymorphism is probably due to slipped-strand mispairing, and
simple tandem repeats may be predisposed to further length
changes by unequal crossing over and (or) subsequent replication, repair, or recombination errors (Levinson and Gut-
Received 4 December 2006. Accepted 12 June 2007. Published on the NRC Research Press Web site at genome.nrc.ca on 3 August
2007.
Corresponding Editor: P. Gustafson.
I. Nagy,1 A. Stagel, and Z. Sasvari. Agricultural Biotechnology Center, Szent-Gyorgyi Albert u. 4, H-2100 Godollo, Hungary.
M. Roder and M. Ganal.2 Institut fur Pflanzengenetik und Kulturpflanzenforschung, Corrensstrae 3, D-06466 Gatersleben, Germany.
1Corresponding
2Present
doi:10.1139/G07-047
Nagy et al.
669
670
No.
Genotype
Inbred lines
Capsicum annuum
2
3
4
5
6
7
8
9
10
11
12
13
14
Wild Capsicum genotypes
C. frutescens with
C. chinense introgression
C. chinense
C. eximium
C. pubescens
15
Tabasco
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
chi1
chi7
chi21
chi25
exi1-1043
exi1-1052
pub2
pub3
pub5
pen1
pen6
pra1
cha6
cha9
bac10
bac20
fru39
fru48
34
35
36
WaWa700
4889
Desiree
Database
(23 174 entries)
Genomic libraries
(55 296 clones)
A
C
AC
AG
AT
AAC
AAG
AAT
ACC
ACG
ACT
AGC
AGG
ATC
CCG
AAAC
AAAG
AAAT
AACT
AAGG
ACAT
ACAG
ATAG
AATT
AGGG
AAAAG
ACTAT
Total
181
8
17
104
29
21
60
20
37
2
7
24
12
27
15
0
2
3
0.4
0.4
0
0
0
0.4
0.4
0.4
0.4
571.4
n.a.
n.a.
12
11
10{
0.5
0.5
7
2
0
0.5
0.3
0.3
0.3
n.d.
0.3
n.d.
n.d.
0.6
n.d.
0.3
0.3
0.3
n.d.
n.d.
n.d.
n.d.
46.2
k
X
Pi 2
i1
Nagy et al.
671
Table 3. Functionality of the newly developed microsatellite markers on a test panel consisting
of 33 Capsicum genotypes.
(PICa) for the information content in C. annuum (14 genotypes) and one (PICc) for all Capsicum genotypes from the
secondary test panel (33 genotypes belonging to 8 species,
including the 14 lines of C. annuum).
Cluster analysis
The presence or absence of each SSR allele was coded as
1 or 0, respectively. The resulting binary matrix was directly
used to construct unrooted phylogenetic trees with the DOLLOP program. Phenetic analysis was performed after calculating pairwise genetic similarity indices (Nei and Li 1979)
from the binary matrix, using the UPGMA method option
of the NEIGHBOR program. Consensus trees were reconstructed by the CONSENSE program; graphical rendering
of the dendrograms was processed by the DRAWTREE program (PHYLIP package, version 3.6, Felsenstein 2005).
Genomic libraries
Database sequences
No.
154
29
40
24
61
No.
257
42
109
20
86
%
18.8
26.0
15.6
39.6
%
16.3
42.4
7.8
33.5
672
Table 4. Allelic variation and information content of the informative GPMS markers.
Repeat
(AC)18
GPMS3
(CA)17(TA)21
GPMS4
(GT)8GCG(TG)5(TA)7
GPMS6
(AC)20imp(AT)5
GPMS8
(AT)9(GT)22
GPMS29
(GT)15GGT7(GTT)2
GPMS37
(TC)18(TA)12
GPMS93
(TA)14imp(GA)27imp
GPMS100
T5(GT)12
10
GPMS101
(TC)16imp
11
GPMS103
(CT)11
12
GPMS104
(AT)6(GT)11
13
GPMS107
(AT)7(GT)12
14
GPMS109
(AC)14(AT)7
15
GPMS111
(AT)7(GT)12
16
GPMS112
(AT)8(GT)19
17
GPMS113
(AT)20(AG)18
18
GPMS117
(TA)25(GA)14
19
GPMS119
(GT)11G13
20
GPMS140
(TA)7imp(CA)7(TA)4
No. of alleles
4
PICa
0.49
No. of alleles
11
262282
0.73
196216
0.71
122172
0.68
12
0.89
159229
0.77
14
0.87
238255
0.60
0.83
176182
0.60
0.67
202268
0.44
12
0.86
141169
0.57
0.83
176211
0.46
0.67
134149
0.70
96123
0.13
0.72
177182
0.13
0.62
165183
0.13
0.48
120132
0.17
203280
0.76
10
0.86
91172
0.86
15
0.90
111177
0.76
11
0.86
204224
0.54
0.81
169199
0.63
PICc
0.83
Code
GPMS1
No.
1
Repeat
(AC)7(CA)2(TA)5
22
GPMS142
(CA)6(TA)4
23
GPMS147
(CT)19
24
GPMS150
(AT)4(AC)9. . .(AT)33
25
GPMS151
(TC)13
26
GPMS153
(AC)10imp(AT)9
27
GPMS154
(CA)4A(CA)9
28
GPMS155
(TG)12impCA(TG)4
29
GPMS156
(GA)6(CA)6
30
GPMS157
(TG)7
31
GPMS159
(TAA)20
32
GPMS161
(AAT)25
33
GPMS162
(TA)2(TATG)4(TG)5
34
GPMS163
(AAC)5
35
GPMS164
(ACC)5
36
GPMS165
(TAA)35imp. . .(TA)26
37
GPMS166
(ACC)7
38
GPMS169
(ATT)5T(TTA)7
39
GPMS171
40
GPMS176
(TAT)13
41
GPMS178
(ATT)19
Allele size
range (bp)
100149
No. of alleles
1
PICa
0
No. of alleles
8
103125
0.56
184240
0.69
291299
0.58
183191
0.58
200225
0.26
165178
0.50
0.67
171173
0.13
0.06
228232
0.13
0.64
278305
0.68
281317
0.61
10
0.82
184259
0.8
12
0.89
305325
0.13
0.64
280291
0.57
230259
0.47
242317
0.61
13
0.87
193247
0.36
10
0.82
176220
0.52
0.78
288346
0.13
0.72
327345
0.13
0.70
230261
0.78
14
0.89
PICc
0.76
673
Code
GPMS141
No.
21
Nagy et al.
Table 4 (continued).
674
Table 4 (concluded).
Code
Repeat
42
GPMS181
(ATT)5. . .(ATT)11
43
GPMS183
(TC)9(TA)7
44
GPMS185
T12(GT)7
45
GPMS186
(TTA)14
46
GPMS187
(CA)6C4A36
47
GPMS188
(GGT)5(TGG)2
48
GPMS189
(AT)7imp(AC)7(AT)5
49
GPMS191
(GA)8
50
GPMS193
(CTT)8 . . .(CTT)10
51
GPMS194
(TA)17(GA)12
52
GPMS195
C11(CA)6
53
GPMS196
(TG)7(TA)6
54
GPMS197
(GA)3(TAT)16
55
GPMS198
T17(GT)9
56
GPMS199
(TA)16(GT)10(GA)16
57
GPMS200
58
GPMS201
(AC)4AT(AC)3
59
GPMS202
(AT)10. . .(AT)4(TG)9
60
GPMS203
(CAC)6
61
GPMS205
(AC)11(AT)5
Mean
Allele size
range (bp)
No. of alleles
PICa
No. of alleles
166214
0.13
0.64
195227
0.70
238242
0.25
0.71
161180
0.52
219246
0.71
121201
0.70
290321
0.68
261286
0.54
313335
0.65
216270
0.62
10
0.81
231321
10
0.77
159208
0.13
0.72
272344
0.81
10
0.82
252279
0.13
0.77
279371
12
0.74
233243
0.44
247250
0.17
252305
0.56
10
0.87
195210
0.68
313404
0.13
10
0.78
2.41
0.27
7.28
PICc
0.68
No.
Nagy et al.
Table 5. Allelic variation and information content of the informative EPMS markers.
Code
EPMS303
Repeat
(TA)25
EPMS305
(CTT)3(CAT)9
EPMS309
(CTT)6
EPMS310
(CAT)13
EPMS316
(AAT)8
EPMS327
(CGT)7
EPMS330
(AT)10
EPMS331
(CA)10
EPMS335
(ACAT)3(AT)17
10
EPMS340
(AT)13
11
EPMS342
(CTT)7
12
EPMS343
(CAT)6
13
EPMS345
(AAG)7
14
EPMS349
(GAT)6
15
EPMS350
(CAT)2(CAA)7
16
EPMS353
(CTG)6
17
EPMS366
(TA)8
18
EPMS369
(CAT)2. . .(CAT)6
19
EPMS372
(TA)8
20
EPMS373
(CAT)6
Capsicum spp.
(33 genotypes)
No. of alleles
8
PICa
0.85
No. of alleles
9
94115
0.57
0.83
224250
0.07
0.70
140172
0.44
0.81
155204
0.13
0.31
98114
0.36
0.76
257268
0.13
0.67
92107
0.70
0.75
286330
0.66
10
0.84
261288
0.24
0.52
323343
0.50
0.78
129173
0.24
0.77
154163
0.50
223235
0.07
0.62
96106
0.3
0.75
299317
0.13
0.70
171196
0.73
347356
0.50
0.71
321325
0.24
0.60
177190
0.60
Allele size
range (bp)
291330
PICc
0.84
675
No.
1
Capsicum annuum
(14 genotypes)
676
Table 5 (continued).
Repeat
21
EPMS374
(CAA)6CAG(CAA)3
22
EPMS376
(CAA)6
23
EPMS377
(AG)11
24
EPMS378
(CGG)7(AGG)2
25
EPMS382
(ACA)9
26
EPMS386
(CA)15
27
EPMS387
(AAT)2AGT(AAT)6
28
EPMS390
(ATT)8
29
EPMS391
(AT)9
30
EPMS395
(CCG)6
31
EPMS396
(CAT)6
32
EPMS397
(CA)20
33
EPMS399
(AAT)8
34
EPMS402
(CCT)3(CTT)6(CCT)
35
EPMS404
(CTT)12
36
EPMS409
(CAT)7
37
EPMS410
(CT)14(CA)16 imp
38
EPMS411
(TA)9(GATA)3
39
EPMS412
(AT)9
40
EPMS413
(CCA)9
Capsicum spp.
(33 genotypes)
Allele size
range (bp)
No. of alleles
PICa
No. of alleles
206215
0.69
235259
0.67
0.85
134161
0.50
0.72
101110
0.58
154166
0.70
122170
0.62
11
0.86
162199
0.5
10
0.80
113131
0.41
0.69
177187
0.5
0.79
116131
0.24
0.46
223231
0.57
0.67
102117
0.54
0.76
144175
0.37
0.81
197216
0.47
0.74
215245
0.47
0.80
162180
0.26
0.72
149187
0.54
10
0.85
333373
0.66
205213
0.49
0.77
252268
0.56
PICc
Code
No.
Capsicum annuum
(14 genotypes)
Nagy et al.
Table 5 (continued).
Repeat
41
EPMS414
(AT)10
42
EPMS415
(CCA)CA(CCA)7
43
EPMS416
(CTT)10T(CTT)2
44
EPMS417
(TC)9
45
EPMS418
(CA)10
46
EPMS419
(AAT)6
47
EPMS420
(CCG)9
48
EPMS421
(CCG)6
49
EPMS424
(CCT)6
50
EPMS426
(AT)15
51
EPMS427
(AAT)6(AAG)
52
EPMS428
(GAT)6
53
EPMS429
(ATC)8
54
EPMS430
(GAA)5ACA(GAA)
55
EPMS438
(ATT)4(TTA)2
56
EPMS439
(TC)6. . .(TC)7
57
EPMS440
(CT)7
58
EPMS441
(AG)11
59
EPMS443
(TG)13imp(TA)4
60
EPMS446
(CAA)AG(CAA)6
Capsicum spp.
(33 genotypes)
Allele size
range (bp)
No. of alleles
PICa
No. of alleles
140186
0.07
0.77
212215
0.50
131154
0.36
0.79
110126
0.72
0.78
178210
0.84
12
0.88
224248
0.46
0.78
110116
0.55
236258
0.29
0.77
148160
0.53
108118
0.60
0.62
113131
0.41
0.75
318327
0.53
136180
0.26
0.66
255258
0.49
320332
0.46
246250
0.41
0.35
178192
0.49
116124
0.74
0.82
190196
0.34
0.58
328338
0.50
0.56
PICc
677
Code
No.
Capsicum annuum
(14 genotypes)
678
Table 5 (continued).
Repeat
61
EPMS448
(TAA)7
62
EPMS449
(TTA)7imp
63
EPMS451
(AT)9
64
EPMS472
T16
65
EPMS480
C12A7
66
EPMS490
T17
67
EPMS492
A7CA16
68
EPMS497
G14
69
EPMS501
T20
70
EPMS507
A30
71
EPMS514
A18
72
EPMS538
(CAT)4imp(CAC)2(CAT)3
73
EPMS539
(TC)11
74
EPMS540
(TGG)GG(TGG)5
75
EPMS542
(TC)10
76
EPMS543
(TC)4(CT)13
77
EPMS546
(ACC)7imp
78
EPMS547
(AG)7
79
EPMS549
(ACC)4TC(ACC)4
80
EPMS554
(TC)6T8
Capsicum spp.
(33 genotypes)
Allele size
range (bp)
No. of alleles
PICa
No. of alleles
127161
0.76
134147
0.17
166184
0.36
0.80
289320
0.61
0.80
241255
0.69
0.69
216222
0.49
0.64
187202
0.46
236248
0.50
0.78
166180
0.69
10
0.88
196254
0.07
0.56
251267
0.13
0.73
118119
0.13
0.50
241254
0.46
0.80
188206
0.13
0.69
168228
0.68
10
0.89
101111
0.72
219230
0.46
0.68
95108
0.55
150160
0.58
209199
0.60
PICc
Code
No.
Capsicum annuum
(14 genotypes)
0.67
5.86
0.30
2.40
0.44
4
0
1
190208
0.75
4
0.49
2
167177
0.77
5
0.46
2
110117
0.33
2
0.13
2
118120
0.63
3
0
1
241243
3
0
1
183186
No. of alleles
PICa
EPMS563
86
Mean
(GA)9
EPMS562
85
(TC)11
EPMS561
84
(CT)4CC(CT)6
EPMS558
83
(CTT)5imp(CT)4
EPMS557
82
(CT)2T(CT)6
EPMS556
81
(CCT)5(CTT)2
Code
No.
Table 5 (concluded).
Repeat
Allele size
range (bp)
No. of alleles
Capsicum spp.
(33 genotypes)
Capsicum annuum
(14 genotypes)
0.17
679
PICc
Nagy et al.
680
Fig. 1. Polymorphism of microsatellite markers EPMS386 (A) and EPMS382 (B). Pseudogel images were produced from data obtained with
an ABI PRISM1 3100 capillary sequencer. Genotypes and their order are as shown in Table 1.
Nagy et al.
681
Fig. 2. A phylogenetic tree of 33 Capsicum genotypes based on polymorphism of 147 informative microsatellite markers (see explanations
in the text).
682
No.
1
Code
EPMS303
Accession
No.
BD076366
EPMS305
EST homologs
Unigene
No.
Tomato
Potato
BI926274
CK278805
EPMS309
BM061028
U205213
EPMS310
BM061162
U199235
EPMS316
BM062268
U199490
EPMS327
BM063302
EPMS330
BM063625
U197030
U199197
U197581
EPMS331
BM063830
U199259
U196576
EPMS335
BM064021
U203480
10
11
EPMS340
EPMS342
BM064640
BM064842
U202270
U202848
12
EPMS343
BM064867
U199025
13
14
15
EPMS345
EPMS349
EPMS350
BM065407
BM066130
BM066247
U197842
U201952
U197448
16
EPMS353
BM067271
U196304
17
EPMS366
CA514758
U200913
18
19
20
21
EPMS369
EPMS372
EPMS373
EPMS374
CA515055
CA515633
CA515649
CA516096
U203361
U205108
U203547
22
EPMS376
CA516334
U198266
23
EPMS377
CA516439
U197562
24
EPMS378
CA516454
U199131
BLAST annotation
CAN131456 Capsicum annuum pap gene,
exons 13
Tetratricopeptide repeat (TPR)-containing protein (Arabidopsis thaliana)
Expressed protein T45782 hypothetical protein
(Arabidopsis thaliana)
Lactoylglutathione lyase family protein F8468
(Arabidopsis thaliana)
Wun1 wound-induced protein JQ0398
(Solanum tuberosum)
Fiber protein Fb19 (Gossypium barbadense)
AW040246
BQ508385*
BG599565*
AW441453
BQ506266
AI486006
BI435779
BW692801
BQ512574*
BQ506610*
BG642614
BM109649*
BG596055*
AW443210
BM404104
U204013
BM059622
Nagy et al.
Table 6 (continued).
Code
Accession
No.
25
26
27
28
EPMS382
EPMS386
EPMS387
EPMS390
CA517071
CA517699
CA518417
CA519104
29
EPMS391
CA519548
30
31
EPMS395
EPMS396
CA520931
CA521318
32
33
34
EPMS397
EPMS399
EPMS402
CA521689
CA522816
CA523427
35
EPMS404
CA524065
36
EPMS409
CA525246
37
38
39
EPMS410
EPMS411
EPMS412
CA525390
CA525873
CA526181
40
EPMS413
CA526196
41
42
EPMS414
EPMS415
CA526211
CA514621
43
EPMS416
CA515656
44
45
EPMS417
EPMS418
CA514714
CA516044
46
EPMS419
CA523208
47
48
EPMS420
EPMS421
CA513842
CA514272
EST homologs
Tomato
Potato{
CK246344
AW622335
AI484478
BQ508386
AW647879
BQ517359
BE923711
AW617461
CK270861
AY423549{
BG096749{
BI203623
BM113158
AW039723*
AI775382*
BQ516501*
AI896317{
BI176058*
BQ512883{
683
No.
U205221
U202660
Glyoxalase/bleomycin resistance protein (Oryza
sativa)
U200675
Expressed protein At1g26948 (Arabidopsis
thaliana)
U198176
Imidazoleglycerolphosphate dehydratase
(Arabidopsis thaliana)
U196920
GTP-binding protein Rab6 (Nicotiana tabacum)
U203723
U204981
PAP2 AUX/IAA family phytochrome-associated
protein (Arabidopsis thaliana)
U204859
LRR-containing F-box protein AtFBL2
(Arabidopsis thaliana)
U198459
Thioredoxin family protein At2g35010
(Arabidopsis thaliana)
U201498
U198173
U198626
Isomerase-like protein AT4g15940/dl4011w
(Arabidopsis thaliana)
U196306
Proline-rich glycoprotein T01365 related
(Arabidopsis thaliana)
U198796
MADS-box protein 9 (Petunia hybrida)
U196307
U199066
U196306
Proline-rich glycoprotein T01365 related
(Arabidopsis thaliana)
U199271
Aux/IAA protein (Solanum tuberosum)
U199272
U196308
Fructokinase (Solanum lycopersicon)
U196436
Ethylene-responsive protein1 S60047 (Hevea
brasiliensis)
U198464
Expressed protein At3g13227.1 68410.m01487
(Arabidopsis thaliana)
U196452
Translation-inhibitor protein (Gentiana triflora)
U198010
T11855 DnaJ protein homolog (Phaseolus
vulgaris)
684
Table 6 (continued).
Code
EPMS424
Accession
No.
CA514316
50
51
EPMS426
EPMS427
CA517376
CA519128
52
EPMS428
CA523880
53
EPMS429
CA525246
54
EPMS430
BM063052
55
EPMS438
CA847350
56
EPMS439
CA847439
57
EPMS440
CA847460
58
EPMS441
CA847465
59
EPMS443
CA847580
60
EPMS446
CA847612
61
62
63
EPMS448
EPMS449
EPMS451
CB164897
CB165039
CF270070
64
EPMS472
BM061910
65
EPMS480
BM062655
U204870
U198888
U196545
EST homologs
Tomato
Potato{
BI926607*
BM406842*
BI203623*
BM113158*
AW616965*
BI175800*
AW039917
BE923988
CO907720
U197913
U205114
DN980489
AI491087
BE432717*
BI178058
No.
49
U196493
Glyceraldehyde-3-phosphate dehydrogenase,
cytosolic (Arabidopsis thaliana)
U200239
RNA recognition motif (RRM)-containing protein (Arabidopsis thaliana)
U198459
Thioredoxin family protein T00482
(Arabidopsis thaliana)
U202943
Expressed protein At3g13940.1 68410.m01579
(Arabidopsis thaliana)
Nagy et al.
Table 6 (concluded).
Related pepper unigenes
Unigene
No.
BLAST annotation
U198650
Glutamyl-tRNA amidotransferase (Arabidopsis
thaliana)
U198040
No.
66
Code
EPMS490
Accession
No.
BM066956
67
68
69
70
71
72
73
EPMS492
EPMS497
EPMS501
EPMS507
EPMS514
EPMS538
EPMS539
CA517063
CA522759
CA523558
CB164833
CB185070
BM068084
BM068544
74
75
76
77
EPMS540
EPMS542
EPMS543
EPMS546
CA514054
CA514614
CA515275
CA516696
U197649
U198029
U201138
U196717
U203478
78
EPMS547
CA516837
U197406
79
EPMS549
CA517430
U196997
80
81
82
83
EPMS554
EPMS556
EPMS557
EPMS558
CA523715
CA525274
CA525515
CA525592
U202664
U198622
U201040
U201943
84
85
EPMS561
EPMS562
BM068460
CA524381
U204797
86
EPMS563
BM061461
U196664
U196969
U196336
U198885
U199268
EST homologs
Tomato
AW218125
Potato{
CK248403
BG590247*
CK263920
AW995353
AI775385
BM407743
U199127
Note: Identifiers of pepper unigene builds and their BLAST annotations were determined using the Web-based utility of the SOL Genomics Network (http://www.sgn.cornell.edu).
*Microsatellite motif in conserved position.
{
Shorter or imperfect microsatellite motif in conserved position.
685
686
Fig. 3. Alignments of orthologous microsatellite loci. (A) EPMS375: CA516199 (Capsicum annuum), CK275337, CK259504, CK259503,
CK260251, CK260250 (Solanum tuberosum), AI772610, AI896145, BG134388, BI422142 (Solanum lycopersicum). (B) EPMS565:
BM065457 (Capsicum annuum), BQ513455, BM111568 (Solanum tuberosum), BE450047, AW650048, BE450054, AW625915,
AW647883, AW647884, BE450060 (Solanum lycopersicum).
query sequences were blasted against 9 554 clustered pepper unigenes available at the SOL Genomics Network
(http://www.sgn.cornell.edu). The results of the homology
searches are summarized in Table 6, which shows EMBL
accession numbers, descriptions of the pepper source sequences as well as the tomato and potato orthologs, and
annotations of the corresponding pepper unigenes.
Out of 86 polymorphic EPMS markers, 35 (40.6%) highly
conserved (at least 150 bit score, Expect value > e50) tomato and (or) potato counterparts were found. In the cases
of 12 potato and 12 tomato orthologs, the sequences contained homologous microsatellite motifs in conserved positions as well. It is interesting to note that in almost every
case the pepper ortholog contained the longest and the tomato ortholog the shortest microsatellite motif. This might
be an effect of the large genome size of Capsicum spp. in
relation to the other solanaceous species (see Fig. 3 for 2 examples). Cross-species transferability of microsatellite
markers has been demonstrated within several plant families
including Leguminosae (Peakall et al. 1998; Kolliker et al.
2001), Cucurbitaceae (Katzir et al. 1996), Triticaceae (Saghai Maroof et al. 1994; Roder et al. 1995; Thiel et al.
2003), and Solanaceae (Provan et al. 1996; Smulders et al.
1997).
From the highly saturated genetic maps of tomato and potato, important information can be derived for the positional
mapping of pepper genes as well. Earlier comparative mapping studies within the Solanaceae (Tanksley et al. 1988,
1992) indicated high synteny throughout the whole genome
of both tomato and potato, but more radical rearrangements
for the pepper chromosomes. For a precise understanding of
this phenomenon, a high number of transferable polymorphic markers and the improvement and further saturation of
Acknowledgements
The authors are indebted to Tatyana Areshchenkova for
her contribution to the development of genomic microsatellite markers and to Alain Palloix (INRA Montfavet, France),
Gabor Csillery (Budakert Ltd., Budapest, Hungary), and
Zsolt Sagi and Pal Salamon (Vegetable Crops Research Institute Station, ZKI, Budapest Hungary) for providing pepper inbred lines and wild species. This work was supported
by the Hungarian Ministry of Education NKFP program
#
Nagy et al.
References
Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z.,
Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSIBLAST: a new generation of protein database search programs.
Nucleic Acids Res. 25: 33893402. doi:10.1093/nar/25.17.3389.
PMID:9254694.
Anderson, J.A., Churcill, G.A., Autrique, J.E., Tanksley, S.D., and
Sorrels, M.E. 1992. Optimizing parental selection for genetic
linkage maps. Genome, 36: 181186.
Areshchenkova, T., and Ganal, M.W. 1999. Long tomato microsatellites are predominantly associated with centromeric regions. Genome, 42: 536544. doi:10.1139/gen-42-3-536. PMID:10382301.
Areshchenkova, T., and Ganal, M.W. 2002. Comparative analysis of
polymorphism and chromosomal location of tomato microsatellite markers isolated from different sources. Theor. Appl. Genet.
104: 229235. doi:10.1007/s00122-001-0775-2. PMID:12582691.
Ben Chaim, A. 2005. SGN data overview: Capsicum annuum [online].
SOL Genomics Network. Available from http://www.sgn.cornell.
edu/content/sgn_data.pl#Capsicumannuum [cited 25 March 2005].
Benham, J.J. 2001. Genographer. Version 1.6 [computer program].
Available from http://hordeum.msu.montana.edu/genographer
Bennett, M.D., and Leitch, I.J.B. 2005. Nuclear DNA amounts in
angiosperms: progress, problems and prospects. Ann. Bot.
(Lond.), 95: 4590. doi:10.1093/aob/mci003. PMID:15596457.
Broun, P., and Tanksley, S.D. 1996. Characterization and genetic
mapping of simple repeat sequences in the tomato genome.
Mol. Gen. Genet. 250: 3949. PMID:8569686.
Chen, C., Zhou, P., Choi, Y.A., Huang, S., and Gmitter, F.G., Jr.
2006. Mining and characterizing microsatellites from citrus
ESTs. Theor. Appl. Genet. 112: 12481257. doi:10.1007/
s00122-006-0226-1. PMID:16474971.
Chin, E.C.L., Senior, M.L., Shu, H., and Smith, J.S.C. 1996. Maize
simple repetitive DNA sequences: abundance and allele variation. Genome, 39: 866873. PMID:8890517.
Cho, Y.G., Ishii, T., Temnykh, S., Chen, X., Lipovich, L., McCouch,
S.R., 2000. Diversity of microsatellites derived from genomic libraries and GenBank sequences in rice (Oryza sativa L.). Theor.
Appl. Genet. 100: 713722. doi:10.1007/s001220051343.
Danin-Poleg, Y., Reis, N., Tzuri, G., and Katzir, N. 2001. Development and characterization of microsatellite markers in Cucumis.
Theor. Appl. Genet. 102: 6172. doi:10.1007/s001220051618.
Dellaporta, S.L., Wood, J., and Hicks, J.B. 1983. Isolation of DNA
from higher plants. Plant Mol. Biol. Rep. 4: 1921.
Felsenstein, J. 2005. PHYLIP (Phylogeny Inference Package).
Version 3.6 [computer program]. Distributed by the author.
Department of Genome Sciences, University of Washington,
Seattle.
Frary, A., Xu, Y., Liu, J., Mitchell, S., Tedeshi, E., and Tanksley,
S. 2005. Development of a set of PCR-based anchor markers encompassing the tomato genome and evaluation of their usefulness
for genetics and breeding experiments. Theor. Appl. Genet. 111:
291312. doi:10.1007/s00122-005-2023-7. PMID:15926074.
Fulton, T.M., Van der Hoeven, R., Eannetta, N.T., and Tanksley,
S.D. 2002. Identification, analysis, and utilization of conserved
ortholog set markers for comparative genomics in higher plants.
Plant Cell, 14: 14571467. doi:10.1105/tpc.010479. PMID:
12119367.
Gonzalo, M.J., Oliver, M., Garcia-Mas, J., Monfort, A., DolcetSanjuan, R., Katzir, N., Arus, P., and Monforte, A.J. 2005.
687
Simple-sequence repeat markers used in merging linkage
maps of melon (Cucumis melo L.). Theor. Appl. Genet. 110:
802811. doi:10.1007/s00122-004-1814-6. PMID:15700148.
Greenleaf, F.J., Martin, J.A., Lease, J.G., Sims, E.T., and Van
Blaircom, L.O. 1970. Greenleaf Tabasco, a new tobacco etch
virus resistant Tabasco pepper variety (Capsicum frutescens
L.). Alabama Agricultural Experiment Station, Auburn University, Ala.
Gupta, P.K., Balyan, H.S., Edwards, K.J., Isaac, P., Korzun, V.,
Roder, M., et al. 2002. Genetic mapping of 66 new microsatellite
(SSR) loci in bread wheat. Theor. Appl. Genet. 105: 413422.
PMID:12582546.
He, C., Poysa, V., and Yu, K. 2003. Development and characterization of simple sequence repeat (SSR) markers and their use in
determining relationships among Lycopersicon esculentum cultivars. Theor. Appl. Genet. 106: 363373. PMID:12582864.
Hedrick, P.W. 1985. Genetics of populations. Jones and Bartlett
Publishers Inc., Boston, Mass.
Huang, S., Zhang, B., Milbourne, D., Cardle, L., Yang, G., and
Guo, J. 2000. Development of pepper SSR markers from sequence databases. Euphytica, 117: 163167.
Katti, M.V., Prabhakar, K.R., and Gupta, V.S. 2001. Differential
distribution of simple sequence repeats in eukaryotic genome sequences. Mol. Biol. Evol. 18: 11611167. PMID:11420357.
Katzir, N., Danin-Poleg, Y., Tzuri, G., Karchi, Z., Lavi, U., and
Cregan, P.B. 1996. Length polymorphism and homologies of
microsatellites in several Cucurbitaceae species. Theor. Appl.
Genet. 93: 12821290. doi:10.1007/s001220050367.
Jones, E.S., Dupal, M.P., Kolliker, R., and Drayton, M.C. 2001.
Development and characterization of simple sequence repeat
(SSR) markers for perennial ryegrass (Lolium perenne L.).
Theor. Appl. Genet. 102: 405415. doi:10.1007/s001220051661.
Kolliker, R., Jones, E.S., Drayton, M.C., Dupal, M.P., and Forster,
J.W. 2001. Development and characterization of simple sequence repeat (SSR) markers for white clover (Trifolium repens
L.). Theor. Appl. Genet. 102: 416424.
Lee, J.M., Nahm, S.H., Kim, Y.M., and Kim, B.D. 2004. Characterization and molecular genetic mapping of microsatellite loci
in pepper. Theor. Appl. Genet. 108: 619627. doi:10.1007/
s00122-003-1467-x. PMID:14647898.
Nei, M., and Li, W.H. 1979. Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc. Natl.
Acad. Sci. U.S.A. 76: 52695273. doi:10.1073/pnas.76.10.5269.
PMID:291943.
Levinson, G., and Gutman, G.A. 1987. Slipped-strand mispairing: a
major mechanism for DNA sequence evolution. Mol. Biol. Evol.
17: 12101219.
Li, J.Z., Sjaktste, T.G., Roder, M.S., and Ganal, M.W. 2003. Development and genetic mapping of 127 new microsatellite markers
in barley. Theor. Appl. Genet. 107: 10211027. doi:10.1007/
s00122-003-1345-6. PMID:12879255.
Ogundiwin, E.A., Berke, T.F., Massoudi, M., Black, L.L., Huestis,
G., Choi, D., Lee, S., and Prince, J.P. 2005. Construction of 2
intraspecific linkage maps and identification of resistance QTLs
for Phytophthora capsici root-rot and foliar-blight diseases of
pepper (Capsicum annuum L.). Genome, 48: 698711. doi:10.
1139/g05-028. PMID:16094437.
Panaud, O., Chen, X., and McCouch, S.R. 1995. Frequency of
microsatellite sequences in rice (Oryza sativa L.). Genome,
38: 11701176. PMID:8654913.
Paran, I., Rouppe Van Der Voort, J., Lefebvre, V., Jahn, M., Landry,
L., Van Schriek, M., et al. 2004. An integrated genetic linkage
map of pepper (Capsicum spp.). Mol. Breed. 13: 251261.
doi:10.1023/B:MOLB.0000022526.30914.31.
#
688
Peakall, R., Gilmore, S., Keys, W., Morgante, M., and Rafalski, A.
1998. Cross-species amplification of soybean (Glycine max)
simple sequence repeats (SSRs) within the genus and other
legume genera: implications for the transferability of SSRs in
plants. Mol. Biol. Evol. 15: 12751287. PMID:9787434.
Pestsova, E., Ganal, M.W., and Roder, M.S. 2000. Isolation and
mapping of microsatellite markers specific for the D genome of
bread wheat. Genome, 43: 689697. doi:10.1139/gen-43-4-689.
PMID:10984182.
Pickersgill, B. 1988. The genus Capsicum: a multidisciplinary approach to the taxonomy of cultivated and wild plants. Biol. Zentbl.
107: 381389.
Pillen, K., Binder, A., Kreuzkam, B., Ramsay, L., Waugh, R.,
Forster, J., and Leon, J. 2000. Mapping new EMBL-derived
barley microsatellites and their use in differentiating German
barley cultivars. Theor. Appl. Genet. 101: 652660. doi:10.
1007/s001220051527.
Provan, J., Powell, W., and Waugh, R. 1996. Microsatellite analysis
of relationships within cultivated tomato (Solanum tuberosum).
Theor. Appl. Genet. 92: 10781084. doi:10.1007/s001220050233.
Ramsay, L., Macaulay, M., Ivanissevich, S.D., Maclean, K., Cardle, L., Fuller, J., et al. 2000. A simple sequence repeat-based
linkage map of barley. Genetics, 156: 19972005. PMID:
11102390.
Rafalski, J.A., Vogel, J.M., Morgante, M., Powell, W., Andre, C.,
and Tingey, S.V. 1996. Generating and using DNA markers in
plants. In Non-mammalian genomic analysis: a practical guide.
Edited by B. Birren and E. Lai. Academic Press Inc., San Diego.
pp. 75135.
Rice, P., Longden, I., and Bleasby, A. 2000. EMBOSS: The European
Molecular Biology Open Software Suite. Trends Genet. 16: 276
277. doi:10.1016/S0168-9525(00)02024-2. PMID:10827456.
Roder, M.S., Plaschke, J., Konig, S.U., Borner, A., Sorrels, M.E.,
Tanksley, S.D., and Ganal, M.W. 1995. Abundance, variability
and chromosomal location of microsatellites in wheat. Mol. Gen.
Genet. 246: 327333. doi:10.1007/BF00288605. PMID:7854317.
Roder, M.S., Korzun, V., Wendehake, K., Plaschke, J., Tixier, M.-H.,
Leroy, P., and Ganal, M.W. 1998. A microsatellite map of
wheat. Genetics, 149: 20072023. PMID:9691054.
Rozen, S., and Skaletsky, H.J. 2000. Primer3 on the WWW for
general users and for biologist programmers. In Bioinformatics
methods and protocols: methods in molecular biology. Edited
by S. Krawetz and S. Misener. Humana Press, Totowa, N.J.
pp. 365386. Source code available from http://primer3.
sourceforge.net/.
Saghai Maroof, M.A., Biyashev, R.M., Yang, G.P., Zhang, Q., and
Allard, R.W. 1994. Extraordinarly polymorphic microsatellite
DNA in barley species diversity, chromosomal locations and
population dynamics. Proc. Natl. Acad. Sci. U.S.A. 91: 5466
5470. doi:10.1073/pnas.91.12.5466. PMID:8202509.