Outline
1. Informatics in MAS 2. Rice genome sequence information for mapping, tagging and map-based cloning of genes 3. Mining novel alleles of cloned genes in the germplasm 4. Synteny and colinearity- transferring rice genome information to wheat 5. Databases and web-based tools to assist breeders 6. High throughput genotyping to save time and cost 7. Proposed activities for the Indo-Aus wheat MAS network
1. Informatics in MAS
Phase 4
Rice
Phase 3
Tomato Soybean Sorghum Medicago Brassica
High throughput gene/ marker discovery Large scale genome sequencing Pilot genome sequencing Gene based markers Map based cloning Identify DNA markers linked with traits
(Simple/ QTL)
Functional genomics
Genotype Transcriptome Proteome Metabolome Phenome
Phase 2
Maize Wheat Cotton Sugarcane
Phase1
Chickpea Pigeonpea Mango Banana
Basic Resources : Germplasm, mutant lines, knock outs, mapping populations, GSTs, ESTs, BAC libraries, BAC-end sequences, Bioinformatics
Arrow of Time
Plants
Source: NCBI
Barley 6000 Mb
Sorghum 1000 Mb
Wheat 16000 Mb
Human 3000 Mb
Microbes 5 Mb
What is needed?
(for Molecular Breeding in Orphan Crops)
2. Rice genome sequence information for mapping, tagging and map-based cloning of genes
Size = 388.8 Mb
Fine mapping of QTLs, and expression profiling (micro array/ proteomics) of genes for complex agronomic traits
Effect of 100 mM NaCl on salt susceptible (MI 48) and salt tolerant (CSR 27) varieties of rice
Frequency
10
15
20
25
30
0 P2
P1 P2 RILs
Grain length
5. 51 -5 .7 1 5. 91 -6 .1 1 6. 31 -6 .5 1 6. 71 -6 .9 1 7. 11 -7 .3 1 7. 51 -7 .7 1 7. 91 -8 .1 1 8. 31 -8 .5 1 8. 71 -8 .9 1 9. 11 -9 .3 1
Salt tolerance
Effect of 100 mM NaCl on salt susceptible (MI 48) and salt tolerant (CSR 27) varieties of rice
(7400 genes)
(18 genes)
28 MB
~1 MB
142kb
TAC
ATT
S 990
Poly A
P2
-64 -47 T1
TATA GT1 BOX
AAcAAA
Motiff
990 bp ORF
TAC
HP2216
P1 -343 WUN-1 -269 - 221 -101 MeJA resp CAAT BOX Element
ATT
S 990
Poly A
P2
-64 -47 G1
TATA GT1 BOX
AAcAAA
Motiff
Gene structure
Table: Growing number of genes for important agronomic traits cloned recently making use of the rice genome sequence information
S. no. 1 Trait Bacterial leaf blight resistance Plant height Amylose content Grain number Salt tolerance Grain aroma Blast resistance Submergence tolerance Lodging tolerance Seed shattering Gene Xa 21 (NBS-LRR type receptor kinase) Sd 1 (gibberellin-20-oxidase) Sbe 3 (starch branching enzymes) OsCKX2 (cytokinin oxydase) SKC1 (a HKT type transporter) BAD2 (betaine aldehyde dehydrogenase 2) Pikh (NBS-LRR type protein) Sub1 (Ethylene response factorlike) Lsi 1 (Silicon transporter) qSH 1 (BEL1-type homeobox) Reference Song et al. (1995)
2 3 4 5 6 7 8 9 10
Sasaki et al. (2002) Liu et al. (2004) Ashikari et al. (2005) Ren et al. (2005) Bradburry et al. (2005) Sharma et al. (2005) Xu et al. (2006) Ma et. (2006) Konishi et al. (2006)
Resistant:
O. punctata, O. latifolia, O. officinalis, O. rhizomatis, Coloro, Jatto, K-60, Fukunishiki and Tetap
Lesions
3 4 5
6 7
8 9 10 11 12 13 14
PCR amplification of Pi-Kh allele from rice lines and wild Oryza species
M UC 1 2 3 4 5 6 7 8 UC M bp
4361 2322 2027
M pUC 1 2 3 4 5 6
(B)
of
Tetep HP2216 Coloro K-60 Jatto Co-39 Bhrigudhan Fukunishiki Nipponbare O. nivara O .rufipogon O. latifolia O.officinalis O. rhizomatis O. punctata O. minuta O.gradiglumis 1bp 1765 bp
ORFs and number of exons predicted in Pi-kh gene isolated from different O. sativa lines and wild species
80
Number of SNPs
40
20
Bh Jat rig to ud ha Fu n ku n N i sh ip k po i nb ar O .n e iv ar a
O .m in
.r O
Indica
Japonica
Lines/Species
Wild species
O .p
o 0 9 16 -3 olor -6 22 o K C C P H
on g o fi p
ut
un
ct a
ta
Phylogenetic relationship among Pi-kh genes amplified from different wild species of rice.
An overview of the sequence contigs of the badh1 gene of 16 rice varieties based on sequence reads obtained using 16 pair of primers, assembled using Phred/Phrap/Consed software)
Screen shot of Consed window showing location of one of the 20 SNPs discovered by sequencing of the badh1 gene fragments from 16 rice varieties.
Location of PCR primers (reverse primer underlined) and 20 SNPs (highlighted) in the badh1 gene of rice. The gene has 15 exons (in bold) and 14 introns
Summary of the SNP alleles in 16 rice varieties and the reference variety Nipponbare at 20 different positions in the badh1 gene, starting from the ATG codon of Nipponbare..
SNP: BADH1- S1
S2
S3
S4
S5
S6
S7
S8
S9
S10
S11
S12
S13
S14
S15
S16
S17
S18
S19 S20
45 40 35
No. of G enes
30 25 20 15 10 5 0 1 2 3 4 5 6 7
Genome wide analysis of homology between 56298 predicted rice gene CDS (from the IRGSP sequence) and 39,813 wheat EST contigs (from wheat SNP consortium, build 3), plus 3792 binmapped wheat EST contigs (USDA-NSF wheat genome project, version Aug. 03)
sorghum
maize
Distribution of Copy Number among the 4659 Rice Gene Homologs of Wheat
A. Rice chr 1/ wheat chr 3 B. Rice chr 2/ wheat chr 6 C. Rice chr 6/ wheat chr 8
Wheat Chromosomes
1.
10 11 12
Rice Chromosomes
Predicting wheat bin location of 6178 unmapped single copy rice gene homologs
Validation of the predicted wheat bin location of unmapped single copy rice genes
35% of the 213 single copy rice gene homologs, representing all 12 rice and all 7 wheat chromosomes, mapped to their predicted bin location
Total genes 7378 5770 6544 5831 4977 5071 4858 4536 3561 3933 4436 4355 61250
R-genes (%) 1.56 1.46 0.7 0.65 1.08 1.24 1.28 1.37 1.24 0.99 2.59 1.22 15.38
Def. Response (%) 0.38 0.17 0.43 0.07 0.4 0.32 0.23 0.29 0.22 0.2 0.25 0.23 3.19
Chr 1
Chr 2
Chr 3
Chr 4
Chr 5
Chr 6
Chr 7
Chr 8
Chr 9
Chr 10
Chr11
Chr 12
A total of 176 R- and DR-genes clusters identified
To sum up
VansanuDhan-A Rice Genome Database created at NRCPB contains 562898 genes info. It is being used in functional and comparative genome analysis in rice. We found 942 R-gene and Defense Response gene like sequences in the rice genome. The physical location and orientation of each gene delineated. Comparative analysis of indica- japonica sequences helped us in mapping and cloning of a new Rice blast gene Pi-kh. Analysis of Pi-kh locus in indica- japonica provided an insight in the presence of SSR elements in this region which may play an important role in shuffling of genes in the genome. Extensive sequence variation was observed between the Pi-kh alleles amplified from wild species and land races of rice. These alleles are being used in functional validation experiments.
Multipex SNP assays using Sequenom MassARRAY system Sequence of flanking pre-amplification primers (PCRP) and single nucleotide extension primers (UEP) for genotyping of 20 SNPs of badh1 gene
C = 273 Da
100
Difference in peak size reveals the SNP allele for that rice variety
80
60
40
20
0 4300
4400
4500
4600
4700
4800
4900
5000
Genotyping
Pusa NPT11
Pusa 1342
Proposed Activities
Database of markers and cloned genes
ESTs, GSS, HTGS, SSRs,, SNPs, Traits