编辑 | 新锐恒丰研究院

37:05

(植物数量性状CRISPR育种方法)

字幕:right good morning can you hear me well

in the back all right well thank you so

much to the organizers for having me

here this morning it is a pleasure to be

at k-state I guess right away because my

favorite sister lives and works here so

it's been great to see her and also you

might not have known this but my mom was

a k-state graduate student back in the

s and she brought our family along

when I was a six month old infant until

I was three years old so it's nice to be

back in town although I don't really

remember much friend that from that time

I'd like to talk this morning about the

use possible use of CRISPR technology in

breeding for quantitative traits in

plants as dr. Fritz mentioned I've been

thinking a lot about what you might call

us bandwagons in plant breeding a

bandwagon is an idea activity or cause

that becomes increasingly popular or

fashionable as more and more people

adopt it bandwagons may have a negative

connotation but there have been some

wildly successful bandwagons over the

years

hybrid maize for example is an example

of a wildly successful bandwagon when it

was introduced back in the s and so

forth people weren't quite sure whether

this is something that would be feasible

but then as people begins researchers

began to study the bandwagon of hybrid

maize it became adopted more and more

and then the useful aspects of hybrid

mase became incorporated in our body of

knowledge to the extent that none of us

who are living here and now think of

hybrid maize as a bandwagon I've been

fortunate to have to spend my career in

plant breeding and genetics during the

time when many new technologies were

coming to the fore for example dr. Fritz

mentioned RFLPs I remember those days

actually about few months ago I was

looking back into isozymes think about

it I so science because we were doing

work in it from the work it show that we

only need markers in maizefor

genomic selection and I said hey we can

do that with isozymes I soon found you

can't order those chemicals for isozymes

anymore so we're back to the more

- DNA markers if you think of a

bandwagon who this is for QTL mapping

for example there in the there usually

is an initial phase of slow development

followed by a phase of rapid excitement

and funding so in the s for example

QTL mapping was very popular and then it

goes down a little bit because we

realized what the technology can do and

what it cannot do and so we have this

bit of disillusionment until we come to

an equilibrium we know this is a mature

technology we know how to use it and as

we might become discouraged while that

is going on the fortunate thing is that

a new bandwagon comes along and it holds

our attention for a while and so we're

in this phase in our work in which new

technologies come along and we need to

critically evaluate what are the pluses

and minuses of this new technology what

are the things it can deliver and what

are the things that we thought it can

deliver but in hindsight it cannot

deliver I've driven some bandwagons

myself and one of which is genomic

selection or genome-wide selection as we

know as plant breeders and geneticists

you have this basic equation in which

your phenotypic value is C equal to your

genotypic value which we cannot see but

we infer and some random noise or

non-genetic effects what we basically do

in genome-wide selection is that we use

a whole bunch of markers spread

throughout the genome and we estimate an

effect associated with each of those

markers and when we take the sum of

those markers that become that then

becomes a surrogate for our genotypic

value so we're simply saying instead of

inferring genotypic values from

phenotypic data we're trying to infer it

from these joint marker effects and so I

work in maize breeding and genetics

we've been at this whole idea of

genome-wide selection in maize for about

a decade now and in in essence this is

how maize breeding looks like these days

you cross an inbred inbred a with inbred

bee breeders who then typically induce

doubled haploid

from f or f plants because it's a

hybrid crop we need to evaluate the

performance of lines in hybrid

combination rather than the performance

of lines by itself and then those lines

that perform best in hybrid combinatio

for example J and K then become released

as new inbred lines and so we asked the

question during which stages of the

breeding program should we apply a

genome-wide selection and a quick answer

is it will be most efficient if you can

use genome wide selection in those

stages of the breeding program in which

phenotypic selection is ineffective I'll

say that again because that is an

important point for any breeding program

you need to look at your breeding

program and see what are those faces in

the breeding program in which phenotypic

selection is ineffective because that's

where you would get the most bang for

your dollar and so selection for hybrid

yield among f plants is totally

ineffective in maize because of hybrid

vigor the performance of an individual

plant is not indicative of its hybrid

performance plus you're looking at a

single individual f plant so the

question is well if we can do phenotypic

selection among f plants during which I

mean genome-wide selection among f

plants how much gain can we get and so

the system we've been working at is if I

have I'm trying to predict the

performance of progeny in the cross a by

B what we can do is go back to our

existing data and find all previous

populations that have a as a parent or B

as a parent and then we pull all of

these prior populations for which data

are already existing with a as one

parent and BS one parent as our training

population to predict the performance of

a by B the nice thing about the system

is you have in it you immediately have a

strong relatedness between the training

population and your test population and

so if you have another cross C by D it

means you find all crosses prior

populations with C as a parent and D as

a parent and you make your

prediction equation again and we found

that when we use this system we

basically get about eighty-five percent

of the gains that we would eventually

get from phenotypic selection to put

that in terms of cost genotyping is

about or percent of the Cross of

the cost of multi location phenotyping

so in this system were able to say that

we can get about % of the gains from

phenotypic selection at about to

percent of the cost and so that along

with other studies in other species has

led us through the conclusion basically

that like phenotypic selection geno

genome-wide selection works it doesn't

work all the time in the same way that

phenotypic selection doesn't work all

the time

genome-wide selection doesn't work all

the time either but if we use it on a

routine basis then it works quite well a

poster child or a poster bull in this

case is this number one bull in the land

from a few years ago the bull's name is

badger fluff Fannie Freddie Badger fluff

funny

Fannie Freddie was predicted to be the

number one bull in the land even before

he got one progeny it was based on

genome-wide predictions and sure enough

when his progeny records came it turned

out that he was indeed the top producing

bull in the land so so now that we can

say that about that that bandwagon there

are new technologies that have come

along such as genome editing

specifically and among others the use of

CRISPR technology that has allowed us

that has allowed scientists to make

precise changes in the DNA of known

genes and so as I thought about this how

could we use CRISPR technology in

breeding for different types of traits

I'm wondering whether there is a strong

parallel between the use of CRISPR

technology and QTL mapping in terms of

application for example in QTL mapping

we have found that QTL mapping is most

effective for those types of traits that

are controlled by few relative

if you QTL and for which these QTL have

a major effect things like disease

resistance plant morphology flowering

date and those sorts of traits in this

case the breeding approach becomes one

of introgression either by pedigree

breeding or by backrests crossbreeding

in other words you could think of these

major qtls like lego building blocks and

if i have five or six or ten lego

building blocks I can put these QTL my

LEGO building blocks into a designed

product and that would be my variety

however if I have a very complex trait

such as grain yield which I am most

interested in by definition these traits

we think are controlled by a lot more

QTL and because they are controlled by a

lot more QTL naturally the effects

associated with each quantitative trait

locus would be smaller and because they

are smaller they're harder to identify

and the effects are different are more

difficult to estimate and even if we

were successful in doing so putting all

of these or QTL together into one

genotype quickly it's a sinner

impossibility and that's why we have

resorted to more of a prediction

approach such as genome-wide selection

as opposed to a design approach with

major QTL and so I'm wondering can we

draw that same parallel then with CRISPR

technology if we knew the genes that we

need to edit and we knew the edits that

we need to make then it becomes more of

a design approach but how do we use

CRISPR technology if we have a trait

such as grain yield or lodging or

maturity for which there are many

unknown genes and so therefore even

identifying the genes to edit would be

would be quite quite challenging so

along these lines I've been thinking of

chromosome of genome-wide marker effects

not for the purpose of selection but

first of all for the purpose of getting

to know our germ plasm better and one

way of doing it is to think of

chromosome wide marker effects it's to

think of a chromosome carried by a line

as a package of inherited units not an

individual

package but a package nevertheless

because it is quite often that a whole

chromosome especially with doubled

haploids would be passed intact from a

parent to an offspring because of the

limited recombination so for example if

I have lined one and I have violet

allelic snips and I have lined one with

eight molecular markers and the marker

effects are indicated there blue would

be positive effects and red would be

negative so one would be the negative of

the other so we look at line two it has

eight chromosome eight markers we add up

the effects of the marker alleles it

carries we say the value of line one on

that chromosome is point two and then

the value of line two on that chromosome

is higher at two and so a student and I

a postdoc and I Andy Thompson have done

this to examine what we call as germ

plasm architecture germ plasm

architecture just means how the alleles

known are unknown for a different

different trader organized or or in the

germ plasm and so in this case here what

we have on on the on the on the x on the

y axis are four different traits and

thesis day down to northern corn leaf

blight and you see the chromosomes

numbered one to ten and on the x axis

are different maze lines about two

hundred seventy one of them and then the

subgroupings population subgroupings b

group misery seventeen group are found

on the top and so this is basically a

heat map it's a heat map if you would of

chromosome white marker effects so if

blue is good and we want blue we could

go down the heat map and look at the

line of interest is there a line for

example that is mostly blue but it had

would have one or two chromosomes in red

and therefore by a chromosome

substitution approach we can come up

with a better line just by substituting

that entire chromosome in that deficient

line where the chromosome from a donor

so again we see this as a way of

organizing

our germ plasm but when I begin to look

at this I go well yes these are

chromosomes but as I said chromosomes

are not indivisible what if we can begin

to divide our chromosomes the way we

want and so therefore this heart can be

back to a slide that I gave at a seminar

about years ago when I wasthinking

of how a car engineer thus does his or

her work and how that differs from our

approach in reading so if you're a car

engineer you do more of a design

approach you take all your knowledge

about cars mechanical engineering

drivers and so forth and you use that

knowledge to come up piece by piece with

a better car however if a breeder were

to design a car the way he or she breeds

plants here's how the breeder would work

a breeder would take two

high-performance cars a Ferrari and a

Porsche would create thousands of cars

that exhibit different Ferrari Porsche

combinations or lamborghini combinations

select the best shuffled car and do that

process again why is that so it's we

could choose the parents to cross the

Ferrari in the poorest Porsche but once

we make that cross between two corn

lines or to wheat lines we cannot really

control what happens there and so

therefore we rely on that natural

shuffling process and we keep on doing

the same thing and selecting the best

shuffle car I think we might be at an

age when we might not have to be limited

to this sort of of

work of going about things because I

think we might be able to start thinking

of what we would call as targeted

recombination targeted recombination

it's the possibility of saying I want a

recombination here and here and here and

here and so how would this work well for

example if I take these two lines and I

have line one it's the same one I had

before with a performance of a point two

and I have line and line and line

one is slightly better than line but

not much better

now we can go and walk through with a

chromosome and say what would be the

value of the progeny on that chromosome

if we can have a recombination at a

certain certain point so if we do that

if we can have a targeted recombination

between markers to and marker three and

recover we recover this particular

doubled haploid we're saying we could

increase performance dramatically in

this example now it's . whatever units

those are compared to the parental

performance of . okay so the question

is well can can we do that

there might be a way of doing that there

was a recent fairly not so recent

anymore but in the May issue of

science there was an article that used

CRISPR technology not for the purpose of

genome editing but for the purpose of

targeted recombination in yeast so this

is yeast it's not corn it's not higher

plants at seized yet they were able to

use targeted recombination successfully

in the sense that you could use the

CRISPR that technology to make

double-stranded breaks and a specific

part of the genome and this the double

double-stranded break would then trigger

a natural mechanism called homologous

response and after a mystery homologous

recombination and after this repair by

homologous recombination the this group

were able to they actually design you

used CRISPR castes to create a mapping

panel as shown on the right side and

this mapping panel had different

recombination points along the different

chromosomes and with this they were able

to map sensitivity to manganese on a

single to a single polymorphic marker on

chromosome so they use thisfor gene

mapping but I'm wondering whether we

could use that same concept not for gene

mapping but actually for targeted

recombination so if you see if you try

to translate how this might work in

crops basically you have CRISPR caste

to induce your double-stranded break you

see you

right to rely on homologous repair hope

it happens there's recombination this is

nearing mitosis by the way not during

meiosis and then you basically need to

recover and regenerate plants that have

the desired recombinations you have are

we here yet absolutely no

the tools the basic tools might be it

might be there but it would require a

complicated workflow to bring this whole

process to fruition a question that we

need to ask though is if if we are to do

this are the gains large enough to

justify investment in this technology

meaning that if this whole process would

lead to only a % or % gain now

there's no point we can get those gains

by using larger population sizes but if

the gains are a lot more substantial

than that say percent or more

then perhaps that's a big carrot it's a

big stick saying yes the games could be

large and therefore if you're a Monsanto

or a pioneer then you might want to

start thinking about developing this

type of technology and so what I've

tried to do and this has been published

in the plant genome journal is try to

determine are the gains for targeted

recombination large enough in maize to

justify the potential development of

this technology and so I looked at two

previous experiments in May so we had at

the University of Minnesota one is a

four by four both by former grad

students and the first one this is the

informated b misery population we

used lines use the markersfrom

patch novels group thanks so much it was

very useful and then the second study

was a mapping panel by Chris Shaffer a

former PhD student we looked at the maze

panel with lines and aboutnearly

nearly , Illumina markers so

basically we used the phenotypic and

marker data estimated genome-wide marker

effects tried to determine where the

combinate where the ideal recombination

point should be and then determine how

much gain would we get if we indeed get

a line at those recombination points and

this is what we found this work here the

wrong button here you go all right I'd

like to focus first on just the first

roll grain yield in kilograms per

hectare this is in the intimated b

misery population for grainyield the

mean grain yield was . or actually

nine eight nearly nine eight point nine

two eight metric tons per hectare if the

breeder were to select the best out of

lines this best out of linesresulting

from random recombination the

the yield of that best out of line

lines is nine point six twometric

tons per hectare compared to the

population mean of eighty nine eighty

two that's about a seven percent gain

. percent gain

now if a breeder were able to induce one

recombination per chromosome at the

right spot on each chromosome one

recombination per chromosome at the

right spot on each chromosome you add up

the gains across the chromosomes because

the chromosomes are independent the

predicted gain is ten point three three

metric tons per hectare . compared

to eight point ninety eight it's about a

percent gain so seven percentgain

with non targeted recombination

percent gain with targeted recombination

that translates to a relative efficiency

using targeted recombination of

percent in other words if we can have

targeted recombination in this

population we could potentially double

the genetic gains for yield I say that

almost nonchalantly I'll say it again we

can double potentially double gains for

yield if we can have targeted

recombination if we can have two

targeted recombination if

two on each chromosome then the games

are even more and you see that fourdifferent

different traits in this

population the relative efficiencies are

also quite high now we need to be

careful in interpreting not only the

relative efficiency but also the amount

of gain because often times the relative

efficiency could be high when the gains

were small take plant height for example

if the gain from non-targeted

recombination is two centimeters and the

gain with targeted recombination is

eight centimeters and that's a four

hundred percent response but eight

centimeters whereas if the gain is you

know ten centimeters versus twenty it's

a percent twice but theamount of

gain is actually larger so we need to

look at both the ratio and the actual

gain now this then is the plot of

targeted recombination for each of the

ten chromosomes for three different

traits so what we're saying is for yield

this triangle here represents the ideal

recombination breakpoint that would

maximize the gains on each chromosome so

it's coded that's chromosome up

to chromosome and so we seehere that

each chromosome has unequal

contributions to the amount of gain that

can be attained the amount of gain was

highest for chromosomes and

it was lowest for chromosome if I if I

recall the numbers right alright so

there is variation among chromosomes

there also is variation among traits so

for example if it's yield here and these

are the targeted recombination

breakpoints for moisture and these are

the targeted recombination breakpoints

and the amount of gain for an

index thatincludes yield moistureand different

lodging traits so a few things to note

we see here that the curve for yield is

a lot smoother than

we have four moisture that shows that

suggests that in this population the the

the favorable alleles for yield are

arranged in larger blocks larger

contiguous blocks for yield than they

are for moisture which is a lot more

jagged second is that if we are to use

recombination targeted recombination we

need to determine beforehand how we

would weight each trait because right

now with with non-targeted recombination

a breeder can just not have any index or

not have any preconceived notion of how

much weight to give to each trade and

just select what the best ones are at

the end of the process if we are to do

targeted recombination we need to

specify beforehand the weights to each

trade because that would drive where the

targeted recombination points would be

next is that this the graph here shows

the targeted recombination break points

for plant height when one line nd is

crossed to several different lines and

we see here that the targeted

recombination points on the same

chromosome chromosome do varyeven if

we use the same parent so again so the a

point here is that this needs to be made

on a case-by-case or a cross by cross

basis what are some considerations that

we would need to have as we as we think

about using this technology well first

of all are there caveats the predictions

need to be effective the prediction

accuracy needs to be high meaning so

that you would have confidence in the

estimates of marker effects and and you

would have confidence in the targeted

recombination breakpoints second is that

the targeted recombination break

points would be only in thosegenomic regions

where recombination already takes place

because the marker effects depend on the

recombination that takes place so if

there are spots in the genome called

spots where this there is no

recombination that would not be a target

that recombination

break point from this process at the

same time if you have that tool for

targeted recombination it's one possible

way to to study what the effect of

recombination is on these existing cold

spots you might need to prioritize

chromosomes but particularly if you have

a lot of chromosomes and the technology

is ineffective as we showed some

chromosomes have a larger contribution

than others so it might be in this maze

example for yield I might want to target

recombination of chromosomes and

maybe a fourth one and then in the

remaining chromosomes just to rely on

random recombination and segregation and

see and see what we get

and lastly although I started out with

CRISPR and I'm now beginning to think

that we actually don't need CRISPR to do

targeted recombination for some species

for that for some species in which

markers are available in doubled

haploids are available just by the

aggressive use of molecular markers and

doubled haploids that we might be able

to create pretty much a the line that

carries a targeted recombination on

several if not nearly all of the

chromosomes just to illustrate this is

from a study published by a student Josh

sleeper we found that in doubled

haploids

then this is maize again with

chromosomes the mean number of

chromosomes of recombinations per

chromosome it's about . it does vary

from on chromosome which which

would be the largest and chromosome

has actually a lot fewer recombinations

at least in this population and so a

possible approach would be first step

one is we develop a library of we

develop a library of lines that would

have a target and recombination on

chromosome another strainwith a

target and recombination on up to

and then the next step wouldbe just

by breeding and back Rossing to assemble

these into one line and that is possible

because the there is a fairly good freak

see of a chromosome passed intact from

parent to offspring so for example

chromosome two nearly fifty percent of

the time it's passed intact from parent

off spring so building that library by

introgression would be a possibility and

so for this perhaps some even some

operate operations research type of

approach would be needed to see what's

the fastest way towards this product

given given were where we stand now if

you're working on sugarcane with

chromosomes or so then it's probably not

gonna work so you you gotta do something

else

so with that just to summarize what I've

shared with you this morning

I think a next bandwagon could be the

use of CRISPR not for gene genome

editing but for targeted recombination

the point being that with genome wide

marker effects we can identify what the

recombination points would be for each

trait we could potentially with this

technology potentially double the rates

of yield gains in case you didn't get

that the previous point is a big deal

especially as we're looking at feeding

an increasingly large population by

and I think I'm beginning to think that

we we probably might be able to

accomplish this at least in

maize orspecies with fewer chromosomes like

barley with seven not with CRISPR but

with a straight breeding methodology

however much work is needed and so I'm I

hope to be working in this field in the

next several years with that thank you

so much for your time and attention if

there is time for questions I'd be

pleased to try to answer them this

morning

okay any questions Rex that was a

absolutely fabulous idea I really like

that that's could be a game-changer as

you say now of course that's the

reaction I should have at the beginning

of the bandwagon I guess but I really

that that's very exciting I liked what

you're doing in terms of looking at

these historical data sets what we've

also seen though in corn breeding is

that there have been one key in bread

comes along and it changes the whole

thing do those contain rare

recombination events consistent with

this hypothesis yeah that's a great

question I've been thinking about that

what makes what makes a line or a hybrid

a superstar if you would it's this

what's special and it might be that if

we're talking about it could potentially

double gains maybe that line just

happened to have some of those are close

to those that I would contribute to that

being just head and shoulders above the

rest of its sister lines in the same

cross it's a great point and then if I

could ask for your feedback on a comment

so at one point I was talking to a

breeding company and I suggested that as

they start genotyping all their lines

that they save those lines with rare

recombination events in anticipation

that someday they might be useful any

thoughts about that

I think I I think I think that's a good

point

save it saving those who especially with

the rail recombination events of events

I think that's an excellent point it's

it's linkage is a it's an interesting

thing because linkage of course as we

know it can release hidden genetic

variation but it can also preserve good

good blocks of alleles and so there's

been a long debate in in corn breeding

do we want to induce do we want more

recombination or do we want less

recombination I think an answer is well

we want recombination where it's helpful

and we don't want recombination where

it's not helpful and I think we can

begin to start thinking of that question

and then save whatever recombinant

survey recombinants we

that we might come across more questions

please okay excuse me thank you as we

know their big difference between mass

and weight we works on weight it's more

complicated so what small concentrations

for weight so what wheat has pairs is

all right so yes let go with it's

more difficult than Lego and if you

think of chromosomes as lego building

blocks so it might be the case where you

would need to in a species like wheat or

sugarcane or large numbers of

chromosomes you might need to prioritize

your chromosomes and and focus on those

a nice thing though is if you're trying

to build an ideal genotype if you would

with the recombinations and if you're

doing that on a piece by piece basis

your progress is not all in the end that

as you're building accumulating those

recombinants you're already making you

hope progress and so you could track

that and see if are you indeed making

progress during that time so again it's

you think of it that the way the way we

work now is we use molecular markers and

and phenotypic data to select which ones

are the best out of what happens

randomly and now we're shifting it to

say well we're using this information to

actually try to select the best

recombinations and again yeah there are

challenges in in in species with large

numbers of chromosomes if you're in

mosquito with three chromosomes it

should be a piece of cake okay more

questions we have enough time

yeah my question is how this new

technology is applicable to the orphan

crops which they didn't have the luxury

of genome sequence information available

data so yes so obviously for this

technology to work you need to have

cheap and abundant molecular markers and

you need to have good phenotypic data if

you have if you don't have both of those

then you cannot use this technology and

so I don't have a quick answer for how

it would work in an orphan species but

the fact of the matter is and the

reality is you need both phenotypic data

and marker data to do this great talk a

question about your modeling and the

gain made so have you thought about

modeling out then if you were able to

select that directory combination that

you wanted what would be the effect on

antagonistic traits such as maturity

moisture plant height loss of test

weight things like that yes and so so

that that's when you could do that so so

you could say if I'm going to select for

just yield you also have the marker

effects for the associate to trade so I

say okay if I have a recombinant

recombination event on this between

markers and for yield you could

easily ask the question how would that

affect moisture and how would that

affect test weight and as I showed in

that graph that the recombination points

differ from trade to trait and therefore

you need to a priori have some idea of

how you're going to weight your traits

because that's going to affect where youwould like to target so it's so you

can't sit back anymore and say I'm going

to wait for the results to come before I

give weights to each trade it's more you

need to do that earlier on the process

before the data come in any more

questions

if not let's thank dr. Bernardo one more

time so much so this is a time for our

morning break but before we go we have a

few announcements there are restrooms in

the east and west side of the building

and at this point we also start

(根据软件自动整理的,难免有误)

(以下为广告)

查看原文 >>
相关文章