HAL Id: hal-00893898
https://hal.archives-ouvertes.fr/hal-00893898
Submitted on 1 Jan 1991
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Gene mapping today: applications to farm animals
M Gillois
To cite this version:
M Gillois. Gene mapping today: applications to farm animals. Genetics Selection Evolution, BioMed
Central, 1991, 23 (Suppl1), pp.19s-48s. �hal-00893898�
Gene mapping today:
applications to farm animals
M Gillois
Institut National de la Recherche
Agronorreique,
Laboratoire de
Génétique Cellulaire,
BP 27, 31326Castanet-Tolosan,
France(Proceedings
of the 9thEuropean Colloquium
onCytogenetics
of DomesticAnimals;
Toulouse-Auzeville,
10-13July 1990)
chromosomes / gene
mapping technologiesINTRODUCTION
The
genetic
information contained in mammalian cells isspread
over a set ofnuclear chromosomes and the mitochondrial DNA.
Converging
evidence has made itpossible
to estimate that the mammalian genome may containfifty
thousandstructural genes, which may be
grouped
in some three thousand to fifteen thousand clusters.Mapping
the genome involvespartitioning
this number of genes into maps of individuallinkage
groups anddetermining
their linear order on each of the chromosomes.Dissection of the nuclear genome for
mapping
can be achievedby
meioticsegregation
infamilies, segregation
in somatic cellhybrids,
naturaltagging
ofparticular
chromosomes in individuals with chromosomeaberrations, by sorting
of individual
metaphase chromosomes, by analysis
of subchromosomalfragments using
somatic cellgenetics and/or genomic
or genesequencing by
molecularbiology techniques.
During
this lecture I wish to discuss the fourfollowing points: firstly,
the identi- fication of the relations which appear between the gene map and the chromosomestructure; secondly,
the evaluation ofpossibilities, complementarities
and limita- tions of genemapping techniques; thirdly,
an overview of the status of the gene maps of cow,sheep, pig
andrabbit; fourthly,
the mostimportant applications
ofgene
mapping.
RELATIONS BETWEEN THE
CHROMOSOME
STRUCTUREAND GENE MAP
It is classical to consider a
genetic
map obtainedby
Mendeliansegregation analysis
and a
physical
map obtainedby manipulation
andcytological
observation of chromosomes(fig 1).
But the term chromosome is
slightly ambiguous
because sometimes we think about the double-strandedDNA
and sometimes we think about themetaphase
chromosome. The
cytological
structure of ametaphase
or apre-metaphase
chromo-some is very
compact.
The ratio of thelength
of themetaphase
chromosome to thelength
of its DNA is about 1:8000.A beautiful electron
microscope preparation
of a baremetaphase chromosome, published by
Paulson and Laemmli(1977),
shows a backbone of non-histoneproteins
that isresponsible
for the basicshape
ofmetaphase
chromosomes and the central scaffoldorganizing
the DNA intoloops throughout
itslength.
Theseloops
of DNA are anchored in thescaffolding
and both ends of aloop
appear tobe anchored at the same
place
because the DNA strand emanatesradially
fromthe scaffold and returns to an
adjacent point.
Eachloop
contains from 30000 to 100 000 basepairs (bp)
of DNA(figs
2 and3).
Using
thetechnique
ofviscoelasticity,
Kavenoff et al(1973)
were able to demon-strate that the DNA of a chromosome is a
single
linear molecule.Consequently,
we have to consider that
along
this DNA molecule there arespecific
sequence do- mains which areresponsible
for theanchorage
of the DNA to the scaffold and for visible structures ofchromosomes,
such as the centromere or telomere.Curiously, repeated
D1VA sequences are situated in theregion
of contact between DNA andproteinaceous
scaffold.In a very
interesting
essay, Goldman(1988)
considers thateukaryotic
chromatinis
organized
into a series ofloops approximately
20-330 kb inlength,
anchored to aprotein
matrix or chromosome scaffold at twopoints (the
SAR or scaffold-attachedregions).
The SAR istypically
a DNA sequence that isrecognized by topoisomerase
II. Each
loop represents
a unit ofreplication, having
asingle replication origin (RO)
and a unit ofsupercoiling,
in that its torsional state isindependent
of thoseof the
surrounding loops.
Eachloop
alsorepresents
a unit offunction,
in that aloop
as a whole may be in apotentially
active orrepressed transcriptional
state.There may be one or several
transcription
units in aloop.
Theorganizer (ORG)
is a DNA sequence that binds a
transcription
factor whose presence isrequired
forassembly
of active chromatin. Thehypothesis
has been advancedthat,
inmammals, transcriptionally
competent domainsreplicate during
the first half of the Sphase,
while
transcriptionally incompetent
domainsreplicate during
the second half of the Sphase.
Thisexplains
the use of BrdU(5-bromo-2’-deoxyuridine) incorporation
into DNA
during
chromosomedynamic banding.
Vogt (1990)
in a recent paperpoints
out that tandemrepeated
DNA sequence blocks which appear at manyplaces
in the DNA strand areresponsible
for thechromosome
folding
and share the chromatin codes.The first code of the DNA sequence, detected
by Lengyel
et al in1961,
is ofcourse the
triplet
code.The second code of the DNA sequence was detected in 1980
by
Trifonovand Sussman.
They
realized thatpairs
of adenine nucleotides occur with a 10.5periodicity along
any DNA sequence that interacts with the histone octamer unit and forms a nucleosome. Thisperiodicity
correlates well with that of the helical turn in the/ 3-helical
DNAsecondary
structure. Thispattern
of adenine nucleotides codes for a unidirectionalcurving
of the DNA helix around the histone octamercomplex.
This code is the chromatin code.Vogt enlarges
on thisidea, forming
theconcept
of a’chromatin-folding
code’.He
presents
evidence thattandemly repetitive
DNA sequences have thepotential
of
expressing
alocus-specific organization
of theirfolding
structures in the nucleus in which three characteristics are notable:1)
theserepetitive
sequences are able to attractspecific
nuclearproteins; 2)
theserepetitive
sequences are able todevelop
alocus-specific, repetitive, higher
order sequence structure because of their inherentpossibilities
ofquick
sequencealteration; 3)
selection pressure createdby
thefunctional interaction between nuclear or
chromosomal proteins
and folded DNA stabilizes at each locus the molecularpattern
of thesespecific repetitive
sequences.Salser et al
(1976)
defineexperimentally
alocus-specific, repetitive
DNA se-quence structure whose sequence units have lost the
ability
tohybridize
to relatedsequence units located at other chromosome
positions.
Vogt
introduces the chromatin code at the nucleosomelevel,
the chromatin-folding
code at the level of the centromereregion,
telomereregion
and scaffoldregion
and he adds the gene
expression code,
thereplication
code and the recombination code.In each
species
exists a so-called’library code’,
which is a list of code sequences.The
composition
of the’higher
order’ structure isspecies-specific
and dictatedby
the combination of amounts of common basic sequence elements.
Two
major types
of tandemrepeated
DNA sequences are distributed in the genome(table I).
For the firsttype, they
areorganized
inlarge
tandem block struc- tures characterizedby
arepetitive
head-to-tailarrangement
of various sequenceunits,
whoselength
can range from one to several thousand nucleotides. Their copy number ishighly
variable. For the secondtype,
small variable tandem sequenceblocks,
with arepetitive
head-to-tailarrangement
of the same basic sequence unit at onelocus,
can be clustered orinterspersed
in the genome and may contain related sequence elements.Now we are able to
design
a schematicrepresentation
of relations between thephase
of DNA sequences and thehistone-depleted
chromosome(fig 3).
The current
problem
is how andwhy
these variousspecific
tandemrepeat
DNAsequences
play
a role in chromatinfolding
and geneexpression.
Firstly,
satellite DNAtype
I sequences aremostly
located in the constitutive heterochromatin of the chromosome centromereregion. Only
thepolymorphic
heterochromatin of the
long
arm of the Y chromosome(region
Yq12-Y qter,
for the humanchromosome)
contains satellite III DNA fractions. Thepattern
of theserepeats
and the different sequence of units wereextensively
studiedmainly by
Prosser et
al (1986).
Some of these unit sequences of cores arechromosome-specific.
Secondly,
thealphoid
DNA sequence families arerepetitive
DNA sequences in the genome with a mean monomerlength
of 171bp
and clustered in a tandemrepeat
sequenceorganization.
We know that each chromosome at its centromereregion
contains at least onespecific alphoid super-repeat
structure, oftenspecified by
achromosome-specific tandemly repeated
restriction site. Thea-protein
in thenucleus is
specifically
attached to thealphoid
DNA sequence structuresupporting
the
specific phasing
of nucleosomes in the centromeric chromatin.Recently,
Wevrick and Willard(1989)
studiedlong-range organization
of tandemarrays of a-satellite DNA at the centromeres of human chromosomes and their mei- otic transmission
through
threegenerations. They report
that restriction site andarray-length polymorphisms
exist between individuals and appear to bemeiotically
stable on a
megabase
range. Similar results werereported by
Jabs et al(1990).
Thirdly, Moyzis
et al(1987)
cloned arepeated
sequence from human DNA thathybridized
to humantelomeres,
and had therepeating
units(TTAGGG).
Sincethen Hastie and Allshire
(1989)
were able togive
a schematicrepresentation
of theorganization
of the human telomere.Finally,
sequences from the telomere are able to formguanine
tetrads within ananti-parallel quadruplex,
which mayexplain
chro-mosome
pairing (Sundquist
andKlug, 1989).
Therecognition
betweenhomologous
chromosomes could be ensured
by specific flanking
sequences of telomeres.Interestingly,
telomere-likerepeat
stretches may also occur in interstitialposi-
tions and may make the chromosome prone to
breakage, fusion, fragile
sites andrecombination.
In
conclusion,
we have toidentify
DNA sequences which areresponsible
forthe
chromatin-folding
code and locate them. Forexample,
Collick andJeffreys (1989)
identified a 40 kDa mouseprotein (Msbp-1)
that bound tomultiple copies
of minisatellite core sequences. Similar
proteins
were found in a number of otherspecies. Among
theseproteins,
we canexpect
toidentify couples
of DNA sequences andproteins
that areresponsible
for chromatinfolding. Furthermore,
we canexpect
to find reasons for the
specific
differences between chromosomes.EVALUATION OF GENE-MAPPING
TECHNIQUES
Use of somatic cell
genetics
in genemapping
Gene
mapping by
somatic cellhybridization
is based on a series ofparasexual
eventsinvolving
fusion of somatic cells from differentspecies,
fusion of their nuclei and mitoticsegregation
ofpart
of thegenetic
material of one of theparents during
theproliferation
of the fusionproducts.
In clones of cell
hybrids retaining
thecomplete
genome of oneparent (recip-
ient
genome)
and one or several chromosomes of thesegregating
genome(donor genome),
theresulting
concurrentsegregation
of biochemicalphenotypes (markers)
indicates that their
encoding
genes are located on the same chromosome. It is theoperational
definitionof synteny (fig 4).
Formation and isolation of somatic cell
hybrids by
selection andcloning,
andtechniques
for molecular andcytogenetic analysis
of thehybrid cells,
are now routinemethodologies.
A
pragmatic approach
has beenadopted
in order to create amultiple
chromo-some
mapping panel.
A number ofindependent hybrid
clones aregenerated and,
after a
period
ofgrowth, analyzed
for the presence of donor chromosomesby
meansof
cytogenetic
and moleculartechniques.
From thiscollection,
a series of clones arechosen as a
mapping panel
based on variouscriteria,
such as their donor chromo-some
content,
their relativestability,
the numericalrelationship
between their donor andrecipient
genomes and theirgrowth
characteristics. Clones which continue tosegregate
donor chromosomes at ahigh
rate are discarded.Some difficulties in this
approach
cannot be avoided. Forexample, specific
combinations of donor chromosomes tend to
co-segregate
inhybrid cells,
whereasother combinations are
particularly
rare. Inaddition,
both donor chromosomes andrecipient
chromosomes can rearrange.Consequently,
statistical rulesconcerning synteny
orindependence
betweenmarkers are needed.
If two
markers,
X andY,
are borneby
the samechromosome,
a set of nhybrid
clones is
expected
to be made up of adoubly positive
clones(those carrying
thecorresponding chromosome)
and of n — adoubly negative
clones(those having
lost the
carrying chromosome). Conversely,
if both markers are carriedby
twoindependent chromosomes,
it isexpected
that all four classes of responses will be found in a set ofhybrids.
The numbers of each class areusually
denotedby
a,b,
cand d
(fig 5).
Expected
values are such that b = c = 0 forsyntenic
genes or ad - bc = 0 forindependent segregation
ofcarrying
chromosomes.In
fact,
these alternativebiological hypotheses
cannot be turned into such clear- cut mathematical statements. Discordant clones may occur at lowfrequencies
forsyntenic pairs
and astatistically significant departure
fromindependence
may occur for genes carriedby
distinct chromosomes. This is the reasonwhy
Chevalet andCorpet (1986)
introduced a coefficient of correlation into the statistical frameworkaccounting
for association between chromosomes in their statement of decision rules.Today,
scientists are interested in the construction of somatic cellhybrids containing
differentoverlapping
deletionsinvolving
aparticular
chromosome. Forexample,
Cowell and Nlitchell(1989)
have constructed a somatichybrid
cellmapping panel
forregional assignment
of human chromosome 13 DNA sequences.The
assignment
of chromosomal breakpoints
fromkaryotipic
data alone is oftensubjective
and alternativeinterpretations
of G-banded appearances arepossible.
However, karyotypic analysis
can exclude the presence of certain chromosomes in thehybrids (fig 6).
To characterize therearrangements
moreprecisely, they
used aset of chromosome
13-specific
DNAprobes.
This
type
ofapproach,
with natural or induced chromosomedeletions,
is usefulto locate and furthermore to clone a gene
responsible
for a disease or amajor
inheritance trait.
The new trend in somatic
cell genetics
is totry
to obtainsingle
chromosomemapping panels. Hybrid
clonescontaining single
donor chromosomes can be gener- atedby
rare chanceimmediately
after cell fusion orduring
thepermanent
process of chromosome loss from established cell linehybrids
at a lowfrequency.
An alternative fusion
procedure
shouldproduce hybrid
cells with few orsingle
donor chromosomes: microcell fusion.
Metaphase
nuclei aresplit
into microcellsor
minisegregant
cells which contain few orsingle
chromosomespacked
into cellmembrane-derived
envelopes.
After fusion with arecipient cell,
theproduction
ofhybrids
withsingle
donor chromosomes is obtained faster than with cell to cell fusion(Ege, 1980;
Fournier andRuddle, 1977;
Tourian etal, 1978).
Unfortunately,
it has turned outthat,
at some timeduring
thisprocedure,
donorchromosomes tend to be
partially
deleted orrearranged. Thus,
the retention ofintact chromosomes has to be
carefully
controlled. Ifhybrids
with asingle
donorchromosome are obtained and maintained under selective pressure to retain this
chromosome, fragmentation
of the donor chromosome can occur. The consequence of thesepitfalls
is that it is difficult to obtain acomplete single
chromosomemapping panel.
For the near
future, single
chromosomemapping panels,
withintact,
deleted orrearranged
chromosomes will be very useful forregional mapping using pulsed-field gel electrophoresis.
Use of in situ
hybridization
in genemapping
Molecular in situ
hybridization
involvespreparing
aprobe
of labeled DNAfrag-
ments
corresponding
to a gene or apart
of a gene,previously
isolatedby
a methodof
genetic engineering,
andhybridizing
thisprobe
to thehomologous
DNA of themetaphase
chromosomes onmicroscope
slides. Thistechnique
is now well estab-lished,
see the review of Gellin and Yerle(1989)
and thefollowing
illustrations.Figure
7 shows the in situlocalization, performed by
Yerle et al(1986)
andYerle and Gellin
(1989),
of theleukocyte-interferon
gene in theq25 region
ofpig
chromosome 1. Two
spots
are visible on the twohomologues
of chromosomes 1.A remarkable concentration of silver
grains
was located on the chromosomeregion lq25
as seen onfigure
8. Acomputer-workstation
was used for in situhybridization
automatic
quantitative analysis.
For
non-isotopic
in situhybridization,
twomajor applications
have received considerable attentionlately:
genemapping
andinterphase cytogenetics (fig 9).
Several groups have
reported
the visualization ofprobes greater
than one kilobase(kb) long using
either conventionalmicroscopy
ordigital-imaging microscopy.
Probes of
considerably greater genetic complexity
have ahigh probability
ofcontaining interspersed repetitive
sequences,usually widely
distributedthroughout
the genome, which will result in a
non-specific hybridization signal. However, protocols designed
to suppress these unwantedsignals
withappropriate competitor
DNA have beendefined,
and thus one can obtainhighly specific
delineation of the genomeregions
from which theprobes
have been derived.With cosmid
probes, nearly 90%
of thetarget
sequences in cellpopulations
canbe delineated
by
fluorescenceand,
onmetaphase
chromosomespreads,
more than80%
exhibitspecific signals
on both chromatids of bothhomologous
chromosomes.Chromosome
assignment
is notproblematic,
as several conventional chromosomebanding
methods arefully compatible
withprobe
detectionby
fluorescence.Hybridization signals
from twoprobes
can bespatially
resolved onmetaphase
chromosomes when the
probes
areonly
several hundred kilobasesapart,
but a minimumseparation
of one or two million basepairs
isrequired
to enable the establishment of theirphysical
order.To further
improve
thespatial
resolution of genemapping, co-hybridization
studies with
closely spaced probes
from asingle genomic region
have beenperformed
on
interphase
nuclei. Measurements of the distances betweenmultiple pair probes,
after
hybridization
to methanol-acetic acid-fixed flatnuclei,
demonstrate afairly
linear
relationship
betweenphysical
distance and genome order over the range fromthirty
kilobasepairs
to about one million basepairs.
The combination of
metaphase
andinterphase
nuclearmapping, particularly using multiple probes simultaneously,
offers theopportunity
tophysically
ordergenomic
DNAsegments
with a resolutionpresently only
achievedby gel electropho-
resis methods and
provides
a newbridge
to interrelatephysical
andgenetic linkage
information.
Use of PFGE and YAC
chromosome-cloning
in genemapping
Pulsed-field
gel electrophoresis (PFGE)
can resolve DNA molecules aslarge
asseveral million base
pairs
in size. This is in contrast to conventionalelectrophoresis
for which the
practical
upper limit ranges from 50 000 to 100 000bp.
This increasedresolving
power hasespecially important
ramifications for thestudy
ofcomplex
genomes of
large
size. PFGE has made thecloning
oflarge
genes or groupsof
genespossible
via theyeast
artificial chromosome(YAC)
method.The essential technical innovation in PFGE is the use of electric fields that are not constant
throughout
thegel
run, but thatregularly
alternate in direction. Thefrequency
with which thechange
occurs, called thepulse time,
dictates the size class offragments
to be resolved.Because of the
large
size of molecules to beexamined,
DNAfragments
for PFGEmust be
carefully prepared
toprevent shearing.
Thisproblem
has beenneatly
andquickly
resolvedby
thepractice
ofpreparing
DNA in agarose blocks orplugs.
The
analysis
of genomes ofhigher organisms requires
restriction enzyme di-gestion.
Aspecial
class of restriction enzymes is needed which must cut rela-tively infrequently, generating fragments
incomplete digests
of between 100 and 1000 kb in average size. Forexample,
enzymes such as NruI(TCGCGA)
or NotI(CCGGCCGG) give
an excellent range offragment
sizes forpulsed-field
work.With an upper size limit of ten million base
pairs,
PFGE isobviously incapable
of
separating
intact mammalian chromosomes. Forexample,
the smallest human chromosome is about 50 million basepairs
in size.A
good example
of anapplication
of thistechnique
is thephysical mapping
ofhuman chromosome
21, accomplished by
Gardiner et al(1990).
Human chromosome21 at low resolution can be divided into the
centromere-proximal
half that ispredominantly
a Giemsa-dark band(q21)
and thetelomere-proximal
half that is aGiemsa-light
band(q22) (fig 10).
DNA
fragments
from theq22 region digested by
several enzymes(NotI, BssHII, NruI)
averageconsiderably
less than 1000kb,
whereas those from theq21 region
average
greater
than 1 500 kb.Furthermore, in
more detailedanalysis, it
can be shown that the upperfragment’s
1 500 kb from the
q22 region
mapexclusively
to the small Giemsa-dark bandq22.2.
This observation
is, in part,
a reflection of fundamental differences betweenGiemsa-light
and -darkbands. Dyes
used incytogenetic staining
arebase-specific
and indicate that
light
bands arehigher
in G + C content than dark bands.In
addition, light
bands are enriched for genes(or
at least for thosemapped
so
far) and, therefore,
for thegene-associated CpG
islands. It islogical
thatlight bands,
because of their different basecomposition,
will contain more sites for rarecutting
restriction enzymes.Pulsed-field
analysis
can alsoprovide
information on both the sizes of mammalian genes and theirorganization.
The entirelong
arm of human chromosome21,
40million base
pairs,
has been accounted for on a collection of 33 NotI restrictionfragments, using
some 50unique
sequenceprobes. Using complete digests, 70%
ofthese
probes
have beenphysically
linked in 13 separate groups.Figure
10 shows data from Gardiner(1990) concerning
the 6physical linkage
groups
mapping
to the distal third of thelong
arm.Together,
these groups contain 14 of the 24 genes and 19 of 54unique
sequences used to construct the map.Consideration of the smallest restriction
fragment
that defines each group indicates that these genes arelocally clustered, being
contained withinonly
3400 kb orapproximately
8% of thelong
arm.Certainly,
chromosome 21 contains many more than these two dozen genes,probably
500 or1000,
but thispresent
evidencestrongly suggests
that genes are notuniformly
distributed.In
conclusion, pulsed-field gel electrophoresis
has increasedby
over 100-fold the size of DNA molecules that can bequickly
andeasily
resolved.Physical mapping using
thistechnique yields
new information on the relativeproximity
of individual genes, localclustering
of groups of genes, and the molecular basis ofcytogenetic banding
patterns.Use of
chromosome-crawling
in genemapping
Once the
large-scale physical
mappers have finished theirhopping, jumping
andwalking
on thechromosome,
there is left the difficult task ofestablishing
the contentof
large segments
ofgenomic
DNA.Although
there has been much discussion about new methods forlarge-scale mapping, approaches designed
toidentify
newexpressed
sequences haveonly recently
been considered.Two groups have now mounted detailed researches
along large segments
of the MHC(major histocompatibility complex),
on the short arm of human chromosome6, looking
forexpressed
unknown sequences.Spies
etal (1989) probed
Northern blotsdirectly
withcosmids, screening
for thepresence of
transcripts
in Bcells,
Tcells, monocytes, epithelial
cells and fibroblasts.Using
thisstrategy, they
identified five new transcribed loci andproceeded
to obtaincDNA clones for each. All
transcripts
wereconstitutively expressed
in all the tissues.Sargent
et al(1989)
used a morethorough approach
foridentifying
sequences.They
first identified HTF(HpaII tiny fragment)
islands within the cloned DNA.These
CpG-rich regions
were identifiedby
the presence of clusters of restriction sites for enzymesBssII, Ea,gI
andSacII,
and the lack ofmethylation
of thesesites in
genomic
DNA. Such sequences arecommonly
foundadjacent
toexpressed
housekeeping
genes.Sequences adjacent
to these HTF islands were then used toprobe
Northern blots toidentify
the transcribed sequences. Thisapproach
revealed12 new genes and established a
density
of such genes of one every 23 kb of DNA between the C4 and the tumor necrosis factor(TNF)
loci.Even if
they
have not identified the fullcomplement
ofexpressed
loci in thisregion,
these two studies areimportant
becausethey provide
animpression
of thehigh density
ofexpressed
sequences inparticular genomic regions.
The MHC isfound in a
Giemsa-light
or G + C-richregion
of the genome, consistent with this veryhigh frequency
ofhousekeeping
genes. Similarhigh
genedensity
can beexpected
inother
genomic regions
associated with aGiemsa-light
band or G + C-rich isochores.Use of DNA
probes
andpedigree analysis
in genemapping
DNA
probes
becomegenetic
markers whenthey
arecapable
ofdetecting genetic
variation between individuals at the level of their chromosome DNA sequences.
The
genetic
variation is revealed as heritable DNA restrictionfragment length patterns
that differ between individuals when their DNA isdigested
with arestriction endonuclease and
hybridized
to the samespecific
DNAprobe
via theSouthern blot
technique (fig 11).
The inheritance of these restrictionfragment length polymorphisms (RFLPs)
can be used to track the inheritance of smallspecific
chromosome
regions,
once theRFLP-detecting
DNAprobe
has beenmapped
to achromosome. The
RFLP-detecting
D1VAprobe
has to beassigned
to aspecific chromosome,
and located in aunique locus, by
somatic cellgenetics and/or
molecular in situ
hybridization
on themetaphasic
chromosome.Usually
the band size differences on theelectrophoretic gel
are a direct result of variation in the DNA sequencecomprising
the enzymerecognition
site. Point muta-tions,
or deletion or insertion of tracts of DNA sequence within the locusrecognized by
the DNAprobe generate
this Mendelian variation between individuals.It is
important
topoint
out that two RFLPs ofequal lengths provided by
thesame enzyme
digestion
can show internalgenetic
differences in each DNA sequence ifthey
aredigested again
with another restriction enzyme. This internalmapping technique, developed by Jeffreys
et al(1990)
for DNAfingerprints,
revealedgenetic
variation on an
extraordinarily
fine scale within asingle
locus and it is well suited to track thegenealogy
of a gene.Figure
11 illustrates asimple type
of RFLP associated with the gene for the/3-subunit
of thefollicle-stimulating
hormone(FSHB)
located on the short arm ofhuman chromosome 11
(llp).
This is the common form of RFLP. It results from asingle
site mutation that alters the restriction endonucleasecleavage site,
in this case for the enzyme HindIII. The FSHB DNAprobe
detects two different restrictionfragment lengths,
which represent the alleles. An individual who possesses both alleles has a different-sized FSHB restrictionfragment
on each chromosome of his chromosome 11pair.
This
approach
isespecially
well suited forphysically approaching
orexamining
genes that are candidates for
specific
Mendelian inheritedphenotypes,
such ashalothane
sensitivity
inpig
or Booroolaprolificacy
gene insheep.
Butfirst,
it isnecessary to find some RLFPs which are
strongly
correlated with the Mendelianphenotype.
In order to attain this
aim,
the realization of alow-density
map of the genome isneeded,
such as is the case for the Booroola gene.Sometimes, comparative
genemapping gives
us indications which reduce the neededmapping
to aregion
of aunique chromosome,
which is theputative
carrier.This is the case with the halothane gene, which may be located to the
cen->q21 region
of thepig
chromosome6,
in thevicinity
of fivemapped
clustered genes, as Yerle et al demonstrate in theseproceedings.
The
goal
ofusing
RFLPs tostudy
the inheritance of agenetic
disease or a clearMendelian
phenotype
is toidentify
thespecific
chromosomalregion
that containsa
particular
gene.Dozens or
possibly
hundreds of RFLPfragments
may need to be tested infamily pedigrees
beforelinkage
can be detected with theputative
carrier chromosomalregion.
Figure
12 shows asimple
scheme of inheritance of two linked RFLPs. The mother’schromosomes,
identified with filled circles forcentromeres,
are indistin-guishable
because each chromosome has the same allele for both the(A1,A1)
or(B1,B1)
locus. The father’schromosomes,
centromeres shown as filled squares, canbe
distinguished
because he isheterozygous (A1,A2), (B1,B2)
at the two loci. Inaddition,
the allele associationpattern (or phase)
of the father’s chromosomes canbe determined from his
parents’s
chromosomes: thegrandmother
contributes anA1/B1 chromosome,
thegrandfather
contributes anA2/B2
chromosome. A mat-ing
of thistype,
where oneparent
isdoubly homozygous
and the other isdoubly heterozygous,
is termed a double backcross. Recombinant(R)
and non-recombinant(NR)
chromosomes can be counteddirectly
from double backcross matin-,-, as shown in thefigure,
whenphases
can be determined. Thispoints
out the 11. of thethree
generation pedigrees.
Individuals
1,
2 and 3 have inherited NR chromosomes from their father. Indi- vidual 4 has inherited an R chromosome(A1/B2)
from his father. The number of R chromosomes dividedby
the number of chromosomesyields
the recombination frac- tion0,
theprobability
with which a recombination occurs, and the measure of thegenetic
distance between markers: for thissimple family pedigree, 0 = 1/4
= 0.25.The LOD score method is
conventionally
used toquantify
the datasupport
forlinkage;
in thisexample
the LOD score reflects thesupport
forlinkage
betweenA and B loci. The LOD score is the
logarithm (to
the base10)
ofthe
ratio oflikelihoods
(L)
calculated from thefamily pedigree
data in two models: one modelassumes
linkage
at different values ofe [L(e)],
the other model assumes nolinkage,
free
recombination,
or 0 =1/2, L(1/2).
The LOD score is calculated at incremental values of 0. The recombination fraction
(0)
associated with the maximum LOD score isaccepted
as the bestestimate of the recombination between markers.
A lot of factors
complicate
thelinkage analysis:
numerouspedigrees, complexity
of these
pedigrees
which includes numerousoffspring,
intricateconsanguinity loops,
and a number of individuals for whom the marker
phenotype
is unknown.Fortunately, good computer
programs areavailable,
such as the program LINKAGE ofLathrop
and Lalouel(1988)
or the programMAPMAKER-QTL
ofLander and Green
(1987). Pairwise,
ortwo-point, linkage analysis
of sets of RFLPsthat map to the same chromosome is first used to
identify
clusters ofclosely
linkedmarkers. The order of clusters of the chromosome or of
closely
linked RFLPs withina cluster is resolved with multilocus
linkage analysis.
Multilocusanalysis,
a methodthat examines the
linkage relationships
ofmultiple
locisimultaneously,
is a power-ful
technique
that is used to resolve the order of markers on the mapby assigning
different
probabilities
to different marker orders.There is another
approach
toincreasing
the informativeness of markerloci,
themultiallelic multiloci variable number tandem
repeat system (VNTR).
Variations in the number of
short, tandemly repeated
sequences found at a number of loci constitute an alternative class of DNA sequencepolymorphisms
in mammalian DNA. The number of
copies
of the tandemrepeats
can vary fromone to
several,
to hundreds. The restrictionfragment carrying
the set of tandemrepeats
willprobably
appear in a number of differentlengths, defining multiple distinguishable
alleles at asingle
locus in apopulation sample. Heterozygosities
over
90%
have been identified so far.A
large
number of VNTR loci have now been identified as a consequence of theirpartial homology
withsynthetic oligonucleotides,
whose construction was based onDNA sequences from known VNTR loci or on a similar sequence.
Resolving
the alleles of thesystem requires
care,however,
because the differences infragment length
among alleles can be as small as asingle repeating
unit. Unitsare known to range from 14 to 70
bp
at different loci.If at all
possible,
it is desirable to define the alleles at a VNTR locus in terms of the number ofrepeats carried,
which makes it mucheasier,
forexample,
to comparethe alleles found in different branches of the same
pedigree.
Scientists
expect,
with aunique
VNTR DNAprobe,
to be able tostudy
15 to 30loci
simultaneously
and to realize alow-density
map of the genome. It is a necessaryprerequisite
to track a chromosomalregion bearing
an unlocated gene such as the Booroola gene. On the otherhand, they expect
also to track thegenealogy
of agene in a
single
locus with the internalmapping technique developed by Jeffreys
etal (1990).
Use of microdissection of
metaphase
chromosomes and sequence-independent amplification
of DNAMany
Mendelian genes, which areresponsible
fordiseases, major
traits orquantita-
tive
inheritance,
are difficult to isolateby
currentcloning strategies,
such as chro-mosome
walking,
eventhough
thephysical
location of otherclosely
linked genes has been determined. This islargely
due to thedifficulty
of chromosomewalking
inthe
complex
genomes ofhigher organisms
which contain abundantrepetitive
DNAsequences. This same
problem complicates
the construction ofcomplete physical
maps and
sequencing
efforts.Physical
dissection ofmetaphase
chromosomes is the mostpromising approach
to this
problem. Scalenghe
et al(1981)
were the first to dissect and clone DNAfrom
Drosophila polytene
chromosomes.Later,
microdissection andmicrocloning
were
improved (Pirrotta
etal, 1983),
andcompleted by
universalenzymatic
DNAamplification (Ludecke
etal, 1989).
Senger
et al(1990) published
a very well-documented review of thepresent
state of the art.They point
out that the conventional microdissectionprocedure
has beenimproved mainly by
the use of thepipette
method for chromosomalharvesting,
which avoids DNA contamination from broken
cells;
the dissection of GTG- bandedchromosomes,
which allowsprecise
excision ofsingle bands;
and a modifieddissection
technique,
which does not use an oil chamber. Theseimprovements
haveseveral
advantages: straight
needles andpipettes
extended with thehelp
of asimple pipette puller
aresufficient;
it ispossible
to dissect under the best directional andoptical
conditions and chromosomes can beplaced perpendicular
to thedissecting needle; banding pattern
and thin needlesrequired
for dissection ofsingle
bands aremuch easier to visualize in air than in oil.
Ludecke et al
(1990)
show that to constructband-specific
DNAlibraries,
20 to40 dissections per
region
are sufficient.We can
expect
that thisprocedure
will beperformed
soon forcloning
bands inthe
cen-q12-q23 region
ofpig
chromosome6,
in order to isolate theHAL-gene.
This
technique
seems morepromising
than thesorting
ofmetaphase
chromo-somes, which
provides
very small amounts of DNA foranalysis
and needsexpensive
chromosome-cell sorter
apparatus.
AN OVERVIEW OF GENE MAPS OF
COW, SHEEP,
PIG AND RABBIT
Apart
from the human gene map,only
the gene map of the mouse is well advanced.For other
species investigated, knowledge
is stilllimited, particularly
for farmanimals,
such ascattle, sheep, pig
and rabbit. As shown on tablesII-V, (Genetic Maps, 1990),
a fewlinkage
groups, 1-20syntenies
and a few lociassigned
orregionally mapped
are known for each of thesespecies.
SOME REMARKS ON APPLICATIONS OF GENE MAPPING
Eight applications
are of some interest:1) comparative
genemapping; 2)
under-standing
relations betweencytogenetics
and molecular structure ofchromosomes;
3) mapping
a chromosome to find and to isolate a gene;4) drawing
realpedigrees
of genes from
pedigrees
ofzygotes; 5) genetic
dissection ofquantitative
or com-plex
inheritablecharacters; 6) genetic improvement
based on selection of marker genes;7) describing
at the DNA level the molecular response to a selection pressure;8) considering
the maternal effect as agenetic imprint.
Comparative
genemapping
A beautiful illustration of
comparative
genemapping
has beenpublished by
Xu andHardison
(1989).
Dutrillaux et al(1980)
found considerablehomology
in the overallchromosome-banding patterns
of rabbits andhumans,
which reflects asimilarity
in gene content in several
regions,
as shownby
Soulie and deGrouchy (1982).
Inparticular,
Dutrillaux et al detected acorrespondence
between rabbitlq
and human11
chromosomes,
and these data confirm the presence ofhomologous genes
in rabbitlq
and humanllp
chromosomeregions.
Another excellent
example
ispresented by
Yerle et al in theseproceedings:
thegroup of five genes:
GPI, APOE, TGF,8,
EN01 andPGD,
which are located in thecen-q25 region
ofpig
chromosome6, corresponds
to two groupsGPI, APOE, TGF/3
located on the human 19 and murine 7 chromosomes and
ENO,
PGD located onthe human 1 and murine 4 chromosomes.