• Aucun résultat trouvé

Coupling of Single Molecule, Long Read Sequencing with IMGT/HighV-QUEST Analysis Expedites Identification of SIV gp140-Specific Antibodies from scFv Phage Display Libraries

N/A
N/A
Protected

Academic year: 2021

Partager "Coupling of Single Molecule, Long Read Sequencing with IMGT/HighV-QUEST Analysis Expedites Identification of SIV gp140-Specific Antibodies from scFv Phage Display Libraries"

Copied!
16
0
0

Texte intégral

(1)

HAL Id: hal-01738025

https://hal.archives-ouvertes.fr/hal-01738025

Submitted on 31 May 2021

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

with IMGT/HighV-QUEST Analysis Expedites

Identification of SIV gp140-Specific Antibodies from

scFv Phage Display Libraries

Seung Yub Han, Alesia Antoine, David Howard, Bryant Chang, Woo Sung

Chang, Matthew Slein, Gintaras Deikus, Sofia Kossida, Patrice Duroux,

Marie-Paule Lefranc, et al.

To cite this version:

(2)

Edited by: Prabakaran Ponraj, Intrexon, United States Reviewed by: Paolo Casali, The University of Texas Health Science Center San Antonio, United States Jiang Zhu, The Scripps Research Institute, United States *Correspondence: Melissa L. Smith melissa.smith@mssm.edu; Ismael Ben F. Fofana ismael.fofana@bc.edu

Specialty section: This article was submitted to B Cell Biology, a section of the journal Frontiers in Immunology Received: 22 September 2017 Accepted: 06 February 2018 Published: 01 March 2018 Citation: Han SY, Antoine A, Howard D, Chang B, Chang WS, Slein M, Deikus G, Kossida S, Duroux P, Lefranc M-P, Sebra RP, Smith ML and Fofana IBF (2018) Coupling of Single Molecule, Long Read Sequencing with IMGT/HighV-QUEST Analysis Expedites Identification of SIV gp140-Specific Antibodies from scFv Phage Display Libraries. Front. Immunol. 9:329. doi: 10.3389/fimmu.2018.00329

coupling of single Molecule,

long read sequencing with

iMgT/highV-QUesT analysis

expedites identification of siV

gp140-specific antibodies from

scFv Phage Display libraries

Seung Yub Han1, Alesia Antoine2, David Howard1, Bryant Chang1, Woo Sung Chang1,

Matthew Slein1, Gintaras Deikus2, Sofia Kossida3, Patrice Duroux3, Marie-Paule Lefranc3,

Robert P. Sebra2, Melissa L. Smith2* and Ismael Ben F. Fofana1*

1 Biology Department, Boston College, Chestnut Hill, MA, United States, 2 Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, Icahn Institute of Genomics and Multiscale Biology, New York, NY, United States, 3 The international ImMunoGeneTics information system® (IMGT®), Laboratoire d'ImmunoGénétique Moléculaire (LIGM), Institut de Génétique Humaine (IGH), UMR CNRS, Montpellier University, Montpellier, France

The simian immunodeficiency virus (SIV)/macaque model of human immunodeficiency virus (HIV)/acquired immunodeficiency syndrome pathogenesis is critical for furthering our understanding of the role of antibody responses in the prevention of HIV infection, and will only increase in importance as macaque immunoglobulin (IG) gene databases are expanded. We have previously reported the construction of a phage display library from a SIV-infected rhesus macaque (Macaca mulatta) using oligonucleotide primers based on human IG gene sequences. Our previous screening relied on Sanger sequenc-ing, which was inefficient and generated only a few dozen sequences. Here, we re-an-alyzed this library using single molecule, real-time (SMRT) sequencing on the Pacific Biosciences (PacBio) platform to generate thousands of highly accurate circular consen-sus sequencing (CCS) reads corresponding to full length single chain fragment variable. CCS data were then analyzed through the international ImMunoGeneTics information system® (IMGT®)/HighV-QUEST (www.imgt.org) to identify variable genes and perform

(3)

inTrODUcTiOn

Nonhuman primates are an important animal model for numerous human diseases, as there is great similarity between the human and macaque genomes (1–7). In addition, macaque immunoglobulin (IG) genes are likely those most closely related to human IG genes among available human immunodeficiency virus (HIV)/acquired immunodeficiency syndrome (AIDS) animal models (8–11). As a result, the variable heavy (VH) and variable light (VL) domains of macaque antibody heavy (H) and light (L) chains can be generated using polymerase chain reaction (PCR) conditions and oligonu-cleotide primers based on human IG nuoligonu-cleotide sequences. This has been shown thus far for VH rather than VL genes, although the use of rhesus specific primers for amplification may function for both variable domains (9, 11, 12).

The simian immunodeficiency virus (SIV)/macaque model of AIDS has been extensively studied and provides the most accurate reflection of HIV pathogenesis across all available animal models (5, 13–16). Virus-specific antibodies are abundantly produced during the course of HIV/SIV infection in humans or macaques (5, 17–20). These antibodies, which primarily target the envelope glycoprotein (Env) on the surface of HIV/SIV virions, gener-ally do not provide protection due in part to their inability to neutralize the virus. In fact, the Env surface glycoprotein employs multiple strategies to shield its neutralization-sensitive epitopes, such as the CD4 and the coreceptor-binding sites, as well as the fusion peptide on the Membrane Proximal External Region (MPER). Env trimer oligomerization, the presence of hyper-variable loops and extensive glycosylation are all components of a complex escape mechanism that limits the potential potency of antibody-mediated neutralization (5, 21).

Despite this, many potent HIV neutralizing antibodies have been isolated and characterized (22–26). Unique structural fea-tures of these antibodies, including extensive somatic mutations and unusually long VH domain complementarity determining region three (VH CDR3), have been associated with the develop-ment of potent neutralizing activity (19). However, roles for the VL domains and other complementarity determining regions in conferring potent HIV neutralizing activity have not been excluded (27–29).

Despite the extensive knowledge gained from studying this panel of naturally occurring neutralizing antibodies, stimulating the development of broadly neutralizing antibodies (bnAbs) by vaccination has remained a critical roadblock in the

development of an efficacious HIV vaccine. The failures of past vaccine immunogens to do so have been, in part, attributed to the inability of the various vaccine antigens to engage B cells that express the proper germline receptors (30–33). However, progress in this area is being made. For example, very recently anti-HIV bnAbs were elicited in the cow (Bos taurus), an unusual experimental animal for HIV-related research (34). The authors were motivated to take this approach due to the inherently long VH CDR3, which characterize cow antibodies. The rarity of such long VH CDR3 in the human IG repertoire raises the question of whether induction of such long VH CDR3 bnAbs will ever be achieved in humans via vaccination. In light of these questions and challenges, many have also turned their focus to the possibility of eliciting protective non-neutralizing antibodies (nnAbs), which may also be capable of preventing HIV infection. Interestingly, the most successful clinical HIV vaccine trial to date, RV144, found a correlation with limited protection and non-neutralizing Env-binding antibodies that target the V1-V2 loop (35–38). Similarly, the most successful pre-clinical HIV vaccine, which uses live-attenuated strains of SIVmac239 found only low titers of neutralizing antibodies in protected macaques; suggesting instead a role for nnAbs in this model (39). Unlike nAb activity, which appears to be conferred by a single or few dominant, protective monoclonal antibodies (mAbs) in a given individual; the characterization of protec-tive nnAbs will likely require large-scale analysis of polyclonal antibodies.

We have previously reported the construction of a phage display library from a SIV-infected rhesus macaque (11). This library was generated by PCR amplification of the VH and VL chains using primers corresponding to the human IG gene sequences. Our prior screening of mAbs from this library relied on handpicking bacterial clones after biopanning with SIV Env gp140, followed by Sanger sequencing. This approach was inef-ficient and generated only a few dozen sequences for analysis, likely severely underrepresenting the repertoire present. The screening of this library would be greatly improved with the use of next generation sequencing (NGS) technologies, such as Illumina, however often NGS platforms are limited in the length of the reads (≤400 bp), covering at most one VH or VL domain per read. This limitation has been addressed through the use of alternate high throughput NGS platforms such as the Ion Torrent Personal Genome Machine (PGM) S5 and the Pacific Bioscience (PacBio) RSII and Sequel systems (40, 41). Using the PGM-S5 system, He et al. generated 900 bp sequencing reads to identify precursors and lineage intermediates of HIV-1 bnAbs from a phage display library. Although single chain fragment variables (scFvs) from this combinatorial library did not nec-essarily represent authentic VH–VL pairing, the authors were

Keywords: antibody, simian immunodeficiency virus, rhesus macaque, PacBio sequencing, single chain fragment variable library, phage display,

SMRT sequencing with the IMGT/HighV-QUEST querying tool will facilitate and expedite our understanding of polyclonal antibody responses during SIV infection and may serve to rapidly expand the known scope of macaque V genes utilized during these responses.

international imMunogeneTics information system/highV-QUesT

Abbreviations: SIV, simian immunodeficiency virus; Env, envelope; scFv, single

(4)

able to validate their data using biopanning onto a native-like gp140 trimer and comparison with previously character-ized bnAb lineages (42–49). Using the PacBio RSII system, Hemadou et al. generated long reads (>800 bp) covering full length scFvs following in  vivo panning in an animal model of atherosclerosis. Subsequently, they analyzed the sequenc-ing data ussequenc-ing International ImMunoGeneTics information system (IMGT)/HighV-QUEST combined with a novel scFv functionality tool for simultaneous characterization of VH and VL chains from individual scFvs. Here, using our previously characterized phage display library (11), we tested the valid-ity of PacBio sequencing and IMGT/HighV-QUEST analysis (www.imgt.org) combined with scFv functionality for the identification and characterization of SIV-specific antibodies.

MaTerials anD MeThODs

Phage Display library Preparation

Construction of a scFv phage display library using archived spleen biopsies from a SIV-infected rhesus macaque (Mm333-95) has been previously reported (11, 50). The animal was housed at the New England Primate Research Center of Harvard Medical School, and given care in accordance with standards of the Association for Assessment and Accreditation of Laboratory Animal Care and the Harvard Medical School Animal Care and Use Committee. The study was approved by the Harvard Medical Area Standing Committee on Animals, within the Office for Research Subject Protection at Harvard Medical School, and conducted according to the principles described in the Guide for

the Care and Use of Laboratory Animals (51).

In the current study, we aimed to evaluate our sequencing and analysis pipelines for the characterization of SIV-specific antibodies. For that reason, the previously generated library was used (11) as it allowed for the most direct comparison of the new pipelines with the prior gold standard method of handpicking colonies representing selected SIV-specific scFvs and screening by Sanger sequencing.

In the first round, antibody variable domains, VH and VL, were amplified by PCR using oligonucleotide primers corre-sponding to the human IG sequences. In the second round, VH and VL products were linked together using external primers corresponding to the 5′ (RSC-F: gaggaggaggaggaggaggcggggc-ccaggcggccgagctc) and 3′ (RSC-B: gaggaggaggaggaggagcctggc-cggcctggccactagtg) regions of the VL and VH PCR products, respectively. This fusion was facilitated by the addition of a linker sequence to the internal PCR primers corresponding to 3′ and 5′ regions in VL and VH PCR products, respectively. The resultant scFv (VL-linker-VH) products were cloned into the phagemid vector pComb3xSS. XL1 Blue Escherichia coli were transformed with recombinant pComb3xSS-scFv DNA by electroporation using a Gene Pulser Xcell (Bio-Rad, Hercules, CA, USA). The phage library preparation was obtained after amplification by culturing in the presence of VCSM13 helper phage (Agilent, Santa Clara, CA, USA). Biopanning of the library using immobilized SIV Env gp140 was also previously described (11).

scFv Dna Preparation from library

and sub-library

Total DNA was extracted from the bacterial pellets obtained dur-ing preparation of the unpanned library and the fourth round SIV Env gp140-panned library using the PureLink DNA maxi-preparation kit (Invitrogen, Carlsbad, CA, USA). scFv DNA was then PCR amplified from the extracted DNA using the same external primers (RSC-F and RSC-B) that had been selected for initial library construction. Samples were amplified in triplicate, with three different concentrations (25, 50, and 75 ng) of DNA as initial template input. High-fidelity Platinum SuperFi polymerase (Life Technologies, Carlsbad, CA, USA) was used to prepare the reaction mixture as follows: 1 µl template DNA (25, 50, or 75 ng), 2 µl (200 pmol) 5′ Primer (RSC-F), 2 µl (200 pmol) 3′ Primer (RSC-B), and 45 µl Platinum SuperFi Master mix. PCR reactions were performed under the following conditions: heated to 94°C for 5 min, subjected to 15 cycles of: 94°C for 15 s, 56°C for 15 s, 72°C for 2 min, followed by a 10-min extension at 72°C. Five microliters of each reaction were evaluated for successful amplification on a 1% agarose gel. The triplicate reactions were then pooled together before performing a PCR cleanup using the QIAquick PCR purification Kit (Qiagen, Valencia, CA, USA).

sMrTbell library Preparation

and sequencing

(5)

iMgT/highV-QUesT analysis

International ImMunoGeneTics information system/HighV-QUEST analysis was performed via the IMGT web portal (53–57). The CCS FASTA files were analyzed using IMGT/HighV-QUEST program version 1.5.5 with the advanced scFb functionality (57). Resultant data files obtained from IMGT/HighV-QUEST were further analyzed using the statistical and clonotype analysis tool that uses the IMGT/V-QUEST version 3.4.7 with advanced scFv functionality (57). An IMGT clonotype (AA) (55) is defined as a unique V-(D)-J rearrangement (with the IMGT gene and allele names determined by IMGT/HighV-QUEST at the nucleotide level) and a unique CDR3-IMGT AA in-frame junction sequence (C104, W118 for IGHV and C104, F118 for IGKV and IGLV) (see IMGT/HighV-QUEST documentation). Data filtering was applied with the following criteria to be fulfilled for each of the two V domains: (i) >85% of identity of the V-REGION of the V domain with the V-REGION of the closest germline IMGT gene and allele (58) and (ii) in-frame V-(D)-J junction. Filtered sequences were then analyzed to identify the closest V, D (for VH) and J IMGT genes and alleles, in order to characterize the amino acid (AA) junction and to give a complete descrip-tion of the scFv with IMGT labels, using the IMGT/V-QUEST algorithm for scFv, implemented in IMGT/HighV-QUEST. Here, we report the statistical and IMGT clonotype analysis for each domain for the unpanned library (Pan0) and the fourth round SIV Env gp140 panned library (Pan4). In order to ensure that the breadth and depth of the data generated by a single SMRT cell per sample was not limiting the characterization of the complex scFv pools, a comparative analysis was done pooling data gener-ated from three SMRT cells run for a single sample.

scFv recovery from the sub-library

Two clones were selected from the fourth round of gp140-panning based on extent of VH CDR3 representation among the sequences generated by IMGT analysis. In addition, these clones also contained a new VH CDR3 AA sequence compared to previously identified clones (11). Recovery of scFv from libraries has previously been described (59). Briefly, overlapping primers were design within the VH CDR3. For clone selection one (S1), the 5′ primer (S1-F: GCG AGA GGC TCC AAA CAA TTT TGT AGT) and 3′ primer (S1-R: ACT ACA AAA TTG TTT GGA GCC TCT CGC) were used while for clone selec-tion two (S2), the 5′ primer (S2-F: CCT CTC CCC GAC TGG GCT GAT TAT AAG) and 3′ primer (S2-R: CTT ATA ATC AGC CCA GTC GGG GAG AGG) were selected. PCR was con-ducted as described above, using the Platinum HiFi Mastermix (Life Technologies, Carlsbad, CA, USA). 1 µl of DpnI digested product was used to transform TOP10 F′ E. coli (Thermo Fisher Scientific, Waltham, MA, USA) and the subsequently transformed bacteria were plated on LB/Agar medium supplemented with 50  µg/ml of Carbenicillin (AmericanBio, Natick, MA, USA). Positive clones were verified by Sanger sequencing. The recovered sequence was also analyzed using IMGT/V-QUEST(www.imgt. org) with scFv functionality, as described above.

Following this selective cloning, binding specificity to SIV Env gp140 of these clones were confirmed after induction of the bacterial culture as previously described (8, 11, 20). Briefly,

bacterial colonies were cultured with SB medium supplemented with 50 µg/ml of Carbenicillin (AmericanBio, Natick, MA, USA). After 5–8 h of culture at 37°C and 250 rpm, scFv expression was induced by addition of Isopropyl β-D-1-thiogalactopyranoside (IPTG) at 2 mM. The induction was performed at 37°C over-night. Culture supernatant was clarified by centrifugation at 12,000 rpm in a microcentrifuge for 5 min. 50 µl of supernatant was tested by enzyme-linked immunosorbent assay (ELISA) for binding to SIV Env gp120 and/or gp140. ELISA plates were also coated with BSA as a negative control antigen.

resUlTs

sMrT sequencing

We have previously described the construction of a phage display library derived from a rhesus macaque infected with SIV. This work generated 32 unique scFv sequences after four rounds of panning onto SIV Env gp140, all of which were characterized to target the gp41 region of Env (11). Here, we aimed to perform a large-scale deep sequencing-driven analysis of the same scFv library and compare the diversity of the unpanned library (Pan0) to the library after round 4 of gp140 selection (Pan4). DNA extracted from bacterial pellets obtained after Pan0 and Pan4 culturing were used as templates to amplify the collection of scFv sequences under examination (Figure 1).

(6)

FigUre 1 | PCR amplification of scFv libraries. scFv DNA was amplified using human IG-specific PCR primers. (a) Triplicate amplified samples were generated

using 25, 50, and 75 ng of input DNA template for Pan0_1, Pan0_2, or Pan0_3 and Pan4_1, Pan4_2, or Pan4_3, respectively. Pan0 correspond to the unpanned library while Pan4 represents fourth round selection onto gp140. (B) DNA traces generated using the DNA12000 kit on the Agilent Bioanalyzer and represent the

distribution of scFv amplicons based on size. (c) Key steps in SMRT sequencing, IMGT/HighV-QUEST and clonotype analysis. (D) PCR amplification of selected

scFv clones chosen for functional validation, S1 and S2, followed by Dpn1 digestion. T, control non-amplified and non-digested DNA; L, 1 Kb plus DNA ladder; bp, base pair; FU, florescence units; scFv, single chain fragment variable; SMRT, single molecule, real-time.

scFv recovery

A manual inspection of the VH CDR3 AA sequences in the newly assigned clonotypes allowed for the verification of the presence of clonotypes previously identified by handpicking and Sanger screen-ing of clones obtained from Pan4 (11). As more than 100 additional clonotypes were observed, the validity of the PacBio-generated sequences was verified by testing the SIV-specificity of two newly identified clones belonging to two different IMGT IGHV clono-types. Of the selected (S1 and S2) clonotypes, S1 IGHV was not abundantly represented in the pooled data, with only 3 sequences observed, while S2 IGHV was represented at a higher proportion, with 102 sequences. Two S1 and S2 clones were recovered by PCR using DNA Pan4 as the template (Figure 1D). The identities of the recovered clones were confirmed by Sanger sequencing and the clonotype characteristics were analyzed using IMGT/V-QUEST with scFv functionality (Table 2). Somatic hypermutation (SHM) frequencies were determined using nucleotide identity with the

closest germline as obtained with IMGT/V-QUEST. To confirm functional activity, the recovered clones were evaluated by ELISA to characterize binding to SIV Env (Table 2).

V-D-J assignments

Data files derived from the IMGT/HighV-QUEST analysis of the PacBio CCS2 reads were submitted to the statistical and IMGT clonotype analysis using the new scFv functionality tool on the IMGT/HighV-QUEST web portal (45) (www.imgt.org). The sequence analysis pipeline is provided in Figure 1C.

IGHV

(7)

TaBle 2 | Characteristic of two scFv clones recovered from Pan4.

scFv s1 s2

IGHV gene and allele IGHV1-1*01 F IGHV3-5*01 F

IGHV SHM (%) 20.7 7.14

VH CDR3 AA sequence ARGSKQFCSSSYCSVGFDY AAEPLPDDWADWADYKKYGLDY

IGLV gene and allele IGLV2S1*01 F IGLV1-10*01 F

IGLV SHM (%) 5.65 6.07

V-LAMBDA CDR3 AA sequence SSYAGSNTFLF AAWDDSLSGWIF

Target gp41 gp41

V genes and alleles, and complementarity determining regions 3 (VH CDR3 and V-LAMBDA CDR3) were determined using IMGT/V-QUEST with scFv functionality (57). Somatic hypermutation frequencies [SHM (%)] represent the nucleotide sequence divergence from the closest germline in IMGT/V-QUEST (54). scFv binding specificity was determined by enzyme-linked immunosorbent assay (ELISA).

F, functional; AA, amino acid.

TaBle 1 | IGHV, IGKV, and IGLV genes, sequence number and IMGT clonotype (AA) assignment.

Pan0_1 Pan0_2 Pan0_3 Batch Pan0

Pan4_1 Pan4_2 Pan4_3 Batch Pan4

PacBio Output seq Nb 8,706 7,335 11,599 27,640 15,036 14,986 13,231 43,253

Nb of seq assigned to an IMGT clonotype (AA) for IGHV 3,837 3,159 5,238 12,269 4,594 4,593 3,884 13,087

Nb of different IMGT clonotypes (AA) for IGHV 1,887 1,685 2,569 5,313 135 141 127 247

Nb of out-of-frame seq for IGHV 117 90 115 322 106 115 80 301

Nb of seq of other categories for IGHV 112 76 103 292 1 0 0 1

Nb of seq assigned to an IMGT clonotype (AA) for IGKV 3,511 2,818 4,640 11,205 6,957 7,062 5,974 19,995

Nb of different IMGT clonotypes (AA) for IGKV 1,280 1,099 1,651 3,263 147 168 150 280

Nb of out-of-frame seq for IGKV 126 96 172 409 216 248 214 678

Nb of seq of other categories for IGKV 54 29 50 137 2 6 3 11

Nb of seq assigned to an IMGT clonotype (AA) for IGLV 3,110 2,531 4,068 9,910 7,484 7,252 6,504 21,502

Nb of different IMGT clonotypes (AA) for IGLV 1,268 1,077 1,573 3,205 588 548 535 1,078

Nb of out-of-frame seq for IGLV 209 169 235 641 581 570 488 1,639

Nb of seq of other categories for IGLV 67 60 111 240 18 22 27 67

Filtered CCS reads (≥99.9% accuracy and 700–1,200 bp in length) generated from SMRT sequencing data were analyzed using IMGT/HighV-QUEST in order to determine V, D,

J gene sequence, allele assignment and functionality (53, 54). IMGT/HighV-QUEST output files were processed through IMGT statistical and clonotype analysis to determine the number of assigned sequences and clonotypes. The number of sequences assigned to an IMGT clonotype corresponds to in-frame sequences [C104, W118 for VH and C104, F118 for VL (V-KAPPA or V-LAMBDA)] from “1 copy” + “More than 1,” “Single allele” and “Several alleles” (or genes) (41). Number of in-frame “other categories” sequences were not assigned to a “single gene.” An IMGT clonotype (AA) was defined by a unique V-(D)-J rearrangement (with IMGT gene and allele names determined by IMGT/HighV-QUEST at the nucleotide level) and a unique CDR3-IMGT AA in-frame junction sequence (55, 56). Pan0 correspond to the unpanned library while Pan4 represent fourth round of selection onto SIV Env gp140. Batch0 and Batch4 represent the analysis of pooled data from three independent SMRT cells for Pan0 and Pan4, respectively. Nb, number; seq, sequence. PCR template concentration of 25, 50, and 75 ng are represented by Pan0_1, Pan0_2, and Pan0_3 or Pan4_1, Pan4_2, and Pan4_3, respectively.

IGHV3-9, IGHV3-10, IGHV3-11, IGHV3-12, IGHV3-14, IGHV3-21, and IGHV3-22) for IGHV3 (Figure 2A). Individually, IGHV4-2 was the most represented gene in the Pan0 dataset, with 1,734 (33.14%) assigned sequences. From a subgroup perspective, IGHV3 was the most represented with a total of 2,165 (41.31%) assigned sequences. Within IGHV3, three genes, IGHV3-6, IGHV3-9, and IGHV3-7, were the most abundant with 562, 498, and 462 assigned sequences, respectively. Overall, the number of assigned IMGT clonotypes was proportional to the number of sequences, as observed for IGHV4-2 (36.97%), IGHV3-6 (9.71%), IGHV3-9 (10.37%), and IGHV3-7 (8.54%).

For Pan4, a total of 9 IGHV genes were observed, compared to 14 in Pan0. IGHV3 remained the most represented subgroup with 6 genes (IGHV3-5, IGHV3-6, IGHV3-7, IGH3-9, and IGHV3-14), followed by IGHV2 with 2 genes (IGHV2-1 and IGHV2-2) and IGHV1 and IGHV4 with a single gene each (IGHV1-1, IGHV4-2) identified. The ranking in allelic representation was generally similar, with the striking exception of a switch between the 2 most represented genes, including 3,814 (83.05%) sequences for IGHV3-9 and 671 (14.61%) sequences for IGHV4-2. IGHV3-6

and IGHV3-7, which were well represented in Pan0, were reduced to only 2 (0.04%) and 35 (0.76%) sequences, respectively, in Pan4. IGHV3-5 was the third most represented in Pan4 with 49 (1.07%) sequences. The number of different IMGT clonotypes was again observed to be relatively proportional to the number of assigned sequences. However, IGHV3-9 showed less diversity with 42 (31.57%) distinct IMGT clonotypes compared to IGHV4-2 with 58 (43.60%) different clonotypes.

(8)

FigUre 2 | Number of IGH sequences and clonotypes, per IGHV, IGHD, and IGHJ gene, in Pan0 and Pan4 single chain fragment variable libraries. A International

ImMunoGeneTics information system (IMGT) clonotype [amino acid (AA)] was defined as a unique V-(D)-J rearrangement using IMGT gene and allele names as determined by IMGT/HighV-QUEST at the nucleotide level, and a unique, in frame variable heavy CDR3-IMGT AA junction (C104, W118) (56). (a) IGHV; (B) IGHD; (c) IGHJ. Nb: number.

dominated by IGHD2-2, which was comprised of 3,948 (85.93%) sequences. IGHD6-1 and IGHD4-2 were the next most abundant genes present with only 2.72 and 2.31% of the assigned sequences, respectively. Despite the enriched representation in regards to sequence number, the IGHD2-2 clonotypes seen were not par-ticularly diverse, encompassing only 22 (16.29%) of the detected differential IMGT clonotypes, while IGHD6-1 and IGHD4-2 were represented by 11 (8.15%) and 8 (5.92%) clonotypes, respec-tively. Surprisingly, IGHD3-3, which had only 67 (1.45%) assigned sequences, included the most variable selection of IMGT clonotypes at 23 (17%).

Similarly, a diverse collection of IGHJ genes was observed in Pan0 (Figure 2C). IGHJ4 was the most abundantly represented with 2,491 (47.57%) sequences. The next most abundant were IGJ5-1 and IGHJ6 with 1,108 (21.16%) and 891 (17%) sequences, respectively. As was observed for prior Pan0 analysis, the num-ber of different IMGT clonotypes was proportional to the most represented genes. However, in regard to Pan4, IGHJ genes were

again highly enriched for the IGHJ5-1 gene, represented by 3,978 (86.59%) of the assigned sequences while IGHJ4 was only a minor group with 433 (9.42%) assigned sequences. Overall, the variety of clonotypes seen was generally low, but the most represented genes included the majority of different clonotypes.

(9)

FigUre 3 | Number of IGH sequences and clonotypes distributed by VH CDR3 length. VH CDR3-IMGT length was determined by IMGT/HighV-QUEST in

productive (those with in-frame junctions, C104, W118) scFv sequences. The numbers of clonotypes per VH CDR3 length are plotted in Pan0 and Pan4 libraries. Nb: number.

different clonotypes contained within these VH CDR3 size bins were usually low. For example, the 20 AA-long VH CDR3, which represented 82.04% of the total assigned sequences, showed relatively poor diversity with only 33 (24%) of the different IMGT clonotypes represented. This was followed by the next most clonotypically diverse VH CDR3 length bins, including 15 AA CDR3 (11.11%) and 18 AA CDR3 (13.33%). Consequently, we observed a more diversified range of VH CDR3 lengths when considering clonotype distribution rather than the simple num-ber of sequences (Figure 3) per VH CDR3 AA length.

IGKV

A diversified representation of IGKV genes was observed within Pan0, which identified 69 unique IGKV genes (Figure  4A). The most abundantly represented were 8, 9, IGKV3-3, and IGKV2S4 with 732 (16.24%), 408 (9.05%), 383 (8.50%), and 316 (7.01%) sequences assigned, respectively. As previously observed with IGHV, the number of different IMGT clonotypes was directly proportional to the gene abundance described above. A less diverse pool of 23 IGKV genes was obtained from the Pan4 data. These genes were dominated by the IGKV3-8 and IGKV3-9 genes with 4,525 (65.04%) and 2,043 (29.37%) of assigned sequences, respectively. The next most abundant gene characterized was IGKV7-1 with 163 (2.34%) assigned sequences. The number of different clonotypes was proportional and dominated by IGKV7-1 and IGKV1-14 with 82 (55.78%) and 16 (10.88%), respectively; IGKV7-1 contained 13 (8.84%) different clonotypes.

Five different IGKJ genes (1–5) were detected in Pan0 (Figure 4B), with the most abundant being IGKJ2, IGKJ4, and IGKJ1 with 1,993 (43.03%), 1,263 (27.26%), and 1,227 (26.49%),

respectively. The number of different clonotypes observed was also highest for IGKJ2, IGKJ4, and IGKJ1 genes with 629 (38.21%), 549 (33.35%), and 390 (23.69%), respectively. For Pan4, 4 IGKJ genes were detected (Figure 4B), with IGKJ2 being the most abundant with 6,596 (94.83%) assigned sequences. A simi-lar abundance was observed for the number of different IMGT clonotypes within IGKJ2, which was shown to have 102 (69.86%) assigned clonotypes.

For Pan0, V-KAPPA CDR3 lengths ranged from 5 AA to 29 AA long (Figure 5), with the most represented lengths being 9 and 8 AA, comprising 3,690 (79.52%) and 794 (17.11%) assigned sequences, respectively. A similar relationship was observed for the number of different IMGT clonotypes within these V-KAPPA CDR3 length bins, with 1,390 (29.95%) and 185 (11.2%) for 9 and 8 AA-long CDR3, respectively. In Pan4, the observed V-KAPPA CDR3 were between 8 and 10 with 9 and 8 AA-long CDR3 lengths dominating the pool as both the most represented in sequence space, as well as the most diverse, containing 122 (83.00%) and 23 (15.64) different clonotypes, respectively.

IGLV

(10)

FigUre 4 | Number of IGK sequences and clonotypes, per IGKV and IGKJ gene, in Pan0 and Pan4 scFv libraries. An IMGT IGKV clonotype (AA) was defined as a

unique V-J rearrangement using IMGT gene and allele names as determined by IMGT/HighV-QUEST at the nucleotide level, and a unique, in frame V-KAPPA CDR3-IMGT AA junction (C104, F118) (56). (a), IGKV; (B), IGKJ. Nb: number.

in Pan4 with 2,350 (31.44%) assigned sequences. IGLV6-5, which represented only 0.71% of the scFv from the Pan0 library, was the most abundant sequence observed in Pan4 with 2,939 (39.33%) of the assigned sequences. As seen within the other IG genes, the number of clonotypes was proportional to the relative representa-tion of each gene in the pool.

Six different IGLJ genes were identified within the Pan0 library (Figure 6B). IGLJ6 and IGLJ7 were the most dominant, with 1,708 (42.49%) and 793 (19.73%) sequences assigned, respectively. In the Pan4 library, 5 different IGLJ genes were observed, missing only the presence of the IGLJ4 gene as compared to the Pan0 library. Despite this similar level of diversity, the majority of sequences were IGLJ3 and IGLJ6 with 3,511 (47.09%) and 1,636 (21.94%), respectively. Once again, the number of clonotypes was again proportional to the most represented genes.

Single chain fragment variable from the Pan0 library contai- ned V-LAMBDA CDR3 lengths ranging from 8 to 13 AA (Figure 7), with the majority of CDR3 lengths in the pool com-prised of 11 (2,308 sequences, 56.73%), 10 (1,013 sequences, 24.90%), and 9 (608 sequences, 14.94%) AA. V-LAMBDA CDR3

lengths within the Pan4 library were observed to vary from 9 to 28 AA, but were dominated by CDR3 lengths of 11 and 10 AA, representing 4,443 (59.36%) and 2,988 (39.92%) of the assigned sequences, respectively. In both Pan0 and Pan4, the overall number of clonotypes was proportional to the assigned genes sequences; specifically in Pan4 the 11 AA V-LAMBDA CDR3 group contained the highest number of diverse clonotypes, followed by those with 10 AA CDR3, with 473 (80.44%) and 93 (15.81%) respectively.

VH-VL Combinations

(11)

FigUre 5 | Number of IGK sequences and clonotypes, distributed by

V-KAPPA CDR3 length. V-KAPPA CDR3-IMGT length was determined by IMGT/HighV-QUEST in productive (those with in-frame junctions, C104, F118) single chain fragment variable sequences. The numbers of clonotypes per V-KAPPA CDR3 length are plotted in Pan0 and Pan4 libraries. Nb: number.

including 76 IGLV2S1, 8 IGLV6-5, 5 IGKV3-8, 3 IGLV1-15, 2 IGLV1S6, and 1 sequence assigned to other genes (IGKV3-9, IGLV1-10, IGLV1-15, IGLV1-7, IGLV1S3, IGLV2-3, IGLV2-7, and IGLV6-1).

Batch Analysis of Triplicate SMRT Cell Outputs

When screening a highly variable library, a limitation of the PacBio RSII system is the relatively low read depth per SMRT  cell, as compared to other NGS technologies. We observed this when running single SMRT cells per sample in our experiments. However, this can be countered by running multiple SMRT cells per sample to increase read depth and the ability to detect low level variants within a population. As the IMGT statistical and clonotype analysis tool functions with batched data, including multiple SMRTcells (up to one mil-lion sequences), we performed a batch analysis for triplicate, pooled samples from Pan0 and Pan4 (Table 1). In this batched dataset, for IGHV, the number of sequences assigned to an IMGT clonotype was increased (two- to fourfold) for Pan0 and almost threefold in Pan4 as compared to those sequences gen-erated with a single SMRT cell per sample (Pan0_1, Pan0_2, and Pan0_3 or Pan4_1, Pan4_2, and Pan4_3). The number of IMGT clonotypes was also approximately twofold of that called using single SMRT cell data for both Pan0 and Pan4. Similar increases in the number of sequences and clonotypes were also observed for IGKV and IGLV when combining outputs from triplicate SMRT  cells for both Pan0 and Pan4 (Table 1). These observations highlight the need to consider increasing depth of coverage when screening highly variable libraries using single molecule sequencing approaches, either through batching multiple cells of data or generating sequence data using a higher throughput instrument (i.e., PacBio Sequel system).

DiscUssiOn

Understanding the role of polyclonal nnAbs in the inhibition of HIV-1 infection will require large-scale analyses of virus-specific antibodies. These analyses should evaluate the functionality of the antibodies, as well as characterize their structure, including V-(D)-J gene or allele usage and CDR3 lengths. Techniques that rely on antigen-based selection of single B cells or screening of single colonies following phage display generally result in a low number of sequences and are not always suited to examine the expected level of complexity in such a system. Broadly used NGS technologies, such as Illumina, can generate very high numbers of sequence reads, but are limited in read length and can cover no more than one variable domain (VH or VL) per read, limit-ing data interpretation and the ability to evaluate the impact of specific VH–VL pairing.

Here, we evaluated the ability of SMRT sequencing technology to generate full length scFv sequencing reads for the analysis of phage display libraries. We obtained thousands of highly accurate (≥99.9%) sequence reads, each containing productive VH-VL combinations. As previously reported for this rhesus macaque (11), SIV Env gp140-specific scFv were dominated by the usage of IGHV4-2 and IGHV3-9. However, we observed far fewer IGHV1-1-containing scFvs in the Pan4 library than expected. We also observed more IGHV3-9-contaning scFv sequences than those with IGHV4-2, despite the latter being more abundant in the unpanned library. Together, these data suggest that the SMRT sequencing approach will be useful for the characterization of scFv libraries as more comprehensive rhesus V gene databases are being developed (60, 61).

The libraries examined were very diverse and included a great number of distinct clonotypes that were, in general, proportional to the number of sequences obtained for each gene. Interestingly, SIV Env gp140-specific scFvs were characterized by a more lim-ited number of clonotypes, as has been previously reported for both HIV-1 and SIV antibodies (12, 22, 23, 32, 62). Specifically, the most dramatic examples were observed with IGHD and IGHJ genes, in which more than 85% of sequences contained IGHD2-2 and IGHJ5-1. It will be necessary and informative to determine whether these patterns are similar in other experimentally infected animals, particularly in those immunized with SIVmac239Δnef, which show protection from SIVmac239 challenge infection. Though less pronounced in terms of clonotype distribution, SIV Env gp140-specific scFvs presented an average of VH CDR3 size of 20 AA (representing around 80% of total sequences), which was far greater than that seen in the Pan0 library, containing the same proportion of sequences for VH CDR3 ranging between 5 and 17 AA. It is difficult to interpret the relevance of these results to nnAbs in particular, as the plasma of this rhesus macaque also contained a high titer of neutralizing antibodies. Understanding the impact of VH CDR3 length on antibody functionality and disease outcome will definitely require a similar analysis across a greater number of animals.

(12)

FigUre 6 | Number of IGL sequences and clonotypes, per IGLV and IGLJ gene, in Pan0 and Pan4 single chain fragment variable libraries. An International

ImMunoGeneTics information system (IMGT) IGLV clonotype (AA) was defined as a unique V-J rearrangement using IMGT gene and allele names as determined by IMGT/HighV-QUEST at the nucleotide level, and a unique, in frame V-LAMBDA CDR3-IMGT AA junction (C104, F118) (56). (a), IGLV; (B), IGLJ. Nb: number.

represented genes in the Pan0 library, with the exception of IGLV6-5, which showed a ~40-fold increase in representation in the SIV Env gp140-specific scFv library, compared to that from the total, unpanned IG repertoire. Due to the combinatory nature of VH-VL association of our phage display libraries, further analyses will certainly be required to clarify and understand the potential con-tribution of diverse IGKV and IGLV genes to the overall antibody functionality.

It has also been observed that the IGHV1 (34.4%) and IGHV4 (26.15%) subgroups were the most dominant among naïve IgM

(13)

FigUre 7 | Number of IGL sequences and clonotypes, distributed by

V-LAMBDA CDR3 length. V-LAMBDA CDR3-IMGT length was determined by IMGT/HighV-QUEST in productive (those with in-frame junctions, C104, F118) single chain fragment variable sequences. The numbers of clonotypes per V-LAMBDA CDR3 length are plotted in Pan0 and Pan4 libraries. Nb: number.

27.89% and BM 28.32%) were the second most dominant (63). During the more advanced (8 month) stage of infection, IGHV3 (57.10% in PBMC, 59.31% in BM IgM libraries) and IGHV1 (53.03% in PBMC and 46.29% in BM IgG libraries) were the most dominant while IGHV4 and IGHV1 (respectively, 17.18% in PBMC and 15.98% in BM IgM libraries), and IGHV3 (22.96% in PBMC, 29.15% in BM IgG libraries) were the second most dominant (63). The complexity of these data suggest that analyses of multiple animals and infection time points are needed, as well as a comparative analyses of these data compared to those gener-ated using scFv libraries constructed with rhesus specific-primers to ensure that any rhesus-specific IG populations are included (9, 12). These additions, as well as leveraging the growing rhesus databases (60, 61) will be necessary moving forward to ensure the most comprehensive analysis of polyclonal antibody responses during SIV infection in rhesus macaques.

There are several caveats that are worth noting regarding the approaches used to generate the V-(D)-J composition and CDR3 lengths of SIV-specific antibodies identified and characterized in this study.

The first caveat surrounds the relatively low read depth gene- rated by the SMRT sequencing approach. As mentioned ear-lier in the text, He et al. generated over one million raw reads (750–950  bp) using the PGM-S5 system in a similar study investigating HIV-specific scFv antibodies. Interestingly, these data demonstrated similar sequence output size from the IMGT statistical and clonotype analysis. Hemadou et al. analyzed more than 450,000 reads using a SMRT sequencing strategy, but by combining the data from 15 SMRT cells to increase depth. In the current work, we did observe an increase (two- to fourfold) of individual sequences or clonotypes called when 3 SMRT cells of data were combined prior to clonotype analysis. While the current data are already hypothesis generating, the increased information

gained by increasing read depth underscores the need for future, similar studies to batch data from multiple RSII SMRT cells or to move this type of study to the higher throughput Sequel system, which generates 400,000–500,000 single molecule reads per chip, drastically increasing the single molecule read depth available.

The next caveat relates to our use of human-based primers for the construction of phage display library under examination. Overall, the rhesus macaque genome is highly homologous to that of the human (~93%) (64). In particular, Sundling et  al. found a high level of homology (~92%) between the human and macaque functional VH regions, with genes clustered according to family distribution rather than species when examined by phylogenetic analysis (65). However, it is noted that these authors also identified unique macaque-specific antibody gene families. Consequently, the phage display library used in the current study is likely biased toward gene families with high homology to the corresponding human genes. It is the focus of our immediate future work to recapitulate these data and expand upon them with the use of macaque-specific primers (9, 12, 62, 65–67). For example, Dai et al. observed a diverse repertoire of VH CDR3 length by tracking lineages of CD4-binding site-directed mAbs in macaques immunized with an HIV-1 trimer vaccine (62). Our VH CDR3 results differed from these observations, which may be due to this type of primer bias, or due to the fact that our antibodies are targeting the MPER region of Env, rather than the receptor-binding site. In fact, our variable light chain results were comparable to those of Dai et al., where it was observed that the 1 or 2 CDR3 AA length was most abundant for V-KAPPA CDR3 and V-LAMBDA CDR3. A related caveat pertains generally to the characterization of all combinatorial phage display approaches, mainly that our libraries do not represent authentic VH–VL pairing. Methods to clone paired VH-VL regions from single macaque B cells have been developed (9, 66). These studies relied on Sanger sequencing while other macaque-based studies did not apply NGS to characterize authentic VH–VL pairs (62, 65). Moving forward, the use of combinatorial phage display tech- nology should be complemented with single cell-based method-ologies, including droplet emulsion-based approaches, to resolve relevant VH–VL pairing (68–72).

(14)

gp120-specific antibodies from the same animal examined in this current study (11). Most importantly, we are currently working with IMGT to expand their macaque IG germline database, as to eliminate this caveat in future studies. To note, Dai et al. also observed simi-lar patterns of VH CDR3 lengths and SHM levels between IMGT and their own “CS germline-gene database” results. Here, we also observed a wide range of VH CDR3 lengths in the context of differential clonotype identities, which is probably more relevant for functional antibody classification.

Lastly, engineering of native-like SIV Env scaffolds may be necessary for the identification of highly functional anti- bodies, particularly those with potent neutralization activity. For instance, we have previously demonstrated the relative stability of non-engineered soluble SIVmac239 Env gp140 trimer and its abil-ity to deplete SIV-infected macaque plasma of antibodies with neu-tralization activity against the neuneu-tralization sensitive SIVmac316 isolate (20). The depletion was less effective against neutralization resistant SIVmac239, although this was expected as the plasma did not display much neutralization activity against SIVmac239 in the first place (20). Subsequent follow up proved challenging, as we were unable to isolate potent SIV neutralizing antibodies although the phage display library had been constructed from an SIV-infected macaque with unusually high neutralization titers (11). This failure may have be due to the use of soluble SIVmac239 Env gp140 trimer, which likely would be improved through specific engineering as has been described for HIV-1 (22, 23, 30, 31, 33, 73).

aUThOr cOnTriBUTiOns

IF, RS, MLS, and M-PL designed the study and wrote the manuscript. IF, SH, DH, BC, WC, and MS performed scFv DNA preparation. AA and GD performed PacBio the SMRT sequencing. IF performed scFv recovery from sublibrary. SK, PD, and M-PL developed and assisted with the IMGT analysis tools and web portal (www.imgt.org). All other authors have read and approved the final manuscript.

acKnOWleDgMenTs

The authors would like to thank Arthur Lavoie and Karthik Kalyan for maintaining the IMGT/HighV-QUEST web portal and the entire IMGT team in developing databases and tools.

FUnDing

IF was supported by a Boston college fund for undergraduate research. This work was granted access to the HPC@LR and to the High-Performance Computing (HPC) resources of the Centre Informatique National de l’Enseignement Supérieur (CINES) and to Très Grand Centre de Calcul (TGCC) of the Commissariat à l’Energie Atomique et aux Energies Alternatives (CEA) under the allocation [036029] (2010-2017) made by GENCI (Grand Equipement National de Calcul Intensif).

reFerences

1. Burbacher TM, Grant KS. Nonhuman primates as animal models for toxi-cology research. Curr Protoc Toxicol (2001) Chapter 1:Unit1.1. doi:10.1002/ 0471140856.tx0101s00

2. Lynch RM, Yamamoto T, Mcdermott AB. HIV vaccine research and discovery in the nonhuman primates model: a unified theory in acquisition prevention and control of SIV infection. Curr Opin HIV AIDS (2013) 8(4):288–94. doi:10.1097/COH.0b013e328361cfe8

3. Geisbert TW, Pushko P, Anderson K, Smith J, Davis KJ, Jahrling PB. Evaluation in nonhuman primates of vaccines against Ebola virus. Emerg Infect Dis (2002) 8(5):503–7. doi:10.3201/eid0805.010284

4. Scanga CA, Flynn JL. Modeling tuberculosis in nonhuman primates. Cold

Spring Harb Perspect Med (2014) 4(12):a018564. doi:10.1101/cshperspect.

a018564

5. Sato S, Johnson W. Antibody-mediated neutralization and simian immuno-deficiency virus models of HIV/AIDS. Curr HIV Res (2007) 5(6):594–607. doi:10.2174/157016207782418515

6. Davis AS, Taubenberger JK, Bray M. The use of nonhuman primates in research on seasonal, pandemic and avian influenza, 1893-2014. Antiviral Res (2015) 117:75–98. doi:10.1016/j.antiviral.2015.02.011

7. Keitel WA, Treanor JJ, El Sahly HM, Evans TG, Kopper S, Whitlow V, et  al. Evaluation of a plasmid DNA-based anthrax vaccine in rabbits, nonhuman pri-mates and healthy adults. Hum Vaccin (2009) 5(8):536–44. doi:10.4161/hv.5.8.8725 8. Barbas CF, Burton DR, Scott JK, Silverman GJ III. Phage Dispaly: A Laboratory

Manual. New York: Cold Spring Harbor Laboratory Press (2001).

9. Sundling C, Phad G, Douagi I, Navis M, Karlsson Hedestam GB. Isolation of antibody V(D)J sequences from single cell sorted rhesus macaque B cells.

J Immunol Methods (2012) 386(1–2):85–93. doi:10.1016/j.jim.2012.09.003

10. Andris-Widhopf J, Steinberger P, Fuller R, Rader C, Barbas CF III. Generation of human scFv antibody libraries: PCR amplification and assembly of light- and heavy-chain coding sequences. Cold Spring Harb Protoc (2011) 2011(9):1139–50. doi:10.1101/pdb.prot065573pdb.prot065573

11. Ita S, Agostinho MR, Sullivan K, Yub Han S, Akleh R, Johnson WE, et al. Analysis of SIVmac envelope-specific antibodies selected through phage

display. AIDS Res Hum Retroviruses (2017) 33(8):869–79. doi:10.1089/AID. 2016.0247

12. Sundling C, Zhang Z, Phad GE, Sheng Z, Wang Y, Mascola JR, et al. Single-cell and deep sequencing of IgG-switched macaque B Single-cells reveal a diverse Ig repertoire following immunization. J Immunol (2014) 192(8):3637–44. doi:10.4049/jimmunol.1303334

13. Johnson WE, Lifson JD, Lang SM, Johnson RP, Desrosiers RC. Importance of B-cell responses for immunological control of variant strains of simian immunodeficiency virus. J Virol (2003) 77(1):375–81. doi:10.1128/JVI. 77.1.375-381.2003

14. Laird ME, Igarashi T, Martin MA, Desrosiers RC. Importance of the V1/V2 loop region of simian-human immunodeficiency virus envelope glycoprotein gp120 in determining the strain specificity of the neutralizing antibody response. J Virol (2008) 82(22):11054–65. doi:10.1128/JVI.01341-08 15. Laird ME, Desrosiers RC. Infectivity and neutralization of simian

immu-nodeficiency virus with FLAG epitope insertion in gp120 variable loops.

J Virol (2007) 81(20):10838–48. doi:10.1128/JVI.00831-07

16. Adnan S, Reeves RK, Gillis J, Wong FE, Yu Y, Camp JV, et  al. Persistent low-level replication of SIVΔnef drives maturation of antibody and CD8 T  cell responses to induce protective immunity against vaginal SIV infection.PLoS Pathog (2016) 12(12):e1006104. doi:10.1371/journal.ppat. 1006104

17. Pantophlet R, Burton DR. GP120: target for neutralizing HIV-1 antibodies.

Annu Rev Immunol (2006) 24(1):739–69. doi:10.1146/annurev.immunol.

24.021605.090557

18. Kwong PD, Mascola JR, Nabel GJ. Broadly neutralizing antibodies and the search for an HIV-1 vaccine: the end of the beginning. Nat Rev Immunol (2013) 13(9):693–701. doi:10.1038/nri3516

19. Kwong PD, Mascola JR. Human antibodies that neutralize HIV-1: identifi-cation, structures, and B Cell ontogenies. Immunity (2012) 37(3):412–25. doi:10.1016/j.immuni.2012.08.012

(15)

21. Mascola JR, Montefiori DC. The role of antibodies in HIV vaccines. Annu Rev

Immunol (2010) 28(1):413–44. doi:10.1146/annurev-immunol-030409-101256

22. Scheid JF, Mouquet H, Ueberheide B, Diskin R, Klein F, Oliveira TY, et al. Sequence and structural convergence of broad and potent HIV antibodies that mimic CD4 binding. Science (2011) 333(6049):1633–7. doi:10.1126/ science.1207227

23. Scheid JF, Mouquet H, Feldhahn N, Seaman MS, Velinzon K, Pietzsch J, et al. Broad diversity of neutralizing antibodies isolated from memory B cells in HIV-infected individuals. Nature (2009) 458(7238):636–40. doi:10.1038/ nature07930

24. Doria-Rose NA, Schramm CA, Gorman J, Moore PL, Bhiman JN, DeKosky BJ, et al. Developmental pathway for potent V1V2-directed HIV-neutralizing anti-bodies. Nature (2014) 509(7498):55–62. doi:10.1038/nature13036nature13036 25. Wu X, Yang ZY, Li Y, Hogerkorp CM, Schief WR, Seaman MS, et  al.

Rational design of envelope identifies broadly neutralizing human mono-clonal antibodies to HIV-1. Science (2010) 329(5993):856–61. doi:10.1126/ science.1187659science.1187659

26. Walker LM, Phogat SK, Chan-Hui PY, Wagner D, Phung P, Goss JL, et al. Broad and potent neutralizing antibodies from an African donor reveal a new HIV-1 vaccine target. Science (2009) 326(5950):285–9. doi:10.1126/ science.1178746

27. Gallerano D, Cabauatan CR, Sibanda EN, Valenta R. HIV-specific antibody responses in HIV-infected patients: from a monoclonal to a polyclonal view.

Int Arch Allergy Immunol (2015) 167(4):223–41. doi:10.1159/000438484

28. Pejchal R, Doores KJ, Walker LM, Khayat R, Huang PS, Wang SK, et al. A potent and broad neutralizing antibody recognizes and penetrates the HIV glycan shield. Science (2011) 334(6059):1097–103. doi:10.1126/science.1213256 29. West AP, Diskin R, Nussenzweig MC, Bjorkman PJ. Structural basis for

germ-line gene usage of a potent class of antibodies targeting the CD4-binding site of HIV-1 gp120. Proc Natl Acad Sci U S A (2012) 109(30):E2083–90. doi:10.1073/pnas.1208984109

30. McGuire AT, Glenn JA, Lippy A, Stamatatos L. Diverse recombinant HIV-1 Envs fail to activate B cells expressing the germline B cell receptors of the broadly neutralizing anti-HIV-1 antibodies PG9 and 447-52D. J Virol (2014) 88(5):2645–57. doi:10.1128/JVI.03228-13JVI.03228-13

31. Jardine J, Julien JP, Menis S, Ota T, Kalyuzhniy O, McGuire A, et al. Rational HIV immunogen design to target specific germline B cell receptors. Science (2013) 340(6133):711–6. doi:10.1126/science.1234150science.1234150 32. Hoot S, McGuire AT, Cohen KW, Strong RK, Hangartner L, Klein F, et al.

Recombinant HIV envelope proteins fail to engage germline versions of anti-CD4bs bNAbs. PLoS Pathog (2013) 9(1):e1003106. doi:10.1371/journal. ppat.1003106PPATHOGENS-D-12-02139

33. McGuire AT, Hoot S, Dreyer AM, Lippy A, Stuart A, Cohen KW, et  al. Engineering HIV envelope protein to activate germline B  cell receptors of broadly neutralizing anti-CD4 binding site antibodies. J Exp Med (2013) 210(4):655–63. doi:10.1084/jem.20122824jem.20122824

34. Sok D, Le KM, Vadnais M, Saye-Francisco KL, Jardine JG, Torres JL, et al. Rapid elicitation of broadly neutralizing antibodies to HIV by immunization in cows. Nature (2017) 548(7665):108–11. doi:10.1038/nature23301 35. Pollara J, Bonsignori M, Moody MA, Liu P, Alam SM, Hwang KK, et  al.

HIV-1 vaccine-induced C1 and V2 Env-specific antibodies synergize for increased antiviral activities. J Virol (2014) 88(14):7715–26. doi:10.1128/JVI. 00156-14

36. Zolla-Pazner S, deCamp AC, Cardozo T, Karasavvas N, Gottardo R, Williams C, et al. Analysis of V2 antibody responses induced in vaccinees in the ALVAC/ AIDSVAX HIV-1 vaccine efficacy trial. PLoS One (2013) 8(1):e53629. doi:10.1371/journal.pone.0053629

37. Haynes BF, Gilbert PB, McElrath MJ, Zolla-Pazner S, Tomaras GD, Alam SM, et al. Immune-correlates analysis of an HIV-1 vaccine efficacy trial. N Engl

J Med (2012) 366(14):1275–86. doi:10.1056/NEJMoa1113425

38. Karasavvas N, Billings E, Rao M, Williams C, Zolla-Pazner S, Bailer RT, et al. The Thai Phase III HIV Type 1 Vaccine trial (RV144) regimen induces antibodies that target conserved regions within the V2 loop of gp120.

AIDS Res Hum Retroviruses (2012) 28(11):1444–57. doi:10.1089/aid.

2012.0103

39. Alpert MD, Harvey JD, Lauer WA, Reeves RK, Piatak M Jr, Carville A, et al. ADCC develops over time during persistent infection with live-attenuated SIV and is associated with complete protection against SIVmac251 challenge.

PLoS Pathog (2012) 8(8):e1002890. doi:10.1371/journal.ppat.1002890

40. He L, Lin X, de Val N, Saye-Francisco KL, Mann CJ, Augst R, et al. Hidden lineage complexity of glycan-dependent HIV-1 broadly neutralizing antibod-ies uncovered by digital panning and native-like gp140 trimer. Front Immunol (2017) 8:1025. doi:10.3389/fimmu.2017.01025

41. Hemadou A, Giudicelli V, Smith ML, Lefranc MP, Duroux P, Kossida S, et al. Pacific biosciences sequencing and IMGT/HighV-QUEST analysis of full-length single chain fragment variable from an in  vivo selected phage- display combinatorial library. Front Immunol (2017) 8:1796. doi:10.3389/ fimmu.2017.01796

42. Doores KJ. The HIV glycan shield as a target for broadly neutralizing anti- bodies. FEBS J (2015) 282(24):4679–91. doi:10.1111/febs.13530

43. Crispin M, Doores KJ. Targeting host-derived glycans on enveloped viruses for antibody-based vaccine design. Curr Opin Virol (2015) 11:63–9. doi:10.1016/j. coviro.2015.02.002

44. Doores KJ, Kong L, Krumm SA, Le KM, Sok D, Laserson U, et  al. Two classes of broadly neutralizing antibodies within a single lineage directed to the high-mannose patch of HIV envelope. J Virol (2015) 89(2):1105–18. doi:10.1128/JVI.02905-14

45. Walker LM, Simek MD, Priddy F, Gach JS, Wagner D, Zwick MB, et  al. A limited number of antibody specificities mediate broad and potent serum neutralization in selected HIV-1 infected individuals. PLoS Pathog (2010) 6(8):11–2. doi:10.1371/journal.ppat.1001028

46. Mouquet H, Scharf L, Euler Z, Liu Y, Eden C, Scheid JF, et al. Complex-type N-glycan recognition by potent broadly neutralizing HIV antibodies. Proc

Natl Acad Sci U S A (2012) 109(47):E3268–77. doi:10.1073/pnas.1217207109

47. Sok D, Doores KJ, Briney B, Le KM, Saye-Francisco KL, Ramos A, et  al. Promiscuous glycan site recognition by antibodies to the high-mannose patch of gp120 broadens neutralization of HIV. Sci Transl Med (2014) 6:236. doi:10.1126/scitranslmed.3008104

48. Garces F, Lee JH, de Val N, de la Pena AT, Kong L, Puchades C, et al. Affinity maturation of a potent family of HIV antibodies is primarily focused on accommodating or avoiding glycans. Immunity (2015) 43(6):1053–63. doi:10.1016/j.immuni.2015.11.007

49. Kong L, Lee JH, Doores KJ, Murin CD, Julien JP, McBride R, et al. Supersite of immune vulnerability on the glycosylated face of HIV-1 envelope gly-coprotein gp120. Nat Struct Mol Biol (2013) 20(7):796–803. doi:10.1038/ nsmb.2594

50. Sato S, Yuste E, Lauer WA, Chang EH, Morgan JS, Bixby JG, et al. Potent antibody-mediated neutralization and evolution of antigenic escape variants of simian immunodeficiency virus strain SIVmac239 in vivo. J Virol (2008) 82(19):9739–52. doi:10.1128/JVI.00871-08JVI.00871-08

51. National Research Council (US) Committee for the Update of the Guide for the Care and Use of Laboratory Animals. Guide for the Care and Use of

Laboratory Animals. Washington, DC: National Academies Press (2011).

52. Smith ML, Murrell B, Eren K, Ignacio C, Landais E, Weaver S, et al. Rapid sequencing of complete env genes from primary HIV-1 samples. Virus Evol (2016) 2(2):8. doi:10.1093/ve/vew018

53. Alamyar E, Giudicelli V, Li S, Duroux P, Lefranc MP. IMGT/HighV-QUEST: the IMGT® web portal for immunoglobulin (IG) or antibody and T  cell

receptor (TR) analysis from NGS high throughput and deep sequencing.

Immunome Res (2012) 7(1):339. doi:10.1038/nmat3328

54. Alamyar E, Duroux P, Lefranc MP, Giudicelli V. IMGT® tools for the

nucleotide analysis of immunoglobulin (IG) and T  cell receptor (TR) V-(D)-J repertoires, polymorphisms, and IG mutations: IMGT/V-QUEST and IMGT/HighV-QUEST for NGS. Methods Mol Biol (2012) 882:569–604. doi:10.1007/978-1-61779-842-9_32

55. Li S, Lefranc MP, Miles JJ, Alamyar E, Giudicelli V, Duroux P, et al. IMGT/ HighV QUEST paradigm for T cell receptor IMGT clonotype diversity and next generation repertoire immunoprofiling. Nat Commun (2013) 4:2333. doi:10.1038/ncomms3333

56. Lefranc M-P. Immunoglobulin and T  cell receptor genes: IMGT® and the

birth and rise of immunoinformatics. Front Immunol (2014) 5:22. doi:10.3389/ fimmu.2014.00022

57. Giudicelli V, Duroux P, Kossida S, Lefranc MP. IG and TR single chain fragment variable (scFv) sequence analysis: a new advanced functionality of IMGT/V-QUEST and IMGT/HighV-QUEST. BMC Immunol (2017) 18(1):35. doi:10.1186/s12865-017-0218-8

(16)

and T cell receptor nucleotide sequences. Nucleic Acids Res (2006) 34:D781–4. doi:10.1093/nar/gkj088

59. Sasso E, Paciello R, D’Auria F, Riccio G, Froechlich G, Cortese R, et  al. One- step recovery of scFv clones from high-throughput sequencing-based screen-ing of phage display libraries challenged to cells expressscreen-ing native claudin-1.

Biomed Res Int (2015) 2015:703213. doi:10.1155/2015/703213

60. Francica JR, Sheng Z, Zhang Z, Nishimura Y, Shingai M, Ramesh A, et al. Analysis of immunoglobulin transcripts and hypermutation following SHIVAD8 infection and protein-plus-adjuvant immunization. Nat Commun (2015) 6:6565. doi:10.1038/ncomms7565

61. Corcoran MM, Phad GE, Vázquez Bernat N, Stahl-Hennig C, Sumida N, Persson MA, et  al. Production of individualized V gene databases reveals high levels of immunoglobulin genetic diversity. Nat Commun (2016) 7:13642. doi:10.1038/ncomms13642

62. Dai K, He L, Khan SN, O’Dell S, McKee K, Tran K, et  al. Rhesus macaque B-cell responses to an HIV-1 trimer vaccine revealed by unbiased longi-tudinal repertoire analysis. MBio (2015) 6(6):e1375–1315. doi:10.1128/ mBio.01375-15

63. Xiao M, Prabakaran P, Chen W, Kessing B, Dimitrov DS. Deep sequencing and Circos analyses of antibody libraries reveal antigen-driven selection of Ig VH genes during HIV-1 infection. Exp Mol Pathol (2013) 95(3):357–63. doi:10.1016/j.yexmp.2013.10.004

64. Rhesus Macaque Genome Sequencing and Analysis Consortium, Gibbs RA, Rogers J, Katze MG, Bumgarner R, Weinstock GM, et  al. Evolutionary and biomedical insights from the rhesus macaque genome. Science (2007) 316(5822):222–34. doi:10.1126/science.1139247

65. Sundling C, Li Y, Huynh N, Poulsen C, Wilson R, O’Dell S, et  al. High-resolution definition of vaccine-elicited B  cell responses against the HIV primary receptor binding site. Sci Transl Med (2012) 4:142. doi:10.1126/ scitranslmed.3003752

66. Meng W, Li L, Xiong W, Fan X, Deng H, Bett AJ, et al. Efficient generation of monoclonal antibodies from single rhesus macaque antibody secreting cells.

MAbs (2015) 7(4):707–18. doi:10.1080/19420862.2015.1051440

67. Margolin DH, Saunders EH, Bronfin B, de Rosa N, Axthelm MK, Goloubeva OG, et al. Germinal center function in the spleen during simian HIV infection in

rhesus monkeys. J Immunol (2006) 177(2):1108–19. doi:10.4049/jimmunol. 177.2.1108

68. Georgiou G, Ippolito GC, Beausang J, Busse CE, Wardemann H, Quake SR. The promise and challenge of high-throughput sequencing of the antibody repertoire. Nat Biotechnol (2014) 32(2):158–68. doi:10.1038/nbt.2782 69. DeKosky BJ, Kojima T, Rodin A, Charab W, Ippolito GC, Ellington AD,

et al. In-depth determination and analysis of the human paired heavy- and light-chain antibody repertoire. Nat Med (2015) 21(1):86–91. doi:10.1038/ nm.3743

70. DeKosky BJ, Ippolito GC, Deschner RP, Lavinder JJ, Wine Y, Rawlings BM, et  al. High-throughput sequencing of the paired human immunoglobulin heavy and light chain repertoire. Nat Biotechnol (2013) 31(2):166–9. doi:10.1038/nbt.2492

71. DeKosky BJ, Lungu OI, Park D, Johnson EL, Charab W, Chrysostomou C, et al. Large-scale sequence and structural comparisons of human naive and antigen-experienced antibody repertoires. Proc Natl Acad Sci U S A (2016) 113(19):E2636–45. doi:10.1073/pnas.1525510113

72. Mcdaniel JR, DeKosky BJ, Tanno H, Ellington AD, Georgiou G. Ultra-high-throughput sequencing of the immune receptor repertoire from millions of lymphocytes. Nat Protoc (2016) 11(3):429–42. doi:10.1038/nprot.2016.024 73. Mouquet H, Warncke M, Scheid JF, Seaman MS, Nussenzweig MC. Enhan-

ced HIV-1 neutralization by antibody heteroligation. Proc Natl Acad Sci U S A (2012) 109(3):875–80. doi:10.1073/pnas.1120059109

Conflict of Interest Statement: The other authors declare that the research was

conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Références

Documents relatifs

To address this visibility problem, we propose here to exploit optical speckle patterns naturally present in coherent illumination as a source of structured

Afin d’avoir une exposition plus large de la région cervicale, une section transversale des muscles sous- hyoïdiens peut parfois être nécessaire.. 4 Préparation de

study is to investigate the implications of the deep mixing for the nitrogen budget, in particular for the nitrate deficit in the water column, by a comparative analysis of the

At first, we were worried that there was something wrong with the cavity mirrors, but soon came to realize that using different cavity heaters that resulted in a change of the

Single chain variable fragment (scFv) specific to cassiicolin could bind and deactivate the toxin in genetically modified rubber trees that harbour the functional

• Le premier quartile Q1 d’une série statistique est la plus petite valeur des termes de la série pour laquelle au moins un quart des données sont inférieures ou égales à Q1. •

“Multiple testing procedures plots” (Figure 3) displays multiple testing procedures visualization plots as line graphs (left figures) and scatter plots (right figures), for genes

Histograms for the TRBV genes (a), TRBD genes (b) and TRBJ genes (c) display results from the ‘1 copy’ ‘single allele’ (for V and J) (58,958 sequences, average sequence length of