Oct 28, 2020

Public workspaceMassive Parallel Reporter Assay (MPRA)

This protocol is a draft, published without a DOI.
  • 1UCSF;
  • 2University of California, San Francisco
  • AhituvLab
Icon indicating open access to content
QR code linking to this content
Protocol CitationGracie Gordon, Nadav Ahituv 2020. Massive Parallel Reporter Assay (MPRA). protocols.io https://protocols.io/view/massive-parallel-reporter-assay-mpra-bmbgk2jw
Manuscript citation:
Gordon MG, Inoue F, Martin B, Schubach M, Agarwal V, Whalen S, Feng S, Zhao J, Ashuach T, Ziffra R, Kreimer A. lentiMPRA and MPRAflow for high-throughput functional characterization of gene regulatory elements. Nature Protocols. 2020 Aug;15(8):2387-412.
License: This is an open access protocol distributed under the terms of the Creative Commons Attribution License,  which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Protocol status: Working
We use this protocol and many adaptations of it regularly and it works for us
Created: September 12, 2020
Last Modified: October 28, 2020
Protocol Integer ID: 42056
Disclaimer
DISCLAIMER – FOR INFORMATIONAL PURPOSES ONLY; USE AT YOUR OWN RISK

The protocol content here is for informational purposes only and does not constitute legal, medical, clinical, or safety advice, or otherwise; content added to protocols.io is not peer reviewed and may not have undergone a formal approval of any kind. Information presented in this protocol should not substitute for independent professional judgment, advice, diagnosis, or treatment. Any action you take or refrain from taking using or relying upon the information presented here is strictly at your own risk. You agree that neither the Company nor any of the authors, contributors, administrators, or anyone else associated with protocols.io, can be held responsible for your use of the information contained in or linked to this protocol or any of our Sites/Apps and Services.
Abstract
Massively parallel reporter assays (MPRAs) can simultaneously measure the function of thousands of candidate regulatory sequences (CRSs) in a quantitative manner. In this method, CRSs are cloned upstream of a minimal promoter and reporter gene, alongside a unique barcode, and introduced into cells. If the CRS is a functional regulatory element, it will lead to the transcription of the barcode sequence, which is measured via RNA sequencing and normalized for cellular integration via DNA sequencing of the barcode. This technology has been used to test thousands of sequences and their variants for regulatory activity, to decipher the regulatory code and its evolution, and to develop genetic switches. Lentivirus-based MPRA (lentiMPRA) produces ‘in-genome’ readouts and enables the use of this technique in hard-to-transfect cells. Here, we provide a detailed protocol for lentiMPRA, along with a user-friendly Nextflow-based computational pipeline—MPRAflow—for quantifying CRS activity from different MPRA designs. The lentiMPRA protocol takes ~2 months, which includes sequencing turnaround time and data processing with MPRAflow.
Attachments
Materials
MATERIALS
ReagentNEB 10-beta Electrocompetent E.coli - 6x0.1 mlNew England BiolabsCatalog #C3020K
ReagentNEBNext High-Fidelity 2X PCR Master Mix - 250 rxnsNew England BiolabsCatalog #M0541L
ReagentI-SceI - 500 unitsNew England BiolabsCatalog #R0694S
ReagentAgeI-HF - 300 unitsNew England BiolabsCatalog #R3552S
ReagentNEBuilder HiFi DNA Assembly Master Mix - 250 rxnsNew England BiolabsCatalog #E2621X
ReagentQIAprep Spin Miniprep KitQiagenCatalog #27104
ReagentTURBO DNA-free™ KitThermo ScientificCatalog #AM1907
ReagentQiaquick gel extraction kitQiagenCatalog #28704
ReagentWizard(R) SV Genomic DNA Purification System, 250 prepsPromegaCatalog #A2361
ReagentDMEMThermo Fisher ScientificCatalog #41966
ReagentpMD2.GaddgeneCatalog #12259
ReagentpsPAX2addgeneCatalog #12260
ReagentSbfI-HFNew England BiolabsCatalog #RS3642S
ReagentSuperscript II Reverse TranscriptaseThermo Fisher ScientificCatalog #18064071
ReagentHEK293ATCCCatalog #CRL-1573
ReagentHighPrep™ PCRCatalog #AC-60050
Reagent2X SsoFast EvaGreen Supermix with Low ROXBIO-RADCatalog #1725211
ReagentTrypsin-EDTA (0.05%), phenol redThermo FisherCatalog #25300120
ReagentSYBR™ Safe DNA Gel StainThermo FisherCatalog #S33102
ReagentPenicillin-StreptomycinSigma AldrichCatalog #P4333
ReagentFBS
ReagentSurePrint 244K Oligonucleotide LibrariesAgilent TechnologiesCatalog #G7223A
ReagentQiagen Plasmid Plus Midi KitQiagenCatalog #12945
ReagentpLS-SV40-mP-EGFP addgeneCatalog #137724
ReagentpLS-SceI addgeneCatalog #137725
ReagentLenti-X ConcentratorTakaraCatalog #631232
ReagentAllPrep DNA/RNA Mini KitQiagenCatalog #80204
Library Amplification
Library Amplification
3h
3h
Dissolve the Agilent oligonucleotide (10 pmol) (Fig. 1b and Box 1) in 100 μL TE buffer to obtain a
100 nM solution.
Set up the first-round PCR reaction. This reaction adds a vector overhang sequence upstream and
minimal promoter and adaptor sequences downstream of the CRSs

ReagentVolume (μL)Final conc
Agilent oligonucleotide (100 nM)21nM
NEBNext High-Fidelity 2× PCR Master Mix1001x
5BC-AG-f01 (100 μM)10.5uM
5BC-AG-r01 (100 μM)10.5uM
Ultrapure distilled H2O96
Total volume200

Step case

Schematics of lentiMPRA
166 steps

b, CRS oligonucleotide. A 200-base CRS (gray) is flanked by PCR adaptor sequences (light green). c, First-round PCR. PCR primers add sequences that are complementary to the vector (black) to the upstream side, as well as minimal promoter (mP, blue) and spacer sequences (yellow) downstream of the CRS oligonucleotide.


Split the premixture into five PCR tubes (40 μL per tube)

CRITICAL STEP Splitting the PCR reaction into multiple tubes is important to reduce the risk of
PCR ‘jackpotting’ (errors that occur during the early PCR cycles and get amplified exponentially) or
amplification bias accidentally occurring during PCR
Run the PCR reaction as follows:

Cycle No.DenatureAnnealExtend
198oC, 2min
2-6 (5cycles)98oC, 15sec60oC, 20sec720C, 30sec
772oC, 5min

Combine the PCR products in a DNA LoBind tube.

Bring the HighPrep PCR reagent to RT for at least 30 min before use. Shake thoroughly to fully resuspend the magnetic beads. Add 1 volume (200 μL) of HighPrep PCR reagent and mix thoroughly by pipetting up and down 6–8 times
Incubate the mixture for 5 min at RT.
Place the tube on the magnet for 2–3 min until the solution clears and beads pull to the side of
the wall
Carefully remove the supernatant by pipetting without disturbing the beads. A small amount
(10–20 μL) of supernatant can be left in the tube
With the tube on the magnet, add 500 μL of 80% (vol/vol) ethanol and incubate for 30 s.
Remove the ethanol by pipetting while the tube is still on the magnet.
Repeat the 80% (vol/vol) ethanol washing (Steps 10 and 11)
Flash-spin (200–1,000g, 22–25 °C, 3 s) the tube, immediately place it back on the magnet and
remove the supernatant
Dry the bead pellet for 2–3 min. Do not overdry the beads.

Add 50 μL of Buffer EB to the beads and mix by pipetting and vortexing.

Place the tube on the magnet for 1–2 min until the solution is clear.

Transfer 45–50 μL of the eluate to a DNA LoBind tube.

Measure the DNA concentration using a NanoDrop spectrophotometer. The expected concentration
is 5–20 ng/μL.

Set up the second-round PCR reaction. This reaction adds a 15-bp barcode and vector overhang
sequence downstream of the first-round PCR fragment


ReagentVolume (μL)Final conc
First-round PCR productVariable (100ng)1nM
NEBNext High-Fidelity 2× PCR Master Mix2001x
5BC-AG-f01 (100 μM)20.5uM
5BC-AG-r01 (100 μM)20.5uM
Ultrapure distilled H2OUp to 400uL
Total volume400

PCR
Step case

Schematics of lentiMPRA
149 steps

d, Second-round PCR. Reverse primer adds the barcodes (red-striped section) and GFP complementary sequences (green).
e, Plasmid construct

Split the premixture into 10 PCR tubes (40 μL per tube).

Run the PCR reaction as follows:
Cycle No.DenatureAnnealExtend
198oC, 2min
2-13 (12 cycles)98oC, 15sec60oC, 20sec720C, 30sec
1472oC, 5min

Combine the PCR products into a DNA LoBind tube.

Add 200 μL of 6× gel loading dye (final conc. 2×) and mix the solution by vortexing.
Run the sample on two 1.5% (wt/vol) TAE–agarose gels (30 mL of 5 × 6-cm mini gels with
3-cm-width wells) and visualize the DNA using SYBR Safe DNA gel stain.

Cut the DNA band (419 bp) using a blue-light Safe Imager.

CRITICAL STEP We highly recommend using a blue-light Safe Imager, because the UV
transilluminator markedly decreases the recombination efficiency.

Purify the DNA from the gel slice, using QIAquick Gel Extraction Kit according to the
manufacturer’s protocol.

Elute the DNA in 50 μL of Buffer EB per column. If multiple columns are used, combine
the eluate.

Purify the DNA using 1.2 volumes of HighPrep PCR reagent and following Steps 6–17.

Measure the DNA concentration using a NanoDrop spectrophotometer. The expected concentration
is ~25 ng/μL.

PAUSE POINT Purified DNA can be stored at −20 °C for months.
Vector Linearization
Vector Linearization
Set up the vector digestion reaction as follows:
ReagentVolume (uL)Final Conc.
pLS-SceIVariable (10ug)
CutSmart (10x)201x
AgeI-HF (20U/uL)50.5U/uL
SbfI-HF (20U/uL)50.5U/uL
Ultrapure distilled H2Omake up to 200uL
Total volume200

Digestion
Incubate the reaction at 37 °C for 3 h to overnight.

To complete the plasmid digestion, add 5 μL of AgeI-HF (20 U/μL) and 5 μL of SbfI-HF (20 U/μL)
to the reaction.

33 Incubate the reaction at 37 °C for 3 h to overnight.

Vortex for 30 s and incubate at 80 °C for 20 min.

Purify the DNA using 0.65 volume (136.5 μL) of HighPrep PCR reagent and following Steps 6–17.

Measure the DNA concentration using a NanoDrop spectrophotometer.

To check the DNA size and quality, run 100–200 ng of the linearized vector and purified insert
DNA (from Step 29) on a 1% (wt/vol) gel along with a 1-kb DNA ladder. Make sure that specific
single bands (7.8-kb linearized vector and 419-bp insert DNA) appear, but that no other bands
appear on the gel.
Recombination and electroporation
Recombination and electroporation
3d
3d
Set up the recombination reaction as follows:


ReagentVolume (uL)Final Conc.
Linearized pLS-SceI (from step 36)Variable (1 μg)
Purified insert DNA (from step 29)Variable (250 ng)
NEBuilder HiFi DNA Assembly Master Mix1001X
Ultrapure distilled H2OMake up to 200 μL
Total volume200

PCR
Incubate the reaction at 37 °C for 1 h.


Place the tube on ice
Purify the DNA using 0.65 volume (136.5 μL) of HighPrep PCR reagent
Set up the digestion reaction to get rid of undigested vectors as follows:

ReagentVolume (uL)Final Conc.
Recombination product44
CutSmart buffer (10×)51X
I-SceI (20 U/μL)1 (20U)0.4 U/uL
Total volume50

Incubate the reaction at 37 °C for 1 h.
Purify the DNA using 1.8 volume (90 μL) of HighPrep PCR reagent and following Steps 6–14.
To elute the DNA, add 20 μL of Buffer EB to the beads and mix by pipetting and vortexing

Place the tube on the magnet for 1–2 min and transfer 18 μL of the eluate to a DNA LoBind tube.

Measure the DNA concentration using a NanoDrop spectrophotometer.
Make sure the DNA the concentration of the recombination product (the eluate) is >25 ng/μL, so as not to need to add
>4 μL to 100 μL of competent cells in Step 50.

PAUSE POINT The recombinant product can be stored at −20 °C for at least a month.

Pause
Prewarm 4–5 mL of Stable Outgrowth Medium (from the NEB 10-beta electrocompetent cells) at
37 °C for at least 30 min.

Thaw NEB 10-beta electrocompetent cells on ice. We usually use 100 μL (one tube) of the
competent cells for low-complexity libraries (0.5–2 million total barcodes) and 400 μL (four tubes)
for high-complexity libraries (8–12 million total barcodes).

CRITICAL STEP Competent cells and cuvettes should be kept on ice during the following procedure.

Critical
Add 100 ng of the recombination product per 100 μL competent cells. The volume of DNA should
be <4 μL per 100 μL competent cells.

Gently transfer 50 μL of the competent cells to a 1-mm-gap cuvette without creating bubbles. Two
cuvettes are prepared for 100 μL of competent cells.

Gently tap the cuvettes on the counter to move the cells to the bottom.

Place the cuvettes in a Gemini X2 electroporator and shock the cells with the following settings:
voltage, 2.0 kV; resistance, 200 ohms; capacitance, 25 μF, number of pulses, 1; gap width, 1 mm.

Immediately add 450 μL of prewarmed Stable Outgrowth Medium to the cuvettes, thoroughly mix
by pipetting up and down, and transfer to a 14-mL conical tube.

Repeat the electroporation for all cuvettes, combining the electroporated bacteria in a single tube
(total 1 mL culture per 100 μL competent cells).

Add fresh Stable Outgrowth Medium and scale up to 4 mL in total. If 400 μL competent cells were
used, there is no need to add more.

Dilute 2 μL of the bacteria in 400 μL of fresh LB medium in a 1.5-mL tube and plate the entire tube
of diluted bacteria (402 μL) in a prewarmed 15-cm LB agar plate along with 20 μL of 100 mg/mL
carbenicillin. This plate will be used for colony counting and plasmid mini prep.

Incubate the cells at 37 °C for 1 h with agitation (200 r.p.m.). Prewarm ten 15-cm LB agar plates at
37 °C.
CRITICAL STEP We recommend using 15-cm plates rather than larger plates because these
enable fine-tuning of the colony numbers collected.

Critical
Plate undiluted bacteria onto the other nine prewarmed 15-cm LB agar plates (400 μL/plate), along
with 100 μL/plate of 100 mg/mL carbenicillin. A higher amount of carbenicillin than usual is added
because the dense culture conditions increase the risk of non-transformed bacteria growth.

Incubate the plates at 37 °C overnight.

To check the plasmid sequence of individual colonies, pick 16 colonies from the diluted-bacteria
plate, purify the plasmids using a QIAprep Spin Miniprep Kit, and send them for Sanger
sequencing using n40.dn.F and EGFP.up.R primers (Supplementary Table 3). Confirm that the
sequence structure corresponds to the design (Fig. 1e, Extended Data Fig. 1c).


PAUSE POINT The plates can be stored at 4 °C for a month.
? TROUBLESHOOTING
Pause
Colony counting and plasmid library prep
Colony counting and plasmid library prep
Count the number of colonies on the diluted-bacteria plate. If there are too many colonies, count
colonies in a quarter of the area and multiply by four to estimate the total number of colonies on
the plate.
Estimate the total number of colonies per undiluted-bacteria plate by multiplying the colony count
in the diluted-bacteria plate by 200 (Supplementary Table 1).

Determine the number of undiluted-bacteria plates to be used for the following plasmid preps. The
total number of colonies needed can be determined by multiplying the number of designed CRSs by
the desired number of barcodes per CRS

CRITICAL STEP The ideal number of barcodes per CRS is between 50 and 200. Fewer barcodes
per CRS may reduce reproducibility. More barcodes per CRS requires more cells, more virus and
deeper sequencing reads, which increase costs. In addition, associating >200 barcodes per CRS does
not increase reproducibility.
Critical

Add 5–6 mL of LB medium to each bacterial plate and gently scrape the colonies, using a cell lifter
without disturbing solid agarose.

Collect the bacterial suspension and combine into a few 50-mL tubes.
Add 5–6 mL of fresh LB medium again to the plates and collect as much leftover bacteria as
possible into the tubes.

Purify the plasmids using a Qiagen Plasmid Plus Midi Kit, following the ‘standard protocol’
in the manufacturer’s protocol. The number of columns to be used depends on the amount of
bacteria. We usually use four columns of Qiagen Plasmid Plus Midi Kit per undilutedbacteria
plate.

Measure the plasmid concentration using a NanoDrop spectrophotometer. The expected
concentration is 0.5–2 μg/μL.

To check the DNA size and quality, run 100–200 ng of the plasmid on a 1% (wt/vol) agarose gel
along with a 1-kb DNA ladder.

PAUSE POINT The purified plasmid library can be stored at −20 °C for years.
Pause
Set up the PCR reaction. This reaction adds a P5 flowcell sequence and the sample index sequence
upstream and a P7 flowcell sequence downstream of the CRS–barcode fragment .

NOTICE: Use different sample index sequences for pLSmP-ass-i# if multiple libraries are
generated and multiplexed


ReagentVolume (uL)Final Conc.
Plasmid libraryVariable (40ng)
NEBNext High-Fidelity 2× PCR Master Mix1001x
pLSmP-ass-i# (100 μM)10.5uM
pLSmP-ass-gfp (100 μM)10.5uM
Ultrapure distilled H2OMake up to 200uL
Total volume200

Step case

Schematics of lentiMPRA
97 steps

e, Plasmid construct. f, Amplification for CRS–barcode association. Primers add P5 (purple) and sample index (gray-striped section) upstream and P7 (pink) downstream. g, Sequencing library structure. h, Sequencing reaction. Paired-end reads specify the CRS sequence, with index read 1 providing the barcode and index read 2 reading the sample index for multiplexing. i, Integrated DNA and expressed RNA in infected cells.

Split the premixture into five PCR tubes (40 μL per tube).
Run the PCR reaction as follows:

Cycle No.DenatureAnnealExtend
198oC, 1min
2-16 (15 cycles)98oC, 15sec60oC, 20sec720C, 3min
1772oC, 5min

CRITICAL STEP Incomplete DNA elongation may create chimeric DNA annealing in the next
cycle and can cause CRS–barcode swapping. A longer extension time (3 min) can help to reduce
this risk.
Critical

Combine the PCR products in a DNA LoBind tube.

Add 100 μL of 6× gel loading dye (final conc. 2×) and mix the solution by vortexing.

Run the sample on a 1.5% (wt/vol) agarose gel
(30 mL of 5 cm × 6-cm mini gel with 3-cm widthwell).

Cut the DNA band (470 bp) using a blue-light Safe Imager.
Purify the DNA from the gel slice using the QIAquick Gel Extraction Kit according to the
manufacturer’s protocol.


79 Elute the DNA in 50 μL Buffer EB per column. If multiple columns are used, combine the eluate.

Purify the DNA using 1.8 volumes of HighPrep PCR reagent, following Steps 6–17.

Measure the DNA concentration using Qubit dsDNA HS Assay Kit according to the manufacturer’s
protocol.

To check the DNA size and quality, run 50–100 ng of the DNA on a 1.5% (wt/vol) agarose gel along
with a 100-bp DNA ladder.

PAUSE POINT Purified DNA can be stored at −20 °C for months.

Pause
Send the purified DNA and custom primers for sequencing.

The sequencing should be done using paired-end reads covering the full CRS with some overlap (here 146 bp each), with 15 cycles for index read 1 and 10 cycles for index read 2. Index read 1 provides the barcode sequence, and index read 2 provides the sample index

Let the sequencing facility know that index read 2 should be used for demultiplexing and that short reads should not be masked, bcl2fastq parameters:
--minimum-trimmed-read-length 0 --mask-short-adapter-reads 0.

A minimum 10× coverage of sequencing reads based on the total number of barcodes is required. For example, we use an Illumina MiSeq v.2 run (15 million reads) for a 0.5-million-barcode library or an Illumina NextSeq mid-output run
(120 million reads) for an 8- to 12-million-barcode library


READCYCLESPRIMEROUTPUT
Read 1146pLSmP-ass-seq-R1CRS (upstream, forward)
Read 2146pLSmP-ass-seq-R2CRS (downstream, reverse)
Index read 115pLSmP-ass-seq-ind1Barcode (forward)
Index read 210pLSmP-rand-ind2Sample index

Step case

NGS primer schematics
85 steps


Lentivirus packaging
Lentivirus packaging
Culture 293T cells in DMEM (with 10% heat-inactivated FBS).
Seed 10–12 million 293T cells per T225 flask. The number of flasks depends on the library complexity and the infectability of the cells. For example, we use one flask for the 0.5-millionbarcode library or six T225 flasks for the 8- to 12-million-barcode library when carrying out lentiMPRA in HepG2 cells.
Incubate the cells for 2 d.
2d
Prepare premixtures A and B as follows:
● Premix A: 800 μL/flask OPTI-MEM and 60 μL/flask EndoFectin.
● Premix B: 800 μL/flask OPTI-MEM, 10 μg/flask plasmid library, 6.5 μg/flask psPAX2, and 3.5 μg/flask pMD2.G.
Add premix A to premix B by pipetting and mix the mixture by inverting the tube.
Incubate the tube at RT for 15 min.
Add 1.6 mL/flask of the mixture to 293T cells (from Step 86).
Incubate the cells for 8–14 h. ! CAUTION During the following procedure (Steps 92–103), the liquid and plastic waste should be discarded into 10% (vol/vol) bleach, because they are contaminated with lentivirus. Culture plates and tubes for storage should be clearly labeled as lentiviral contaminants.
Replace the media with 30 mL/flask DMEM (with 5% heat-inactivated FBS) supplemented with 1× ViralBoost reagent (60 μL reagent per 30 mL medium).
Incubate the cells for 36–48 h.
2d
Check GFP expression using a fluorescence microscope. The majority of cells are expected to express strong GFP, because viral RNA, including the GFP gene, is transcribed via the 5ʹ long terminal repeat (LTR).
Filter the supernatant through a 0.45-μm PES filter system. Use multiple filters (one filter for up to three T225 flasks), so as not to get clogged.
Transfer the flow-through into multiple 50-mL tubes (30 mL per tube).
Add 1/3 volume (10 mL per tube) of Lenti-X concentrator reagent, close the lid tightly and mix gently by inverting the tubes.
Seal the lid with Parafilm and place the tube in a refrigerator at 4 °C for at least 4 h.

PAUSE POINT The tubes can be stored at 4 °C up to 1 week.
Pause
Centrifuge the tubes at 1,500g for 45 min at 4 °C.
Discard the supernatant into 10% (vol/vol) bleach by gentle decanting.
Discard the remainder of the supernatant into 10% bleach by pipetting without disturbing the pellet.
Gently resuspend the pellet in cold DPBS. We usually use 600 μL DPBS per T225 flask.
Store the lentivirus at 4 °C. CRITICAL STEP We do not recommend freezing the virus, especially in the case of high complexity libraries, because freeze–thaw cycles substantially decrease the viral titer. We did not see amarked loss of the viral titer when stored at 4 °C for up to 3 weeks. The following infection experiments should be done within 3 weeks.

PAUSE POINT The virus can be stored at 4 °C up to 3 weeks
Critical
Pause
Lentivirus titration
Lentivirus titration
Infect the lentivirus library (0, 1, 2, 4, 8, 16, 32, 64 μL) into the cells to be used, extract genomic DNA from the cells, and perform qPCR as described in Box 2 (steps 1–14).
Plot the MOI for each condition and draw a linear approximation with the virus volume on the x axis and the MOI on the y axis (Supplementary Table 2).

Step case

Lenti titeration graph example
63 steps


On the basis of its slope and the number of cells seeded, calculate the virus titer (in transducing units per microliter), using Supplementary Table 1.


Lentivirus infection and DNA/RNA extraction
Lentivirus infection and DNA/RNA extraction
1w
1w
Determine the number of integrations per barcode (i.e., total number of a particular barcode
existing in an entire cell population)
Use the spreadsheet provided in Supplementary Table 1.

We recommend a range between 50 and 500 integrations per barcode. Fewer numbers increase the risk
of barcode loss during the downstream procedure. Higher numbers are better but increase the cost.
Computational step
Critical
Seed an appropriate number of cells in 10-cm or 15-cm dishes.

The number of cells required is determined as total barcode integrations (total number of any barcodes existing in the entire cell population) divided by the MOI of the cells (Supplementary Table 1).

Three independent biological replicates should be performed, and each replicate sample should be treated separately during the following procedures
Incubate the cells overnight.
Refresh the culture media (culture conditions depend on the cell type used) and add Polybrene at
the appropriate concentration (Box 2).

111 Add an appropriate amount of the lentivirus library. The volume of virus required is given as total
barcode integrations divided by virus titer (Supplementary Table 1).

Refresh the culture media (culture conditions depend on the cell type used) with no Polybrene the
following day.

After 2 d (3 d of culture in total), check GFP fluorescence to confirm proper lentiviral integration
and expression. GFP expression depends on library design. If the majority of the CRSs
in the library are active enhancers, the cells should have strong GFP expression.

Remove the culture media and wash the cells with DPBS three times. Remove the DPBS completely.

Add RLT Plus lysis buffer (from the AllPrep DNA/RNA Mini Kit) supplemented with 2-
mercaptoethanol (10 μL of 2-mercaptoethanol per 1 mL of RLT Plus).
We usually use 1,200 μL or 2,400 μL of lysis buffer per 10-cm dish or 15-cm dish, respectively.

Scrape the cells using a cell lifter and homogenize the cell lysis, using a 3-mL syringe and 20-gauge
needle.

PAUSE POINT The cell lysate can be frozen and stored at −80 °C for months.
Pause
Transfer the lysate to DNA columns. Use two or four columns per 10-cm dish or 15-cm dish,
respectively
Extract genomic DNA and total RNA simultaneously using the AllPrep DNA/RNA Mini Kit according
to the manufacturer’s protocol. For RNA samples, perform DNase treatment between two of the 350-μL
RW1 washes using Qiagen’s RNase-free DNase Set, according to the manufacturer’s protocol
Elute DNA in 30 μL/column of Buffer EB and combine the eluates of each replicate in a single tube.
Keep each of the three replicates separate.
Elute RNA in 30 μL/column of RNase-free H2O and combine the eluates of each replicate in a single
tube. Keep each of the three replicates separate
Measure the concentrations of the DNA and RNA samples using a NanoDrop spectrophotometer. At
least 12 μg DNA and 60 μg RNA per replicate are required.

PAUSE POINT DNA can be stored at −20 °C for years. RNA can be stored at −80 °C for months
Pause
Reverse transcription
Reverse transcription
4h
4h
Treat RNA samples with DNase, using the TURBO DNA-free Kit and following the manufacturer’s
protocol for ‘Rigorous DNase treatment’. Because the RNA sample has already been treated with
DNase during the AllPrep procedure (Step 93), TURBO DNase treatment can be done using a
high-concentration condition (without any dilution).
Measure the RNA concentration using the Qubit RNA HS Assay Kit. At least 60 μg or 240 μg total
RNA per replicate is required for a low- (0.5–2 million total barcodes) or high- (8–12 million total
barcodes) complexity library, respectively.

PAUSE POINT DNase-treated RNA can be stored at −80 °C for months
Pause
For RNA samples, perform a reverse-transcription reaction in 8-strip PCR tubes. This reaction adds
a 16-bp UMI and a P7 flowcell sequence downstream of the barcode.

For a low-complexity library (0.5–2 million total barcodes), use the amounts given in the table below. For
a high-complexity library (8–12 million total barcodes), we recommend multiplying all amounts in
the following table by 4


ReagentVolume (μL)Final conc.
RNAVariable (60 μg total RNA)
P7-pLSmP-ass16UMI-gfp (100 μM)0.250.25 μM
dNTP mix (10 mM, from SuperScript II Reverse Transcriptase)50.5 mM
UltraPure distilled H2OMake up to 65 μL
Total volume65uL

Step case

Schematics of lentiMPRA
44 steps

i, Integrated DNA and expressed RNA in infected cells. j, Amplification for barcode counting. Primers add P5 and sample index upstream and P7 and UMI, brown stripe) downstream

Incubate the reaction at 65 °C for 5 min using a thermal cycler.

Place the tubes on ice.

Add 20 μL 5× First Strand buffer (from SuperScript II Reverse Transcriptase) and 10 μL 0.1 M DTT
(from SuperScript II Reverse Transcriptase).

Incubate the tubes at 42 °C for 2 min using a thermal cycler.

Add 5 μL of Superscript II.

130 Incubate the tubes at 42 °C for 50 min, followed by incubation at 70 °C for 15 min using a thermal cycler.

PAUSE POINT cDNA can be stored at −20 °C for months.
Pause
Library prep and sequencing for RNA and DNA barcode counts
Library prep and sequencing for RNA and DNA barcode counts
Dilute DNA samples (from Step 121) to a final concentration of 120 ng/μL. For RNA samples
(from Step 96; we refer to RT products as RNA samples in the downstream steps to distinguish
them from samples derived from genomic DNA), use all 100 μL of RT products for the following
first-round PCR reaction.
Perform a first-round PCR reaction with all three replicates of both DNA and RNA samples.

This reaction adds the P5 flowcell sequence and sample index sequence upstream and a 16-bp UMI and P7
flowcell sequence downstream of the barcode .

Use different sample index sequences for each sample
For a low-complexity library (0.5–2 million total barcodes), use the amounts given in the table below. For a high-complexity library (8–12 million total barcodes), we recommend multiplying all amounts in the following table by 4.


ReagentVolume (μL)Final conc.
DNA or cDNA100 (12 μg DNA or entire RT product)
NEBNext High-Fidelity 2× PCR Master Mix2001x
P7-pLSmP-ass16UMI-gfp (100 μM)20.5 μM
P5-pLSmP-5bc-i# (100 μM)20.5 μM
UltraPure distilled H2O96
Total volume400
for Low-Comlexity

Split the premixture into 8 PCR tubes (50 μL per tube).

ReagentVolume (μL)Final conc.
DNA or cDNA400 (48 μg DNA or entire RT product)
NEBNext High-Fidelity 2× PCR Master Mix8001x
P7-pLSmP-ass16UMI-gfp (100 μM)80.5 μM
P5-pLSmP-5bc-i# (100 μM)80.5 μM
UltraPure distilled H2O384
Total volume1600
for High-Complexity

Split the premixture into 32 PCR tubes (50 μL per tube). 4x 8-strips
PCR
Step case

Schematics of lentiMPRA
36 steps

i, Integrated DNA and expressed RNA in infected cells. j, Amplification for barcode counting. Primers add P5 and sample index upstream and P7 and UMI, brown stripe) downstream. k, Sequencing library structure.

Run the PCR reaction as follows.

CycleDenatureAnnealExtend
198oC, 1min
2-4 (3 cycles)98oC, 10sec60oC, 30sec72oC, 1min
572oC, 5min



PCR
After the PCR reaction, combine all eight tubes for each sample in a DNA LoBind tube.

Purify the DNA, using 1.8 volumes (700 μL) of HighPrep PCR reagent and following Steps 6–14.

Add 60 μL of Buffer EB to the beads and elute the DNA by pipetting and vortexing.

Place the tube on the magnet for 1–2 min and transfer 55–58 μL of the eluate to a LoBind tube.
Store the tubes on ice.
Set up the preliminary PCR reaction for each sample in a 96-well qPCR plate. This run finds the
number of PCR cycles required for the following second-round PCR reaction.


ReagentVolume (μL)Final conc.
First-round PCR product (from step 112)5
NEBNext High-Fidelity 2× PCR Master Mix101x
P7 (100uM)0.10.5 μM
P5 (100uM)0.10.5 μM
SYBR green (100X)0.1
UltraPure distilled H2O4.7
Total volume20

Run the qPCR reaction using a qPCR instrument as follows


CycleDenatureAnnealExtend
198oC, 1min
2-31 (30 cycles)98oC, 10sec60oC, 30sec72oC, 1min

PCR
On the basis of the raw amplification curve of each sample, determine the number of cycles at
which the amplification nearly plateaus for each sample.
Set up the second-round PCR reaction for each sample as follows.

Because it is expected that th numbers of cycles required for DNA and RNA samples will be different, we recommend running the PCRs for them separately. For a low-complexity library (0.5–2 million total barcodes), use the
amounts given in the table below. For a high-complexity library (8–12 million total barcodes), we
recommend multiplying all amounts in the following table by 4.



Set up the second-round PCR reaction for each sample as follows. Because it is expected that the
numbers of cycles required for DNA and RNA samples will be different, we recommend running
the PCRs for them separately. For a low-complexity library (0.5–2 million total barcodes), use the
amounts given in the table below. For a high-complexity library (8–12 million total barcodes), we
recommend multiplying all amounts in the following table by 4.

ReagentVolume (μL)Final conc.
DNA or cDNA50
NEBNext High-Fidelity 2× PCR Master Mix1001x
P7 (100 μM)10.5 μM
P5 (100 μM)10.5 μM
UltraPure distilled H2O48
Total volume200
for Low-Comlexity

Split the premixture into 5 PCR tubes (40 μL per tube).

ReagentVolume (μL)Final conc.
DNA or cDNA50
NEBNext High-Fidelity 2× PCR Master Mix4001x
P7 (100 μM)40.5 μM
P5 (100 μM)40.5 μM
UltraPure distilled H2O192
Total volume800
for High-Complexity

Split the premixture into 20 PCR tubes (40 μL per tube). 4x 8-strips
PCR
Run the qPCR reaction using a qPCR instrument as follows

According to cycles numbers established in step 113-114
CycleDenatureAnnealExtend
198oC, 1min
2-31 (X cycles)98oC, 10sec60oC, 30sec72oC, 1min

Combine each sample into a DNA LoBind tube.

Purify the DNA using 1.8 volume (360 μL) of HighPrep PCR reagent following Steps 6–14.

Elute the DNA in 20 μL EB.

Measure the DNA concentration using a NanoDrop spectrophotometer.

Pool three replicates of DNA samples or RNA samples using an equal amount (1 μg) from each
replicate, using Supplementary Table 4. Keep DNA and RNA samples separated until these are
pooled at the later step (Step 160).

PAUSE POINT DNA can be stored at −20 °C for months.
Pause
Add equal volume of 6x gel loading dye (final conc. 3×) and mix the solution by vortexing.

Run the pooled sample on a 1.8% (wt/vol) agarose gel (30 mL of 5 cm ×6 cm mini gels with 1.3 cm-width well
Cut the DNA bands (162 bp) using a blue-light Safe Imager.

Purify the DNA from the gel slices using QIAquick Gel Extraction Kit according to the
manufacturer’s protocol.
Use one column for each of DNA or RNA samples.

Elute the DNA in 50 μL Buffer EB per column.

Purify the DNA using 1.8 volume (90 μL) of HighPrep PCR reagent

Bring the HighPrep PCR reagent to RT for at least 30 min before use. Shake thoroughly to fully
resuspend the magnetic beads. Add 1 volume (200 μL) of HighPrep PCR reagent and mix
thoroughly by pipetting up and down 6–8 times
Incubate the mixture for 5 min at RT
Place the tube on the magnet for 2–3 min until the solution clears and beads pull to the side of
the wall
Carefully remove the supernatant by pipetting without disturbing the beads. A small amount
(10–20 μL) of supernatant can be left in the tube.
With the tube on the magnet, add 500 μL of 80% (vol/vol) ethanol and incubate for 30 s.
Remove the ethanol by pipetting while the tube is still on the magnet.
Repeat the 80% (vol/vol) ethanol washing
Flash-spin (200–1,000g, 22–25 °C, 3 s) the tube, immediately place it back on the magnet and
remove the supernatant
Dry the bead pellet for 2–3 min. Do not overdry the beads
Add 20 μL of Buffer EB to the beads and elute the DNA by pipetting and vortexing.
.
Place the tube on the magnet for 1-2 min and transfer 18 μL of the eluate to a LoBind tube.

Measure the DNA concentration using a Qubit dsDNA HS Assay Kit according to manufacturer’s
protocol.

To check the DNA size and quality, run 20-30 ng of the DNA on a 1.8% (wt/vol) gel along with
100-bp DNA ladder.

Pool the DNA and RNA samples in a single LoBind tube with 1:3 ratio to obtain 100 μL mixture at
the final concentration of 10 nM (1 ng/μL), using Supplementary Table 4.
PAUSE POINT The pooled DNA can be stored at −20 °C for months.

Pause
Send the sequencing library and custom primers for sequencing

The sequencing should be done with paired-end 15 bp (barcode length), 16 cycles for index read 1 and 10 cycles for index read 2. Index read 1 provides the UMI sequence, and index read 2 provides the sample index.
let the sequencing facility know that the index read 2 should be used for demultiplexing and that short reads should not be masked, bcl2fastq parameters:

--minimum-trimmed-read- length 0 --mask-short-adapter-reads 0

An average of 10×(DNA) and 30×(RNA) coverage of the library (based on number of barcodes) via
sequencing reads is required. For example, we use an Illumina NextSeq high-output run (400M
reads over 3 replicates) for the 0.5M barcode library or three runs (1.2B reads over three replicates)
for the 8–12 million barcode library (Supplementary Table 1). Sequencing cycles, primers, and
expected output for each read is shown below

READCYCLESPRIMEROUTPUT
Read 115pLSmP-ass-seq-ind1Barcode (forward)
Read 215pLSmP-bc-seqBarcode (reverse)
Index read 116pLSmP-UMI-seqUMI
Index read 210pLSmP-5bc-seq-R2Sample index

Computational step