Protocol Citation: Christian Macdonald, David Nedrud, Patrick Rockefeller Grimes, Donovan Trinidad, Catherine Shin, James Fraser, Willow Coyote-Maestas 2024. DIMPLE library generation and assembly protocol . protocols.io https://dx.doi.org/10.17504/protocols.io.rm7vzy7k8lx1/v5Version created by Christian Macdonald
Manuscript citation:
Macdonald, C.B., Nedrud, D., Grimes, P.R. et al. DIMPLE: deep insertion, deletion, and missense mutation libraries for exploring protein variation in evolution, disease, and biology. Genome Biol24, 36 (2023). https://doi.org/10.1186/s13059-023-02880-6
License: This is an open access protocol distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Protocol status: Working
We use this protocol and it's working
Created: November 14, 2023
Last Modified: December 20, 2024
Protocol Integer ID: 116449
Disclaimer
DISCLAIMER – FOR INFORMATIONAL PURPOSES ONLY; USE AT YOUR OWN RISK
The protocol content here is for informational purposes only and does not constitute legal, medical, clinical, or safety advice, or otherwise; content added to protocols.io is not peer reviewed and may not have undergone a formal approval of any kind. Information presented in this protocol should not substitute for independent professional judgment, advice, diagnosis, or treatment. Any action you take or refrain from taking using or relying upon the information presented here is strictly at your own risk. You agree that neither the Company nor any of the authors, contributors, administrators, or anyone else associated with protocols.io, can be held responsible for your use of the information contained in or linked to this protocol or any of our Sites/Apps and Services.
Abstract
This is a protocol for generating and QCing mutagenic libraries using the DIMPLE protocol. This version is updated to include expanded descriptions of QC and to clarify certain steps.
Materials
Enzymes and cells
PrimeSTAR GXL DNA PolymeraseContributed by usersCatalog #R050A
NEB Golden Gate Assembly MixNew England BiolabsCatalog #
E1601S (BsaI)
Kits (NEB kits given only as examples - any comparable kit should work)
Monarch PCR and DNA Cleanup Kit - 50 prepsNew England BiolabsCatalog #T1030S
Monarch® DNA Gel Extraction KitNew England BiolabsCatalog #T1020
Monarch Plasmid Miniprep Kit - 250 prepsNew England BiolabsCatalog #T1010L
Media and chemicals
SOC Outgrowth Medium - 100 mlNew England BiolabsCatalog #B9020S
PCR Water (nuclease free)Contributed by usersCatalog #PCPW
Equipment and consumables
Thermocycler
Electroporator
Shaker
Horizontal electrophoresis system
Benchtop centrifuge
OD meter
Electroporation cuvettes (0.1 cm)
Cuvettes for OD measurement
Selection agar plates
Agarose
Optional: large BioAssay plates for plasmid purification
Preparation
Preparation
Use DIMPLE to generate mutagenic oligos and primers.
Snapshot of the default DIMPLE GUI
Important notes: DIMPLE breaks a gene up into sub-library fragments and generates mutagenic insert oligo pools, where each oligo contains barcodes, Type IIS restriction cutsites, and a sub-region of the gene. Be sure to review your library generation vector and gene sequences and look for pre-existing Type IIS restriction sites. Use site-directed mutagenesis to remove unwanted off-target sites. Also note that, by default, DIMPLE expects restriction enzymes with 6 bp recognition sites and 4 bp overhang lengths (as with BsaI and BsmBI).
Input your wildtype gene sequence.
Notes:
DIMPLE takes a reference sequence and provides a set of primers and oligos to mutagenize it according to the user's specifications. Specifying the mutagenic region properly is essential.
The reference gene sequence should contain the gene in the vector you are planning to construct the library in to allow primers for the fragments at either end to be properly created.
Example:
Snapgene view of target vector with region to be mutagenized (OCT1, red feature) selected. The sequence is displayed in a vector that will be used to assemble the library. Note that the displayed selection boundaries are 27 and 1688.
Input wildtype gene sequence: Provide a fasta file of your gene sequence, including backbone as above. If multiple genes will be mutagenized in the same oligo pool, combine them all into one fasta file.
Optionally (but recommended), you can define the starting and ending nucleotide positions of your ORFs in each fasta sequence. Note that this uses 1 indexing: for example, in the above sequence, we would provide the following fasta file with identical start and ending positions:
*If you do not provide start & end position, the software will analyze different ORFs and ask you to select which one contains your gene of interest.
It is important to include the plasmid backbone so that the software can avoid making primers that nonspecifically recognize a region outside of your gene.
Using the DIMPLE GUI
Working Directory: When you open the DIMPLE GUI, first designate your working directory. This is the folder you wish your mutagenic oligo & primer outputs to be saved in.
Target Gene File: Upload your gene fasta text file with the Target Gene File button.
Oligo length: Designate the length of the oligos for the final pool. This will include the barcodes, Type IIS restriction cut site, and a region of your target gene that will be mutagenized. So, roughly, a given oligo length will allow fragments containing (oligo length - 50) bp of the gene.
Fragment length: You can adjust the length of your fragments, we recommend leaving the fragment length set to auto.
DIMPLE will automatically break up your gene into roughly the same fragment sizes, and will determine which lengths work best to avoid matching overhangs.
Fragment overlap:The number of base pairs shared between sub-regions. We have seen that setting overlap to 0 base pairs leads to errors. The overlap is set to 4 base pairs by default.
Barcode start position: The software selects from a set pool of barcodes when designing oligos. You can define which number barcode in the list this starts from. Unless you have a very good reason to, this can safely be left at 0.
Type IIS restriction sequence: You can select which Type IIS restriction cutsite you'd like to append to your inserts and backbone. You can choose between the BsmBI (CGTCTC) or BsaI (GGTCTC) sequences.
Sequences to avoid: This will allow the code to design fragments without the cutsite you selected in the Type IIS restriction sequence section. If there is an error when running, this implies you have a cutsite in your gene or vector that needs to be removed.
Codon usage: Your oligos can be codon optimized to your organism of choice. The default options are E. coli or Human, but you can also upload a codon usage table for any organism by pressing "Custom codon usage."
Select mutations:
Deletions: If you'd like to generate a library with deletions across your gene, select "List of deletions". Enter how many base pairs you'd like to be deleted across your library.
For example, if you'd like deletions that are one codon long, enter "3" for 3 nucleotides.
For example, If you'd like to include both deletions that are one codon long AND two codons long in the same pool, enter "3,6" for both 3, and 6 nucleotide long deletions.
Insertions: If you'd like to generate a library with different sequences inserted across your gene, select "List of Insertions" and provide the nucleotides you wish to be inserted (NOT amino acids!).
For example, If you'd like to insert single glycine residues between every codon of your gene, include "GGG" (or "GGA", etc) in the box.
For example, If you'd like to insert glycine-serine pairs between every codon of your gene, include "GGGTGC" in the box.
For example, if you'd like to insert glycine AND glycine-serine residues between every codon of your gene, include "GGG,GGGTGC" in the box.
Substitutions: If you'd like to generate a library where an amino acid at each position is swapped with every other possible amino acid, select "Include Substitutions."
Run DIMPLE: Hit "Run DIMPLE" when you are ready to generate your pool.
Examples of what the running code and outputs look like:
The code will first iterate the ideal fragment length sizes across each gene of interest.
Next, barcodes are assigned to each fragment. The code will update how many barcodes it attempted to use to generate each fragment pool, and will remove these from the remaining barcode pool.
This will continue for each gene included in your fasta file.
There is a final QC check for each primer set generated.
All outputs are saved in your working directory.
If you are using DIMPLE to mutagenize several genes, there will be separate files for the primers and mutagenic inserts for each gene, as well as a master list of all oligos and primers.
For each gene, DIMPLE generates a list of:
EXAMPLE_DMS_Oligos. This is the list of sub-library oligo pools, where each oligo contains barcodes, Type IIS restriction cutsites, and a sub-region of the gene.
Screenshot of mutagenic inserts generated by DIMPLE
EXAMPLE_DMS_Gene_Primers. These are primers used to add cutsites to and amplify the backbone each mutagenic insert will be ligated into.
Screenshot of backbone primers generated by DIMPLE. Each primer name also lists its melting temperature.
Each primer pair linearizes the backbone
EXAMPLE_DMS_Oligo_Primers. These are primers used to amplify the mutagenic inserts from your oligo pool.
EXAMPLE_mutations. This is the list of mutations that will be present in your final library.
EXAMPLE_designed_variants. This is a more detailed list of the variants expected to be present in your final library. This file can be directly used to analyze a DMS experiment using Dumpling, a pipeline we created to perform QC and generate scores with DIMPLE libraries.
Library assembly
Library assembly
3d
3d
Prepare oligo pool stock
Follow any recommendations for oligo pool resuspension: typically, this results in a 10 ng/µL solution, which the following steps assume. Lower or higher concentrations may require alterations.
PCR amplification of oligos, backbone
Thaw all components beforehand and follow general directions provided by manufacturer.
Amplification of mutagenic inserts and backbones should ideally be performed in parallel, assuming one has two thermocyclers. Alternatively, the backbone amplification can be performed first, then the the insert amplification can be done during the gel purification of the backbone.
PCR amplification of backbone.
Prepare a master mix with PrimeSTAR GXL polymerase: