Oct 24, 2023

Public workspaceDesigning an EpiTYPER bisulfite sequencing assay for age estimation in Acinonyx jubatus based on human orthologues

  • 1University of the Free State;
  • 2South African National Biodiversity Institute;
  • 3Teesside University
Open access
Protocol CitationLouis-Stéphane Le Clercq, Desire Dalton, Antoinette Kotze, Paul Grobler 2023. Designing an EpiTYPER bisulfite sequencing assay for age estimation in Acinonyx jubatus based on human orthologues. protocols.io https://dx.doi.org/10.17504/protocols.io.j8nlk4yk1g5r/v1
Manuscript citation:
Le Clercq, L.S., 2023. Biological clock measures: Assessing the association between the circadian and epigenetic clock as predictors of migration phenology and biological aging in wildlife (Doctoral thesis, University of the Free State).
License: This is an open access protocol distributed under the terms of the Creative Commons Attribution License,  which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Protocol status: Working
We use this protocol and it's working
Created: June 15, 2021
Last Modified: October 24, 2023
Protocol Integer ID: 50801
Keywords: Methylation, Epigenetics, MALDI-TOF, Age determination, EpiTYPER, Human, Cheetah
Funders Acknowledgement:
National Research Foundation (RSA)
Grant ID: 112062
Abstract
Age is key factor in animal ecology as it can be used to assign animals to important age classes, ranging from immature young to reproductive adults and eventual old age and fragility. Different groups contribute to different aspects that need to be considered when modeling current and future population dynamics as part of continued conservation efforts. Due to the need of an accurate molecular method for assigning age, several studies have explore various aspects of epigenetic clocks. Epigenetics is a collective term for mechanisms that modify DNA and DNA packaging, independent of genetic sequence. One widely studied epigenetic feature is DNA methylation; a process that adds a methyl group to the 5’ cytosine of Cytosine-Guanine pairs (CpG’s). Studies have revealed that within genes, nearly a third of all CpG sites are influenced by age. Given its consistency, the epigenetic clock is a promising avenue of chronological age prediction which has been illustrated in many human studies. This protocol illustrates how CpG's with known age-correlations from human studies can be used to (1) identify orthologous regions in other species and (2) design primers to assay differential methylation using EpiTYPER mass array technology.
Guidelines
None
Materials
  • UCSC Genome Browser
  • NCBI BLAST
  • EpiDesigner website
  • R
  • RSeqMeth
Safety warnings
Attention
None
Ethics statement
Protocol approval for the present study was obtained from the protocol committee of the Department of Genetics, University of the Free State (approval number: Res18/2020). Ethics approvals were obtained from the University of the Free State (approval number: UFS-AED2020/0015/1709) as well as the South African National Biodiversity Institute (approval number: SANBI/RES/P2020/30). Appropriate research permits were also obtained from South African regulatory authorities including the Department of Agriculture, Land Reform, and Rural Development (Section 20 permit: 12/11/1/1/18(1824JD)) and the Department of Environmental Affairs (Threatened Or Protected Species (TOPS) permit: O-52903).
Before start
You need to know the CG values for the CpG's you would like to design the assay.


GeneCG valueReference
ASPAcg02228185Vidal-Bralo et al. (2017)
EDARADDcg09809672Bocklandt et al. (2011)
ELOVL2cg21572722Bekaert et al. (2015)
FHL2cg22454769Giuliani et al. (2015)
FUT3cg17471102Vidal-Bralo et al. (2017)
ITGA2Bcg25809905Weidner et al. (2014)
GRIA2cg25148589Polanovski et al. (2014)
PDE4Ccg17861230Weidner et al. (2014)
PENKcg16219603Giuliani et al. (2015)
TET2cg08924430Polanovski et al. (2014)
Table of genes and cg names used to design assays.

Retrieving CpG from human sequencing
Retrieving CpG from human sequencing
Most research papers on humans give a "CG" value which corresponds to a CpG site in the human genome based on Illumina sequencing. The following steps were used to retrieve the human sequence for reported CG values using the University of California Santa Cruz (UCSC) Genome Browser (https://genome.ucsc.edu/).
On the landing page for the website, select "Genome Browser".
Genome Browser is listed under "Tools" on the landing page.
From the drop down menu, select the "Feb. 2009 (GRCh37/hg19)" genome build, as this is the version with the mapped CG values.
In the "position/search term" box, enter the desired CG value to look up e.g., cg00123456 and click "Go" to perform search.
Example of search setup with the correct assembly and desired cg search term indicated.
The top of the results page will show the specific position e.g., Chr 19; 18.343,902.
The left panel of the scaffold should indicate a track for the CG value that was searched. Click on the panel (orange) for more details.

Expected result
Genome view of the CpG mapped on the human genome. The bottom of the genome view shows the specific cg (orange).


Click on "View DNA for this feature" and then select "Get DNA".

Expected result
New view that loads after clicking on the cg panel.


Specify to add 300-400 base pairs upstream and downstream of the CpG to ensure that the target CpG is in the middle and you have enough sequence to design primers.

Expected result
Menu that appears to select DNA sequence for export.


Save the sequence in the FASTA format.
Finding the animal orthologues
Finding the animal orthologues
The next steps are to find the orthologous gene sequence for the target species (e.g., Acinonyx jubatus) using NCBI Blast (https://blast.ncbi.nlm.nih.gov/Blast.cgi).
Select "Nucleotide Blast".


Copy and Paste the FASTA sequence into the sequence box or select the FASTA file to upload.


For "Database" select the "RefSeq Genome Database" and for "Organism" specify the target species e.g., Acinonyx jubatus (taxid:32536).


Download the FASTA for the complete aligned sequence.

Expected result



Note
Because the target CpG is located at the 300th base pair, be sure the resulting match covers that region.


Note
Tip: If the BLAST result does not cover the full length of the sequence used in the search (600 bp), you can view the result in the assembly viewer and use the sliders to select a region of around 600 bp that includes the BLAST match.
Once you selected a specific BLAST match to view, the result page will have the option to see the match in an aligned genomic context on the right-hand side of the page.



Designing Primers for the EpiTYPER assay
Designing Primers for the EpiTYPER assay
The next steps are used to design the EpiTYPER primers on their EpiDesigner website (https://www.epidesigner.com/).
On the landing page, click "Start" to begin a new experiment design.
Once the input page has loaded, input the target sequence in FASTA format by either copying and pasting the sequence into the box or selecting the file.

Note
Tip: EpiDesigner seems to work better if you attach a file rather than pasting the sequence into the box.

Input the desired primer parameters or alternatively use the recommended base settings as is.
Leave the Target, Excluded, and Transcription Region (Advanced) setting empty to design primers across the full sequence. Click on "Begin" to design the primers.

Example of input screen for EpiDesigner. The sequence file is chosen and primer design was done by selecting both the forward and reverse strand. The gene name was used as a "Note" to keep track of results for different genes.
The results will appear starting with an interactive diagram of the sequence, the detected CpG's, and the mapped product for each primer pair.

Expected result
Example of diagram generated for the target sequence indicating the 14 possible amplicons for the region based on different primer pairs.

Below the image, the primers and their details are listed for each product shown on the diagram.

Expected result
List of primers for each amplicon with positions, size, Tm, and sequences. Below each primer pair the product size and number of CpG's covered are indicated.

Potential primers that appear to be optimal can be selected by the left-hand tick boxes and exported in several format.


Testing CpG coverage for selected Primers
Testing CpG coverage for selected Primers
The final steps are performed in R (4.0.6) using RSeqMeth (https://github.com/cran/RSeqMeth) to determine the fragmentation patterns and elucidate which CpG's can be assayed.

Note
The desired amplicon from each gene, indicated in a column from the exported primer design results, needs to be saved as a plain text file e.g., "ASPA.txt"

Once you have downloaded RSeqMeth, open R and execute the analyses with the following code:
>source("Path to RSeqMeth\\R\\ampliconReport.R")

This is used to load the function is used to analyze the desired amplicon.

Then execute:

>ampliconReport("Path to amplicon text file.txt")

Six files are written to the same directory as the text file once completed, three for the analysis of "T spectra" and three for the analysis of "C spectra".

Expected result
Example of code run and files written for the analysis of fragments.


Note
Depending on which sequencing kit you will be using you only need to look at either the results for the "T spectra" OR the results for the "C spectra".

The following results can then be viewed and assessed.

Expected result
Cleavage reaction predicted fragmentation showing clustering of CpG's per fragment. Clusters indicated in red are covered and can be tested while those in grey cannot be assayed.
These CpG's and their mass are listed in the table (CSV) output and indicates reasons why selected sites cannot be assayed such as overlapping fragment sizes, size duplication, or low mass.


Protocol references
Bekaert, B., Kamalandua, A., Zapico, S.C., Van de Voorde, W. and Decorte, R., 2015. Improved age determination of blood and teeth samples using a selected set of DNA methylation markers. Epigenetics10(10), pp.922-930.

Bocklandt, S., Lin, W., Sehl, M.E., Sánchez, F.J., Sinsheimer, J.S., Horvath, S. and Vilain, E., 2011. Epigenetic predictor of age. PloS one6(6), p.e14821.

Giuliani, C., Cilli, E., Bacalini, M.G., Pirazzini, C., Sazzini, M., Gruppioni, G., Franceschi, C., Garagnani, P. and Luiselli, D., 2016. Inferring chronological age from DNA methylation patterns of human teeth. American Journal of Physical Anthropology159(4), pp.585-595.

Polanowski, A.M., Robbins, J., Chandler, D. and Jarman, S.N., 2014. Epigenetic estimation of age in humpback whales. Molecular ecology resources14(5), pp.976-987.

Vidal-Bralo, L., Lopez-Golan, Y. and Gonzalez, A., 2016. Simplified assay for epigenetic age estimation in whole blood of adults. Frontiers in genetics7, p.126.

Weidner, C.I., Lin, Q., Koch, C.M., Eisele, L., Beier, F., Ziegler, P., Bauerschlag, D.O., Jöckel, K.H., Erbel, R., Mühleisen, T.W. and Zenke, M., 2014. Aging of blood can be tracked by DNA methylation changes at just three CpG sites. Genome biology15, pp.1-12.