Association of a structural variant within the SQSTM1 gene with amyotrophic lateral sclerosis
Citation Manager Formats
Make Comment
See Comments

Abstract
Objective As structural variations may underpin susceptibility to complex neurodegenerative diseases such as amyotrophic lateral sclerosis (ALS), the objective of this study was to investigate a structural variant (SV) within sequestosome 1 (SQSTM1).
Methods A candidate insertion/deletion variant within intron 5 of the SQSTM1 gene was identified using a previously established SV evaluation algorithm and chosen according to its subsequent theoretical effect on gene expression. The variant was systematically assessed through PCR, polyacrylamide gel fractionation, Sanger sequencing, and reverse transcriptase PCR.
Results A reliable and robust assay confirmed the polymorphic nature of this variant and that the variant may influence SQSTM1 transcript levels. In a North American cohort of patients with familial ALS (fALS) and sporadic ALS (sALS) (n = 403) and age-matched healthy controls (n = 562), we subsequently showed that the SQSTM1 variant is associated with fALS (p = 0.0036), particularly in familial superoxide dismutase 1 mutation positive patients (p = 0.0005), but not with patients with sALS (p = 0.97).
Conclusions This disease association highlights the importance and implications of further investigation into SVs that may provide new targets for cohort stratification and therapeutic development.
Glossary
- ALS=
- amyotrophic lateral sclerosis;
- CI=
- confidence interval;
- fALS=
- familial ALS;
- GWAS=
- genome-wide association study;
- I/D=
- insertion/deletion;
- mRNA=
- messenger RNA;
- ONS=
- olfactory neurosphere derived;
- OR=
- odds ratio;
- sALS=
- sporadic ALS;
- SOD1=
- superoxide dismutase 1;
- SQSTM1=
- sequestosome 1;
- RT-PCR=
- reverse transcriptase PCR;
- SV=
- structural variant
A growing body of literature indicates that structural variants (SVs) are important in complex diseases because their role in gene expression and messenger RNA (mRNA) regulation is now emerging.1,–,4 Given the potential implication of SVs in neurodegenerative disorders,5,–,7 an investigation of SVs within associated genes may provide insight toward the pathogenic mechanisms involved and enable the identification of new therapeutic targets.
Mutations in sequestosome 1 (SQSTM1), and aggregation of the SQSTM1-encoded p62 protein, have been identified in patients with amyotrophic lateral sclerosis (ALS) and other neurodegenerative diseases.8,–,13 P62 is a multifunctional protein that binds ubiquitin and is involved in autophagy, proteasomal degradation of ubiquitinated proteins, mitophagy, and cellular signaling.14,–,16 Variants within SQSTM1 or surrounding SQSTM1 may contribute to the diverse presentation observed between the patients with ALS. How mutations in SQSTM1 influence ALS is not fully understood; however, it is believed that structural changes to the protein may affect adapter function of SQSTM1 to the LC3 protein in the nascent autophagosome and impair autophagy of proteins that are not recyclable by the proteasome.17 Other cellular systems may also be affected such as ubiquitin binding and regulation of cellular processes including DNA repair, endocytosis, and signal transduction.8,14
In the current study, a potential influential insertion/deletion (I/D) within intron 5 of SQSTM1 was identified using an in silico short SV evaluation algorithm.18 Reverse transcriptase PCR (RT-PCR) analysis revealed a link between the I/D and altered transcript levels. An exploratory association study in a sporadic and familial North American ALS cohort established an association with familial ALS (fALS), particularly superoxide dismutase 1 (SOD1) mutation-positive patients but not sporadic ALS (sALS). We further examined whether this SV was associated with age of disease onset and duration in SOD1 mutation-positive patients and found no statistically significant association.
Methods
Standard protocol approvals, registrations, and patient consents
This study was approved by the ethical standards of the relevant institutional review board, the Human Research Ethics Committee of the University of Western Australia (RA/4/20/5308). Participants were enrolled after informed consent was obtained. Clinical data were collected according to the Health Insurance Portability and Accountability Act Standards of Confidentiality and Disclosure and approved by the Northwestern University institutional review board (STU0012722/ CR5_STU00012722) and the Duke University institutional review board (PRO00040665/323682). All patients were diagnosed by board-certified neurologists and met the revised El Escorial World Federation of Neurology criteria for diagnosis with ALS.19
SV identification
An SV evaluation algorithm was used to locate the variant within the gene, SQSTM1.18 Detailed methods are described in the supplementary material (see e-methods, links.lww.com/NXG/A229).
PCR amplification and Sanger sequencing
Detailed methods are described in the supplementary material. Sequences of all primers used for this study are outlined in table e-1 (links.lww.com/NXG/A229).
Olfactory neurosphere-derived cells
Culturing information can be found in the supplementary data.
RT-PCR and densitometry
Detailed methods are described in the supplementary material. Sequences of all primers used for this study are detailed in table e-1 (links.lww.com/NXG/A229).
ALS and healthy control participants
A cohort of 196 fALS, 207 sALS, and 562 healthy control participants were recruited into the Neurologic Diseases Registry, Northwestern University, Chicago, USA, and Duke University, North Carolina, USA. The 196 fALS cases from Northwestern University belong to 74 different families with possible within-family genetic correlations. The mutation data for all fALS patients are detailed in table e-2 (links.lww.com/NXG/A229).
Statistics
Data are reported as mean ± SD where appropriate. Statistical differences in genotype proportions for independent cases were assessed using Pearson χ2 test with Yates correction. Proportions and ages of onset involving familial groups were analyzed using the mixed effects regression models to account for possible within-family correlations. Durations were analyzed using Cox proportional hazards models with clustering to account for the correlations. Analyses were carried out in IBM SPSS Statistics version 25.0 (IBM Co., Armonk, NY) and R version 3.4.3 (R Foundation for Statistical Computing, Vienna, Austria).
Data availability
Study data for the primary analyses presented in this report are available on reasonable request from the corresponding author.
Results
Identification of a polymorphic variant within the SQSTM1 gene
An SV evaluation algorithm was used to identify the genetic variant located within SQSTM1. The identified cytosine adenine adenine adenine (CAAA) I/D within intron 5 of SQSTM1 was evaluated on the NCBI database; although the region had been previously reported (NC_0000005.0), no disease associations were established. The recorded NCBI data demonstrated uncertainty to its polymorphic nature; currently, the region is recorded as a series of insertion/deletions of varying length, with no allelic frequency data. Conventional polyacrylamide gel electrophoresis revealed a biallelic variant, consisting of a heterozygous I/D genotype in lanes 2, 3, and 7, a homozygous D/D genotype in lanes 4 and 8, and a homozygous I/I genotype in lanes 5, 6, and 9 (figure 1A). Sanger sequencing confirmed that the I allele contained a CAAA insertion when compared with the D allele (figure 1B). To determine the effect of the CAAA SV on SQSTM1 expression, SQSTM1 was assessed in a panel (n = 3) of olfactory neurosphere-derived (ONS) cells obtained from the healthy controls. Semiquantitative RT-PCR revealed a stepwise increase in the level of SQSTM1 transcript, with the I/I genotype yielding the lowest levels and the I/D and D/D genotypes producing a 1.77-fold and 2.47-fold relative increase in SQSTM1 transcript, respectively (figure 1C).
(A) PCR and native polyacrylamide gel electrophoresis across a random selection of control DNA samples and compared against a 100bp ladder. (B) Sanger sequencing of the I allele and D allele. (C) SQSTM1 transcript levels of SQSTM1 exon 4–7 analyzed using RT-PCR on RNA from a panel of control ONS cells. Relative densitometry was calculated with SQSTM1 transcript signal standardized to each respective GAPDH signal. GAPDH = glyceraldehyde 3-phosphate dehydrogenase; I/D = insertion/deletion; ONS = olfactory neurosphere derived cells; RT-PCR = Reverse transcriptase PCR; SQSTM1 = sequestosome 1.
The SQSTM1 variant is associated with fALS
Following the identification of the CAAA I/D variant, we proceeded to determine the variant frequency in a cohort of 196 fALS patients, 207 sALS patients, and 562 healthy age-matched controls. The characteristics of the study participants are summarized in table 1, including sex, age, disease duration, and family history of each cohort. Of the ALS cohort, patients with familial mutations (48.6%) were further grouped by affected gene, specifically C9orf72 (2.5% of patients with ALS), SOD1 (41.4% of patients with ALS), or TDP-43 (4.7% of patients with ALS).
Characteristics of patients and healthy control participants
There was no difference in the frequency of the SQSTM1 variant observed between healthy controls and sALS cases (χ2 = 0.032, df = 2, p = 0.984, odds ratio [OR] = 0.974, 95% confidence interval [CI]: 0.660, 1.436). However, there was a difference in SQSTM1 variant frequency observed between healthy controls and fALS patients (χ2 = 12.791, df = 2, p = 0.002, OR = 1.79, 95% CI: 1.250, 2.562 assuming independence of cases), with the homozygous (I/I) genotype over-represented in the fALS cohort (33.2%) when compared with the controls (21.7%). When analyzed using mixed effects regression models to account for possible within-family correlations, the (I/I) association remained both genotypical and allelical (p = 0.013 and p = 0.0036, respectively, table 2). This SQSTM1 association was stronger when examining the subset of fALS patients carrying mutations in SOD1 (χ2 = 16.754, df = 2, p = 0.0002, OR = 1.869, 95% CI: 1.281, 2.726 assuming independent cases). When accounting for family structures, the SQSTM1 variant was strongly associated with patients carrying a SOD1 mutation both genotypically and allelically (p = 0.001 and p = 0.0005, respectively) (table 2). When analyzed with SOD1-A5V mutation-positive patients excluded, the strong SOD1 association remains (χ2 = 6.801, p < 0.009, OR = 1.830, 95% CI: 1.157, 2.896).
Association of SQSTM1 variant and ALS disease status
The SQSTM1 variant is not associated with age at onset of disease or survival in patients with SOD1.
To determine if the SQSTM1 variant is associated with SOD1 mutation-positive ALS patient outcomes, age at onset and survival were analyzed. There was no association observed between the age at onset in patients with the I/I (Mean Rank = 87.25), I/D (Mean Rank = 76.40), or D/D (Mean Rank = 84.43) genotypes (p = 0.2, mixed model regression, figure 2A). Survival curves were generated to compare durations of each genotypic group (figure 2B). No association was observed between patients carrying each genotype and their survival after taking into account of familial correlations via a clustered Cox regression (p = 0.55).
(A) The median and distribution of age at onset (years) of SOD1 mutation-positive patients with ALS (n = 167) grouped by each SQSTM1 variant genotype. (B) Kaplan-Meier survival curves of SOD1 mutation-positive patients, comparing the SQSTM1 genotypes assuming independent measurements. A robust log-rank test accounting for familial correlation was performed to assess any association between the groups. Survival was measured in months from ALS diagnosis until death. ALS = amyotrophic lateral sclerosis; SOD1 = superoxide dismutase 1; SQSTM1 = sequestosome 1.
Discussion
A growing body of literature has highlighted SVs, their abundance throughout the human genome, and their potential role in the pathogenesis of ALS and other neurodegenerative diseases.20,–,24 SVs are responsible for greater diversity at the nucleotide level between 2 human genomes than any other form of genetic variations and are three-fold more likely to associate with genome-wide association studies (GWASs) signals than single nucleotide polymorphisms (SNPs).20 SVs that remain cryptic to current sequencing algorithms are likely to account for disease-causing variation in unsolved Mendelian disorders and missing heritability in complex disorders.20,23 SVs may affect gene expression and therefore may play an important but understudied role in disease susceptibility.20,25 Recent discoveries of SVs as informative disease risk markers for rare genetic disorders provide compelling evidence for ongoing investigation into the association between SVs and rare genetic diseases.26,–,30 The novel bioinformatics SV evaluation algorithm tool prioritizes potential functional/causal SVs within candidate regions identified using GWAS.18 In ALS and all rare genetic diseases, these highly polymorphic markers are often overlooked, largely because of the limitations in the current gene sequencing platforms such as next-generation sequencing and GWAS, which are primarily designed to detect SNPs.
Two direct lines of evidence link SQSTM1 to ALS: the first through etiology on account of mutations associated with ALS and second through pathology, where p62 immune-reactive inclusions are found in virtually all forms of ALS and ALS-frontotemporal dementia.8 We now provide a third link of SQSTM1 variants to the fALS. SQSTM1-encoded p62 protein is a key scaffolder involved in cellular signaling and protein degradation through the autophagosome-lysosome system.17 SQSTM1/p62 mutations may confer a toxic gain of function through protein interactions, leading to dysregulation of cell signaling pathways, protein misfolding, and aggregation.8 Evidence for this in the literature suggests parallels between p62 and other proteins associated with neurodegeneration, including SOD1-linked ALS, C9orf72, ubiquilin 2, TDP-43, FUS, optineurin, beta-amyloid, α-synuclein, and tau.8,31 Owing to its role in protein degradation, overexpression of p62 has been shown to be protective in some neurodegenerative animal models, but overexpression in a SOD1 ALS model was found to accelerate disease onset.32,–,34 Taken together, these studies suggest that a fine balance in p62 levels is required for optimal signaling and protein clearance. Consequently, small changes in the level of p62 expression, which may be produced by variants such as the SQSTM1 intron 5 I/D, could tip the balance of p62 expression, contributing to the disease. Previously identified mutations within SQSTM1 have been associated with ALS.35,36 As such, it has been suggested that these mutations may have a direct role in ALS pathogenesis, presenting as an important target for future therapy.8
In this study, we hypothesized that structural variations in SQSTM1 may uncover novel susceptibility factors that underpin this disease. SQSTM1 mutations, and p62 aggregation, have been reported not only in ALS but also in other complex neurodegenerative diseases, including Parkinson disease, Paget disease of bone, Alzheimer disease and frontotemporal dementia.8,–,13 A semiquantitative assessment of the mRNA expression revealed a difference in SQSTM1 transcript levels in the ONS cells carrying each genotype. Neural tissue derived from olfactory mucosa, such as primary ONS cells, provide informative cellular models for neurodegenerative diseases. Primary ONS cells are neural progenitor cells and more accurately reflect motor neuron cellular function than fibroblasts and PBMCs, and unlike postmortem brain tissue, they can be isolated from living patients.37,38 As such, the altered SQSTM1 transcript levels observed in the ONS cells with the I/I genotype may be reflective of altered SQSTM1 expression in motor neurons. Although this result requires validation in a larger sample size, it does suggest some level of regulation by the SV on SQSTM1 transcript expression. This may be due to altered transcription efficiency, splicing, or transcript stability, potentially translating to altered protein expression that may contribute to disease pathologies.
An evaluation of the SQSTM1 variant within a cohort of 403 patients with ALS and 562 healthy age-matched controls revealed an association with fALS disease risk, but not age of disease onset or disease severity. Replications in additional fALS cohorts are necessary to truly elucidate the nature of this variant. Causality has been reported between ALS and SQSTM1 variants rs796051870, rs776749939, rs796052214, and as such, the ongoing investigation between ALS and SQSTM1 gene variants is necessary.8,39,40 P62 has previously been reported as overexpressed and accumulated in inclusions of sporadic inclusion body myositis, reinforcing its importance in neurodegenerative diseases.12 We anticipate that as additional SVs are identified, these will further stratify other relevant disease phenotypes such as age at onset, duration, and disease progression.
Study funding
The study was funded by research support from the Perron Institute for Neurologic and Translational Science and the Les Turner Foundation for ALS. The funders have no role in the design of the study and collection, analysis, decision to publish, interpretation of data, or preparation of the manuscript.
Dislosure
Disclosures available: Neurology.org/NG.
Acknowledgment
The authors are deeply indebted to the late Allen D. Roses, PhD, who provided mentorship to several of the authors and who originally conceptualized and initiated this research. The authors would also like to thank the contribution of Prof. Alan Mackay Sim and Prof. George Mellick (Griffith University, Institute for Drug Discovery) for the generous gift of the ONS cell cultures. The study was in part funded by NIH grants to TS (NS050641, NS046535), the Les Turner ALS Foundation/Herbert C. Wenske Foundation Professorship and the Les Turner ALS Research and Patient Care Center.
Appendix Authors


Footnotes
Go to Neurology.org/NG for full disclosures. Funding information is provided at the end of the article.
The Article Processing Charge was funded by the authors.
- Received July 19, 2019.
- Accepted in final form January 23, 2020.
- Copyright © 2020 The Author(s). Published by Wolters Kluwer Health, Inc. on behalf of the American Academy of Neurology.
This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND), which permits downloading and sharing the work provided it is properly cited. The work cannot be changed in any way or used commercially without permission from the journal.
References
- 1.↵
- Chakravarti A,
- Kapoor A
- 2.↵
- 3.↵
- 4.↵
- 5.↵
- Lyall DM,
- Royle NA,
- Harris SE, et al
- 6.↵
- Lutz MW,
- Saul R,
- Linnertz C, et al
- 7.↵
- 8.↵
- 9.↵
- 10.↵
- 11.↵
- Lowe J,
- Blanchard A,
- Morrell K, et al
- 12.↵
- 13.↵
- 14.↵
- 15.↵
- 16.↵
- 17.↵
- Pankiv S,
- Clausen TH,
- Lamark T, et al
- 18.↵
- Saul R,
- Lutz MW,
- Burns DK,
- Roses AD,
- Chiba-Falek O
- 19.↵
- Brooks BR,
- Miller RG,
- Swash M,
- Munsat TL
- 20.↵
- 21.↵
- 22.↵
- Roses AD
- 23.↵
- 24.↵
- 25.↵
- 26.↵
- 27.↵
- 28.↵
- 29.↵
- Sproviero W,
- Shatunov A,
- Stahl D, et al
- 30.↵
- 31.↵
- 32.↵
- Doi H,
- Adachi H,
- Katsuno M, et al
- 33.↵
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- Reynolds B,
- Deleyrolle L
- Féron F,
- Perry C,
- Girard SD,
- Mackay-Sim A
- 38.↵
- Matigian N,
- Abrahamsen G,
- Sutharsan R, et al
- 39.↵
- 40.↵
- Le Ber I,
- Camuzat A,
- Guerreiro R, et al
Letters: Rapid online correspondence
REQUIREMENTS
If you are uploading a letter concerning an article:
You must have updated your disclosures within six months: http://submit.neurology.org
Your co-authors must send a completed Publishing Agreement Form to Neurology Staff (not necessary for the lead/corresponding author as the form below will suffice) before you upload your comment.
If you are responding to a comment that was written about an article you originally authored:
You (and co-authors) do not need to fill out forms or check disclosures as author forms are still valid
and apply to letter.
Submission specifications:
- Submissions must be < 200 words with < 5 references. Reference 1 must be the article on which you are commenting.
- Submissions should not have more than 5 authors. (Exception: original author replies can include all original authors of the article)
- Submit only on articles published within 6 months of issue date.
- Do not be redundant. Read any comments already posted on the article prior to submission.
- Submitted comments are subject to editing and editor review prior to posting.
You May Also be Interested in
Use of Whole-Genome Sequencing for Mitochondrial Disease Diagnosis
Dr. Robert Pitceathly and Dr. William Macken
► Watch
Related Articles
- No related articles found.
Topics Discussed
Alert Me
Recommended articles
-
Article
Unraveling ALS due to SOD1 mutation through the combination of brain and cervical cord MRIFederica Agosta, Edoardo Gioele Spinelli, Ivan V. Marjanovic et al.Neurology, January 24, 2018 -
Amyotrophic lateral sclerosis
Phenotype and genotype analysis in amyotrophic lateral sclerosis with TARDBP gene mutationsP. Corcia, P. Valdmanis, S. Millecamps et al.Neurology, April 25, 2012 -
Articles
Expanded ATXN2 CAG repeat size in ALS identifies genetic overlap between ALS and SCA2P. Van Damme, J.H. Veldink, M. van Blitterswijk et al.Neurology, May 11, 2011 -
Article
Disease-modifying effects of an SCAF4 structural variant in a predominantly SOD1 ALS cohortJulia Pytte, Loren L. Flynn, Ryan S. Anderton et al.Neurology: Genetics, July 01, 2020