Introduction

Recent advances in high-throughput sequencing technologies have revolutionized genetic studies of complex diseases1,2,3,4,5,6,7. The emergence of next-generation sequencing (NGS) platforms has enabled genomic analyses at an unprecedented scale and resolution. These technologies have facilitated whole-genome sequencing (WGS) and whole-exome sequencing (WES) of large cohorts, unveiling novel disease-associated loci and providing deeper insights into the genetic architecture of complex disorders1,2,3,4,5,6,7,8,9.

Detecting disease-causing variants from WES/WGS data is a complex task. Today, most clinical genetics labs that analyze WES/WGS data follow the American College of Medical Genetics and Genomics (ACMG) guidelines for interpreting sequence variants10. This mainly includes detecting high-quality variants with lower allele frequency and damaging effects on the protein function. Other factors usually considered are the segregation of the variant with the phenotype and existing evidence for the variant or gene association with disease. To assist clinicians in this laborious process, several automated tools such as Exomiser11, AMELIE12, LIRICAL13, AutoCaSc14, etc., have been devised to prioritize disease-specific variants (mainly single nucleotide variants [SNVs] and insertions/deletions [indels]) from WES/WGS data.

Autism spectrum disorder (ASD) is a complex neurodevelopmental disorder (NDD) that has greatly benefited from the emergence of NGS technologies. Recent large-scale WES and WGS studies have identified thousands of ASD susceptibility genetic variants in hundreds of genes5,15,16,17,18,19,20. Nevertheless, despite these advances in ASD genetics, clinically meaningful genetic variants are identified only in 8–30% of affected probands5,21,22. Thus, there is a need for new approaches to facilitate the detection of ASD-specific variants from WES/WGS data.

Here, we present two scoring approaches called AutScore and AutScore.r that integrate variant and gene-level information such as pathogenicity, deleteriousness, clinical relevance, gene-disease association, and gene-variant inheritance pattern from a wide range of bioinformatics tools and databases to generate a single score for prioritizing clinically relevant ASD candidate variants from WES data for simplex and multiplex families. We applied both versions of AutScore to WES data from 581 Israeli ASD-affected probands and their parents. We assessed its performance by comparing the obtained results to a manual and blinded evaluation of the variants by clinicians and to AutoCaSc14, an existing variant prioritization tool for NDDs.

Materials and methods

Study sample

Our sample included 581 children diagnosed with ASD, registered with the Azrieli National Centre for Autism and Neurodevelopment Research (ANCAN)23,24. Based on clinical records, none of the parents had registered themselves with ASD, intellectual disability, or other NDDs. Genomic DNA was extracted from saliva samples from children and their parents using Oragene®•DNA (OG-500/575) collection kits (DNA Genotek, Canada).

Whole exome sequencing (WES)

Whole Exome Sequencing (WES) analysis was conducted in two labs: (1) the Broad Institute as a part of the Autism Sequencing Consortium (ASC) project17 and (2) the Clalit Health Services sequencing lab at Beilinson Hospital. WES was performed using Illumina HiSeq sequencers in both places, followed by the Illumina Nextera exome capture kit. The sequencing reads were aligned to human genome build 38 and aggregated into BAM/CRAM files. Then, the Genome Analysis Toolkit (GATK)25 (Broad) or Illumina’s DRAGEN pipeline26 (Beilinson) was used for variant discovery and the generation of joint variant calling format (vcf) files.

Variant filtering and annotations

The multi-sample vcf files generated by the Genome Analysis Toolkit (GATK) and the DRAGEN platform were undertaken with identical procedures for variant filtering and annotation, as previously detailed27. Subsequently, we identified pathogenic (P), likely pathogenic (LP), or likely gene-disrupting (LGD) variants using the InterVar28 tool in conjunction with our proprietary tool, Psi-Variant27. We kept only those LP/P/LGD variants that affected genes associated with ASD or other NDDs according to the SFARI gene29 or the DisGeNET30 databases for downstream analyses. Subsequently, 1161 candidate variants in 441 probands remained for further analysis (Fig. 1).

Fig. 1
figure 1

Analysis workflow for detecting ASD candidate variants from the WES data. The final list of ASD candidate variants (n = 116) consists of 65 ‘likely’ and 51 ‘possibly’ ASD candidates.

Prioritization of ASD candidate variants

We developed a metric called AutScore that integrates data from diverse bioinformatics tools to prioritize the detected list of ASD candidate variants as follows:

$$AutScore = {\rm{I}}+{\rm{P }}+{\rm{D }}+{\rm{ S}}+{\rm{G}}+ {\rm{C}}+{\rm{H}}$$

Where:

  • I – indicates the pathogenicity of a variant based on InterVar28 classification as follows: ‘benign’ = -3; ‘likely benign’ = -1; ‘variants of uncertain significance (VUS)’ = 0; ‘likely pathogenic’ = 3, and ‘pathogenic’ = 6.

  • P – cumulatively assess the deleteriousness of a variant based on the following six in-silico tools (SIFT31 (< 0.05), PolyPhen-232 (≥ 0.15), CADD33 (> 20), REVEL34 (> 0.50), M_CAP35 (> 0.025) and MPC36 (≥ 2)). For each of these tools, a variant gets a score of 1 (deleterious) or 0 (benign), and these scores are aggregated to generate a single score ranging from 1 to 6.

  • D – indicates the agreement of variant-phenotype segregation with the predicted segregation by the Domino tool37 where agreement with Domino’s ‘very likely dominant/recessive’ classes = 2; agreement with Domino’s ‘likely dominant/recessive’ classes = 1; disagreement with Domino’s ‘very likely dominant/recessive’ classes = -2; disagreement with Domino’s ‘likely dominant/recessive’ classes = -1; and 0 were assigned for variants with Domino’s ‘either dominant or recessive’ segregation.

  • S – indicated the strength of association of the affected gene with ASD according to the SFARI gene database29 where ‘high confidence’ = 3; ‘strong candidate’ = 2; ‘suggestive evidence’ = 1; and not in SFARI database = 0.

  • G – indicated the strength of association of the affected gene with ASD according to the DisGeNET database30 where weak/no association (GDA = 0 to 0.25) = 0: mild association (GDA = 0.25 to 0.50) = 1: moderate association (GDA = 0.50–0.75) = 2: strong association (GDA = 0.75 and above) = 3.

  • C – pathogenicity of a variant based on ClinVar38 where ‘benign’ = -3; ‘likely benign’ = -1; ‘VUS’ or not in ClinVar = 0; ‘Likely pathogenic’ = 1; ‘Pathogenic’ = 3.

  • H – segregation of variants in the family weighted as (n2)-1 where n = number of probands in a family that carries the detected variants.

Clinical genetics validation

Variants with AutScore ≥ 10 (top quartile of candidate variants scores) were visually validated using the IGV software39 and then manually examined by two clinical geneticists according to the standard ACMG/AMP guidelines10. The clinical genetic experts had ample experience in evaluating the clinical significance of different genetic tests, including WES of ASD triplets, in clinical genetics labs. They used their clinical experience to assess the clinical significance of the candidate variants regarding the ASD phenotype of the child and consequently assigned each variant one of the following rankings: ‘Likely,’ ‘Possibly,’ and ‘Unlikely’. Importantly, each of the clinical experts independently assigned the ranking to the candidate variants without knowing AutScore values and the criteria used to determine these scores. Then, they compared their ranking and together reached a consensus ranking for each variant.

Refinement of AutScore weighting

To reduce the subjectivity of weights assigned to the different tools used by AutScore, we fitted a generalized linear model to these data with the different AutScore modules (i.e. I, P, D, S, G, C, H) being used as predictors and the clinical genetic ranking as the outcome (likely = 1 and possibly or unlikely = 0). For further clarity, we call this revised version of AutScore as “AutScore.r” throughout the manuscript. Details about the probabilistic weights (beta coefficients) and other statistics (standard error, p-value) that were used to develop AutScore.r can be found in Supplementary Table S1.

Statistical analysis

We used a Receiver Operating Characteristic (ROC) analysis to assess the performance of both AutScore versions in detecting ASD candidate variants using the clinical experts’ rankings as the reference. We then used Yuden J’s statistics40 to identify the optimal cutoff in each AutScore version. Then, we used this cutoff to compute the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy for the two AutScore versions in comparison to the clinical experts’ ranking. The diagnostic yield (%) of each tool was also computed as the proportion of the number of ASD probands that have at least one ASD candidate variant out of the total affected ASD probands that completed their WES analysis. Finally, we compared the performance of the two AutScore versions in detecting ASD candidate variants as well as with the performance of AutoCaSc14, an existing variant prioritization tool for NDDs.

Software

Data storage, management, and analyses were conducted in a high-performing Linux cluster using Python version 3.5 and R version 1.1.456. All statistical analyses and data visualization were performed and incorporated into R.

Results

A total of 1161 variants distributed in 687 genes in 441 ASD probands were evaluated by the AutScore algorithm. Variant’s scores ranged from − 4 to 25, with a mean ± SD of 5.89 ± 4.18 (Fig. 2). The clinical experts examined 201 (17.31%) variants with an AutScore of ≥ 10. Among these, 24 (11.9%) were found to be false positive indels in the visual assessment using the IGV software and thus removed from subsequent analyses. Of the remaining 177 variants, 65 (36.7%) were ranked as ‘likely,’ 51 (28.8%) as ‘possibly,’ and 61 (34.5%) as ‘unlikely’ ASD candidate variants (Supplementary Table S2).

Fig. 2
figure 2

Histogram depicting the distribution of AutScore ranking of the 1161 ASD candidate genetic variants.

Determining an optimal cutoff for AutScore

The ROC analyses of the two AutScore versions are depicted in Fig. 3. The revised version, AutScore.r had a slightly better performance in detecting “likely” ASD variants compared to the original version (AUC = 0.90, 95% CI = 0.84–0.95 vs. AUC = 0.84, 95% CI = 0.78–0.91 respectively). Yuden J’s statistics suggested an optimal AutScore cutoff of ≥ 12 (Yuden J = 0.53) and a cutoff of ≥ 0.335 for AutScore.r (Yuden J = 0.69).

Fig. 3
figure 3

ROC curves of AutScore (Orange line) and AutScore.r (Blue line) in detecting ‘likely’ ASD variants. Both approaches demonstrated good performance in detecting ‘likely’ ASD variants with AutScore.r performing slightly better than AutScore (AUC = 0.90, 95% CI = 0.84–0.95 vs. AUC = 0.84, 95%CI = 0.78–0.91) respectively).

Comparing the performance between AutScore, AutScore.r and AutoCasC in detecting ASD candidate variants

Next, we used the suggested Yuden J’s statistics cutoff of the two AutScore versions to compare their performance in detecting ASD candidate variants vis-à-vis the existing NDD prioritization tool, AutoCaSc, using its recommended cut-off of > 614. The results of this comparison are presented in Fig. 4; Table 1. A moderate and statistically significant correlation was seen between the AutScore and AutoCaSc scores (r = 0.58 and r = 0.55 for AutoScore and AutoScore.r respectively, p < 0.01). All the three tools were highly sensitive in detecting ASD variants using their recommended cut-off (AutScore = 0.91, AutScore.r = 0.81 and AutoCasC = 0.92, respectively) and achieved a similar diagnostic yield of 9–10%. However, both AutScore and AutScore.r had better specificity, PPV and accuracy than AutoCasC with AutScore.r having a slightly better performance than AutScore in these parameters (Specificity: 0.87, 0.62 and 0.13; PPV: 0.79, 0.58 and 0.40; and Accuracy: 0.85, 0.72 and 0.43 for AutScore.r, AutScore and AutoCasC respectively, Table 1). The variant list (n = 177) with their respective scores from these three tools as well as their clinical assessment ranking is provided in Supplementary Table S2.

Fig. 4
figure 4

Scatter plots of the relationships between (A) AutScore and AutoCaSc and (B) AutScore.r and AutoCaSc in detecting ASD candidate variants. Dashed horizontal and vertical lines depicting the cutoff used by the different tools in determining “positive findings”. Colored dots depict the clinical genetic variant ranking (“likely” [green], “possibly” [yellow], and “unlikely” [red]).

Table 1 Comparing the performance between AutScore, AutScore.r and autocasc in detecting ASD candidate variants.

Characteristics of the variants detected by AutScore.r

Finally, we examined the characteristics of the 67 genetic variants detected by the best-performing tool – AutScore.r using a cutoff of  ≥0.335 (Table 2). Of these variants, 53, 10, and 4 variants were ranked as ‘likely’, ‘possibly’, and ‘unlikely’ ASD candidate variants, respectively, by the clinical experts. Most of the detected variants (92.5%) were distributed in genes associated with ASD according to the SFARI Gene database29 with most of them considered as high-confidence ASD genes (i.e., SFARI score of 1). Another five variants were distributed in 5 genes not listed in the SFARI database and thus could be considered as novel ASD candidate genes. Around 80% of the detected variants were classified as LP/P according to the ACMG/AMP variant interpretation criteria10, and more than 73% were denovo variants.

Table 2 Characteristics of the variants detected with AutScore.r ≥ 0.335 (N = 67).

Examination of AutScore performance in an ethnically diverse population

AutScore was developed based on WES data from an ASD cohort within the Israeli population. Thus, its performance parameters presented here could be specific to this population. To address this issue, we applied AutScore.r to WGS data from the BARAKA-Qatar study41 of 372 individuals from 100 local families. Of the 10 dominant SNVs reported in Table 2 of their paper, 7 variants (70%) had an AutScore.r score of ≥ 0.335 (Supplementary Table S3). In addition, only one of the eight reported missense variants and none of the 5 x-linked variants had an AutScore.r score of ≥ 0.335. Of note, all the recessive and x-linked variants reported in the Qatar study except the one detected by AutScore.r, were classified as VUS by the ACMG criteria (Supplementary Table S3).

Discussion

Discerning clinically relevant ASD candidate variants from many variants detected in WES analyses pose a formidable challenge for clinical experts, demanding considerable time and effort. Here, we present AutScore, a novel bioinformatics prioritization tool that integrates variant and gene-level information to prioritize ASD candidate variants derived from WES data. AutScore can be integrated into an existing bioinformatic pipeline for WES data analysis by pre-installing the ACMG/AMP10 variant interpretation tool InterVar14 and our in-house tool Psi-Variant27. Although AutScore was initially designed to assess the ASD clinical relevance of rare autosomal SNVs, it can be adapted for analyses of mitochondrial variants, and common heritable variants that are expected to enhance its applicability further. Finally, it can be continuously improved by adjusting the weights of its different modules using regression analyses as demonstrated in its revised AutScore.r version in this study.

Our results indicated that both the original and revised versions of AutScore are highly efficient in detecting clinically relevant ASD variants while achieving an overall diagnostic yield of ~ 10%, comparable to results from prior studies5,21,22. They also perform better than the existing NDD variant prioritization tool, AutoCaSc14, in detecting clinically relevant ASD candidate variants. The higher accuracy of AutScore compared to AutoCaSc is likely because it was explicitly designed to detect ASD candidate variants. At the same time, AutoCaSc focuses on prioritizing candidate variants related to a broader range of NDDs.

The following limitations should be considered when using AutScore. First, the original AutScore metric was established using a trial-and-error approach, assigning certain weights and penalties to its different modules and that may lead to inherent subjectivity biases. This was overcome later by refining the weights of the different modules in the revised version of AutScore.r. Second, the accuracy of both AutScore and AutScore.r are not 100%, thus having some false positive and false negative findings. To address this limitation, one can use different thresholds of these tools to reduce the number of one type of these variants while increasing the other one. Third, in this study AutScore was constrained to specific genes from the DisGeNET30 and SFARI Gene29 databases and hence might have missed some potential candidate variants in genes not cataloged in these databases. Since both of these databases are constantly updating with new genes. future implication of AutScore may have even better performance. Forth, AutScore.r performance was based on variants from the top quartile of the original AutScore. Therefore, it is possible that our analysis underestimated the false-negative rates and, consequently also, the accuracy of the tool. Lastly, in its current form, AutScore is not designed to assess the effect of other types of genetic variants that may contribute to ASD such as CNVs42 and compound heterozygotes43. Additionally, the metric may not function optimally in cases involving probands with incomplete pedigree information and unknown segregation patterns.

Conclusion

AutScore.r constitutes a highly effective automated ranking system designed to prioritize ASD candidate genetic variants in WES data. The utilization of AutScore.r holds the potential to significantly streamline the process of elucidating the specific genetic etiology of ASD within affected families. In doing so, it can contribute to expediting and enhancing the accuracy of clinical management and treatment strategies, ultimately leading to more effective interventions in the context of ASD.