Table 1 Computational tools used in tumor immunogenomics with high-throughput next-generation sequencing data

From: Technological advances in cancer immunity: from immunogenomics to single-cell analysis and artificial intelligence

Tool

Characteristics

URL

Year

Ref.

Quantification of immune cells in TIME

CIBERSORT

Based on linear support vector regression, deconvolution from microarray data, and known gene expression profiling set.

https://cibersort.stanford.edu/

2015

30

CIBERSORTx

Expanding data source to single-cell RNA-seq.

https://cibersortx.stanford.edu/

2019

34

DeconRNASeq

Constrained least square regression model validated with RNA-seq data from five human tissues

http://bioconductor.org/packages

2013

273

EPIC

Based on constrained least square, incorporates a non-negative condition into deconvolution.

https://gfellerlab.shinyapps.io/EPIC_1-1/

2017

274

ESTIMATE

Generating a stromal score and immune score to reflect the tumor purity based on ssGSEA

https://sourceforge.net/projects/estimateproject/

2013

27

FARDEEP

Based on adaptive least trimmed square, FARDEEP removes outliers and outputs the absolute quantification of cell types

https://github.com/YuningHao/FARDEEP.git

2019

32

MCP-counter

The score is the geometric mean of the expression level of cell-specific genes, implying the absolute abundance of immune cell types among samples

http://github.com/ebecht/MCPcounter

2016

29

MuSiC

Deconvolution of bulk sequencing data based on cell type-specific gene expression reference from single-cell RNA-seq

https://github.com/xuranw/ MuSiC

2019

33

NITUMD

A semi-supervised nonnegative matrix factorization framework with a trichotomous signature matrix

https://github.com/tdw1221/NITUMID

2020

275

PERT

A non-negative maximum likelihood-based method applied to fresh human umbilical cord blood samples

2012

276

quanTIseq

Quantification of ten different immune cell types and other uncharacterized cells based on RNA-seq data

http://icbi.at/quantiseq

2019

277

TIMER

Immune cells are estimated via transcriptomic data and the correlations among the immunological, genomic, and clinical features were established.

https://cistrome.shinyapps.io/timer/

2016

278

xCell

Spillover compensation is used to separate cell types with high correlation

http://xcell.ucsf.edu/

2017

28

Prediction of mutated proteins

CN-Learn

Machine-learning framework integrating calls from multiple CNV detection algorithms and learning to accurately identify true CNVs

https://github.com/girirajanlab/CN_Learn

2019

180

deepSNV

Detecting and quantifying sub-clonal SNVs in mixed populations even for low-frequency variants

http://www.bioconductor.org

2012

181

DeepVariant

SNP and small-indel variant caller using deep neural networks in aligned NGS read data

https://github.com/google/deepvariant/

2018

182

EBCall

Discriminating somatic mutations from sequencing errors with both moderate and low allele frequencies

https://github.com/friend1ws/EBCall

2013

61

GATK

Industry standard for identifying SNPs and indels via analyzing WES, WGS, and RNA-seq data

https://gatk.broadinstitute.org/hc/en-us

2010

51

LoFreq

Modeling sequencing run-specific error rates to accurately call variants occurring in <0.05% of a population

http://sourceforge.net/projects/lofreq/

2012

53

MuTect2

Sensitive detection of somatic point mutations especially in low-allelic-fraction events

https://software.broadinstitute.org/cancer/cga/mutect

2013

52

Platypus

Using local de novo assembly to generate candidate variants, including SNPs, indels, and complex polymorphisms

http://www.well.ox.ac.uk/platypus

2014

279

PyroHMMsnp

Realigning read sequences around homopolymers and inferring the underlying genotype by using a Bayesian approach

https://github.com/homopolymer/PyroTools/

2013

280

SAMtools

Variant caller utilizing the post-processing alignments in the SAM/BAM format

http://samtools.sourceforge.net

2009

62

SCcaller

Firm foundation for standardized somatic-mutation analysis in single-cell genomics based on single-cell multiple displacement amplification (SCMDA)

https://github.com/biosinodx/SCcaller/

2017

281

SomaticSeq

Somatic mutation detection pipeline used to produce highly accurate somatic mutation calls for both SNVs and small INDELs

http://bioinform.github.io/somaticseq/

2015

282

SomaticSniper

Calling of somatic SNPs and indels from matched tumor–normal NGS data

http://gmt.genome.wustl.edu/packages/somatic-sniper/

2011

56

Strelka2

Fast and accurate caller of germline and somatic variants based on Strelka

https://github.com/Illumina/strelka

2018

58

VarDict

Variant caller of SNV, MNV, INDELs, and SVs, enabling ultra-deep sequencing

https://github.com/AstraZeneca-NGS/VarDict

2016

60

VarScan2

Detection of somatic mutations and CNVs in exome data from tumor–normal pairs

http://varscan.sourceforge.net

2012

55

HLA typing

HISAT2

Graph-based genome alignment and genotyping, also applied for DNA fingerprinting

https://github.com/DaehwanKimLab/hisat2

2019

84

HLA-HD

Extraction of six-digit resolution HLA-I and HLA-II from NGS data

https://www.genome.med.kyoto-u.ac.jp/HLA-HD/

2017

77

HLA-miner

HLA-I and HLA-II typing directly from non-targeted RNA-seq, WGS and WES data

http://www.bcgsc.ca/platform/bioinfo/software/hlaminer

2012

76

HLAProfiler

K-mer profile-based method for HLA calling in RNA-seq data for both rare and common HLA alleles at two-field precision

https://github.com/ExpressionAnalysis/HLAProfiler

2017

283

HLAreporter

Extraction of HLA-I and HLA-II from NGS data at four-digit resolution

http://paed.hku.hk/genome/

2015

81

HLAscan

Determination of HLA type across the whole-genome, exome, and target sequences

http://www.genomekorea.com/display/tools/HLA_SCAN

2017

284

HLAssign

First highly automated open-source HLA-typing method for NGS data to three-field resolution

https://www.ikmb.uni-kiel.de/resources/download-tools/software/hlassign

2015

285

HLA-VBseq

Genotyping of HLA alleles at an 8-digit resolution from WGS data without the need of prior knowledge regarding the HLA loci

http://nagasakilab.csml.org/hla

2015

79

Kourami

Graph-guided assembly technique used to provide highly classical HLA typing

https://github.com/Kingsford-Group/kourami

2018

85

Optitype

Genotyping of major and minor HLA-I alleles from RNA-seq, WGS, and WES data not specifically enriched for the HLA cluster

http://github.com/FRED-2/OptiType

2014

78

PHLAT

High-accuracy genotyping of HLA-I and HLA-II alleles from RNA-seq, WGS, WES, and targeted sequencing at a four-digit resolution

https://sites.google.com/site/phlatfortype

2014

83

Polysolver

High-precision HLA typing of WES data even relatively low-coverage WES data, and subsequent mutation detection

http://www.broadinstitute.org/cancer/cga/polysolver

2015

82

seq2HLA

Using standard RNA-Seq reads as input to determine the HLA-I and HLA-II types and expression at a four-digit resolution

https://github.com/TRON-Bioinformatics/seq2HLA

2012

70

SNP2HLA

Imputing four-digit classical alleles and amino acid polymorphisms at class I and class II loci

http://faculty.washington.edu/browning/beagle/beagle.html

2013

80

Prediction of antigen-MHC binding affinity

ACME

Pan-specific peptide–MHC class I binding prediction through attention-based deep neural networks

https://github.com/HYsxe/ACME

2019

286

MHCAttnNet

MHC-peptide binding prediction of MHC alleles classes I and II using an attention-based deep neural model

https://github.com/gopuvenkat/MHCAttnNet

2020

287

MHCflurry

Open-source class I MHC binding affinity prediction, using mass spectrometry datasets for model selection and showing competitive accuracy

https://github.com/openvax/mhcflurry

2018

98

MHCSeqNet

Open-source deep neural network model for universal MHC binding prediction, accepting peptides of any length

https://github.com/cmbcu/MHCSeqNet

2019

288

NetMHC

High accuracy prediction of pMHC binding affinity to human and non-human MHC-I molecules based on ANN and PSSMs

http://www.cbs.dtu.dk/ services/NetMHC

2008

93

NetMHCII

High accuracy prediction of pMHC binding affinity to human and non-human MHC-II molecules based on ANN and PSSMs

http://www.cbs.dtu.dk/services/NetMHCII

2018

289

NetMHCIIpan

Pan-specific version of netMHCII

http://www.cbs.dtu.dk/services/NetMHCIIpan

2020

102

NetMHCpan

Pan-specific version of netMHC

http://www.cbs.dtu.dk/services/NetMHCpan

2020

102

PSSMHCpan

PSSM based software for predicting class I

peptide-HLA binding affinity

https://github.com/BGI2016/PSSMHCpan

2017

94

PUFFIN

Deep residual network-based computational approach that quantifies uncertainty in pMHC affinity prediction

http://github.com/gifford-lab/PUFFIN

2019

290

  1. ANN artificial neural network, CNVs copy number variations, HLA human leukocyte antigen, INDELs insertions and deletions, NGS next-generation sequencing, MNVs multiple-nucleotide variants, PSSM position-specific scoring matrix, RNA-seq RNA sequencing, SNP single-nucleotide polymorphism, SNVs single nucleotide variations, SVs structural variants, TIME tumor immune microenvironment, WES whole-exome sequencing, WGS whole-genome sequencing