In spatial transcriptomics, tissue samples are randomly positioned. Rotation-sensitive methods can lead to unreliable spatially variable gene (SVG) detection. We highlight their inherent technical pitfalls and discuss strategies for rotation-invariant methods, enhancing the robustness of SVG detection.
One of the most significant tasks in analyzing spatial transcriptomics data is to efficiently and accurately detect spatially variable genes (SVGs). These genes exhibit specific, non-random spatial patterns across tissue sections, reflecting their significant roles in understanding the spatial heterogeneity in gene expression. Detecting SVGs is essential to reveal molecular mechanisms underlying tissue organization and to better understand biological processes controlling tissue functionality and pathology1,2. To address this challenge, numerous computational approaches have been developed, employing advanced statistical, machine learning, and bioinformatics techniques3. These methods are intended to enhance SVG detection, interpretability, and performance in large-scale spatial transcriptomics data.
One major limitation of many existing SVG detection methods is that the results are not invariant under different rotations of spatial coordinates. This implies that the results of SVG detection can be largely dependent on the orientation of the tissue sample during analysis. We know that tissue sections are often positioned arbitrarily during sample preparation, resulting in spatial coordinates that differ based on how a sample was positioned. Many methods cannot cope with such variations, which leads to unreliable results, such as high false positives or negatives in SVG detection. This limitation is particularly worsened in the integrative analysis of multi-slice spatial transcriptomics data, where tissue slices are often misaligned (see Fig. 1), thus significantly hindering seamless integration and comparative evaluation across different tissue sections. Ensuring rotation invariance is therefore pivotal to achieving a robust and reliable analysis, especially when analyzing multiple tissue sections of varying orientations.
Reproduced and modified from Ståhl et al., Science, DOI: 10.1126/science.aaf2403, 2016, AAAS. The original images are from https://www.spatialresearch.org. a H& E images of mouse olfactory bulb tissue replicate samples 1, 2, 3, and 4 from ID 1000L2. b Hematoxylin and eosin-stained brightfield images of human breast cancer biopsy replicate samples 1, 2, 3, and 4 from ID 1000L3. The variability in tissue orientation and positioning during sample preparation highlights the need for rotation-invariant statistical methods in spatial transcriptomics.
In our comprehensive review of existing SVG detection methods, we identified three prevalent methodological misuses that contribute to the serious issue of rotation variance, compromising the validity of SVG detection (see Fig. 2 for an overview of the three misuses). These misuses are due to inappropriate handling of spatial coordinates in statistical models, leading to flawed assumptions and unreliable outcomes. By synthesizing these findings, we emphasize the need for methodological refinements to effectively address these limitations.
a Directly using coordinates in modeling; b inadequate kernel design; c improper data pre-processing. Here, \({{{\bf{S}}}}={({{{{\bf{s}}}}}_{1},{{{{\bf{s}}}}}_{2})}_{n\times 2}\) is the spatial coordinate matrix of n spots. After applying the spatial rotation, the new spatial coordinate matrix is given as S* = SRT, where R is an orthogonal rotation matrix satisfying RTR = RRT = I.
Directly using coordinates in modeling can be problematic
One of the most prominent misuses is when certain methods model spatial coordinates as fixed effects, often accompanied by various transformations or basis expansions in regression models. These models subsequently rely on statistical hypothesis testing of these fixed effects to identify SVGs, including cell-type-specific SVGs (ctSVGs). For instance, CTSV4 applies a zero-inflated negative binomial regression model, treating the two spatial coordinates (s1, s2) or their transformation (h(s1), h(s2)) as regression covariates, followed by the claim of ctSVG by testing the significance of their associated fixed effect regression coefficients β1 and β2. A gene is considered ctSVGs if at least one of these coefficients is statistically non-zero. Similarly, C-SIDE5 utilizes a Poisson regression model, incorporating basis expansions of spatial coordinates to model spatial gene patterns. It uses L smooth basis functions, whose linear combinations approximate the overall spatial expression pattern. Statistical significance is assessed via two-sided z-tests for each basis function, and the smallest adjusted p-value is used to assess the statistical significance. In contrast, spVC6 adopts a two-stage Poisson regression approach to identify ctSVGs. In the first stage, it models K ctSVGs non-spatial effects and tests their significance to claim SVGs, together with testing a residual spatial random effect. In their second stage, a full model incorporates K additional ctSVGs spatial effects, approximated via bivariate penalized spline techniques, to identify ctSVGs. The bivariate spline expansions are based on spatial coordinates and are treated as fixed effects in the estimation and testing of ctSVG, thus exhibiting the rotation invariance issue (see the supplemental file in Su et al.7 for a comprehensive evaluation of these methods violating the rotation-invariant property).
In summary, these methods share a fundamental limitation: their statistical tests rely on rotation-variant fixed-effect modeling of spatial coordinates. Consequently, the testing results change when the coordinate system is altered, including through rotation (see the supplemental file for more details).
Inadequate kernel design fails to capture spatial correlation consistently
Common approaches for SVG detection involve constructing similarity kernel matrices to capture spatial correlation, modeling spatial random effects and then performing hypothesis tests on the corresponding variance components.
However, we noticed that some methods use rotation-variant techniques to construct spatial similarity matrices, leading to potential inconsistencies in statistical inference. For instance, methods that use projection matrices of kernel-transformed spatial coordinates to measure spatial similarity define relationships based on specific coordinate axes, making their results sensitive to rotation (see the supplemental file for more details). Similarly, linear kernels, relying on direct inner products of spatial coordinates, introduce dependencies on the absolute orientation of the coordinate system. As a result, their testing results can change when the coordinate system is rotated, raising concerns about robustness and reproducibility in SVG detection.
SPARK-X8 exemplifies this issue. It constructs a similarity matrix for gene expression and another for spatial patterns using projection matrices, then tests whether these two similarity matrices are independent to determine whether a gene is an SVG. Specifically, the spatial similarity matrix is defined as \({{{\bf{K}}}}={{{\bf{S}}}}^{\prime} ({({{{\bf{S}}}}^{\prime} )}^{T}{{{\bf{S}}}}^{\prime} )^{-1}{{{{\bf{S}}}}}^{{\prime} T}\), where \({{{{\bf{S}}}}}^{{\prime} }\) is the spatial coordinate matrix of dimension n × 2 after some transformation such as exponential or cosine. We showed in the supplemental file that such a projection-based similarity matrix maintains the spatial rotation-invariant property if \({{{{\bf{S}}}}}^{{\prime} }\) it is based on the original coordinates without transformation. However, this property is violated when \({{{{\bf{S}}}}}^{{\prime} }\) it is obtained via exponential or cosine transformation of the original coordinates as proposed by SPARK-X. In this case, the testing outcomes of SPARK-X can change when the coordinate system is rotated, compromising the method’s validity and reliability. To illustrate the issue, we applied SPARK-X8 to the mouse olfactory bulb tissue replicate #89 under different rotations. Figure 3 shows the inconsistent SVG detection results under different rotations.
The Venn diagram shows the logical relationship between sets of genes identified by SPARK-X8 across the original spatial pattern and patterns rotated by 30∘, 45∘, and 60∘ of mouse olfactory bulb (mOB)9 tissue replicate #8. Under the original pattern, 2321 genes are identified as SVGs by SPARK-X. However, this number drops sharply to 548 when the tissue is rotated by 60∘. Even more strikingly, only seven genes are consistently identified across the 30∘ and 45∘ rotations. Source data are provided as a Source Data file.
Similarly, SMASH10 modifies the SPARK-X framework by incorporating two additional alternatives, distance-based Gaussian and cosine kernels, to construct the spatial similarity matrix K. These alternatives are rotation-invariant, since they rely solely on relative distances. However, the inclusion of the original projection matrix, which is derived from kernel-transformed spatial coordinates, retains the flaw, making SMASH susceptible to rotation-induced inconsistencies in statistical inference.
Improper data pre-processing can cause rotation variance
In addition to inappropriate modeling approaches, certain preprocessing procedures can also introduce rotation-variance issues, possibly propagating through the analysis pipeline and impacting the reliability of SVG detection.
One such issue arises in methods that discretize spatial coordinates by dividing the tissue slice into a fixed grid structure along both axes. This type of preprocessing is employed by methods such as singlecellHaystack11 and SOMDE12, where spatial locations are assigned to grid cells before performing downstream statistical analyses.
The main issue is that the coverage of each grid cell over the tissue is not preserved under rotation. When the coordinate system alters, the boundaries of these predefined grid cells shift relative to the spatial locations of the observed data points, causing inconsistencies in how cells or spots are assigned to different spatial bins. As a result, spatial patterns that were initially captured within specific grid structures may become distorted, potentially leading to different statistical conclusions when the same dataset is analyzed under different orientations.
This rotation-induced variation can significantly affect the modeling and hypothesis-testing steps that follow. For instance, in singlecellHaystack11, spatial enrichment is assessed by comparing the density of observed spatial locations within grid cells against a null distribution. If the grid structure changes with rotation, the spatial density estimates may vary accordingly, leading to inconsistent enrichment results. Similarly, in SOMDE12, which applies self-organizing maps to discretized spatial domains before conducting variance decomposition, the altered grid structure can impact how spatial dependencies are learned, ultimately affecting SVG detection.
While grid-based preprocessing improves scalability, its sensitivity to rotation can introduce biases, leading to inconsistent SVG detection across different coordinate representations. This highlights the importance of adopting rotation-invariant preprocessing techniques to ensure robustness and reproducibility in SVG detection.
Another source of rotation variance comes from coordinate normalization procedures used in some methods. In particular, BSP13, a granularity-based approach for SVG detection, performs normalization of spatial coordinates based on spot density. Specifically, the coordinates along each axis are scaled by an estimated density, which is computed as the total number of spots divided by the sample area. However, the sample area in BSP is defined as the product Δs1Δs2, where \(\Delta {{{{\bf{s}}}}}_{j}=\max ({{{{\bf{s}}}}}_{j})-\min ({{{{\bf{s}}}}}_{j})\) for j = 1, 2, corresponding to the range of spatial coordinates along each axis. This definition assumes a rectangular bounding box aligned with the coordinate axes, which is sensitive to rotation. If the tissue sample is not approximately circular, a rotation of the coordinate system can significantly alter the values of Δs1 and Δs2, and hence the computed area and normalization factors. As a result, the normalized coordinates and the downstream SVG detection may vary with rotation.
Strategies to ensure rotation-invariant SVG detection
To address the three methodological misuses resulting in rotation variance discussed above, we provide several suggestions for designing SVG detection methods that are robust to rotation.
First, when fitting fixed-effect models to capture spatial variation, it is advisable to avoid using spatial coordinates or their transformations as covariates. A more robust strategy is to construct predictors based on relative distances, which are inherently invariant to rotation. For example, Niche-DE14 encodes spatial context by computing a distance-based, kernel-smoothed density for each anchor cell, effectively summarizing locally spatial information. Since this method relies exclusively on pairwise distances, the resulting fixed-effect modeling is rotation-invariant by design.
Second, when modeling spatial correlation through similarity kernels, only distance-based kernel functions should be used. When a distance-based kernel function, such as a distance-based Gaussian kernel, a distance-based cosine kernel, or a Matérn-class kernel, is used to define spatial similarity, the resulting statistical tests remain invariant under spatial coordinate transformations, including rotation (see the supplemental file for more details). This invariance arises because distance-based kernels depend solely on pairwise distances between spatial locations, which are preserved regardless of rotation. Methods like SPARK15 and nnSVG16 exemplify this approach by adopting distance-based kernels to model spatial correlations.
Third, to avoid the rotation sensitivity introduced by grid-based preprocessing steps, strategies relying on relative positioning should be considered. For example, aggregation schemes based on k-nearest neighbors offer a rotation-invariant alternative, as the neighborhood structure defined by pairwise distances remains unchanged under rotation. Such methods can preserve spatial resolution while maintaining geometric robustness.
Conclusion
Accurate identification of SVGs is crucial for understanding the spatial organization of gene expression. Our evaluation highlights a critical limitation in many existing SVG detection methods: their sensitivity to tissue positioning during sample preparation. Through the analysis of common methodological misuses, we demonstrated how inappropriate modeling of spatial coordinates can lead to unreliable and inconsistent SVG results. We show that leveraging relative distances provides a more robust framework for capturing spatial patterns while maintaining invariance to tissue rotations. Additionally, improper preprocessing steps can exacerbate rotation-induced variability, complicating the detection of true spatial expression patterns. Addressing the methodological pitfalls is essential to improving the reliability and interpretability of SVG detection. Thus, by adopting methodological refinements and incorporating rotation-invariant strategies, researchers can improve the accuracy and reproducibility of SVG detection, ultimately advancing our understanding of spatial gene expression and its biological significance.
Data availability
The mouse olfactory bulb and human breast cancer image data9 shown in Fig. 1 can be found at https://www.spatialresearch.org. Source data are provided with this paper.
References
Satija, R., Farrell, J. A., Gennert, D., Schier, A. F. & Regev, A. Spatial reconstruction of single-cell gene expression data. Nat. Biotechnol. 33, 495–502 (2015).
Svensson, V., Teichmann, S. A. & Stegle, O. SpatialDE: identification of spatially variable genes. Nat. Methods 15, 343–346 (2018).
Yan, G., Hua, S. H. & Li, J. J. Categorization of 34 computational methods to detect spatially variable genes from spatially resolved transcriptomics data.Nat. Commun. 16, 1141 (2025).
Yu, J. & Luo, X. Identification of cell-type-specific spatially variable genes accounting for excess zeros. Bioinformatics 38, 4135–4144 (2022).
Cable, D. M. et al. Cell type-specific inference of differential expression in spatial transcriptomics. Nat. Methods 19, 1076–1087 (2022).
Yu, S. & Li, WeiVivian spVc for the detection and interpretation of spatial gene expression variation. Genome Biol. 25, 103 (2024).
Su, H., Wu, Y., Chen, B. & Cui, Y. STANCE: a unified statistical model to detect cell-type-specific spatially variable genes in spatial transcriptomics. Nat. Commun. 16, 1793 (2025).
Zhu, J., Sun, S. & Zhou, X. SPARK-X: non-parametric modeling enables scalable and robust detection of spatial expression patterns for large spatial transcriptomic studies. Genome Biol. 22, 184 (2021).
Ståhl, P. L. et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 353, 78–82 (2016).
Seal, S., Bitler, B. G. & Ghosh, D. SMASH: scalable method for analyzing spatial heterogeneity of genes in spatial transcriptomics data. PLOS Genet. 19, e1010983 (2023).
Vandenbon, A. & Diez, D. A clustering-independent method for finding differentially expressed genes in single-cell transcriptome data. Nat. Commun. 11, 4318 (2020).
Hao, M., Hua, K. & Zhang, X. SOMDE: a scalable method for identifying spatially variable genes with self-organizing map. Bioinformatics 37, 4392–4398 (2021).
Wang, J. et al. Dimension-agnostic and granularity-based spatially variable gene identification using bsp. Nat. Commun. 14, 7367 (2023).
Mason, K. et al. Niche-DE: niche-differential gene expression analysis in spatial transcriptomics data identifies context-dependent cell-cell interactions. Genome Biol. 25, 14 (2024).
Sun, S., Zhu, J. & Zhou, X. Statistical analysis of spatial expression patterns for spatially resolved transcriptomic studies. Nat. Methods 17, 193–200 (2020).
Weber, L. M., Saha, A., Datta, A., Hansen, K. D. & Hicks, S. C. nnSVG for the scalable identification of spatially variable genes using nearest-neighbor gaussian processes. Nat. Commun. 14, 4059 (2023).
Acknowledgements
The authors thank the reviewers for their insightful comments, which greatly improved the clarity and presentation of the work.
Author information
Authors and Affiliations
Contributions
Y.C. conceived the idea. H.S. and Y.C. designed the experiments. H.S. performed the analysis. H.S. and Y.C. wrote the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Su, H., Cui, Y. Rotation-invariance is essential for accurate detection of spatially variable genes in spatial transcriptomics. Nat Commun 16, 7122 (2025). https://doi.org/10.1038/s41467-025-62574-4
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41467-025-62574-4


