Identification of functional features underlying heat stress response in Sprague–Dawley rats using mixed linear models

Kotlarz, Krzysztof; Mielczarek, Magda; Wang, Yachun; Dou, Jinhuan; Suchocki, Tomasz; Szyda, Joanna

doi:10.1038/s41598-022-11701-y

Download PDF

Article
Open access
Published: 10 May 2022

Identification of functional features underlying heat stress response in Sprague–Dawley rats using mixed linear models

Krzysztof Kotlarz¹,
Magda Mielczarek^1,2,
Yachun Wang³,
Jinhuan Dou⁴,
Tomasz Suchocki^1,2 &
…
Joanna Szyda^1,2

Scientific Reports volume 12, Article number: 7671 (2022) Cite this article

2279 Accesses
6 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Since global temperature is expected to rise by 2 °C in 2050 heat stress may become the most severe environmental factor. In the study, we illustrate the application of mixed linear models for the analysis of whole transcriptome expression in livers and adrenal tissues of Sprague–Dawley rats obtained by a heat stress experiment. By applying those models, we considered four sources of variation in transcript expression, comprising transcripts (1), genes (2), Gene Ontology terms (3), and Reactome pathways (4) and focussed on accounting for the similarity within each source, which was expressed as a covariance matrix. Models based on transcripts or genes levels explained a larger proportion of log₂ fold change than models fitting the functional components of Gene Ontology terms or Reactome pathways. In the liver, among the most significant genes were PNKD and TRIP12. In the adrenal tissue, one transcript of the SUCO gene was expressed more strongly in the control group than in the heat-stress group. PLEC had two transcripts, which were significantly overexpressed in the heat-stress group. PER3 was significant only on gene level. Moving to the functional scale, five Gene Ontologies and one Reactome pathway were significant in the liver. They can be grouped into ontologies related to DNA repair, histone ubiquitination, the regulation of embryonic development and cytoplasmic translation. Linear mixed models are valuable tools for the analysis of high-throughput biological data. Their main advantages are the possibility to incorporate information on covariance between observations and circumventing the problem of multiple testing.

Unraveling the molecular mechanisms of paclitaxel in high-grade serous ovarian cancer through network pharmacology

Article Open access 12 May 2025

Effects of acute heat stress on protein expression and histone modification in the adrenal gland of male layer-type country chickens

Article Open access 22 March 2021

Heat shock transcription factor-mediated thermal tolerance and cell size plasticity in marine diatoms

Article Open access 10 April 2025

Introduction

The most general formulation of heat stress is defined as an increase in ambient temperature above the threshold beyond which body temperature cannot be maintained on the physiologically optimal level^1,2. Heat stress is among the best characterized environmental stressors and has the most severe detrimental effects. Individuals cope with heat stress by increasing body temperature, reducing feed intake as well as changing of physiological state, which is referred to as physiological adaptive responses³. Therefore, heat stress can lead to economic losses due to lower productivity and reproductivity, as well as increased health burden⁴. Unfortunately, global temperature is expected to rise by 2 °C in 2050⁵ and furthermore, the average temperature in 2100 may rise by up to 4.8 °C⁶. As a consequence, the maximum daily temperature will be higher, so the intensity and duration of heat stress will be more severe and prolonged, compared to 2021. Therefore, heat stress may become the most severe environmental factor.

Studies have reported that heat stress significantly altered physiological, biochemical, metabolic, and cellular responses in mice and livestock models^3,7,8. However, animals have developed protective measures against the challenges of heat stress manifested by altered transcript and gene expression levels^9,10,11. With the development of next generation sequencing technologies, including RNA-sequencing (RNA-seq), hundreds or thousands of genes involved in heat stress response have been identified^12,13. Consequently, the main goal of the research was to illustrate how linear mixed models can be used to perform a biologically driven analysis of changes in transcript and gene expression levels between the control and the heat-stress groups. In contrast to the vast majority of approaches, when particular transcripts are considered as independent, we incorporated biological information on the correlation between them directly into the statistical model. Furthermore, we also considered higher-order units, expressed by Gene Ontology terms and Reactome pathways, since these are the actual functional components of the physiological response towards heat stress.

The incorporation of functional units into the analysis of RNA-seq data is a known idea. It has been applied either through enrichment analysis as e.g. by Dou et al.¹⁴ to the data set analysed in our study or through the Ingenuity Pathway Analysis as e.g. by Lan et al.¹⁵ to gene expression under heat stress in poultry. Nevertheless, the major difference between our approach and the aforementioned studies is that the former conducts a two-step analysis, by first identifying significantly differentially expressed transcripts and genes by considering them independent (step 1) and then further analysing the selected, significant transcripts and genes in terms of the correlation between them or the enrichment of ontologies or pathways in this significant gene set (step 2). The approach proposed in our study attempted to combine the process within one-step by including the correlation of the functional information directly into the model fitted to the whole (i.e. not only to significant) expression data and is statistically similar to the linear mixed model fitted to gene expression data by Wang et al.¹⁶, albeit with a different approach to the modelling of the covariance of random effects.

In the current study, we illustrate the application of the mixed models using the whole genome transcript expression data of Sprague–Dawley rats obtained from a heat stress experiment¹⁴.

Methods

Experimental animals

The data set underlying the analysis is a subset of the material used by Dou et al.¹⁴. In brief, the analysed individuals comprised eight weeks old female, specific-pathogen-free, Sprague–Dawley rats. Prior to the experiment, all rats were housed in a laboratory at 22 ± 1 °C, 50% relative humidity with 12 h reverse light/dark cycle with feed and water provided ad libitum. After one week, five rats from the heat-stressed group were exposed to 42 °C for 120 min, while five rats from the control group were housed at an initial temperature of 22 °C. After the completion of the experiment, the animals were euthanised and samples of liver and adrenal gland tissues were used as a source of total RNA.

Bioinformatic analysis

Illumina HiSeq2000 was used to sequence 150 long reads in the paired-end (PE) data mode. The total number of read pairs per sample was from 51,706,978 to 97,059,004. The detailed description of the RNA isolation and sequencing was also provided by Dou et al.¹⁴. The bioinformatic pipeline consisted of the following steps: quality control of raw reads, editing of raw reads based on their quality, and quantification of transcripts’ expression. In particular, the quality of raw reads was assessed by applying the FastQC software¹⁷. Then, reads were processed by the Trimmomatic software¹⁸, which removed adapter sequences, trimmed reads with an average sequencing quality of 4 consecutive reads below 20 (SLIDINGWINDOW:4:20), and removed reads shorter than 60 bp (MINLEN:60). Next, the Salmon software¹⁹ was used to quantify the abundances of transcripts. This software implements the pseudoalignment process—an approach allowing for rapid identification of the compatibility of reads with transcripts, without the need of a computationally intensive whole genome alignment. In the last step, log₂ fold changes in transcript expression levels between the control and heat stressed groups were calculated using DESeq2²⁰.

Statistical modelling of expression data

The log₂ fold changes (log₂FC) calculated based on the transcript expression levels pooled over the control and heat-stressed animals respectively, were analysed in four mixed linear models.

The transcript-based model (M1) is given by:

$${\varvec{y}}={\varvec{\mu}}+{{\varvec{Z}}}_{M1}{\varvec{t}}+{{\varvec{e}}}_{M1},$$

(1)

where y is the vector of log₂FC of transcript expression, ${\varvec{\mu}}$ represents the general mean, ${\varvec{t}}$ is the random transcript effect with a predisposed normal distribution defined by $N\left(0,{{{\varvec{V}}}_{M1}\sigma }_{t}^{2}\right)$, ${{\varvec{e}}}_{M1}$ is a vector of residuals distributed as $N\left(0,{{\varvec{I}}\sigma }_{{e}_{M1}}^{2}\right)$, ${{\varvec{Z}}}_{M1}$ is an incidence matrix for ${\varvec{t}}$. In this model, the similarity between transcripts i and j, was introduced into the model by incorporating a nondiagonal transcript covariance matrix ${{\varvec{V}}}_{M1}$. The covariance between transcripts was expressed by the Jaccard similarity coefficient: $J\left(A,B\right)=\frac{\left|A\cap B\right|}{\left|A\cup B\right|},$ which in the case of transcripts was calculated based on the similarity in their exon composition:

$$J\left(i,j\right)=\frac{a}{N},$$

where a represents the number of exons common between transcripts i and j, while N represents the total number of exons of a given gene. Independence was assumed between genes, so the resulting matrix had a block diagonal structure. Transcript information was obtained from the Ensembl database, release100²¹ and Jaccard coefficients were calculated using the ADE-4 package²².

The gene-based model (M2) was applied to the same dependent variable (${\varvec{y}}$) as M1 and is given by:

$${\varvec{y}}={\varvec{\mu}}+{{\varvec{Z}}}_{M2}{\varvec{g}}+{{\varvec{e}}}_{M2},$$

(2)

where ${\varvec{g}}$ is the random gene effect with a preimposed normal distribution defined by $N\left(0,{{{\varvec{V}}}_{M2}\sigma }_{g}^{2}\right)$, ${{\varvec{e}}}_{M2}$ is a vector of random residuals distributed as $N\left(0,{{\varvec{I}}\sigma }_{{e}_{M2}}^{2}\right)$, ${{\varvec{Z}}}_{M2}$ is an incidence matrix for ${\varvec{g}}$. No covariance between genes was assumed so ${{\varvec{V}}}_{M2}$ was diagonal.

Furthermore, the Gene Ontology-based model (M3):

$${\varvec{y}}={\varvec{\mu}}+{{\varvec{Z}}}_{M3}{\varvec{g}}{\varvec{o}}+{{\varvec{e}}}_{M3},$$

(3)

where ${\varvec{g}}{\varvec{o}}$ is the random effect of Gene Ontology (GO; geneontology.org) terms assigned to transcripts whose log₂FC of transcript expressions are contained in ${\varvec{y}}$. It was assumed that ${\varvec{g}}{\varvec{o}}$ follows the normal distribution $N\left(0,{{{\varvec{V}}}_{M3}\sigma }_{go}^{2}\right)$, ${{\varvec{e}}}_{M3}$ is a vector of residuals distributed as $N\left(0,{{\varvec{I}}\sigma }_{{e}_{M3}}^{2}\right)$, and ${{\varvec{Z}}}_{M3}$ is an incidence matrix for the ${\varvec{g}}{\varvec{o}}$ terms. ${{\varvec{V}}}_{M3}$ describes the covariance between GO terms expressed, as above, by the Jaccard coefficients quantifying genes overlapping between two given GO terms. Each transcript was assigned GO term(s) from the biological process ontology, considering the ontologies from the 2nd hierarchy level.

Finally, a model incorporating the effects of Reactome pathways (M4) was fitted to transcript log₂FC:

$${\varvec{y}}={\varvec{\mu}}+{{\varvec{Z}}}_{M4}{\varvec{r}}+{{\varvec{e}}}_{M4},$$

(4)

in this model, ${\varvec{r}}$ represents the random effect of Reactome pathways (reactome.org) corresponding to transcripts. The normal distribution pre-imposed on ${\varvec{r}}$ is given by $N\left(0,{{{\varvec{V}}}_{M4}\sigma }_{r}^{2}\right)$, ${{\varvec{e}}}_{M4}$ is a vector of residuals distributed as $N\left(0,{{\varvec{I}}\sigma }_{{e}_{M4}}^{2}\right)$, and ${{\varvec{Z}}}_{M4}$ is an incidence matrix assigning Reactome pathways to transcripts. The covariance matrix between Reactome pathways (${{\varvec{V}}}_{M4}$) was also expressed by the Jaccard coefficients quantifying genes overlapping between the two given pathways.

Estimation of variance components and significance testing

The expectation–maximization algorithm²³ was applied for the estimation of variance components underlying the four above models (i.e. ${\sigma }_{t}^{2},{\sigma }_{g}^{2},{\sigma }_{go}^{2},{\sigma }_{r}^{2},{\sigma }_{{e}_{M1}}^{2},{\sigma }_{{e}_{M2}}^{2},{\sigma }_{{e}_{M3}}^{2}$ and ${\sigma }_{{e}_{M4}}^{2}$) and the mixed model equations²⁴ were used to obtain solutions of ${\varvec{\mu}},{\varvec{t}},{\varvec{g}},{\varvec{g}}{\varvec{o}}$ and ${\varvec{r}}$:

$$\left[\begin{array}{c}\widehat{{\varvec{\mu}}}\\ \widehat{{\varvec{x}}}\end{array}\right]={\left[\begin{array}{cc}{\varvec 1}^{T}{{\varvec{R}}}^{ -1}\varvec1& {\varvec 1}^{T}{{\varvec{R}}}^{-1}{{\varvec{Z}}}_{x}\\ {{\varvec{Z}}}_{x}^{T}{{\varvec{R}}}^{-1}\varvec 1& {{\varvec{Z}}}_{x}^{T}{{\varvec{R}}}^{-1}{{\varvec{Z}}}_{x}+{{\varvec{G}}}^{-1}\end{array}\right]}^{-1}\left[\begin{array}{c}{\varvec 1}^{T}{{\varvec{R}}}^{-1}{\varvec{y}}\\ {{\varvec{Z}}}_{x}^{T}{{\varvec{R}}}^{-1}{\varvec{y}}\end{array}\right],\mathrm{ where }\,{\varvec{R}}={\varvec{I}}{\widehat{\sigma }}_{{e}_{Mi}}^{2}\mathrm{ and }{\varvec{G}}={{\varvec{V}}}_{{\varvec{x}}}{\widehat{\sigma }}_{x}^{2},$$

with ${\varvec{x}}$ representing ${\varvec{t}},{\varvec{g}},{\varvec{g}}{\varvec{o}}$ or ${\varvec{r}}$ depending on the model, and $i\in \left\{M1, M2, M3, M4\right\}$ being a model indicator. Each element of the solution vectors $\widehat{{\varvec{t}}},\widehat{{\varvec{g}}},\widehat{{\varvec{g}}{\varvec{o}}}$ and $\widehat{{\varvec{r}}}$ was transformed to the standard normal distribution and tested for significance by assessing the probability of obtaining a more extreme value based on the standard normal density function.

Enhancement of the computational efficiency of the estimation

In order to maximise the computational performance of the estimation of model parameters and its variance components, a custom-written Python program implementing the Numba library²⁵ was used. Numba compiles a subset of native Python and NumPy code into the machine code. Since all calculations were carried out on a multicore server, the Numba library was also used to parallelize the code, what further improved the computing time compared to a native Python application.

Ethics approval

Not applicable, only in silico data was processed in this study.

Results

Obviously models M1 and M2, fitting thousands of components, i.e. transcripts (M1) or genes (M2) explain a much larger proportion of the observed variability of log₂FC than the M3 and M4 fitting the functional components, i.e. GO terms (M3) or Reactome pathways (M4). Still, by taking into account the correlation between transcripts expressed by their exon composition, we explain 12.43% and 18.32% of the total variability of log₂FC observed in liver and adrenal tissues, respectively. A similar picture arises when the assumed source of variability of log₂FC is considered on a gene level, resulting in 10.12% and 16.45% variance components in liver and adrenal tissues respectively—somewhat lower than for correlated transcripts in M1. By further shrinking the functional units to GO terms and Reactome pathway units we explain “only” 1.48% (GO) and 1.45% (Reactome) of the total variance for liver, as well as 1.44% (GO) and 1.93% (Reactome) for the adrenal tissue (Table 1).

Table 1 Variance components estimated by the mixed models (M1–M4) expressed as the percentage of the total variance of y.

Full size table

Considering the transcript and gene levels, in the liver, M1 and M2 point at the significant effects of the PNKD and TRIP12 genes. PNKD exhibits a significant effect of a transcript ENSRNOT00000046229, which is approximately 2²³ times higher expressed in the control than in the heat-stress group. However, three of the five transcripts of the gene show higher expression in the heat-stress group, reaching 2⁵ higher expression of ENSRNOT00000089580. TRIP12 synthesises one particular transcript (ENSRNOT00000022822) which is 2²² times higher expressed in the control than in the heat-stress group. Different genes were significantly differentially expressed in the adrenal tissue. One transcript of the SUCO gene (ENSRNOG00000026542) is 2²³ times higher expressed in the control than in the heat-stress group, while the three other transcripts synthesised from this gene have an opposite effect, i.e. show higher expression in heat-stressed individuals. PLEC has two transcripts which are significantly overexpressed in the heat-stress group. PER3 was significant only on the gene level (M2) with all of its transcripts being under expressed during heat stress (Table 2).

Table 2 Top 5 significant transcripts from M1, top 2 significant genes from M2, significant Gene Ontology terms from M3, and significant Reactome pathways from M4.

Full size table

While moving to a functional level, we did not identify any significant GO terms differentially expressed in the adrenal tissue and a single Generic Transcription Reactome Pathway (R-RNO-212436). However, in the liver, five ontologies and one Reactome pathway were significant. Functionally, they can be grouped into ontologies related to DNA repair through the regulation of double-strand break repair (GO:2000780 and GO:2000779) and histone ubiquitination (GO:1901315), to the regulation of embryonic development (GO:0045995) and to cytoplasmic translation (GO:0002181, R-RNO-6791226).

Discussion

Various scopes of information considered in our study explain different amounts of the observed variation of transcript differential expressions across the genome (Table 1). The difference in the estimated variance components related to transcripts, genes, GO terms and Reactome pathways associates with the number of levels of those independent variables. Since we are not aware of the corresponding estimates reported by other studies, a discussion with the literature is not possible. Therefore, at this stage, we can only hypothesise that on the one hand, the differences in variance components can be due to the technical nature of the model, since many more transcript effects were estimated as Reactome pathway effects. On the other hand, those effects were modelled as random and thus their variation was constrained by the predefined co-variance structure so that the differences in variance components express the biological phenomenon—Reactome pathways aggregate effects of many transcript/genes.

By considering the significant transcripts, genes and pathways identified in our study, a consistent picture emerges that DNA stability and its repair mechanisms are affected by heat stress. This phenomenon has already been reported by other authors, e.g.²⁶ and was reviewed by Kantidze et al.²⁷. On a gene level, TRIP12 was reported as encoding a protein which is associated with chromatin and plays a role in the maintenance of genome integrity²⁸ and PER3 influences DNA damage repair by correlating with checkpoint kinase 2 gene²⁹. Mutations in the SUCO gene induce hypoglycaemia [www.ensembl.org] in humans, which is the cause of hypothermia in diabetic patients³⁰. PER3 was differentially expressed in broiler chicken maintained under control and heat stress conditions³¹. In general, it has been proven that PER3 is associated with behavioural differences towards stress and was influenced by stress and ethanol treatment in BXD strain mice³².

On the functional level, we observed a significantly under expression of genes with ontologies related to double-strand break repair (GO:2000780 and GO:2000779) and histone ubiquitination. Furthermore, outside of the nucleus, a significant effect of the cytoplasmic translation ontology (GO:0002181) and rRNA processing in the nucleolus and cytosol pathway (R-RNO-6791226) can be linked to the phenomenon of the aggregation of proteins in the cytoplasm of yeast cells subjected to heat stress, which consequence was impaired cytoplasmic translation³³.

Noteworthy, our list of significant transcripts, genes, GO terms, and Reactome pathways did not overlap with the results reported for the same material by Dou et al.¹². Only the PER3 gene was significantly associated with differential expression in adrenal tissue in both studies. The most emerging difference concerned the number of significant effects reported. While Dou et al.¹² estimated 3909 and 4953 significantly differentially expressed genes for liver and adrenal tissues respectively, our study pointed at only two significant genes for each tissue. Similarly, the number of significant GO terms was 193 for liver and 79 for adrenal tissue in Dou et al.¹², while only five and zero in our results. The observed differences are caused by the following factors: different raw reads editing criteria, different approaches to estimate transcript expression, and different statistical modelling of expression data. Dou et al.¹² used the Cufflinks software³⁴ for bioinformatic processing of the expression data, which implements Tophat2³⁵ for the alignment of reads to the reference genome and a single gene hypothesis test with multiple testing correction of P values via the FDR for the assessment of differential expression. In our analysis, an alignment-free approach and a random effect model incorporating all transcripts/genes simultaneously were applied. Already ³⁶ pointed at differences in the statistical inference based on fixed and random effect models, indicating that the former tend to underestimate, and the latter—overestimate a residual variance. As a consequence, a type I-error is often elevated in fixed-effect models, which was analytically demonstrated by ³⁷ in the context of meta-analysis. Moreover, significant differences in the quantification of expression between alignment-based and alignment-free approaches were recently demonstrated in our unpublished analysis of Sus scrofa RNA-seq data by Hoffman et al.

Conclusions

Mixed models, i.e. statistical models fitting random effects, are a valuable tool for the analysis of high-throughput biological data. Their major advantages comprise: (1) the possibility to incorporate information on covariance between observations, which is often neglected while applying simple, fixed effect models, and (2) circumventing the problem of multiple testing, by simultaneous fitting all effects. We see the major limitation of the proposed approach in the varying quality of functional annotation of genomes available for different species. While genomes of humans and experimental species, such as rats, are very well functionally annotated with the most of transcripts/genes assigned to GO terms and metabolic pathways, less well-studied species have less complete annotation, which would enforce the incorporation of phantom GO and phantom pathway effects (similarly to phantom parent groups in livestock genetic evaluation).

From the biological perspective, PER3 and SUCO genes as well as DNA repair and translation were indicated as factors playing a significant role in heat stress response.

Data availability

The datasets generated and/or analysed during the current study are not publicly available due to institutional constrains, but are available from the Yachun Wang on reasonable request.

Code availability

Python code is available in principle upon request from the corresponding author.

References

Berman, A. et al. Upper critical temperatures and forced ventilation effects for high-yielding dairy cows in a subtropical climate. J. Dairy Sci. 68(6), 1488–1495. https://doi.org/10.3168/jds.S0022-0302(85)80987-5 (1985).
Article CAS PubMed Google Scholar
Kadzere, C. T., Murphy, M. R., Silanikove, N. & Maltz, E. Heat stress in lactating dairy cows: A review. Livest Prod. Sci. 77(1), 59–91. https://doi.org/10.1016/S0301-6226(01)00330-X (2002).
Article Google Scholar
Gonzalez-Rivas, P. A. et al. Effects of heat stress on animal physiology, metabolism, and meat quality: A review. Meat Sci. https://doi.org/10.1016/j.meatsci.2019.108025 (2020).
Article PubMed Google Scholar
St-Pierre, N. R., Cobanov, B. & Schnitkey, G. Economic losses from heat stress by US livestock industries1. J. Dairy Sci. https://doi.org/10.3168/jds.S0022-0302(03)74040-5 (2003).
Article PubMed Google Scholar
Trnka, M., Olesen, J. E., Kersebaum, K. C., et al. Agroclimatic Conditions in Europe under Climate Change. Vol 17; 2011. https://doi.org/10.1111/j.1365-2486.2011.02396.x.
Barros, V. R., Field, C. B., Dokken, D. J., et al. Climate Change 2014 Impacts, Adaptation, and Vulnerability Part B: Regional Aspects: Working Group Ii Contribution to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change (2014).
Hu, Y. et al. Effects of chronic heat stress on immune responses of the foot-and-mouth disease DNA vaccination. DNA Cell Biol. 26(8), 619–626. https://doi.org/10.1089/dna.2007.0581 (2007).
Article CAS PubMed Google Scholar
Lu, Q., Wen, J. & Zhang, H. Effect of chronic heat exposure on fat deposition and meat quality in two genetic types of chicken. Poult. Sci. 86(6), 1059–1064. https://doi.org/10.1093/ps/86.6.1059 (2007).
Article CAS PubMed Google Scholar
Sonna, L. A., Fujita, J., Gaffin, S. L. & Lilly, C. M. Invited review: Effects of heat and cold stress on mammalian gene expression. J. Appl. Physiol. 92(4), 1725–1742. https://doi.org/10.1152/japplphysiol.01143.2001 (2002).
Article CAS PubMed Google Scholar
He, Y., Maltecca, C., Tiezzi, F., Soto, E. L. & Flowers, W. L. Transcriptome analysis identifies genes and co-expression networks underlying heat tolerance in pigs. BMC Genet. 21, 1. https://doi.org/10.1186/s12863-020-00852-4 (2020).
Article CAS Google Scholar
Stallings, J. D. et al. Patterns of gene expression associated with recovery and injury in heat-stressed rats. BMC Genom. 15(1), 1058. https://doi.org/10.1186/1471-2164-15-1058 (2014).
Article CAS Google Scholar
Dou, J. et al. Comprehensive RNA-Seq profiling reveals temporal and tissue-specific changes in gene expression in Sprague-Dawley rats as response to heat stress challenges. Front Genet. https://doi.org/10.3389/fgene.2021.651979 (2021).
Article PubMed PubMed Central Google Scholar
Wang, L. C. et al. Transcriptome profiling of the fifth-stage larvae of Angiostrongylus cantonensis by next-generation sequencing. Parasitol. Res. 112(9), 3193–3202. https://doi.org/10.1007/s00436-013-3495-z (2013).
Article PubMed PubMed Central Google Scholar
Dou, J. et al. Heat stress impairs the physiological responses and regulates genes coding for extracellular exosomal proteins in rat. Genes (Basel) 11(3), 306. https://doi.org/10.3390/genes11030306 (2020).
Article CAS Google Scholar
Lan, X., Hsieh, J. C. F., Schmidt, C. J., Zhu, Q. & Lamont, S. J. Liver transcriptome response to hyperthermic stress in three distinct chicken lines. BMC Genom. 17, 1. https://doi.org/10.1186/s12864-016-3291-0 (2016).
Article CAS Google Scholar
Wang, T. & Zeng, Z. Contribution of genetic effects to genetic variance components with epistasis and linkage disequilibrium. BMC Genet. https://doi.org/10.1186/1471-2156-10-52 (2009).
Article PubMed PubMed Central Google Scholar
Andrews S, others. FastQC: A quality control tool for high throughput sequence data. 2010. http://www.bioinformatics.babraham.ac.uk/projects/. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/.
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 30(15), 2114–2120. https://doi.org/10.1093/bioinformatics/btu170 (2014).
Article CAS PubMed PubMed Central Google Scholar
Patro, R., Duggal, G., Love, M. I., Irizarry, R. A. & Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods 14, 4. https://doi.org/10.1038/nmeth.4197 (2017).
Article CAS Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 12. https://doi.org/10.1186/s13059-014-0550-8 (2014).
Article CAS Google Scholar
Yates, A. D. et al. Ensembl 2020. Nucleic Acids Res. 48(D1), D682–D688. https://doi.org/10.1093/nar/gkz966 (2020).
Article CAS PubMed Google Scholar
Thioulouse, J., Chessel, D., Dolédec, S. & Olivier, J. M. ADE-4: A multivariate analysis and graphical display software. Stat. Comput. 7(1), 75–83. https://doi.org/10.1023/A:1018513530268 (1997).
Article Google Scholar
Dempster, A. P., Laird, N. M. & Rubin, D. B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B 39(1), 1–22. https://doi.org/10.1111/j.2517-6161.1977.tb01600.x (1977).
Article MathSciNet MATH Google Scholar
Henderson, C. Applications of Linear Models in Animal Breeding (CABI, 1984).
Google Scholar
Lam, S. K., Pitrou, A., & Seibert, S. Numba. In: Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC-LLVM ’15. ACM Press; 2015:1–6. https://doi.org/10.1145/2833157.2833162.
Takahashi, A. et al. Evidence for the involvement of double-strand breaks in heat-induced cell killing. Cancer Res. 64(24), 8839–8845. https://doi.org/10.1158/0008-5472.CAN-04-1876 (2004).
Article CAS PubMed Google Scholar
Kantidze, O. L., Velichko, A. K., Luzhin, A. V. & Razin, S. V. Heat stress-induced DNA damage. Acta Nat. 8(2), 75–78. https://doi.org/10.32607/20758251-2016-8-2-75-78 (2016).
Article CAS Google Scholar
Larrieu, D. et al. The E3 ubiquitin ligase TRIP12 participates in cell cycle progression and chromosome stability. Sci. Rep. 10, 1. https://doi.org/10.1038/s41598-020-57762-9 (2020).
Article CAS Google Scholar
Im, J. S., Jung, B. H., Kim, S. E., Lee, K. H. & Lee, J. K. Per3, a circadian gene, is required for Chk2 activation in human cells. FEBS Lett. 584(23), 4731–4734. https://doi.org/10.1016/j.febslet.2010.11.003 (2010).
Article CAS PubMed Google Scholar
Tran, C. et al. Hypothermia is a frequent sign of severe hypoglycaemia in patients with diabetes. Diabetes Metab. 38(4), 370–372. https://doi.org/10.1016/j.diabet.2012.03.005 (2012).
Article CAS PubMed Google Scholar
Jastrebski, S. F., Lamont, S. J. & Schmidt, C. J. Chicken hepatic response to chronic heat stress using integrated transcriptome and metabolome analysis. PLoS One 12(7), e0181900. https://doi.org/10.1371/journal.pone.0181900 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wang, X. et al. A promoter polymorphism in the Per3 gene is associated with alcohol and stress response. Transl. Psychiatry 2, 1. https://doi.org/10.1038/tp.2011.71 (2012).
Article Google Scholar
Gallardo, P., Real-Calderón, P., Flor-Parra, I., Salas-Pino, S. & Daga, R. R. Acute heat stress leads to reversible aggregation of nuclear proteins into nucleolar rings in fission yeast. Cell Rep. 33(6), 108377. https://doi.org/10.1016/j.celrep.2020.108377 (2020).
Article CAS PubMed Google Scholar
Trapnell, C. et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat. Biotechnol. 28(5), 511–515. https://doi.org/10.1038/nbt.1621 (2010).
Article CAS PubMed PubMed Central Google Scholar
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: Discovering splice junctions with RNA-Seq. Bioinformatics 25(9), 1105–1111. https://doi.org/10.1093/bioinformatics/btp120 (2009).
Article CAS PubMed PubMed Central Google Scholar
Overton, R. C. A comparison of fixed-effects and mixed (random-effects) models for meta-analysis tests of moderator variable effects. Psychol. Methods 3(3), 354–379. https://doi.org/10.1037/1082-989X.3.3.354 (1998).
Article MathSciNet Google Scholar
Hunter, J. E. & Schmidt, F. L. Fixed effects vs random effects meta-analysis models: Implications for cumulative research knowledge. Int. J. Sel. Assess. 8(4), 275–292. https://doi.org/10.1111/1468-2389.00156 (2000).
Article Google Scholar

Download references

Acknowledgements

Computations were carried at the Wroclaw Centre for Networking and Supercomputing (mixed models) and the Poznan Supercomputing and Networking Center (bioinformatic pipelines).

Funding

The research was funded by the Polish National Science Foundation (NCN) under grant number 2019/35/O/NZ9/00237 and supported by the tutoring programme of the Wroclaw University of Environmental and Life Sciences. The APC is financed by the Wroclaw University of Environmental and Life Sciences.

Author information

Authors and Affiliations

Biostatistics Group, Department of Genetics, Wroclaw University of Environmental and Life Sciences, Kozuchowska 7, 51-631, Wroclaw, Poland
Krzysztof Kotlarz, Magda Mielczarek, Tomasz Suchocki & Joanna Szyda
National Research Institute of Animal Production, Krakowska 1, 32-083, Balice, Poland
Magda Mielczarek, Tomasz Suchocki & Joanna Szyda
Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
Yachun Wang
College of Animal Science and Technology, Beijing University of Agriculture, Beijing, 102206, China
Jinhuan Dou

Authors

Krzysztof Kotlarz
View author publications
Search author on:PubMed Google Scholar
Magda Mielczarek
View author publications
Search author on:PubMed Google Scholar
Yachun Wang
View author publications
Search author on:PubMed Google Scholar
Jinhuan Dou
View author publications
Search author on:PubMed Google Scholar
Tomasz Suchocki
View author publications
Search author on:PubMed Google Scholar
Joanna Szyda
View author publications
Search author on:PubMed Google Scholar

Contributions

Y.W., and J.S., designed the research and interpreted the data. K.K., M.M., J.D., and T.S., and performed data analysis. J.S. drafted the manuscript. All authors contributed to the writing of the manuscript, reviewed and approved its final version.

Corresponding author

Correspondence to Joanna Szyda.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kotlarz, K., Mielczarek, M., Wang, Y. et al. Identification of functional features underlying heat stress response in Sprague–Dawley rats using mixed linear models. Sci Rep 12, 7671 (2022). https://doi.org/10.1038/s41598-022-11701-y

Download citation

Received: 08 September 2021
Accepted: 25 April 2022
Published: 10 May 2022
Version of record: 10 May 2022
DOI: https://doi.org/10.1038/s41598-022-11701-y

This article is cited by

Multi-trait GWAS for growth under contrasting thermal rearing conditions in rainbow trout (Oncorhynchus mykiss)
- Jousepth Gallardo-Hidalgo
- David A. Tapia
- José M. Yáñez
Molecular Genetics and Genomics (2025)
Proteomic identification of potential biomarkers for heat tolerance in Caracu beef cattle using high and low thermotolerant groups
- Ana Claudia de Freitas
- Henrique G. Reolon
- Nedenia B. Stafuzza
BMC Genomics (2024)