Reference genome and population genomic analyses reveal insight into herbicide tolerance in Avena fatua L.

Liu, Jianan; Liu, Ningkun; Yan, Wei; Hu, Xiaoying; Wang, Meijia; Qin, Ruidong; Ma, Lin; Li, Wei; Wang, Yu; Sun, Qingbin; Xing, Longsheng; Yun, Yange; Li, Yingying; Yan, Jinjiang; Wang, Xuemin; Gong, Zhizhong; Wang, Hongru; Du, Huilong

doi:10.1038/s41467-025-64825-w

Download PDF

Article
Open access
Published: 07 November 2025

Reference genome and population genomic analyses reveal insight into herbicide tolerance in Avena fatua L.

Nature Communications volume 16, Article number: 9851 (2025) Cite this article

7246 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Avena fatua (weedy oat) is a globally pervasive weed, notorious for its adaptability to extreme environments and herbicide tolerance. Here, we present the 10.98-Gb hexaploid genome of A. fatua and a variation map from 768 wild and cultivated oats (A. fatua, A. sterilis and A. sativa), elucidating their genetic relationship and evolutionary history. Population genomic analyses reveal genetic connections between A. fatua and cultivated naked oats, and identify divergent regions between A. fatua and A. sativa enriched in genes associated with biotic and abiotic stresses. Among them, a herbicide-resistance locus is identified on chromosome 4D, with A. fatua accessions carrying a highly differentiated haplotype from cultivated oats. Multi-omics profiling and functional validation demonstrate that one expanded GST gene in this locus contributes significantly to oat herbicide resistance. Our study provides genomic resources for understanding A. fatua’s broad adaptability to diverse environmental conditions, facilitating the development of climate-resilient oat varieties.

The near-complete genome assembly of hexaploid wild oat reveals its genome evolution and divergence with cultivated oats

Article 03 December 2024

The mosaic oat genome gives insights into a uniquely healthy cereal crop

Article Open access 18 May 2022

Global genomic population structure of wild and cultivated oat reveals signatures of chromosome rearrangements

Article Open access 29 October 2025

Introduction

The genus Avena contains approximately 30 recognized oat species, including diploids, tetraploids and hexaploids^1,2,3. Among them, the common oat (Avena sativa L., 2n = 6x = 42, AACCDD), a globally important cereal crop, was domesticated from the wild hexaploid Avena sterilis L. (2n = 6x = 42, AACCDD) more than 3000 years ago⁴. Closely related to both is Avena fatua L. (2n = 6x = 42, AACCDD), a weedy hexaploid oat that frequently invades agricultural fields alongside A. sativa. Unlike typical wild oats, A. fatua lacks a distinct natural geographic range and is predominantly associated with human-disturbed habitats. Its taxonomic status remains contentious: Ladizhinski and Zohary (1971) argued that A. fatua should be classified as a morphological type of a single biological species encompassing A. sativa and A. sterilis, given the absence of reproductive isolation^5,6. However, this view is not universally accepted⁷, as many researchers maintain A. fatua as a separate species due to its unique ecological impact and invasive behavior.

Regardless of its taxonomic placement, A. fatua is well-known as one of the worst annual weeds worldwide, causing enormous problems in agriculture and its infection keeps rapidly increasing^8,9 (Fig. 1a, b). Depending on plant density and relative time of emergence, A. fatua can reduce annual crop yields by as much as 70%¹⁰. In western Canada, over $500 million are expended on herbicides to control A. fatua annually¹⁰, and A. fatua also infests over 11 million ha of cropland in the Great Plains and Pacific Northwest regions causing over $1 billion in annual crop losses¹⁰. Despite its significant damage as a weed, the weedy oat (A. fatua) harbors valuable alleles that can be utilized to enhance disease resistance and stresses resistance in cultivated oats, such as resistance to crown rust and powdery mildew^8,11,12. One successful example is the cultivar “Bayou18”¹³, which exhibits strong resistance to disease, drought, and barrenness, along with high yield and quality. This cultivar was developed through interspecific hybridization between A. fatua and the cultivated varieties “Jizhangyou4” and “Bayou9”. Therefore, genomic studies of A. fatua have the potential to further our understanding of its evolution and provide a valuable resource for the genetic improvement of cultivated oats.

Fig. 1: Phenotypes of Avena fatua and overview of the A. fatua accession W1004 assembly. — **Fig. 1: Phenotypes of *Avena fatua* and overview of the *A. fatua* accession W1004 assembly.**

As a cosmopolitan grass weed, A. fatua can be well adapted to changing environmental conditions, and grow not only in various climatic zones from tropics to polar circle, but also on nearly all types of soil⁸ (Fig. 1b; Supplementary Fig. 1a, b). A. fatua also exhibits strong phenotypic plasticity in morphological characteristics, including plant height, leaf size, tiller number and length, in response to the changes in photoperiod and temperature under controlled environment conditions¹⁴. In addition to the excellent adaptation to diverse extreme environments, the consistently high abundance of A. fatua in cropland may also be assisted by its seed shattering and relatively long persistence in the soil seed bank^15,16 (Supplementary Fig. 1b). A. fatua exhibits strong competitive ability, high resistance to various abiotic and biotic stresses as well as herbicide tolerance^10,17,18 (Supplementary Fig. 1c), making it become a good model for understanding plant adaptation to diverse environments and the genomic basis of A. fatua’s success.

Herbicide resistance in weeds is a global problem threatening crop production^19,20. The continuous and rapid increase in herbicide resistance among weeds, driven by intensive herbicide use, has caused substantial crop yield losses and increased production costs²¹. Many herbicide-resistant weed species are polyploids²², such as the hexaploid wild oat and Barnyard grass, but little is known about its molecular evolution in polyploids. Previous studies have demonstrated that metabolic tolerance often involves multiple detoxification enzyme families^23,24, such as glutathione S-transferases (GSTs), which can conjugate glutathione to herbicide molecule, rendering them nontoxic²⁵. Numerous studies have provided evidence for the important roles of enhanced GST activity or increased GST expression in herbicide resistance mechanisms^26,27,28,29. Although numerous enzymes and related mechanisms associated with herbicide-resistance have been identified and elucidated in weeds, the availability of genome sequences could accelerate research on these processes. Due to the relatively limited genetic information, few enzymes and related mechanisms associated with herbicide-resistance have been identified and elucidated in A. fauta^30,31,32. Exploring herbicide-tolerance mechanisms and identifying the key loci responsible for strong herbicide resistance in A. fatua could aid in the development of new herbicides and the breeding of cultivated oats with strong herbicide resistance.

Here, we report a near-complete genome assembly for A. fatua (W1004) through the integration of HiFi, Nanopore, and Hi-C sequencing techniques. Additionally, we construct a comprehensive genomic variation map consisting of 768 wild and cultivated oats worldwide (288 cultivated oats from A. sativa, 443 wild oats from A. fatua, and 37 previously released³³ wild oats data from A. sterilis). Population genetic analysis reveals the evolutionary history of A. sterilis, A. fatua and A. sativa. It also aids in the identification of highly divergent genomic regions between A. fatua and A. sativa, and the genes in these regions are largely associated with responses to diverse extreme environments. One highly divergent genomic locus on chromosome 4D with expanded GST genes in A. fatua is identified. By combining transcriptome and chromatin accessibility profiling with functional validation using knock-down and over-expression transgenic lines in oat, we demonstrate the crucial role of a recently expanded cluster of GST genes in conferring strong herbicide resistance in A. fatua. Our study provides resources for elucidating oat genome evolution and understanding the genetic basis of A. fatua’s adaptation to diverse and extreme environmental conditions.

Results

High-quality genome assembly, annotation and comparative genomics of A. fatua

To uncover the genome architecture of A. fatua, we obtained 306.43 Gb of Pacbio HiFi long reads with a N50 length of 19.61 kb, 317.73 Gb ONT ultra-long reads with N50 of 74.51 kb and 196.64 Gb of high-throughput chromosome conformation capture (Hi-C) data (Supplementary Table 1). The HiFi reads and ONT ultra-long reads were used to construct a preliminary assembly comprising 145 contigs with the contig N50 of 473.48 Mb. Utilizing Hi-C data, the assembled contigs were further anchored onto 21 pseudochromosomes (Supplementary Fig. 2; Supplementary Fig. 3a). As a result, we achieved a final assembly of 10.98 Gb with a contig N50 length of 473.48 Mb. More than 99.79% of sequences were anchored to the 21 chromosomes with only 22 gaps, and among them, 10 chromosomes were gap-free (Supplementary Fig. 2; Table 1; Supplementary Table 2-4). These chromosomes were further assigned to A, C, and D subgenomes based on synteny with previously published hexaploid oat genomes³, and subgenome-specific k-mers³⁴ (Supplementary Fig. 3b, c).

Table 1 Statistics of genome assembly and annotation for A. fauta accession W1004

Full size table

This highly contiguous assembly enabled us to identify all telomere regions at 42 chromosomal ends (Supplementary Table 5). The Benchmarking Universal Single-Copy Orthologs (BUSCO) evaluation with the embryophyta_odb10 database demonstrated high genome completeness, with 98.50% of conserved orthologs successfully detected under the genome mode (Table 1; Supplementary Fig. 3d). To further evaluate its quality, we remapped all primary sequencing data to the A. fatua (W1004) genome assembly, achieving mapping rates of 99.94% for HiFi reads, 99.97% for ONT reads and 99.94% for Hi-C reads. Moreover, the consensus Quality Value (QV) of k-mer-based estimates reached 72.42, exceeding all of the published oat genomes (Supplementary Fig. 3e). These metrics showed that we have obtained a high-quality and near-complete A. fatua (W1004) genome assembly (Supplementary Fig. 2).

For genome annotation, we first characterized repetitive sequences, which comprise 9.70 Gb (88.38%) of the assembled genome (Table 1), consistent with other oat genomes. The majority (87.26-89.64%) of these repeats were transposons distributed across all three subgenomes (A: 87.83%, 3.02 Gb; C: 89.64%, 3.78 Gb; D: 87.26%, 2.85 Gb; Supplementary Table 6). Retrotransposons dominated the transposable element landscape, with Gypsy-type LTR retrotransposons representing the most abundant class (35.17% of total transposon length; Supplementary Fig. 3f). For protein-coding gene annotation, we integrated evidence from five tissue-specific RNA-seq datasets with ab initio predictions and homology-based searches. This comprehensive approach identified 135,470 high-confidence protein-coding genes (Table 1; Supplementary Table 7), establishing A. fatua (W1004) as having one of the most extensively annotated oat genomes. We also applied the same annotation procedure to other related oats genomes, including Sang², SanfenSan³, and OT3098, identifying 135,528, 131,864, and 131,157 genes, respectively, to further facilitate their comparison (Supplementary Table 7).

To identify the closest extant relatives of each subgenome and to verify the allopolyploid origin of A. fatua, we conducted a phylogenetic analysis using 4709 single-copy orthologs from 19 (sub) genomes representing 10 Pooideae species (Fig. 2a; Supplementary Table 8). The result indicated that the divergence between A. fatua (W1004) and hulled oat (Sang) as well as naked oat (Sanfensan³) occurred more recently, at 0.79 ~ 1.13 Mya, and A. fatua was identified as the closest outgroup to cultivated oats (Fig. 2a). To better capture the genomic features of A. fatua (W1004) genome, we performed a comparative analysis of large-scale structural variations among A. fatua (W1004), cultivated oat genomes (Sanfensan) and A. sterilis (Fig. 1c; Supplementary Fig. 4). This analysis revealed numerous large-scale inversions specific to A. fatua (W1004), particularly on chromosomes 4 A and 4 C, which are absent in both A. sterilis and cultivated oats (Supplementary Fig 4d–f).

Fig. 2: Phylogeny of Avena genomes and gene family analysis in A. fatua accession W1004. — **Fig. 2: Phylogeny of *Avena* genomes and gene family analysis in *A. fatua* accession W1004.**

Comparative genomic analysis revealed 472 expanded gene families in A. fatua (W1004) versus cultivated oats, including stress-related ‘Gibberellin-insensitive, Repressor of gal-3 and Scarecrow (GRAS)’, ‘Glutathione-S transferase (GST)’, and ‘UDP-Glycosyltransferase (UGT)’ families (Fig. 2b, c; Supplementary Fig. 5a). GRAS, an important gene family in plant development and response to multiple stresses³⁵, showed notable expansion (187 vs. 162-179 in cultivars), particularly on the A subgenome (Supplementary Fig. 5a, b). And homologs of many cloned disease resistance genes co-localized with R gene-rich regions in A. fatua (W1004) (Supplementary Note 1; Supplementary Fig. 5c–e; Supplementary Table 9). We also found that GSTs exhibited dramatic expansion (266 vs. 236-248 in cultivars), with Tau-class genes specifically amplified on chromosome 4D (13 genes vs. 9–11 in cultivars) (Fig. 2d–f; Supplementary Fig. 6). RNA-seq analysis demonstrated tissue-specific expression of these GSTs, with preferential accumulation in root and leaf (Supplementary Fig. 6e), consistent with their roles in detoxification and herbicide tolerance^36,37. These expansions likely underlie A. fatua’s unique environmental adaptability.

Genetic relationship and evolutionary history of A. sterilis, A. fatua and A. sativa

To reveal the evolutionary relationships among wild, weedy and cultivated oats, we performed whole-genome resequencing for 443 A. fatua accessions collected from roadsides and fields. The sampled A. fatua accessions cover three representative ecological niches in northern China, including 267 accessions from the Tibet Plateau (AFT, average altitude ~3800 m), 71 accessions from Xinjiang (AFXJ, average altitude ~1006 m), and 105 accessions from Zhangjiakou (AFZJK, average altitude ~1416 m) (Fig. 3a; Supplementary Fig. 7a, b; Supplementary Data 1). We further sequenced a diverse collection of cultivated oats (A. sativa) consisting of 288 globally distributed accessions (Fig. 3b; Supplementary Table 10; Supplementary Data 1). In total, 58.16 Tb of resequencing data were generated, with an average depth of 7.29 (79.56 Gb per accession) (Supplementary Data 2). By combining the previously released genomic data of 37 wild (A. sterilis) accessions from different regions of Israel³³ and taking the A. fatua (W1004) genome as the reference, we obtained 76,205,385 high-quality single-nucleotide polymorphisms (SNPs) across the genome, with 84.65% in intergenic and 1.97% in coding regions, respectively (Supplementary Tables 11 and 12). Subgenome D exhibited elevated SNP density, particularly on chromosomes 1D and 4D (Supplementary Fig. 7c; Supplementary Table 12).

Fig. 3: Population genetic analysis of A. sterilis, A. fatua and A. sativa. — Fig. 3: Population genetic analysis of *A. sterilis*, *A. fatua* and *A. sativa.*

To reveal the population structure of these oat populations, we performed principal components analysis (PCA) and ADMIXTURE analysis. To mitigate sample size bias among the three oat species, we performed SNP ascertainment using a balanced panel consisting of 30 randomly selected samples per species and used this SNP panel for subsequent analyses (Supplementary Fig. 8; see Methods). PCA revealed a clear separation of A. sterilis (WST) accessions from other populations along the first principal component, with hulled oats (CSA) forming an intermediate cline and naked oats (CNU) clustering at the opposite end, which is consistent with the notion that A. sativa was domesticated from A. sterilis⁴ (Fig. 3d). Along the second principal component, A. fatua accessions were distinctly separated from cultivated oats. Within A. fatua, accessions from different regions formed overlapping clusters, with Xinjiang accessions positioned intermediately between those from Zhangjiakou and Tibet, reflecting their geographic distribution (Fig. 3d). The ADMIXTURE analysis closely mirrored PCA results (Fig. 3c; Supplementary Fig. 7d), identifying A. sterilis as harboring a distinct genetic component at K = 2. At K = 3, A. sativa and A. fatua were modeled with predominantly different ancestral components. The lowest cross-validation error occurred at K = 4 (Supplementary Fig. 8b), where cultivated oats split into two components corresponding to hulled and naked varieties. Phylogenetic analysis further supported these patterns (Supplementary Fig. 7e), with oat accessions clustering into three major clades representing the three species, and naked oats forming a distinct subclade within cultivated oats. These results demonstrate that A. fatua accessions from northern China are genetically distinct from both cultivated and wild oats, consistent with their unique ecological niche and morphological characteristics. In PCA, A. fatua accessions clustered closer to naked oats than to hulled oats, forming a cline that suggests closer genetic affinity with naked oats (Fig. 3d; Supplementary Fig. 8c). The pattern on PCA is also supported by D-statistics as A. fatua populations positioned closer to naked oat share more alleles with it than others (D (AFT/AFXJ, AFZJK; CNU, A. sterilis) <0, 7 <|Z | <31) (Supplementary Fig. 7f). Together, these results demonstrate a genetic connection between northern Chinese A. fatua and the widely cultivated naked oats, potentially reflecting either a direct evolutionary origin from naked oats or extensive historical gene flow between these groups. Moreover, ADMIXTURE analysis consistently detected a minor A. sterilis-related component in A. fatua populations that is absent in naked oats, indicating additional hybridization occurred between A. fatua and A. sterilis.

To investigate genetic diversity and divergence among the four groups, we calculated the nucleotide diversity (π) for each group and conducted a pairwise analysis of genetic distances (F_ST). The naked oat population (CNU) exhibited the lowest nucleotide diversity (π = 0.61 × 10⁻³), while the A. fatua population showed the highest genetic diversity (π = 1.06 × 10⁻³) among the four groups. Within A. fatua, genetic diversity decreased from high altitude (AFT) to low (AFXJ and AFZJK), with the highest π in Tibet province of China (Supplementary Table 13). The genetic differentiation between A. fatua and CNU (0.272) is lower than the comparisons between A. fatua and CSA/WST (0.337/0.345), further supporting their genetic connection (Fig. 3e). The lowest level of linkage disequilibrium (LD) decay distance was observed for WST and highest level of LD was observed for CNU (Fig. 3f), suggesting a substantial decrease in genetic diversity of CNU during its unique process of evolution and domestication history in China. Together, our results demonstrated the close relationship between A. fatua and A. sativa, and shed light on the genetic relationships and evolutionary history of A. sterilis, A. fatua from various geographical regions in China, and A. sativa worldwide.

Genomic signatures of selection underlying A. fauta’s environmental adaptability

A. fatua has long been noted for being adaptable to a wide range of extreme environments³⁸. To uncover the genetic basis, we compared the population genomes of A. fatua and its closely related, A. sativa, using six complementary methods to identify genomic regions with selection signatures in A. fatua population (Fig. 4a; Supplementary Fig. 9a). (I) Top 10% genomic regions with the greatest differences in the average reads coverage of the whole-genome resequencing data from A. fatua and A. sativa accessions; (II) Top 10% genomic regions with highest structural variations (SVs) frequencies between A. fatua and all cultivated oat genomes (Supplementary Note 2; Supplementary Tables 14 and 15); (III) Cross-population extended haplotype homozygosity (XP-EHH) for the top 10% haplotypes; (IV) Cross-population composite likelihood ratio (XP-CLR) for the top 10% genomic regions (Supplementary Note 3); (V) Fixation statistics (F_ST) for top 10% genomic regions; (VI) Nucleotide diversity PI (π _{A. sativa}/π _{A. fatua}) for top 10% genomic regions. Specifically, methods I and II searches for differentiated genomic regions between the two species based on the pattern of large sequence alterations, while methods III to VI focused on SNP patterns. Collectively, these approaches identified 1.11–1.71 Gb of regions (spanning 12,896–22,051 genes) (Figs. 3g, 4b, c; Supplementary Fig. 9b; Supplementary Note 3; Supplementary Data 3). Among them, we identified 2,417 candidate genes detected by at least five methods (Fig. 4d), suggesting strong divergence between A. sativa and A. fatua with potential selection in A. fatua. We focused on these genes to investigate the genomic basis of their differences in environmental adaptation and phenotypic traits. Functional enrichment analysis revealed strong associations with stress responses (e.g., oxidative stress, auxin signaling) and growth/development (Supplementary Fig. 10a). A total of 105 candidate genes are homologs of known stress-tolerance genes in rice and wheat (Fig. 4e; Supplementary Data 4), linked to heavy metal resistance (8), salt tolerance (22), heat resistance (4), cold resistance (13), drought resistance (14), disease resistant (41) and herbicide resistance (3). Another 51 genes were homologs of genes conferring for important agronomic traits (Fig. 4e; Supplementary Data 4), including yield and quality (16), growth and development (25), sterility (9) and shattering (1). Besides, we found stress-related genes exhibited lower sequence similarity between A. fatua and A. sativa than agronomic-related genes (Fig. 4e), suggesting significant divergence in environmental adaptation between A. fatua and A. sativa.

Fig. 4: Genome-wide scan for regions of genetic divergence between A. fatua and A. sativa. — Fig. 4: Genome-wide scan for regions of genetic divergence between *A. fatua* and *A. sativa.*

This candidate gene set provides important foundation for unveiling the genetic mechanism of the broad adaptation to diverse environments in A. fatua. To facilitate their utilization, we conducted detailed analysis of these genes, incorporating information on SNP variations, resulting amino acid changes, allele frequencies across species, functional annotations of their rice homologous, and tissue-specific expression patterns (Supplementary Data 5 and 6). For example, 2Cg0008813, the ortholog of OsTPS28, encodes a terpene synthase that confers blast and blight resistance in rice³⁹. In oat, this gene is highly expressed in stems and harbors nine non-synonymous SNPs distinguishing A. fatua from A. sativa (Supplementary Fig. 10b; Supplementary Data 5 and 6). Similarly, 5Dg0059047, an ortholog of HIS1 that confers resistance to benzobicyclon and other β-triketone herbicides in rice⁴⁰, exhibits strong floral expression and contains three non-synonymous SNPs defining divergent haplotypes between A. fatua and A. sativa (Supplementary Fig. 10b; Supplementary Data 5 and 6).

We observed significant phenotypic divergence between A. fatua and A. sativa, including stronger seed-shattering and longer awns in A. fatua (Fig. 1a; Supplementary Fig. 1b). Among our candidate genes, 3Cg0189326-a homolog of rice qSH1 that regulates seed shattering⁴¹-exhibited two major haplotypes. Haplotype1 was present in 97% of A. fatua accessions, where haplotype2 dominated cultivated oats (94%). Variant analysis revealed that two non-synonymous SNPs and a 21-bp deletion in the first exon of 3Cg0189326, prevalent in most A. fatua accessions (Supplementary Fig. 10c). Long awns are crucial for seed dispersal and propagation. We identified 2Cg0006814, encoded a cytokinin-activating enzyme to LONG AND BARBED AWN1 (LABA1), which was known to control awn elongation in rice⁴². Haplotype analysis revealed a non-synonymous mutation defining two haplotypes: haplotype1 was present in 93% of the A. fatua accessions, contrasting to only 12% of accessions in the A. sativa (Supplementary Fig. 10d). Furthermore, haplotype1 contained a 5-bp deletion in the upstream regulatory region, disrupting a predicted NAC transcription factor binding site. These mutations represent candidate functional variants underlying the observed interspecific variation in awn length. (Supplementary Fig. 10d). Nevertheless, the functional relevance of these candidate genes remains to be experimentally validated in oat.

The distal region of chromosome 4D, exhibiting GST gene expansions in A. fatua, emerged as a strongly differentiated region between A. fatua and A. sativa across all six detected methods (Fig. 4a, f). This locus harbored two distinct haplotypes segregating between A. fatua and A. sativa, and the density of SVs and divergent SNPs was high in this region. The high F_ST values between the two species associated with the markedly low nucleotide diversity (π) in the A. fatua population in this interval probably indicated that this genomic region was positively selected in A. fatua (Fig. 4f). This result further supports that the GST gene cluster in the locus have contributed to environmental adaptation in A. fatua.

Overall, we provided a comprehensive catalog of selective sweeps in A. fatua, which could be useful resources for future identifying important genes associated with A. fatua’s environmental adaptability.

Multi-omics analysis of herbicide response mechanisms in A. fatua

A. fatua is reported to exhibit remarkable herbicide resistance³⁸, and our herbicide screening experiments confirm that A. fatua exhibits a significantly higher survival rate following herbicide treatment compared to A. sativa (Supplementary Data 7). However, the genetic mechanisms underlying this trait remain unknown. To investigate this, we conducted RNA-seq analysis of leaf and root tissues at three time points (6 h, 24 h, and 7 d) following fenoxaprop-P-ethyl (85 g/hm²) treatment (Fig. 5a). Principal component analysis confirmed data reproducibility (Supplementary Fig. 11a; Supplementary Data 8). We identified 23,833 differentially expressed genes (DEGs) across treatments (Supplementary Fig. 11b–d; Supplementary Table 16), with 2685 and 559 showing consistent differential expression in leaf and root, respectively (Fig. 5b; Supplementary Fig. 11d). DEGs numbers increased with duration (Fig. 5b; Supplementary Table 17). Functional enrichment revealed defense-related response, including amino acid catabolism, auxin response, glutathione metabolism, DNA damage repair (Supplementary Fig. 11e). Among 23 genes whose homologs are related to herbicide resistance in other plants, 21 out of 23 and 20 out of 23 genes response to herbicide treatment in leaf and root of A. fatua, respectively (Supplementary Fig. 11f, g). A total of 142 out of 266 GST genes showed differential expression (Fig. 5c; Supplementary Fig. 11h), supporting GST’s role in herbicide resistance.

Fig. 5: Transcriptome and chromatin accessibility profiling of herbicide response in A. fatua assession W1004. — **Fig. 5: Transcriptome and chromatin accessibility profiling of herbicide response in *A. fatua* assession W1004.**

Dynamics of chromatin accessibility is an important factor regulating gene expression during various stresses⁴³. To explore the potential impacts of chromatin sequencing (ATAC-seq) libraries under 85 g/hm² fenoxaprop-P-ethyl treatment and control conditions at two time points for leaves: 24 hours and 7 days. A total of 8931 and 10,152 differentially accessible regions (DARs) were identified, respectively, with strong enrichment in the promoter regions (Supplementary Fig. 12a-c). Integration with RNA-seq data revealed that 1908 herbicide-responsive DEGs were associated with these chromatin accessibility changes, which were classified into four distinct expression clusters (G1-G4, Fig. 5d). Transcription factors (TFs) binding to the cis-regulatory elements in open chromatin regions (OCRs) always governs the gene expression⁴⁴. TF binding site enrichment patterns differed significantly among DEG-associated accessible regions in clusters G1-G4 (Fig. 5e; Supplementary Fig. 12d). SBP and bZIP were enriched within clusters G1 and G3, respectively; NAC and SBP were enriched at the cluster G2; and MYB and TCP stood out within cluster G4. Furthermore, GO enrichment analysis revealed temporal patterns of highly expressed genes (Fig. 5f): prior to herbicide treatment, genes associated with oxygen binding, carbohydrate metabolic, hormone biosynthetic, protein-DNA complex and protein heterodimerization activity (G2 and G4) were highly expressed. After herbicide treatment, genes related to calcium ion and carbohydrate derivative binding, protein phosphorylation and kinase activity (G3) as well as genes related to response to oxidative stress, endogenous stimulus, hormone and stress (G1) were highly expressed. The key herbicide-responsive module-comprising genes linked to glutathione metabolism, including members of the GST family was strongly induced post-treatment (Fig. 5f, g), suggesting a role of xenobiotic detoxification.

By integrating transcriptomic and chromatin accessibility analyses, we uncovered the dynamic genetic and epigenetic mechanisms underlying the response to herbicide treatment in A. fatua, providing a critical resource for understanding the herbicide tolerance in A. fatua.

qGST4D underlies herbicide tolerance in A. fatua

To further unravel the genomic basis of herbicide tolerance in A. fatua, we performed a genome-wide association study (GWAS) using survival rates following herbicide (fenoxaprop-P-ethyl) treatment as the phenotypic trait (Supplementary Fig. 13a). We identified a major association peak on the end of chromosome 4D (Fig. 6a; Supplementary Fig. 13b), overlapping with the locus we noted in the gene expansion and selective sweep analyses above, which we termed qGST4D (4D: 442-444 Mb) here. The lead SNP (P = 1.189251E-20) of the peak is located within the GST gene clusters on the A. fatua (W1004) genome (Fig. 6a). Moreover, combining evidence from multi-omics analyses and qRT-PCR validation, we found 11 GST genes in the qGST4D locus are responsive to herbicide treatment (Fig. 5c; Supplementary Fig. 13c), indicating the GST cluster might be the major contributor to the herbicide resistance variation in oat population, driving the strong genome-wide association signal.

To further test the role of qGST4D in herbicide resistance, we focused on 4Dg0135144, a GST gene expanded in the A. fatua (W1004) genome within the locus, for further validation. Read coverage (Fig. 6b; Supplementary Fig. 13d) and PCR (Fig. 6c) results confirmed that 4Dg0135144 is present in 96.74% A. fatua accessions compared to 53.41% accessions in A. sativa. Time course qRT-PCR result showed that the expression of 4Dg0135144 was significantly up-regulated herbicide treatment both in root and leaves (Fig. 6d). The ATAC-seq result also showed the chromatin accessibility in the upstream and downstream regions of 4Dg0135144 was increased after herbicide treatment (Fig. 6e). To validate its function, we developed a TRV (Tobacco rattle virus) -induced gene silencing system in A. fatua to obtain knockdown lines of 4Dg0135144, in which gene expression were successfully down-regulated to less than 50% (Supplementary Fig. 14a, b). After 7 days of herbicide treatment, survival rate of the two knockdown lines both decreased significantly compared to the control (P = 0.0004/0.0213, Two-tailed student’s t test; Fig. 6f–h). Previous studies have shown herbicide treatment can affect the burst of reactive oxygen species (ROS), plant cell permeability, and photosynthesis^{45,46,47,48,49,50}. We then measured CAT enzyme activity, the contents of soluble sugar and chlorophyll to evaluate the herbicide resistance of the knock-down line of 4Dg0135144. Our result showed that all the three physiological indices of the knock-down line were significantly lower than the control after herbicide treatment (CAT enzyme activity, P = 0.0003; The contents of soluble sugar, P < 0.0001; The contents of chlorophyll, P < 0.0001; Two-tailed student’s t test) (Fig. 6i). These results showed that knock-down of 4Dg0135144 reduced the herbicide resistance in A. fatua. Moreover, we constructed the pUBI::4Dg0135144-GFP and pUBI::4Dg0135144-flag vector, to generate the over-expression transgenic plants of 4Dg0135144 (4Dg0135144-OE) in oat, with the transgenic line pUBI::GFP or wide type (WT) as the negative control (Supplementary Fig. 14c–h). We found 4Dg0135144-OE plants exhibited greater resistance to herbicide treatment compared to control plants (Fig. 6j-l), concomitant with increased survival rates (P = 0.0042, Two-tailed student’s t test) (Supplementary Fig. 14i) and elevated catalase (CAT) enzyme activity (line 2, P = 0.0007; line 6, P = 0.0003; Two-tailed student’s t test) (Supplementary Fig. 14j). These results demonstrate that 4Dg0135144, an expanded gene within the qGST4D locus, is a key determinant of herbicide tolerance in A. fatua. This finding further corroborates the functional importance of the GST gene cluster in this locus and its contribution to the evolution of enhanced herbicide tolerance in A. fatua.

Discussion

A. fatua is well-known as one of the worst annual weeds worldwide and can adapt to diverse climatic and extreme conditions⁵¹. Its genome harbors numerous stress-resistant alleles, which-given the lack of reproductive barriers with cultivated oats (A. sativa)-could serve as a valuable genetic resource for enhancing abiotic stress tolerance in cultivated oats⁵². In this study, we present a near-complete reference genome assembly and comprehensive annotation for A. fatua. This resource will enable precise gene discovery and accelerate molecular breeding efforts-including marker-assisted selection and interspecific hybridization-in cultivated oats (A. sativa). Moreover, through population genomic sequencing and analyses of 443 A. fatua accessions collected from three distinct ecological zones in northern China, we identified multiple genomic regions showing strong differentiation from cultivated oats and signatures of selection. These loci represent promising targets for investigating the genetic basis of A. fatua’s adaptation to extreme environments.

One of the key traits underlying the remarkable invasiveness of A. fatua is its herbicide tolerance. In this study, we performed multi-omics analyses to characterize the response to herbicide treatment, revealing coordinated regulatory dynamics at the transcriptomic and chromatin accessibility levels. These findings provide a valuable resource for investigating the molecular basis of herbicide tolerance in this invasive weed. Through genome-wide association studies, we identified a major locus associated with herbicide resistance. This locus harbors a GST gene cluster, with most A. fatua accessions carrying a haplotype characterized by GST gene expansion (Fig. 4f). Using knockdown and overexpression transgenic experiments, we functionally validated the contribution of one expanded GST gene to oat herbicide resistance. Our results demonstrate that copy number variation of GST genes underlies variation in herbicide resistance. At this locus, A. fatua and cultivated oat (Avena sativa) populations exhibit strong genetic differentiation. The A. fatua population is dominated by a single haplotype and shows signatures of selection (Fig. 4f). This selected haplotype is also present in cultivated and wild oats (A. sterilis), suggesting that the adaptive haplotype in A. fatua represents standing genetic variation in oats that was selectively favored in weedy populations, facilitating their adaptation.

Although the weedy oat is classified as a distinct species (A. fatua), it lacks reproductive isolation from its close relatives, the wild oat (A. sterilis) and cultivated oat (A. sativa), leaving its evolutionary status unresolved. Here, we generated a population genomic dataset comprising multiple A. sterilis accessions from Israel and globally representative A. sativa accessions, enabling us to assess the evolutionary origins of A. fatua populations collected from northern China. Population genetic analyses revealed that these weedy oats are genetically more closely related to cultivated oats, particularly to naked oats (A. sativa ssp. nuda) varieties widely grown in China. This suggests two possible scenarios: (1) they originated via feralization of local cultivated oats, or (2) they experienced extensive historical gene flow with domesticated populations. However, more detailed demographic modeling is required to resolve their precise evolutionary origin. We acknowledge that our sampling was geographically limited. Given A. fatua’s cosmopolitan distribution, broader sampling is required to fully elucidate its genetic ancestry and establish a global understanding of its genomic basis, as A. fatua populations from different regions may have distinct origins, similar to patterns observed in other crop systems such as rice⁵³.

Methods

Plant materials and genome sequencing

The A. fatua accession W1004 was originally collected in Lhasa, Tibet Province, China (91.39 E, 229.84 N). For genome sequencing, high-quality genomic DNA was extracted from leaves using a DNA extraction kit. SMRTbell libraries were sequenced on a PacBio Sequel II system with HiFi mode. A total of 306.43 Gb circular consensus sequencing (CCS) data were obtained. Hi-C libraries were constructed from tender leaves using the Illumina NovaSeq platform. This allowed us to generate a total of 196.64 Gb of sequencing data. The ONT ultra-long sequencing library was prepared with the Oxford Nanopore SQK-LSK109 kit and subsequently sequenced using a Nanopore PromethION platform. To extract RNA for gene annotation and gene expression analysis, samples from five tissues, including root, stem, leaf, flower and seed, were harvested and frozen immediately in liquid nitrogen. Three independent experiments were used for RNA-seq. Sequencing was conducted by a commercial company (Annoroad Gene Technology, Beijing, China).

Genome assembly and quality assessment

The assembly of the A. fatua (W1004) genome was performed using hifiasm⁵⁴ (v0.19.9-r616) with default parameters (--hg-size 11 G –ul), integrating PacBio HiFi and ONT long reads for genome assembly. Subsequently, the Hi-C reads were processed and aligned to the contigs using Juicer⁵⁵ (v1.6), after which they were clustered into chromosomes by 3D-DNA⁵⁶ with default parameters. Finally, we used Juicebox⁵⁷ Assembly Tools (v2.20) to manually correct errors and visualize the assembled results. Additionally, the three subgenomes (A, C, and D) were distinguished using SubPhaser³⁴, relying on sequence similarity from published hexaploid oat genomes and subgenome-specific k-mers.

For genome assessment, the Hi-C reads were mapped to the final assembly using BWA⁵⁸ (v0.7.17), and ONT and HiFi reads were mapped with Minimap2⁵⁹ (v2.28), the mapping rate of reads on the genome was calculated using SAMtools⁶⁰ (v1.7). Then, BUSCO⁶¹ (v5.2.2) with the “embryophyta_odb10” database to evaluate the genome completeness. Finally, we assessed the genome assembly using Merqury⁶² (v1.3) based on the 21-mer hybrid Merqury k-mer database by HiFi reads.

Identification of telomere sequences

The telomeric sequence 5’ CCCTAAA-3’ and the reverse complement of these seven bases were directly searched in the genome sequence.

Repeats and gene annotation

The sequenced genomes of the Avena species are a central resource for comparative work supporting the understanding of genome evolution. However, full utility has been hampered by uneven genome annotation. To better utilize the genomic resources of these released cultivated oats, we have performed repeat and gene reannotations for all of these 4 genomes (A. fatua (W1004), Avena sativa cv. Sang, Sanfensan, and OT3098) using the same ab initio-based annotation strategy combining with our extensively collected and integrated evidence data. Repetitive sequences in the genome were identified based on ab initio prediction and homology searching. The reference genome was hard masked during gene annotation. For ab initio prediction, RepeatModeler⁶³ (v2.0.2), LTR_FINDER⁶⁴ (v1.07), LTRharvest⁶⁵ (v1.5.10) and LTR_retriever⁶⁶ (v2.9.0) were used to bulid a de novo-based repeat database. RepeatMasker⁶⁷ was used to mask the genome and annotate the TE elements using the library combined by RepeatModeler⁶³ (v4.1.1), LTR_retriever⁶⁶ (v2.9.0) and Repbase⁶⁸ (v15.02). Tandem Repeats Finder⁶⁹ (v.4.07b) was used to search tandem repeats with default parameters.

To obtain high-quality annotation of protein-coding genes, we employed multiple methods for gene prediction, including transcriptome-based predictions, de novo predictions, and homology-based predictions. We adopted Augustus⁷⁰ (v3.2.3) to perform de novo gene prediction. We have constructed a comprehensive protein database using the high-quality sequences from Brachypodium distachyon, Hordeum vulgare, Oryza sativa and Triticum aestivum and Avena species generated from third-generation sequencing. Genomethreader⁷¹ (v1.7.3) was used for homology-based gene prediction. For transcript-based predictions, Trinity⁷² (v2.12) was used to assemble all the RNA-seq data, followed by alignment to the genome with the PASA⁷³ (v2.3) software. In addition, the clean RNA-seq reads were mapped to the genome using HISAT2⁷⁴ (v2.2.1) to obtain transcriptome mapping data. Subsequently, StringTie⁷⁵ (v2.1.6) was used to assemble the mapped reads into transcripts. The assembled transcripts were then used for open reading frame (ORF) prediction with TransDecoder (v.5.1.0) (https://github.com/TransDecoder/TransDecoder). Finally, EVidenceModeler⁷⁶ (v1.1.1) was used to integrate the prediction results obtained by the above three methods to produce a non-redundant set of gene structures.

The completeness of the final gene set was evaluated by BUSCO⁶¹ (v5.2.2) with the “embryophyta_odb10” database. Functions of the protein-coding genes were annotated using Diamond⁷⁷ (v.0.9.29) blastp with an E-value 1e-5 to search against the databases of NCBI non-redundant protein database (NR) and SwissProt (http://web.expasy.org/docs/swiss-prot/guideline.html) database. Protein domain and gene ontology term annotations were performed using InterProScan⁷⁸ (v4.7), while KO terms were assigned through homology searches with KofamScan⁷⁹.

Gene families and phylogenetic analysis

We used OrthoFinder⁸⁰ (v2.3.14) to identify orthologous gene families between A. fatua and 9 other plant species, including two Triticeae species (Secale Cereale, Triticum Aestivum) and eight Aveneae species (Avena strigose, Avena atlantica, Avena longiglumio, Avena eriantha, Avena insularis, Avena sativa ssp. Sang, and Avena sativa ssp. Sanfensan) with default parameters. The phylogenetic relationships among these species were estimated using RAxML⁸¹ (v8.2.12) based on the protein sequences of single-copy ortholog families. Divergence times were estimated using the MCMCtree program embedded in PAML⁸² (v4.9). The expansion and contraction of orthologous gene families were assessed based on a maximum likelihood tree using CAFE⁸³ (v4.2). A P value threshold of 0.05 was applied to determine which families had significant size changes between species. GO enrichment analysis of specific expanded gene families was performed using the R package clusterProfiler⁸⁴. Collinear blocks between subgenomes were obtained by Jcvi⁸⁵ (v1.2.1) with default parameters.

Identification of gene families

Hidden Markov model profiles of gene domains were downloaded from Pfam (http://pfam.xfam.org/) for use in gene family identification. Hmmer⁸⁶ (v3.1) and Diamond⁷⁷ (v.0.9.29) blastp searches were used to identify putative genes in the A. fatua, A. sativa cv. Sang, A. sativa cv. Sanfensan, and OT3098 genome. Protein sequences of gene family members from Arabidopsis thaliana were used as queries with an E value of 1e-5.

GST family genes were further identified using InterProScan⁷⁸ (v5.22) based on the keywords “GST” and “Glutathione S -transferase”. To classify the GST gene family, we first aligned the protein sequences of all identified GSTs together with reference GSTs from Oryza sativa and Arabidopsis thaliana (Supplementary Data 9) using MAFFT⁸⁷ (v7.490). A maximum likelihood phylogenetic tree was then constructed with IQ-TREE⁸⁸ (v2.2.0), and GST genes were assigned to established classes based on sequence similarity and conserved domain architecture. The chromosomal distribution of GST genes was visualized using TBtools⁸⁹ (v2.121).

Identification of R genes

R genes in the four Avena genomes (A. fatua, A. sativa cv. Sang, A. sativa cv. Sanfensan, and OT3098) were identified using RGAugury⁹⁰ pipeline. The nucleotide-binding site (NB-ARC) domain-encoding genes were classified into different subgroups based on their domain and motif structures: TN (Toll/interleukin-1 receptor (TIR) and NB-ARC), CN (coiled-coil (CC) and NB-ARC), NL (NB-ARC and leucine rich repeat (LRR)), CNL (CC, NB-ARC and LRR), NB (NB-ARC), TNL (TIR, NB-ARC and LRR).

To understand whether the identified R genes were related to the map positions of known cloned loci for powdery mildew, rust, and smut, DNA markers that co-segregated or flanked known genes were mapped to the A. fatua and cultivated oat genomes by Diamond⁷⁷ (v.0.9.29) blastp analysis. The distribution of R genes and known quantitative trait loci are shown in Supplementary Fig. 5e.

Sampling and whole-genome sequencing

To collect A. fatua accessions in China, we surveyed three regions severely affected by common wild oats: Xinjiang, Tibet, and Hebei-Zhangjiakou. In various locations, we searched for weedy A. fatua along roadsides and in agricultural fields. Seeds from each plant were gathered, and their geographical coordinates were documented. All accessions were collected from farmlands such as wheat and barley fields, or from the margins of these agricultural areas^10,11, which are well documented as typical habitats for A. fatua (https://cals.cornell.edu/weed-science/weed-profiles/wild-oat). The identification of A. fatua was based on well-established morphological characteristics¹⁰. Specifically, the collected individuals exhibited key features of A. fatua, including spontaneous seed shattering at maturity, long dark brown to black awns, and seeds with distinctly darker seed coats. These characteristics were consistently observed across accessions and are consistent with earlier reports¹⁰. Finally, 443 A. fatua with different geographic origins from Xinjiang, Tibet and Zhangjiakou of Hebei province were collected. Additionally, 37 A. sterilis accessions from Israel³³ and 228 A. sativa accessions were selected from global collection were included for comparative analysis (Supplementary Table 10; Supplementary Data 1). Plants were grown in the greenhouses at the Hebei University (Baoding City, Hebei province, China). Fresh young leaves from individual seedings were gathered and immediately frozen in liquid nitrogen for the subsequent DNA extraction. The genomic DNA was extracted using the cetrimonium bromide method. The libraries were sequenced using the Illumina HiSeq X Ten platform. and a total of 58.16 TB sequencing data were obtained, with an average depth of 7.29x for each sample.

Variant detection and annotation

The clean reads were mapped to the refence genome (W1004) using the ‘mem’ algorithm in BWA⁵⁸ (0.7.17) with default parameters. The results were converted to BAM format using SAMtools⁶⁰ (v1.7), filtered for mapping quality with ‘-q 30’ parameter, and the BAM files were sorted by Sambamba⁹¹ (v0.8.2). The Genome Analysis Toolkit⁹² (GATK, v4.2.0.0) modules MarkDuplicates were used to marked PCR-duplicated read pairs. Then, genomic variants in GVCF format for each accession were identified by the HaplotypeCaller model of GATK. Subsequently, the modules CombineGVCFs, GenotypeGVCFs, SelectVariants and VariantFiltration were used for variants calling. Finally, the SNPs were filtered with the following parameters: “-filterExpression QD < 2.0 | | FS > 60.0 || MQRankSum < -12.5 || ReadPosRankSum < -8.0 | | SOR > 3.0 | | MQ < 40.0.” Potentially false SNPs were further filter using VCFtools⁹³ (v0.1.16) with the following parameters: (1) MAF ≥ 5% (2) Missing rate ≤ 0.5% (3) bi-allelic sites. High-quality SNPs were further annotated with SnpEff⁹⁴ (v5.1).

Population genetic analyses

To ensure the quality of genotype data, we first performed principal component analysis (PCA) on the balanced panel (768 individuals) and projected the remaining individuals. Based on the results of the first two principal components (PC1 and PC2), we excluded samples that exhibited significant outlier patterns, which may indicate contamination or mislabeling, in total 17 outliers were removed. To perform population genetic analysis, 30 randomly picked individuals from each of A. sterilis, A. fatua and A. sativa population to construct a sample size balanced panel. Specifically, within A. sativa, 15 individuals were picked from hulled and naked oats, respectively; within A. fatua, 10 individuals were picked from each of the three regions (Zhangjiakou, Xinjiang, Tibet) respectively. Within the panel, SNPs were further filtered by keeping sites where minor allele frequency is greater than 5% and missing rate lower than 10%, retaining 44,752,824 SNP sites. To reduce linkage disequilibrium and ensure even variant spacing, SNPs were further thinned by randomly keeping one site every 10 kb across the genome, retaining 838,840 SNPs for subsequent analyses. To reduce potential reference bias, we called pseudo haploid genotypes at each site for each individual by randomly picking a read mapped to the site and taking the call from the read. The neighbor-joining (NJ) tree was constructed to assess the genetic relationships among the oat accessions. Pseudo haploid genotype calls at each SNP for each individual were encoded as binary matrices, and pairwise Hamming distances were calculated using pdist from SciPy. Hierarchical clustering with the Ward method (linkage) was then used to generate the tree. Population structure was inferred using the ADMIXTURE⁹⁵ program (v1.3.0) in supervised mode. To provide reference information for supervised clustering, the balanced panel consisting of 90 individuals used as the reference panel. Genotype data in PLINK BED⁹⁶ format were analyzed across a range of K values, with 8 independent runs per K (--seed 1-10). The optimal number of ancestral populations was determined by identifying the run with the lowest cross-validation error (--cv). The supervised mode leveraged the predefined group labels of the reference individuals to estimate ancestry proportions for the remaining samples. The principal component analysis was performed on the balanced panel consisting of 90 individuals using smartpca (https://github.com/wenhedr/PCA_analysis_using_SmartPCA), with the rest individual projected. To perform D-statistics, we used the qpDstat program from the AdmixTools package⁹⁷ (v6.0). The input genotype dataset was first converted to EIGENSTRAT format using convertf. The D-statistics were computed for selected quartets of populations based on a user-defined test list, each consisting of four populations (W, X, Y, Z).

Structural variation analysis

We selected W1004 genome as the reference. The other cultivated genomes (A. sativa cv. Sanfensan) were aligned to the reference using mummer⁹⁸ (v4.0.0) with the parameter -c 1000. The raw alignment results were further filtered using delta-filter with parameters -m -i 90 -l 100. The resulting filtered delta files were used to detect structural variations using the SyRI⁹⁹ (v.1.4) pipeline with default parameters and visualized with plotsr¹⁰⁰ (v.1.4). To confirm the inversions identified within genomes, we performed reciprocal mapping of Hi-C data using the Juicer⁵⁵ (v.1.6) pipeline. The Hi-C contact maps were visualized using JuiceBox⁵⁷ (v.2.20).

Identification of differentiated regions

To identify the genomic regions of divergence, we scanned the genome for multiple methods: (I) We used BWA⁵⁸ (v.0.7.17) to align the genome resequencing reads of A. fatua and A. sativa L. accessions to the W1004 reference genome, comparing their sequence similarity. Then the average identity score was estimated for each 1 M non-overlapping window. (II) We compared the genomes of cultivated oats with the reference A. fatua to identify structural variants (SVs) using SyRI⁹⁹ (v.1.4). We calculated the SVs frequency around 500 kb sliding windows. The top 10% of regions with the highest breakpoint frequencies as SV hotspots, representing divergence regions between the two groups. (III) We used Selscan¹⁰¹ (v.2.0) to estimate XP-EHH, and then the average XP-EHH score was estimated for 500 kb sliding windows with 250 kb steps. (IV) The XP-CLR values calculated by the XP-CLR¹⁰² program (v1.1.2) using 500 kb sliding windows with 250 kb steps. (V) Popuation Fixation statistics (F_ST) were calculated using VCFtools⁹³ (v0.1.16). The F_ST value were measured for each 500-kb window, with a step size of 250 kb. VI) Genetic diversity (π) was calculated using VCFtools⁹³ (v0.1.16) with a window size of 500 kb and a window step of 250 kb. Pi-ratio was further calculated between two groups as PI (π _{A. sativa L.}/π _{A. fatua}). Outlier windows (top 10%) detected by at least five methods were considered as the candidate regions of highly divergent regions. The distribution of the divergent regions was visualized with RectChr v1.38 (https://github.com/BGI-shenzhen/RectChr). Genes located in selected sweeps were considered as selected candidate genes.

Identification of homologous genes in divergent regions

To identify orthologous genes between A. sativa and A. fatua, we first performed synteny analysis using the JCVI⁸⁵, based on a cloned gene from rice. Colinear gene pairs were extracted from the syntenic blocks between the rice gene and subgenomes of both cultivated and A. fatua oats. To further validate orthology, we constructed phylogenetic trees using iq-tree⁸⁸ based on the protein sequences of the colinear genes. Gene pairs that clustered together with high support were retained for downstream analysis. Finally, the gene structures of these candidate orthologs were examined; only those with highly similar exon-intron organization were designated as putative one-to-one orthologs.

Herbicide tolerance evaluation of the oat association panel

Survival rate (SR) tests were conducted in an association-mapping panel composed of 259 oat accessions (Supplementary Data 7). The herbicide tolerance of the panel was recorded and evaluated¹⁰³. To compare the herbicide resistance between A. fatua and A. sativa, 103 wild accessions and 156 cultivated accessions were selected. All of the accessions were randomly planted in a cultivation pool with a soil: vermiculite mixture at a 2:1 ratio, and 20 plants of each accession were grown per pot in each assay. The seedlings were grown in a greenhouse under 12-h light/12-h dark and 20 °C/15 °C with a relative humidity of 50% for assessment of herbicide stress response or SR. Seedlings at the three-leaf stage were used for herbicide (fenoxaprop-P-ethyl, Huaxing, GB/T22618-2008) treatment. When the seedlings reached the three-leaf stage, they were uniformly sprayed with a solution of 680 g/hm² fenoxaprop-P-ethyl. Survival rate was assessed 10 days post-treatment by counting the number of living plants per pot.

RNA-seq data analyses

We performed RNA-seq under different herbicide treatments. Seedlings were treated with fenoxaprop-P-ethyl at 0 g/hm² (Control) and 85 g/hm² (Treatment). Then the leaf and root samples were separately collected in triplicate as biological replicates at three different timepoints (6 hours, 24 hours and 7 days). and resulted in the generation of 36 RNA-seq libraries. The clean reads were mapped to the reference genome W1004 using HISAT2⁷⁴ (v2.2.1), and SAMtools⁶⁰ (v1.7) was used to sort the resulting aligned reads. The gene expression values were normalized in units of reads per kilobase per million (TPM). DESeq2¹⁰⁴ software (v1.28.1) was used to identify DEGs with thresholds of P_adj < 0.05 and |log2 (fold-change) | >1. Then, the genes with TPM ≥ 0.1 were defined as DEGs.

Of the total DEGs gene sets, 34 DEGs presented in all 6 groups. In the root, 572, 1013 and 1690 DEGs presented in 4 to 5 group (>90% of the collection), 1834, 2960 and 5572 DEGs presented in 2 to 3 group, and 419, 666 and 3074 DEGs presented in only one group (Fig. 5b; Supplementary Table 17). As well as, in the leaf, 1755, 1771 and 1883 DEGs presented in 4 to 5 group (>90% of the collection), 5595, 5655 and 6443 DEGs presented in 2 to 3 group, and 1611, 1550 and 2539 DEGs presented in only one group (Fig. 5b; Supplementary Table 17). Heatmap visualization was performed using the R package pheatmap (https://CRAN.R-project.org/package=pheatmap).

ATAC-seq data analysis

Fastp¹⁰⁵ (v0.23.2) was used process raw reads by removing adapters, trimming low-quality bases, and filtering bad reads. Clean reads were aligned to the W1004 reference genome using the BWA⁵⁸ mem algorithm (v0.7.17). The aligned reads were sorted and filtered using SAMtools⁶⁰ (v1.7).

Peak calling was done using bam files in macs2 (v2.2.9.1) (https://github.com/macs3-project/MACS). For ATAC-seq data, the parameter of peak calling using macs2 was ‘-q 0.05 -f BAMPE -nomodel -extsize 200 -shift -100 -g 10911912618’. Peaks located in promoter and genic regions were annotated to genes that overlapped with them, while peaks in distal regions were annotated to the nearest gene/TSS. For quantification of ATAC-seq data, read counts under the reference peak, along with normalized counts per million values (DBA_SCORE_TMM_READS_EFFECTIVE_CPM), were produced using the R package DiffBind¹⁰⁶ (v2.16.2). Raw peak counts were used as input for identifying differentially accessible and differentially marked peaks with the R package DESeq2¹⁰⁴ (v1.28.1). The thresholds were |log2 fold change | > 1, FDR < 0.01 and adjusted P < 0.05. The heatmaps centered on peaks were created using computeMatrix and plotHeatmap from deeptools¹⁰⁷ (v3.5.5).

For each cluster, the representation analysis in the open chromatin was performed using the enricher function in the R package clusterProfiler⁸⁴ (v3.19). Enrichment analysis of TF families for DEGs with open chromatin was achieved using homer (v24.05.1) (https://github.com/bastienwirtz/homer.git).

qRT-PCR assay

Total RNA was extracted with TRIzol reagent (Thermo Fisher Scientific,15596018CN) and reverse transcribed into cDNA using the PrimeScript-RT Reagent Kit (Takara, RR047A) according to the manufacturer’s instructions. The qRT-PCR assay was performed using TB Green premix Ex Taq^TM II (Takara, RR820A). All primers used in the qRT-PCR assays are listed in Supplementary Table 18.

Herbicides treatment

Pot culture method was used to cultivate wild oat seedlings. The indoor temperature was set at 20 °C during the day, 15 °C at night, the illumination was 12 hours, and the relative humidity was (50 ± 5) %. At the stage of 2 ~ 3 leaves after the emergence of wild oats, a small spray pot was used to spray fenoxaprop-P-ethyl (Huaxing, GB/T22618-2008) at a dose of 340 g/hm².

Virus-mediated gene silencing

To investigate the potential role of 4Dg0135144 in herbicide tolerance of oat, we employed the Tobacco Rattle Virus (TRV) -virus induced gene silencing (VIGS) system to knock down the expression of the gene. The vector was generated by subcloning these fragments into the TRV2 infection plasmid vector using the NC clone (NC Biotech, NC001). To initiate the virus-induced gene silencing assay, Agrobacterium tumefaciens strains GV3101 carrying pTRV1 and different pTRV2 derived vectors (TRV2, V-4Dg0135144-1/V-4Dg0135144-2) in 1:1 ratio, and supplemented with acetosyringone (AS, coolaber, SL95131) (19.62 mg. L⁻¹), cysteine (Cys, Amethyst, 52-90-4) (400 mg. L⁻¹), and Tween-20 (sigma, P2287) (5 ml. L⁻¹). Infect experiments were performed under vacuum suction¹⁰⁸, the Agrobacterium tumefaciens strains GV3101 concentration was OD = 2.0 and the vacuum condition is 40 kpa.

Plasmids and cloning procedures

To generate the constructs for virus-induced gene silencing (VIGS) assay, the target ORF fragment region of 4Dg0135144 was amplified from W1004 cDNA and the target ORF fragment regions of AsPDS was amplified from A. sativa cv. Marvellous cDNA. The target ORF fragment regions should be 200–400 bp in length, with 40–60% GC content, while avoiding repetitive sequences or secondary structures¹⁰⁹. These fragments were then cloned into TRV2 infection plasmid vector with NC clone (NC Biotech, NC001) to generate pTRV2::4Dg0135144-1/pTRV2::4Dg0135144-2/pTRV2::AsPDS. All primers are listed in Supplementary Table 18.

Determination of catalase activity and soluble sugar content

The materials for physiological indexes were obtained from the materials treated with herbicides for 7 days. The catalase activity and soluble sugar content was tested with the CAT activity detection kit (Solarbio, BC0200) and the Plant soluble sugar detection kit (Solarbio, BC0030).

Chlorophyll measurement

The leaves were incubated in 95% (v/v) ethanol for 5 d in the dark. The absorbances were measured at 665 and 649 nm. The chlorophyll contents were calculated according to the following ratio: (6.63A665 + 18.08A649) /g fresh weight¹¹⁰.

A. sativa transformation

Mature embryos of healthy plants of Avena sativa cultivar ‘Bayou18’ grown in a well-conditioned greenhouse were collected and cultured on L3-M medium [4.6 g. L⁻¹ L3 Base Salts with vitamins (Coolaber, PM1621), 30 g. L⁻¹ maltose (Coolaber, CM7181), 4 g. L⁻¹ phytagel (Coolaber, CP8581Z), 2 mg. L⁻¹ 2,4-D (Coolaber, PH105), 1 g. L⁻¹ Dicamba (Coolaber, PH113)] until embryonic callus was produced. Agrobacterium tumefaciens strain GV3101 with pUBI::4Dg0135144-GFP was cultured at 28 °C in YEP medium overnight. Centrifuge at 25 °C, 3438 × g for 10 minutes. Then, the precipitates were resuspended to OD = 0.5 with WLS solution [4.30209 g Linsmaise&Skoog Base Salts (Coolaber, PM1470), 100 μL 1000 ×MS vitamins (Coolaber, PML1780), 10 g Glucose (Sangon biotech, A501991-0500), 0.5 g MES (Sigma, M8250), add H₂O to 1 L. PH = 5.8]. Mixed Agrobacterium tumefaciens strain GV3101 with pUBI::4Dg0135144-GFP and Agrobacterium tumefaciens strain GV3101 with pUBI::TaWOX5 in equal proportion. Soak the selected embryonic callus in the mixed Agrobacterium tumefaciens strain for 30 minutes. The embryogenic callus was removed and the residual bacterial liquid was absorbed by filter paper. The embryogenic calli were cultured on filter paper containing 75 μ mol as in dark for 3 days. And then, embryogenic calli were cultured on wls-res medium for 5 days. WLS-P5 medium [the wls-res medium containing 0.5- 5 mg.L⁻¹ Basta (Sangon Biotech, A356357)] was used to gradiently screen the callus which survived successfully, and the callus was transferred to regeneration medium [4.6 g L3 Base Salts with vitamins (Coolaber, PM1621), 5 mg Zeatin (Coolaber, PH110), 20 g sucrose (Diamond, A100335-0250), 0.5 g MES (Sigma, M8250), 200 μL 12.5 g.L⁻¹ CuSO₄·5H₂O (Sangon biotech, A600063), 4 g phytagel (Coolaber, CP8581Z), add H₂O to 1 L. PH = 5.8] and grow to 3-5 cm. Finally, the roots were cultured on the rooting medium [4.6 g L3 Base Salts with vitamins (Coolaber, PM1621), 0.2 mg.ml^-1 IBA (Coolaber, PH1031), 15 g sucrose (Diamond, A100335-0250), 0.5 g MES (Sigma, M8250), 4 g phytagel (Coolaber, CP8581Z), add H₂O to 1 L. PH = 5.8]. Transgenic plants confirmed by Genetically modified PAT/bar colloidal gold test strips (BioRun, RIA03) or qRT-PCR, were used for phenotypic evaluation^111,112,113.

The pUBI::4Dg0135144-Flag transgenic plants were also generated. The vector was generated by homologous recombination into the pUBI::Flag plasmid vector using the ClonExpressII One Step Cloning Kits (Vazyme, C112). Hygromycin (5 mg. L⁻¹) was used as the selection pressure for screening positive seedlings. Transgenic plants of the T1 generation were confirmed by PCR, qRT-PCR and western blot. Positive transgenic plants were used for phenotypic evaluation.

Western blot assay

Total protein was extracted by the extraction buffer [0.2 M NaCl, 5 mM MgCl2, 5 mM DTT, 20 mM Tris-HCl (pH 7.5), 0.03% Tween-20 (Ameresco), and 0.5 tablets of protease inhibitor (Roche)]. The supernatant was collected by centrifuging at 12,000 rpm for 15 min. Total proteins were examined by western blot analysis using α-tubulin (1:5000; EASYBIO, BE0031) as a loading control. Proteins in the study were also probed with α-Flag (1:2000; Sigma, F1804). Secondary antibodies were goat anti-mouse IgG (1:5000; EASYBIO, BE0102). The protein Marker (product #26616) purchased from Thermo Scientific was used in all the western blot assays in this manuscript. The instrument (BIO-01, O1900) was used to obtain images.

Statistical analyses

Details of the statistics of Fig. 6d, h, I and Supplementary Fig. 13c, 14a, b, e, g, i and j are provided in the figure legends and Methods. All statistics were carried out in R using Student’s t-test (two tailed) (unless otherwise indicated). Each experiment involved at least three biological replicates.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Raw sequencing data and genome assembly have been deposited in the Genome Sequence Archive (GSA) at the National Genomics Data Center under the BioProject accession number PRJCA030330. Raw genome sequencing data and genome assembly are also available at National Center for Biotechnology Information (NCBI) under the Bioproject PRJNA1259166. The whole genome resequencing data have been deposited in NCBI under the Bioproject PRJNA1253394. The transcriptome data have been deposited in NCBI under the Bioproject PRJNA1265370. Source data are provided with this paper.

References

Liu, Q., Lin, L., Zhou, X., Peterson, P. M. & Wen, J. Unraveling the evolutionary dynamics of ancient and recent polyploidization events in Avena (Poaceae). Sci. Rep. 7, 41944 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Kamal, N. et al. The mosaic oat genome gives insights into a uniquely healthy cereal crop. Nature 606, 113–119 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Peng, Y. et al. Reference genome assemblies reveal the origin and evolution of allohexaploid oat. Nat. Genet. 54, 1248–1258 (2022).
Article CAS PubMed PubMed Central Google Scholar
Zhou, X., Jellen, E. N. & Murphy, J. P. Progenitor germplasm of domisticated hexaploid Oat. Crop Sci. 39, 1208–1214 (1999).
Article Google Scholar
Harlan, J. R. & de Wet, J. M. Toward a rational classification of cultivated plants. Taxon 20, 509–517 (1971).
Article Google Scholar
Ladizinsky, G. & Zohary, D. Notes on species delimination, species relationships and polyploidy in Avena L. Euphytica 20, 380–395 (1971).
Article Google Scholar
Loskutov, I. G., Gnutikov, A. A., Blinova, E. V. & Rodionov, A. V. The origin and resource potential of wild and cultivated species of the genus of oats (Avena L.). Russian J. Genet. 57, 642–661 (2021).
Article CAS Google Scholar
Okoń, S., Ociepa, T., Nucia, A., Cieplak, M. & Kowalczyk, K. Is every wild species a rich source of disease resistance? Avena fatua L.—potential donor of resistance to powdery mildew. Plants 10, 560 (2021).
Holm, L. G., Plucknett, D. L., Pancho, J. V. & Herberger, J. P. The world’s worst weeds. Distribution and Biology, (1977).
Beckie, H. J., Francis, A. & Hall, L. M. The biology of Canadian weeds. 27. Avena fatua L.(updated). Can. J. Plant Sci. 92, 1329–1357 (2012).
Article CAS Google Scholar
Morikawa, T., Sumiya, M. & Kuriyama, S. Transfer of new dwarfing genes from the weed species Avena fatua into cultivated oat A. byzantina. Plant Breed. 126, 30–35 (2007).
Article CAS Google Scholar
Rines, H. et al. Collection and evaluation of Avena fatua for use in oat improvement 1. Crop Sci. 20, 63–68 (1980).
Article CAS Google Scholar
Zuo, W. et al. Breeding of a new grain-forage variety of naked oat’Bayou 18’ with high quality. Chin. J. Grassl. 43, 42–49 (2021).
Google Scholar
Somody, C. N., Nalewaja, J. D. & Miller, S. D. The response of wild oat (Avena fatua) and Avena sterilis accessions to photoperiod and temperature. Weed Sci. 32, 206–213 (1984).
Article Google Scholar
Gallagher, R. S., Granger, K. L., Snyder, A. M., Pittmann, D. & Fuerst, E. P. Implications of environmental stress during seed development on reproductive and seed bank persistence traits in wild oat (Avena fatua L.). Agronomy 3, 537–549 (2013).
Article Google Scholar
Alwarnaidu Vijayarajan, V. B. et al. First report on assessing the severity of herbicide resistance to ACCase inhibitors pinoxaden, propaquizafop and cycloxydim in six Avena fatua populations in Ireland. Agronomy 10, 1362 (2020).
Article Google Scholar
Keith, B., Lehnhoff, E., Burns, E., Menalled, F. & Dyer, W. Characterisation of Avena fatua populations with resistance to multiple herbicides. Weed Res. 55, 621–630 (2015).
Article CAS Google Scholar
Wang, H. et al. Mechanism of multiple resistance to fenoxaprop-P-ethyl, mesosulfuron-methyl, and isoproturon in Avena fatua L. from China. Pesticide Biochem. Physiol. 203, 105985 (2024).
Hussain, A. et al. Herbicide resistance: another hot agronomic trait for plant genome editing. Plants 10, 621 (2021).
Article CAS PubMed PubMed Central Google Scholar
MacLaren, C., Storkey, J., Menegat, A., Metcalfe, H. & Dehnen-Schmutz, K. An ecological future for weed science to sustain crop production and the environment. A review. Agron. Sustain. Dev. 40, 1–29 (2020).
Article Google Scholar
Heap, I. Herbicide Resistant Weeds (Springer, 2014).
Yu, Q., Ahmad-Hamdani, M., Han, H., Christoffers, M. & Powles, S. Herbicide resistance-endowing ACCase gene mutations in hexaploid wild oat (Avena fatua): insights into resistance evolution in a hexaploid species. Heredity 110, 220–231 (2013).
Article CAS PubMed Google Scholar
Agaras, B. C., Grossi, C. E. M. & Ulloa, R. M. Unveiling the secrets of calcium-dependent proteins in plant growth-promoting rhizobacteria: an abundance of siscoveries awaits. Plants 12, 3398 (2023).
Article CAS PubMed PubMed Central Google Scholar
Liu, X. et al. Managing herbicide resistance in China. Weed Sci. 69, 4–17 (2021).
Article Google Scholar
Lei, T. et al. Metabolic resistance mechanism to glufosinate in Eleusine indica. Pestic. Biochem. Physiol. 204, 106083 (2024).
Article CAS PubMed Google Scholar
Evans, A. F. Jr. et al. Biochemical characterization of metabolism-based atrazine resistance in Amaranthus tuberculatus and identification of an expressed GST associated with resistance. Plant Biotechnol. J. 15, 1238–1249 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ioannou, E., Papageorgiou, A. C. & Labrou, N. E. Directed evolution of Phi class glutathione transferases involved in multiple-herbicide resistance of grass weeds and crops. Int. J. Mol. Sci. 23, 7469 (2022).
Article CAS PubMed PubMed Central Google Scholar
Cummins, I. et al. Key role for a glutathione transferase in multiple-herbicide resistance in grass weeds. Proc. Natl. Acad. Sci. USA 110, 5812–5817 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Dücker, R. et al. Glutathione transferase plays a major role in flufenacet resistance of ryegrass (Lolium spp.) field populations. Pest Manag. Sci. 75, 3084–3092 (2019).
Article PubMed Google Scholar
Alebrahim, M. T., Zangoueinejad, R. & Tseng, T. M. Biochemical and molecular knowledge about developing herbicide-resistant weeds. Herbicide Resistance in Weeds and Crops, 101-132 (2017).
Gaines, T. A. et al. Mechanisms of evolved herbicide resistance. J. Biol. Chem. 295, 10307–10330 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bo, A. B., Won, O. J., Sin, H. T., Lee, J. J. & Park, K. W. Mechanisms of herbicide resistance in weeds. Korean J. Agric. Sci. 44, 1–15 (2017).
CAS Google Scholar
He, Q. et al. The near-complete genome assembly of hexaploid wild oat reveals its genome evolution and divergence with cultivated oats. Nat. Plants 10, 2062–2078 (2024).
Article CAS PubMed Google Scholar
Jia, K. H. et al. SubPhaser: a robust allopolyploid subgenome phasing method based on subgenome-specific k-mers. N. Phytol. 235, 801–809 (2022).
Article CAS Google Scholar
Waseem, M. et al. GRAS transcription factors emerging regulator in plants growth, development, and multiple stresses. Mol. Biol. Rep. 49, 9673–9685 (2022).
Article CAS PubMed Google Scholar
Monticolo, F., Colantuono, C. & Chiusano, M. L. Shaping the evolutionary tree of green plants: evidence from the GST family. Sci. Rep. 7, 14363 (2017).
Article ADS PubMed PubMed Central Google Scholar
Tiwari, V., Patel, M. K., Chaturvedi, A. K., Mishra, A. & Jha, B. Functional characterization of the Tau class Glutathione-S-Transferases gene (SbGSTU) promoter of Salicornia brachiata under salinity and osmotic stress. PLoS One 11, e0148494 (2016).
Article PubMed PubMed Central Google Scholar
Bajwa, A. A. et al. Biology and management of Avena fatua and Avena ludoviciana: two noxious weed species of agro-ecosystems. Environ. Sci. Pollut. Res. 24, 19465–19479 (2017).
Article CAS Google Scholar
Zhan, C. et al. Selection of a subspecies-specific diterpene gene cluster implicated in rice disease resistance. Nat. Plants 6, 1447–1454 (2020).
Article CAS PubMed Google Scholar
Maeda, H. et al. A rice gene that confers broad-spectrum resistance to β-triketone herbicides. Science 365, 393–396 (2019).
Article ADS CAS PubMed Google Scholar
Konishi, S. et al. An SNP caused loss of seed shattering during rice domestication. Science 312, 1392–1396 (2006).
Article ADS CAS PubMed Google Scholar
Hua, L. et al. LABA1, a domestication gene associated with long, barbed awns in wild rice. Plant Cell 27, 1875–1888 (2015).
Article CAS PubMed PubMed Central Google Scholar
Potter, K. C., Wang, J., Schaller, G. E. & Kieber, J. J. Cytokinin modulates context-dependent chromatin accessibility through the type-B response regulators. Nat. Plants 4, 1102–1111 (2018).
Article CAS PubMed Google Scholar
Inukai, S., Kock, K. H. & Bulyk, M. L. Transcription factor–DNA binding: beyond binding site motifs. Curr. Opin. Genet. Dev. 43, 110–119 (2017).
Article CAS PubMed PubMed Central Google Scholar
Castro, B. et al. Stress-induced reactive oxygen species compartmentalization, perception and signalling. Nat. Plants 7, 403–412 (2021).
Article CAS PubMed PubMed Central Google Scholar
Mittler, R., Zandalinas, S. I., Fichman, Y. & Van Breusegem, F. Reactive oxygen species signalling in plant stress responses. Nat. Rev. Mol. Cell Biol. 23, 663–679 (2022).
Article CAS PubMed Google Scholar
Dat, J. et al. Dual action of the active oxygen species during plant stress responses. Cell. Mol. Life Sci. CMLS 57, 779–795 (2000).
Article CAS PubMed Google Scholar
Gill, S. S. & Tuteja, N. Reactive oxygen species and antioxidant machinery in abiotic stress tolerance in crop plants. Plant Physiol. Biochem. 48, 909–930 (2010).
Article CAS PubMed Google Scholar
Pozniak, C. J. et al. Physiological and molecular characterization of mutation-derived imidazolinone resistance in spring wheat. Crop Sci. 44, 1434–1443 (2004).
Article CAS Google Scholar
Zhang, Y., Chen, J. M., Miller, J. R. & Noland, T. L. Leaf chlorophyll content retrieval from airborne hyperspectral remote sensing imagery. Remote Sens. Environ. 112, 3234–3247 (2008).
Article ADS Google Scholar
Ņečajeva, J., Bleidere, M., Jansone, Z., Gailīte, A. & Ruņģis, D. Variability of seed germination and dormancy characteristics and genetic analysis of Latvian Avena fatua populations. Plants 10, 235 (2021).
Article PubMed PubMed Central Google Scholar
Paczos-Grzeda, E., Sowa, S., Koroluk, A. & Langdon, T. Characteristics of resistance to Puccinia coronata f. sp. avenae in Avena fatua. Plant Dis. 102, 2616–2624 (2018).
Article CAS PubMed Google Scholar
Qiu, J. et al. Diverse genetic mechanisms underlie worldwide convergent rice feralization. Genome Biol. 21, 70 (2020).
Article PubMed PubMed Central Google Scholar
Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. & Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods 18, 170–175 (2021).
Article CAS PubMed PubMed Central Google Scholar
Durand, N. C. et al. Juicer provides a One-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).
Article CAS PubMed PubMed Central Google Scholar
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Durand, N. C. et al. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 3, 99–101 (2016).
Article CAS PubMed PubMed Central Google Scholar
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. https://doi.org/10.48550/arXiv.1303.3997 (2013).
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Article CAS PubMed PubMed Central Google Scholar
Danecek, P. et al. Twelve years of SAMtools and BCFtools. GigaScience 10, giab008 (2021).
Manni, M., Berkeley, M. R., Seppey, M., Simão, F. A. & Zdobnov, E. M. BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol. Biol. Evol. 38, 4647–4654 (2021).
Article CAS PubMed PubMed Central Google Scholar
Rhie, A., Walenz, B. P., Koren, S. & Phillippy, A. M. Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol. 21, 245 (2020).
Article CAS PubMed PubMed Central Google Scholar
Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc. Natl. Acad. Sci. USA 117, 9451–9457 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Xu, Z. & Wang, H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res. 35, W265–W268 (2007).
Article PubMed PubMed Central Google Scholar
Ellinghaus, D., Kurtz, S. & Willhoeft, U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinforma. 9, 18 (2008).
Article Google Scholar
Ou, S. & Jiang, N. LTR_retriever: A highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol. 176, 1410–1422 (2017).
Article PubMed PubMed Central Google Scholar
Tempel, S. Using and understanding repeatmasker. in Mobile Genetic Elements: Protocols and Genomic Applications (ed Bigot, Y.) 29–51 (Humana Press, Totowa, NJ, 2012).
Bao, W., Kojima, K. K. & Kohany, O. Repbase update, a database of repetitive elements in eukaryotic genomes. Mob. DNA 6, 11 (2015).
Article PubMed PubMed Central Google Scholar
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 (1999).
Article CAS PubMed PubMed Central Google Scholar
Stanke, M. & Waack, S. Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics 19, ii215–ii225 (2003).
Article PubMed Google Scholar
Gremme, G., Brendel, V., Sparks, M. & Kurtz, S. Engineering a software tool for gene structure prediction in higher organisms. Inf. Softw. Technol. 47, 965–978 (2005).
Article Google Scholar
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
Article CAS PubMed PubMed Central Google Scholar
Haas, B. J. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 31, 5654–5666 (2003).
Article CAS PubMed PubMed Central Google Scholar
Kim, D., Paggi, J. M., Park, C., Bennett, C. & Salzberg, S. L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 37, 907–915 (2019).
Article CAS PubMed PubMed Central Google Scholar
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol. 9, R7 (2008).
Article PubMed PubMed Central Google Scholar
Buchfink, B., Reuter, K. & Drost, H.-G. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat. Methods 18, 366–368 (2021).
Article CAS PubMed PubMed Central Google Scholar
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240 (2014).
Article CAS PubMed PubMed Central Google Scholar
Aramaki, T. et al. KofamKOALA: KEGG Ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics 36, 2251–2252 (2019).
Article PubMed Central Google Scholar
Emms, D. M. & Kelly, S. OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 16, 157 (2015).
Article PubMed PubMed Central Google Scholar
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article CAS PubMed PubMed Central Google Scholar
Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007).
Article CAS PubMed Google Scholar
De Bie, T., Cristianini, N., Demuth, J. P. & Hahn, M. W. CAFE: a computational tool for the study of gene family evolution. Bioinformatics 22, 1269–1271 (2006).
Article PubMed Google Scholar
Xu, S. et al. Using clusterProfiler to characterize multiomics data. Nat. Protoc. 19, 3292–3320 (2024).
Wang, Y. et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 40, e49–e49 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Eddy, S. R. A new generation of homology search tools based on probabilistic inference. Genome Inf. 23, 205–211 (2009).
Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar
Minh, B. Q. et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chen, C. et al. TBtools-II: a “one for all, all for one” bioinformatics platform for biological big-data mining. Mol. Plant 16, 1733–1742 (2023).
Article CAS PubMed Google Scholar
Li, P. et al. RGAugury: a pipeline for genome-wide prediction of resistance gene analogs (RGAs) in plants. BMC Genom. 17, 852 (2016).
Article CAS Google Scholar
Tarasov, A., Vilella, A. J., Cuppen, E., Nijman, I. J. & Prins, P. Sambamba: fast processing of NGS alignment formats. Bioinformatics 31, 2032–2034 (2015).
Article CAS PubMed PubMed Central Google Scholar
McKenna, A. et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Article CAS PubMed PubMed Central Google Scholar
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Article CAS PubMed PubMed Central Google Scholar
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. (Austin) 6, 80–92 (2012).
Article CAS PubMed Google Scholar
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009).
Article CAS PubMed PubMed Central Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Patterson, N. et al. Ancient admixture in human history. Genetics 192, 1065–1093 (2012).
Article PubMed PubMed Central Google Scholar
Marçais, G. et al. MUMmer4: a fast and versatile genome alignment system. PLOS Comput. Biol. 14, e1005944 (2018).
Article PubMed PubMed Central Google Scholar
Goel, M., Sun, H., Jiao, W.-B. & Schneeberger, K. SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol. 20, 277 (2019).
Article PubMed PubMed Central Google Scholar
Goel, M. & Schneeberger, K. Plotsr: visualizing structural similarities and rearrangements between multiple genomes. Bioinformatics 38, 2922–2926 (2022).
Article CAS PubMed PubMed Central Google Scholar
Szpiech, Z. A. Selscan 2.0: scanning for sweeps in unphased data. Bioinformatics 40, btae006 (2024).
Chen, H., Patterson, N. & Reich, D. Population differentiation as a test for selective sweeps. Genome Res. 20, 393–402 (2010).
Article CAS PubMed PubMed Central Google Scholar
Adamczewski, K., Kierzek, R. & Matysiak, K. Wild oat (Avena fatua L.) biotypes resistant to acetolactate synthase and acetyl-CoA carboxylase inhibitors in Poland. Plant Soil Environ. 59, 432–437 (2013).
Article Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 1–21 (2014).
Article Google Scholar
Chen, S. Ultrafast one-pass FASTQ data preprocessing, quality control, and deduplication using fastp. iMeta 2, e107 (2023).
Article CAS PubMed PubMed Central Google Scholar
Stark, R. & Brown, G. DiffBind: differential binding analysis of ChIP-Seq peak data. R package version 100, 2–21 (2011).
Ramírez, F. et al. DeepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 44, W160 (2016).
Article PubMed PubMed Central Google Scholar
Zhang, J. et al. Vacuum and co-cultivation agroinfiltration of (germinated) seeds results in tobacco rattle virus (TRV) mediated whole-plant virus-induced gene silencing (VIGS) in wheat and maize. Front. Plant Sci. 8, 393 (2017).
PubMed PubMed Central Google Scholar
Wei, W. et al. Mapping of powdery mildew resistance genes transferred to common wheat from wild emmer wheat revealed three functional Pm60 haplotypes. Crop J. 12, 540–548 (2024).
Article CAS Google Scholar
Tian, T. et al. Arabidopsis FAR-RED ELONGATED HYPOCOTYL3 integrates age and light signals to negatively regulate leaf senescence. Plant Cell 32, 1574–1588 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ishida, Y., Tsunashima, M., Hiei, Y. & Komari, T. Wheat (Triticum aestivum L.) transformation using immature embryos. Methods Mol. Biol. 1223, 189–198 (2015).
Article CAS PubMed Google Scholar
Liu, N. et al. Comprehensive co-expression network reveals the fine-tuning of AsHSFA2c in balancing drought tolerance and growth in oat. Commun. Biol. 8, 393 (2025).
Article CAS PubMed PubMed Central Google Scholar
Shi, K. et al. Efficient genetic transformation and genome editing via an Agrobacterium-mediated in commercial oat (Avena sativa L.) cultivars. J. Integrat. Plant Biol. https://doi.org/10.1111/jipb.13915 (2025).

Download references

Acknowledgements

We sincerely thank Dr. Wenzhao Xie from Huazhong Agricultural University for the help of revising our manuscript. This work was supported by the Natural Science Foundation of Hebei Province (grant no: C2023201074 and C2021201048 to H.D.), the Young Elite Scientists Sponsorship Program by CAST (grant YESS20210080 to H.D.), the National Natural Science Foundation of China (32100500 to H.D.), and the Interdisciplinary Research Program of Natural Science of Hebei University (grant no: 513201422004 to H.D.) and the National Natural Science Foundation of China (32401776 to N.L.).

Author information

These authors contributed equally: Jianan Liu, Ningkun Liu, Wei Yan, Xiaoying Hu, Meijia Wang.

Authors and Affiliations

College of Life Sciences, Institute of Life Science and Green Development, Hebei University, Baoding, Hebei, China
Jianan Liu, Ningkun Liu, Wei Yan, Xiaoying Hu, Meijia Wang, Wei Li, Yu Wang, Qingbin Sun, Longsheng Xing, Yange Yun, Yingying Li, Jinjiang Yan, Zhizhong Gong & Huilong Du
Hebei Basic Science Center for Biotic Interaction, Hebei University, Baoding, Hebei, China
Ningkun Liu, Xiaoying Hu, Wei Li, Qingbin Sun & Huilong Du
Hebei Innovation Center for Bioengineering and Biotechnology, Hebei University, Baoding, Hebei, China
Ningkun Liu, Xiaoying Hu, Wei Li & Qingbin Sun
Institute of Biodiversity Science, School of Life Sciences, Fudan University, Shanghai, China
Ruidong Qin
Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
Lin Ma & Xuemin Wang
State Key Laboratory of Plant Environmental Resilience, College of Biological Sciences, China Agricultural University, Beijing, China
Zhizhong Gong
Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
Hongru Wang

Authors

Jianan Liu
View author publications
Search author on:PubMed Google Scholar
Ningkun Liu
View author publications
Search author on:PubMed Google Scholar
Wei Yan
View author publications
Search author on:PubMed Google Scholar
Xiaoying Hu
View author publications
Search author on:PubMed Google Scholar
Meijia Wang
View author publications
Search author on:PubMed Google Scholar
Ruidong Qin
View author publications
Search author on:PubMed Google Scholar
Lin Ma
View author publications
Search author on:PubMed Google Scholar
Wei Li
View author publications
Search author on:PubMed Google Scholar
Yu Wang
View author publications
Search author on:PubMed Google Scholar
Qingbin Sun
View author publications
Search author on:PubMed Google Scholar
Longsheng Xing
View author publications
Search author on:PubMed Google Scholar
Yange Yun
View author publications
Search author on:PubMed Google Scholar
Yingying Li
View author publications
Search author on:PubMed Google Scholar
Jinjiang Yan
View author publications
Search author on:PubMed Google Scholar
Xuemin Wang
View author publications
Search author on:PubMed Google Scholar
Zhizhong Gong
View author publications
Search author on:PubMed Google Scholar
Hongru Wang
View author publications
Search author on:PubMed Google Scholar
Huilong Du
View author publications
Search author on:PubMed Google Scholar

Contributions

H.D., H.W. and Z.G., conceived the project; H.D. supervised the project and designed the study. H.D., J.L., W.Y., Y.L., Q.S., and M.W. collected the A. fatua samples. J.L., X.H., W.Y., L.X., and M.W. sequenced and processed the raw data. X.H., J.L., Y.W., and M.W. assembled and annotated the genome. J.L., W.Y., W.L., and M.W. performed the phylogenetic and genome evolution analyses; J.L., W.Y., and M.W. conducted the transcriptome analysis; J.L., W.Y., and M.W. conducted ATAC-seq analysis. J.L., W.Y., R.Q. and W.L. performed the population genetics analysis. N.L., Y.Y., J.Y., L.M., and X.W. designed and performed the experiments. H.D. composed the outline of the manuscript. J.L., H.D., and X.H. wrote the manuscript.

Corresponding authors

Correspondence to Zhizhong Gong, Hongru Wang or Huilong Du.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Yuanying Peng and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Peer Review file (download PDF )

Description of Additional Supplementary Files (download PDF )

Supplementary Data 1 (download XLSX )

Supplementary Data 2 (download XLSX )

Supplementary Data 3 (download XLSX )

Supplementary Data 4 (download XLSX )

Supplementary Data 5 (download XLSX )

Supplementary Data 6 (download XLSX )

Supplementary Data 7 (download XLSX )

Supplementary Data 8 (download XLSX )

Supplementary Data 9 (download XLSX )

Reporting Summary (download PDF )

Source data

Source Data (download ZIP )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, J., Liu, N., Yan, W. et al. Reference genome and population genomic analyses reveal insight into herbicide tolerance in Avena fatua L.. Nat Commun 16, 9851 (2025). https://doi.org/10.1038/s41467-025-64825-w

Download citation

Received: 10 March 2025
Accepted: 26 September 2025
Published: 07 November 2025
Version of record: 07 November 2025
DOI: https://doi.org/10.1038/s41467-025-64825-w