Main

The telencephalon, the anterior part of the mammalian brain, encompasses key structures that function as command centres for hierarchically organized brain networks driving behaviour and cognition. It consists of the cerebral cortex (CTX; shell) and cerebral nuclei (CNU; core), derived from the developmental pallium and subpallium, respectively. The CTX consists of isocortex, hippocampal formation (HPF), olfactory areas (OLF) and cortical subplate (CTXsp), and the CNU consists of striatum (STR) and pallidum (PAL). Each structure contains functionally distinct subregions (together, over 100 regions; see Supplementary Table 1 for anatomical ontology from the Allen Mouse Brain Common Coordinate Framework version 3 (CCFv3; RRID: SCR_020999))4, each containing many cell types.

A general organizing principle of the telencephalon involves several parallel corticostriatopallidal circuit pathways5. In a highly simplified view, the dorsal pathway, involving the isocortex, caudate putamen (CP) and globus pallidus external and internal segments (GPe and GPi), mediates sensory and motor functions; the ventral pathways, from the prefrontal cortex to the nucleus accumbens (ACB) and ventral PAL (PALv), or from the amygdala to the bed nucleus of the stria terminalis (BST), regulate affective functions; and the hippocampal–septal pathway supports learning and cognition. These circuits rely on highly diverse glutamatergic and GABAergic neurons, of which the heterogeneity supports regional specialization. Understanding their molecular identity, spatial distribution and inter-relationships is key to understanding how brain functions emerge from this complexity.

Glutamatergic excitatory neurons are the dominant neuronal class of the CTX and arise from the ventricular and subventricular zones (SVZs) of the developing pallium. By contrast, most telencephalic GABAergic neurons originate from five subpallial progenitor domains: medial (MGE), caudal (CGE) and lateral (LGE) ganglionic eminences, septum and preoptic area (POA). These neurons are born embryonically and migrate along defined routes to disperse throughout the forebrain3,6,7. Many GABAergic neurons migrate to the pallium, where they intermingle with glutamatergic neurons. Cell fate specification in the progenitor domains is orchestrated by a combination of transcription factors (TFs) and morphogens6,7,8,9,10,11,12,13,14,15.

We conducted a systematic and in-depth analysis of the transcriptomic and spatial organization of GABAergic neuronal types in all regions of the mouse telencephalon and their developmental origins, using our high-resolution whole-brain atlas16 and an additional single-cell RNA-sequencing (scRNA-seq) dataset from prenatal and postnatal development. Our study identified a diverse set of distinct cell types and molecular gradients that collectively shape the cellular landscape that underlies the functions of various telencephalic regions and circuits. We defined a comprehensive set of TFs that delineate all major subclasses and supertypes of the adult-stage telencephalic GABAergic neurons, linking their expression to specific developmental regions. This enabled us to infer the developmental origins of these diverse neuronal types. The results reveal two prominent features of the telencephalic GABAergic neurons: (1) long-distance migration and dispersion as a common characteristic of nearly all classes of telencephalic GABAergic neurons; (2) differential extents of postnatal diversification between cortical and striatal, and septal, preoptic and most pallidal GABAergic neuronal types.

GABAergic neuronal types in the mouse telencephalon

In a previous study, we created a high-resolution transcriptomic and spatial cell-type atlas of the entire adult mouse brain, combining scRNA-seq and multiplexed error-robust fluorescence in situ hybridization (MERFISH) spatial transcriptomics16 (Supplementary Table 2). This atlas included four hierarchical levels of classification: 34 classes, 338 subclasses, 1,201 supertypes and 5,322 clusters. Neuronal cell types represented a large portion of the atlas, with 29 classes (85%), 315 subclasses (93%), 1,156 supertypes (96%) and 5,205 clusters (98%). However, that study provided a coarse, class/subclass level overview. Here we conduct a more in-depth analysis and introduce the most complete taxonomy to date of all of the GABAergic neuronal types in the entire telencephalon, at all levels of the hierarchy.

The telencephalic GABAergic taxonomy, defined as the subpallium GABA neighbourhood, includes subclasses 39–90 across seven classes: OB (olfactory bulb)–IMN (immature neurons) GABA, CTX–CGE GABA, CTX–MGE GABA, CNU–MGE GABA, CNU–LGE GABA, LSX (lateral septal complex) GABA and CNU–HYa (anterior hypothalamus) GABA (Fig. 1). These 52 subclasses encompass 284 supertypes and 1,051 clusters, containing 611,423 high-quality single-cell transcriptomes (Supplementary Table 3). We provide several visualizations, including dendrograms (Fig. 1a), uniform manifold approximation and projections (UMAPs; Fig. 1b–d) and constellation diagrams (Extended Data Fig. 1c,d) to depict the multidimensional relationships among different subclasses or supertypes.

Fig. 1: Transcriptomic taxonomy of telencephalic GABAergic neuronal types in mice.
Fig. 1: Transcriptomic taxonomy of telencephalic GABAergic neuronal types in mice.
Full size image

a, The transcriptomic taxonomy of 284 supertypes organized in a dendrogram (10xv2, n = 269,307 cells; 10xv3, n = 342,116 cells). At each class or subclass branch, the top marker genes are shown. From the top down, the bar plots represent major neurotransmitter (NT) type and region distribution of profiled cells. NA, not applicable. bd, UMAP representation of all cell types coloured by subclass (b), supertype (c) and the dissection region (d). e, Representative MERFISH sections of the adult mouse brain across forebrain structures coloured by cell class. Each class is labelled by its ID and shown in the same colour as in the dendrogram in a. The triangular schematic denotes the hierarchical level shown in the panel. C, class. All class, subclass and region abbreviations are listed in Supplementary Table 1.

Using MERFISH data registered to the Allen Mouse Brain CCFv3 (Fig. 1e), we determined the distribution of GABAergic supertypes across telencephalic regions (Extended Data Fig. 1a). Most supertypes span neighbouring regions, except those in the LSX and the main (MOB) and accessory (AOB) olfactory bulb, which are region specific. Gini and Shannon indices revealed strong spatial localization, with the exception of more widely distributed types in the isocortex (Extended Data Fig. 1a).

As reported previously16, a vast majority of the clusters are purely GABAergic (Fig. 1a and Supplementary Table 3). However, we also identified glutamate–GABA co-releasing types expressing Slc17a8 in 11 clusters across several subclasses. We identified 11 cholinergic neuronal clusters in subclass 58 (PAL–STR Gaba–Chol), with complex GABA and/or glutamate co-release patterns (see below), and 2 GABA–cholinergic clusters in subclass 69 (LSX Nkx2-1 Gaba) (Supplementary Table 3). We also identified four GABA–dopamine co-releasing clusters in subclass 44 (OB Dopa–Gaba).

We identified 26 neuropeptide genes that are differentially expressed across telencephalic GABAergic types (Supplementary Table 3), many of which have been previously used as cell-type markers. Most exhibit restricted expression, such as Edn1, which is largely confined to subclass 52 (Pvalb Gaba). Others, like Penk and Pnoc, are more broadly expressed across subclasses within each GABAergic class. These expression patterns further support the transcriptomic identity and potential functional roles of specific cell types.

GABAergic neuronal types in the olfactory bulb

Olfactory sensory neurons transmit odour information to the OB, where neurons with the same odorant receptor converge on specific glomeruli. These neurons synapse with interneurons and excitatory mitral and tufted cells, which then project to various brain regions for further processing. The OB has an exceptionally high ratio of inhibitory to excitatory neurons compared to, for example, the isocortex.

GABAergic neuronal types in the MOB and AOB mostly originate from the LGE and migrate to the OB during development11. These neurons continue to be replaced in the adult through neurogenesis in the SVZ and migration through the rostral migratory stream17. In the OB–IMN GABA class, we defined six OB GABAergic subclasses (39 to 45; Extended Data Fig. 2).

Subclass 45 (OB–STR–CTX Inh IMN), which includes seven supertypes, contains immature neurons from the SVZ that migrate to the OB (Extended Data Figs. 2a–c,k and 3a,b). Supertypes 166, 168, 171 and 172 are the most immature based on gene expression, spatial location and pseudotemporal ordering16,18. The remaining three supertypes appear transitional, resembling mature subclasses (Extended Data Fig. 2a–c). Supertype 170 forms a transition from immature neurons to subclass 39 (OB Meis2 Thsd7b Gaba), which is found mostly in the internal plexiform and mitral (Ipl/Mi), external plexiform (EPl) and glomerular (Gl) layers (Extended Data Fig. 2a,b,d,e). Within subclass 39, supertype 143 represents Calb1+ Blanes cells19 populating the Gl layer (Extended Data Fig. 2e). Supertype 167 forms a transition to subclass 41 (OB–in Frmd7 Gaba), representing granule cells in the granule layer (GrO), Ipl/Mi and partly EPl. Supertype 150 populates the EPl and includes Gl cells modulated by olfactory activity20 (Extended Data Figs. 2a,b,d,j and 3a). Lastly, immature supertype 169 forms a transition to subclass 42 (OB–out Frmd7 Gaba), which consists of Calr+ periglomerular cells (PGCs) populating the Gl layer (Extended Data Fig. 2a,b,d,g).

Subclass 40 (OB Trdn Gaba) contains the population of Rprm+ granule cells20 (Extended Data Fig. 2c,f) and subclass 43 (OB−mi Frmd7 Gaba) represents Pvalb+ OB neurons that are postnatally generated neurons populating the EPl19 (Extended Data Fig. 2c,h). The Gl layer, where sensory input is first processed, contains several interneuron populations that can be broadly divided into three categories, Calb1+ PGCs from subclass 39, Calr+ PGCs from subclass 42, and the Slc6a3+ and Th+ dopaminergic PGCs from subclass 44 (Extended Data Figs. 2c,e,g,i and 3a).

GABAergic neuronal types in the cerebral cortex

The CTX (pallium) comprises four major structures: the isocortex, HPF, OLF and CTXsp. GABAergic neurons across these regions fall into two main classes, CTX–CGE GABA (Extended Data Fig. 4) and CTX–MGE GABA (Fig. 2), named after their spatial location and predominant developmental origins. Notably, around 10% of neurons in these classes arise from the embryonic POA7,14,21. The current classification aligns with our previous transcriptomic taxonomy of the mouse isocortex and HPF22, as well as MET-types in Patch-seq studies of the mouse visual cortex23 (Extended Data Fig. 5), and extends the framework into the OLF and CTXsp areas.

Fig. 2: MGE-derived GABAergic neuronal types in the CTX and CNU.
Fig. 2: MGE-derived GABAergic neuronal types in the CTX and CNU.
Full size image

ac, UMAP representation of all MGE cells coloured by subclass (a), supertype (b) or the broad brain region (c). d, Constellation plot of MGE clusters using UMAP coordinates shown in b. The nodes are clusters coloured by supertype (ST) and grouped in bubbles by subclass. The lines around the subclass bubbles denote the two classes—CTX–MGE GABA and CNU–MGE GABA. em, Representative MERFISH sections showing the location of supertypes in MGE subclasses 50 (Lamp5 Lhx6 Gaba) (e), 51 (Pvalb chandelier Gaba) (f), 52 (Pvalb Gaba) (g), 53 (Sst Gaba) (b), 54 (STR Prox1 Lhx6 Gaba) (i), 55 (STRv Lhx8 Gaba) (j), 56 (Sst Chodl Gaba) (k), 57 (NDB–SI–MA–STRv Lhx8 Gaba) (l) and 58 PAL–STR Gaba–Chol (m). Cells are coloured and labelled by supertype. The triangular schematic denotes the most granular hierarchical level shown in the panels. All class, subclass and region abbreviations are listed in Supplementary Table 1.

In the CTX–CGE GABA class, we defined four subclasses: three known (46, Vip Gaba; 47, Sncg Gaba; and 49, Lamp5 Gaba)22, and one newly defined (48, retrohippocampus (RHP)–COA Ndnf Gaba) (Extended Data Fig. 4a–g). The Vip+, Sncg+ and Lamp5+ subclasses correspond to the well-known bipolar, multipolar, neurogliaform and other types of GABAergic interneurons found throughout isocortex and HPF, and also in OLF and CTXsp (Extended Data Fig. 4c–g). Vip clusters in supertypes 179, 181 and 182 are largely hippocampus (HIP) specific (Extended Data Fig. 4b–d). Most GABAergic neurons in the CTX are local interneurons, except for a few long-range projecting (LRP) types24,25. The LRP types in the HIP are among the best described cortical LRP neurons. On the basis of the expression of genes, including Chrna4, Pcp4, Nos1 and Htr3a, supertype 179, which is located in the HIP, was identified as a potential LRP population26.

The newly defined subclass 48 (RHP–COA Ndnf Gaba) expresses Ndnf and Ntng1, is enriched in the HPF, OLF and CTXsp, and shows strong regional specificity. Supertypes 195 and 198 localize to the entorhinal cortex (ENT) and RHP, respectively; supertype 194 localizes to the cortical amygdalar area (COA); and the remaining supertypes localize to the HIP (Extended Data Fig. 4c,f). This subclass includes the Meis2+ GABAergic population in supertype 198, previously described as originating from the embryonic pallial–subpallial boundary and residing in white matter22,27 (Extended Data Fig. 4a–c). Supertype 198 is the only cortical GABAergic cell type expressing Meis2 (Extended Data Fig. 4c). Subclass 48 (RHP–COA Ndnf Gaba) lacks Prox1, in contrast to other CGE-derived GABAergic neurons, but expresses the CGE markers Nr2f2 and Htr3a and the LGE marker Sp9, suggesting an origin at the posterior LGE–anterior CGE boundary.

In the CTX–MGE GABA class, we defined four subclasses: three previously defined ones22, subclass 53 (Sst Gaba), subclass 52 (Pvalb Gaba) and subclass 51 (Pvalb chandelier Gaba), and a newly reclassified type, subclass 50 (Lamp5 Lhx6 Gaba). The latter was previously part of the CGE-derived Lamp5+ subclass but is now assigned to the MGE class on the basis of expression of the MGE-specific TF Lhx6 (Fig. 2a–d). The Sst+, Pvalb+ and chandelier subclasses correspond to known interneuron types distributed across the isocortex and HPF, and we now show that they are also present in the OLF and CTXsp regions (Fig. 2f–h and Extended Data Fig. 6a).

Two Pvalb+ chandelier clusters were identified: one that is broadly distributed, and another that is mostly HIP specific. Among nine Pvalb+ supertypes, four are isocortex predominant, and two are enriched in the OLF, CTXsp and HPF (Extended Data Fig. 6a). The Sst+ subclass is highly diverse, with 19 supertypes and 71 clusters showing variable regional preferences. Seven supertypes are enriched in the OLF, CTXsp and HPF (Extended Data Fig. 6a (red dots)), while others are broadly present in the isocortex. Several Pvalb+ and Sst+ supertypes (such as, 209, 216, 219, 228 and 232) are HIP specific. (Fig. 2g,h and Extended Data Fig. 6a). The Lamp5+Lhx6+ subclass is similarly enriched in the HIP with a minor presence in the isocortex (Fig. 2e and Extended Data Fig. 6a).

Sst+ supertypes 215 and 216 probably correspond to LRP neurons expressing Sst, Npy and Nos1 (ref. 25), localized to the OLF, CTXsp and HPF, respectively (Fig. 2i). These match LRP3 and LRP4 types described previously28, which align with CTX–HPF type 87 Sst Etv1 (ref. 22) and show a mixed transcriptional profile characteristic of both interneurons and LRP neurons28 (Extended Data Fig. 5b).

Overall, CTX–CGE and CTX–MGE GABAergic neuron types in the CTX display clear spatial segregation between the isocortex and the other structures that are considered evolutionarily more ancient, including the HPF, OLF and CTXsp. Some types are situated in transition zones, for example, Vip+ supertype 177 is mostly located in the isocortex, is less dense in the retrosplenial cortex (RSP) and is sparse in the HIP (Extended Data Fig. 4d).

MGE-derived GABAergic types in the cerebral nuclei

MGE-derived GABAergic neurons are widely distributed and fall into two main classes: CTX–MGE, located in pallial areas, and CNU–MGE, mainly in the CNU. Both classes express Lhx6, a developmental pan-MGE marker. CNU–MGE GABAergic neurons form smaller, more heterogeneous clusters than those in the CTX–MGE class, highlighting a marked difference in complexity between the two (Fig. 2a–d).

STR and PAL in the CNU are key components of the basal ganglia, a group of interconnected subcortical nuclei involved in motor behaviour, learning and cognition29,30. These regions are predominantly composed of GABAergic neurons, which fall into four transcriptomic classes (Fig. 1). The CNU–MGE GABA and CNU–LGE GABA classes include striatal and pallidal neurons derived from the MGE and LGE, respectively. The LSX GABA class contains lateral septum neurons derived from the embryonic septum31. The CNU–HYa GABA class includes GABAergic neurons located in the STR-like amygdalar nuclei (sAMY), caudal PAL (PALc) and POA of the HYa (Extended Data Fig. 1a); these probably arise from multiple developmental origins, including the LGE, MGE and embryonic POA.

In the CNU–MGE class, we defined five subclasses: 54 (STR Prox1 Lhx6 Gaba), 55 (STR Lhx8 Gaba), 56 (Sst Chodl Gaba), 57 (NDB–SI–MA–STRv Lhx8 Gaba) and 58 (PAL–STR Gaba–Chol) (Fig. 2a,d and Extended Data Fig. 6a). These subclasses span multiple regions, including the dorsal and ventral STR (STRd and STRv) and dorsal, ventral and medial PAL (PALd, PALv, PALm), with no single subclass confined to a specific region (Fig. 2i–m). Although local interneurons make up less than 10% of the striatal neurons, they are molecularly diverse32. Subclasses 54 (Pvalb+) and 55 (Pvalb), located in the STRd and STRv, probably represent striatal GABAergic interneurons (Fig. 2i,j and Extended Data Figs. 3d and 6a). Subclass 56 (Sst Chodl Gaba) is notable for spanning both pallial and subpallial structures. It includes isocortical Sst+Chodl+ cells (supertype 241, the cortical Sst+Chodl+ cells22,33), CTXsp (supertype 242) and Sst+ interneurons in the STR and PAL (supertypes 238–240 and 243) (Fig. 2d,k and Extended Data Figs. 1a and 6a). These findings indicate that the previously identified cortical Sst+Chodl+ LRP neurons23,28,33 are transcriptionally related to striatal Sst+ interneurons, all classified within subclass 56.

Subclass 58 (PAL–STR Gaba–Chol) includes basal forebrain cholinergic neurons, divided into three supertypes (Fig. 2a,d,m and Extended Data Figs. 6a and 7). Supertypes 259 and 260 contain cholinergic neurons, whereas supertype 261 includes closely related GABAergic neurons primarily from the medial septum (MS) and diagonal band nucleus (NDB). Supertype 259 represents cholinergic projection neurons that co-release GABA and, in some cases, glutamate (Supplementary Table 3), and show spatially specific clustering: MS and NDB (clusters 923–925), substantia innominata (SI) and NDB (clusters 926 and 928) or GPe and SI (cluster 927) (Extended Data Fig. 7b). Supertype 260 includes striatal cholinergic interneurons located in the CP, ACB and olfactory tubercle (OT), which may co-release glutamate through Slc17a8 but do not co-release GABA (Extended Data Fig. 7c,e and Supplementary Table 3). Moreover, supertype 307 of subclass 69 (LSX Nkx2-1 Gaba) contains clusters 1081 and 1084, which are both GABAergic and cholinergic and are located in the lateral septum (Extended Data Fig. 7a,d,e).

LGE-derived GABAergic types in the cerebral nuclei

The CNU–LGE GABA class consists of seven LGE-derived subclasses (59–65) (Fig. 3 and Extended Data Fig. 6b). These subclasses resemble the well-known D1- and D2-type medium spiny neurons (MSNs; also known as spiny projection neurons) in the STR, with subclasses 61 (STR D1 Gaba) and 62 (STR D2 Gaba) being the prototypic striatal D1 and D2 MSNs and the other subclasses being newly defined homologous cell types. Among them, subclasses 60, 63 and 64 also express the dopamine receptor gene Drd1, and subclass 60 additionally expresses Drd3 strongly.

Fig. 3: LGE-derived GABAergic neuronal types of the CNU.
Fig. 3: LGE-derived GABAergic neuronal types of the CNU.
Full size image

a, Constellation plot of LGE clusters using UMAP coordinates shown in b. Nodes are labelled by cluster ID, coloured by supertype and grouped in bubbles by subclass. b, UMAP representation of all LGE cells coloured by supertype. cg, Representative MERFISH sections showing the location of LGE subclasses 59 (GPe Sox6 Cyp26b1 Gaba) (c), 63 (STR D1 Sema5a Gaba) (d), 64 (STR–PAL Chst9 Gaba) (e), 60 (OT D3 Folh1 Gaba) (f) and 65 (IA Mgp Gaba) (g). Cells are coloured and labelled by supertype. h,i, Representative MERFISH sections showing the location of CNU–LGE subclasses 61 (STR D1 Gaba) (h) and 62 (STR D1 Gaba) (i). Each box contains one supertype and cells are labelled and coloured by cluster to highlight the diversity. The triangular schematics denote the most granular hierarchical level shown in cg (supertype) and h and i (cluster (CL)). BLA, basolateral amygdalar nucleus; BMA, basomedial amygdalar nucleus; CLA, claustrum. All class, subclass and region abbreviations are listed in Supplementary Table 1.

Subclass 59 (GPe–SI Sox6 Cyp26b1 Gaba) is predominantly localized to the GPe, with some cells observed in the SI (Fig. 3c). The GPe—a part of the basal ganglia involved in action control, decision-making and reward—contains two main types of GABAergic neurons: prototypical and arkypallidal34. The prototypical neurons mostly express the TF gene Nkx2‐1, exhibit a fast firing rate and project to the subthalamic nucleus. Supertype 247 of the MGE-derived subclass 57 (NDB–SI–MA–STRv Lhx8 Gaba) is located in the GPe and corresponds to these Nkx2-1+ and Lhx6+ neurons (Fig. 2l and Extended Data Fig. 6a). By contrast, arkypallidal neurons, characterized by the expression of Penk and Foxp2, have a low firing rate and project back to the STR34,35. Supertype 257 of the LGE-derived subclass 59 (GPe–SI Sox6 Cyp26b1 Gaba) probably represents these Penk+ and Foxp2+ GPe arkypallidal neurons (Fig. 3c and Extended Data Fig. 6b).

Subclass 60 (OT D3 Folh1 Gaba), which strongly expresses Drd3 alongside Drd1, is specifically localized in the Islands of Calleja in the OT (Fig. 3f and Extended Data Fig. 3c,d). Subclasses 63 (STR D1 Sema5a Gaba), 64 (STR–PAL Chst9 Gaba) and 65 (IA Mgp Gaba) collectively form a new group of GABAergic neuronal types that resemble MSNs based on their gene expression profiles but are distinct based on their spatial locations (Fig. 3d,e,g). The spatial distribution of many supertypes and clusters in these subclasses is unique, scattered along the borders of different striatal and pallidal regions, forming distinct patches or streaks.

Supertype 281 within subclass 63 (STR D1 Sema5a Gaba) is found in the STRd and contains hybrid MSNs that show robust expression of Drd1 and modest expression of Drd2 (Fig. 3d and Extended Data Fig. 6b). Within cluster 991 of supertype 281, 32% of cells express both Drd1 and Drd2 above a threshold of log2[counts per million (CPM)] > 3. However, the gene expression signature of these Drd1 and Drd2 co-expressing cells is insufficient to define a separate cluster. These neurons may be those described before as co-expressing Drd1a and a shortened variant of the D2 receptor36. These neurons resemble the exo-patch MSNs, which are located in the striatal matrix but physiologically resemble the MSNs from patches32,36,37 (Extended Data Fig. 3c,d). We also identified a small proportion of Drd1/Drd2-co-expressing neurons in subclasses 61 (8.3%) and 62 (8.8%).

Spatial patterns suggest that subclasses 64 (STR–PAL Chst9 Gaba) and 65 (IA Mgp Gaba) reside in the interstitial nucleus of the posterior limb of the anterior commissure (described in the Paxinos’s atlas38) or intercalated amygdalar nucleus (IA) (Fig. 3e,g). Subclass 65 includes five supertypes, all expressing Foxp2 and Tshz1, markers of Sp8+ dLGE-derived intercalated cells (ITCs)39,40 (Extended Data Fig. 6b). Subclass 65 (IA Mgp Gaba) shows similarity to subclass 39 (OB Meis2 Thsd7b Gaba) (Fig. 1b,c,e,f), hinting at shared developmental origins. Within subclass 65, supertype 293 stands out. On the basis of the dissection information, these cells were isolated from the OLF (48%) and CTXsp (36%) (Extended Data Fig. 6b). MERFISH data show that cells in supertype 293 are located in the MOB, anterior olfactory nucleus and, sparsely, in the IA (Fig. 3g). A previous STICR lineage-tracing study showed that the ITCs of the amygdala are clonally related to OB interneurons and MSNs in the STR41.

We defined 9 and 7 supertypes in subclasses 61 (STR D1 Gaba) and 62 (STR D2 Gaba), respectively. These supertypes and their clusters show diverse spatial patterns across the STRd and STRv (Fig. 3h,i). Some span broad areas while others are confined to specific medial–lateral, anterior–posterior or dorsal–ventral subdomains. Paired D1 and D2 clusters often co-localize. Clusters D1 951 and D2 982 are located in the lateral CP, while clusters D1 950 and D2 981 are in the medial CP. Clusters D1 960 and D2 970 are located in the posterior tail of the CP, while clusters D1 957 and D2 983 are in the posterior ventral tip of the CP. Within subclass 61 (STR D1 Gaba), supertype 268 is enriched in the striatal patches and striosomes and supertype 267 is enriched in the matrix (Fig. 3h). Similarly, in subclass 62, supertype 279 is striosome enriched and supertype 277 is matrix enriched (Fig. 3i).

We observed complex subregional enrichment of cell types in the ACB. D1 supertypes 265, 266, 270, 272 and 273, and D2 supertypes 275 and 278 are mostly restricted to the ACB and OT (Fig. 3h,i). The ACB can be divided into core and shell subregions. D1 supertypes 272 and 273 and D2 supertype 278 are predominantly located in the core. D1 supertype 266 has two clusters: 948 in the core and 949 in the shell (Fig. 3h). D2 supertype 275 includes clusters 973 and 974 in the core and the others in the shell or OT (Fig. 3i). The ACB core and shell can be further subdivided along mediolateral and anteroposterior axes. In the D1 272 and D2 278 supertypes, clusters D1 963 and D2 985 are located in the medial–anterior subdomain of the ACB core, while clusters D1 961 and D2 983 are in a more lateral–posterior position (Fig. 3h,i). In the ACB shell, D1 supertypes 265 and 270 occupy medial–anterior and lateral–posterior subdomains, respectively, and, within the D2 275 supertype, clusters 977 and 975 show a similar pattern. Despite these enrichments, many cell types overlap across ACB subregions, as previously reported32.

GABAergic neuronal types in the lateral septum

The LSX GABA class is uniquely localized to the LSX and is distinct from other CNU GABAergic types. It contains six complex and intertwined subclasses (67–72) with partially overlapping spatial patterns (Extended Data Fig. 8a–j). For example, subclasses 67 and 68 are found in both the rostral and ventral LSX (LSr and LSv), with subclass 68 also extending dorsally (Extended Data Fig. 8d–f). Similarly, subclasses 69 and 70 overlap in the LSr and LSv, with subclass 70 extending further into the anterior and posterior LSX (Extended Data Fig. 8d, g, h).

GABAergic neuronal types in the sAMY and POA

The CNU–HYa GABA class is highly complex, with 19 subclasses that are predominantly localized in the CNU but also extending into the POA of the HYa (subclasses 66 and 73–90; Extended Data Fig. 9). Neurons in this class are located in specific sAMY and PAL regions, including the central amygdalar nucleus (CEA), medial amygdalar nucleus (MEA), anterior amygdalar area (AAA), BST, SI and magnocellular nucleus (MA) (Fig. 4a and Extended Data Fig. 9d). Each subclass within the CNU–HYa GABA class is not specific to a single region but contains neurons from multiple regions, for example, subclass 80 contains neurons from the CEA, AAA and BST (Fig. 4a,c and Extended Data Fig. 9d). Conversely, each region contains multiple subclasses, for example, subclasses 74, 79, 80 and 82 are co-localized in the CEA, AAA and BST, while subclasses 78 and 84 are co-localized in the SI, NDB and MA.

Fig. 4: Organization of GABAergic neuronal types across the sAMY and BST.
Fig. 4: Organization of GABAergic neuronal types across the sAMY and BST.
Full size image

a, Representative MERFISH sections showing the locations of subclasses belonging to the CNU–HYa GABA class. Cells are coloured and labelled by subclass (SC). A, anterior; P, posterior. b,c, Representative MERFISH sections showing the locations of cells belonging to the MEA–BST subclasses (b) and CEA–BST subclasses (c). Each row shows one subclass, and cells are coloured and labelled by supertype. Within each subclass, each supertype’s locations in both the MEA and BST (b) or both the CEA and BST (c) are shown, indicating the existence of the same cell type in these locations. df, Spatial domain clustering within BST neurons using BANKSY. The alluvial plot (d) shows the relationship between subclasses present in the BST and the spatial domains. Spatial domains corresponding to subclasses that are shared with MEA or CEA are labelled as MEA domains or CEA domains, respectively. Representative MERFISH images show the location of the spatial domains, with cells coloured by spatial domain identity (e) and the area covered by each domain (f). Triangular schematics denote the most granular hierarchical level shown in the panels. All class, subclass and region abbreviations are listed in Supplementary Table 1.

BST functions as a hub for processing limbic information and monitoring emotional valence and is at the centre of a vast connectivity network. It regulates mood and arousal through sAMY, dorsal raphe and VTA connections, and monitors feeding, drinking and reproductive behaviours through brainstem and hypothalamic inputs, as well as LSX and MEA projections. These diverse functions correspond to distinct cell types. As part of the extended amygdala, BST is a key output region for CEA and MEA neurons42,43. Transcriptomic analysis reveals strong similarity between the BST, CEA and MEA, with each region marked by distinct gene signatures (Extended Data Fig. 9d).

Types within subclasses 73, 74, 75, 76 and 88 are mostly located in the MEA and/or BST (Figs. 1a,e and 4b and Extended Data Fig. 9d), probably sharing developmental origins based on conserved TF expression (see below). These MEA–BST types are transcriptomically similar to subclasses 85–87, 89 and 90 in the POA, which are linked by shared circuits and functions44,45,46 (Fig. 1e and Extended Data Fig. 9d). The MEA integrates olfactory and vomeronasal inputs and has a central role in regulating social and reproductive behaviours47. For example, subclasses 74, 75 and 76 express the posterodorsal MEA marker Lhx6 (ref. 48,49) and neurons in this region are involved in reproductive behaviours47,49. Supertypes 349 and 351 (subclass 74) express Crhr2 and Ucn3 (Extended Data Fig. 9d), markers of stress-responsive neurons50. Supertypes 357–359 (subclass 76) include the BNSTprTac1/Esr1 type, which is known to regulate male social behaviour46 (Extended Data Fig. 3e).

Subclasses 77–84 are primarily located in the CEA, BST, AAA and SI (Extended Data Fig. 9d), show similarity to striatal CNU–LGE subclasses (Figs. 1e and 4a,c) and may share developmental origins based on conserved TF expression. The CEA is a STR-like GABAergic structure containing both interneurons and long-range projection neurons and drives fear responses through projections to the hypothalamus and brainstem51. Most supertypes in subclass 79 (CEA–AAA–BST Six3 Cyp26b1 Gaba) are located in the lateral and capsular parts of CEA (CEAl and CEAc), express the known MSN markers Penk, Pax6, Gpr88 and Ppp1r1b36,37,48, and are related to subclass 62 (STR D2 Gaba) (Figs. 1e,f and 4c and Extended Data Fig. 9d). Supertype 371 in subclass 79, located mostly in the CEAc, is transcriptomically and spatially similar to D2 supertypes 274 and 280 (Figs. 1e,f, 3i and 4c). Supertype 368 corresponds to the previously described CeA Prkcd-Ezr type that is highly responsive to cued fear conditioning48 (Extended Data Fig. 3f). Subclasses 77 (CEA–BST Gal Avp Gaba), 82 (CEA–BST Ebf1 Pdyn Gaba) and 83 (CEA–BST Rai14 Pdyn Crh Gaba) are mainly located in the medial part of CEA (CEAm; Fig. 4c).

We extracted all BST neurons from the MERFISH data and performed spatial clustering based on gene expression (500-gene panel) and spatial proximity (Methods). This revealed distinct domains, several of which align with known BST subdivisions and with MEA–BST and CEA–BST subclasses (Fig. 4d). BST subdivisions have been proposed along mediolateral, anteroposterior and dorsoventral axes based on cytoarchitecture, developmental origin and monoaminergic innervation42,52. MEA primarily projects to posteromedial BST42, where MEA–BST subclasses (73–76) are enriched, while CEA projects to the anterolateral BST53, where CEA–BST subclasses (77 and 79–83) are located (Fig. 4e,f). Thus, neurons of the same supertype may migrate to distant yet interconnected regions such as the CEA and anterolateral BST or MEA and posteromedial BST.

Gene expression gradients in telencephalic regions

MGE cortical gradient

During adulthood, CGE-derived GABAergic neurons in subclasses 46 (Vip Gaba) and 47 (Sncg Gaba) are evenly distributed in deep and superficial cortical layers and show minimal spatial gene expression gradients (Extended Data Figs. 1b and 4d,e). By contrast, MGE-derived Pvalb+ and Sst+ neurons exhibit distinct laminar distributions. The birthdate of neurons has been linked to laminar patterning whereby MGE-derived GABAergic neurons are generated in an inside-out pattern similar to their excitatory counterparts54,55. However, this is only part of a more complex inside-outside-in pattern formation resulting in the distinct laminar distribution seen in Pvalb+ and Sst+ neurons56. Sst+ GABAergic neurons are among the first types to diversify and mature and are more abundant in infragranular than in supragranular layers of the isocortex, while Pvalb+ GABAergic neurons can be found throughout all layers except for layer 1 (Fig. 2g,h and Extended Data Fig. 10a–c).

We performed independent component analysis (ICA) separately on subclasses 52 (Pvalb Gaba) and 53 (Sst Gaba), projecting scRNA-seq data onto MERFISH data and identifying top loading genes driving the strongest spatial gradients (Methods). We found 20 genes driving a spatial gradient along the cortical depth in the Pvalb+ subclass and 45 such genes in the Sst+ subclass (Extended Data Fig. 10d–g). Among these genes, only six genes are shared between the subclasses (Gm32647, Il1rapl2, Col25a1, Cnr1, Nkain3 and Parm1), indicating that the observed gradient is unique for each subclass. And only three genes (Rbp4, Pdyn and Ndst3) are subclass or supertype markers, suggesting that the gradients are largely independent of supertype diversity within each subclass.

Homology and gradients in MSN populations

GABAergic neuron subclasses 61 (STR D1) and 62 (STR D2) exhibit highly similar spatial patterns (Fig. 3h,i). The constellation plot shows pairs of D1 and D2 clusters with the highest level of transcriptome similarity (Methods and Extended Data Fig. 11a). MERFISH analysis validated co-localized spatial distributions for five closely related D1–D2 cluster pairs (Extended Data Fig. 11b–f). ICA on D1 and D2 subclasses, combined with scRNA-seq projection onto MERFISH data, identified two gene modules forming a dorsolateral-to-ventromedial gradient (Methods and Extended Data Fig. 11g,h), with gene module scores shown on the scRNA-seq UMAP and representative MERFISH sections (Extended Data Fig. 11i–p). Dorsolateral D1/D2 types are marked by expression of genes encoding various neuroactive receptors, such as Grm8, Htr2a and Cnr, while ventromedial types express genes encoding proteins involved in cGMP–PKG signalling, including Csgalnact1, Prkg1 and Slc8a1 (Extended Data Fig. 11g,h).

Gradients in lateral septum

The LSX—a basal forebrain structure—integrates cortical and subcortical inputs and relays signals to hypothalamic and midbrain nuclei, playing a key part in social behaviours such as anxiety and aggression. Most neurons in the LSX are GABAergic and contain receptors for a variety of neuromodulators and neuropeptides (Supplementary Table 3). LSX subclasses exhibit overlapping spatial distributions, and transcriptomic subtypes are not confined to single LSX segments. Using scRNA-seq imputed into MERFISH space, we identified five major spatial gradients (Methods and Extended Data Fig. 12). Gene modules driving these gradients often cross subclass and supertype boundaries, with only two subclass markers, Zeb2 and Six3, and just one supertype marker, Foxp2 (Extended Data Fig. 12a). The strongest spatial gradients in the LSX represent dorsoventral or mediolateral gradients but no strong anteroposterior gradient exists (Extended Data Fig. 12c). Although LSX input domains poorly match its molecular structure, molecular patterns align with projection targets57,58. For example, module 3 includes Foxp2 and Ndst4, genes that are enriched in a subregion projecting to the medial and lateral POA (MPO and LPO)57,58. These findings suggest that LSX cell types are arranged along multidimensional gradients that may reflect both input and output pathways.

Persistent developmental signatures

The telencephalic GABAergic neurons arise mostly from the five principal progenitor domains of the subpallium and from there migrate to populate various regions of the telencephalon. We identified a comprehensive set of TF marker genes defining the adult neuronal types at the class and subclass levels (Fig. 5a).

Fig. 5: TFs and developmental trajectories of telencephalic GABAergic neurons.
Fig. 5: TFs and developmental trajectories of telencephalic GABAergic neurons.
Full size image

a, Expression of key TFs in each supertype in the adult subpallium GABAergic neuron taxonomy tree. Genes labelled in red have exemplary developmental in situ hybridization (ISH) images shown in Extended Data Fig. 15b. be, UMAP representation of all cell types across the developmental time course from E7 to P56, coloured by age (b), class (c), adult subclass (d) and major developmental lineage gene markers (e). In this global UMAP, only the 10xv3 cells are included for the P56 timepoint. All class, subclass and region abbreviations are listed in Supplementary Table 1.

To trace the developmental origins of telencephalic GABAergic cell types, we generated scRNA-seq data across nine stages (embryonic day 11.5 (E11.5) to postnatal day 14 (P14)) and incorporated eight external datasets spanning E7–P2 (refs. 31,59,60,61,62,63) (Supplementary Tables 4 and 5). After stringent quality control and selection of subpallium GABA neurons, we compiled a developmental dataset of 614,569 cells, 387,148 in house (Allen Institute for Brain Science, AIBS) and 227,421 external sources (Supplementary Tables 4 and 5 and Extended Data Fig. 13). This dataset was integrated with adult P56 telencephalic GABAergic neurons (10xv3 only) to visualize gene expression dynamics across time (Fig. 5b). We inferred progenitor origins for adult classes and subclasses (Fig. 5c,d) by label transfer to developmental cells and subclass-specific de novo clustering (Methods). The label accuracy was validated against the original dataset annotations (Extended Data Fig. 14).

We established a developmental telencephalic GABAergic neuronal type taxonomy (E7–P14) with 1,688 clusters within 268 supertypes, 62 subclasses and 13 classes (Supplementary Table 6). Of these, 1,525 clusters, containing all postnatal developmental cells, were assigned with the adult cell-type identities through label transfer, recovering 242 out of the 284 adult supertypes and all adult subclasses and classes. The remaining 163 clusters were assigned developmental identities: 6 classes (51, radial glial cell (RGC); 52, neuroblast; 53, CGE immature; 54, MGE immature; 55, LGE immature; 56, CNU–HYa immature), 10 subclasses and 50 supertypes. The RGC class, probably representing SVZ radial glial cells, is closely related to OB–IMN GABA and comprises mainly early postnatal cells. The neuroblast class includes early embryonic progenitors across domains and may give rise to multiple neuronal lineages (Fig. 5b,c).

GABAergic neurons from the five progenitor domains are identifiable early by distinct gene expression patterns (Fig. 5e and Extended Data Figs. 15a and 16), with spatial TF specificity verified using the developing mouse ISH atlas64 and the P56 MERFISH dataset (Extended Data Fig. 15b).

There is considerable gene expression heterogeneity in the newly defined immature classes. Both classes 54 (MGE immature) and 55 (LGE immature) contain clusters that are actively going through cell cycle as shown by expression of Top2a and Cdk1 (Extended Data Fig. 16a,b,d). Within each of these classes, genes marking specific progenitors are expressed as well as genes that are expressed in RGCs and/or intermediate progenitor cells (IPCs). Nkx2-1, Dlx1 and Ascl1 are expressed in MGE RGCs and IPCs regardless of the cell cycle phase (Extended Data Fig. 16a,d). Within the MGE immature class, several clusters express Zic1, Zic4 and low levels of Pax6, potentially marking progenitors from rostral MGE or septum7,65. Immature LGE cells express Dbi, Fabp7, Gsx2 and Ascl1, which are known to be expressed in LGE RGCs and IPCs66 (Extended Data Fig. 16b,d). As the cells start differentiating into MSN D1 or D2, Isl1 and Ebf1 are upregulated in D1, while Six3 and Tshz1 mark D2. In the 56 CNU–HYa immature class, we could not resolve developmental stages; instead, this class, like its adult counterpart, includes clusters from diverse developmental domains, marked by Nkx2-1 and Zic1 (MGE/septum), Meis2 and Pax6 (LGE), and Otp (hypothalamic anlage)67,68 (Extended Data Fig. 16c).

Developmental CGE and MGE

The cortical CGE- and MGE-derived GABAergic classes are predominantly located in all regions of the isocortex, OLF, HPF and CTXsp (Extended Data Fig. 1a), and are marked by expression of the developmental TF Maf (Fig. 5a and Extended Data Fig. 15a). The CTX–CGE GABA class specifically expresses the developmental TFs Nr2f2 and Prox1. MGE gives rise to GABAergic neurons that populate both CTX (CTX–MGE GABA class) and STR and PAL (CNU–MGE GABA class). Both classes are marked by expression of Lhx6 and the CNU–MGE GABA class also specifically expresses Lhx8 (Fig. 5a,e and Extended Data Fig. 15b).

For both CTX–CGE and CTX–MGE neurons, expansion of distinct types occurs between P0 and P14, whereas CNU–MGE types begin diversifying earlier, during late embryonic stages (Fig. 6a–c and Extended Data Fig. 17a–d). Before P0, we identified 6 out of 8 subclasses and 8 out of 60 supertypes in the CTX–CGE GABA and CTX–MGE GABA classes, increasing to all 8 subclasses and 53 supertypes by P14. In the CNU–MGE class, all 5 subclasses and 14 out of 27 supertypes were detected before P0, rising to 23 by P14. Among cortical types, we could verify that Sst+ GABAergic neurons emerge and diversify earlier than Pvalb+ neurons13. Pvalb+ chandelier cell signatures69 appear around E14 but do not form distinct clusters until P0 (Extended Data Fig. 17e). Early postnatal circuit activity probably influences subtype specification70,71.

Fig. 6: Developmental trajectories of CGE-, MGE- and LGE-derived neurons.
Fig. 6: Developmental trajectories of CGE-, MGE- and LGE-derived neurons.
Full size image

a, UMAP representation of all neurons from embryonic ages to P14 that will form the CTX–CGE GABA, CTX–MGE GABA and CNU–MGE GABA classes. Cells are coloured and labelled by subclass. b,c, Constellation plots showing all clusters using UMAP coordinates from a. Nodes are clusters that are coloured and labelled by subclass (b) or the proportion of age bins (c). The bubbles represent classes (b) and subclasses (c). d, UMAP representation of all neurons from embryonic ages to P14 that will form the OB–IMN GABA and CNU–LGE GABA classes. Cells are coloured and labelled by subclass. e, Constellation plot showing all clusters using UMAP coordinates from d. Nodes are coloured and labelled by subclass, and bubbles behind constellation are coloured by class. f, UMAP representation of all neurons from embryonic ages to P14 that will form the CNU–LGE GABA class. Cells are coloured and labelled by subclass. g, Constellation plot showing all clusters using UMAP coordinates from f. Nodes are clusters that are coloured by the proportion of age bins. The bubbles represent subclasses. The triangular schematic denotes the most granular hierarchical level shown in the panels (supertype). All class, subclass and region abbreviations are listed in Supplementary Table 1.

In corroboration with previous results, subclass 50 (Lamp5 Lhx6 Gaba) expresses both Lhx6 and Adarb2, markers of the MGE and CGE (Extended Data Fig. 17f). As stated above, in the current whole-mouse brain (WMB) taxonomy, this subclass is assigned to the CTX–MGE class due to expression of the MGE marker Lhx6. Lhx6 and Adarb2, marking MGE- and CGE-originated inhibitory neurons, respectively, are co-enriched in the POA-derived neurogliaform cells, indicating that the Lamp5+Lhx6+ subclass might be POA derived.

In the adult data, we described the existence of a laminar gradient among the Pvalb+ and Sst+ types (Extended Data Fig. 10). This laminar gradient can also be seen in the Pvalb+ and Sst+ types during postnatal development starting as early as P0 (Extended Data Fig. 18a,b). This might indicate that the gradient is established very early on as the new neurons enter the isocortex or it is preprogrammed into newborn neurons.

Developmental LGE

The OB–IMN GABA and CNU–LGE GABA classes both arise from the LGE domain (Figs. 5b–e and 6d,e and Extended Data Fig. 19a,b). OB–IMN GABA types are found in the MOB, AOB and the SVZ lining the lateral ventricle, and express Sp8, Sp9 and Meis2 (Fig. 5a,e and Extended Data Fig. 15). The types within the CNU–LGE GABA class are predominantly located in the STRd and STRv and are marked by expression of Rarb and Foxp2 (Fig. 5a and Extended Data Fig. 15). Before P0, we identified 6 out of 14 subclasses and 12 out of 65 supertypes in the OB–IMN GABA and CNU–LGE GABA classes. At P14, we identified all 14 subclasses and 64 out of 65 supertypes within these classes.

In the adult GABAergic cell-type taxonomy, we observed a relatedness between subclass 39 (OB Meis2 Thsd7b Gaba) and subclass 65 (IA Mgp Gaba) from CNU that would suggest that these subclasses share their developmental origin. Subclass 39, containing the neurons that populate the Ipl/Mi, EPl and Gl layers of the OB, is sequestered away from most other OB cell types and more closely related to subclass 65 containing immature cells populating IA (Fig. 6d,e).

In the developmental taxonomy, we observed that supertypes 265 and 270 of subclass 61 (STR D1 Gaba) are closely related to subclass 60 (OT D3 Folh1 Gaba) (Fig. 6d,e). Notably, these supertypes share a physical location with the OT neurons in subclass 60 in the adult (Fig. 3f,h). This might indicate that the neurons in these locations share a specific developmental origin.

Although subclasses 61 (STR D1 Gaba) and 62 (STR D2 Gaba) are transcriptomically similar in adulthood, their distinction is established early. A shared immature subclass (507 STR immature) emerges at E12.5, followed by distinct D1 (508) and D2 (509) subclasses at E13.5 and E14.5, respectively (Fig. 6d–g and Extended Data Fig. 19a–e). D1 and D2 populations originate from different progenitor populations based on expression of distinct gene sets, for example, Isl1 and Ebf1 for D1, and Sp9 and Sox2 for D2 (Extended Data Fig. 19e–k). Initially distinct, their gene expression profiles converge postnatally, with formerly specific markers shared by P14 and P56 (Extended Data Fig. 19e,l–q). Transcriptomic similarity increases over time, as shown by reduced distance and emergence of a shared gene expression gradient in adulthood (Extended Data Figs. 11g,h, 18c,d and 19r). This suggests that D1 and D2 types follow separate developmental trajectories and later converge as they integrate into shared circuits.

Developmental CNU–HYa and LSX

The LSX GABA class, restricted to LSX, expresses TFs Zic1, Zic4, Zic5 and Prdm16 (Fig. 5a,e and Extended Data Figs. 15 and 20f). The CNU–HYa GABA class includes types found in both the CNU (notably sAMY and PALc) and POA. Both classes are highly diverse with multiple embryonic origins. LSX contains a population of Nkx2-1+ neurons from the septal eminence31. CNU–HYa types in the CEA, AAA and BST express Meis2 and Six3, types in the MEA and certain BST cell types express Lhx6 and Nr2f2, and adult POA types (in the medial preoptic nucleus (MPN), MPO and LPO) express Lhx6, Lhx8, Nkx2-1 and Zic1, suggesting origins in the ventral MGE, embryonic POA or septal eminence7 (Fig. 5a and Extended Data Fig. 20f). Developmental mapping could not precisely resolve origins for all adult subclasses (Extended Data Fig. 20a–e). Both LSX and CNU–HYa classes share similar progenitor populations from E11.5 to E14.5 that shift substantially by E16.5. In contrast to cortical and striatal GABAergic types, LSX and CNU–HYa cell-type diversification largely occurs before birth: by E18, 23 out of 25 subclasses and 87 out of 132 supertypes were identified, with near-complete adult profiles (all 25 subclasses and 107 out of 132 supertypes) by P14. This early maturation parallels POA-derived hypothalamic neurons63. The 56 CNU–HYa immature class features heterogeneous signatures from multiple regional domains, in contrast to 54 MGE and 55 LGE immature classes, which reflect temporal stages of differentiation (Extended Data Fig. 16).

Discussion

Here we present a comprehensive cell-type taxonomy and spatial atlas of GABAergic neurons in the mouse telencephalon, integrating high-resolution scRNA-seq and adult whole-brain MERFISH data. This enables a unified view of the molecular and anatomical organization of GABAergic neurons, revealing a hierarchical structure in which cell-type relationships reflect both spatial location and developmental origin. Our taxonomy aligns with previous studies across telencephalic regions and identifies many novel types. This resource provides a foundation for future molecular, anatomical and functional investigations of GABAergic circuits. Moreover, we identify key organizational principles, including spatial gene expression gradients, long-range migration and dispersion of related cell types, and marked differences in the timing of diversification between cortical and striatal neurons and between septal, pallidal and preoptic neurons.

GABAergic neurons of the telencephalon originate from five embryonic subpallial domains: MGE, CGE, LGE, POA and septum3,7,72. From here, the immature neurons migrate to distant locations and differentiate to form mature brain structures. These processes are regulated by morphogen-regulated TF modules. Many TFs regulating neuronal differentiation during development are also expressed in mature neurons, enabling us to infer developmental origins based on TF expression patterns. Recent studies show that some of these TFs not only specify but also maintain neuronal identity. In invertebrates, the term ‘terminal selector genes’ describes such TFs, while, in mice, the concept of master regulators has been used73,74. Many of these TFs have been identified by the fact that genetic removal during development results in the failure of specific neuronal classes to develop properly. Our study reveals a large set of persistent TFs in telencephalic GABAergic neurons (Fig. 5), with several candidates potentially acting as terminal selectors, to be explored in future genetic perturbation experiments.

A near-universal feature of telencephalic GABAergic neurons is that neurons from distant brain regions can belong to the same cell type. This suggests that neurons sharing developmental origins migrate extensively, populating disparate areas. Building on previous studies, we show that most CTX–CGE and CTX–MGE supertypes and clusters are broadly distributed across the isocortex, HPF, OLF and CTXsp, while some are selectively restricted to the HPF, OLF or CTXsp (Fig. 2 and Extended Data Fig. 4). The CNU–MGE class, containing all striatal interneurons and many pallidal neurons, shares a common MGE origin75,76 with cortical Pvalb+ and Sst+ interneurons (Fig. 2). The OB–IMN class, containing olfactory bulb GABAergic neurons, developmentally originates in the LGE and is transcriptomically related to the CNU–LGE class containing the striatal D1 and D2 MSNs and related cell types (Fig. 3 and Extended Data Fig. 2). The OB–IMN class also includes immature neurons generated by adult neurogenesis in the SVZ. Finally, the CNU–HYa class encompasses a highly heterogeneous group of cell types that are widely distributed across the amygdala, PALm/PALv/PALc and hypothalamic POA. This class derives from multiple origins, including MGE, LGE and POA, and may even contain some diencephalic-origin neurons.

At finer resolution, subclass 56 (Sst–Chodl Gaba) in the CNU–MGE class contains both cortical Sst+Chodl+ long-range projection neurons33,77,78 and Sst+ striatal interneurons, which share closer transcriptomic similarity to each other than to cortical Sst+ interneurons (Fig. 2). Similarly, the cholinergic neurons in subclass 58 (PAL–STR Gaba–Chol) include basal forebrain cholinergic projection neurons and striatal cholinergic interneurons, with spatial specificity at the supertype or cluster level (Fig. 2 and Extended Data Fig. 7). Lastly, within the CNU–HYa class, certain supertypes span functionally connected regions, such as the CEA and BST, or the MEA and BST, indicating that a single-cell type can occupy separate but connected nodes of the same neural circuit (Fig. 4 and Extended Data Fig. 9).

As a consequence of widespread migration and dispersion, most telencephalic regions (except for the LSX and OB) contain a heterogeneous mixture of GABAergic neuronal types (Extended Data Fig. 1) with distinct developmental origins and probably distinct connectivity and circuit functions. Within each major progenitor domain, several functional subdomains have been identified that generate distinct GABAergic neurons. For example, the LGE’s SVZ is divided dorsoventrally into four progenitor zones (pLGE1–4). The dorsal LGE (dLGE; pLGE1,2) gives rise to OB interneurons and amygdala ITCs, while the ventral LGE (vLGE; pLGE3,4) predominantly produces MSNs39,79. Recent data suggest that pLGE3 and pLGE4 preferentially generate D2 and D1 MSNs, respectively80. Likewise, MGE is divided into five subdomains (pMGE1–5) generating diverse GABAergic neurons7,81. Ventral MGE (pMGE4,5) produces many striatal and pallidal neurons, while dorsal MGE (pMGE1–3) generates cortical interneurons7,81. Future studies will be crucial for systematically mapping the spatiotemporal emergence of GABAergic diversity within each progenitor domain.

After reaching their final destinations, GABAergic neurons further refine their identities under the influence of the local environment, evidenced by gene expression gradients and spatial variation. MGE-derived cortical Sst+ and Pvalb+ interneurons show gradual transcriptomic changes from deep to superficial layers (Extended Data Fig. 10). Similarly, LSX subclasses exhibit multidimensional spatial gradients (Extended Data Fig. 12) that might align with their input/output connections57,82. The most pronounced spatial gradients were observed among D1 and D2 MSNs in the STR, correlating with the dorsolateral-to-ventromedial axis (Extended Data Fig. 11). This transcriptional gradient is partly driven by gene modules related to cGMP–PKG signalling, which is crucial for regulating long-term changes in striatal synaptic efficacy83,84. Meanwhile, the ventromedial striatal MSNs express neuroactive receptor genes such as Cnr1 (ref. 36), and are aligned with topographically organized excitatory afferent projections85.

We identified distinct cholinergic neuronal types in STR, PAL and lateral septum. Most telencephalic cholinergic neurons originate from the MGE, embryonic POA and embryonic septum86. The cholinergic precursors originate from the Nkx2-1+ domain and are further specified by combinatorial expression of additional TFs7. Lhx6 is essential for specification and migration of MGE-derived GABAergic interneurons in both the CTX and STR8,87, and Lhx8 has been associated with the specification of a cholinergic phenotype by actively inducing cholinergic properties8. These TFs remain expressed in adulthood. Cholinergic neurons born between E12 and E16 acquire distinct identities, with early- and late-born neurons populating different regions. The late-born population, absent in Gbx2 knockouts88, probably corresponds to striatal supertype 260, which still expresses Gbx2 (Extended Data Fig. 7). A subset of the striatal cholinergic interneurons in this supertype expresses the type-3 vesicular glutamate transporter (Slc17a8) and can mediate glutamatergic transmission, which is required for cholinergic signalling onto fast spiking interneurons (subclass 54, STR Prox1 Lhx6 Gaba) as well as acetylcholine-dependent inhibition of MSNs89. While striatal cholinergic neurons act primarily as interneurons, pallidal cholinergic neurons mostly project to the isocortex, HIP and amygdala. Their function is linked to topographic organization: dorsal prefrontal cortex receives input from medial SI and NDB cholinergic neurons, whereas ventral regions are innervated by more lateral basal forebrain nuclei. The HIP and ENT mainly receive cholinergic input from MS and NDB neurons90.

Developmental scRNA-seq data reveal a key difference between GABAergic neurons derived from the MGE, CGE and LGE (CTX–CGE, CTX–MGE, CNU–MGE, CNU–LGE and OB–IMN classes) and those from the embryonic septum and POA (LSX and CNU–HYa classes). The former exhibit extensive postnatal diversification, with clear transcriptomic shifts from late embryonic to adult stages (Fig. 6 and Extended Data Figs. 17, 19 and 20). New types, particularly in CTX–CGE and CTX–MGE, emerge as late as P21 (ref. 91), potentially including postnatally migrating Htr3a+ CGE-derived neurons92. By contrast, septal, preoptic and most pallidal GABAergic neurons transition primarily between E12 and E18, with minimal postnatal diversification, indicating that their adult identities are largely established prenatally63. This distinction reflects a broader dorsal–ventral dichotomy. Dorsal cell types (for example, cortical and hippocampal) tend to be larger and more transcriptomically distinct, while ventral types (for example, pallidal and hypothalamic) form numerous small, similar clusters16. Evolutionarily, ventral structures are more conserved and support homeostatic functions such as feeding, sleep and reproduction, whereas dorsal structures have evolved to support adaptive behaviours. Dorsal region maturation depends more on sensory inputs, in contrast to ventral regions, which develop largely independent of experience6,10,63,71,93.

In conclusion, our study provides a detailed transcriptomic characterization of GABAergic neurons in the telencephalon, their spatial locations and their potential developmental origins. Although we have captured several rare cell types, it is possible that, with more cells profiled, a more comprehensive catalogue of cell types could be obtained. However, even this level of diversity poses challenges for linking cell types to morphology, connectivity, physiology and function. Although the spatial organization of transcriptomic types aligns with known circuit organization, further research is needed to link the transcriptomic types to specific projection and connectivity patterns. Also, whereas the current developmental data link adult cell types to their origins, further molecular studies are needed to fully understand the diversification process leading to the large repertoire of telencephalic GABAergic neuronal types.

Methods

Sample collection and data analysis for the P56 dataset

Most of the methods that apply to the adult P56 10x scRNA and MERFISH datasets used for this paper were described previously16 and the following methods are either newly introduced or a modified version was used for this Article. The P56 dataset used in this study is from the subpallium GABA neighbourhood in the WMB cell atlas16, with a total of 611,423 high-quality single-cell transcriptomes (10xv2: 269,307 cells, 3,567 ± 1,264 (mean ± s.d.) genes per cell, 9,328 ± 5,502 unique molecular identifiers (UMIs) per cell; 10xv3: 342,116 cells, 5,949 ± 1,625 genes per cell, 26,476 ± 14,943 UMIs per cell) (Supplementary Table 3).

UMAP projection

We performed principal component analysis (R package stats, v.4.4.1, RRID: SCR_025968) based on the imputed gene expression matrix of 4,895 marker genes using the 10xv3 reference. We selected the top 100 principal components, then removed one principal component with more than 0.7 correlation with the technical bias vector, defined as log2[gene count] for each cell. We used the remaining principal components as an input to create 2D and 3D UMAPs (v.0.5.6, RRID: SCR_018217)94 using the parameters nn.neighbors = 25 and md = 0.4.

Constellation plot

To generate the constellation plot, each transcriptomic supertype was represented by a node (circle), of which the surface area reflected the number of cells within the supertype in log scale. The position of nodes was based on the centroid positions of the corresponding supertypes in UMAP coordinates. The relationships between nodes were indicated by edges that were calculated as follows. For each cell, 15 nearest neighbours in reduced-dimension space were determined and summarized by supertypes. For each supertype, we then calculated the fraction of nearest neighbours that were assigned to other supertypes. The edges connected two nodes in which at least one of the nodes had >5% of nearest neighbours in the connecting node. The width of the edge at the node reflected the fraction of nearest neighbours that were assigned to the connecting node and was scaled to node size. For all nodes in the plot, we then determined the maximum fraction of outside neighbours and set this as edge width = 100% of node width. The function for creating these plots, plot_constellation, is included in scrattch.bigcat (v.0.0.5; https://github.com/AllenInstitute/scrattch.bigcat)16.

Imputation of scRNA-seq data into the MERFISH space

The MERFISH dataset was collected using only 500 genes. To obtain the spatial distribution of all of the genes, we projected gene expression of the MERFISH dataset to the 10xv3 scRNA-seq dataset using the impute_knn_global function in the scrattch.bigcat package16. We used the self-imputed 10xv3 dataset as a reference, meaning that the expression of each 10xv3 cell was first imputed based on its nearest 15 neighbours in the reduced principal component space. This decision was made to ensure that, in the following hierarchical imputation step, the transitions between major cell types were preserved. The imputation was conducted in the order specified by the hierarchy defined by class and subclass. At the root, we imputed the expression for all of the genes for each MERFISH cell based on the average expression of their nearest neighbours from the reference 10xv3 dataset, defined by the cosine similarity using all 500 MERFISH genes. In each of the following iterations, we selected the node to which each MERFISH cell was assigned and imputed only the expression of the differentially expressed genes (DEGs) based on pairwise comparison for all of the clusters under this node. The nearest neighbours for imputation were selected from the clusters under this node in the reference dataset, using only the subset of DEGs that were present on the MERFISH gene panel. We repeated this procedure until reaching the leaf node. This strategy enabled us to preserve the cell-type resolution during imputation, making it less susceptible to the global platform differences between MERFISH and scRNA-seq.

Analysis of spatial gene expression gradients

We performed ICA using fastICA (v.1.2-5.1; RRID: SCR_013110)95 to decompose the gene expression matrix into independent components. These components were then projected onto the imputed MERFISH data to determine whether the component represents a spatial gene expression program. For components that represent a spatial gene expression program, the top loading genes were selected and visualized on both the UMAP in scRNA-seq space and on sections in the imputed MERFISH space. We evaluated the gene modules in the identified individual components and applied UCell (v.2.8.0; https://doi.org/10.18129/B9.bioc.UCell)96 to assign a gene module score based on both positive and negative genes to each cell.

Spatial domain clustering

We used BANKSY (v.0.99.12; https://github.com/prabhakarlab/Banksy)97 to perform spatial domain clustering within BST neurons. This algorithm implements a feature-augmentation approach to map domains by integrating the transcriptional profiles of individual cells with their physical distances and tissue neighbourhood context. As the MERFISH data were registered to the CCFv3, it enabled us to subset BST neurons from the MERFISH data. We used the 8,988 BST neurons, their spatial location and their 500-gene expression profile as input for BANKSY.

Assessing the concordance of cell-type taxonomy between the adult subpallium GABAergic cell-type atlas and adult external datasets

We performed mapping of cells from each adult external dataset18,20,32,36,46,48 to the 10xv3 whole-brain dataset using treeMap function from scrattch.mapping package (v.0.55; https://github.com/AllenInstitute/scrattch-mapping)98. The reference cell-type taxonomy was organized in a hierarchy defined by class and subclass. At each node, the top markers were selected that best discriminate the clusters belonging to different child nodes. Starting at the root, cells were assigned to the closest cluster centroid from all of the clusters under the given node based on the selected node markers using the cosine similarity metric. This mapping procedure was repeated until reaching the leaf nodes. To assess the mapping confidence, we subsampled 80% of the markers at each node and repeated the mapping process 100 times. In each bootstrapping step, we also computed the cosine similarity of the cell to the mapped cluster based on the markers for all of the nodes along the mapping path and calculated the average similarity across all 100 bootstrapping iterations. This score was used to assess the quality of the mapping. Cells with a score above 0.5 were used to generate a confusion matrix showing the proportion of cells jointly found between two types and their Jaccard similarity score.

Measuring the similarity between MSN D1 and D2 clusters

We computed the nearest neighbours from STR D1 MSNs for STR D2 MSNs and vice versa using the cosine similarity metric based on the same marker list used to define the edge weights for the constellation plots. To select the most similar pairs between D1 and D2 MSN types, we selected all of the pairs with a Pearson correlation of greater than 0.93 and with at least 30% of the k-nearest neighbours from one cluster belonging to the other cluster in the pair.

Developmental scRNA-seq data collection

Mouse breeding and husbandry

All experimental procedures related to the use of mice were approved by the Institutional Animal Care and Use Committee of the AIBS, in accordance with NIH guidelines. Mice were housed in a room with temperature (21–22 °C) and humidity (40–51%) control within the vivarium of the AIBS. Mice were provided food and water ad libitum and were maintained on a regular 14 h–10 h light–dark cycle. Mice were maintained on the C57BL/6J (JAX, 000664) background. We excluded any mice with anophthalmia or microphthalmia.

The mothers of all experimental mice were placed into a fresh cage when the embryos were aged about E8. We used 6 embryos to collect 74,550 cells from ages E11.5 (n = 1), E12.5 (n = 1), E13.5 (n = 2) and E14.5 (n = 2). From ages E11.5 and E12.5, we collected whole-brain tissue and, from ages E13.5 and E14.5, we collected cerebrum and brain stem (CH–BS). From P0 pups (n = 7), we collected 171,041 cells; from P4 pups (n = 8) we collected 259,172 cells; from P8 pups (n = 7) we collected 326,245 cells; from P12 pups (n = 7) we collected 260,336 cells; and from P14 pups (n = 8) we collected 331,137 cells. Cells from postnatal mice were collected from both male and female mice across six dissection regions of interest (ROIs): OLF, CTXsp, isocortex, HPF, CNU and HY. No statistical methods were used to predetermine sample size. All donor animals used for the developmental scRNA-seq data generation are listed in Supplementary Table 4. Brain dissections for all groups took place in the morning. Randomization and blinding were not performed in this study. scRNA-seq data were generated and analysed using standardized, automated computational pipelines. As such, data collection and analysis were not influenced by operator bias. Potential batch effects were accounted for using established normalization and integration methods during downstream analysis.

Single-cell isolation

Single cells were isolated according to a cell-isolation protocol developed at AIBS99. The brain was dissected, submerged in artificial cerebrospinal fluid (ACSF), embedded in 2% agarose and sliced into 350 μm coronal sections on a compresstome (Precisionary Instruments). Block-face images were captured during slicing. ROIs were then microdissected from the slices and dissociated into single cells.

Dissected tissue pieces were digested with 30 U ml−1 papain (Worthington PAP2) in ACSF for 30 min at 30 °C. Owing to the short incubation period in a dry oven, we set the oven temperature to 35 °C to compensate for the indirect heat exchange, with a target solution temperature of 30 °C. Enzymatic digestion was quenched by exchanging the papain solution three times with quenching buffer (ACSF with 1% FBS and 0.2% BSA). The samples were incubated on ice for 5 min before trituration. The tissue pieces in the quenching buffer were triturated through a fire-polished pipette with 600 µm diameter opening approximately 20 times. The tissue pieces were allowed to settle and the supernatant, which now contained suspended single cells, was transferred to a new tube. Fresh quenching buffer was added to the settled tissue pieces, and trituration and supernatant transfer were repeated using 300 µm and 150 µm fire-polished pipettes. The single-cell suspension was passed through a 70 µm filter into a 15 ml conical tube with 500 µl of high-BSA buffer (ACSF with 1% FBS and 1% BSA) at the bottom to help cushion the cells during centrifugation at 100g in a swinging-bucket centrifuge for 10 min. The supernatant was discarded, and the cell pellet was resuspended in the quenching buffer. The concentration of the resuspended cells was quantified, and cells were immediately loaded onto the 10x Genomics Chromium controller.

cDNA amplification and library construction

The E11.5 to E14.5 cell suspensions were processed using the Chromium Single Cell 3′ Reagent Kit v3 (1000075, 10x Genomics). We followed the manufacturer’s instructions for cell capture, barcoding, reverse transcription, cDNA amplification and library construction100. We loaded 8,283 ± 703 (mean ± s.d.) cells per port. We targeted a sequencing depth of 120,000 reads per cell; the actual average achieved was 70,324 ± 62,149 (mean ± s.d.) reads per cell across 9 libraries.

The P0 cell suspensions were processed using the Chromium Single Cell 3′ Reagent Kit v3.1 (1000268, 10x Genomics). We followed the manufacturer’s instructions for cell capture, barcoding, reverse transcription, cDNA amplification and library construction101. We loaded 11,468 ± 1,735 cells per port. We targeted sequencing depth of 120,000 reads per cell; the actual average achieved was 114,853 ± 30,439 reads per cell across 13 libraries.

The P4 to P14 cell suspensions were processed using the Chromium Next GEM-X Single Cell 3′ Reagent Kit v4 (1000691, 10x Genomics). We followed the manufacturer’s instructions for cell capture, barcoding, reverse transcription, cDNA amplification and library construction. We loaded 38,740 ± 8,097 cells per port. We targeted sequencing depth of 120,000 reads per cell; the actual average achieved was 67,175 ± 24,394 reads per cell across 58 libraries.

Sequencing data processing and QC

Processing of 10x Genomics scRNA-seq libraries was performed as described previously22. In brief, libraries were sequenced on the Illumina NovaSeq 6000 system, and sequencing reads were aligned to the mouse reference transcriptome (M21, GRCm38.p6) using the 10x Genomics CellRanger pipeline (v.6.1.1) with the default parameters. To remove low-quality cells, we applied similar quality-control analysis and thresholding based on gene detection, quality control score and doublet score as described previously16. Doublets were identified using a modified version of the DoubletFinder algorithm, which is available in scrattch.hicat (v.0.1.0, RRID: SCR_01809)102. We removed cells using a gene count cut-off of 2,000, quality-control score cut-off of 200 and doublet score cut-off of 0.3. Cells were collected from six distinct dissection ROIs: OLF, CTXsp, isocortex, HPF, CNU and HY. The subpallium GABAergic cells were selected based on marker gene expression. In brief, Seurat (v.5.1.0, RRID: SCR_016341) was used for initial clustering (default parameters). A gene module score was calculated for the following genes: Dlx1, Dlx2, Dlx5, Dlx6 and Arx. Clusters with a score of <0 were removed from further analysis. In total, 387,148 newly generated cells were included in the dataset (Supplementary Table 4).

Published datasets

We combined our developmental dataset with eight published datasets from other laboratories covering the embryonic to early postnatal period (E7 to P2)31,59,60,61,62,63 (Supplementary Table 5). Raw counts and, if available, cell-type and metadata annotations for each of the datasets were downloaded from their respective sources. We applied minimal quality-control thresholds to the external datasets (Supplementary Table 5). Cells with fewer than 1,000 genes per cell were removed from the dataset, resulting in a total of 227,421 remaining cells for integration with our own datasets. The combined datasets vary greatly in number of genes and UMIs detected (Extended Data Fig. 13a,b).

Integration of scRNA-seq data

We used all developmental datasets and the 10xv3 portion of the adult P56 dataset, totalling 956,213 cells for integration. Raw counts were log-normalized and subsequently used to select 4,000 highly variable genes. For dimensionality reduction and batch correction, the raw counts of the highly variable genes were used to train the scvi-tools (v.0.17.1, RRID: SCR_026673) scVI variational autoencoder (parameters: batch_key = ‘dataset’, n_latent = 32, gene_likelihood = ‘nb’, n_layers = 3, continuous_covariate_keys = log2ngene)103. The latent dimensions were used to generate a k-nearest neighbours graph for UMAP generation and subclustering.

Clustering scRNA-seq data

To assign cell-type identity to cells at P14, each cell was mapped onto the WMB taxonomy using hierarchical approximate nearest neighbour (HANN) mapping available in the scrattch-mapping package98. For P12 cells, we assigned each cell’s identity by mapping it to the nearest supertype centroid in the adjacent older age group, P14, using scrattch.mapping. P8 cells were mapped to P12 and P4 cells were mapped to P8. For cells from E7 to P2, ages were binned together based on the similarity between temporal age and predicted age (Extended Data Fig. 13d). Predicted age was calculated as follows; within the integrated scVI latent space, the ten nearest neighbours for each single cell were determined. Each cell was assigned a predicted age based on the common age among the ten neighbours. This process was repeated ten times, using the predicted age from the previous iteration as starting point. Cells from P0 and P2 were binned and mapped to P4, cells from E15 to E18.5 were binned and mapped to P0–P2, cells from E11 to E14.5 were binned and mapped to E15–E18.5, and cells from E7 to E10 were binned and mapped to E11–E14.5.

After assigning the broad cell types, iterative clustering was performed within assigned subclasses in the scVI latent space using the scrattch.bigcat package as described previously16. Clustering was performed using the following DEG criteria: q1.th = 0.4, q.diff.th = 0.7, de.score.th = 120, min.cells = 10. Cell-type annotation of the developmental scRNA-seq dataset is shown in Supplementary Table 6.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.