Abstract
Despite growing attention to gender disparities in innovation, little is known about how gender shapes the characteristics and outcomes of patented inventions. This study analyzes 3.7 million U.S. utility patents, covering 1.8 million distinct inventors and over 200,000 organizations, to investigate the gendered patterns of inventorship. While women’s participation in patenting has increased over time, they remain significantly underrepresented, and patents involving female inventors consistently receive fewer citations than those by all-male teams. However, women-participated patents are more likely to exhibit novelty, originality, and technological generality, particularly when produced by mixed-gender teams, which tend to generate the most disruptive inventions. Female inventors also draw more heavily on scientific literature and public support, especially in green technology and academic settings. Organizational and domain-level differences are pronounced: universities involve women at higher rates than corporations, and fields such as biotechnology and civil engineering demonstrate distinct gendered patterns in patent quality and disruption. These results suggest that women make important yet often overlooked contributions to innovation and that structural barriers may suppress their full inventive potential. Addressing these disparities can enhance innovation diversity, expand the societal relevance of patented technologies, and better support the next generation of inventors.
Similar content being viewed by others
Introduction
Inventions drive technological advances and economic development. Society benefits from an inventor population with a variety of backgrounds that can create diverse and valuable inventions. Despite this, there is a lack of diversity in inventors worldwide. In particular, women are significantly underrepresented. Female inventors and patents filed by women are substantially fewer than men. Compared to men, women are less likely to engage in inventive activities, file for patents, receive patent grants, appeal, or maintain granted patent rights (Huang et al., 2022; Jensen et al., 2018; Schuster et al., 2020).
While gender disparity in the quantity of patents is well-documented, whether such disparity extends to the characteristics of patents remains under-explored. Understanding gender differences in the characteristics of inventions can shed light on the underlying nature of gender gaps and indicate potential approaches to encouraging gender diversity in innovation. As the world’s largest economy (World Bank, 2025) and a major contributor to global technological innovation, the United States plays a central role in the global patent landscape. According to data from the World Intellectual Property Organization (WIPO), the U.S. ranks among the top patent-filing countries globally (WIPO, 2024).Footnote 1 However, while the U.S. leads in the number and influence of patents in high-value sectors such as pharmaceuticals, software, and artificial intelligence, it also reflects many of the gendered patterns seen worldwide. Therefore, studying gender disparities in U.S. patenting not only offers insight into the national innovation system but also informs broader global conversations about equity and participation in knowledge economies.
This study compares women- and men-invented patents’ characteristics using several patent indicators, including impact, novelty, scope, reliance on public-supported research, and reliance on scientific knowledge. To conduct this analysis, we compiled a dataset comprising 3.7 million U.S. utility patents and 1.8 million distinct inventors with inferred gender information. This dataset is further enriched with three specialized sources that provide indicators of a patent’s disruption to existing technologies, its reliance on publicly funded research, and its references to scientific literature.
This study expands the current literature by examining gender disparities across a wide range of patent characteristics and institutional contexts. While we find that the number of female inventors has steadily increased over time, significant disparities persist. Women-invented patents tend to be cited less often, suggesting lower conventional impact. Yet, we also uncover evidence that female inventors contribute meaningfully to innovation through greater novelty, broader technological scope, and more disruptive inventions in certain domains. Despite societal and institutional barriers, women continue to advance the frontier of invention, often in distinct and underappreciated ways.
Literature review
Historical records show a large gender disparity in patenting. In the U.S., about 375,000 patents were granted to men in the 50 years from 1836 to 1888 (Schuster et al. 2020). By comparison, only about 3000 women inventors are identified during the first hundred years of the U.S. patent system (Pursell, 1981).
Since the 20th century, gender disparity in patenting has improved but still exists. Data from the WIPO shows that from 2000 to 2022, the percentage of female inventors worldwide who filed through the Patent Cooperation Treaty (PCT) increased from 9.9% to 17.1%. Over the same period, the percentage of international patent applications that have at least one women inventor increased from 18.6% to 34.7% (WIPO, 2023). The United States Patent and Trademark Office (USPTO) also reported that the percentage of women inventors in the U.S. increased from 3% in 1976 to 12.8% in 2019 (Toole et al. 2020a). In particular, women’s participation in emerging scientific fields such as biotechnology and artificial intelligence (AI) remains notably low, despite evidence that their involvement adds significant value (Giczy et al., 2024).
Gender disparity exists not only in the number of inventors and patents but also in patents’ content, quality, and legal processes. For example, female applicants are more likely to be rejected, and upon rejection, they are less likely to appeal (Huang et al., 2022; Schuster and Goodman, 2025). Women are allowed to have a smaller fraction of their claims, and more texts are added to their claims during prosecution, which typically reduces patents’ value. Patents filed by women are less likely to be maintained (Jensen et al. 2018). Patents created by women tend to have fewer claims and lower technological impact compared to those filed by men (Sugimoto et al., 2015), and women are more likely to appear as co-inventors rather than lead inventors on patent documents (Whittington and Smith-Doerr, 2005). In addition, inventors’ genders impact the motive and purposes of the inventions and who benefits from the inventions. For instance, women engage in less commercial patenting and are more likely to discover female-focused ideas, such as biomedical devices focusing on women’s health (Koning et al. 2020).
Despite this disparity, women have created great inventions and have contributed significantly to science and technology (Comedy and Dougherty, 2018). For example, Anna Connelly invented the first outdoor fire escape (Uhl, 2018), and Hedy Lamarr invented a communication system that paved the way for wi-fi (Rhodes, 2012). It is worth acknowledging the substantial but often overlooked contributions of nearly 200,000 women in the U.S. who have played a vital role in creating over 600,000 patents in the past half-century, despite their names not receiving widespread recognition.
Potential reasons for gender gaps in inventing
Studies show that gender gaps in inventing may be rooted in social, cultural, economic, and political factors. Patents are primarily created within fields related to STEM (Science, Technology, Engineering, and Mathematics), in which women are less likely to work. U.S. National Science Foundation (NSF) reported that only 35% of the U.S. STEM workforce in 2021 are women (National Center for Science and Engineering Statistics (NCSES), 2023). Even with similar educational backgrounds and previous career paths, women are less likely to apply for patents than men (Whittington and Smith-Doerr, 2008).
Previous studies found that government funding is increasingly fueling innovation in the U.S., demonstrated by the increased percentage of patents relying on research funded by the government (Fleming et al., 2019a). Organizations and inventors can benefit from patenting with knowledge created by government research, especially with constrained resources. Nevertheless, gender disparity may arise within this context. Researchers found that women are less likely to obtain research grants than men, and funding awarded to women tends to be lower than to men (Head et al., 2013; RissleR et al., 2020).
Some scholars have argued that the institutional and cultural structures of invention and commercialization have created barriers to women’s full participation (Lai, 2020). For example, female academics are often disproportionately burdened with non-research tasks, such as service or advising and counseling students (Ding et al., 2006). Women’s contributions to inventions are often undervalued and they are less likely to be acknowledged as inventors, even when their contributions are comparable to their male counterparts (Ross et al., 2022). This reduces women’s opportunities and success rates in research-related work, such as applying for funding, publishing articles, and filing for patents (Ceci and Williams, 2010). Therefore, systematic reform may be necessary to help protect women’s innovation products, such as an addition to the existing patent legal regime to acknowledge and protect unregistered patent rights (Marcowitz-Bitton et al., 2020).
Gender disparity in patenting may lead to profound consequences. The inventive potential of women and the value of their ideas and inventions may be under-recognized. Funding and other resources may be inadequate to help women invent and commercialize their inventions. This can lead to a loss of potential inventions, harming economic growth. In addition, the absence of women’s participation in inventing can result in a lack of diversity in innovative products. Because women are more likely to invent for other women (Koning et al., 2020, 2021), the shortage of women inventors may lead to fewer inventions that benefit women, fulfill women’s needs, and align with women’s values.
Research gap
Despite growing attention to the underrepresentation of women in patenting, most existing research has primarily focused on the quantity of patents filed by women, paying limited attention to potential disparities in the quality, characteristics, and types of inventions they produce. Studies that do compare patent characteristics by gender often concentrate on a single technological domain, such as the life sciences, or examine only a narrow set of metrics, typically citation counts or patent scope. Few have conducted a systematic, large-scale investigation of how gender differences manifest across a comprehensive set of patent indicators, including resource utilization, knowledge characteristics, and various dimensions of impact.
This study addresses these gaps by offering a comprehensive, multi-dimensional, longitudinal analysis of U.S. utility patents, shedding new light on how gender intersects with the nature, influence, and institutional environments of technological innovation.
Research questions
While prior studies have documented women’s underrepresentation in securing public funding for research and development (R&D) in certain domains (Holliday et al., 2014; Ley and Hamilton, 2008), it remains unclear whether and how this disparity extends to the context of commercial patenting. Moreover, no prior research has examined whether women and men differ in citing scientific publications in patent documents, despite such citations often being associated with the depth and quality of technological innovation. These two aspects can serve as indicators of inventors’ access to and engagement with key resources. Thus, we ask:
-
RQ1: How do inventor teams involving women utilize resources differently in the creation of patents (specifically, public support and scientific knowledge) compared to teams composed solely of male inventors?
Investigating this question will help illuminate gender disparities in the utilization of both financial and intellectual resources in the innovation process.
Second, gender differences in the content and knowledge characteristics of patents remain underexplored. These features, often referred to as ex-ante characteristics, capture the nature of the invention prior to its market impact or citation performance. Few studies have systematically examined these characteristics from a gender perspective. Therefore, we ask:
-
RQ2: How do patents by teams with different gender compositions differ in their knowledge-related characteristics, such as technological scope, combinatorial novelty, and originality?
Investigating this question contributes to our understanding of how gender relates to the substantive content and knowledge diversity of patented inventions.
Third, previous studies have shown that women-authored papers and patents tend to receive fewer citations than those authored by men. However, technological impact extends beyond simple citation counts. It also encompasses the breadth and nature of citations, for example, whether a patent is cited across diverse technological fields or whether it disrupts existing knowledge trajectories by being cited more than the prior art it builds upon. These aspects provide a richer understanding of an invention’s influence on technological development. Therefore, we ask:
-
RQ3: Do patents created by teams with different gender compositions exert different types and levels of technological impact?
Answering this question contributes new knowledge about how gender differences manifest in the nature of impact of patents.
Fourth, previous studies have demonstrated gender disparities in specific scientific and technological fields, particularly in life sciences and engineering (Ding et al., 2006; Whittington and Smith-Doerr, 2005). While examining gender disparity within a single domain is informative, comparing disparities across a broader set of major sectors can yield deeper insights into how institutional and disciplinary contexts shape inventorship.
In this study, we focus on five domains of high economic and strategic importance, and ask:
-
RQ4: How do gender disparities in patenting vary across fields, especially in strategically significant areas such as artificial intelligence, nanotechnology, biotechnology, civil engineering, and green technology?
Investigating this research question not only expands our understanding of gender disparity in underexplored domains, such as AI and green technology, but also helps identify which fields exhibit greater gender diversity and where women inventors thrive. Such comparative insights may reveal institutional practices or cultural factors from these fields that could inform policies aimed at reducing gender gaps in other areas.
Organizations play a critical role in shaping inventors’ opportunities and access to resources. Prior research has suggested that structural features of universities (such as flexible research environments and collaborative cultures) may support women’s innovation more than hierarchical corporate settings. However, few studies have systematically compared gender disparities across different types of organizations involved in patenting, such as corporations, academic institutions, and government agencies. To address this gap, we ask:
-
RQ5: Which organizational types (e.g., firms, universities, government agencies) are associated with greater participation by women inventors?
Exploring this question helps illuminate the institutional contexts in which women are more likely to contribute as inventors. By identifying organizational environments that foster gender diversity in innovation, our findings may inform organizational and policy-level interventions to support women’s participation in patenting across sectors.
Data
Patents created by U.S. inventors
We collected data on granted patents, inventors, assignees, locations, patent-to-patent citations, and technology classifications from the PatentsViewFootnote 2 open data platform (an initiative supported by the USPTO) in December 2024 (Toole et al., 2021). The process of data collection, cleaning, merging, and final construction is illustrated in the flowchart in Fig. 1.
There are three types of patents granted in the United States: utility patents, design patents, and plant patents. Among these, utility patents, also known as “patents for invention,” are the most common and are widely used in research as a proxy for overall inventive activity in the U.S. (Griliches, 1990; Sampat, 2018; Seymore, 2013; Wang and Lobo, 2024; Youn et al., 2015).Footnote 3 Accordingly, we focus exclusively on utility patents for this study. Also, the dataset includes two types of technology classifications. The first is the Cooperative Patent Classification (CPC) system. The second is the sector and field classification scheme developed by the WIPO.Footnote 4
The initial dataset includes all utility patents filed between 1976 and September 30, 2024.Footnote 5 For our main analyses, we selected utility patents filed from 1976 through 2021. This subset includes 7,822,530 granted utility patents, 20,228,153 inventor-patent instances, and 3,834,487 unique inventors. Utility patents filed after 2021 are used only to compute forward-looking metrics such as citations, which require several years of data accumulation post-grant.
The USPTO routinely receives applications from foreign inventors (WIPO, 2021). Because this study focuses on U.S.-based inventors, we further restricted our dataset to 4,052,122 patents (51.8%) that were invented exclusively by U.S. inventors, excluding patents involving any non-U.S. inventors.Footnote 6 This selection was made possible by the inclusion of inventor country information in the dataset; we retained only those inventors whose country designation was listed as “US.” Footnote 7
Of the 3,834,487 distinct inventors in the full dataset, 1,856,323 (48.4%) are located in the United States. Among them, 1,811,777 contributed to patents created exclusively by U.S. inventors, resulting in 3,753,689 utility patents. This number was further reduced to 3,696,871 in our final dataset (highlighted by the red box in the upper middle of Fig. 1) after excluding patents involving inventors with unknown or missing gender information. This exclusion affected approximately 1.5% of the data, a proportion considered acceptable given the scope of our study.
To supplement our analyses with information not provided by the USPTO, we incorporated three additional datasets:
-
a patent-to-paper citation dataset (Marx, 2024; Marx and Fuegi, 2020, 2022), which links U.S. patents to scientific publications indexed in academic databases;
-
a dataset identifying patents that rely on public funding (Fleming et al., 2019a, b); and
-
a dataset containing the disruption index for U.S. patents (Funk et al., 2022; Park et al., 2023).
The procedures for merging and processing these datasets are detailed in Fig. 1.
Gender attributes
Identifying inventors genders
We employed a three-step approach to identify the gender of each inventor. This process builds upon the pre-generated gender codes developed by Toole et al. (2019), supplemented by an AI-assisted method, as described in this section.Footnote 8
Step #1: Pre-Generated Gender Code Using Name Databases
Because the USPTO does not collect gender information during the patent filing process, we rely on a pre-processed dataset developed by Toole et al. (2019), which uses probabilistic name-gender matching across two large databases: (1) IBM’s Global Name Recognition (GNR) system and (2) WIPO’s Worldwide Gender-Name Dictionary (WGND). The method applies a multi-step classification procedure based on first-name probabilities and inferred country-of-origin information. Full details are provided in Appendix A.
In our dataset, this baseline method successfully assigned a gender to 1,738,265 of the 1,811,777 (95.9%) distinct U.S.-based inventors. Among those identified, 1,509,486 (86.8%) were classified as male and 228,779 (13.2%) as female. The remaining 73,512 inventors either received an undefined label or were missing gender data entirely.
Step #2: Gender-Guesser Python Package
To address the 73,512 inventors with undefined gender information, we implemented two additional steps. The first involved using the GENDER-GUESSER (Pérez, 2016), a Python package for gender inference based on first names. This package draws on data from the “gender” program developed by Jörg Michael, which contains gender information for approximately 40,000 first names across multiple languages and cultures (Gecko, 2009).
The package produces one of six possible outputs for each name: male, mostly_male, female, mostly_female, andy (androgynous),Footnote 9 and unknown.Footnote 10 For the purposes of this study, we grouped male and mostly_male into a single “male” category, and female and mostly_female into a single “female” category. Names classified as andy or unknown were grouped into an “undefined” category. Applying this method to the 73,512 gender-undefined inventors, we were able to classify 15,624 as male and 4656 as female. The remaining 53,232 names remained undefined.
Step #3: LLM-Assisted Gender Classification
For the remaining 53,232 inventors whose gender could not be identified using the previous two steps, we employed a large language model (LLM) to infer gender. LLMs are a form of generative artificial intelligence (GenAI) capable of contextual text analysis, reasoning, and content generation. In recent social science research, LLMs have been increasingly adopted for tasks such as data annotation, text generation, information extraction, and analysis, ranging from sentiment analysis (Miah et al., 2024), identification of political affiliations in social media posts (Törnberg, 2024), simulation of social behavior (Kim et al., 2025; Manning et al., 2024), and occupational task analysis (Eloundou et al., 2024; Wang et al., 2025). While concerns about transparency and replicability in using LLMs for research persist (Lin and Zhang, 2025), we argue that gender inference from names is a particularly suitable use case for LLMs, given their training on vast textual corpora that include culturally contextualized names.
For this task, we used GPT-4o-mini,Footnote 11 a cost-effective LLM developed by OpenAI, accessed via API. Prior to deployment, we engaged in iterative prompt engineering using randomly sampled names in OpenAI’s Playground environment.Footnote 12 To enhance the model’s contextual understanding, we constructed a dataset that included each inventor’s full name,Footnote 13 location (city, county, state, if available), patent titles, assignee information (including organizational name and location), and other relevant metadata associated with their patent filings.
In our final prompt (see Appendix B), we instructed GPT-4o-mini to infer the inventor’s gender using all contextual information provided. The model was asked to return JSON objects including gender code—“M” for male, “F” for female, and “U” for undefined, along with a brief explanation and a confidence score ranging from 0.0 to 1.0. For example, for the first name “Michael,” GPT-4o-mini responded: “Michael is a common male name” with a confidence score of 0.9. For the name Cuiyu Zhang, it responded: “Cuiyu Zhang is likely female based on the name’s structure, which is more common for female names in Chinese culture,” with a confidence score of 0.7.
To evaluate the validity of this LLM-assisted approach, we conducted a manual validation using 100 randomly-selected names for which gender had already been assigned by the USPTO (Toole et al., 2019) (Step #1). These names are considered high-confidence cases, as the USPTO approach is more likely to classify names with strong gender associations. We queried the gender of these 100 names using our LLM-assisted method and observed a 94% agreement between GPT-4o-mini and the USPTO-assigned gender codes.Footnote 14 Therefore, we concluded that GPT-4o-mini, when used with our prompt and contextual data, offers a level of accuracy comparable to the USPTO method, demonstrating that the LLM-assisted approach can, to a reasonable degree of accuracy, infer gender based on inventor names and accompanying metadata.
Applying this approach to the remaining 53,232 inventors, GPT-4o-mini initially classified 33,940 as male (63.7%), 3470 as female (6.5%), and 15,822 (29.7%) as undefined. Given that many names are genuinely gender-neutral or culturally ambiguous, and recognizing the impossibility of perfect classification, our goal was not to label every inventor but to do so with reasonable confidence. To that end, we retained only those gender inferences with a confidence score of 0.7 or higher, and assigned all others to the “undefined” category ("U”).
After this filtering step, the revised gender composition of the 53,232 previously unclassified inventors was as follows: 33,432 male (62.8%), 3425 female (6.4%), and 16,375 undefined (30.7%). A detailed breakdown of the number of inventors identified at each step is provided in Table 1.
Gender characteristics at the patent level
To enable patent-level analyses, we aggregated inventor gender information to the level of individual patents. We first removed 56,818 patents that included at least one inventor with an undefined gender, as the gender composition of these teams could not be accurately determined. These gender-unidentifiable patents represent only 1.5% of the entire dataset, a level of missingness that is widely considered acceptable in quantitative research. While this missingness is not completely random, it is attributable to systematic limitations in gender identification tools and does not pose a significant risk of bias to our analysis.Footnote 15
For the remaining 3,696,871 patents, we assigned the following three sets of gender-related attributes:
-
To examine whether the presence of women on a team influences invention characteristics, we created a binary variable, “with_women,” which takes the value 1 if at least one inventor is female, and 0 otherwise.
-
To study gender collaboration patterns, we constructed a categorical variable, “gender_collab_type,” indicating the gender composition of the inventor team. This variable takes one of three values: all-women team, all-men team, or mixed-gender team.
-
We also counted the number of male and female inventors on each patent. If no female inventors were present, the count of female inventors was set to 0.
Assignee types
Identifying organizational assignee types
An assignee refers to the individual or organization that holds the legal ownership rights to a patent. To investigate differences in gender composition and disparities across organizational types, we classified organizational assignees into three categories: government, universities, and private companies.
We began with 8,293,198 patent-assignee pairs (see Fig. 1), involving 492,670 unique assignees. From this pool, we selected 216,149 unique assignees associated with patents granted between 1976 and 2021 that were invented exclusively by U.S.-based inventors.Footnote 16
To classify these assignee organizations, we used a multi-step process combining structured data from the PatentsView dataset with an LLM assistance. First, we utilized pre-assigned assignee type codes to group entities into corporations, governments, and individuals. Next, for assignees without a valid type code, we applied GPT-4o-mini to classify organizations based on their name, patenting activity, and geographic information. Finally, because the USPTO often classifies universities under the broader category of corporations, we implemented an additional LLM-assisted step to identify universities within this group and distinguish them from private companies. The detailed classification procedure is provided in Appendix C.
After consolidating the results from all three steps, our final classification includes 924 (0.4%) government assignees, 2595 (1.2%) universities, 212,396 (98.3%) corporations, 175 individuals, and 59 unidentifiable assignees.
Classifying patents by assignee type
A single patent may have multiple assignees, potentially spanning different assignee types. Since our analysis focuses on whether patents owned by governments and universities differ from those owned by corporations in terms of gender composition and patent quality, we prioritized identifying government and university ownership.
We first classified a patent as government-owned if at least one of its assignees was identified as a government entity. Among the remaining patents, we then classified those with at least one university assignee as university-owned. The rest were classified based on their assignee type: corporation, individual, unidentified, or null (indicating missing assignee information).
This process resulted in 40,595 (1.1%) government-owned patents, 145,698 (3.9%) university-owned patents, 2,950,992 (79.8%) corporation-owned patents, 170 patents assigned to individuals, 61 unidentified patents, and 559,355 patents with null assignee information.Footnote 17
Ex-ante patent characteristics
To examine differences in the characteristics of patents produced by inventor teams with varying gender compositions, we compiled a set of widely used patent indicators. These indicators are divided into two categories: (1) ex-ante patent characteristics, described in this section, which capture attributes known at the time of patent filing or reflect the nature of the knowledge embedded in the patent; and (2) ex-post patent characteristics (Section “Ex-post patent characteristics”), which represent outcomes or impacts observed after the patent has been granted.
Patent scope
Patent scope refers to the breadth of technological domains an invention spans, typically measured by the number of distinct classification codes assigned to a patent. Broader scope often reflects greater technological complexity and integration across multiple knowledge areas, and has been associated with increased firm value and innovation performance (Barbieri et al. 2020; Lerner, 1994; Novelli, 2015; Wang et al. 2024). Historical trends also suggest a rise in scope over time, especially since the mid-20th century (Youn et al. 2015).
To measure patent scope, we use the Cooperative Patent Classification (CPC) system adopted by the USPTO. Each patent is assigned one or more CPC codes, which represent the non-trivial technical components disclosed in the invention (USPTO, 2015).Footnote 18 Following prior literature (Barbieri et al. 2020; Strumsky et al. 2012; Wang et al. 2024; Youn et al. 2015), we define the technological scope of a patent as the number of unique 4-digit CPC subclass codes it receives. Each subclass code corresponds to a distinct technical fields.Footnote 19
While our main focus is on 4-digit CPC codes, we also counted the number of full-digit subgroup codes as a control variable to account for the level of technical granularity.Footnote 20
To account for variation in team size, we also compute per-inventor scope by dividing the number of 4-digit CPC codes by the number of inventors listed on each patent.
Originality
To assess the diversity of knowledge sources incorporated into an invention, we computed the originality index, a widely adopted metric in patent studies (Breitzman and Thomas, 2015a; Hall et al., 2001; Hasan et al., 2009; Jaffe and Trajtenberg, 2002; Squicciarini et al., 2013; Su, 2022). This index measures the extent to which a patent builds upon prior inventions from different technological fields, based on its backward citations.
The originality score for a given patent p, denoted as Op, is calculated as:
where spt represents the share of backward citations made by patent p that fall under CPC 4-digit code t, and Np is the number of distinct 4-digit CPC codes among all cited patents. A higher Op value indicates that the cited prior art spans a broader range of technical fields, suggesting greater originality. In contrast, if a patent cites only prior work from a single field, the originality score Op equals zero.
Combinatorial novelty
Inventions are often conceptualized as novel combinations of existing technologies (Arthur, 2009; Schumpeter, 1934; Wagner and Rosen, 2014; Youn et al., 2015). Although novelty is a formal requirement for patentability (35 U.S. Code § 102), its evaluation by patent examiners is inherently subjective and context-dependent, making it difficult to quantify systematically. To address this challenge, researchers have developed empirical methods to measure the novelty of knowledge recombinations in patents and scientific publications, demonstrating their predictive value for future impact (Kim et al., 2016; Mukherjee et al., 2016; Uzzi et al., 2013; Youn et al., 2015).
We adopt a classification-based method developed by Strumsky and Lobo (2015) to assess the combinatorial novelty of patents. This approach categorizes patents according to whether they introduce new technological components (defined using full-digit CPC codes) and/or novel combinations of those components. The taxonomy includes four categories: (1) Origination, where all assigned components are new to the patent system; (2) Novel Combination, which mixes new and existing components; (3) Combination, which uses only previously known components but forms new pairwise combinations; and (4) Refinement, in which both the components and their combinations have appeared in prior patents.
Strumsky and Lobo (2015) reported a steady decline in Origination and Novel Combination patents, with each comprising less than 1% of all patents after 1990. The majority of novel activity in recent decades arises from new combinations of existing components, rather than from entirely new technologies. To reflect this reality and avoid highly imbalanced categories, many studies have adopted a binary definition of novelty, classifying patents as novel if they introduce at least one new pairwise combination of components, regardless of whether the components themselves are new (Verhoeven et al., 2016; Wang et al., 2024). Under this approach, the first three categories defined by Strumsky and Lobo (2015)—Origination, Novel Combination, and Combination—are grouped together as “novel,” while only the Refinement category is considered “non-novel.”
Following this logic, we define a binary indicator of combinatorial novelty, assigning a value of 1 to patents that introduce new components or new combinations (i.e., non-Refinement patents), and 0 to those that introduce neither (i.e., Refinement patents). Using this classification, we identified 1,806,610 patents (48.9%) in our dataset as novel, and 1,890,261 patents (51.1%) as Refinements.Footnote 21
Patents relying on public support
To examine how inventors of different genders rely on publicly funded research, we utilized a dataset of government-supported patents compiled by Fleming et al. (2019b).Footnote 22 This dataset classifies a patent as relying on public research if it meets at least one of the following criteria: (1) the patent is owned by the government, (2) the patent explicitly acknowledges government funding, or (3) the patent cites a prior patent or publication that satisfies either of the first two criteria or is authored by an individual affiliated with a U.S. government institution (Fleming et al., 2019a).
The first two types are considered direct public support, as they reflect formal and financial backing from government sources. The third type captures indirect public support, representing the knowledge spillover effects of publicly funded research. In our analysis, we group both direct and indirect support under a single category (Relying on Public Support) to capture any reliance on knowledge created with public funding, support, or resource.
Using this classification, we identified 654,878 patents (17.7%) as relying on public support, while the remaining 82.3% did not show any such dependence.
Patents relying on science
Scientific research and discovery have long served as key sources of inspiration for inventions and technological advancement. One common method of mapping the flow of knowledge from science to technology is through analyzing patent citations to scientific publications (Jaffe et al., 1993, 2000; Meyer, 2000; Narin and Noma, 1985; Tussen et al., 2000).
In this study, we used an open-access dataset developed by Marx and Fuegi (2020), which links patents granted worldwide to scientific articles published since the 1800s (Marx and Fuegi, 2020, 2022). This dataset identifies approximately 22 million patent citations to science (PCS), constructed through high-precision matching of the non-patent literature (NPL) section in patent documents to entries in bibliographic databases such as MEDLINE, Web of Science, and CrossRef. The authors employed a combination of rule-based parsing, metadata alignment, and manual validation to ensure accuracy and global coverage.
Using this dataset, we examine whether the gender composition of inventor teams influences the extent to which patents rely on scientific knowledge. We focus on two key measures: (1) the number of scientific publications cited in a patent, which reflects the overall degree of reliance on science; and (2) the number of citations per inventor, which adjusts for team size and approximates individual-level engagement with scientific research.
Ex-post patent characteristics
Ex-post patent characteristics are attributes that become observable only after a patent is granted, often reflecting its technological or commercial impact. In this study, we use three widely adopted ex-post measures to assess the realized influence of patents across different inventor team gender compositions: forward citations, the generality index, and the disruption index. These capture, respectively, the extent of a patent’s influence on subsequent inventions, the breadth of that influence across fields, and whether the patent disrupts or consolidates existing citation structures.
Forward citations
Forward citations made by subsequent patents to a focal patent are a widely used proxy for a patent’s technological impact and knowledge diffusion (Barbieri et al., 2020; Fleming and Sorenson, 2004; Hall et al., 2001; Trajtenberg and Jaffe, 2002). Higher forward citation counts generally signal greater influence on future innovations and broader spillover effects (Jaffe et al., 2000).
In this study, we capture forward citation impact using two measures: (1) a count variable representing the number of forward citations a patent receives within five years of its grant date, and (2) a binary variable indicating whether the patent is a “top-cited” or “hit” patent, defined as being in the top 10% of forward citations among all patents granted in the same year. The inclusion of the binary variable accounts for the highly skewed and potentially uneven distribution of citations across patents, which may obscure differences between inventor teams of different gender compositions. This approach follows prior studies by Uzzi et al. (2013) and Mukherjee et al. (2016).
Generality
The generality index captures the extent to which a patent influences a broad range of technological domains (Barbieri et al., 2020; Trajtenberg and Jaffe, 2002). A higher generality score indicates that a patent is cited by subsequent patents from a wider variety of fields, reflecting its cross-domain applicability.
In this study, we adopt a modified version of the generality index proposed by Squicciarini et al. (2013), calculated over a five-year citation window. Let Yp represent the set of patents that cite focal patent p within five years of its grant date. For each citing patent y ∈ Yp, let My be the set of full-digit CPC codes, and Ny be the corresponding 4-digit CPC codes.
The union of all 4-digit CPC codes across citing patents is denoted as:
The generality index Gp for patent p is computed as:
where βyn represents the share of full-digit CPC codes in y that fall under the 4-digit code n, calculated as:
Here, Tyn is the number of full-digit codes in y that map to a 4-digit code n, and ∣My∣ is the total number of full-digit codes in y.
This index ranges from 0 to 1, with higher values indicating that the citing patents span a more diverse set of technological classes, suggesting broader generality of the cited patent.
Disruption
While forward citations measure the volume of a patent’s subsequent influence, and generality captures the breadth of its impact across fields, neither metric reflects how a patent affects the direction of future innovation. Specifically, they do not indicate whether a patent reinforces existing technological trajectories or breaks away from them to establish new paths. Therefore, we incorporate the Consolidating-or-Disruptive index (CD index), a measure that captures how much a patent destabilizes the existing knowledge structure by redirecting the flow of citations (Funk and Owen-Smith, 2017).
The CD index ranges from −1 to 1, with positive values indicating greater disruption. A patent is considered disruptive if future patents tend to cite it but not its predecessors, suggesting that it redirects the flow of knowledge and introduces a novel technological direction. In contrast, a consolidating patent is one that is frequently co-cited with its prior art, reinforcing existing knowledge structures and extending established technological trajectories without initiating a new path (Park et al., 2023; Wu et al., 2019).
Formally, the CD index at time t is computed as:
where fi indicates whether a later patent i cites the focal patent, bi indicates whether it cites one of the focal patent’s predecessors, and nt is the number of patents citing either the focal patent or its predecessors as of time t (Park et al., 2023).
We use a pre-computed dataset of CD indices for patents granted between 1976 and 2010 (Funk et al., 2022), which includes three time horizons: five-year, ten-year, and cumulative as of 2017.Footnote 23 To maintain consistency with our forward citation and generality analyses, we focus on the five-year CD index (CD5).
In addition to using the continuous CD index, we construct a binary “disruption hit” or “top disruptive” indicator that equals 1 if a patent’s CD5 score falls within the top 10% of its grant-year cohort, and 0 otherwise. This binary measure captures the distributional skewness of disruption and allows us to examine whether certain inventor team characteristics are associated with an increased likelihood of producing highly disruptive patents (Mukherjee et al., 2016).
Descriptive statistics for variables discussed in Section “Data” are presented in Table 2.
Methods
Regression models
To address RQs 1–4, we estimate a set of linear regression models to examine whether the presence or absence of female inventors is associated with differences in patent characteristics and outcomes. All models use patent-level indicators as dependent variables (denoted as \(Indi{c}_{p}^{A}\)) and are estimated with and without field-level fixed effects based on WIPO technology classifications.
Model #1: Baseline model: with vs. without women
Model #1-1: Baseline model without field controls
This model (Equation (6)) uses a binary indicator variable \(With.Wome{n}_{p}^{0,1}\) as the independent variable, which equals 1 if a patent includes at least one female inventor, and 0 otherwise. No field fixed effects are included.
Model #1-2: Baseline model with WIPO field controls
This model (Equation (7)) includes the same independent variables as Model #1-1, with the addition of field-level fixed effects (\(Fiel{d}_{p}^{0,1}\)) based on WIPO technology categories.Footnote 24 Field fixed effects are included to account for systematic differences across technological domains, such as variation in patenting practices, team composition norms, or citation behaviors, that may confound the relationship between gender composition and patent outcomes. Controlling for field-specific characteristics helps isolate the effect of gender composition from underlying differences across technology areas.
Model #2: Gender collaboration types
Model #2-1: Gender collaboration types, without field controls
Recognizing that patents involving women can reflect different team structures—either all-female teams or mixed-gender teams—we define \(Gender.Collab.Typ{e}_{p}^{0,1}\) as a categorical variable capturing the type of gender composition.Footnote 25 This model (Equation (8)) assesses whether gender collaboration type explains variation in patent outcomes.
Model #2-2: Gender collaboration types, with field controls
This model (Equation (9)) extends Model #2-1 by controlling for field-level differences using WIPO field indicators.Footnote 26
All models in Model #1 and Model #2 are controled for filing year (Filing. Yearp). In all these models, control variables (Controlsp) differ slightly depending on the indicator, but all models include four common controls: the number of backward citations, the number of claims, team size, and technological maturity. Backward citations capture how many prior patents are cited by the focal patent, serving as a proxy for the depth of prior knowledge integration and often correlating with patent value and significance (Harhoff et al., 2003; Thompson, 2016).Footnote 27 The number of claims can reflect the invention’s potential commercial value, and has been linked to technological breadth and firm performance (Lanjouw and Schankerman, 2004; Novelli, 2015; Tong and Frame, 1994). Team size, defined by the number of inventors, is associated with patent quality and scope, as larger teams tend to generate more influential patents (Breitzman and Thomas, 2015b; Wu et al., 2019).
Technological maturity is measured as the natural logarithm of the cumulative number of patents filed in the primary full-digit CPC subclass up to the filing year, indicating the developmental stage of the technology domain (Barbieri et al., 2020). More mature fields typically offer a broader landscape of prior technologies to build upon. To address potential issues with taking the logarithm of zero, one is added to the cumulative count prior to transformation (Feng et al., 2014).
In addition to these common controls, models for the scope, scientific papers cited, and novelty include the number of full-digit CPC codes to account for variation in technical component size and complexity. For the generality and disruption indicators, we also control for forward citations within 5 years windows to adjust for differences in exposure and citation opportunity.
Model #3: assignee type and gender disparity
To address RQ5, we estimate regression models where the dependent variable is a binary indicator of whether a patent includes at least one woman inventor, and the key independent variable is the assignee type (see Section “Assignee types”). We estimate two versions of the model: one without field-level controls (Model #3-1) and one with field-level fixed effects based on WIPO technology classifications (Model #3-2), as specified in Eqs. (10) and (11), respectively.
In both models, we control for filing year, team size, number of claims, number of full-digit CPC codes, and backward citations.
Model selection
The selection of regression models was guided by the data type and distribution of each patent indicator. Ordinary least squares (OLS) regression was used for continuous indicators, such as originality, generality, and disruption. Binary indicators, such as novelty and public support, were analyzed using logistic regression. For count-based indicators, we distinguished between two cases. Poisson regression was applied to count variables without zero values, such as scope. In contrast, for indicators that include zero counts, such as forward citations and the number of scientific papers cited, we employed negative binomial regression to account for overdispersion.Footnote 28
Sampling strategy
For Model #3, we estimate regression models only on the full dataset. For Models #1 and #2, we supplement full-sample regressions with an undersampling approach to address the significant class imbalance between patents with and without women inventors. Given that patents without women account for 82.4% of the data, this imbalance may introduce bias and reduce the sensitivity of models to detect patterns associated with underrepresented groups. Undersampling the majority class (patents without women) helps balance the sample, potentially improving both model interpretability and predictive performance (Spelmen and Porkodi, 2018; Yang et al., 2021). Specifically, we randomly sample a subset of patents without women to match the number of patents with women and re-estimate the regression models on the balanced dataset. To assess the robustness of our findings, we repeat this sampling process ten times using different random seeds, generating distinct subsets and running the same regression model on each.
To address RQ4, which examines whether gender disparities persist across different domains, we selected five representative fields and applied the same set of regression models to field-specific subsets of patents. The selected fields are (1) civil engineering, (2) artificial intelligence (AI), (3) nanotechnology, (4) biotechnology, and (5) green technology (or climate change mitigation and adaptation technology, CCMA technology). These fields were chosen to capture variation in disciplinary culture and gender representation. Civil engineering is a traditionally male-dominated field (Manesh et al., 2020);Footnote 29 AI is an emerging domain with rapidly evolving inventor demographics (Huang et al., 2023);Footnote 30 nanotechnology has received considerable policy and scholarly attention regarding its development (Hulla et al., 2015);Footnote 31 and biotechnology is known for relatively higher female participation compared to other STEM fields (McMillan, 2009).Footnote 32 Green technology is included due to its growing importance in addressing climate change and its increasing visibility in sustainability-focused innovation policy. Given the strong policy incentives and societal relevance (Wang et al. 2024), this field may attract a more diverse pool of inventors and provide insight into whether gender disparities persist even in areas aligned with global sustainability goals.Footnote 33
In total, we conducted 64 sets of regression analyses across the four main models. Each model was estimated in 16 different settings: one using the full dataset, ten using undersampled data to address class imbalance, and five applied to the five selected technology domains. Within each setting, we ran separate regressions for each of the 12 dependent variables described in Sections “Ex-ante patent characteristics” and “Ex-post patent characteristics”.
Results
Descriptive statistics
Our results show that patenting in the U.S. is overwhelmingly dominated by men. It is estimated that approximately 51% of the U.S. population is female,Footnote 34 and women account for 47% of the U.S. workforce (U.S. Census Bureau, 2023).Footnote 35 However, within the 1.78 million distinct U.S. inventors, only 232,534 (13.1%) are women. Among the 3.7 million U.S. patents in our dataset, only 651,327 (17.6%) are filed with at least one female inventor. In these 651,327 patents, most of them are created with both women and men (551,785, 84.7%), and only 99,542 are created by only women, accounting for 2.7% of the patents in our dataset.
The summarty statistics of numerical variables conditioned on gender collaboration is shown in Table 3,Footnote 36 and the time series of patents’ characteristics are presented in Fig. 3. The simplified regression results of Models #1 and #2 are presented in Table 4, with full results available in Appendices E and F. An illustrative visualization of the main findings from these two models is shown in Fig. 6. The results of Model #3 are reported in Table 6. Additionally, we provide an alternative specification as a robustness check in Appendix G.
Women inventors are growing, but still underrepresented
Undeniably, women’s participation in patenting has increased over the past four decades. The share of patents filed with at least one female inventor rose from 4.4% in 1976 to 25.8% in 2021 (blue line in Fig. 2). This growth has been driven primarily by mixed-gender collaborations—patents involving both male and female inventors—reaching 22.8% in 2021. In contrast, patents produced solely by all-women teams remain rare and have even declined during the period under analysis, consistently fluctuating below 3% and showing a noticeable drop in the 2000s (orange line in Fig. 2).
Time series of women’s participation in U.S. patenting, 1976-2021. The blue line shows the percentage of patents with at least one female inventor. The orange line shows the percentage of patents invented exclusively by all-women teams. The green line indicates the percentage of inventor-patent pairs in which the inventor is a woman. The red line reflects the share of distinct inventors who are women.
The overall percentage of female inventors has also grown over time (green line in Fig. 2). In 1976, less than 2% of inventors were women. By 2021, this figure had risen to over 12%. Despite this progress, the gender gap remains substantial. Moreover, the pace of change appears to have slowed in recent decades: while the share of women inventors more than doubled from 3.5% in 1980 to 9.1% in 2000, it increased only modestly from 9.1% to 11.8% between 2000 and 2020.
Mixed-gender teams cite more science and better leverage public support
In response to RQ1, our findings demonstrate that the gender composition of inventor teams significantly influences their use of scientific knowledge and public support. Descriptive statistics in Table 3 and time series patterns in Fig. 3D, F, and G offer preliminary insights, while Table 4 summarizes the regression results from Models #1 and #2.Footnote 37
Time series panels of 16 selected characteristics of U.S. patents, grouped by gender composition of inventor teams. Blue lines represent patents by all-female teams, green lines by all-male teams, red lines by mixed-gender teams, and orange lines by patents with at least one female inventor. Note that the orange category includes both the blue and red groups.
Reliance on science
Specifically, patents filed by all-male teams cite an average of 4.3 scientific papers. In contrast, all-female teams cite 6.2 papers on average (~44% more), and mixed-gender teams cite as many as 13.2 papers. The regression results from Model #1 confirm this pattern (Table 4). The coefficient for the variable With. Women is positive and statistically significant in the full sample and across five representative domains, with p < 0.001 in all but two fields (AI and nanotechnology). This suggests that the presence of women is consistently associated with increased use of scientific knowledge in the inventive process. Specifically, patents with at least one female inventor cite 99.5% more scientific papers than those without, holding all else constant. When controlling for technical fields, the increase remains statistically significant at 9.9%. Among the five focal domains, green technology exhibits the most substantial effect, with patents by teams including women citing 136.3% more scientific papers.
Results of Model #2, which distinguishes among all-male, all-female, and mixed-gender teams, reveals a more nuanced gradient. Mixed-gender teams consistently cite the most scientific papers, even after accounting for team size. All-female teams follow, and all-male teams cite the fewest. Without field controls, all-male teams cite 51.4% fewer papers than mixed-gender teams, and all-female teams cite 21.5% fewer. This ranking remains stable across all five technical domains.
Reliance on public support
When it comes to the use of public support, a more complex picture emerges. Mixed-gender teams are most likely to rely on public funding—30.8% of their patents reference government-sponsored research or benefit from direct government support. In comparison, only 21.9% of all-male and 18.1% of all-female patents do so.
Previous studies have documented a general rise in U.S. patents relying on federally funded science (Fleming et al., 2019a). Our data supports this trend. As shown in Fig. 3D, the proportion of gender-collaborative patents relying on public support rose from 16.4% in 1980 to 31.5% in 2012. In the same year, 25.3% of male-invented patents and 20.3% of female-invented patents were linked to public support.Footnote 38
Model #1 shows that, without controlling for field, the presence of women in the inventor team increases the odds of public support utilization by 13.3%. However, when technical fields are controlled, the effect slightly reverses: women’s presence is associated with a 1.8% reduction in the odds. Model #2 indicates that both all-female and all-male teams are less likely than mixed-gender teams to utilize public support, with reductions of 29.2% and 16.9%, respectively.Footnote 39 This suggests that mixed-gender collaboration is particularly conducive to drawing on publicly supported science.Footnote 40
Finally, we find that women inventors in universities are significantly more likely to utilize publicly supported research than their counterparts in corporate settings. Overall, 19.3% of patents associated with public support include at least one female inventor. However, in corporate-owned patents, only 18.6% of those involving public support include women, whereas in university-owned patents, this number rises to 31.9%.Footnote 41 Looking from another angle, when women are on the inventor team, only 25.8% of corporate patents use public support, while in universities, as high as 73.7% of women-participated patents are public-supported. This suggests that women in academia are significantly more likely to engage with publicly funded research in their inventive activity—nearly three times more than women in industry, and even more so than their male colleagues in academia.
In response to RQ1, our analysis shows that inventor teams with different gender compositions utilize resources in distinct ways: mixed-gender teams consistently cite the most scientific knowledge and are most likely to leverage public support. In contrast, all-female and all-male teams cite fewer scientific papers, with all-female teams generally citing more than all-male teams, while public support is least utilized by all-female teams in industry but most utilized by women in academia.
Gender composition shapes the scope, novelty, and originality of patents
Addressing RQ2, which examines how gender composition relates to the ex-ante knowledge characteristics of patents, we find systematic differences across three dimensions: scope, novelty, and originality.
Scope
Descriptive statistics suggest that patents from mixed-gender teams exhibit the broadest technological scope, as measured by both 4-digit and full-digit CPC codes (Table 3; Fig. 3H, J). All-male teams tend to have slightly broader scope than all-female teams overall.Footnote 42 Since a broader technological scope is often correlated with team size (larger teams typically bring more diverse expertise), it is important to control for this factor.Footnote 43 Mixed-gender teams, by definition, must include at least two inventors and often tend to be larger on average. After controlling for team size and other covariates in Model #1 and Model #2, we find that mixed-gender teams still exhibit slightly but significantly broader scope than same-gender teams. Specifically, patents from all-female teams have 3.9% fewer 4-digit CPC codes, and all-male teams have 3.1% fewer, compared to those from mixed-gender teams. However, this effect does not appear in certain technical fields, such as AI, biotechnology, nanotechnology, and green technology, where gender composition does not significantly affect scope.
Interestingly, when scope is normalized by team size (i.e., 4-digit CPC codes per inventor), Model #1 reveals that the inclusion of women is slightly negatively associated with per-inventor scope. However, this finding is clarified in Model #2, which shows that the lower per-inventor scope in mixed-gender teams drives this relationship. In fact, all-female teams exhibit the highest scope per inventor, followed by all-male teams, with mixed-gender teams showing the lowest. Specifically, on a balanced subset controlling for WIPO fields, all-female teams have 0.61 more 4-digit CPC codes per inventor than mixed-gender teams, while all-male teams have 0.29 more than mixed-gender teams. This trend holds consistently across the five representative domains. These findings suggest that while mixed-gender teams cover more ground collectively, all-female teams may contribute more technical breadth per individual.
Novelty
Our analysis indicates that women’s participation in inventor teams is associated with higher novelty in patents. Model #1 shows that, controlling for WIPO technical fields, including at least one female inventor increases the odds of a patent being novel by 8.7%. This effect is especially pronounced in civil engineering, where the odds increase by 15.5%.
Model #2 further reveals that while the effect of all-female teams on novelty is generally not statistically significant, all-male teams consistently reduce the likelihood of novelty. For instance, on a balanced subset of the data with WIPO field controls, patents from all-male teams are 8.8% less likely to be novel than those from mixed-gender teams. Again, this effect is strongest in civil engineering, where the reduction reaches 13.9%. These results suggest that including women in inventor teams—particularly in collaborative, mixed-gender settings—enhances the likelihood of producing novel inventions.
Originality
Originality, measured as the technological diversity of prior art cited, shows less variation across gender compositions. Figure 3K and descriptive statistics in Table 2 indicate that mixed-gender teams tend to have slightly higher originality scores (mean = 0.663), followed by all-male teams (mean = 0.650), and then all-female teams (mean = 0.642). Model #1 confirms a small but statistically significant effect: including women increases originality scores by approximately 0.005 points, with p < 0.001.
Model #2 shows that while mixed-gender teams have the highest average originality, the gap between all-male and all-female teams is minimal, with all-female teams slightly outperforming all-male teams. However, the pattern varies across technical domains. In civil engineering, including women increases originality, but the opposite is observed in AI, biotechnology, and green technology. In these latter fields, all-male teams tend to produce more original patents, though the differences are modest, and the p-values for nanotechnology are not statistically significant.
In sum, mixed-gender teams consistently produce patents with broader overall technological scope, greater novelty, and slightly higher originality than same-gender teams. However, when considering scope on a per-inventor basis, all-female teams outperform both mixed-gender and all-male teams. This indicates that while mixed-gender collaboration expands collective technical coverage, all-female teams may engage more deeply or broadly per contributor. Additionally, women’s participation, especially in collaborative or academic settings, enhances the likelihood of producing novel and original inventions, particularly in domains such as civil engineering.
Women-invented patents receive fewer citations but are more disruptive
Addressing RQ3, we examine how gender composition relates to the impact of patents, including forward citations, generality, and disruption. Our findings reveal important differences in how patents from teams with different gender compositions are recognized and influence subsequent innovation.
Forward citations
Our analysis shows that patents involving women inventors receive significantly fewer citations than those created by all-male teams. This difference is consistent over time (see Fig. 3L, O). On average, patents from all-male teams receive 3.17 citations within five years of filing, compared to only 2.08 citations for patents from all-female teams.
Regression results from Model #1 confirm this pattern (Table 4): including women in inventor teams is associated with a 10.7% reduction in forward citations. This effect is particularly pronounced in civil engineering and biotechnology, where citations decline by 14.6% and 22.2%, respectively, when women are included. However, including women slightly increases the likelihood of a patent being in the top 10% most cited patents by 0.8%. Model #2 further reveals that all-male teams receive 11.8% more citations than mixed-gender teams, while all-female teams receive 0.9% fewer citations than mixed-gender teams. Similarly, the probability of being among the top 10% cited patents is highest for all-male teams (1.3% higher than mixed-gender), and slightly lower for all-female teams, although the differences are small.
An exception to this general pattern is observed in green technology. In this domain, all-female teams produce the most cited patents, receiving 17.5% more citations than mixed-gender teams. All-male teams also perform better than mixed-gender teams in green technology, with a 9.8% citation advantage.
Generality
Generality (ranging from 0 to 1) captures the breadth of a patent’s impact across different technological fields (see Section “Generality”). Regression results from both Model #1 and Model #2 indicate that patents involving women are slightly more general, with an average increase of 0.0073. Among all team types, mixed-gender teams produce patents with the highest generality. When controlling for WIPO technology fields, all-female teams show slightly higher generality than all-male teams, though still lower than mixed-gender teams. These findings suggest that including women, particularly within gender-diverse teams, enhances the cross-domain influence of patented inventions.
Disruption
In terms of disruption (measured using the five-year disruption index, see Section “Disruption”), average values are 0.101 for all-male teams, 0.094 for all-female teams, and 0.065 for mixed-gender teams. Although the raw averages suggest that all-male teams produce slightly more disruptive patents, this relationship reverses when controlling for relevant variables. Model #1 results show that including women in inventor teams slightly increases the disruption score by 0.01. This effect is particularly evident in biotechnology and green technology, though not significant in the other three representative domains. Model #2 reveals that mixed-gender teams produce more disruptive patents than same-gender teams, even after accounting for field and team size. Among same-gender teams, all-female teams generate more disruptive patents than all-male teams.Footnote 44
Although including women slightly decreases the likelihood of a patent being in the top 10% of disruption scores (by 1.4%), all-female teams, surprisingly, have the highest odds of producing top-disruptive patents. On a balanced dataset with WIPO field controls, all-female teams are 8.7% more likely than mixed-gender teams to be in the top decile of disruption, while all-male teams are 8.1% less likely.
Women in universities are more likely to invent
Prior research has found that women in academia are more likely to engage in patenting activity than women in other sectors (Sugimoto et al., 2015). Addressing our RQ5, we extend this evidence through an updated analysis using patent data through 2021.
As shown in Table 5, corporate-owned patents—while making up 96.0% of all utility patents—have the lowest rate of women participation, with only 17.9% involving at least one female inventor. Government-owned patents show slightly higher participation at 19.8%. University-owned patents, however, exhibit substantially greater gender inclusion: 33.6% involve women inventors, nearly double the rate seen in corporate patents. This pattern is confirmed by the regression results in Model #3 (Table 6). Whether or not technological fields are controlled for, the relationship remains consistent: government-owned patents are 37.8% more likely to include women inventors, and university-owned patents are 112.3% more likely to do so, and both effects are statistically significant at p < 0.001.
Further supporting this pattern, Table 1 shows that only 13.1% of inventors in the overall dataset are women. This share drops to just 9.1% among inventors on corporate patents but rises to 16.0% for inventors on university-owned patents, representing a 76% increase relative to corporations.
This trend is also reflected at the individual organization level. On average, each university exhibits a higher rate of female inventors (14.1%) and a higher share of patents with at least one female contributor (26.7%), compared to corporations, where the corresponding averages are only 6.8% and 11.1%, respectively, less than half the levels observed in universities.
As discussed in Section “Women inventors are growing, but still underrepresented”, women’s participation in patenting has increased over time (Fig. 4), and this upward trend is evident across all types of organizations. However, the rate of growth is especially pronounced in universities, where participation is consistently higher than in corporations. As shown in Fig. 4, the percentage of academic patents involving women inventors rose from 8.3% in 1976 to over 40% in 2021. In contrast, only 26.2% of corporate-owned patents in 2021 included women inventors.
Nevertheless, some organizations stand out for their success in engaging women in the inventive process. We identified 15,257 organizations whose entire patent portfolios involve women inventors, examples include A.M. Surgical, CoAxia, Sakti3, and In2Bones, many of which are in the biotechnology or medical device sectors. Moreover, 5,101 organizations have inventor teams that are over 80% female. Among them are companies such as Chanel Parfums Beauté, Venus by Maria Tash, Hair Blast, Martha Stewart Living Omnimedia, and Lashify, many of which produce beauty, wellness, or women-oriented consumer products. This suggests that women inventors are not only participating in innovation but are particularly active in developing products that directly serve women consumers, consistent with previous findings by Koning et al. (2021).Footnote 45
Among universities, several institutions stand out for combining high patenting activity with high levels of women’s participation. For example, Hampton University leads with 84.6% (22 out of 26) of its patents involving women inventors, followed by Creighton University (52.5%), the University of North Carolina (49.0%), the University of Kansas (48.2%), and Boise State University (46.8%). Saint Louis University also stands out, with 45.2% of its patents involving women and a high female inventor rate of 32.3%. Other notable universities and research institutions with high levels of women participation include the Wistar Institute of Anatomy and Biology, Albert Einstein College of Medicine, Fox Chase Cancer Center, St. Jude Children’s Research Hospital, the Icahn School of Medicine at Mount Sinai, and Oregon Health & Science University.
The U.S. government agencies also exhibit strong women participation in certain cases. For instance, the U.S. Centers for Disease Control and Prevention (CDC) has 54.4% of its patents involving women inventors. Similarly, the U.S. Department of Veterans Affairs (50.6%), the U.S. Department of Health and Human Services (50.1%), and the U.S. National Institutes of Health, Office of Technology Transfer (46.3%), all show relatively high levels of women's participation in patenting.
Gender disparity varies across technological domains
We found that while women’s participation in all fields has increased, the magnitude of growth varies significantly across domains (Fig. 5).Footnote 46 For instance, in the chemistry sector, women’s participation is relatively high and has grown substantially, particularly in biotechnology, organic fine chemistry, nanotechnology, and pharmaceuticals. By 2021, more than half (55.0%) of biotechnology patents involved at least one female inventor, which is a remarkable milestone.
Moreover, among the top patent-active companies with the highest rates of women participation, a substantial portion operate in biotechnology, pharmaceuticals, nanotechnology, and medical devices. Many of these firms focus on areas such as molecular diagnostics, gene therapy, advanced biomaterials, ultrasound imaging, or targeted therapeutics. Others are active in emerging technology sectors like nanotechnology, energy storage, consumer electronics, entertainment, or software. A number of companies also specialize in wellness, personal care, or protective equipment, domains often associated with caregiving or women’s health. This distribution reinforces the pattern that women inventors are especially active in fields with strong academic linkages and societal relevance, particularly those oriented-toward health, well-being, and inclusive technology.
On the other hand, certain fields continue to show persistently low levels of women’s participation, even after four decades of development. These include Mechanical Elements, Civil Engineering, Machine Tools, Engines, Pumps and Turbines, and Thermal Processes and Apparatus. For example, by 2021, only 9.1% of patents in the field of Mechanical Elements involved women inventors, nearly six times lower than in Biotechnology. Civil Engineering also remains low, with just 12.0% of patents including women contributors.
This pattern aligns with prior findings that women’s representation in science and technology varies across disciplines. For example, women make up a larger share of the workforce in biology and chemistry than in fields like engineering and computer science. According to the National Science Foundation, 46% of biological, agricultural, and life scientists are women, compared to just 16% of engineers (Cheryan et al., 2017). Our analysis suggests that these workforce imbalances are echoed in patenting outcomes: fields with more women professionals tend to produce more patents with female inventors. However, even in relatively gender-balanced fields, women patent at lower rates than men, pointing to persistent structural or institutional barriers.Footnote 47
Artificial Intelligence (purple line in Fig. 5), an important and rapidly emerging field with the potential to transform nearly every sector of the economy and society, exhibits a moderate level of women’s participation. While the share of AI patents with women inventors has grown from less than 5% in 1976 to 30.8% in 2021, this still lags considerably behind fields such as Biotechnology. This indicates that gender disparities persist even in newer, high-impact technological domains. Further underscoring this disparity, we manually inspected the top 100 inventors with the most AI patents and found that only four of them are women, just 4% of the list. This is particularly concerning given AI’s increasing role in shaping not only the future of technology and society, but also the very process of invention itself.Footnote 48
Beyond participation levels, gender disparities in patent characteristics also vary notably across technological domains. Figure 6d illustrates these differences for four representative fields: AI, green technology, biotechnology, and civil engineering.Footnote 49 A consistent pattern across all domains is that patents involving women tend to receive fewer forward citations. However, the nature and magnitude of gender-based differences in other patent characteristics vary by field. For example, in green technology, patents created by women-inventor teams are significantly more likely to draw on public support and cite scientific publications, suggesting higher engagement with public and academic resources, though their impact remains relatively limited. Biotechnology patents with female inventors tend to be more disruptive, indicating a stronger break from existing technological trajectories and greater potential to open new innovation pathways. However, these patents often have narrower technological scope and attract fewer citations. Civil engineering patents involving women, on the other hand, are significantly more original, drawing from a more diverse technological base, and tend to have broader subsequent impact across fields.
a Comparison between patents with female inventors and those without (gray). b Comparison between patents with female inventors and those without (gray) under four conditions. c Comparison among patents created by all-women teams, all-men teams, and mixed-gender teams. d Comparison of patents involving female inventors across four domains with those without female inventors. All values represent normalized regression coefficients (scaled between − 1 and 1) for illustrative comparison. Filled circles indicate statistically significant coefficients (p < 0.05), while unfilled circles indicate non-significant ones.
In summary, gender disparities in patent characteristics are not uniform across technological domains. While women-invented patents tend to receive fewer citations overall, they often exhibit unique strengths in specific fields, such as higher disruptiveness in biotechnology, greater originality and broader impact in civil engineering, and stronger engagement with public and academic resources in green technology. These domain-specific patterns underscore the importance of examining gender disparities beyond aggregate metrics, as women’s contributions to innovation may manifest differently depending on the technological and institutional context.
Discussion
Using data from almost 3.7 million U.S. patents, 1.8 million distinct inventors, and over 200,000 unique patent assignees, this research found that gender disparity in patenting has improved in the past half-century but remains stubborn with respect to many aspects of inventing. In 2021, women accounted for less than 13% of the inventor population and participated in about one-quarter of patents, even though women constitute about half of the U.S. workforce. Women-invented patents receive fewer citations, indicating limited impacts. The disproportionately low participation of women in patents signals limited opportunities and insufficient incentives for women to invent. It may lead to a lack of diverse innovations in society and a shortage of products that benefit women or align with women’s values.
One possible cause for the under-representation of women lies in the systematic gender disparity in patent authorship even with the same job titles, fields, and contribution, and such under-crediting of women is particularly pronounced in highly cited spectrum (Ross et al. 2022). Women’s under-representation in patenting can also be attributed, in part, to their shorter career duration and more frequent departures, as observed in academia, science, and engineering professions (Holliday et al. 2014; Preston, 1994). Research indicates that faculty’s career length directly links to the quantity and quality of their publications (Diamond et al. 2016). Unfortunately, women face disproportionate penalties due to life transitions, resulting in more and longer career breaks that negatively impact their professional trajectories (Arun et al. 2004; Buse et al. 2013; Kyvik and Teigen, 1996). Notably, these factors do not have the same effect on men’s career paths (Xie and Shauman, 2005).
In addition to examining the level of women’s participation, we also investigated gender disparities in patent characteristics through five research questions, as discussed below.
Gender disparity in resource utilization
Addressing RQ1, we used regression analyses to examine how the inclusion of women in inventor teams (Model #1) and different gender compositions (Model #2) influence the utilization of public support and scientific knowledge. We found that all-women teams are less likely to utilize public support than all-men teams, consistent with prior research showing that women are less likely to receive public grants (Holliday et al., 2014; Ley and Hamilton, 2008). This disparity may stem from limited access to funding networks, mentorship opportunities, institutional backing, or the disproportionate assignment of non-inventive tasks to women. In contrast, mixed-gender teams are significantly more likely to utilize public support, particularly direct government funding. The inclusion of women in inventor teams also increases the likelihood of receiving direct public support. Regarding the use of scientific knowledge, all-women teams cite more scientific publications than all-men teams, suggesting stronger engagement with foundational science. Once again, mixed-gender teams lead in both dimensions, citing the most scientific literature and benefiting most from public support.
Gender disparity in patents’ knowledge charactersitics
Addressing RQ2, we compared the ex-ante characteristics of patents, namely, scope, novelty, and originality, across different gender compositions of inventor teams using regression analyses. We found that the inclusion of women significantly increases a patent’s scope, indicating broader technological knowledge. Women-contributed patents are also more likely to be novel, meaning they introduce new components or novel combinations of existing ones. Additionally, these patents exhibit greater originality, as they draw from a more diverse and wide-ranging set of prior technologies. Further analysis reveals that mixed-gender teams outperform both all-male and all-female teams across all three characteristics. This suggests that gender-diverse collaboration may enhance a team’s capacity to explore novel ideas, synthesize knowledge from a broad range of technological domains, and ultimately generate more original and wide-reaching inventions.
Gender disparity in patents’ impact
Addressing RQ3, we examined the gender differences in the impact of patents using regression analyses. Our results show that patents involving women inventors receive significantly fewer citations than those created exclusively by men, and this pattern holds consistently across technological domains. Unlike other patent indicators where mixed-gender teams typically outperform single-gender teams, citation impact appears to be highest for patents generated by all-male teams, suggesting a higher level of conventional influence. This finding aligns with prior research showing that women researchers tend to receive fewer citations than men in both scientific publications and patents. One potential explanation is that male-majority teams are more likely to cite other male-authored work, while female-authored inventions are less likely to be further developed (Ferber et al., 2011; Subramani and Saksena, 2024; Tahmooresnejad and Turkina, 2022). To advance this discussion, we extend the analysis by including two additional indicators of patent impact that are rarely examined in studies of gender disparities—disruption and generality. These metrics go beyond citation volume to capture the breadth and nature of a patent’s influence. We find that although patents with female inventors are cited less often, their inclusion tends to diversify the impact. Specifically, patents with female participation exhibit higher generality scores, indicating their influence spans a broader range of technological areas. Additionally, mixed-gender teams are more likely to produce disruptive patents—those that break with existing technological trajectories and initiate new innovation pathways. These findings suggest that while patents with women may be cited less frequently overall, they contribute meaningfully to technological diversity and long-term innovation dynamics.
Organizational patterns
Addressing RQ4, we examined women’s participation in patenting across different types of organizations—government agencies, universities, and private corporations. Our findings indicate that women in universities are significantly more likely to engage in invention than those in other sectors. Moreover, female inventors in academia appear to be more resourceful: they are substantially more likely to leverage public funding in their patented inventions. This suggests that the academic environment may be more conducive to female inventors’ success. This advantage may stem from universities’ structural characteristics, such as decentralized decision-making, flexible research environments, and cross-disciplinary collaboration, which have been shown to support women in innovation processes (Whittington and Smith-Doerr, 2008). In addition, institutions that proactively implement policies aimed at reducing gender gaps in technology transfer have demonstrated measurable improvements in women’s patenting activity (Giuri et al., 2020), suggesting that targeted interventions can be effective. Extending these insights beyond academia, prior research has shown that companies with more gender-balanced boards produce more patents, and those patents tend to be more novel and inventive (Griffin et al., 2021). This implies that increasing gender diversity in leadership and decision-making roles within corporations may help reduce disparities in inventorship and improve overall innovation outcomes.
Domain-specific characteristics
Addressing RQ5, we examined how gender disparities manifest across five representative technological fields (AI, nanotechnology, biotechnology, green technology, and civil engineering), each of which holds strategic economic value and addresses pressing global challenges such as climate change. We found that gender disparities vary substantially across domains, both in terms of participation rates and patent characteristics.
In green technologies, women inventors appear more resourceful than men, as evidenced by significantly higher utilization of public support and more frequent citation of scientific knowledge. Prior studies from China and Europe have shown that the presence of women in R&D teams and executive leadership is positively associated with the development of high-quality green innovations, particularly in green product innovation (Chu, 2024; Dal Molin and Leggerini, 2024; He and Jiang, 2019). Our findings provide complementary evidence, suggesting that women’s contributions in green technology not only reflect societal priorities but also leverage public and scientific resources more effectively.
In AI, women’s participation remains modest to low, consistent with recent findings (Giczy et al., 2024), which also suggest that women are more likely to work in diverse teams that generate higher economic value. While we observe a few differences in patent characteristics by gender in AI, the low representation of women is notable given the strategic importance of AI innovation. National advisory reports have emphasized that demographic, institutional, and geographic diversity in AI is essential for long-term competitiveness and societal alignment (Eric and Work, 2021). Expanding women’s involvement in AI could thus support a more inclusive and resilient innovation ecosystem.
In biotechnology, women’s participation in patenting is relatively high. Prior research found that although women are less likely than men to patent in life sciences, particularly in commercial biotechnology patents, there are no substantial differences in patent quality, and in some cases, women’s patents are of higher quality (McMillan, 2009; Whittington and Smith-Doerr, 2005). Our findings extend this work by showing that women-invented biotech patents are, although less cited and less resource-intensive, significantly more disruptive, indicating a stronger tendency to challenge and transform existing technological paradigms.
Prior research points to the role of early-life exposure in shaping inventive careers. For instance, growing up in regions with more female inventors in specific fields increases the likelihood that young women will pursue inventions in those same areas (Bell et al., 2019). High school invention education has also been shown to significantly encourage young women to patent (Couch et al., 2018). It is possible that the existence of a domain-specific positive feedback loop: fields with higher female participation may foster conditions that attract and retain more women inventors over time. Our findings support this hypothesis, particularly in Biotechnology, where women’s patenting rates are consistently higher and innovation tends to be more disruptive.
In civil engineering, despite being the field with the lowest female participation in patenting (Section “Gender Disparity Varies Across Technological Domains”) and historically male-dominated (Johnson, 2013; Maurer et al., 2021), women-invented patents stand out in terms of quality. These patents are more novel, draw from a broader technological base, and exert more diverse impact. Although they receive fewer citations, they show strong potential for technological advancement. This aligns with earlier findings that, among all engineering disciplines, civil engineering is the only one in which scientific articles first-authored by women receive significantly more citations (Ghiasi et al., 2015). Our results suggest that even in domains with extreme gender imbalances, women can produce high-impact, boundary-pushing innovations.
By contrast, in nanotechnology, we did not observe significant gender differences across patent indicators. This suggests a more gender-neutral performance landscape in the domain, although it may also reflect the field’s relatively recent emergence and evolving institutional dynamics.
Conclusion
This study offers one of the most comprehensive empirical investigations of gender disparities in patenting to date, examining more than four decades of U.S. utility patents with a particular focus on inventor gender composition, patent quality, institutional context, and technological domain. Our findings reveal persistent gender disparities—not only in the participation of women in patenting, but also in the types of contributions they make, the resources they utilize, and the impacts their patents generate.
We found that women inventors remain significantly underrepresented, particularly in corporate patents and male-dominated fields such as mechanical engineering and civil engineering. While participation rates have improved, women still contribute to only a minority of patents, and all-women inventor teams remain exceedingly rare. Nevertheless, our regression analyses indicate that including women, especially within mixed-gender teams, is associated with broader scope, greater originality, and, in some domains, higher disruptive potential. These results challenge simplistic assumptions that associate patent impact solely with citation volume and instead suggest that gender-diverse teams may enrich innovation by fostering more interdisciplinary and novel contributions.
Our study also highlights the importance of institutional context: women are significantly more likely to invent in university settings, where patents are more likely to cite scientific knowledge and utilize public funding. Universities may offer a more supportive environment for women due to more flexible structures and collaborative norms. In contrast, the corporate sector continues to lag behind, both in gender inclusion and in the diversity of innovation outcomes. Domain-level analysis shows that gender disparities are not uniform—some sectors such as biotechnology and green technology show greater inclusion and more distinctive forms of impact from women’s contributions, while others remain persistently exclusionary.
Limitations and future research
This study is subject to several limitations. First, while we use a large-scale patent dataset enriched with gender attribution, name-based gender identification techniques are imperfect and may underrepresent individuals with non-binary or international naming conventions. In addition, we utilized LLMs to assist in gender classification and assignee type identification. Although LLMs offer efficiency and contextual understanding, they are known to introduce potential biases or hallucinations, particularly when dealing with ambiguous or underrepresented groups, which may affect classification accuracy.
Second, while we control for technological fields and other conditions, unobserved factors (such as team dynamics, access to mentorship, or funding pathways) could also shape gendered outcomes in invention. Third, our analysis does not include inventor-level variables, such as career stage, prior patenting experience, or institutional affiliations. Incorporating such variables would allow for a more nuanced understanding of how gender disparities evolve over the course of individual careers and would help identify when and where interventions might be most impactful.
Moreover, while patents provide a useful lens into technological innovation, they are only one part of the broader innovation ecosystem. Many patents are linked to scientific projects and often arise from academic research. Future work could explore linking patents to their associated scientific publications to assess whether gender disparities differ between publishing and patenting processes within the same research trajectory, e.g., whether women who appear as authors on scientific papers are omitted from associated patents. Additionally, expanding this analysis to include international patent systems, startup formation, product commercialization, or institutional technology transfer practices could offer a more holistic and global perspective on gendered innovation dynamics.
We also note that our domain-level analysis focuses on five representative technological areas of strategic and societal importance. However, many other domains may exhibit different gender dynamics that merit further investigation. Future research should expand the analysis across a broader array of fields to determine how gender disparities manifest in less-studied or emerging technologies.
Future research could also incorporate qualitative approaches, such as interviews or ethnographic studies, to better understand how institutional cultures, organizational incentives, or informal norms perpetuate gender disparities in invention. Such mixed-methods approaches would be valuable for identifying targeted interventions that go beyond representation to promote equity in outcomes and recognition within innovation systems.
Policy implications
Reflecting on the implications of our findings, we interpret the persistent underrepresentation of women in patenting as, in part, a legacy of systemic historical barriers. These include structural gender discrimination embedded in both policy and practice, ranging from legal restrictions that once prevented women from owning property, including intellectual property, to broader institutional limitations. In the U.S., for instance, married women were not granted full rights to own and manage intellectual property until the passage of the Married Women’s Property Acts (Bedi, 1999; Khan, 1996). Combined with limited educational opportunities and sociopolitical exclusion, these historical constraints created a path-dependent disadvantage that continues to shape gender disparities in innovation.
While structural inequities persist, our findings also reveal several promising entry points for policy intervention. Certain domains, such as biotechnology, exhibit relatively balanced gender representation, suggesting that field-level dynamics and organizational contexts can moderate disparities. These patterns underscore the need for targeted policies that are sensitive to the conditions under which women are more likely to participate and thrive as inventors.
We identify multiple potential avenues for intervention. At a general level, increasing gender diversity in corporate R&D teams, promoting inclusive technology transfer practices in universities, and expanding mentorship and funding opportunities for underrepresented inventors may help reduce the gender gap. Policymakers and funding agencies should consider incentivizing gender-diverse collaborations and supporting research that addresses public needs. Innovation institutions, including patent offices, should assess whether current evaluation metrics, citation practices, and resource distribution mechanisms unintentionally reinforce gender disparities.
Field-specific strategies are also warranted. In green technologies, where women show strong engagement with scientific knowledge and public funding, targeted support for women-led sustainability projects could amplify their contributions. In biotechnology, where female inventorship is relatively high but citation rates are lower, policies that enhance translational research and academic-industry partnerships could improve the visibility and impact of women-led innovations. In artificial intelligence, where women remain underrepresented, early educational initiatives, inclusive funding programs, and equity-focused research agendas are essential. In civil engineering, where women inventors are fewer yet produce particularly high-quality patents, efforts to recruit, mentor, and amplify the work of female inventors could strengthen the innovation pipeline. Nanotechnology, where gender gaps appear less pronounced, should be monitored as a potential model for fostering gender inclusion.
Institutional context also plays a crucial role. Universities consistently demonstrate higher rates of female inventorship and more effective mobilization of public support. This highlights the value of inclusive academic environments, collaborative grant structures, and career development programs. These strategies should be sustained and expanded. In contrast, private corporations, where women are underrepresented and less likely to produce patents linked to public research, may benefit from internal reforms, such as fostering inclusive innovation cultures, ensuring gender-balanced leadership, and improving transparency in R&D resource allocation.
The gender disparity we observe in patenting is not unique to the United States but reflects a global pattern. For example, a study of 23 Ibero-American countries found that women’s participation in patent applications does not exceed 30% (Carvalho et al., 2020), closely aligning with our finding of 22.3% in the U.S. Similarly, a study of 27 European countries reported that only 7.7% of inventors are women, though the rates vary by country—for instance, Latvia, Slovenia, and Hungary show relatively higher levels of female participation, yet remain in the same general range as the U.S. (Martin, 2020). Research in China echoes these trends, but also shows that gender-diverse inventor teams tend to produce higher-quality patents, particularly in the case of radical innovations (Zhang et al., 2024; Zhang and Zong, 2023). These cross-national comparisons highlight that gender disparities in innovation are widespread and deeply rooted in structural and cultural contexts.
Ultimately, narrowing the gender gap in patenting is not only a matter of equity but also one of innovation quality and societal relevance. Encouraging women’s participation in invention can enrich the diversity of ideas, broaden the reach of technological solutions, and inspire future generations of innovators. Addressing both structural and cultural barriers is essential for building a more inclusive and effective innovation ecosystem.
Data availability
The data that support the findings of this study are openly available in Zenodo: Wang, J., & Maynard, A. (2025). Replication data for: Gender Disparity in U.S. Patenting [Data set]. Zenodo. https://doi.org/10.5281/zenodo.16929927.
Notes
According to the World Intellectual Property Indicators report by the WIPO (2024), the top countries leading global patent filings in 2023 were China (1.6 million), the United States (518,364), Japan (414,413), the Republic of Korea (287,954), and Germany (133,053).
As shown in our dataset, more than 90% of the patents granted at USPTO are utility patents. Utility patents cover novel and useful processes, machines, and compositions of matter, forms of technological innovation that are central to the study of invention and R&D. In contrast, design patents protect only ornamental aspects of products (e.g., shapes, surface designs) and are rarely used to indicate functional innovation or knowledge production. Plant patents, which apply to asexually reproduced plant varieties, account for a negligible share of total patenting and are highly domain-specific. As such, design and plant patents lack the technological scope, citation richness, and classification structure needed for generalizable innovation analysis (Griliches, 1990; Jaffe and Trajtenberg, 2002; Sampat, 2018).
WIPO’s classification scheme was developed based on the International Patent Classification (IPC) Technology Concordance Table (IPC/CE/41/5) (Schmoch, 2008) (see Appendix K).
For more details about this data release, see the PatentsView release notes: “Data update: 10/23/2024” https://patentsview.org/release-notes.
This exclusion criterion means that U.S. inventors who only collaborate with non-U.S. inventors are also excluded from our analysis.
In this study, we distinguish between inventors and distinct inventors (also referred to as unique inventors). An inventor is defined as a name listed on a single patent filing, while a distinct inventor refers to a disambiguated individual who may be associated with multiple patents. The dataset includes unique identifiers for distinct inventors, which were generated using a disambiguation method developed by Monath et al. (2021). This method employs a linear model that matches inventors based on features such as names and initials, overlapping co-inventors, and patent title similarity, the latter computed using the Sent2Vec sentence embedding model (Gupta et al., 2019; Monath et al., 2019). These identifiers enable us to compute inventor-level metrics such as patents per inventor.
The three-step procedure is labeled in gray as “Gender id. Step #1", “Gender id. Step #2", and “Gender id. Step #3" in Fig. 1.
A result of andy indicates that the name occurs with approximately equal frequency among male and female individuals, while unknown means the name is not found in the database.
Prior to applying GENDER-GUESSER, we performed a pre-processing step to clean the inventor first names. A visual inspection revealed that many first names provided by the USPTO included middle names or initials or multiple name components that could interfere with classification, for example, “Thomas Alva” instead of “Thomas” as the first name. Using regular expressions, we extracted only the first token in the first name field that did not end with a period. For example, from “John W.” we extracted “John” for evaluation by GENDER-GUESSER.
We tested both GPT-4o and GPT-4o-mini and found their performance to be comparable. GPT-4o-mini was selected for its efficiency and lower cost.
We used full names rather than first names alone for two key reasons. First, manual inspection of data samples revealed that, in some cases, possibly due to filing errors or cultural naming conventions (e.g., in Chinese, given names follow family names), first and last names were entered in the wrong order. Providing the full name gives the LLM an opportunity to detect and correct such inconsistencies. Second, full names offer greater cultural context. For example, the name “Juan” is typically male in Hispanic contexts but often female in Chinese contexts. Including the last name helps the LLM infer cultural origin more accurately, thereby improving gender prediction.
For the six cases of disagreement, we conducted manual validation using publicly available information, including search engines, Google Scholar, social media, company websites, geographic and organizational context, and patenting histories. Our heuristic review revealed that in two cases, the USPTO method appeared to have assigned the incorrect gender, while GPT-4o-mini correctly inferred it. In three other cases, GPT-4o-mini returned “undefined,” which we do not consider a misclassification. In the final case, the USPTO method inferred the inventor to be male, while GPT-4o-mini returned “undefined.” We were unable to identify the individual or confidently infer their gender through manual investigation, so we do not assess which method was correct in that instance. Nonetheless, the fact that GPT-4o-mini correctly classified two cases where the USPTO method made errors demonstrates that the LLM-assisted approach is not only comparable in accuracy but can, in some instances, outperform existing methods. This supports the robustness and practical value of using GPT-4o-mini for gender inference at scale in large patent datasets.
Manual inspection of the 16,375 inventors with unclassified gender revealed that a substantial portion have names likely originating from East Asian cultural and linguistic contexts, particularly Chinese and Japanese (e.g., Chengzhou Li, Zhitu Chen, Kanuza Tanaka). Other names suggest South or Southeast Asian (e.g., Hsin-Lei Yao, Guangyan Du) or Arabic backgrounds (e.g., Mounir Jabra Lebbad). These names are often culturally ambiguous, gender-neutral in transliteration, or underrepresented in Western name-gender datasets and large language models. As a result, the probability of missingness is associated with observable features such as name structure and cultural origin, indicating that the data is Missing at Random (MAR). Given the small proportion of missing cases (1.5%) and the absence of evidence suggesting systematic differences in key outcome variables, excluding these cases is methodologically justified. This approach is also consistent with established research standards, which consider missingness below 5% to be unlikely to introduce significant bias (Little and Rubin, 2020; Schafer, 1999).
This does not imply that all these organizations are located in the U.S. Foreign entities may own patents created solely by U.S. inventors. Since our analysis focuses on U.S.-invented patents—that is, patents developed by U.S.-based inventors—the geographic location of the assignee is not a determining factor.
In the U.S. patent system, if no assignee information is provided at the time of application, ownership of the patent defaults to the listed inventors. See https://www.uspto.gov/web/offices/pac/mpep/s325.html.
The CPC is a hierarchical system co-developed by the USPTO and the European Patent Office (EPO), with multiple levels of granularity: section (e.g., “A” for Human Necessities), class (2-digits, e.g., “G01” for measuring and testing), subclass (4-digits, e.g., “G01C” for measuring distances), and group/subgroup (full-digit codes). For example, “A21D 2/08” refers to a process for treating flour or dough by adding organic substances. “A21” represents the class of inventions about baking and edible doughs, “A21D” the subclass about treatment of flour or dough for baking, and “A21D 2/08” the full-digit code or the group representing treating flour or dough specifically by adding organic substances.
For example, U.S. Patent US9107383B2 (Tidy cat litter box) was assigned two CPC codes—A01K1/0107 (cat trays; dog urinals; toilets for pets) and A01K1/0114 (litter boxes with screens for separating excrement from litter), which are under the same subclass (A01K, animal husbandary), resulting in a scope count of one. In contrast, US10733430B2 (Toilet usage management system and cat toilet), a more complex cat toilet system with camera and cat identificatioin process, was assigned codes spanning four subclasses: A01K, G06K (graphical data reading), G06F (electric digital data processing), and G06V (image recognition), giving it a scope of four.
Both indicators were calculated using CPC assignments that include both the mandatory “inventive” codes, which reflect novel aspects of the invention, and the “additional” codes, which capture significant but non-novel technical content. Including both inventive and additional CPC codes provides a fuller representation of the technological components involved in a patent (USPTO, 2015).
The refinement classification is based on a global comparison across all U.S. utility patents granted between 1976 and 2021, regardless of inventor location. This ensures consistency in determining whether a component or combination is truly novel.
Our analysis of patents involving public support is limited to patents granted through 2017.
Disruption-related analyses in this study are limited to patents granted between 1976 and 2010 due to data availability.
WIPO categorizes technologies into thirty-five fields, which are grouped into five sectors, including chemistry, electrical engineering, instruments, mechanical engineering, and other fields (Schmoch, 2008). The categorization can be found in Table K.1 in Appendix K. The field variable is categorical and was one-hot encoded into thirty-four binary variables, each taking a value of 1 or 0. One-hot encoding allows us to include categorical information in the regression model while avoiding multicollinearity by omitting one category (Kutner et al., 2005).
As described in Section “Gender Characteristics at the Patent Level”, Gender. Collab. Type is a categorical variable with three possible values: all-women, all-men, and mixed-gender teams. We applied one-hot encoding to this variable, resulting in two binary variables: Only. Females and Only. Males, with mixed-gender teams serving as the reference category.
To test the robustness of our study, in addition to Model #1 and Model #2, we estimated an alternative model in which the independent variable is the number of female inventors on the patent team. We implemented two versions of this model—with and without field controls. The model specifications and corresponding regression results are provided in Appendix G.
In this study, only citations to valid U.S. patent numbers were included.
In the case of forward citations, we used L1-regularized (Lasso) negative binomial regression to mitigate sparsity and enhance model performance (Tibshirani, 1996). Since regularization is sensitive to variable scaling, all non-dummy variables were standardized prior to model fitting. The resulting coefficients were then re-scaled (inverse-standardized) to reflect the original units for interpretability. Because a substantial proportion of patents receive zero forward citations (52.6% within 5 years of grant and 23.7% with no citations at all as of 2021), we also tested zero-inflated models, including zero-inflated Poisson and zero-inflated negative binomial regressions. Final model selection was based on convergence criteria and model fit, evaluated using the Akaike Information Criterion (AIC) (Stoica and Selen, 2004; Taddy, 2019).
We identified civil engineering patents as those whose primary WIPO technology field is classified as 35 (Civil Engineering) (Schmoch, 2008), resulting in a total of 116,127 patents in this category.
We identified AI patents using the AI Patent Dataset (AIPD) released by the USPTO in 2021 and updated in 2023 (Giczy et al., 2021; Pairolero et al., 2024; Toole et al., 2020b). This dataset applies machine learning to assign eight prediction scores (ranging from 0 to 1) to each patent, estimating the likelihood that it pertains to one of eight AI component technologies: knowledge processing, speech, AI hardware, evolutionary computation, natural language processing, machine learning, vision, and planning and control. Toole et al. (2020b) identified AI patents as those with a prediction score above 0.5 for any of the components. In this study, we adopt a more conservative threshold and define AI patents as those with at least one prediction score equal to or greater than 0.99, indicating a 99% likelihood of being AI-related. Using this criterion, we identified a total of 334,550 AI patents.
We identified nanotechnology patents as those whose primary WIPO technology field is classified as 22 (Micro-structure and nano-technology) (Schmoch, 2008), resulting in a total of 3836 nanotechnology patents.
We identified biotechnology patents as those whose primary WIPO technology field is classified as 15 (Biotechnology) (Schmoch, 2008), resulting in a total of 68,255 nanotechnology patents.
We identified green technology patents using the CPC code Y02, which covers technologies related to climate change mitigation and adaptation (https://www.uspto.gov/web/patents/classification/cpc/html/cpc-Y.html#Y02), resulting in a total of 242,083 patents.
The summary statistics of numerical variables conditioned on With. Women is shown in Table D.1 in Appendix D.
Full regression results of Models #1-1 and #1-2 on the full dataset are provided in Tables E.1 and E.2 in Appendix E, respectively.
The decline in the 2010s in Fig. 3D is an artifact of data limitations. The public support dataset from Fleming et al. (2019a) includes only granted patents through 2017. Since our analysis is based on filing year, and patents typically take 3–7 years to be granted, many patents filed between 2010 and 2017 are excluded from this data.
Across all five representative domains, all-female teams show a greater reduction in public support utilization than all-male teams, relative to mixed-gender teams. The only exception is green technology, where women’s presence still improves engagement with public support: all-female teams show a 10.2% reduction, while all-male teams show a 21.6% reduction, compared to mixed-gender teams.
To further understand how gender composition influences the type of public support, we conducted three sets of multinomial logistic regressions (see Appendix H). First, we used a binary variable With.Women and found that patents involving women were significantly more likely to directly rely on public funding, rather than indirectly or not at all. Second, we used gender collaboration type and found that both all-women and all-men teams were less likely to directly rely on government support compared to mixed-gender teams. Third, we included female and male inventor counts as predictors and found that each additional female inventor significantly increased the odds of direct reliance on public funding, while the effect of additional male inventors was much smaller and statistically weaker. These consistent patterns suggest that greater female representation in inventor teams is associated with stronger engagement with direct government funding.
In corporate-owned patents, the share of public-reliant patents is 21.1% for all-male teams, 20.3% for all-female teams, and 26.6% for mixed-gender teams. Notably, the share for all-female teams is slightly lower than that of all-male teams. In contrast, among university-owned patents, the corresponding percentages are much higher: 66.5% for all-male teams, 69.7% for all-female teams, and 74.1% for mixed-gender teams. Here, both all-female and mixed-gender teams exhibit substantially higher reliance on public support compared to all-male teams.
However, this pattern shifts when we control for team size. Among smaller teams, all-female teams tend to exhibit broader scope. For example, among three-inventor teams, patents from all-female teams involve an average of 7.6 full-digit CPC codes, compared to 7.2 in all-male teams (Figure L.3). Nonetheless, the number of large all-female teams is very small; for instance, among 46,214 patents with seven inventors, only 7 are from all-female teams, compared to 21,312 from all-male teams and 24,895 from mixed-gender teams.
Our data show that as team size increases, the likelihood of a patent covering more technical components rises substantially. For example, the share of patents with more than three full-digit CPC codes increases from 51.1% among solo inventors to 91.5% among teams of twenty inventors (Figure L.1).
All-female teams reduce disruption by only 0.004 compared to mixed-gender teams, while all-male teams show a larger reduction of 0.012.
Among the organizations with over 80% female inventors, a large share are concentrated in the beauty, fashion, and wellness sectors. These include manufacturers of cosmetics, skincare, jewelry, and personal care products. This pattern supports prior research suggesting that women are more likely to invent in domains aligned with their lived experiences and consumer needs.
Figures J.1 and J.2 in Appendix J present detailed time series for the percentage of women-invented patents and seven selected patent indicators across sectors and domains.
For example, among biotechnology patents filed in 2020, despite the relatively high proportion of female scientists in the field, only 4.2% were created by all-women inventor teams, whereas all-men teams accounted for 47.4%. Gender-collaborative teams, including both male and female inventors, made up the remaining 48.4%. This means that male inventors contributed to 95.7% of biotechnology patents that year, while female inventors contributed to just 51.5%. The stark contrast highlights how even in fields with better gender representation in the workforce, women remain underrepresented as patent creators.
The four women among the top 100 AI inventors are: (1) Marta Karczewicz, with 388 AI patents, ranked 6th overall, and Vice President at Qualcomm; (2) Lisa Seacat DeLuca, with 353 AI patents, ranked 9th, currently Vice President at Unstoppable Domains and formerly an engineer and director at IBM; (3) Valentina Salapura, with 237 AI patents, ranked 37th, Principal Engineer at Google; and (4) Donna K. Bryon, with 173 AI patents, ranked 87th, a software engineer at IBM.
References
35 U.S. Code § 102 Conditions for patentability: novelty
Arthur WB (2009) The nature of technology: what it is and how it evolves. Free Press, New York
Arun SV, Arun TG, Borooah VK (2004) The effect of career breaks on the working lives of women. Feminist Econ 10(1):65–84
Barbieri N, Marzucchi A, Rizzo U (2020) Knowledge sources and impacts on subsequent inventions: do green technologies differ from non-green ones? Res Policy 49(2):103901
Bedi J (1999) Exploring the history of women inventors
Bell A, Chetty R, Jaravel X, Petkova N, Van Reenen J (2019) Who Becomes an Inventor in America? The Importance of Exposure to Innovation. Q J Econ 134(2):647–713
Breitzman A, Thomas P (2015a) The Emerging Clusters Model: A tool for identifying emerging technologies across multiple patent systems. Res Policy 44(1):195–205
Breitzman A, Thomas P (2015b) Inventor team size as a predictor of the future citation impact of patents. Scientometrics 103(2):631–647. Place: Dordrecht Publisher: Springer Netherlands
Buse K, Bilimoria D, Perelli S (2013) Why they stay: women persisting in US engineering careers. Career Dev Int 18(2):139–154. Publisher: Emerald Group Publishing Limited
Carvalho DS, Bares L, Silva K (2020) The Gender Patenting Gap: A Study on the Iberoamerican Countries. J Data Inf Sci 5(3):116–128
Ceci SJ, Williams WM (2010) Sex Differences in Math-Intensive Fields. Curr Direc Psychol Sci 19(5):275–279. SAGE Publications Inc
Cheryan S, Ziegler SA, Montoya AK, Jiang L (2017) Why are some STEM fields more gender balanced than others? Psychol Bull 143(1):1–35. Place: United States Publisher: American Psychological Association
Chu S (2024) Are women greener? Board gender diversity and corporate green technology innovation in China. Int Rev Econ Financ 93:1001–1020
Comedy YL, Dougherty EL (2018) Breaking barriers: female inventors blazing a path forward. Technol Innov 19(4):751–758. Place: Tampa Publisher: National Academy of Inventors
Couch S, Estabrooks LB, Skukauskaite A (2018) Addressing the gender gap among patent holders through invention education policies. Technol Innov 19(4):735–749. Tampa Publisher: National Academy of Inventors
Dal Molin M, Leggerini C (2024) The Impact of R&D Team Gender Diversity on Green Innovation in Europe
Diamond SJ, Thomas CR, Desai S, Holliday EB, Jagsi R, Schmitt C (2016) Gender differences in publication productivity, academic rank and career duration among U.S. Academic Gastroenterology Faculty. Acad Med 91(8):1158–1163
Ding WW, Murray F, Stuart TE (2006) Gender differences in patenting in the academic life sciences. Science 313(5787):665–667. American Association for the Advancement of Science
Eloundou T, Manning S, Mishkin P, Rock D (2024) GPTs are GPTs: labor market impact potential of LLMs. Science 384(6702):1306–1308. American Association for the Advancement of Science
Eric S, Work B (2021) Final Report: National Security Commission on Artificial Intelligence. Technical report
Feng C, Wang H, Lu N, Chen T, He H, Lu Y (2014) Log-transformation and its implications for data analysis. Shanghai Arch Psychiatry 26(2):105–109
Ferber MA, Brün M (2011) The gender gap in citations: does it persist? Feminist Econ 17(1):151–158
Fleming L, Sorenson O (2004) Science as a map in technological search. Strateg Manag J 25(8-9):909–928. https://onlinelibrary.wiley.com/doi/pdf/10.1002/smj.384
Fleming L, Greene H, Li G, Marx M, Yao D (2019a) Government-funded research increasingly fuels innovation. Science 364(6446):1139–1141. United States, The American Association for the Advancement of Science
Fleming L, Green H, Li G-C, Marx M and Yao D (2019b) Replication Data for: Government-funded research increasingly fuels innovation
Funk RJ, Owen-Smith J (2017) A dynamic network measure of technological change. Manag Sci 63(3):791–817. INFORMS
Funk RJ, Park M, Leahey E (2022) Papers and patents are becoming less disruptive over time (1.0)
Gecko Z (2009) Gender-verification by forename (cmd-line-tool & db) - utilities
Ghiasi G, Larivière V, Sugimoto CR (2015) On the compliance of women engineers with a gendered scientific system. PLOS ONE 10(12):e0145931. Public Library of Science
Giczy AV, Pairolero NA, Toole AA (2021) Identifying artificial intelligence (AI) invention: a novel AI patent dataset. J Technol Transf 47(2):476–505. Place: New York Publisher: Springer US
Giczy AV, Pairolero NA, Toole AA (2024) Discovering value: women’s participation in university and commercial AI invention. Nat Biotechnol 42(1):26–29
Giuri P, Grimaldi R, Kochenkova A, Munari F, Toschi L (2020) The effects of university-level policies on women’s participation in academic patenting in Italy. J Technol Transf 45(1):122–150
Griffin D, Li K, Xu T (2021) Board gender diversity and corporate innovation: international evidence. J Financ Quant Anal 56(1):123–154. New York, USA, Cambridge University Press
Griliches Z (1990) Patent statistics as economic indicators: a survey
Gupta P, Pagliardini M, Jaggi M (2019) Better word embeddings by disentangling contextual n-gram information. Proceedings of the conference of the north. Minneapolis, Minnesota Publisher: Association for Computational Linguistics, pp 933–939
Hall BH, Jaffe AB, Trajtenberg M (2001)The NBER Patent Citation Data File: Lessons, Insights and Methodological Tools. Technical Report w8498, National Bureau of Economic Research
Harhoff D, Scherer FM, Vopel K (2003) Citations, family size, opposition and the value of patent rights. Res Policy 32(8):1343–1363
Hasan MA, Spangler WS, Griffin T, Alba A (2009) COA: finding novel patents through text analysis. In Proceedings 15th ACM SIGKDD international conference on knowledge discovery and data mining, KDD’09, New York, NY, USA. Association for Computing Machinery, pp 1175–1184
He X, Jiang S (2019) Does gender diversity matter for green innovation? Bus Strategy Environ 28(7):1341–1356. https://onlinelibrary.wiley.com/doi/pdf/10.1002/bse.2319
Head MG, Fitchett JR, Cooke MK, Wurie FB, Atun R (2013) Differences in research funding for women scientists: a systematic comparison of UK investments in global infectious disease research during 1997-2010. BMJ Open 3(12):e003362. British Medical Journal Publishing Group Section: Infectious diseases
Holliday EB, Jagsi R, Wilson LD, Choi M, Thomas CR, Fuller CD (2014) Gender differences in publication productivity, academic position, career duration and funding among U.S. Academic Radiation Oncology Faculty. Acad Med: J Assoc Am Med Coll 89(5):767–773
Huang R, Cortese A, Chamness G, Inacker A, Mataya Y, Roche C et al. The Global AI Talent Tracker 2.0. (2023)
Huang V, Finch S, Patrick C (2022) Patents and gender: a big data analysis of 15 years of Australian patent applications. Univ N South Wales Law J 45(3):927–962. Wm. W. Gaunt and Sons Inc
Hulla J, Sahu S, Hayes A (2015) Nanotechnology: history and future. Hum Exp Toxicol 34(12):1318–1321. SAGE Publications Ltd STM
Jaffe AB, Trajtenberg M (2002) Patents, citations and innovations: a window on the knowledge economy. MIT Pr, Cambridge, Mass
Jaffe AB, Trajtenberg M, Henderson R (1993) Geographic localization of knowledge spillovers as evidenced by patent citations. Q J Econ 108(3):577–598. Place: Cambridge, Mass. [etc.] Publisher: MIT Press
Jaffe AB, Trajtenberg M, Fogarty MS (2000) Knowledge spillovers and patent citations: evidence from a survey of inventors. Am Econ Rev 90(2):215–218. American Economic Association
Jensen K, Kovács B, Sorenson O (2018) Gender differences in obtaining and maintaining patent rights. Nat Biotechnol 36(4):307–309. United States, Nature Publishing Group
Johnson PA (2013) State of women in civil engineering in the United States and the role of ASCE. J Profess Issues Eng Educ Pract 139(4):275–280. American Society of Civil Engineers
Khan BZ (1996) Married women’s property laws and female commercial activity: evidence from United States Patent Records, 1790-1895. J Econ Hist 56(2):356–388
Kim D, Cerigo DB, Jeong H, Youn H (2016) Technological novelty profile and invention’s future impact. EPJ Data Sci 5(1):1–15. SpringerOpen
Kim S, Chang D, Wang J (2025) Persona alchemy: designing, evaluating and implementing psychologically-grounded LLM agents for diverse stakeholder representation. arXiv:2505.18351
Koning R, Samila S, Ferguson J-P (2020) Inventor gender and the direction of invention. AEA Pap Proc 110:250–254
Koning R, Samila S, Ferguson J-P (2021) Who do we invent for? Patents by women focus more on women’s health, but few women get to invent. Science 372(6548):1345–1348. Washington, American Association for the Advancement of Science
Kutner MH, Nachtsheim C, Neter J, Li W (2005) Applied linear statistical models. McGraw-Hill Irwin, Columbus, OH
Kyvik S, Teigen M (1996) Child care, research collaboration and gender differences in scientific productivity. Sci Technol Hum Values 21(1):54–71. SAGE Publications Inc
Lai JC (2020) Patents and gender: a contextual analysis. Queen Mary J Intellect Prop 10(3):283–305. Place: Cheltenham Publisher: Edward Elgar Publishing
Lanjouw JO, Schankerman M (2004) Patent Quality and Research Productivity: Measuring Innovation with Multiple Indicators. Econ J 114(495):441–465. Oxford, UK, Blackwell Publishing Ltd
Lerner J (1994) The importance of patent scope: an empirical analysis. RAND J Econ 25(2):319–333. [RAND Corporation, Wiley]
Ley TJ, Hamilton BH (2008) The gender gap in NIH grant applications. Science 322(5907):1472–1474. American Association for the Advancement of Science
Lin H, Zhang Y (2025) The risks of using large language models for text annotation in social science research. arXiv:2503.22040 [cs]
Little RJA, Rubin DB (2020) Statistical analysis with missing data. Hoboken, NJ, Wiley
Manesh SN, Choi JO, Shrestha BK, Lim J, Shrestha PP (2020) Spatial analysis of the gender wage gap in architecture, civil engineering and construction occupations in the United States. J Manag Eng 36(4):04020023. American Society of Civil Engineers
Manning BS, Zhu K, Horton JJ (2024) Automated social science: language models as scientist and subjects
Marcowitz-Bitton M, Kaplan Y, Morris EM (2020) Unregistered patents & gender equality. Harv J Law Gend 43(1):47–. Harvard Law School
Martin AF (2020) The patenting gender gap in Europe. Ph.D. thesis, Norwegian School of Economics
Marx M (2024) Reliance on science
Marx M, Fuegi A (2020) Reliance on science: worldwide front-page patent citations to scientific articles. Strateg Manag J 41(9):1572–1594. https://onlinelibrary.wiley.com/doi/pdf/10.1002/smj.3145
Marx M, Fuegi A (2022) Reliance on science by inventors: hybrid extraction of in-text patent-to-article citations. J Econ Manag Strategy 31(2):369–392. https://onlinelibrary.wiley.com/doi/pdf/10.1111/jems.12455
Maurer JA, Choi D, Hur H (2021) Building a diverse engineering and construction industry: public and private sector retention of women in the civil engineering workforce. J Manag Eng 37(4):04021028. American Society of Civil Engineers
McMillan GS (2009) Gender differences in patenting activity: an examination of the US biotechnology industry. Scientometrics 80(3):683–691. Dordrecht, Springer Netherlands
Meyer M (2000) Does science push technology? Patents citing scientific literature. Res Policy 29(3):409–434
Miah MSU, Kabir MM, Sarwar TB, Safran M, Alfarhood S, Mridha MF (2024) A multimodal approach to cross-lingual sentiment analysis with ensemble of transformer and LLM. Sci Rep 14(1):9603. Nature Publishing Group
Monath N, Kobren A, Krishnamurthy A, Glass MR and McCallum A (2019) Scalable hierarchical clustering with tree grafting. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, KDD’19, New York, NY, USA. Association for Computing Machinery, pp 1438–1448
Monath N, Jones C, Madhavan S (2021) PatentsView: disambiguating inventors, assignees and locations. Technical report, American Institutes for Research
Mukherjee S, Uzzi B, Jones B, Stringer M (2016) A new method for identifying recombinations of existing knowledge associated with high impact innovation. J Prod Innov Manag 33(2):224–236
Narin F, Noma E (1985) Is technology becoming science? Scientometrics 7(3):369–381
National Center for Science and Engineering Statistics (NCSES) (2023) Diversity and STEM: women, minorities and persons with disabilities 2023. Technical Report Special Report NSF 23-315, National Science Foundation, Alexandria, VA
Novelli E (2015) An examination of the antecedents and implications of patent scope. Res Policy 44(2):493–507
Pairolero NA, Giczy AV, Torres G, Erana TI, Finlayson MA, Toole AA (2024) The Artificial Intelligence Patent Dataset (AIPD) 2023 update
Park M, Leahey E, Funk RJ (2023) Papers and patents are becoming less disruptive over time. Nature 613(7942):138–144. Number: 7942 Nature Publishing Group
Preston AE (1994) Why have all the women gone? A study of exit of women from the science and engineering professions. Am Econom Rev 84(5):1446–1462. American Economic Association
Pérez IS (2016) gender-guesser: get the gender from first name
Pursell C (1981) Women inventors in America. Technol Cult 22(3):545–549. Detroit, Michigan, University of Chicago Press
Rhodes R (2012) Hedy’s Folly: the life and breakthrough inventions of Hedy Lamarr, the most beautiful woman in the world. Vintage, New York, reprint edition
RissleR LJ, Hale KL, Joffe NR, Caruso NM (2020) Gender differences in grant submissions across science and engineering fields at the NSF. BioScience 70(9):814–820
Ross MB, Glennon BM, Murciano-Goroff R, Berkes EG, Weinberg BA, Lane JI (2022) Women are credited less in science than men. Nature 608(7921):135–145
Sampat BN (2018) A survey of empirical evidence on patents and innovation
Schafer JL (1999) Multiple imputation: a primer. Stat Methods Med Res 8(1):3–15
Schmoch U (2008) Concept of a technology classification for country comparisons. Technical Report IPC/CE/41/5, WIPO, Karlsruhe, Germany
Schumpeter JA (1934) The theory of economic development: an inquiry into profits, capital, credit, interest and the business cycle. Harvard University Press, Cambridge, MA
Schuster WM, Goodman J (2025) Gender inventorship equity in patent prosecution. Sci Rep. 15(1):2070. Nature Publishing Group
Schuster WM, Davis RE, Schley K, Ravenscraft J (2020) An Empirical Study of Patent Grant Rates as a Function of Race and Gender. Am Bus law J 57(2):281–319. Place: Hoboken, USA Publisher: John Wiley & Sons, Inc
Seymore SB (2013) Making Patents Useful. Minn Law Rev 98:1046
Spelmen VS, Porkodi R (2018) A review on handling imbalanced data. In International Conference on Current Trends towards Converging Technologies (ICCTCT), pp 1–11
Squicciarini M, Dernis H, Criscuolo C (2013) Measuring patent quality: indicators of technological and economic value. Technical report, OECD, Paris
Stoica P, Selen Y (2004) Model-order selection: a review of information criterion rules. IEEE Signal Process Mag 21(4):36–47
Strumsky D, Lobo J (2015) Identifying the sources of technological novelty in the process of invention. Res Policy 44(8):1445–1461
Strumsky D, Lobo J, Leeuw SVD (2012) Using patent technology codes to study technological change. Econ Innov N Technol 21(3):267–286
Su H-N (2022) How do patent-based measures inform patent collateral? A holistic analysis on all USPTO patents between 1986 and 2016. IEEE Trans Eng Manag 69(6):1–11
Subramani G, Saksena M (2024) Untapped potential: investigating gender disparities in patent citations
Sugimoto CR, Ni C, West JD, Larivière V (2015) The academic advantage: gender disparities in patenting. PLOS ONE 10(5):e0128000. Public Library of Science
Taddy M (2019) Business data science: combining machine learning and economics to optimize, automate and accelerate business decisions
Tahmooresnejad L, Turkina E (2022) Female inventors over time: Factors affecting female Inventors’ innovation performance. J Informetr 16(1):101256
Thompson MJ (2016) Measuring patent quality: a claim and search report approach. World Pat Inf 45(June 2016):47–54. Elsevier Ltd
Tibshirani R (1996) Regression shrinkage and selection via the Lasso. J R Stat Soc Ser B, Methodol 58(1):267–288
Tong X, Frame JD (1994) Measuring national technological performance with patent claims data. Res Policy 23(2):133–141
Toole A, Breschi S, Miguelez E, Myers A, Ferrucci E, Sterzi V et al. (2019) Progress and potential: a profile of women inventors on U.S. Patents. Technical Report 2, USPTO
Toole A, Saksena MJ, deGrazia CAW, Black KP, Lissoni F, Miguelez E et al. (2020a) Progress and potential: 2020 update on U.S. women inventor-patentees. Technical Report 4, USPTO
Toole A, Jones C, Madhavan, S (2021) PatentsView: An Open Data Platform to Advance Science and Technology Policy. SSRN Scholarly Paper ID 3874213, Social Science Research Network, Rochester, NY
Toole AA, Pairolero NA, Giczy A, Forman JQ, Pulliam C, Such M et al. (2020b) Inventing AI: tracing the diffusion of artificial intelligence with U.S. patents. Technical report, USPTO, Alexandria, VA
Trajtenberg M, Jaffe AB (2002) Patents, citations and innovations: a window on the knowledge economy
Törnberg P (2024) Large language models outperform expert coders and supervised classifiers at annotating political social media messages. Soc Sci Comput Rev https://doi.org/10.1177/08944393241286471
Tussen RJW, Buter RK, van Leeuwen TN (2000) Technological relevance of science: an assessment of citation linkages between patents and research papers. Scientometrics 47(2):389–412
Uhl XM (2018) The most influential female inventors. Buffalo, NY, The Rosen Publishing Group, Inc
U.S. Census Bureau (2023) U.S. Census Bureau QuickFacts: United States
USPTO 905 Cooperative Patent Classification [R-07.2015] (2015). In Manual of patent examining procedure
Uzzi B, Mukherjee S, Stringer M, Jones B (2013) Atypical combinations and scientific impact. Science 342(6157):468–472
Verhoeven D, Bakker J, Veugelers R (2016) Measuring technological novelty with patent-based indicators. Res Policy 45(3):707–723
Wagner A, Rosen W (2014) Spaces of the possible: universal Darwinism and the wall between technological and biological innovation. J R Soc interface 11(97):20131190–20131190
Wang J, Lobo J (2024) Extensive growth of inventions: evidence from U.S. patenting. Technol Forecast Soc Change 207:123586
Wang J, Lobo J, Shutters ST, Strumsky D (2024) Fueling a net-zero future: the influence of government-funded research on climate change mitigation inventions. Environ Innov Soc Transit 51:100836
Wang J, Kiran E, Aurora S, Simeone M, Lobo J (2025) ChatGPT on ChatGPT: an exploratory analysis of its performance in the public sector workplace. Digit Gov Res Pract 6(1):1–28. Just Accepted
Whittington KB, Smith-Doerr L (2005) Gender and commercial science: women’s patenting in the life sciences. J Technol Transf 30(4):355–370
Whittington KB, Smith-Doerr L (2008) Women inventors in context: disparities in patenting across academia and industry. Gend Soc 22(2):194–218
WIPO World Intellectual Property Indicators 2022 (2021) Technical report, World Intellectual Property Organization (WIPO), Geneva, Switzerland
WIPO (2023) WIPO IP Statistics Data Center
WIPO (2024) World intellectual property indicators technical report, World Intellectual Property Organization, Geneva, Switzerland. ISBN: 9789280536911 ISSN: 2709-5207
World Bank (2025) GDP (current US$): World Bank national accounts data and OECD National Accounts data files
Wu L, Wang D, Evans JA (2019) Large teams develop and small teams disrupt science and technology. Nature 566(7744):378–382
Xie Y, Shauman KA (2005) Women in science: career processes and outcomes. Harvard University Press, Cambridge, MA
Yang Y, Zha K, Chen Y, Wang H, Katabi D (2021) Delving into deep imbalanced regression. In Proceedings of the 38th international conference on machine learning, pp 11842–11851
Youn H, Strumsky D, Bettencourt LMA, Lobo J (2015) Invention as a combinatorial process: evidence from US patents. J R Soc Interface 12(106):20150272
Zhang X, Chen S, Zheng W (2024) Gender differences, academic patenting and tenure-track reform in China: evidence from life sciences at elite universities. PLOS ONE 19(7):e0307165. Public Library of Science
Zhang Z, Zong Q (2023) Gender diversity and patent quality: evidence from Chinese patent data. Int Stud Econ 18(4):430–453. https://onlinelibrary.wiley.com/doi/pdf/10.1002/ise3.54
Acknowledgements
We acknowledge the AI Innovation Challenge at Arizona State University, in collaboration with OpenAI, for providing the opportunity to explore generative AI technologies. We also thank the OpenAI Researcher Access Program for granting us access to their API tools, which were instrumental in conducting this study.
Author information
Authors and Affiliations
Contributions
Jieshu Wang and Andrew Maynard contributed to the conceptualization and investigation of the study. Jieshu Wang was responsible for data acquisition and curation, formal analysis, methodology design, visualization, and drafting the original manuscript. Andrew Maynard provided supervision throughout the project. Both Jieshu Wang and Andrew Maynard contributed to validation of the analysis and to reviewing and editing the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
Ethical approval was not required for this study because it did not involve any intervention, experimentation, or interaction with human participants.
Informed consent
Informed consent was not required as the study did not involve direct participation of human subjects.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wang, J., Maynard, A. Gender disparity in U.S. patenting. Humanit Soc Sci Commun 12, 1730 (2025). https://doi.org/10.1057/s41599-025-06038-6
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1057/s41599-025-06038-6








