Abstract
Expanded potential stem cells (EPSCs) represent a distinct and developmentally primitive stem cell population characterized by their broad developmental potential, which encompasses both embryonic and extra-embryonic lineages. In this study, we employed a polycistronic cassette to directly reprogram human fibroblasts into induced Expanded Potential Stem Cells (iEPSCs). Substituting SOX2 with engineered SOX17 transcription factors resulted in an approximately five-fold increase in the average yield of iEPSC colonies, while maintaining the molecular and functional integrity of the resulting clonal lines. Notably, under feeder-free conditions, SOX2 occasionally failed to reprogram and yielded inconsistent colony numbers, whereas engineered SOX17 and miniaturized SOX17 reproducibly produced feeder-free iEPSCs. In summary, the use of engineered SOX17 enables efficient and robust reprogramming of human fibroblasts into EPSCs, allowing for modeling of early human pre-implantation development, investigating placental disorders, and expanding the toolkit for drug development with a versatile model of pluripotent stem cells that exhibit broader developmental capabilities.
Similar content being viewed by others
Data availability
All plasmids generated in this study are available in Addgene or can be obtained from the lead contact upon request with a completed Materials Transfer Agreement. The RNA-sequencing data were deposited with the ArrayExpress at https://www.ebi.ac.uk/biostudies/arrayexpress under the accession number E-MTAB-16152. The processed datasets of RNA-sequencing generated in this study are also available in Supplementary Data 3. All previously published RNA-seq datasets used in our analyses are publicly available1,9,20, and the corresponding accession numbers are: ArrayExpress accession E-MTAB-7253, specifically run ERR3153919, ERR3153920, ERR3153917, ERR3153918; SRA accession number SRP115256, specifically run SRR6001369, SRR6001370, SRR6024552, SRR6026885, SRR6049361, SRR6049358, SRR6024552, SRR6026885, SRR6007374, SRR6007372, SRR6036267, SRR6037375; GEO accession numbers GSE30567, specifically run SRR307903, SRR307904. The source data underlying the graphs in the main figures and Supplementary Figs. can be found in Supplementary Data 2. We have uploaded our step-by-step protocol for iEPSC reprogramming at protocol.io with https://doi.org/10.17504/protocols.io.6qpvrynrzgmk/v1. All other data supporting the findings of this study are available from the corresponding author upon reasonable request.
Code availability
The scripts used in this study for bulk RNA‑sequencing and reprogramming data processing are available upon request.
References
Liu, X. et al. Comprehensive characterization of distinct states of human naive pluripotency generated by reprogramming. Nat. Methods 14, 1055–1062 (2017).
Takahashi, K. et al. Induction of pluripotent stem cells from adult human fibroblasts by defined factors. Cell 131, 861–872 (2007).
Yu, J. et al. Induced pluripotent stem cell lines derived from human somatic cells. Science 318, 1917–1920 (2007).
Bayerl, J. et al. Principles of signaling pathway modulation for enhancing human naive pluripotency induction. Cell Stem Cell 28, 1549–1565.e1512 (2021).
Gafni, O. et al. Derivation of novel human ground state naive pluripotent stem cells. Nature 504, 282–286 (2013).
Guo, G. et al. Epigenetic resetting of human pluripotency. Development 144, 2748–2763 (2017).
Di Stefano, B. et al. Reduced MEK inhibition preserves genomic stability in naive human embryonic stem cells. Nat. Methods 15, 732–740 (2018).
Takashima, Y. et al. Resetting transcription factor control circuitry toward ground-state pluripotency in human. Cell 158, 1254–1269 (2014).
Gao, X. et al. Establishment of porcine and human expanded potential stem cells. Nat. Cell Biol. 21, 687–699 (2019).
Ruan, D. et al. Establishment of human expanded potential stem cell lines via preimplantation embryo cultivation and somatic cell reprogramming. Nat. Protoc. 20, 2698–2734 (2025).
Veerapandian, V. et al. Directed evolution of reprogramming factors by cell selection and sequencing. Stem Cell Rep. 11, 593–606 (2018).
Tan, D. S. et al. Directed evolution of an enhanced POU reprogramming factor for cell fate engineering. Mol. Biol. Evol. 38, 2854–2868 (2021).
Jauch, R. et al. Conversion of Sox17 into a pluripotency reprogramming factor by reengineering its association with Oct4 on DNA. Stem Cells 29, 940–951 (2011).
Hu, H. et al. Evaluation of the determinants for improved pluripotency induction and maintenance by engineered SOX17. Nucleic Acids Res. 51, 8934–8956 (2023).
Ho, S. Y. et al. An acidic residue within the OCT4 dimerization interface of SOX17 is necessary and sufficient to overcome its pluripotency-inducing activity. Stem Cell Rep. 20, https://doi.org/10.1016/j.stemcr.2025.102398 (2025).
Weng, M. et al. An engineered Sox17 induces somatic to neural stem cell fate transitions independently from pluripotency reprogramming. Sci. Adv. 9, eadh2501 (2023).
Aksoy, I. et al. Sox transcription factors require selective interactions with Oct4 and specific transactivation functions to mediate reprogramming. Stem Cells 31, 2632–2646 (2013).
Okae, H. et al. Derivation of human trophoblast stem cells. Cell Stem Cell 22, 50–63.e56 (2018).
Chen, A. C. H. et al. Expanded potential stem cells from human embryos have an open chromatin configuration with enhanced trophoblast differentiation ability. Adv. Sci. 10, 2204797 (2023).
Tilgner, H. et al. Deep sequencing of subcellular RNA fractions shows splicing to be predominantly co-transcriptional in the human genome but inefficient for lncRNAs. Genome Res. 22, 1616–1625 (2012).
Dong, C. et al. Derivation of trophoblast stem cells from naïve human pluripotent stem cells. eLife 9, e52504 (2020).
Martin, M. J., Muotri, A., Gage, F. & Varki, A. Human embryonic stem cells express an immunogenic nonhuman sialic acid. Nat. Med. 11, 228–232 (2005).
Villa-Diaz, L. G., Ross, A. M., Lahann, J. & Krebsbach, P. H. Concise review: the evolution of human pluripotent stem cell culture: from feeder cells to synthetic coatings. Stem Cells 31, 1–7 (2012).
Jo, S.-Y., Kim, E. & Kim, S. Impact of mouse contamination in genomic profiling of patient-derived models and best practice for robust analysis. Genome Biol. 20, 231 (2019).
Cunningham, F. et al. Ensembl 2022. Nucleic Acids Res. 50, D988–D995 (2021).
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinforma. 12, 323 (2011).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Hutchins, A. P., Jauch, R., Dyla, M. & Miranda-Saavedra, D. glbase: a framework for combining, analyzing and displaying heterogeneous genomic and high-throughput sequencing data. Cell Regenerat. 3, https://doi.org/10.1186/2045-9769-3-1 (2014).
Risso, D., Schwartz, K., Sherlock, G. & Dudoit, S. GC-content normalization for RNA-seq data. BMC Bioinforma. 12, 480 (2011).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Acknowledgements
This study was supported by the InnoHK initiative of the Innovation and Technology Commission of the Hong Kong Special Administrative Region Government. This research was funded by the Research Grants Council of Hong Kong General Research Fund [RGC/GRF, 17106622, 17117925, and 17101120 to R.J.]. We acknowledge support and encouragement from Revive & Restore and the Applied Stem Cell Conservation Fund. We gratefully acknowledge Pik Fan Wong and the CTSCB administrative team for their support, and Yiyi Xuan for providing the SNL76/7 and SNL76/7-P cell lines.
Author information
Authors and Affiliations
Contributions
Conceptualization, H.H., D.H.H.H., D.R., and R.J.; methodology, H.H., D.R., D.H.H.H., Y.T., S.Y.H., M.W., and R.J.; software, H.H., M.W.; investigation, H.H., and S.W.Y.; writing – original draft, H.H., S.Y.H., and R.J.; writing—proofreading, H.H., S.W.Y., D.R., D.H.H.H., Y.T., S.Y.H., M.W., and R.J.; funding acquisition, R.J.; resources, R.J.; supervision, R.J.
Corresponding author
Ethics declarations
Competing interests
The authors declare the following competing interests: H.H., D.H.H.H., M.W., and R.J. are inventors on U.S. Patent Application No. 18/453,112 (published as US 2024/0084265 A1) relating to this work. The authors may be entitled to royalty or other financial benefits under their institutions’ IP policies. All other authors declare no competing interests.
Peer review
Peer review information
Communications Biology thanks Alice Jouneau and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Simona Chera and David Favero. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Hu, H., Ho, D.H.H., Yeung, S.W. et al. Enhancing reprogramming towards induced human expanded pluripotency through substitution of SOX2 with engineered SOX17 transcription factors. Commun Biol (2026). https://doi.org/10.1038/s42003-026-09963-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s42003-026-09963-7


