Tailblazing deeper and scalable cellular biology.

Hi, my name is Duo Peng. I'm a senior computational biologist at the Chan Zuckerberg Biohub San Francisco, where I lead a team building bioinformatics solutions that:

  • Map the human subcellular proteome through organellar mass spectrometry data.
  • Automate CRISPR informatics for cell biology research at scale.
  • Mine, process, and model omics datasets at the repository scale, encompassing entire species and large-scale perturbation studies.

Education

University of Georgia, Athens, Georgia, U.S.A.

2012-2017

Dissertation: Developing CRISPR/Cas9 for Genome-Wide Gene Editing in the Human Pathogen Trypanosoma cruzi

University of Georgia, Athens, Georgia, U.S.A.

2012-2016

Dissertation: Frequent Intra-Family Recombination in the Largest Repository of Antigen Variants in The Protozoan Pathogen Trypanosoma cruzi

Wuhan University, Wuhan, Hubei, P.R.China

2006-2010

Thesis: Predicting Trans-splicing by Analysis of RNA-seq Sequencing Data

Selected Publications

corresponding author
* equal contribution

1.  M.Y. Hein*, D. Peng*, V. Todorova*, F. McCarthy*, K. Kim*, C. Liu*, ..., J.E. Elias, M.D. Leonetti (2025) Global organelle profiling reveals subcellular localization and remodeling at proteome scale. Cell [link]

2.  D. Peng, M. Vangipuram, J. Wong, M.D. Leonetti (2024) protoSpaceJAM: an open-source, customizable and web-accessible design platform for CRISPR/Cas insertional knock-in. Nucleic Acids Research [link]

3.  D. Peng, E.G. Kakani, E.Mameli, C. Vidoudez, S.N. Mitchell, G.E. Merrihew, M.J. MacCoss, K. Adams, T.A. Rinvee, W.R Shaw, F. Catteruccia. (2022) A male steroid controls female sexual behaviour in the malaria mosquito. Nature [link]

4.  D. Peng, R. Tarleton. (2015) EuPaGDT: A Web Tool Tailored to Design CRISPR Guide RNAs for Eukaryotic Pathogens. Microbial Genomics [link]

5.  D. Peng, S.P. Kurup, P.Y. Yao, T.A. Minning, R.L. Tarleton. (2014) CRISPR-Cas9-mediated Single-gene and Gene Family Disruption in Trypanosoma cruzi. mBio [link]

6.  D. Peng, X. Gu, L.J. Xue, J.H. Leebens-Mack, C.J. Tsai. (2014) Bayesian phylogeny of sucrose transporters: Ancient Origins, Differential Expansion and Convergent Evolution in Monocots and Dicots. Frontiers in Plant Science [link]

7.  D.B. Weatherly*, D. Peng*, RL Tarleton. (2016) Recombination-driven Generation of the Largest Pathogen Repository of Antigen Variants in the Protozoan Trypanosoma cruzi. BMC Genomics [link]

8.  Z. Zuo*, D. Peng*, X. Yin, X. Zhou, H. Cheng, R. Zhou. (2013) Genome-wide Analysis Reveals Origin of Transfer RNA Genes From tRNA Halves. Molecular Biology and Evolution [link]

See Google Scholar for a complete list

Work Experience

Senior computational biologist
2024.07-present

1. Build machine learning models to resolve host gene signatures at different resolutions, and predict host response under perturbations.
2. Data-driven understanding of the landscape of cellular responses from multimodal assays

Bioinformatics data scientist II
2023.01-2024.06

1. Data-driven understanding of subcellular architecture (preprint).
2. Species-wide data mining for paired host and viral gene expression, build machine learning models to resolve host gene signatures at different resolutions.

Bioinformatics data scientist I
2021.11-2022.12

1. ProtoSpaceJAM: Genome-wide CRISPR knock-in design at scale using biologically informed algorithms (paper, webapp).
2. DeepGenotype: Calculate frequencies of protein-level mutations from deep-sequencing reads of CRISPR-edited cells (codebase).

Software developed

1.  Data portal for: Global organelle profiling reveals the human proteome’s subcellular landscape and its dynamic remodeling
    online access (hosted by the Chan Zuckerberg Biohub San Francisco)

2.  Web-based App: ProtoSpaceJAM - CRISPR knock-in design at scale
    online access (hosted by the Chan Zuckerberg Biohub San Francisco)
    code base

3.  Web-based App: Eukaryotic Pathogen gRNA design tool (This webserver had 24,907 users, 49,267 visits, 17,972 job requests from 91 countries [Google Analytics, 2021])
    online access (hosted by the University of Georgia)

4.  Automated Image Preprocessing and Malaria-oocyst Recognition Tool
    online access (hosted by AWS cloud)
    code base:    Preprocessing    Recognition