Biohub Commits $500 Million to Build the Open Cellular Dataset for AI-Powered Biology
Chan Zuckerberg Biohub committed $500 million over five years to build open AI-trainable datasets of human cells, partnering with NVIDIA, Allen Institute, Arc, and the Human Cell Atlas.
Biohub Commits $500 Million to Build the Open Cellular Dataset for AI-Powered Biology Chan Zuckerberg Biohub committed $500 million over five years to build open AI-trainable datasets of human cells, partnering with NVIDIA, Allen Institute, Arc, and the Human Cell Atlas. Aaron Rafferty April 30, 2026 Key Takeaways: Chan Zuckerberg Biohub announced the Virtual Biology Initiative on April 29, 2026, committing $500 million over five years to build open AI-trainable datasets of human cells, with $400 million for internal data generation and $100 million for external research. Partners include NVIDIA, Arc Institute, Allen Institute, and the Human Cell Atlas, with Biohub head of science Alex Rives saying current datasets cover roughly one billion cells and the field needs an order of magnitude more. The commitment is structured as open infrastructure rather than proprietary research, with all generated data made freely available to the global scientific community. Chan Zuckerberg Biohub announced the Virtual Biology Initiative on April 29, 2026, a five-year, $500 million commitment to build the open cellular datasets needed to train AI models that can predict how human cells behave in health and disease. The split is $400 million for internal data generation, imaging, and engineering technology, and $100 million for external research labs working on the same infrastructure problem. Renaissance Philanthropy contributed an additional undisclosed amount. The premise is the scaling argument that worked in language models and protein structure prediction. According to Biohub head of science Alex Rives, current biological datasets cover about one billion cells, and the field needs roughly an order of magnitude more before predictive cell models become useful. The initiative coordin