https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Tahoe-100M
Tahoe-100M is a giga-scale single-cell perturbation atlas consisting of over 100 million transcriptomic profiles from 50 cancer cell lines exposed to 1,100 small-molecule perturbations. Generated using Vevo Therapeutics' Mosaic high-throughput platform, Tahoe-100M enables deep, context-aware exploration of gene function, cellular states, and drug responses at unprecedented scale and resolution. This dataset is designed to power the development of next-generation AI… See the full description on the dataset page: https://huggingface.co/datasets/tahoebio/Tahoe-100M.
AIDO.Cell Dataset Collection
Cell Type Classification
Dataset Name Location
Citation Notes
Zheng zheng 11 Zheng et al. 2017 Human PBMCs. Same splits as Ho et al. 2024.
Segerstolpe Segerstolpe 13 Segerstople et al. 2016 Same splits as Ho et al. 2024.
scTab sctab 164 Fischer et al. 2024 TileDB version of the minimal dataset from scTab's GitHub.
Perturbation Datasets
Tahoe-100M
For demonstration purposes, we include data… See the full description on the dataset page: https://huggingface.co/datasets/genbio-ai/cell-downstream-tasks.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Tahoe-100M
Tahoe-100M is a giga-scale single-cell perturbation atlas consisting of over 100 million transcriptomic profiles from 50 cancer cell lines exposed to 1,100 small-molecule perturbations. Generated using Vevo Therapeutics' Mosaic high-throughput platform, Tahoe-100M enables deep, context-aware exploration of gene function, cellular states, and drug responses at unprecedented scale and resolution. This dataset is designed to power the development of next-generation AI… See the full description on the dataset page: https://huggingface.co/datasets/tahoebio/Tahoe-100M.