Skip to content

ArcInstitute/arc-virtual-cell-atlas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Arc Virtual Cell Atlas

Important

Data Migration Notice: Arc's Virtual Cell Atlas data has migrated to the Google Cloud Marketplace.

Note: The new bucket is subject to Requester Pays. Users can access up to 2TB of data per month for free before fees apply.

Access to the current GCS buckets (gs://arc-ctc-tahoe100/ and gs://arc-scbasecount/) will be deprecated on March 31, 2026. Please update your workflows to use the Google Marketplace bucket gs://arc-institute-virtual-cell-atlas.

The Arc Virtual Cell Atlas is a collection of high quality, curated, open datasets assembled for the purpose of accelerating the creation of virtual cell models. The atlas includes both observational and perturbational data from over 602 million cells (and growing).

The atlas is bootstrapped with Tahoe’s Tahoe-100M and Arc’s AI agent-curated scBaseCount dataset.

Tahoe-100M

Documentation

scBaseCount

Documentation

Virtual Cell Challenge

Documentation

About

Arc Virtual Cell Atlas

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors