PANGEA Cohorts and Data


The PANGEA database holds over 38,600 NGS sequence files from sub-Saharan Africa with basic epidemiological metadata associated with them. For some cohorts, extensive metadata is available; use the metadata tab to explore this data and contact us for more information.

We currently obtain full genomes from ~ 90% of high-quality samples, with many of the less-complete sequences originating from participants who are likely to be virally suppressed. See the resources section for sequencing protocols used.

Sampling Period Number of Sequences
Country
Botswana 2004-2018 7,574
Kenya 2005-2013 1,508
South Africa 2004-2020 4,160
Tanzania 2005-2008 57
Uganda 2003-2023 17,987
Zambia 2013-2018 7,412