Pf8: an open dataset of Plasmodium falciparum genome variation in 33,325 worldwide samples

Abdel Hamid, Muzamil Mahdi; Abdelraheem, Mohamed Hassan; Acheampong, Desmond OmaneORCID logo; Adam, Ishag; Aide, Pedro; Ajibaye, OlusolaORCID logo; Ali, Mozam; Almagro-Garcia, Jacob; Amambua-Ngwa, Alfred; Amenga-Etego, LucasORCID logo; +104 more...Aniebo, Ifeyinwa; Aninagyei, EnochORCID logo; Ansah, Felix; Apinjoh, Tobias O; Ariani, Cristina V; Auburn, Sarah; Awandare, Gordon A; Balmer, Andrew; Bejon, PhilipORCID logo; Boene, SimoneORCID logo; Bwire, George; Candrinho, Baltazar; Chidimatembue, Arlindo; Chindavongsa, Keobouphaphone; Comiche, Kiba; Conway, DavidORCID logo; Dara, Antoine; Diakite, MahamadouORCID logo; Djimde, AbdoulayeORCID logo; Dondorp, ArjenORCID logo; Doumbia, Seydou; Drury, Eleanor; Fanello, Caterina AORCID logo; Ferdig, Mike; Figueroa, Katherine; Gamboa, Dionicia; Golassa, Lemu; Gonçalves, Sónia; Guindo, Merepen dite Agnes; Hamaluba, MaingaORCID logo; Hanboonkunupakarn, Borimas; Howe, Kevin; Hussien, Maazza; Imwong, Mallika; Ishengoma, Deus; Jeans, Julia; Kabaghe, Alinune; Kamuhabwa, Appolinary; Kindermans, Jean-Marie; Konate, Drissa SORCID logo; Kwiatkowski, Dominic P; Lee, Chiyun; Lee, Samuel K; Lee, Sue JORCID logo; Ley, BenediktORCID logo; Llanos-Cuentas, AlejandroORCID logo; Marfurt, Jutta; Matambisso, Glória; Maude, Rapeephan Rattanawongnara; Maude, Richard JamesORCID logo; Mayor, Alfredo; Mayxay, MayfongORCID logo; Maïga-Ascofaré, Oumou; McCann, Robert S; Miles, Alistair; Miotto, Olivo; Mohamed, Abdelrahim OsmanORCID logo; Morang’a, Collins MisitaORCID logo; Murie, Kathryn; Ngasala, Billy EphraimORCID logo; Nguyen, Thuy-NhienORCID logo; Nolasco, Oscar; Nosten, FrancoisORCID logo; Noviyanti, Rintis; O'Connor, Ísla; Oboh, Mary; Ochola-Oyier, Lynette Isabella; Olufunke Falade, Catherine; Olukosi, Adeola; Olumide, Ajibola; Olusola, Fiyinfoluwa I; Onyamboko, Marie AORCID logo; Oriero, Eniyou Cheryll; Oyibo, Wellington Aghoghovwia; Pannebaker, Danielle; Pearson, Richard DORCID logo; Phiri, KamijaORCID logo; van der Pluijm, Rob W; Price, Ric NORCID logo; Quang, Huynh Hong; Rajkumar Devaraju, Vinoth; Randrianarivelojosia, MilijaonaORCID logo; Ranford-Cartwright, LisaORCID logo; Rayner, Julian CORCID logo; Rovira-Vallbona, EduardORCID logo; Rowlands, Katherine; Ruano-Rubio, Valentin; Sanchez, Juan F; Saúte, Francisco; Shettima, Shuwaram; da Silva, Clemente; Simpson, Victoria J; Suddaby, Simon; Takken, Willem; Thu, Aung Myint; Toure, Mahamoudou; Unlu, Eyyub; Valdivia, Hugo OORCID logo; van Vugt, Michele; Waithira, NaomiORCID logo; Wellems, ThomasORCID logo; Wendler, Jason; White, NinaORCID logo; and Wuendrich Ogidan, Rachel (2025) Pf8: an open dataset of Plasmodium falciparum genome variation in 33,325 worldwide samples. Wellcome Open Research, 10. p. 325. ISSN 2398-502X DOI: 10.12688/wellcomeopenres.24031.1
Copy

<ns3:p>We describe the Pf8 data resource, the latest MalariaGEN release of curated genome variation data on over 33,000 <ns3:italic>Plasmodium falciparum</ns3:italic> samples from 99 partner studies and 122 locations over more than 50 years. This release provides open access to raw sequencing data and genotypes at over 12 million genomic positions. For the first time, it includes copy-number variation (CNV) calls in the drug-resistance associated genes <ns3:italic>gch1</ns3:italic> and <ns3:italic>crt</ns3:italic>. As in Pf7, CNV calls are provided for <ns3:italic>mdr1</ns3:italic> and <ns3:italic>plasmepsin2/3</ns3:italic>, along with calls for deletion in <ns3:italic>hrp2</ns3:italic> and <ns3:italic>hrp3,</ns3:italic> genes associated with rapid diagnostic test failures. This data resource additionally features derived datasets, interactive web applications for exploring patterns of drug resistance and variation in over 5,000 genes, an updated Python package providing methods for accessing and analysing the data, and open access analysis notebooks that can be used as starting points for further analyses. In addition, informative example analyses show contrasting profiles of the decline of chloroquine resistance-associated mutations in Africa, and variation in copy number variation across 10 distinct sub-populations. To the best of our knowledge, Pf8 is the largest open data set of genome variation in any eukaryotic species, making it an invaluable foundational resource for understanding evolution, including that of pathogens.</ns3:p>


picture_as_pdf
Hamid-etal-2025-Pf8-an-open-dataset-of-plasmodium.pdf
subject
Published Version
Available under Creative Commons: Attribution 4.0

View Download

Atom BibTeX OpenURL ContextObject in Span Multiline CSV OpenURL ContextObject Dublin Core Dublin Core MPEG-21 DIDL Data Cite XML EndNote HTML Citation JSON MARC (ASCII) MARC (ISO 2709) METS MODS RDF+N3 RDF+N-Triples RDF+XML RIOXX2 XML Reference Manager Refer Simple Metadata ASCII Citation EP3 XML
Export

Downloads