Skip to main content

RNA-seq research landscape in Africa: systematic review reveals disparities and opportunities

Abstract

RNA sequencing has emerged as the standard method for transcriptome profiling of several human diseases. We performed a systematic review detailing the state of RNA-seq analyses in Africa from its inception till February 2022. Our goal was to provide an update on the state of RNA-seq analyses in Africa, including research gaps, funding information, participants information, authorship and collaborations. Following the PRISMA guidelines, we performed an exhaustive literature search for RNA-seq studies conducted in Africa, using PubMed, Scopus and Academic Search Complete (EBSCOhost). The output was exported to Endnote X9 for analyses. The initial literature search yielded 10,369 articles spread across PubMed (4916), Scopus (4847) and EBSCOhost (580). By applying our exclusion criteria, 28 full-text articles remained and were thoroughly analyzed. Overall, 17 human diseases were studied, including cancers (10/28), infectious disease (4/28), parasitic disease (4/28), autoimmune disorders (2/28) and neglected tropical diseases (2/28). Majority of the articles were published in PLoS Pathogens, BioMed Central and Nature. The National Institutes of Health (42.4%), the Bill & Melinda Gates Foundation (7.5%) and the Wellcome Trust (7.5%) were the top funders of the research studies. Eleven African countries contributed to the participant group, with 57% located in Eastern Africa, 23.1% from Western and 16.7% from Southern Africa. The extremely low number of RNA-seq research studies in Africa is worrying and calls for an immediate investment in research by the African governments. The funding agencies and institutional review boards should also ensure that African collaborators are treated equitably in the course of the research projects.

Introduction

Gene expression profiling (GEP) has been widely employed in biological research, leading to substantial breakthroughs in our understanding of the molecular mechanisms behind complex human diseases such as cancer, autoimmune and heart diseases, and metabolic disorders [1]. To gain insight into a cell's transcriptome, RNA sequencing (RNA-seq) method is commonly employed. In comparison with previous gene expression measures such as microarray analyses, RNA-seq approaches can discover new transcripts, identify alternatively spliced genes, detect allele-specific expression and quantify gene expression [2]. This makes it a versatile technology for gene expression profiling in various conditions. RNA-seq was initially introduced in 2008 [3, 4] and has grown in popularity over the last decade. This could largely be attributed to the dramatic reduction in the cost of sequencing [5] and the availability of sequencing platforms in several institutions worldwide [6].

RNA-seq studies have made significant contributions to several fields, particularly cancer research [7], including differential gene expression analysis and cancer biomarkers identification [8], cancer drug resistance [9], cancer heterogeneity and evolution [10] and cancer microenvironment and immunotherapy [11]. For RNA-seq analyses to effectively contribute to biomarker identification and precision medicine development, it is incumbent on scientists to consider the different world populations. Gene expression during a diseased state varies considerably from one geographical location, diet, climate, and even gender. As a result, profiling gene expression in as many geographical locations as possible is critical.

The gene expression profiles of western populations have been thoroughly studied; however, data on the expression profiles of African patients are limited. Due to context-specific characteristics, RNA-seq analyses that improve our understanding of major diseases must be replicated in Africa. Additionally, environmental variables such as living conditions, diet, pollution, climate and genetic heterogeneity that may influence gene expression are more pronounced in Africa [12]. As a result of these differences, it may not be prudent to generalize results from RNA-seq studies conducted in only one population.

Currently, to the best of our knowledge, no study has summarized RNA-seq studies performed within Africa in order to delineate potential researchable areas, gaps in knowledge, countries at the forefront of RNA-seq and those lagging in Africa. It is an undeniable fact that Africa is burdened with diseases such as malaria, neglected tropical diseases, tuberculosis, trypanosomiasis, and onchocerciasis, different from those of the developed world. This accentuates the need for health-related research to focus on these diseases.

In this study, we performed a systematic review of RNA-seq analyses that have been conducted on African patients with various human diseases. Our goals were to (1) summarize studies that implemented RNA-seq studies on African patients; (2) identify potential research gaps and future research options in RNA-seq analyses in Africa; (3) identify countries that are championing RNA-seq in Africa; (4) assess the contribution of African-based researchers; (5) ascertain whether the target diseases are relevant to Africans; (6) provide information on funding sources and the number of papers published by African-based scientists on RNA-seq analyses.

Materials and methods

Data sources and study selection

In strict adherence to the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines [13], two of the authors conducted a comprehensive literature search. We searched for published articles in three authoritative scientific databases: PubMed, Scopus and Academic Search Complete via EBSCOhost. Hand searches using Google and Microsoft Bing search engines were also undertaken. The search terms were broadly grouped based on the type of next-generation sequencing (NGS) technology, the relevant countries, years of publication and the type of species being studied. The complete list of the search terms for each database can be found in Supplementary file 1.

Inclusion and exclusion criteria

The search results were considered eligible provided that:

  1. a.

    the studies employed bulk RNA-seq technology,

  2. b.

    the year of publication of the article was 2008 onwards,

  3. c.

    the study participants originated from Africa,

  4. d.

    the study was published in English language,

  5. e.

    the study was conducted on humans.

Articles with the following characteristics were excluded from the analyses:

  1. a.

    studies that employed gene quantification methods such as microarray,

  2. b.

    studies that performed single-cell RNA-seq,

  3. c.

    studies that analyzed publicly available RNA-seq data,

  4. d.

    studies where participants originated from countries other than an African country,

  5. e.

    studies that used animal models,

  6. f.

    studies published in languages other than English,

  7. g.

    review articles, letters to the editor, commentary, opinions, case studies and editorials.

Quality assessment

Two reviewers conducted all the steps in (Fig. 1). The citations of all the search results were imported into Endnote X9. All duplicated articles were removed. Papers whose titles and abstracts were revealed to be irrelevant to the subject and for the purpose of the study were removed. All articles that failed the inclusion criteria were also removed. Results from both reviewers were compared until a consensus was reached. All authors addressed discrepancies in the results.

Data extraction and synthesis

We extracted the following information from the relevant full-text articles obtained by the two reviewers; (a) the demographics of the study participants; (b) the author affiliations; (c) the disease type being studied; (d) the funding sources; (e) the journals in which the paper was published and; (f) the author’s information. By assessing the disease type being studied, our goal was to establish the extent to which the studies focused on diseases relevant to Africa, as defined by the Africa Centers for Disease Control and Prevention [14]. These include neglected tropical diseases, malaria, anthrax, tuberculosis, pneumonia, HIV/AIDS, Lassa fever and meningococcal meningitis. Regarding the funding sources, we focused on organizations that directly funded the project instead of scholarships awarded to one or more of the authors. We only reported on participants and researchers based in Africa in international projects.

Results

Literature search output

Following the PRISMA guidelines, an initial literature search yielded 10,369 articles consisting of PubMed (4916), Scopus (4847) and EBSCOhost (580). No article was found by hand searching using the two search engines (Google and Microsoft Bing). After duplicate removal, 9958 records remained and based on the set inclusion criteria, 8935 articles were removed. The full texts of 222 articles were assessed for eligibility, and 28 studies were finally arrived at and included in the study (Fig. 1).

Fig. 1
figure 1

Flowchart of literature search for articles that employed RNA-seq analyses in African populations

Journal information and diseases studied

The 28 articles included in this study employed RNA-seq analyses to study the gene expression profiles of African patients. The authors adhered to the conventional RNA-seq study design (case–control), where two groups of individuals were studied; one group had the disease of interest, and the other group did not. Overall, 17 different diseases were studied, including cancers (10/28), infectious disease (4/28), parasitic disease (4/28), autoimmune disorders (2/28), neglected tropical diseases (2/28) and others (6/28) (Table 1). Burkitt lymphoma was the most studied of all the cancers (5/10) (Fig. 2). Majority of the studies employed DeSeq2 (11/28) and EdgeR (5/28) for the differential gene expression analysis with utilizing various statistically methods to unraveled the genes that are differentially expressed. Table 2 contains information about the differentially expressed genes specific to African population that were identified in each of the 28 studies, their p-values / false discovery rate (FDR) cut off, as well as the downstream analyses performed.

Table 1 Study characteristics, including body sites, sample types, diseases, funding and data availability
Fig. 2
figure 2

An African map showing the distribution of studies and the various diseases across eleven countries. The colored circles represent the disease type under study, whereas the size of the circles represent the number of studies conducted within a country

Table 2 Study characteristics including Journal names, sequencing platforms and authors home countries

The articles were published in 23 different open-accessed journals, and only one is unavailable in PubMed Central. PLoS Pathogens, BioMed Central (BMC), Nature and Blood Advances contained the highest number of published articles with 17.9%, 14.3%, 14.3% and 7.1%, respectively (Table 2). The majority of the articles were published between 2017 and 2021 (22/28) (Fig. 3), and all the articles are freely available for download.

Fig. 3
figure 3

Number of published articles in a specific year

Study participants’ information and distribution across Africa

Overall, 986 participants of African descent were included in the studies. They were distributed across 11 countries in Western (233/1010), Eastern (576/1010), Southern (169/1010) and Northern (32/1010) Africa. Some studies recruited patients from two or more countries, which increased the total to 1010. The highest number of participants was recruited from Uganda (n = 298), Tanzania (150) and Guinea (128). Sudan and Mali had the least number of participants of 32 and 8, respectively. Sudan was the only Northern African country that recruited participants for the RNA-seq studies (Fig. 4). Approximately 61% of the studies had no information on the gender of the study participants. Of the remaining 39%, 62% were females, whereas 38% were males.

Fig. 4
figure 4

A map of Africa showing the number of study participants and their age groups across the various countries. The size of the circles depict the total number of study participants, while the different colors represent the different age groups

Body site and sample type

Half of the studies (49.9%) collected blood from the participants (Figs. 4 and 5). Other sample types used included tissue biopsies (39.3%), fine needle aspirates (3.6%), spinal fluid (3.6%) and sputum (3.6%). The most widely studied body site was also blood (41.9%), followed by the jaw (6.5%) and abdomen (6.5%) (Figs. 4 and 5). The remaining body sites (neck, breast, mandible, mucous, intestine, skin, spinal cord, throat and the colon) each constituted about 3.2% (Fig. 6). Studies that did not provide information on the body sites where the samples were extracted comprised approximately 16.1%.

Fig. 5
figure 5

An African map showing the different sample types (colored circle) and the number of studies (circle sizes) that used a specific sample type

Fig. 6
figure 6

Graphical representation of the body sites from which the samples were taken for RNA extraction

Authorship information

We explored information on the authors of the various articles with specific emphasis on the first, the last and the corresponding authors, as well as the extent of collaborations among African-based and non-African-based researchers (Fig. 7). Although first authors may not be authorities in their respective fields, they contribute most during manuscript writing and editing. Last authors and corresponding authors are seasoned researchers in their fields and usually serve supervisory and review roles. The corresponding authors of the 28 articles were unequally distributed across 11 countries from three continents (North America, 19; Europe, 9; and Africa, 4). Researchers in the USA accounted for over 67% of all the corresponding authors. This was followed by authors from the U.K. (12.5%) and South Africa (6.3%). Zambia, Spain, Sweden, Canada, Uganda, Germany, Switzerland and Italy had one corresponding author each. Regarding the first authorship, 53.3% were based in the USA, 13.3% were from the U.K., and 6.7% were from Zambia. The USA and the U.K. had the highest numbers of the last authors; 19 and 3, respectively. South Africa and Spain had two last authors each. In 75% of the articles, the corresponding authors doubled as the last authors (Table 2). The highest level of collaboration was observed between researchers from the USA and the following African countries; Uganda (6), Zambia (5), Tanzania (5), Nigeria (2), Malawi (2) and Kenya (2) (Fig. 7).

Fig. 7
figure 7

Graphical representation of the extent of collaborations between African-based and non-African researchers. The higher the extent of collaboration, the deeper the shade of red color

Funding

Table 1 and Fig. 8 summarize the organizations and agencies that awarded grants to the respective authors for the research projects in Africa. The grants were received from agencies within nine countries spread across three continents (North America, Europe and Africa). Funding agencies from the United States of America (USA), United Kingdom (U.K.) and Germany provided 59.1%, 11.4% and 9.1%, respectively, of the entire grants awarded to the authors. South Africa was the only African country whose government made grants available to support RNA-seq research projects, and they accounted for 4.5% of the total number of grants awarded.

Fig. 8
figure 8

A Representation of the number of studies funded by international agencies and B the countries where the agencies are located

Moreover, the funding bodies that contributed the most grants were the National Institutes of Health, NIH (42.4%), the Bill & Melinda Gates Foundation (7.5%) and the Wellcome Trust (7.5%). Five different agencies funded one study, 2 studies were supported by four agencies, 4 studies by three agencies, 4 studies by two agencies, 15 studies by one agency and 2 studies had no information on the funding sources. Grand Challenges Canada, the German Research Foundation, the Steward Foundation and the National Health Laboratory Research Trust (South Africa) were among the other funding bodies.

Sequencing technologies and data availability

Regarding the availability of the RNA-seq data in public databases, 16/28 studies made the research data available with accession numbers, one was available upon reasonable request to the corresponding authors, one could not be made available due to identifying patient information, and 10 had no information on data availability. The 28 studies used NGS platforms from Illumina [27] and Thermo Fisher Scientific (1). Illumina HiSeq2000 was employed by 10/28 authors for sequencing, followed by Illumina HiSeq2500 (9/28), Illumina Nextseq500 (3) and Illumina HiSeq4000 (2). One study used Thermo Fisher’s Ion Torrent Proton sequencer (Table 2).

Discussion

In the present systematic review, we aimed at surveying research projects within Africa that employed RNA-seq to analyze the gene expression profiles of African patients with various human diseases. Undertaking such analyses revealed the specific diseases that are widely studied and those that have been neglected. It also provided comprehensive information on the funding sources, countries with authors in the leading role, participants’ demographics, authors' affiliations and collaborations.

From 2008, when the first results of RNA-seq were published, till February 2022, 28 RNA-seq studies have been conducted where Africans formed part of the study participants. Considering the high prevalence of infectious and other parasitic diseases in Africa, we found the total number of studies to be quite disappointing and further accentuates the underrepresentation of Africans at the genomics level. Since its discovery, only eleven out of the 54 African countries have employed RNA-seq analysis to studies diseases affecting them. Interestingly, most of these studies were performed in Eastern Africa patients, while Northern Africa had a single study. This observation could be attributed to the fact that the studied diseases were not common in Northern Africa compared to the Western and Eastern parts of Africa. For instance, Burkitt lymphoma was the most widely studied disease, and it is highly prevalent in malaria-endemic regions such as Eastern and Western Africa [15]. It is possible that competent researchers in Northern Africa may prioritize other diseases using other sequencing technologies such as whole-exome or whole-genome sequencing.

Over 80% of the studies recruited less than 50 participants. Although six samples (participants) are sufficient to conclude RNA-seq studies [16], increasing the number of samples greatly increases the statistical power and culminates in concrete conclusions [17]. The inability of the researchers to include many participants may be due to financial constraints, including sequencing costs, consumables and wet lab analyses. One method that could have been employed to minimize the sequencing cost was multiplexing [18, 19], allowing simultaneous sequencing of multiple samples.

All the articles were published in either open-accessed or subscription-based journals with 27/28 were freely available through PubMed. The remaining article was also freely available for download on the Journal's website. The free availability of the articles could be due to the open-access revolution and the scientific community’s advocacy for open-access publication [20]. Most African universities and other academic institutions struggle to subscribe to first-class Journals due to the high cost involved. Open access publication will ensure that studies conducted in Africa will be freely available to students, policymakers, government officials and the entire scientific community. It also increases the visibility of African researchers on the global stage, which promotes collaborative research activities. As a result of the expensive nature of RNA-seq studies, these collaborations could acquire grants that can be used to study diseases that terrorize the African continent.

Studies that focused on the gene expression profiles of patients with cancer constituted 36% used RNA-seq. This was unexpected, as the majority of diseases that plague the African continent are parasitic diseases, infectious diseases and neglected tropical diseases, as stipulated by the African CDC [14]. Many of the funding agencies that underwrite the projects come with their already-defined objectives and diseases that can be studied. This makes it difficult for researchers based in Africa to focus on more common diseases in Africa. The various African governments and philanthropists can mitigate this by providing financial resources, which will be used to exclusively study some of these diseases that are prevalent in Africa, such as Malaria, Trypanosomiasis and other neglected tropical diseases. Although some funding agencies abroad allocate funds to study these diseases, they come with terms and conditions that may limit the nature of analyses to be performed.

The U.S. NIH made the most contribution (42%) in funding for the studies. The BMGF and Wellcome Trust also contributed some level of funding. Surprisingly, the South African government was the only African government that partially funded one of the 28 RNA-seq studies through the National Health Laboratory Research Trust. This observation is consistent with findings from Allali, Abotsi [21], who review the state of microbiome research in Africa. African governments and other funding bodies based in Africa need to understand the importance of research and its contribution to growth and development. Researchers based in Africa continue to rely on foreign grants to conduct research are less likely to enjoy the flexibility of studying diseases that threaten public health in Africa, such as malaria, neglected tropical diseases, cholera, Chikungunya, Dengue Fever, Ebola Virus Disease, Lassa Fever and typhoid fever. Additionally, these researchers may not receive the necessary recognition deemed them. This is evident in our study, where over 50% of the first, last and corresponding authors were based in the USA. On the other hand, only 12.5%, 16.7%, and 15.6% of the corresponding, the first and the last authors were based in Africa. If the African government invests in research by providing grants, African scientists can initiate and lead projects involving public health emergency diseases.

The sequencing platform used for most of the studies was Illumina. This was distributed among Hiseq2000, Hiseq2500 and Hiseq4000. The Illumina Hiseq system was launched in 2010 [22] and has since been the sequencing platform of choice for RNA-seq [23,24,25,26]. This could be attributed to the following; Illumina Hiseq has a multiplexing system, enabling thousands of different samples to be sequenced simultaneously. It also has an average error rate of 2% and a storage space of 3 T.B. (Hiseq2000). The qualities of the Illumina platform make it ideal for researchers in resource-constrained environments such as Africa.

More than 50% of the studies provided accession numbers to the RNA-seq data deposited in public repositories. To promote reproducibility, data deposition in public databases by authors has been championed by both the funding bodies [27] and Journals [28]. The data could also be used as secondary data for students and researchers, mainly those based in resource-constrained institutions such as Africa. Other research projects can pool several of these data, add to their generated data, and analyze, as seen in Kaymaz’s study on Burkitt lymphoma [29].

The 28 studies included in this review recruited 986 participants from Africa. Uganda and Tanzania had the highest number of participants, 298 and 150, respectively. Three reasons could account for this observation; the authors based in these African countries had collaborators in foreign institutions who had already secured funding for the project. Secondly, it could be possible that the studied diseases had a high prevalence in these African countries. For instance, Burkitt lymphoma was the most widely studied disease, and it is widespread in malaria and Epstein–Barr virus endemic regions such as Eastern Africa. Finally, the high number of participants from a specific country could be attributed to researchers affiliated with African and non-African institutions. This is one avenue through which resources could be available for a research project.

The highest level of collaboration was observed between African scientists and their American counterparts. Similar observations were made by Alali Allali, Abotsi [21] in their review on microbiome research in Africa. One attributing factor is that the American authors secured the grants and served as the principal investigators on the projects. Their labs perform all the web lab analyses, including library preparation and sequencing. Additionally, they contribute the most to data analyses and manuscript preparation. One significant role of the African collaborator has been patient recruitment and sample collection [30], which has been extensively criticized or perceived as “helicopter research” [31,32,33]. The funding agencies and Institutional review boards can mitigate this unfair treatment of African scientists by their foreign counterparts by including a compulsory capacity-building component in their grant calls and also for African governments should fund research in areas that disproportionally affect their populations. This will force the western researchers to equip and train their African collaborators in undertaking independent research.

Conclusion

In this review, we have explored the RNA-seq research landscape in Africa from RNA-seq inception till February 2022. The scanty number of published studies on RNA-seq in Africa highlights the need for African governments to invest in research. Funding agencies and institutional review boards should ensure capacity-building in cutting-edge research by ensuring equitable collaboration between African and non-African researchers.

Availability of data and materials

Not applicable.

References

  1. Kim K, Zakharkin SO, Allison DB. Expectations, validity, and reality in gene expression profiling. J Clin Epidemiol. 2010;63(9):950–9.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Kukurba KR, Montgomery SB. RNA sequencing and analysis. Cold Spring Harbor Protocols. 2015;2015(11):pdb-top084970.

  3. Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5(7):621–8.

    Article  CAS  PubMed  Google Scholar 

  4. Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, Gerstein M, et al. The transcriptional landscape of the yeast genome defined by RNA sequencing. Science. 2008;320(5881):1344–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. McCombie WR, McPherson JD. Future promises and concerns of ubiquitous next-generation sequencing. Cold Spring Harb Perspect Med. 2019;9(9): a025783.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Koch CM, Chiu SF, Akbarpour M, Bharat A, Ridge KM, Bartom ET, et al. A beginner’s guide to analysis of RNA sequencing data. Am J Respir Cell Mol Biol. 2018;59(2):145–57.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Hong M, Tao S, Zhang L, Diao L-T, Huang X, Huang S, et al. RNA sequencing: new technologies and applications in cancer research. J Hematol Oncol. 2020;13(1):1–16.

    Article  Google Scholar 

  8. Govindarajan M, Wohlmuth C, Waas M, Bernardini MQ, Kislinger T. High-throughput approaches for precision medicine in high-grade serous ovarian cancer. J Hematol Oncol. 2020;13(1):1–20.

    Article  Google Scholar 

  9. Debruyne DN, Dries R, Sengupta S, Seruggia D, Gao Y, Sharma B, et al. BORIS promotes chromatin regulatory interactions in treatment-resistant cancer cells. Nature. 2019;572(7771):676–80.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Tirosh I, Izar B, Prakadan SM, Wadsworth MH, Treacy D, Trombetta JJ, et al. Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq. Science. 2016;352(6282):189–96.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Pan Y, Lu F, Fei Q, Yu X, Xiong P, Yu X, et al. Single-cell RNA sequencing reveals compartmental remodeling of tumor-infiltrating immune cells induced by anti-CD47 targeting in pancreatic cancer. J Hematol Oncol. 2019;12(1):1–18.

    Article  CAS  Google Scholar 

  12. Campbell MC, Tishkoff SA. African genetic diversity: implications for human demographic history, modern human origins, and complex disease mapping. Annu Rev Genomics Hum Genet. 2008;9:403–33.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Moher D, Liberati A, Tetzlaff J, Altman DG. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. Ann Internal Med. 2009;151(4):264–9.

    Article  Google Scholar 

  14. Africa CDC. Available on the world wide web at https://africacdc.org/disease/ (18 April 2022, date last accessed). 2022.

  15. Rainey JJ, Omenah D, Sumba PO, Moormann AM, Rochford R, Wilson ML. Spatial clustering of endemic Burkitt’s lymphoma in high-risk regions of Kenya. Int J Cancer. 2007;120(1):121–7.

    Article  CAS  PubMed  Google Scholar 

  16. Baccarella A, Williams CR, Parrish JZ, Kim CC. Empirical assessment of the impact of sample number and read depth on RNA-Seq analysis workflow performance. BMC Bioinformatics. 2018;19(1):1–12.

    Article  Google Scholar 

  17. Salifu SP, Nyarko HN, Doughan A, Msatsi HK, Mensah I, Bukari A-RA. RNA-seq analyses: benchmarking differential expression analyses tools reveals the effect of higher number of replicates on performance. bioRxiv. 2020.

  18. Hou Z, Jiang P, Swanson SA, Elwell AL, Nguyen BKS, Bolin JM, et al. A cost-effective RNA sequencing protocol for large-scale gene expression studies. Sci Rep. 2015;5(1):1–5.

    Article  Google Scholar 

  19. Shishkin AA, Giannoukos G, Kucukural A, Ciulla D, Busby M, Surka C, et al. Simultaneous generation of many RNA-seq libraries in a single reaction. Nat Methods. 2015;12(4):323–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Beall J. What the open-access movement doesn’t want you to know. Academe. 2015;101(3):37–40.

    Google Scholar 

  21. Allali I, Abotsi RE, Tow LA, Thabane L, Zar HJ, Mulder NM, et al. Human microbiota research in Africa: a systematic review reveals gaps and priorities for future research. Microbiome. 2021;9(1):1–54.

    Article  Google Scholar 

  22. Liu L, Li Y, Li S, Hu N, He Y, Pong R, et al. Comparison of next-generation sequencing systems. J Biomed Biotechnol. 2012;2012.

  23. Ansari-Pour N, Zheng Y, Yoshimatsu TF, Sanni A, Ajani M, Reynier JB, et al. Whole-genome analysis of Nigerian patients with breast cancer reveals ethnic-driven somatic evolution and distinct genomic subtypes. Nat Commun. 2021;12(1):6946. PubMed PMID: 34836952. Pubmed Central PMCID: PMC8626467 CancerIQ, serves as Scientific Advisor at Tempus and is on the Board of 54gene. All other authors declare no competing interest. Epub 2021/11/28. eng.

  24. Dupnik KM, Reust MJ, Vick KM, Yao B, Miyaye D, Lyimo E, et al. Gene expression differences in host response to Schistosoma haematobium infection. Infect Immun. 2019;87(1).

  25. Cummings MJ, Bakamutumaho B, Price A, Owor N, Kayiwa J, Namulondo J, et al. Multidimensional analysis of the host response reveals prognostic and pathogen-driven immune subtypes among adults with sepsis in Uganda. Crit Care. 2022;26(1):1–15.

    Article  Google Scholar 

  26. Duffy FJ, Du Y, Carnes J, Epstein JE, Hoffman SL, Abdulla S, et al. Early whole blood transcriptional responses to radiation-attenuated Plasmodium falciparum sporozoite vaccination in malaria naïve and malaria pre-exposed adult volunteers. Malaria J. 2021;20(1):308.

    Article  CAS  Google Scholar 

  27. Glover SW, Webb A, Gleghorn C. Open access publishing in the biomedical sciences: could funding agencies accelerate the inevitable changes? Health Info Libr J. 2006;23(3):197–202.

    Article  PubMed  Google Scholar 

  28. Sholler D, Ram K, Boettiger C, Katz DS. Enforcing public data archiving policies in academic publishing: a study of ecology journals. Big Data Soc. 2019;6(1):2053951719836258.

    Article  Google Scholar 

  29. Kaymaz Y, Oduor CI, Yu H, Otieno JA, Ong’echa JM, Moormann AM, et al. Comprehensive transcriptome and mutational profiling of endemic Burkitt lymphoma reveals EBV type-specific differences. Mol Cancer Res. 2017;15(5):563–76.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Boshoff N. Neo-colonialism and research collaboration in Central Africa. Scientometrics. 2009;81(2):413–34.

    Article  Google Scholar 

  31. Haelewaters D, Hofmann TA, Romero-Olivares AL. Ten simple rules for Global North researchers to stop perpetuating helicopter research in the Global South. Public Library Sci. 2021;17:e1009277.

    CAS  Google Scholar 

  32. Adame F. Meaningful collaborations can end ‘helicopter research’. Nature. 2021.

  33. Minasny B, Fiantis D. Helicopter research: who benefits from international studies in Indonesia. The Conversation [Internet]. 2018.

  34. Abate F, Ambrosio MR, Mundo L, Laginestra MA, Fuligni F, Rossi M, et al. Distinct viral and mutational spectrum of endemic Burkitt lymphoma. PLoS Pathog. 2015;11(10): e1005158.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Chama M, Amadi BC, Chandwe K, Zyambo K, Besa E, Shaikh N, et al. Transcriptomic analysis of enteropathy in Zambian children with severe acute malnutrition. EBioMedicine. 2019;45:456–63.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Duffy FJ, Du Y, Carnes J, Epstein JE, Hoffman SL, Abdulla S, et al. Early whole blood transcriptional responses to radiation-attenuated Plasmodium falciparum sporozoite vaccination in malaria naïve and malaria pre-exposed adult volunteers. Malar J. 2021;20(1):1–13.

    Article  Google Scholar 

  37. Dupnik KM, Reust MJ, Vick KM, Yao B, Miyaye D, Lyimo E, et al. Gene expression differences in host response to Schistosoma haematobium infection. Infect Immun. 2019;87(1):e00291-e318.

    Article  CAS  PubMed  Google Scholar 

  38. Estévez O, Anibarro L, Garet E, Pallares Á, Barcia L, Calviño L, et al. An RNA-seq based machine learning approach identifies latent tuberculosis patients with an active tuberculosis profile. Front Immunol. 2020;11(14):1–12.

    Google Scholar 

  39. Fedoriw Y, Selitsky S, Montgomery ND, Kendall SM, Richards KL, Du W, et al. Identifying transcriptional profiles and evaluating prognostic biomarkers of HIV-associated diffuse large B-cell lymphoma from Malawi. Mod Pathol. 2020;33(8):1482–91.

    Article  CAS  PubMed  Google Scholar 

  40. Fisher RG, Smith DM, Murrell B, Slabbert R, Kirby BM, Edson C, et al. Next generation sequencing improves detection of drug resistance mutations in infants after PMTCT failure. J Clin Virol. 2015;62:48–53.

    Article  CAS  PubMed  Google Scholar 

  41. Hatem G, Hjort L, Asplund O, Minja DTR, Msemo OA, Møller SL, et al. Mapping the cord blood transcriptome of pregnancies affected by early maternal anemia to identify signatures of fetal programming. J Clin Endocrinol Metab. 2022;107:1303.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Mahady G, Kanabar P, Los N, Lawal T, Patel S, Maienschein-Cline M, et al. Transcriptomic analysis reveals that combinations of vitamins A, D2, and D3 have synergistic effects in HCT-116 colon cancer cells by altering the expression of genes involved in multiple canonical pathways including apoptosis, regulation of the. Funct Foods Health Dis. 2021;11(4):154–78.

    Article  Google Scholar 

  43. Kelly P, Amadi B, Chandwe K, Besa E, Zyambo K, Chama M, et al. Gene expression profiles compared in environmental and malnutrition enteropathy in Zambian children and adults. EBioMedicine. 2021;70: 103509.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Lai RPJ, Cortes T, Marais S, Rockwood N, Burke ML, Garza-Garcia A, et al. Transcriptomic characterization of tuberculous sputum reveals a host Warburg effect and microbial cholesterol catabolism. MBio. 2021;12(6):e01766-e1821.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Popescu CR, Tembo B, Chifisi R, Cavanagh MMM, Lee AH-Y, Chiluzi B, et al. Whole blood genome-wide transcriptome profiling and metagenomics next-generation sequencing in young infants with suspected sepsis in a low-and middle-income country: a study protocol. Gates Open Res. 2020;4:139.

    PubMed  PubMed Central  Google Scholar 

  46. Lidenge SJ, Kossenkov AV, Tso FY, Wickramasinghe J, Privatt SR, Ngalamika O, et al. Comparative transcriptome analysis of endemic and epidemic Kaposi’s sarcoma (KS) lesions and the secondary role of HIV-1 in KS pathogenesis. PLoS Pathog. 2020;16(7): e1008681.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Liu X, Speranza E, Muñoz-Fontela C, Haldenby S, Rickett NY, Garcia-Dorival I, et al. Transcriptomic signatures differentiate survival from fatal outcomes in humans infected with Ebola virus. Genome Biol. 2017;18(1):4.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Lombardo KA, Coffey DG, Morales AJ, Carlson CS, Towlerton AM, Gerdts SE, et al. High-throughput sequencing of the B-cell receptor in African Burkitt lymphoma reveals clues to pathogenesis. Blood Adv. 2017;1(9):535–44.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. Mulindwa J, Matovu E, Enyaru J, Clayton C. Blood signatures for second stage human African trypanosomiasis: a transcriptomic approach. BMC Med Genomics. 2020;13(1):14.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Panea RI, Love CL, Shingleton JR, Reddy A, Bailey JA, Moormann AM, et al. The whole-genome landscape of Burkitt lymphoma subtypes. Blood. 2019;134(19):1598–607.

    Article  PubMed  PubMed Central  Google Scholar 

  51. Wichers JS, Tonkin-Hill G, Thye T, Krumkamp R, Kreuels B, Strauss J, et al. Common virulence gene expression in adult first-time infected malaria patients and severe cases. eLife. 2021;10.

  52. Vlasova-St Louis I, Musubire AK, Meya DB, Nabeta HW, Mohei H, Boulware DR, et al. Transcriptomic biomarker pathways associated with death in HIV-infected patients with cryptococcal meningitis. BMC Med Genomics. 2021;14(1).

  53. Tso FY, Kossenkov AV, Lidenge SJ, Ngalamika O, Ngowi JR, Mwaiselage J, et al. RNA-Seq of Kaposi’s sarcoma reveals alterations in glucose and lipid metabolism. PLoS Pathog. 2018;14(1): e1006844.

    Article  PubMed  PubMed Central  Google Scholar 

  54. Tran TM, Jones MB, Ongoiba A, Bijker EM, Schats R, Venepally P, et al. Transcriptomic evidence for modulation of host inflammatory responses during febrile Plasmodium falciparum malaria. Sci Rep. 2016;6:31291.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Silterra J, Gillette MA, Lanaspa M, Pellé KG, Valim C, Ahmad R, et al. Transcriptional categorization of the etiology of pneumonia syndrome in pediatric patients in malaria-endemic areas. J Infect Dis. 2017;215(2):312–20.

    CAS  PubMed  Google Scholar 

  56. Schmitz R, Young RM, Ceribelli M, Jhavar S, Xiao W, Zhang M, et al. Burkitt lymphoma pathogenesis and therapeutic targets from structural and functional genomics. Nature. 2012;490(7418):116–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Rothen J, Murie C, Carnes J, Anupama A, Abdulla S, Chemba M, et al. Whole blood transcriptome changes following controlled human malaria infection in malaria pre-exposed volunteers correlate with parasite prepatent period. PLoS ONE. 2018;13(6): e0199392.

    Article  PubMed  PubMed Central  Google Scholar 

  58. Rose TM, Bruce AG, Barcy S, Fitzgibbon M, Matsumoto LR, Ikoma M, et al. Quantitative RNAseq analysis of Ugandan KS tumors reveals KSHV gene expression dominated by transcription from the LTd downstream latency promoter. PLoS Pathog. 2018;14(12):e1007441.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Funding

This work is supported by the U.S. National Institutes of Health Common Fund [Grant numbers U24HG006941 to Samson Pandam Salifu and 1U2RTW010679 to Samson Pandam Salifu and Albert Doughan]. The content of this publication is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The funding body did not play any roles in the design of the study, collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations

Authors

Contributions

SPS: conceptualize paper, constructive suggestions for the manuscript, review and editing of the final paper. AD: drafting of the original manuscript, contributed to search strategy definition, literature search, data analyses, figure generation and editing of the final paper. WA: contributed to search strategy definition, literature search, data analyses and figure generation. All authors approved the final version.

Corresponding author

Correspondence to Samson Pandam Salifu.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

Contains all the search terms used in the analyses.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Doughan, A., Adingo, W. & Salifu, S.P. RNA-seq research landscape in Africa: systematic review reveals disparities and opportunities. Eur J Med Res 28, 244 (2023). https://doi.org/10.1186/s40001-023-01206-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40001-023-01206-3