- Open Access
Network analysis of ChIP-Seq data reveals key genes in prostate cancer
© Zhang et al.; licensee BioMed Central Ltd. 2014
Received: 23 December 2013
Accepted: 8 August 2014
Published: 3 September 2014
Prostate cancer (PC) is the second most common cancer among men in the United States, and it imposes a considerable threat to human health. A deep understanding of its underlying molecular mechanisms is the premise for developing effective targeted therapies. Recently, deep transcriptional sequencing has been used as an effective genomic assay to obtain insights into diseases and may be helpful in the study of PC.
In present study, ChIP-Seq data for PC and normal samples were compared, and differential peaks identified, based upon fold changes (with P-values calculated with t-tests). Annotations of these peaks were performed. Protein–protein interaction (PPI) network analysis was performed with BioGRID and constructed with Cytoscape, following which the highly connected genes were screened.
We obtained a total of 5,570 differential peaks, including 3,726 differentially enriched peaks in tumor samples and 1,844 differentially enriched peaks in normal samples. There were eight significant regions of the peaks. The intergenic region possessed the highest score (51%), followed by intronic (31%) and exonic (11%) regions. The analysis revealed the top 35 highly connected genes, which comprised 33 differential genes (such as YWHAQ, tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein and θ polypeptide) from ChIP-Seq data and 2 differential genes retrieved from the PPI network: UBA52 (ubiquitin A-52 residue ribosomal protein fusion product 1) and SUMO2 (SMT3 suppressor of mif two 3 homolog 2) .
Our findings regarding potential PC-related genes increase the understanding of PC and provides direction for future research.
Prostate cancer (PC) is the second most common cancer among men worldwide . The most common symptoms are difficulty in urinating, erectile dysfunction and problems during sexual intercourse ,. Genetic background contributes to PC risk, as suggested by associations with race, family and specific gene variants ,. Many genes have been found to be involved in PC. For example, mutations in BRCA1 (breast cancer 1, early onset) and BRCA2 (breast cancer 2, early onset) are important risk factors for PC . The authors of a previous article reported that PTEN (phosphatase and tensin homolog) deletions are related to tumor aggression in PC .
Determining how proteins interact with DNA is important to fully view many biological processes and disease states. The information thus derived can lead to a deeper understanding of tumor development. Chromatin immunoprecipitation sequencing (ChIP-Seq) is used to investigate interactions between chromatin-associated protein and DNA . It provides the ability to identify the binding sites of any DNA-associated proteins . CCCTC binding factor (CTCF) binds to three regularly spaced repeats of the core sequence CCCTC, and it is widely applied in ChIP-Seq ,.
ChIP-Seq analysis has recently been used to study PC. The results of these studies provide some meaningful ChIP-Seq data in comprehending the molecular mechanisms of androgen receptors (AR) in PC cells, which may be used to develop novel drugs ,. Yu et al.  systematically mapped the genomic landscape of transcription factors and histone marks across multiple PC cell lines and tissues. ChIP-Seq has also been used to reveal direct binding of AR and ERG (v-ets avian erythroblastosis virus E26 oncogene homolog) to the promoter of glycine N-methyltransferase in VCaP (vertebral cancer of the prostate) cells . There has been little study of the identification of PC-related genes by ChIP-Seq analysis.
Protein–protein interaction (PPI) networks provide valuable information in the understanding of cellular function and biological processes. With the tremendous increase in human protein interaction data, the PPI network approach is used to understand molecular mechanisms of disease, particularly to analyze cancer-related phenomena . PPI networks also provide insights into distinct topological features of cancer genes . Therefore, in the present study, we analyzed ChIP-Seq data derived from PC samples and normal controls, and differential peaks were screened out. Annotations were given for those peaks, and PPI network analysis was performed to identify critical genes related to PC.
Two Sequence Read Archive files, SRR513122 for normal and SRR513123 for tumor, were downloaded from National Center for Biotechnology Information Gene Expression Omnibus database with accession number [GEO:GSE38684] . The original ChIP-Seq data were acquired from two normal primary prostate epithelial cells (PrEC cell lines) and three PC cell lines (LNCaP cell lines) by Bert et al. .
Processing and alignment
Reads with low quality were first discarded. To prevent high-quality reads from being rejected during quality-filtering or assembly processes, we trimmed bases from poor-quality ends of reads . ChIP-Seq reads were aligned to the human genome (hg19) using the Bowtie tool , which allows up to two mismatches in the alignment. Locations with one or more exact matches were kept for further analysis.
Screening and annotation of differential peaks
Model-based analysis of ChIP-Seq (MACS)  was applied to identify peaks (read-enriched regions) from Sequence Alignment/Map (SAM) format files as well as differential peaks between cancer and normal samples. MACS is a command line tool designed to analyze ChIP-Seq data in eukaryotes, especially in mammals . The parameters for calling peaks are as follows: (1) effective genome size = 2.70e + 09, (2) bandwidth = 300 bp and (3) P-value cutoff = 1.00e-10. To provide further functional context for the biological interpretation, peaks with fold enrichment scores above 2.95 were picked out and annotated with respect to known genomic sequence features (such as genes and transcripts) according to the hg19 refGenes data. Some peaks of various widths were assigned to a gene based on the closest transcription start sites (TSSs), then the locations of those peaks were extracted by MACS. DNA motifs were identified using MDSeqPos (P < 0.0001).
Protein–protein interaction network analysis
Proteins (genes) work together to exert certain biological functions, with the proteins exhibiting more interactions with others likely to play more important roles in the whole process. Therefore, PPI network analysis of the genes obtained above was performed with BioGRID (Biological General Repository for Interaction Datasets) , which is a public database of physical and genetic interactions for all major model organism species (combined score above 0.4). This network was then constructed using Cytoscape , which is a software package available on the internet for biological network visualization, data integration and interaction network generation.
Differential peaks between prostate cancer tissue samples and normal samples
Annotation of differential peaks
Protein–protein interaction network of peak-associated genes
Unlike sequence-specific transcription activators such as ER and p53, CTCF binding sites are ubiquitously and universally present throughout the genome, and their chromosomal distribution is strongly correlated with genes . Bert et al. used these data to identify the relationship between long-range epigenetic activation and domain gene deregulation, which is quite different from our results . We used MACS was in the present study and found 5,570 differential peaks comprising 3,726 differentially enriched peaks in the PC sample and 1,844 differentially enriched peaks in the normal sample. According to the annotation results (Figure 2), CTCF binding sites were scattered in chromosomes. About one-half of the CTCF binding sites flanked genes. In addition, some differentially expression genes were obtained with the PPI network analysis.
Figure 4 shows the top 35 highly connected genes in a network of differential genes. Some of them are biomarkers of PC. AR had the sixth most interactors in the network, and the peak of it was enriched in the intergenic region of chromosome X, which is in accordance with the results reported by Taslim et al. . The AR signaling axis plays a critical role in PC development and progression . In a previous study, researchers discovered that AR is highly expressed in PC and that it may inhibit PC progression by suppressing AR expression and activity . An international patent has been filed for the use of AR variants as biomarkers and therapeutic targets in advanced PC .
CDK9 is a regulator of cell cycles. The peaks of CDK9 were aligned upstream of chromosome 9. Shore et al. showed that one isoform of CDK9 was transcribed from an alternative upstream promoter . Gordon et al. found that CDK9 regulated AR promoter selectivity and cell growth through serine 81 phosphorylation . The combined inhibition of Cdk9 and Akt can be utilized to induce apoptosis of metastatic PC cells .
EGFR is also an important player in the PPI network. The position of differential peaks annotation was exonic at chromosome 7. EGFR is the cell-surface receptor for members of the epidermal growth factor family of extracellular protein ligands . EGFR overexpression may serve as a reasonable target for therapeutic intervention in this otherwise difficult-to-treat subset of PC patients . The progression of PC is accompanied by the overexpression of EGFR in a very large majority of cases, suggesting that it may play a crucial role in PC .
NEDD4, whose peak annotation was located in the intergenic region of chromosome 15, was in the center of the PPI network. It is a proto-oncogenic ubiquitin ligase for PTEN, and its upregulation is found in many human cancers . Farooqi et al. reported that SMURF and NEDD4 interference offers therapeutic potential in chaperoning genome integrity .
The peak of HSPA4 (heat shock protein 70 (HSP70)) was located in the intergenic region of chromosome 5. HSPA4 has been implicated in PC ,, and it may also be differentially regulated according to our analysis. Kottke et al. found that induction of hsp70-mediated Th17 autoimmunity can be exploited in immunotherapy for metastatic PC .
SUMO2 was also included in the list. In previous studies, investigators have discovered the role of desumoylation in the development of PC  . Yang et al. reported that small ubiquitin-like modifier isoforms 1, 2 and 3 are activated in human astrocytic brain tumors and are required for glioblastoma cell survival. Therefore, we speculated that SUMO2 might play a similar role in PC.
POTEH, located on chromosomes 21, may also be a candidate for the immunotherapy of PC , which is recorded in GeneCards as a PC-related gene. A previous study showed that POTEH is expressed highly in PC, but is limited in benign tissues .
Overall, we identified a number of key genes related to PC by analyzing ChIP-Seq data in the present study. These genes include AR, CDK9, EGFR, NEDD4, HSPA4 and SUMO4, and about one-half of them were located in the intergenic regions of chromosomes with differential peak annotations. These genes may help enhance the understanding of PC and also provides direction for future research. Future research is needed to define their roles in detail and subsequently develop effective molecular target therapies for PC.
YuZ and YuhZ designed and coordinated the study, participated in most of the experiments and prepared the manuscript. ZH and ZZ provided assistance with the study design, coordinated and carried out all the experiments and participated in manuscript preparation. JL and XZ provided assistance for all experiments. All authors read and approved the final manuscript.
- Jemal A, Bray F, Center MM, Ferlay J, Ward E, Forman D: Global cancer statistics. CA Cancer J Clin 2011, 61: 69–90. 10.3322/caac.20107PubMedView ArticleGoogle Scholar
- Darwish-Yassine M, Berenji M, Wing D, Copeland G, Demers RY, Garlinghouse C, Fagerlin A, Newth GE, Northouse L, Holmes-Rovner M, Rovner D, Sims J, Wei JT: Evaluating long-term patient-centered outcomes following prostate cancer treatment: findings from the Michigan Prostate Cancer Survivor study. J Cancer Surviv 2014, 8: 121–130. 10.1007/s11764-013-0312-8PubMedView ArticleGoogle Scholar
- Grunfeld EA, Halliday A, Martin P, Drudge-Coates L: Andropause syndrome in men treated for metastatic prostate cancer: a qualitative study of the impact of symptoms. Cancer Nurs 2012, 35: 63–69. 10.1097/NCC.0b013e318211fa92PubMedView ArticleGoogle Scholar
- Lockett KL, Hall MC, Xu J, Zheng SL, Berwick M, Chuang SC, Clark PE, Cramer SD, Lohman K, Hu JJ: The ADPRT V762A genetic variant contributes to prostate cancer susceptibility and deficient enzyme function. Cancer Res 2004, 64: 6344–6348.Google Scholar
- Thompson IM, Ankerst DP, Chi C, Goodman PJ, Tangen CM, Lucia MS, Feng Z, Parnes HL, Coltman CA Jr: Assessing prostate cancer risk: results from the Prostate Cancer Prevention Trial. J Natl Cancer Inst 2006, 98: 529–534. 10.1093/jnci/djj131PubMedView ArticleGoogle Scholar
- Struewing JP, Hartge P, Wacholder S, Baker SM, Berlin M, McAdams M, Timmerman MM, Brody LC, Tucker MA: The risk of cancer associated with specific mutations of BRCA1 and BRCA2 among Ashkenazi Jews. N Engl J Med 1997, 336: 1401–1408.Google Scholar
- Fraser M, Zhao H, Luoto KR, Lundin C, Coackley C, Chan N, Joshua AM, Bismar TA, Evans A, Helleday T, Bristow RG: PTEN deletion in prostate cancer cells does not associate with loss of RAD51 function: implications for radiotherapy and chemotherapy. Clin Cancer Res 2012, 18: 1015–1027.Google Scholar
- Furey TS: ChIP-Seq and beyond: new and improved methodologies to detect and characterize protein–DNA interactions. Nat Rev Genet 2012, 13: 840–852. 10.1038/nrg3306PubMedPubMed CentralView ArticleGoogle Scholar
- Park PJ: ChIP-seq: advantages and challenges of a maturing technology. Nat Rev Genet 2009, 10: 669–680. 10.1038/nrg2641PubMedPubMed CentralView ArticleGoogle Scholar
- Schmidt D, Wilson MD, Spyrou C, Brown GD, Hadfield J, Odom DT: ChIP-seq: using high-throughput sequencing to discover protein–DNA interactions. Methods 2009, 48: 240–248. 10.1016/j.ymeth.2009.03.001PubMedPubMed CentralView ArticleGoogle Scholar
- Lin B, Wang J, Hong X, Yan X, Hwang D, Cho JH, Yi D, Utleg AG, Fang X, Schones DE, Zhao K, Omenn GS, Hood L: Integrated expression profiling and ChIP-seq analyses of the growth inhibition response program of the androgen receptor. PLoS One 2009, 4: e6589. 10.1371/journal.pone.0006589PubMedPubMed CentralView ArticleGoogle Scholar
- Sharma NL, Massie CE, Ramos-Montoya A, Zecchini V, Scott HE, Lamb AD, MacArthur S, Stark R, Warren AY, Mills IG, Neal DE: The androgen receptor induces a distinct transcriptional program in castration-resistant prostate cancer in man. Cancer Cell 2013, 23: 35–47. 10.1016/j.ccr.2012.11.010PubMedView ArticleGoogle Scholar
- Yu J, Yu J, Mani RS, Cao Q, Brenner CJ, Cao X, Wang X, Wu L, Li J, Hu M, Gong Y, Cheng H, Laxman B, Vellaichamy A, Shankar S, Li Y, Dhanasekaran SM, Morey R, Barrette T, Lonigro RJ, Tomlins SA, Varambally S, Qin ZS, Chinnaiyan AM: An integrated network of androgen receptor, polycomb, and TMPRSS2-ERG gene fusions in prostate cancer progression. Cancer Cell 2010, 17: 443–454. 10.1016/j.ccr.2010.03.018PubMedPubMed CentralView ArticleGoogle Scholar
- Sreekumar A, Poisson LM, Rajendiran TM, Khan AP, Cao Q, Yu J, Laxman B, Mehra R, Lonigro RJ, Li Y, Nyati MK, Ahsan A, Kalyana-Sundaram S, Han B, Cao X, Byun J, Omenn GS, Ghosh D, Pennathur S, Alexander DC, Berger A, Shuster JR, Wei JT, Varambally S, Beecher C, Chinnaiyan AM: Metabolomic profiles delineate potential role for sarcosine in prostate cancer progression. Nature 2009, 457: 910–914. A published erratum appears in Nature 2013, 499: 504 10.1038/nature07762PubMedPubMed CentralView ArticleGoogle Scholar
- Kann MG: Protein interactions and disease: computational approaches to uncover the etiology of diseases. Brief Bioinform 2007, 8: 333–346. 10.1093/bib/bbm031PubMedView ArticleGoogle Scholar
- Kar G, Gursoy A, Keskin O: Human cancer protein-protein interaction network: a structural perspective. PLoS Comput Biol 2009, 5: e1000601. 10.1371/journal.pcbi.1000601PubMedPubMed CentralView ArticleGoogle Scholar
- Bert SA, Robinson MD, Strbenac D, Statham AL, Song JZ, Hulf T, Sutherland RL, Coolen MW, Stirzaker C, Clark SJ: Regional activation of the cancer genome by long-range epigenetic remodeling. Cancer Cell 2013, 23: 9–22. 10.1016/j.ccr.2012.11.006PubMedView ArticleGoogle Scholar
- Blankenberg D, Gordon A, Von Kuster G, Coraor N, Taylor J, Nekrutenko A: Manipulation of FASTQ data with Galaxy. Bioinformatics 2010, 26: 1783–1785. 10.1093/bioinformatics/btq281PubMedPubMed CentralView ArticleGoogle Scholar
- Langmead B, Trapnell C, Pop M, Salzberg SL: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 2009, 10: R25. 10.1186/gb-2009-10-3-r25PubMedPubMed CentralView ArticleGoogle Scholar
- Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nussbaum C, Myers RM, Brown M, Li W, Liu XS: Model-based analysis of ChIP-Seq (MACS). Genome Biol 2008, 9: R137. 10.1186/gb-2008-9-9-r137PubMedPubMed CentralView ArticleGoogle Scholar
- Feng J, Liu T, Zhang Y: Using MACS to identify peaks from ChIP-Seq data. Curr Protoc Bioinformatics 2011, 34: 2.14:2.14.1–2.14.14.Google Scholar
- Chatr-Aryamontri A, Breitkreutz BJ, Heinicke S, Boucher L, Winter A, Stark C, Nixon J, Ramage L, Kolas N, O'Donnell L, Reguly T, Breitkreutz A, Sellam A, Chen D, Chang C, Rust J, Livstone M, Oughtred R, Dolinski K, Tyers M: The BioGRID interaction database: 2013 update. Nucleic Acids Res 2013, 41(Database issue):D816-D823. 10.1093/nar/gks1158PubMedPubMed CentralView ArticleGoogle Scholar
- Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 2003, 13: 2498–2504. 10.1101/gr.1239303PubMedPubMed CentralView ArticleGoogle Scholar
- Kim TH, Abdullaev ZK, Smith AD, Ching KA, Loukinov DI, Green RD, Zhang MQ, Lobanenkov VV, Ren B: Analysis of the vertebrate insulator protein CTCF binding sites in the human genome. Cell 2007, 128: 1231–1245. 10.1016/j.cell.2006.12.048PubMedPubMed CentralView ArticleGoogle Scholar
- Taslim C, Chen Z, Huang K, Huang THM, Wang Q, Lin S: Integrated analysis identifies a class of androgen-responsive genes regulated by short combinatorial long-range mechanism facilitated by CTCF. Nucleic Acids Res 2012, 40: 4754–4764. 10.1093/nar/gks139PubMedPubMed CentralView ArticleGoogle Scholar
- Lonergan PE, Tindall DJ: Androgen receptor signaling in prostate cancer development and progression. J Carcinog 2011, 10: 20. 10.4103/1477-3163.83937PubMedPubMed CentralView ArticleGoogle Scholar
- Kashat M, Azzouz L, Sarkar SH, Kong D, Li Y, Sarkar FH: Inactivation of AR and Notch-1 signaling by miR-34a attenuates prostate cancer aggressiveness. Am J Transl Res 2012, 4: 432–442.PubMedPubMed CentralGoogle Scholar
- Lok DJA, Van Der Meer P, de la Porte PW B-A, Lipsic E, Van Wijngaarden J, Hillege HL, van Veldhuisen DJ: Prognostic value of galectin-3, a novel marker of fibrosis, in patients with chronic heart failure: data from the DEAL-HF study. Clin Res Cardiol 2010, 99: 323–328. 10.1007/s00392-010-0125-yPubMedPubMed CentralView ArticleGoogle Scholar
- Shore SM, Byers SA, Maury W, Price DH: Identification of a novel isoform of Cdk9. Gene 2003, 307: 175–182. 10.1016/S0378-1119(03)00466-9PubMedView ArticleGoogle Scholar
- Gordon V, Bhadel S, Wunderlich W, Zhang J, Ficarro SB, Mollah SA, Shabanowitz J, Hunt DF, Xenarios I, Hahn WC, Conaway M, Carey MF, Gioeli D: CDK9 regulates AR promoter selectivity and cell growth through serine 81 phosphorylation. Mol Endocrinol 2010, 24: 2267–2280. 10.1210/me.2010-0238PubMedPubMed CentralView ArticleGoogle Scholar
- Mohapatra S, Chu B, Zhao X, Cheng J, Pledger WJ: Combination inhibition of Cdk9 and AKT induces apoptosis of metastatic prostate cancer cells [Abstract 5399].; Presented at the 2007 annual meeting of the American Association for Cancer Research (AACR), April 2007; Los Angeles, CA, USA.Google Scholar
- Herbst RS: Review of epidermal growth factor receptor biology. Int J Radiat Oncol Biol Phys 2004, 59(2 Suppl):S21-S26. 10.1016/j.ijrobp.2003.11.041View ArticleGoogle Scholar
- Marks RA, Zhang S, Montironi R, McCarthy RP, MacLennan GT, Lopez-Beltran A, Jiang Z, Zhou H, Zheng S, Davidson DD, Baldridge LA, Cheng L: Epidermal growth factor receptor (EGFR) expression in prostatic adenocarcinoma after hormonal therapy: a fluorescence in situ hybridization and immunohistochemical analysis. Prostate 2008, 68: 919–923. 10.1002/pros.20715PubMedView ArticleGoogle Scholar
- Guérin O, Fischel JL, Ferrero JM, Bozec A, Milano G: EGFR targeting in hormone-refractory prostate cancer: current appraisal and prospects for treatment. Pharmaceuticals 2010, 3: 2238–2247. 10.3390/ph3072238PubMed CentralView ArticleGoogle Scholar
- Wang X, Trotman LC, Koppie T, Alimonti A, Chen Z, Gao Z, Wang J, Erdjument-Bromage H, Tempst P, Cordon-Cardo C, Pandolfi PP, Jiang X: NEDD4–1 is a proto-oncogenic ubiquitin ligase for PTEN. Cell 2007, 128: 129–139. 10.1016/j.cell.2006.11.039PubMedPubMed CentralView ArticleGoogle Scholar
- Farooqi AA, Mansoor Q, Rana A, Mashhadi TM, Imran M, Naqi SA, Zia-ur-Rehman , Bhatti S: SMURF and NEDD4 interference offers therapeutic potential in chaperoning genome integrity. J Exp Integr Med 2011, 1: 43–50. 10.5455/jeim.030111.or.002View ArticleGoogle Scholar
- Cardile V, Scifo C, Russo A, Falsaperla M, Morgia G, Motta M, Renis M, Imbriani E, Silvestre G: Involvement of HSP70 in resveratrol-induced apoptosis of human prostate cancer. Anticancer Res 2003, 23(6C):4921–4926.PubMedGoogle Scholar
- Nylandsted J, Brand K, Jäättelä M: Heat shock protein 70 is required for the survival of cancer cells. Ann N Y Acad Sci 2000, 926: 122–125. 10.1111/j.1749-6632.2000.tb05605.xPubMedView ArticleGoogle Scholar
- Kottke T, Sanchez-Perez L, Diaz RM, Thompson J, Chong H, Harrington K, Calderwood SK, Pulido J, Georgopoulos N, Selby P, Melcher A, Vile R: Induction of hsp70-mediated Th17 autoimmunity can be exploited as immunotherapy for metastatic prostate cancer. Cancer Res 2007, 67: 11970–11979. 10.1158/0008-5472.CAN-07-2259PubMedView ArticleGoogle Scholar
- Cheng J, Bawa T, Lee P, Gong L, Yeh ET: Role of desumoylation in the development of prostate cancer. Neoplasia 2006, 8: 667–676. 10.1593/neo.06445PubMedPubMed CentralView ArticleGoogle Scholar
- Bawa-Khalfe T, Cheng J, Wang Z, Yeh ET: Induction of the SUMO-specific protease 1 transcription by the androgen receptor in prostate cancer cells. J Biol Chem 2007, 282: 37341–37349. 10.1074/jbc.M706978200PubMedView ArticleGoogle Scholar
- Wu F, Mo YY: Ubiquitin-like protein modifications in prostate and breast cancer. Front Biosci 2007, 12: 700–711. 10.2741/2094PubMedView ArticleGoogle Scholar
- Bera TK, Zimonjic DB, Popescu NC, Sathyanarayana BK, Kumar V, Lee B, Pastan I: POTE , a highly homologous gene family located on numerous chromosomes and expressed in prostate, ovary, testis, placenta, and prostate cancer. Proc Natl Acad Sci U S A 2002, 99: 16975–16980. A published erratum appears in Proc Natl Acad Sci U S A 2003, 100: 1462.Google Scholar
- Bera TK, Huynh N, Maeda H, Sathyanarayana BK, Lee B, Pastan I: Five POTE paralogs and their splice variants are expressed in human prostate and encode proteins of different lengths. Gene 2004, 337: 45–53. 10.1016/j.gene.2004.05.009PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.