6 General Discussion
When analysing microbial communities using 16S rRNA Amplicon Sequencing, it is important to note that the obtained OTU abundances are nothing more than relative measures of the read abundances of the extracted DNA sequences in the samples relative to the total amount of reads. They do not exactly represent the amount of cells present of a particular microorganism due to various biases. These biases are introduced by for example multiple gene copy numbers (GCN) within each cell, primer specificity, and DNA extraction (M. C. van Loosdrecht et al., 2016). A study in 2004 found that the 76 bacterial genomes sequenced contained between 1-15 operon copies and up to 7 are commonly found. Only 40% of the genomes had 1-2 (Acinas, Marcelino, Klepac-Ceraj, & Polz, 2004). This introduces a significant bias which can be further enhanced by multiple genome copies. Additionally, the primers used to target the 16S rRNA gene are designed based on a consensus sequence matching as many bacteria as possible, and therefore whole taxonomic groups can be overlooked. To provide a more complete picture of the microbial communities, perhaps all 9 variable regions of the 16S rRNA should be sequenced, or alternatively sequencing full-length ribosomal small subunits (SSUs) using the Oxford Nanopore instead (S. M. Karst et al., 2016). Together, these biases may have had the consequence that the true ecological differences between the WWTPs are not exactly as presented here. Because the abundances do not seem to have a large influence on the differences when comparing the results of PCA (Figure 4.1) with PCoA using the Bray-Curtis dissimilarity measure (Figure 4.2), biases in abundances are considered unimportant, however. If neccessary, tools exist to correct the abundances based on known GCNs of specific bacteria (Angly et al., 2014).
The fact that the WWTPs have many OTUs in common is supported by the work of A. M. Saunders, Albertsen, Vollertsen, & Nielsen (2016), where they found that 13 Danish WWTPs (of which many are the same as those investigated in this report) contained 63 genera making up 68% of the total reads using 16S rRNA amplicon sequencing. In one of the WWTPs, they further related these genera to what was present in the influent wastewater, concluding that 10% of the total reads were due to immigration from the influent wastewater, and that these OTUs participated little to the wastewater treatment based on their growth rate in the AS. This helps elucidate the influence of the bacteria present in the influent wastewater on the microbial community of the activated sludge. The exact influence of the influent wastewater to the WWTPs sampled here have not been investigated and is a subject for further study.
There have not previously been performed many studies of this kind using different ordination methods to describe so many samples at once, and in particular from the AS of different WWTPs. The largest similar study to date seems to be of GenBank records of 202 globally distributed environmental samples from different soil and water environments (C. A. Lozupone, Hamady, Kelley, & Knight, 2007). The authors used PCoA of UniFrac distances, which incorporates phylogenetic distances (Lozupone & Knight, 2005), to identify clusters of samples based on known physical environmental factors. They found that salinity was a major environmental determinant of the microbial community compositions, more impactful than other physical factors like temperature or pH. This supports that the halophilic genera observed in the Ribe WWTP (Figure 4.6) indeed could be due to high salinity in the wastewater. In another study of 14 geographically separated WWTPs in China and US (Zhang, Shao, & Ye, 2012), they performed both Cluster Analysis using the Bray-Curtis dissimilarity measure aswell as PCoA of weighted UniFrac distances on data obtained using Roche 454 pyrosequencing of 16S rRNA amplicons. They were able to roughly identify 3 clusters of samples from the WWTPs, however only 15 samples were analysed. Interestingly, like A. M. Saunders et al. (2016), they also found that the majority (70 to be exact) of the 744 identified genera present in all 15 samples made up a large amount of the OTU reads (63.7%), which supports that a few genera can be dominant in most WWTPs, as also observed in this study (5 out of all 364 identified genera made up ~20% of the reads). Furthermore, Zhang et al. (2012) reported that some of the most abundant genera identified in the 14 WWTPs were Zoogloea, Trichococcus, Prosthecobacter, and Dechloromonas (in decreasing order of abundance), where Zoogloea and Prosthecobacter are not observed among the 40 most abundant genera in this study (Figure 4.4). This is possibly due to a different climate or other factors, however.
The fact that the WWTPs seem similar and have many OTUs in common is further confirmed by other ordination methods than the ones presented in this report. There are numerous combinations of data transformation and filtering, ordination methods, and distance measures which can be used to analyse the microbial communities of the WWTPs, however only a handful of the most informative have been shown in this report. Several other methods showed somewhat the same patterns, and in general they all indicated that the WWTPs are similar with shared OTUs (a few additional plots can be found in Appendix A). Categorising the microbial communities of WWTPs using ordination was initially inspired by the clustering of human gut microbiomes into so-called Enterotypes (Arumugam et al., 2011). Here, the authors particularly used one measure only (Jensen-Shannon Divergence) to cluster the microbiomes with little reasoning noted, and the practice of Enterotyping has received criticism (Knights et al., 2014; Koren et al., 2013), because the clustering of microbial communities highly depends on the chosen distance measure. This is also evident in the ordination analyses in this report, where different measures revealed different results from the exact same data, which therefore highlights the importance of knowing what kind of information the particular measure is able to reveal in the data and use this as an advantage. CCA were able to reveal unique OTUs in the WWTPs, while PCA and PCoA instead highlighted differences based on OTUs generally with a high read abundance.
The eigenvalue percentages of the axes plotted are generally low in the ordination plots, especially in CCA. This is believed to be due to the large amount of similar samples being analysed at the same time. When analysing fewer samples from only one year, 2013 (Figure A.7 in Appendix A), the overall patterns are very similar to the patterns observed when all 622 samples are analysed at once (Figure 4.3). Again, the microbial communities of the Ribe and Esbjerg E+W WWTPs seem to be different from the rest of the WWTPs, but the eigenvalue percentages of the axes are significantly higher (all samples: 5.8% and 3.7% - 2013 samples: 13.9% and 9.5%). This further confirms that there are many similar OTUs between the samples, or else the eigenvalues would not be significantly lower with more samples analysed. In perspective, in the study mentioned earlier of 202 environmental samples (C. A. Lozupone et al., 2007), the axis percentages of their PCoA analyses were similarly low (between 3.1%-5.5%), which indicates that this is not unusual when analysing this many samples at once.
As a last note, it would arguably make sense to filter the OTUs that are only present in one sample to provide a completely representative picture of the microbial communities of each WWTP, as these OTUs are possibly not part of the functional “core” microbial community of the particular WWTP. As mentioned, there are many of these OTUs which are only present in one sample, and filtering them may have revealed a more ecologically meaningful representation of the WWTPs with respect to their performance.