Genomic characterisation of perinatal Western Australian Streptococcus agalactiae isolates


As a leading cause of neonatal sepsis, Streptococcus agalactiae, commonly known as Group B Streptococcus, is a major neonatal pathogen. Current global screening practices employ risk- or culture-based protocols for detection of these organisms. In Western Austra-lia (WA), universal culture-based screening is provided, with subsequent intrapartum antibi- otic prophylaxis for all S. agalactiae-positive women during labour. Widespread antibiotic exposure is not ideal and this is one of the factors driving development of vaccines against S. agalactiae. Vaccine candidates have focused on the capsule, surface proteins and pilus types, however, capsule serotypes are known to vary geographically. The aim of this study was to use genome sequencing to gain an understanding of the circulating genotypes in WA, and to assess variations in the associated gene pools. We sequenced 141 antenatal carriage (vaginal/rectal) isolates and 10 neonatal invasive disease isolates from WA. Based on the global PubMLST database, the 151 strains were characterised into 30 sequence types, with clustering of these mainly into clonal complexes 1, 12, 17, 19 and 23. Of the genes encoding eleven surface proteins that were analysed, the most prevalent were fbp, lmband scpBwhich were present in ? 98% of isolates. A cluster of non-haemolytic isolates, one of which was a neonatal invasive disease isolate, appeared to lack the entire cyllocus. Admixture analysis of population structure revealed evidence of genetic transfer among the WA isolates across structural groups. When compared against the PubMLST S. agalactiae data, WA isolates showed high levels of strain diversity with minimal apparent clustering. This is the first whole genome sequence study of WA S. agalactiaeisolates and also represents the first addition of Australian isolate data to PubMLST. This report provides insight into the distribution and diversity of vaccine targets of S. agalactiaewithin Western Australia, indicating that the most appropriate capsular vaccine for this population would be the pro- posed pentavalent (Cps Ia, Ib, II, III and V) preparation, whilst vaccines targeting surface proteins should ideally utilise Fbp, Lmb and/or ScpB.


As an opportunistic pathogen, Streptococcusagalactiae, commonly known as Group B Strepto- coccus, is able to reside as a commensal in some individuals and cause serious infection in oth- ers; commonly those who are immunocompromised, and typically the elderly or neonates. As the leading cause of neonatal sepsis, S. agalactiae is targeted in the antenatal and intrapartum period by antibiotic treatment to reduce the risk of transmission from mother to infant and effectively reduce associated disease [1]. Currently, all women in Western Australia (WA) are offered S. agalactiaetesting to assess presence at 35–37 weeks gestation and this involves cul- ture analysis. Approximately 10–30% of women will be carriers of S. agalactiae and receive antibiotics based on this presence/absence result [2]. This universal screening practice pro- vides a currently under-utilized opportunity to attain a better understanding of circulating S. agalactiaestrains in our population through serotyping and whole genome sequencing of clin- ical specimens to further refine S. agalactiaediagnostic and therapeutic approaches.

The recent comprehensive systematic review and meta-analysis by Russell et al. [2] demonstrated the substantial global geographical variation in circulating S. agalactiaeserotypes. For example, serotype III, most commonly associated with invasive disease, had an overall global prevalence rate of 25% of colonised women, yet in Central America and South-Eastern Asia the prevalence rate was less than half this at 11% and 12%, respectively. In addition, Dutraet al. [3] documented regional variation in circulating GBS serotypes in Brazil highlighting the importance of documenting population dynamics at a local level, such as in WA, where we recently described the first data on circulating GBS capsular genotypes [2–4]. However, although knowledge of capsule (CPS) serotype is of significant importance for formulating vaccines that target the S. agalactiae capsule, with the exception of CPS III, information on serotype is not particularly useful for comparing carriage and invasive disease isolates which are generally found distributed amongst all serotypes [5]. This is further complicated by reports of capsular switching [6–9] and discrepant results between serotyping methods [10,11].

While some serotypes predominate across all countries, variation can impact the potential vaccine coverage within each population. In Japan for example, the globally rarer serotypes such as VI, VIII and incidences of IX, are observed more frequently [12]. Serotype VIII in particular is strongly associated with Japan, with studies reporting high rates in Japanese pregnant women [13]. It is apparent that each region should tailor serotype vaccines to their population to optimise efficacy. The success of this approach will rely on the knowledge of the circulating serotypes within each country.

Beyond serotype description, we have the capability of delving more deeply into the charac- terisation and epidemiology of S. agalactiae isolates and a number of molecular approaches have been described [12]. A common molecular identifier is through use of the seven house- keeping genes, adhP, pheS, atr, glnA, sdhA, glcK and tkt, described by Jones and colleagues [14]. The multi locus sequence typing (MLST) database, PubMLST, is a valuable public source of S. agalactiaesequence typing data, however, it currently contains no Australian isolates.MLST profiles were often determined via PCR [14], however, the use of whole genome sequencing (WGS) is becoming more popular. Population analysis allowed by WGS enables the detection of new and emerging clusters of GBS, some of which may represent pathogenic clades, which would, therefore, have implications for public health surveillance. The depth of information that can be afforded by such methods can also clarify the origins of disease. For example, Da Cunha and others [15] used WGS to show that the emergence of S. agalactiae disease in humans can be traced through genetic lineage as a result of extensive tetracycline use.

The aim of this study was to use WGS to characterise circulating S. agalactiaestrains in WA pregnant women and neonates and to compare these data to those from other geographical areas, in an effort to inform vaccine targets.


Clinical isolates

Antenatal isolates were collected upon written consent as part of a larger cohort study, Pre- dict1000, which was approved by the Women and Newborn Health Service Human Research Ethics Committee (201535EW). This study was conducted at King Edward Memorial Hospital, Perth, the largest Tertiary Obstetrics Hospital in WA, over 2015–2017 [4]. The majority of the participants included in this study resided in the broader Perth and surrounding areas, however, rural participants were also represented in this cohort (n = 4). Specimen collection involved self-collected vaginal and rectal e-swabs (Copan, Italy) from pregnant women at 14– 22 weeks gestational age (GA) and again at 33–40 weeks GA. Specimens were processed immediately by culture and also stored at -80˚C for molecular analysis. Culture for S. agalactiae detection involved Strep B Carrot Broth™ (Hardy Diagnostics, California, USA) followed by sub-culture onto CHROMagar™ StrepB (CHROMagar, Paris, France), the latter with the ability to detect non-hemolytic S. agalactiae isolates. All S. agalactiae positive cultures were confirmed by PCR [16] and stored at -80˚C as pure isolates. Of the 151 antenatal isolates, 139 represented isolates from different participants, while 12 isolates represented multiple sampling (different gestation or sample site) of 5 participants. Ten archived invasive disease isolates collected from neonatal blood cultures at King Edward Memorial Hospital over 2012–2014 were also included in analyses, and these were cultured from pure stocks.

DNA extraction

DNA from pure S. agalactiaeisolates was extracted using the QIAGEN MagAttract Microbial DNA Extraction Kit (QIAGEN) as per manufacturer’s instructions using the KingFisher Duo (ThermoFisher Scientific) automated platform.

Whole genome sequencing

Extracted DNA was quantified on a QubitTM fluorometer (Life Technologies) with a dsDNA broad range kit (Life Technologies), and 1.0 ng of DNA was used in the Illumina Nextera XT library preparation protocol as per manufacturer’s instructions (Illumina). The genomic DNA was tagmented, indexed by PCR and purified using AMPure XP beads. Library pooling was performed after quantification of the library size using the LabChip1 GXII. The pooled DNA library (12 pM) from all 151 S. agalactiaestrains was sequenced on an Illumina Nextseq using a paired-end 300 bp read workflow.

Sequence analysis

DNA sequences were analysed using the Short Read Sequence Typing for Bacterial Pathogens (SRST2) program [17] for the detection of the Multi-Locus Sequence Typing (MLST) loci [14]. Alleles were assigned integers using the S. agalactiae database within PubMLST (http:// [18]. The seven housekeeping loci used for MLST were adhP, pheS, atr, glnA, sdhA, glcKand tkt[14]. Assembly was performed using SPAdes (version 3.10.3) [19] followed by the PubMLST database automated pipeline for annotation of genes, in addition to the online tool Genome Comparator [18]. All assembled genomes were analysed using QUAST genome assembly evaluation tool [20] to ensure quality (S2 Table). The capsulargenotypes were determined through analyses of the capsular locus as previously described [21]. Additionally, SRST2 [17] was used to identify partial genes which were not annotated by the automated pipeline (alleles: rib= 1, bac= 1–89, bca= 1, alp2= 1–2, alp3= 1–6 and alp4= AJ488912.1). The presence of alpgenes encoding a family of surface protein antigens was con- firmed by multiplex PCR, as described previously [22] and confirmed with Sanger sequencing. Those that were not confirmed by PCR were manually curated to ensure flanking regions of the gene were adjacent or that SRST2 did not detect partial genes. Population structure was analysed using BAPS (version 6) [23] with hierarchical clustering via HierBAPS admixture analysis [24].

Representative global database

The PubMLST S. agalactiaedatabase ( [18] contained 4,214 S. agalactiaepublicly available submissions; of these, a total of 2,801 isolate records contained >1500 tagged loci, indicative of genomic sequence availability. These were exported and assessed to include all variations available for continent, country, ST, source, disease and year. The resulting dataset of 1378 isolates [25–29] was used as a global representative for further analysis and comparison with our Australian isolates (S3 Table). The genome comparator tool was used to align all loci, followed by neighbor-joining tree analysis using MEGA (version 7) [30] with 500 boot-straps and visualised using the Interactive Tree of Life online software (iTOL version 3) [31].


Population structure

The majority of the clinical isolates collected from WA were observed to cluster into five clonal complexes (CCs), CC1, CC12 (also known as CC10), CC17, CC19 and CC23 (Fig 1A), how- ever, a small cluster (n = 5) of ST22 isolates (which may be considered CC17) and a single ST248 isolate were also observed. Neonatal invasive disease-causing isolates were distributed relatively evenly among the antenatal carriage isolates across each CC. The phylogenetic tree revealed diversity within the CC1 lineage with various branches and seven (Ia, Ib, II, IV, V, VI and VIII) associated capsular genotypes (n = 41 isolates). In contrast, CC17 appeared largely clonal and consisted of ST17 (all CPS III, n = 9) and a single ST291 (CPS IV) isolate. Similarly, CC23 also exhibited minimal branching in the WA population, comprising four capsular genotypes (Ia, III, V and VI) indicative of recent clonal expansion (n = 41). BAPS analysis identified six subgroups within the WA S. agalactiae population in accordance with the CCs and phylogenetic analysis. Multiple isolates were sequenced from 13 patients and included different sites (vaginal and rectal) and/or different gestational ages during pregnancy. Paired isolates clustered together as identical isolates, except for two cases. The first case included vaginal and rectal specimens from a mother at both sampling gestations (?22 weeks and ?33 weeks) where three of the four S. agalactiae isolates were identical and within CC1 (263_V, 263_R and II-263_R), while the remaining isolate collected from the vagina (II-263_V) was distinct and clustered within CC23. Another case included specimens from different sites of the same mother (198_R and II-198_V) at different gestational ages which were genetically distinct (S1 Table).

Admixture analysis

Admixture analysis exposed the extent of genetic exchange evident in this population and identified various donor subgroups (Fig 1B). CC17 and CC19 showed no evidence of

recombination within this population and appeared only as genetic donors, particularly CC19 with CC1 and CC23. CC12 appeared to have obtained the majority of genetic information from CC1. The singleton cluster appeared uniform (all ST22 isolates), except for a single ST248 isolate, which branched outside the ST22 cluster and contained recombination originating from CC1, CC12 and CC19.

Sequence types

A total of 150/151 isolates were represented by 30 currently described STs. A novel MLST pro- file was identified for an additional isolate with the allele combination of 5, 4, 6, 1, 2, 1 and 2 for adhP, pheS, atr, glnA, sdhA, glcK and tkt, respectively. Three STs (ST1, 23 and 19) accounted for >50% of the isolates. Comparison with the PubMLST database revealed several STs that had hypothetical allelic profiles and had been assigned ST numbers accordingly, but where no representative clinical isolates had currently been described (ST15, ST248, ST414 and ST509). In addition, six STs including ST861, ST1167, ST529, ST569, ST585 and ST890 were identified where only one isolate had been submitted to the database.

Virulence genes

Several virulence genes were examined across this cohort including those encoding capsules, surface proteins, pilus islands, haemolysins and hyaluronate lyase (Table 1). All capsular genotypes were represented in this study, except for CPS VII. Overall, capsular genotypes Ia, V, III and II predominated in this cross-sectional population. Neonatal disease isolates were only represented by capsular genotypes Ia–VI. Eleven surface protein genes were assessed, with fibrinogen-binding protein (SAG0152), laminin-binding protein (SAG01234), and serine peptidase (SAG1236) detected in 98% of all sequences. To a lesser extent, bac(SAG2195) was detected in 15.7% and rib(SAG0433) in 27.8% of isolates. The majority of isolates contained pilus islands 1 and 2a (1/2a) in combination followed by 2a, 1/2b and 2b. The cyl genes that form the haemolysin locus were detected in the majority of isolates, however, we were unable to detect any genes within the entire locus for six isolates, including pairs 291_V, II-291_V, 202_V and II-202_V, which comprised capsular genotype V, ST41 isolates within CC12. The other two isolates included ST585 neonatal invasive isolate (Neo_9, also capsular genotype V) within CC12 and ST22 carriage isolate (105_R, capsular genotype II). In addition, seven other strains lacked detection of different cyl genes within the locus including II-493_V, II-302_V, 469_V (capsular genotype III, ST19), II-431_V, Neo_6 (capsular genotype II and Ib, respectively, ST12), Neo_2 (capsular genotype V, ST1) and 152_V (capsular genotype Ia, ST23).

Additionally, 80% of the isolates possessed detectable hlyB gene sequences (SAG1197) encoding hyaluronate lyase; of those lacking hlyB, 20/30 were capsular genotype III, CC19 isolates.

Global comparison

In order to compare the genetic profiles of Australian isolates with those from other international cohort studies, WGS data contained within the PubMLST database was accessed. A representative global database was selected as described above and consisted of 1,378 isolates containing whole genome sequence data representative of the diversity available from the PubMLST database; these data were then used for comparison with the Australian isolates (Fig 2). A maximum likelihood phylogeny tree constructed from the combined global and

Australian isolate data was dominated by African isolates (a notable bias as these represent the majority of the S. agalactiaeisolates in the PubMLST database). Although cladal structures

were discernible, isolates from Africa, Europe and Australian origin were distributed amongst these clades suggesting there was no linkage with geographic isolation of the strains. In contrast, North American isolates clustered into a limited number of cladal groups (Fig 2). There was no apparent clustering of invasive disease isolates within any of the global data, with distribution of these amongst carriage isolates being commonplace.


Whole genome sequencing of WA S. agalactiaeisolates has revealed a clonal population struc- ture with evidence of genetic exchange between structural groups. The five main CCs identified here included CC1, CC12, CC17, CC19 and CC23. The ten neonatal disease isolates included in this analysis showed no evidence of clustering based on invasive phenotype, rather these isolates were distributed amongst carriage isolates of all CCs. Although the small sample size of neonatal isolates is a limitation, invasive neonatal isolates were distributed across multiple CCs (CC12 = 3/23 (13%); CC17 = 1/10 (10%); CC1 = 4/41 (9.7%); CC19 = 1/30

(3.3%) and 1/41 (2.4%)). Further admixture analysis of the population structure provided evi- dence of genetic exchange across all structural groups with CC1, CC12 and CC23 being the most diverse of these groups. This diversity was also evident in the capsular genotypes within CC1 (Ia, Ib, II, IV, V, VI and VIII), CC12 (Ia, Ib, II, V and IX) and CC23 (Ia, II, III, V and VI).

In contrast, here we observed no evidence of genetic recombination within CC17 (III and IV) and CC19 (II, III and V) which contained uniform structural groups, but appeared to act as potential donors of genetic material to the other CCs.

Virulence factors such as capsule and surface proteins are being assessed as vaccination tar- gets, with several vaccines in development [32,33]. It is crucial to understand the population dynamics of these targets to assess the population coverage of the candidate vaccines, in addition to further understanding their association with virulence. Capsular types are well- described among S. agalactiaestudies and our study is consistent with global data that suggest capsular genotypes Ia, Ib, II, III and V predominate [2,34,35]. Type VI is often reported in Asian countries, in particular Japan [13] and Malaysia [36,37], however, it was prevalent in similar proportions to Ib and IV in our West Australian antenatal cohort. Furthermore, type VIII and IX were also described in this population, which have rarely been reported in Austral- asia, with CPS VIII representing <2% in some studies [38–40], while IX was first described in 2007 [41] and neither VIII nor IX was detected in another Australian study performed in 2015 [42]. The differences in the genetic diversity of these data sets could be a result of introduction and expansion of founder isolates into these regions and diversification over extended periods of time by evolution or continued introductions into the same region [43].

The emergence of type IV has been reported in the literature in the last decade, particularly in the United States [44–50]. The main circulating STs of CPS IV include ST452 in CC23, ST459 and ST196 in CC1 and ST291 in CC17. We found three STs encompassing the CPS IV isolates in our study including ST196 (n = 4), ST414 (n = 1) and ST291 (n = 1). The only invasive disease isolate of this capsular genotype was an ST196 isolate (Neo_1). This has been observed in other studies [44–48] and interestingly, Lyhs et al. found both cattle and human isolates that represent ST196 of capsular genotype IV, however, genomes were unavailable for comparison [51].

All capsular genotype IV isolates were found to cluster within CC1, except for ST291 which represented the only non-ST17 isolate within CC17. Bellais et al. [6] analysed the evolution of this ST which formed from a capsular switch between a hypervirulent strain from ST17 (CC17) and ST459 (CC1, CPS IV) that provided a 35.5 Kb genomic segment comprising the type IV cpsoperon. Florindo and colleagues [52] found that 10% (9/89) of the type IV isolates were represented by ST291 within CC17. Ferrieri et al. [53] described two ST291 isolates in CC17 isolated from adults (pregnant and non-pregnant) that caused invasive disease. Similarly studies in Ireland and Canada observed type IV isolates of ST291 within CC17 also [8,54]. This has been reported as a notable strain for surveillance monitoring and isolate II-434_V is the first description from Australia. This suggests serotype replacement is a possibility and as with any capsule-specific vaccination strategy, lack of coverage may lead to the emergence of previously rare serotypes causing disease after vaccination implementation [55].

Capsule-based vaccination strategies such as the bivalent (II and III), trivalent (Ia, Ib and III) [33] and pentavalent (Ia, Ib, II, III and V) [32] combinations would potentially cover 35.1%, 51% and 89.4% of isolates in the WA cohort, respectively. The remaining capsular genotypes, as identified in this cohort, include CPS IV (4%), VI (5.3%), VIII (0.7%) and IX (0.7%) and would not be covered and as mentioned above could be a concern given emergence and expansion of previously rare capsular genotypes.

In addition to the capsular polysaccharide, surface proteins have also been shown to be immunogenic and potential vaccine candidates. Rib and alpha proteins have shown promise as surface protein targets with respect to low antibody levels and the association with invasive S. agalactiae infection [56]. While formulations are currently in development, in this WA cohort riband alpha C (bca) genes were not the most frequently identified (27.8% and 21.9%, respectively), suggesting that vaccines targeting these antigens would result in minimal coverage if used in WA women. In contrast, genes encoding surface proteins such as laminin-bind- ing protein, fibrinogen-binding protein and serine peptidase were present in ?98% of isolates and may represent ideal targets for vaccination (if these genes are expressed) with regard to isolate coverage. Lower prevalence was observed with genes bac, alp1, alp2 and alp3 with 15.2%, 29.1%, 1.3% and 19.2%, respectively, and no alp4 was detected within WA isolate genomes. In a previous study conducted examining a range of reference isolates from the US and New Zealand, in addition to 206 clinical isolates (NZ and Australia), only a single isolate was found to contain the alp4surface protein gene (n = 244) which supports our lack of detection [57]. Dangor and colleagues assessed maternal antibody levels against pilus islands, bibA and fbsA and found a weak relative association between anti-fbsA antibody concentrations and invasive S. agalactiae disease. These data suggest that these surface proteins may not be ideal candidates for neonatal disease protection [55].

A neonatal strain of interest was isolate ST585 which had no current PubMLST data, and lacked a detectable haemolysin locus, being one of six isolates (from four subjects) that were non-haemolytic. Non-haemolytic isolates of S. agalactiae are thought to be rare with descrip- tions of <5% of isolates lacking haemolysin activity [58,59]. With haemolysin recognised as a major virulence factor, non-haemolytic strains are often considered less pathogenic, however, a case of hypervirulence associated with such a strain has been reported [60]. Our data provides further evidence that non-haemolytic S. agalactiae ST585 isolates are clinically significant, ST585 (Neo_9) having been isolated from a case of neonatal disease (n = 10). The antenatal car- riers of non-haemolytic GBS, however, did not contain any indication of disease development either themselves or in their neonate according to medical record data, highlighting that this is not exclusive to invasive or carriage isolates. Screening approaches that omit the identification of non-pigmented and non-haemolytic isolates could limit the accurate detection of pathogenic

S. agalactiaestrains. Capsule, surface proteins and haemolysin production are examples of the virulence factors S. agalactiaeuse to cause disease and evade immune defences. A range of other virulence factors have also been described in this species [61–63] and we have examined several of relevance to protein vaccine targets, colonisation and persistence, and disease outcomes.

Another virulence factor, hyaluronate lyase, acts by cleaving a major component of connective tissues and enables the organism to spread. We found the hlyB gene to be present within 80% of both carriage and invasive disease isolates. Associations have been made between ele- vated production of this enzyme and neonatal disease within type III isolates [64], however, other studies have observed invasive disease caused by isolates lacking this gene [65].

The pili of S. agalactiaerepresent a potential vaccine target, but have also been linked to the colonization of the genitourinary tract and biofilm formation. Animal models have revealed that the deletion of the pilA gene results in reduced ability to persist in the vaginal tract of mice [66]. Similarly, biofilm production (suggestive of a persistent colonizer) has been associated with specific pilus types. Rinaudo et al. reported biofilm-forming capacity in isolates that carried pilus 2a and not pilus types 1 and 2b [67]. One of the two isolates we found with pilus island 2b, isolate Neo_7, was an ST17, capsular genotype III neonatal invasive disease isolate. Lazzarin and colleagues showed that ST17 isolates with pilus 2b knockouts were recovered significantly less from blood, lungs and brain tissue in mice, suggesting that this pilus type plays a role in invasion within this ST [68].

More generally, pilus island distribution was assessed among the S. agalactiaelineages and we found similar prevalence of pilus islands based on CCs [69]. CC23 was composed mainly of 2a with some 1/2a isolates, CC17 was all 1/2b except for a single 2b isolate, CC12 consisted of mainly 1/2a with several 2a isolates and CC19 (separated into CC1 and CC19 in the previous study) contained mainly 1/2a, but also 1/2b and 2a isolates. Springman reported pilus islands 2a > 1/2a > 1/2b prevalence in maternal colonising strains, with no detection of 2b [69]. We detected all pilus islands, however, with differences in prevalence including 1/2a > 2a > 1/ 2b > 2b among the antenatal carriage isolates. The single 2b isolate was a singleton of ST248 isolated from the rectum and exhibited a transient loss of colonisation over the two collection time points during pregnancy. Neonatal invasive isolates were also characterized by the previous study and had prevalence of 1/2b > 1/2a > 2a, while again, no 2b was detected [69]. Our study found an order of prevalence of 1/2a > 2a > 2b, with no 1/2b detected in the small number of neonatal invasive disease isolates characterised. Interestingly, the neonatal 2b isolate represented one of two isolates in which the 2b pilus island was detected in the entire cohort.

The small sample size of neonatal isolates in comparison to the antenatal isolates represents a limitation of this study; however, we have attempted to adjust for this through comparison to the extensive global data on invasive S. agalactiaedisease. We present genomic analysis of isolates of clinical importance to WA pregnant women and neonates and have shed light on the population structure of this group. This is the initial step toward monitoring S. agalactiaeevo- lution in our population.


Current global screening protocols, particularly those culture-based approaches (as implemented in WA) resemble an ideal surveillance tool for S. agalactiaein individual populations. We have described S. agalactiaeisolate genetic diversity within a WA cohort for the first time, and have added valuable whole genome sequence data to the public database to allow more representative global analyses. We observed similarities in CCs and capsular genotype distribution and our results are consistent with the global trend of type IV emergence. Additional description of a non-haemolytic cluster of isolates raises questions about screening detection, as some methods lack the capability to detect such isolates. The potential importance of their detection is highlighted as one of these isolates came from a case of neonatal invasive disease. In addition to monitoring prevalent circulating sequence types, our data also provide information on potential therapeutic targets that may be of use in both implementation and assessment of efficacy of future vaccination strategies. Our data indicate that the most appropriate capsular vaccine for this population would be the proposed pentavalent (Cps Ia, Ib, II, III and preparation, whilst vaccines targeting surface proteins should ideally utilise the Fbp, Lmb and/or ScpB.

Supporting information

S1 Table. Summary of 151 clinical Streptococcus agalactiae isolate characteristics including study and PubMLST identification numbers. (DOCX)

S2 Table. Quality statistics for the sequencing and assembly of the 171 Streptococcusaga lactiae isolates. (DOCX)

S3 Table. Isolates included in the global phylogenetic comparison, including characteristics and reference to publication, accession number or PubMLST contributor [25,27–29]. (DOCX)


Authors acknowledge and thank all study participants for their involvement. We also acknowl- edge midwives Narisha Pendal (Predict1000 study coordinator), Diane Loh and Jan MacFar- lane. We acknowledge Dr Anthony Keil from PathWest Laboratories (WA) and Professor Lyn Gilbert and Dr Fanrong Kong from Westmead Hospital (NSW) for providing neonatal and reference isolates, respectively. Additionally, we would like to acknowledge the assistance pro- vided by Dr Shakeel Mowlaboccus and Barakat Al Suwayyid in the sequencing and bioinfor- matics training and Professor Geoffrey Coombs, Dr Stanley Pang and Lina Lee for their assistance with sequencing involved in this project, conducted at the Antimicrobial Resistance of Infectious Diseases (AMRID) Laboratory at Murdoch University.

Author Contributions

Conceptualization:Lucy L. Furfaro, Barbara J. Chang, Charlene M. Kahler, Matthew S. Payne.

Data curation: Lucy L. Furfaro.

Formal analysis: Lucy L. Furfaro.

Funding acquisition: Lucy L. Furfaro, Barbara J. Chang, Charlene M. Kahler, Matthew S. Payne.

Investigation: Lucy L. Furfaro.

Methodology: Lucy L. Furfaro, Barbara J. Chang, Charlene M. Kahler, Matthew S. Payne.

Project administration: Matthew S. Payne.

Resources: Charlene M. Kahler, Matthew S. Payne.

Supervision: Barbara J. Chang, Charlene M. Kahler, Matthew S. Payne.

Visualization: Lucy L. Furfaro.

Writing – original draft: Lucy L. Furfaro.

Writing – review & editing: Barbara J. Chang, Charlene M. Kahler, Matthew S. Payne.


  1. Verani JR, McGee L, Schrag SJ (2010) Prevention of perinanatal group B streptococcal disease: revised guidelines from CDC, 2010. Recommendations and Reports 59: 1–32.
  2. Russell NJ, Seale AC, O’Driscoll M, O’Sullivan C, Bianchi-Jassir F, Gonzalez-Guarin J, et al. (2017) Maternal colonization with group B Streptococcus and serotype distribution worldwide: Systematic review and meta-analyses. Clinical Infectious Diseases 65: S100–S111. cix658 PMID: 29117327
  3. Dutra VG, Alves VM, Olendzki AN, Dias CA, de Bastos AF, Santos GO, et al. (2014) Streptococcus agalactiaein Brazil: serotype distribution, virulence determinants and antimicrobial susceptibility. BMC Infect Dis 14: 323. PMID: 24919844
  4. Furfaro LL, Nathan EA, Chang BJ, Payne MS (2019) Group B streptococcus prevalence, serotype distribution and colonization dynamics in Western Australian pregnant women. Journal of Medical Microbiology.
  5. Bohnsack JF, Whiting A, Gottschalk M, Dunn DM, Weiss R, Azimi PH, et al. (2008) Population structure of invasive and colonizing strains of Streptococcusagalactiaefrom neonates of six U.S. Academic Centers from 1995 to 1999. J Clin Microbiol 46: 1285–1291. PMID: 18287314
  6. Bellais S, Six A, Fouet A, Longo M, Dmytruk N, Glaser P, et al. (2012) Capsular switching in group B Streptococcus CC17 hypervirulent clone: a future challenge for polysaccharide vaccine development. J Infect Dis 206: 1745–1752. PMID: 23002446
  7. Martins ER, Melo-Cristino J, Ramirez M (2010) Evidence for rare capsular switching in Streptococcus agalactiae. J Bacteriol 192: 1361–1369. PMID: 20023016
  8. Meehan M, Cunney R, Cafferkey M (2014) Molecular epidemiology of group B streptococci in Ireland reveals a diverse population with evidence of capsular switching. Eur J Clin Microbiol Infect Dis 33: 1155–1162. PMID: 24469423
  9. Neemuchwala A, Teatero S, Athey TB, McGeer A, Fittipaldi N (2016) Capsular switching and other large-scale recombination events in invasive sequence type 1 group B Streptococcus. Emerg Infect Dis 22: 1941–1944. PMID: 27767925
  10. Kong F, Lambertsen LM, Slotved HC, Ko D, Wang H, Gilbert GL (2008) Use of phenotypic and molecu- lar serotype identification methods to characterize previously nonserotypeable group B streptococci. J Clin Microbiol 46: 2745–2750. PMID: 18562579
  11. Yao K, Poulsen K, Maione D, Rinaudo CD, Baldassarri L, Telford JL, et al. (2013) Capsular gene typing of Streptococcusagalactiaecompared to serotyping by latex agglutination. J Clin Microbiol 51: 503– 507. PMID: 23196363
  12. Furfaro LL, Chang BJ, Payne MS (2018) Perinatal Streptococcusagalactiaeepidemiology and surveil- lance targets. Clinical Microbiology Reviews 31.
  13. Lachenauer CS, Kasper DL, Shimada J, Ichiman Y, Ohtsuka H, Kaku M, et al. (1999) Serotypes VI and VIII predominate among group B streptococci isolated from pregnant Japanese women. J Infect Dis 179.
  14. Jones N, Bohnsack JF, Takahashi S, Oliver KA, Chan MS, Kunst F, et al. (2003) Multilocus sequence typing system for group B streptococcus. J Clin Microbiol 41: 2530–2536. 41.6.2530-2536.2003 PMID: 12791877
  15. Da Cunha V, Davies MR, Douarre PE, Rosinski-Chupin I, Margarit I, Spinali S, et al. (2014) Streptococ- cusagalactiaeclones infecting humans were selected and fixed through the extensive use of tetracy- cline. Nat Commun 5: 4544. PMID: 25088811
  16. Furfaro LL, Chang BJ, Payne MS (2017) A novel one-step real-time multiplex PCR assay to detect Streptococcusagalactiaepresence and serotypes Ia, Ib, and III. Diagnostic Microbiology and Infectious Disease 89: 7–12. PMID: 28669679
  17. Inouye M, Dashnow H, Raven LA, Schultz MB, Pope BJ, Tomita T, et al. (2014) SRST2: Rapid genomic surveillance for public health and hospital microbiology labs. Genome Med 6: 90. 1186/s13073-014-0090-6 PMID: 25422674
  18. Jolley KA, Maiden MC (2010) BIGSdb: Scalable analysis of bacterial genome variation at the population level. BMC Bioinformatics 11: 595. PMID: 21143983
  19. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. (2012) SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19.
  20. Gurevich A, Saveliev V, Vyahhi N, Tesler G (2013) QUAST: quality assessment tool for genome assem- blies. Bioinformatics 29: 1072–1075. PMID: 23422339
  21. Kapatai G, Patel D, Efstratiou A, Chalker VJ (2017) Comparison of molecular serotyping approaches of Streptococcusagalactiaefrom genomic sequences. BMC Genomics 18: 429. s12864-017-3820-5 PMID: 28571573
  22. Creti R, Fabretti F, Orefici G, von Hunolstein C (2004) Multiplex PCR assay for direct identification of group B streptococcal alpha-protein-like protein genes. J Clin Microbiol 42: 1326–1329. 10.1128/JCM.42.3.1326-1329.2004 PMID: 15004110
  23. Corander J, Marttinen P, Sire´n J, Tang J (2008) Enhanced Bayesian modelling in BAPS software for learning genetic structures of populations. BMC Bioinformatics 9: 539. 2105-9-539 PMID: 19087322
  24. Cheng L, Connor TR, Siren J, Aanensen DM, Corander J (2013) Hierarchical and spatially explicit clus- tering of DNA sequences with BAPS software. Mol Biol Evol 30: 1224–1228. molbev/mst028 PMID: 23408797
  25. Seale AC, Koech AC, Sheppard AE, Barsosio HC, Langat J, Anyango E, et al. (2016) Maternal coloni- zation with Streptococcusagalactiaeand associated stillbirth and neonatal disease in coastal Kenya. Nat Microbiol 1: 16067. PMID: 27572968
  26. Flores AR, Galloway-Pena J, Sahasrabhojane P, Saldana M, Yao H, Su X, et al. (2015) Sequence type 1 group B Streptococcus, an emerging cause of invasive disease in adults, evolves by small genetic changes. Proc Natl Acad Sci U S A 112: 6431–6436. PMID: 25941374
  27. Teatero S, McGeer A, Low DE, Li A, Demczuk W, Martin I, et al. (2014) Characterization of invasive group B streptococcus strains from the greater Toronto area, Canada. J Clin Microbiol 52: 1441–1447. PMID: 24554752
  28. Mehershahi KS, Hsu LY, Koh TH, Chen SL (2015) Complete genome sequence of Streptococcus aga- lactiae serotype III, multilocus sequence type 283 strain SG-M1. Genome Announc 3.
  29. Singh P, Springman AC, Davies HD, Manning SD (2012) Whole-genome shotgun sequencing of a colo- nizing multilocus sequence type 17 Streptococcusagalactiaestrain. J Bacteriol 194: 6005. https://doi. org/10.1128/JB.01378-12 PMID: 23045509
  30. Kumar S, Stecher G, Tamura K (2016) MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Molecular Biology and Evolution 33: 1870–1874. msw054 PMID: 27004904
  31. Letunic I, Bork P (2016) Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res 44: W242–245. PMID: 27095192
  32. Lin SM, Zhi Y, Ahn KB, Lim S, Seo HS (2018) Status of group B streptococcal vaccine development. Clin Exp Vaccine Res 7: 76–81. PMID: 29399583
  33. Heath PT (2016) Status of vaccine research and development of vaccines for GBS. Vaccine 34: 2876– 2879. PMID: 26988258
  34. Le Doare K, Heath PT (2013) An overview of global GBS epidemiology. Vaccine 31 Suppl 4: D7–12.
  35. Madrid L, Seale AC, Kohli-Lynch M, Edmond KM, Lawn JE, Heath PT, et al. (2017) Infant group B Streptococcal disease incidence and serotypes worldwide: Systematic review and meta-analyses. Clini- cal Infectious Diseases 65: S160–S172. PMID: 29117326
  36. Eskandarian N, Ismail Z, Neela V, van Belkum A, Desa MN, Amin Nordin S (2015) Antimicrobial sus- ceptibility profiles, serotype distribution and virulence determinants among invasive, non-invasive and colonizing Streptococcusagalactiae(group B streptococcus) from Malaysian patients. Eur J Clin Micro- biol Infect Dis 34: 579–584. PMID: 25359580
  37. Suhaimi MES, Desa MNM, Eskandarian N, Pillay SG, Ismail Z, Neela VK, et al. (2017) Characterization of a Group B Streptococcus infection based on the demographics, serotypes, antimicrobial susceptibil- ity and genotypes of selected isolates from sterile and non-sterile isolation sites in three major hospitals in Malaysia. J Infect Public Health 10: 14–21. PMID: 27095302
  38. Taylor K (2006) A study of group B streptococcus in Brisbane: the epidemiology, detection by PCR assay and serovar prevalence. Masters Thesis.
  39. Zhao Z, Kong F, Zeng X, Gidding HF, Morgan J, Gilbert GL (2008) Distribution of genotypes and antibi- otic resistance genes among invasive Streptococcusagalactiae(group B streptococcus) isolates from Australasian patients belonging to different age groups. Clin Microbiol Infect 14: 260–267. https://doi. org/10.1111/j.1469-0691.2007.01914.x PMID: 18190584
  40. Zeng X, Kong F, Wang H, Darbar A, Gilbert GL (2006) Simultaneous detection of nine antibiotic resis- tance-related genes in Streptococcusagalactiaeusing multiplex PCR and reverse line blot hybridization assay. Antimicrob Agents Chemother 50: 204–209. PMID: 16377687
  41. Slotved HC, Kong F, Lambertsen L, Sauer S, Gilbert GL (2007) Serotype IX, a proposed new Strepto- coccusagalactiaeserotype. J Clin Microbiol 45: 2929–2936. PMID: 17634306
  42. Ko DW, Zurynski Y, Gilbert GL (2015) Group B streptococcal disease and genotypes in Australian infants. J Paediatr Child Health.
  43. Gudjonsdottir MJ, Hentz E, Berg S, Backhaus E, Elfvin A, Kawash S, et al. (2015) Serotypes of group B streptococci in western Sweden and comparison with serotypes in two previous studies starting from 1988. BMC Infect Dis 15: 507. PMID: 26553333
  44. Gherardi G, Imperi M, Baldassarri L, Pataracchia M, Alfarone G, Recchia S, et al. (2007) Molecular epi- demiology and distribution of serotypes, surface proteins, and antibiotic resistance among Group B Streptococci in Italy. Journal of Clinical Microbiology 45: 2909–2916. 00999-07 PMID: 17634303
  45. Hery-Arnaud G, Bruant G, Lanotte P, Brun S, Picard B, Rosenau A, et al. (2007) Mobile genetic ele- ments provide evidence for a bovine origin of clonal complex 17 of Streptococcusagalactiae. Appl Envi- ron Microbiol 73: 4668–4672. PMID: 17526784
  46. Lartigue MF, Hery-Arnaud G, Haguenoer E, Domelier AS, Schmit PO, van der Mee-Marquet N, et al. (2009) Identification of Streptococcusagalactiaeisolates from various phylogenetic lineages by matrix- assisted laser desorption ionization-time of flight mass spectrometry. J Clin Microbiol 47: 2284–2287. PMID: 19403759
  47. Luan SL, Granlund M, Sellin M, Lagergard T, Spratt BG, Norgren M (2005) Multilocus sequence typing of Swedish invasive group B streptococcus isolates indicates a neonatally associated genetic lineage and capsule switching. J Clin Microbiol 43: 3727–3733. 2005 PMID: 16081902
  48. van der Mee-Marquet N, Fourny L, Arnault L, Domelier AS, Salloum M, Lartigue MF, et al. (2008) Molec- ular characterization of human-colonizing Streptococcusagalactiaestrains isolated from throat, skin, anal margin, and genital body sites. J Clin Microbiol 46: 2906–2911. 00421-08 PMID: 18632904
  49. Diedrick MJ, Flores AE, Hillier SL, Creti R, Ferrieri P (2010) Clonal analysis of colonizing group B Strep- tococcus, serotype IV, an emerging pathogen in the United States. J Clin Microbiol 48: 3100–3104. PMID: 20610684
  50. Campisi E, Rinaudo CD, Donati C, Barucco M, Torricelli G, Edwards MS, et al. (2016) Serotype IV StreptococcusagalactiaeST-452 has arisen from large genomic recombination events between CC23 and the hypervirulent CC17 lineages. Sci Rep 6: 29799. PMID: 27411639
  51. Lyhs U, Kulkas L, Katholm J, Waller KP, Saha K, Tomusk RJ, et al. (2016) Streptococcusagalactiae serotype IV in humans and cattle, Northern Europe. Emerg Infect Dis 22: 2097–2103. 10.3201/eid2212.151447 PMID: 27869599
  52. Florindo C, Damiao V, Silvestre I, Farinha C, Rodrigues F, Nogueira F, et al. (2014) Epidemiological sur- veillance of colonising group B Streptococcus epidemiology in the Lisbon and Tagus Valley regions, Por- tugal (2005 to 2012): emergence of a new epidemic type IV/clonal complex 17 clone. Euro Surveill 19.
  53. Ferrieri P, Lynfield R, Creti R, Flores AE (2013) Serotype IV and invasive group B Streptococcus dis- ease in neonates, Minnesota, USA, 2000–2010. Emerg Infect Dis 19: 551–558. 3201/eid1904.121572 PMID: 23628320
  54. Teatero S, McGeer A, Li A, Gomes J, Seah C, Demczuk W, et al. (2015) Population structure and anti- microbial resistance of invasive serotype IV group B Streptococcus, Toronto, Ontario, Canada. Emerg Infect Dis 21: 585–591. PMID: 25811284
  55. Dangor Z, Kwatra G, Izu A, Adrian P, Cutland CL, Velaphi S, et al. (2015) Association between maternal Group B Streptococcus surface-protein antibody concentrations and invasive disease in their infants. Expert Rev Vaccines 14: 1651–1660. PMID: 26364978
  56. Larsson C, Lindroth M, Nordin P, Stalhammar-Carlemalm M, Lindahl G, Krantz I (2006) Association between low concentrations of antibodies to protein alpha and Rib and invasive neonatal group B strep- tococcal infection. Arch Dis Child Fetal Neonatal Ed 91: F403–408. 090472 PMID: 17056838
  57. Kong F, Gowan S, Martin D, James G, Gilbert GL (2002) Molecular profiles of group B streptococcal surface protein antigen genes: relationship to molecular serotypes. J Clin Microbiol 40: 620–626. PMID: 11825981
  58. Rodriguez-Granger J, Spellerberg B, Asam D, Rosa-Fraile M (2015) Non-haemolytic & non-pigmented group B streptococcus, an infrequent cause of early onset neonatal sepsis. Pathogens and Disease.
  59. Six A, Firon A, Plainvert C, Caplain C, Bouaboud A, Touak G, et al. (2016) Molecular characterization of nonhemolytic and nonpigmented Group B Streptococci responsible for human invasive infections. J Clin Microbiol 54: 75–82. PMID: 26491182
  60. Gendrin C, Vornhagen J, Armistead B, Singh P, Whidbey C, Merillat S, et al. (2017) A non-hemolytic Group B Streptococcus strain exhibits hypervirulence. The Journal of Infectious Diseases: jix646- jix646.
  61. Herbert MA, Beveridge CJ, Saunders NJ (2004) Bacterial virulence factors in neonatal sepsis: group B streptococcus. Curr Opin Infect Dis 17: 225–229. PMID: 15166825
  62. Liu GY, Nizet V (2004) Extracellular virulence factors of group B Streptococci. Front Biosci 9: 1794– 1802. PMID: 14977587
  63. Rajagopal L (2009) Understanding the regulation of Group B Streptococcal virulence factors. Future Microbiol 4: 201–221. PMID: 19257847
  64. Milligan TW, Baker CJ, Straus DC, Mattingly SJ (1978) Association of elevated levels of extracellular neuraminidase with clinical isolates of type III group B streptococci. Infect Immun 21: 738–746. PMID: 361576
  65. Granlund M, Oberg L, Sellin M, Norgren M (1998) Identification of a novel insertion element, IS1548, in group B streptococci, predominantly in strains causing endocarditis. J Infect Dis 177: 967–976. https:// PMID: 9534970
  66. Sheen TR, Jimenez A, Wang NY, Banerjee A, van Sorge NM, Doran KS (2011) Serine-rich repeat pro- teins and pili promote Streptococcusagalactiaecolonization of the vaginal tract. J Bacteriol 193: 6834– 6842. PMID: 21984789
  67. Rinaudo CD, Rosini R, Galeotti CL, Berti F, Necchi F, Reguzzi V, et al. (2010) Specific involvement of pilus type 2a in biofilm formation in group B Streptococcus. PLoS One 5: e9216. 1371/journal.pone.0009216 PMID: 20169161
  68. Lazzarin M, Mu R, Fabbrini M, Ghezzo C, Rinaudo CD, Doran KS, et al. (2017) Contribution of pilus type 2b to invasive disease caused by a StreptococcusagalactiaeST-17 strain. BMC Microbiology 17: 148. PMID: 28673237
  69. Springman AC, Lacher DW, Waymire EA, Wengert SL, Singh P, Zadoks RN, et al. (2014) Pilus distribu- tion among lineages of group b streptococcus: an evolutionary and clinical perspective. BMC Microbiol 14.