Organization, evolution and function of fengycin biosynthesis gene clusters in the Bacillus amyloliquefaciens group

The Bacillus velezensis strain PG12, belonging to the Bacillus amyloliquefaciens group, is an endophytic bacterium known for its antimicrobial activities against crop pathogens. However, our knowledge of the molecular basis underlying its biocontrol activity and the relatedness of different strains in the Bacillus amyloliquefaciens group is limited. Here, we sequenced and analyzed the genome of PG12 to test its taxonomic affiliation and identified genes involved in the biocontrol activity. The phylogenomic analysis results indicate that PG12 belongs to B. velezensis, a subgroup of the B. amyloliquefaciens group. By comparing the genomes of 22 strains in this group, we confirmed that it comprises three different phylogenetic lineages: B. amyloliquefaciens, B. velezensis and B. siamensis. Three secondary metabolism gene clusters related to the production of lipopeptides, namely fengycin, iturin and surfactin, were identified in the genomes of the B. amyloliquefaciens group. The core genome of B. velezensis is enriched in secondary metabolism genes compared with B. siamensis and B. amyloliquefaciens. Three of the five genes pertaining to the gene cluster responsible for fengycin biosynthesis (fenBCD) were found in B. velezensis and B. siamensis, but not in B. amyloliquefaciens. Phenotypic analysis showed that the ∆fenA mutant of PG12 displayed significantly decreased biofilm formation and swarming motility, which indicates that fengycin contributes to the colonization and pathogen control abilities of PG12. Our results also suggest that B. siamensis and B. velezensis have acquired the fenBCD genes from Paenibacillus spp. by horizontal gene transfer (HGT). Taken together, the results provide insights into the evolutionary pattern of the B. amyloliquefaciens group strains and will promote further researches on their taxonomy and functional genomics.


Background
Pathogenic microorganisms affecting plant health are a major and chronic threat to food production worldwide (Compant et al. 2005). The annual yield loss of crop plants due to microbial diseases is approximately 25%. To reduce crop loss, farmers rely heavily on agrochemicals (Schäfer and Adams 2015;Wu et al. 2015). However, overuse of chemicals to enhance crop yield and control plant diseases has resulted in an irreversible loss of soil quality along with serious health and environmental problems (Prashar et al. 2013). As a result, many countries are now restricting the use of a wide range of fungicides and pesticides (Rahman 2013). In this changing context, disease control in plants by beneficial bacteria is steadily increasing as an alternative to chemical pesticides (Wu et al. 2015). Several bacterial species, such as Pseudomans spp., Bacilllus spp. and Streptomyces spp. have been commercialized as biological control agents (Paterson et al. 2017). Among them, Bacillus spp. have become increasingly important in agriculture and agro-food industry where spore-forming strains are favored due to their long-term viability (Qiao et al. 2014).
The genus Bacillus represents a large group of Grampositive bacteria belonging to the Firmicutes phylum. They are capable of forming stable dormant structures called endospores in nutrient-poor and stressful environmental conditions (Hamdache et al. 2013). The bacteria from this group can inhabit a large variety of ecological niches, including soil, water, plant surfaces and rhizosphere (Siefert et al. 2000;Feng et al. 2014;Kim et al. 2015;Gao et al. 2019). B. cereus, B. subtilis and B. amyloliquefaciens are three Bacillus species known for supporting plant growth and protecting plants from diseases (Chen et al. 2016;Fan et al. 2017b;Zeng et al. 2018). The biocontrol activity mediated by Bacillus can be divided into direct and indirect forms. Effective biological control of pathogens by Bacillus is achieved by a combination of mechanisms, including production of antibiotic, induction of host resistance and promotion of plant growth (Paterson et al. 2017;Jayapala et al. 2019). Among them, the production of antibiotics is one of the primary mechanisms for achieving biocontrol effect (Li et al. 2021). The antibiotic activity of Bacillus species is partly driven by the secretion of active lipopeptides. These cyclic lipopeptides belong to three main families: surfactin (surfactin, lychenisin, pumilacidin and halobacilin), iturin (iturin, bacillomycin and mycosubtilin) and fengycin (fengycin, plipastatin and maltacin) (Zhao et al. 2017;Fira et al. 2018). They are produced by nonribosomal peptide synthetase and exhibit broad-spectrum antimicrobial activity (Cochrane and Vederas 2016). Different lipopeptides can act synergistically to inhibit the growth of phytopathogens (Koumoutsi et al. 2014). In addition, studies have revealed that lipopeptides play an important role in maintaining colonization behaviors such as biofilm formation and swarming motility (Luo et al. 2014;Fan et al. 2017b).
According to phylogenomic analyses, the B. amyloliquefaciens group comprises three species: B. siamensis, mainly occurring in Asian food, some of which are capable of controlling phytopathogens (Fan et al. 2017a; B. velezenisis, the main source of bioformulations used in organic agriculture; B. amyloliquefaciens, known for its ability to produce industrial enzymes (Wu et al. 2015;Fan et al. 2017a). Currently, many basic characteristics of these species remain unclear. With the advent of comparative genomics and the availability of an increasing number of whole genome sequences, it is possible to understand the B. amyloliquefacies group strains at genome level. In our previous study, we isolated a B. velezensis strain PG12 from apple fruit, which exhibits distinctive inhibition to apple ring rot pathogen and broad-spectrum antagonistic activities against other pathogens. PG12 demonstrates an outstanding biocontrol performance, especially against Botryosphaeria dothidea. It produces several antagonistic compounds, including iturin and fengycin (Chen et al. 2016). Recently, the released data on genome sequences of numerous Bacillus strains have to some extent revealed the underlying molecular basis of their biocontrol performance (Kim et al. 2015;Li et al. 2021).
Based on the above background, the aims of the present study were to investigate: (1) the genomic features of PG12 and its phylogenetic relationship; (2) the significant genomic differences between the B. amyloliquefaciens group strains; (3) the difference in the specific genes involved in biocontrol activity between the B. amyloliquefaciens group strains. Consequently, we sequenced and analyzed the genome of PG12, and meanwhile, we collected previously published genomes of the B. amyloliquefaciens group strains to perform a comparative genomic analysis. The results will expand our knowledge about the B. amyloliquefaciens group strains, with emphasis on their taxonomical status. Understanding the genomic features enables us to gain novel insights into the ecology and evolution of the B. amyloliquefaciens group.

Genome features of B. velezensis PG12
To identify genetic factors involved in the biocontrol activity of B. velezensis PG12, the genome of this strain was sequenced and analyzed. The genome assembly is 3,990,845 bp with 22 scaffolds (N 50 = 2,085,242 bp, G + C content of 46.45%). The number of predicted proteincoding genes is 3,884, including 3,001 genes with a putative function and 883 hypothetical protein-encoding genes (Additional file 1: Figure S1). These protein-coding genes have an average length of 917 bp and account for 89.25% of the PG12 genome. Furthermore, a total of 56 tRNA-coding genes and 8 rRNA genes were predicted in the genome sequence (Table 1).

Phylogenomic analysis of B. velezensis PG12
Preliminary molecular identification based on 16S rDNA and gyrA sequences assigned PG12 strains to B. amyloliquefaciens (Chen et al. 2016). Here, we performed a phylogenomic reconstruction based on 624 singlecopy conserved genes using the maximum-likelihood (ML) method. PG12 was clearly clustered within the B. velezensis clade, with B. velezensis CAU B946 as its closest relative (Fig. 1). They have an average nucleotide identity (ANI) of 99.96% (Additional file 2: Table S1).
Genome comparison of several B. velezensis strains, including PG12, revealed a high sequence conservation throughout the genome of these strains (Additional file 1: Figure S2). This was confirmed by the pan-genome analysis carried out among four B. velezensis strains. Genomewide comparisons of orthologous clusters between PG12 and closely related B. velezensis strains indicated that these B. velezensis strains contained 3,226 genes in their core genome, covering from 83.1% to 88.7% of their genomes (Additional file 1: Figure S3).

Distinct phylogenetic and evolutionary history of the B. amyloliquefaciens group strains
A phylogenetic tree of the B. amyloliquefaciens group strains based on 860 conserved single-copy genes was constructed, and the result showed that this group strains were clearly separated into three different phylogenetic lineages, i.e. B. amyloliquefaciens, B. velezensis and B. siamensis (Additional file 1: Figure S4), consistent with previous reports (Fan et al. 2017a;Chun et al. 2019). Our ANI analysis further supports the distribution of B. amyloliquefaciens group strains in three clusters (Fig. 2).

Genes involved in secondary metabolism are enriched in B. velezensis strains
The pan-genome analysis of 22 B. amyloliquefaciens group strains was conducted and yielded a pan-genome with 7,753 genes. The core, accessory and unique genomes comprise 2,806, 2,287 and 2,660 genes, respectively. The strain-specific genes for these selected B. amyloliquefaciens group strains ranged from 37 to 324 (Fig. 3). Meanwhile, the sizes of pan-genome and coregenome of B. velezensis, B. siamensis and B. amyloliquefaciens were estimated using the above genome data. B. velezensis strains have a core genome of 3,138 orthologous genes (76.8%-86.3% of the protein repertoire); while B. amyloliquefaciens and B. siamensis strains have a core genome of 3,116 (73.9%-81.6% of the protein repertoire) and 3,144 orthologous genes (73.1%-85.8% of the protein repertoire), respectively. The pan-genome sizes of B. velezensis, B. siamensis and B. amyloliquefaciens appeared to reach infinity when the number of added genomes increased infinitely, suggesting that these species have open pan-genomes (Additional file 1: Figure  S5). Furthermore, we compared the core genome among the B. amyloliquefaciens group strains by using the Cluster of Orthologous Group of proteins (COG) assignments to determine whether there are differences in the proportion of the core genome attributable to particular cellular processes. The core genomes of B. velezensis strains were found to be highly enriched in secondary metabolism genes compared with those of B. amyloliquefaciens and B. siamensis strains (Table 2) (Fisher's exact test; P < 0.05).

The distribution of lipopeptide gene clusters in the B. amyloliquefaciens group strains
Three gene clusters coding for the cyclic lipopeptides: surfactin, bacillomycin and fengycin were identified in the genomes of the B. amyloliquefaciens group strains (Fig. 4). The most obvious difference between the aforementioned three subgroup strains was found in the gene cluster coding for fengycin. The fengycin gene cluster (fenA, fenB, fenC, fenD and fenE) is conserved in all the B. velezensis and B. siamensis strains that we tested. In contrast, only two genes (fenA and fenE) were found in B. amyloliquefaciens strains ( Fig. 4 and Additional file 1: Figure S6). By contrast, the surfactin and iturin gene clusters are highly conserved in the B. amyloliquefaciens group, and only slight differences were noticed between different subgroup strains (Fig. 4). In all the examined B. velezensis and B. siamensis strains, we observed striking difference (P < 0.01) in the G + C content between the average genome (46.36%) and fenBCD gene sequences (49.77%, 51.70% and 50.82% for fenB, fenC and fenD, respectively) (Additional file 1: Figure S7). The observed bias in G + C content of fenBCD suggests a potential HGT might occur in these genes (Xie et al. 2014). To gain insights into the origin of fenBCD genes in B. velezensis strains, we constructed phylogenetic trees based on the protein sequence of fenB, fenC or fenD (Additional file 1: Figure S8-S10). The fenB, fenC and fenD sequences of B. velezensis clustered with their orthologs from Paenibacillus spp., suggesting a transfer of the fengycin gene cluster from Paenibacillus spp. to B. velezensis. To investigate if the biosynthesis of fengycin also affects these features, we produced a ∆fenA mutant from the parental strain PG12. Compared with the wild-type PG12 strain, the ∆fenA mutant exhibited a substantial reduction in its swarming motility (Fig. 5a, d). PG12 formed colonies with dense wrinkles and compact structure on MSgg plate, and a thick and wrinkled floating pellicle in MSgg broth. In contrast, the ∆fenA mutant formed a flat colony on MSgg plate and a thin and less wrinkled floating layer in MSgg broth (Fig. 5b, c, e). Thus, deletion of the fenA gene significantly influenced the biofilm formation and motility of PG12. In addition, the ∆fenA mutant showed a decreased ability to inhibit the growth of plant pathogens, such as Colletotrichum sp. (strawberry) and Fusarium monififorme ( Fig. 5f and Additional file 1: Figure S11).

Discussion
In our previous study, B. velezensis PG12 was isolated from apple fruit and showed high antifungal activity, which indicates that this bacterial strain has the potential to be used to control fungal pathogens infecting crop plants (Chen et al. 2016). In the present study, we displayed that both B. velezensis strains including PG12 and B. siamensis strains contain three secondary metabolism gene clusters related to the production of lipopeptides, such as surfactin, iturin and fengycin. Identification of this large spectrum of antibiotic-biosynthetic genes in the genome of PG12 to some extent explains its high competitive ability against other microorganisms. However, it is still not known whether this biocontrol capacity is expressed in the field under different environmental conditions, as biocontrol effectiveness can be critically influenced by the resident microbial communities (Goudjal et al. 2014). Therefore, further field tests on the biocontrol effect of PG12 need to be performed in the future.
The species in the genus of Bacillus are both taxonomically and metabolically diverse, and the taxonomic status of some Bacillus strains constantly brings confusion to researchers (Hamdache et al. 2013). B. subtilis, B. amyloliquefaciens and B. pumilus, used in the agro-food industry, belong to a group of phylogenetically homogeneous species. It is therefore difficult to distinguish these species according to microbiological phenotypic characteristics. Moreover, phylogenetic analysis based on the 16S rRNA gene also fails to differentiate these species due to the highly conserved nature of the gene (Rooney et al. 2009). The B. cereus group comprises eleven closely related species, but the phylogenetic and taxonomic relationships within this group are still debated (Liu et al. 2015). With the development of sequencing technology, more and more genomic data of Bacillus strains are released. Construction of phylogenetic trees based on genome sequence will help us to correct errors and improve the taxonomy quality of Bacillus strains. For instance, the biocontrol strains that were ever classified as B. subtilis based on 16S rRNA and metabolic profiles were reclassified as B. amyloliquefaciens based on their genome sequence (Magno-Perez-Bryan et al. 2015). On this count, we used 860 single-copy core genes to obtain a robust phylogenetic analysis of the B. amyloliquefaciens group strains in this study. The result showed that the tree can be divided into three clusters (Additional file 1: Figure S4). A heat-map analysis based on ANI values also support to divide this group into three subgroups (Fig. 2). With genome sequence data, we can realize accurate identification and taxonomy of Bacillus isolates, and further reveal their distinct features at the molecular level.
The B. amyloliquefaciens group harbors a wide range of ecotypes, including soil borne B. amyloliquefaciens and plant-associated B. siamensis and B.veleznesis (Reva et al. 2004;Fan et al. 2017a). The most important biocontrol and plant-growth-promoting strains were found in B. velezensis, and have been successfully used in agriculture (Borriss 2011). There are significant differences in genome sequences between plant-associated and non-plant-associated strains. Rueckert et al. (2011) performed a genome comparison between FZB42 (a type strain of plant-associated B. velezensis) and DSM7 (a type strain of non-plant-associated B. amyloliquefaciens) and found significant differences in their genomic sequences (Rueckert et al. 2011). COG function classification and manual curation of the core genomes of the B. amyloliquefaciens group strains showed that the genes involved in the biosynthesis of secondary metabolism were more abundant in B. velezensis (Table 2). Fan et al. (2017a) also found that the gene cluster involved in macrolactin synthesis is present in B.velezensis, but not in B.siamensis nor in B. amyloliquefaciens. In addition, the number of metabolites produced by B. velezensis is higher than those by B. siamensis and B. amyloliquefaciens. As a consequence, the strains harboring more antibiotic-synthesizing genes have an increased ability to limit the colonization of crop plants by fungal pathogens through the biosynthesis of antibiotics. Horizontal gene transfer is recognized as an important factor in driving evolution and differentiation of different organisms. Three of the five genes pertaining to fengycin biosynthesis gene cluster (fenBCD) were not found in B. amyloliquefaciens, whereas were identified in B. siamensis and B. velezensis, with the possibility to acquire them from Paenibacillus spp. by HGT. The ∆fenA mutant of PG12 displayed a significantly decreased antifungal activity, swarming motility and biofilm formation (Fig. 5 and Additional file 1: Figure S11), confirming the key role of this type of lipopeptide in maintaining bacterial antagonistic ability. It also demonstrated that fengycin as an antagonistic compound plays an important role in biocontrol activities (Liu et al. 2021;Xiao et al. 2021). Additionally, fengycin was reported as an elicitor produced by B. velezensis SQR9 and B. subtilis GLB191, and it is able to induce plant immunity by triggering induced systemic resistance in Arabidopsis and grape plants in response to pathogen infection (Wu et al. 2018;Li et al. 2019). Therefore, fengycin-producing strain PG12 is a very promising candidate for the biological control of fungal diseases  Data are an average of three replicates ± the standard deviation of three independent experiments. Bars represent standard deviations. ** and * represent P < 0.01 and P < 0.05 (t-test), respectively in crop plants. Further studies will also need to characterize the role of the fenBCD genes and the regulation mechanism of fengycin biosynthesis gene clusters. Altogether, our results promote comparative genomic studies of Bacillus strains, and meanwhile allow to gain insights into the underlying mechanistic basis of their biocontrol activity.

Conclusion
The biocontrol strain PG12 was identified as B. velezensis based on its genomic sequence. Three lipopeptides gene clusters were identified in the genome of PG12. Furthermore, fengycin was found to contribute to the biocontrol activity of PG12 through maintaining its colonization properties and meanwhile inhibiting pathogen growth. Importantly, genes related to secondary metabolism are more abundant in the core genome of B. velezensis. Moreover, B. siamensis and B. velezensis probably acquired the fengycin biosynthesis cluster gene through HGT events during their evolutionary process. Our study offers novel insights into the evolution of the B. amyloliquefaciens group strains and also theoretical basis for further application of beneficial strains of this group in agriculture.

Strains used in this study
B. velezensis PG12 was isolated from apple fruit (Chen et al. 2016). Genome data of other Bacillus strains used in this study were downloaded from the National Center for Biotechnology Information (NCBI) database. Fusarium sp., Colletotrichum sp., Botryosphaeria berengeriana and Fusarium moniliforme were provided by the Department of Plant Pathology of the China Agricultural University. All these fungal strains were stored in paraffin at 4℃ before use.

Genome sequencing, assembly, and annotation
The draft sequence of the B. velezensis PG12 strain was produced using Illumina Hiseq 2500 paired-end (350 bp) sequencing platform by Berry Genomics (Berry Genomics, Beijing). The assembly was performed using SOAPdenovo (v.2.04), resulting in 22 scaffolds for PG12 (Luo et al. 2012;Zeng et al. 2019). Predictions of proteincoding genes were implemented in Prokka (v. 1.11) (Seemann 2014). Functional annotation was carried out using the Basic Local Alignment Search Tool (BLAST) against the Cluster of Orthologous Groups of proteins (COG), NCBI nr protein database, Kyoto Encyclopedia of Genes and Genomes (KEGG) database and InterPro database (Zeng et al., 2018). Ordering of contigs of PG12 was achieved using the program Mauve (Rissman et al. 2009). The genome sequence of B. velezensis CAU B946 was used as a reference sequence. The final annotated chromosome was plotted using CIRCOS software (Krzywinski et al. 2009). The secondary metabolite gene clusters were examined using the antiSMASH v4.0.0rc1 program (Blin et al. 2017).

Phylogenomic analyses
A ML phylogenetic tree of Bacillus species was constructed based on 624 or 860 single-copy core proteins shared by Bacillus and Paenibacillus polymyxa M1 according to the following methods: (1) multiple alignment of amino acid sequences by mafft (v. 7.310) (Katoh and Standley 2013); (2) conserved blocks from multiple alignment using Gblocks (Castresana 2000); (3) construction of ML tree using RAxML (v. 8.2.10) software and PROTGAMMALGX model with 100 bootstrap replicates (Zeng et al. 2018). The tree was displayed using molecular evolutionary genetic analysis (MEGA). Then, ANI value was calculated using Jspecies software with MUMmer (NUCmer) alignment (Richter and Rossello-Mora 2009). Single gene alignments were performed with MEGA 5. The neighbor-joining trees were constructed using the same software with 1,000 bootstrap replicates (Kumar et al. 2008).
For genome similarity assessment, genome comparisons were performed by using BRIG program (v. 0.95). The analysis was done by using the blastn option. The genome sequence of B. velezensis FZB42 was used as a reference sequence, whereas the genome sequences of strains (CAU B946, PG12 and UCMB5036) were used as query sequences.

Core and pan-genome analyses
Identification of the core and pan-genomes of the 22 B. amyloliquefaciens group strains were performed with the PGAP pipeline (Zhao et al. 2012). The protein similarity method from PGAP was used to detect a set of core orthologs which were clustered using at least 50% protein sequence identity to each other and 50% overlap with the longest sequence with an e-value threshold of 1e-5. The core and strain-specific genomes was extracted from the orthologous table by using a custom Perl script (Zeng et al. 2018). Functional annotation of core genes of B. velezensis, B. amyloliquefaciens and B. siamensis was performed using the COG database. The core and pan-genomes, as well as estimated respective sizes and trajectories, were assessed using models and regression algorithms proposed by Tettelin et al. (2005Tettelin et al. ( , 2008. Both the core and pan-genome were visualized through PanGP (v1.0.1) using default parameters to generate distribution plots of (i) total genes, and (ii) conserved genes found upon progressive sampling of "n" genomes (Zhao et al. 2014).

Construction of fenA deletion mutant
The fenA deletion mutant was generated from PG12 by homologous recombination. Primers UF/UR and DF/DR were used to amplify the upstream region (1,585 bp) and downstream region (1,500 bp) of fenA, respectively. The kanamycin resistance gene was amplified from plasmid pUB110 using primer pair Kan-F/ Kan-R. To construct the knockout vector, the upstream, downstream and resistance gene region were ligated using gene splicing by overlap extension (4,040 bp). Then, the target product was ligated into pMD19-T to generate the knockout plasmid pTFENA, which was subsequently transferred into B. velezensis PG12 by electroporation. Kanamycin-sensitive clones were isolated, and the mutants were identified by PCR with primer pair fenA-out-F/fenA-out-R and confirmed by Sanger sequencing (Additional file 2: Table S2, S3).

Analysis of biofilm formation
Biofilm formation was monitored in MSgg medium. PG12 was grown in LB medium at 37℃ overnight. Then, 4 µL of cell culture was added to each well of a 12-well microplate, with each well containing 4 mL of MSgg medium, and incubated statically at 28℃ for up to 60 h. To quantify biofilm formation, liquid cultures beneath the biofilm were carefully drawn off and the biofilm in each well was washed three times with 1 mL of sterile saline and fixed with 2 mL of 99% (v/v) methanol for 15 min, followed by air-drying. The dried biofilms were stained with 2 mL 0.1% crystal violet (CV) for 10 min. After excess CV was removed, the dried biofilms were dissolved with 5 mL of 33% (v/v) glacial acetic acid and then were diluted to 10 times for OD 570 detection (Weng et al. 2013;Fan et al. 2017b). For colony architecture, 2 µL of cell culture was spotted onto the surface of a Petri plate containing MSgg medium and 1.5% agar, and incubated at 28℃ for 60 h.

Assays of swarming motility
The swarming motility of PG12 was tested using standard protocol with minor modification. PG12 cultures in LB liquid medium were prepared with shaking (200 rpm) at 37℃ to an OD 600 of 0.9, one mL of cells were collected by centrifugation at 6,000 g for 5 min, washed with phosphate-buffered saline (PBS), and resuspended in 100 µL PBS. Three µL of cell suspension was spotted on the center of a Petri plate containing LB medium and 0.7% agar, and the plates were incubated at room temperature for 9 h to allow cell growth in order to clearly visualize the swarming zone (Chen et al. 2012;Fan et al. 2017b). The plates were air-dried for 2 h in a laminar flow hood, and then the diameter of the swarming zone was measured.

In vitro antagonism test
The antagonistic activities of the wild-type and mutant strains against fungal pathogens were assessed as follows. In the plate confrontation assay, the fungi were cultured on potato-dextrose-agar plates (PDA) at 28℃ for 3-5 days. A 5-mm-diameter mycelial block was then cut from the margin of an actively growing fungal culture and transferred into the center of a fresh PDA plate. After 1 day of incubation, 2 µL of overnight bacterial culture grown in LB medium was spotted on the PDA plate 2.5 cm away from the center where the mycelial block was placed. The antifungal activities of the examined bacterial strains were evaluated by measuring the radial extension of mycelium after 5-7 days of incubation at 28℃. the number of CDS unique to each strain. The total number of proteincoding genes within each genome is listed below the strain name. Figure  S4. Phylogenetic tree showing the relationship of the B. amyloliquefaciens group strains. The tree is based on the concatenated alignments of 860 single core genomes. Paenibacillus polymyxa M1 was used as an outgroup. Figure S5. Pan-and core-genome plots of the B. amyloliquefaciens group strains. a B. velezensis; b B. siamensis; c B. amyloliquefaciens. The lines in yellow and purple represent the pan-and core-genomes, respectively. Figure S6. Organization of lipopeptide gene clusters in draft genome sequence of the B. amyloliquefaciens group strains. Different genes are marked with different colors. Figure S7. Comparative analysis of G + C contents between genomes and the fengycin gene clusters (fenB, fenC and fenD) in the B. amyloliquefaciens group strains. Figure S8. Neighborjoining phylogenetic tree constructed based on the protein sequences of fenB. Figure S9. Neighbor-joining phylogenetic tree constructed based on the protein sequence of fenC. Figure S10. Neighbor-joining phylogenetic tree constructed based on the protein sequence of fenD. Figure  S11. The antifungal activity of PG12 and the ∆fenA mutant. A mycelium plug of fungal pathogen was inoculated onto the center of a PDA plate, and then 2 µL of PG12 and ∆fenA cultures in LB medium were added at a distance of 2.5 cm from the rim of the fungal plug. Clear inhibition zones were observed after incubation for 5-7 days.

Additional file 2: Table S1
Average nucleotide identity pairwise comparisons among the B. amyloliquefaciens group strains. Table S2. Strains and plasmids used in this study. Table S3. Primers used in this study.