EcOH: In silico serotyping of E. coli - Supplementary material
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
This supplementary data accompanies the
manuscript "In silico serotyping of E. coli from short read data
identifies limited novel O loci but extensive diversity of O:H serotype
combinations within and between pathogenic lineages".
Sequences used in the EcOH database are given in EcOH Supplementary Table 1.
NCBI preliminary validation results are given in EcOH Supplementary Table 2.
Validation of phenotype from genotype on 197 EPEC isolates are in EcOH Supplementary Tables 3-5.
Diversity analyses results on 1547 E. coli are given in EcOH Supplementary Table 6.
Supplementary Figures 1-3 are given in EcOH
Sequences and annotations for the novel loci identified in GEMS and the ETEC and GenomeTrakr datasets are given in GEMS_6novel_Oantigen.gbk and GT_ETEC_32novel_Oantigen.gbk. Three O-antigens with variant alleles are in Variants_prototypical_Oantigens.gbk.