Streptococcus pyogenes SAMN04875527 reads
datasetposted on 03.07.2017 by RYAN WICK
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
This is a sample read set for use in Unicycler. It is linked to in the Unicycler README so people who install Unicycler can get a small read set to make sure the program works.
The files contain Illumina and PacBio reads from Streptococcus pyogenes biosample SAMN04875527. The SRA run accessions are SRR4242236 and SRR4242237. These files do not contain the full set of reads from these run. Rather, they have been subsampled down to create smaller files, easier to download. The PacBio reads were subsampled based on quality and are a high-quality subset of the original reads.
The Streptococcus pyogenes genome is particularly small and simple and is relatively easy to assemble with Illumina reads. It does have a few repetitive elements, however, including five copies of the RNA operon and six copies of IS1548.