sepsis-omics.github.io - This is a tutorial for a workshop on long-read (PacBio) genome assembly.
It demonstrates how to use long PacBio sequencing reads to assemble a bacterial genome, and includes additional steps for circularising, trimming, finding plasmids, and...
github.com - Flye is a de novo assembler for single molecule sequencing reads, such as those produced by PacBio and Oxford Nanopore Technologies. It is designed for a wide range of datasets, from small bacterial projects to large mammalian-scale assemblies. The...
github.com - MitoZ is a Python3-based toolkit which aims to automatically filter pair-end raw data (fastq files), assemble genome, search for mitogenome sequences from the genome assembly result, annotate mitogenome (genbank file as result), and mitogenome...
console.cloud.google.com - List of publically available databases on google server.
More at https://software.broadinstitute.org/gatk/download/bundle
ftp://ftp.ncbi.nlm.nih.gov/snp/organisms/human_9606/VCF/GATK/.
ftp://ftp.broadinstitute.org/bundle/hg38/hg38bundle/
It is often necessary to import sample textbook data into R before you start working on your homework.
Excel File
Quite frequently, the sample data is in Excel format, and needs to be imported into R prior to use. For this, we can...
github.com - Other tools focus on getting data out of the fastq or fast5 files, which is slow and computationally intensive. The benefit of this approach is that it works on a single, small, .txt summary file. So it's a lot quicker than most other things out...