This python script is by far the best for MCscan
https://github.com/tanghaibao/jcvi/wiki/MCscan-(Python-version)
use SPAdes to assemble the data. SPAdes is a swiss-army knife of genome assembly tools, and by default includes read correction. This takes up lots of RAM, so we are going to skip it. We will also only use 3 kmers to save time:
./SPAdes-3.6.2-Linux/bin/spades.py --only-assembler
-t 4 -k 21,51,71
-1 SRR2627175_1.fastq.gz
-2 SRR2627175_2.fastq.gz
--nanopore minion.pass.2D.fastq
-o SPAdes_hybrid &
Use samtools to extract the top contig:
head -n 1 SPAdes_hybrid/contigs.fasta
samtools faidx SPAdes_hybrid/contigs.fasta
samtools faidx SPAdes_hybrid/contigs.fasta NODE_1_length_4620446_cov_135.169_ID_22238 > single_contig.fa
Finally, a quick comparison to the reference:
sudo apt-get install mummer
curl -s "http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi?db=nucleotide&id=NC_000913.3&rettype=fasta&retmode=txt" > NC_000913.3.fa
nucmer NC_000913.3.fa single_contig.fa
mummerplot -png out.delta
display out.png &