Onliner to split the multifasta to singlefasta files !
#Split the multifasta to singlefasta # Multi fasta #Single fasta awk '$0 ~ "^>" { match($1, /^>([^:]+)/, id); filename=id[1]} {print >> filename".fa"}' sequence.fasta1457 days ago
1459 days ago
Sequence Ids conversion files !
ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/ Name Size Date Modified ARCHIVE/ 02/01/2020, 05:30:00 ASN_BINARY/ 03/07/2020, 07:49:00 GENE_INFO/ 03/07/2020, 07:48:00 0 B 10...1459 days ago
Reformat the multifasta for sequence length !
#awk oneliner to reformat the multifasta sequences awk '!/^>/ {printf "%s", $0; n = "\n"} /^>/ {print n $0; n = ""}' file.fasta | fold -w 1001448 days ago
Bash script to handle Multifasta files
#Convert all lowercase residues to uppercase in a FASTA sequence file $ awk 'BEGIN{FS=" "}{if(!/>/){print toupper($0)}else{print $1}}' input.fasta > output.fasta #Rear...1413 days ago
1396 days ago
1260 days ago
Python script to read FASTA and FASTQ file !
# !/usr/bin/env python3 # -*- coding: utf-8 -*- from pysam import FastxFile def read_fasta_q_file(fasta_q_file): """Parse FASTA/Q file using `pysam.Fastx...1033 days ago
Remove dupcates in multifasta file !
#Using seqkit for duplicate sequence removal seqkit rmdup -n seqs.fa -o seqs_without_duplicate.fa #Awk for duplicate sequence removal awk '/^>/ { f = !a[$0]++ } f' seqs.fa1032 days ago
Tadpole is 250x faster than SPADes assembler !
lege@jit-Lenovo-ideapad-320-15ISK:~/Downloads/MyTools/Vir$ tadpole.sh Written by Brian Bushnell Last modified July 16, 2018 Description: Uses kmer counts to assembl...1032 days ago