985 days ago
Onliner to convert multi line fasta to single line fasta !
#Oneliner to convert awk '/^>/ {printf("\n%s\n",$0);next; } { printf("%s",$0);} END {printf("\n");}' < file.fa > fileres.fa #Then delete the first empty line tail -n +2 fileres.fa > fileout.fa985 days ago
Bash script to simulate a genome !
# Reference https://github.com/chhylp123/hifiasm/issues/33 # Use Drosophila melongaster PacBio assembly cd /genetics/elbers/test/fly2 wget https://ftp.ncbi.nlm.nih...974 days ago
Extract fasta sequences with ids in another file !
#Ids are in test.txt - one ids per line #sequences are in test.fa grep -w -A 2 -f test.txt test.fa --no-group-separator # seqtk seqtk subseq test.fa test.txt #faSomeRecods faSomeRecords in.fa listFile out.fa # seqkit seqkit grep -n -f list.txt sequences.fas > newfile2.fas946 days ago
Run Pango on your multifasta file !
#More at https://cov-lineages.org/resources/pangolin/usage.html (base) [jnarayan@hn1 FASTA]$ conda activate pangolin (pangolin) [jnarayan@hn1 FASTA]$ ls Input_for_...944 days ago
Extract fasta header with ids !
#Extract all the fasta header name with certain ids kraken --db ../../../../DATABASE/minikraken_20171019_8GB.tgz out.fa more out.fa_class.txt | grep "227859" | awk...933 days ago
923 days ago
Perl script to rename the fasta file !
#Script #1 #!/usr/bin/perl -w use strict; #USAGE #perl extractPattern.pl kmerfasta > uniref100_result_broad my %kHash; local $/ = '>'; my $infile2 = "$ARGV[0]"; # Kmer fasta open( FH2, '914 days ago
Extract the sequences with IDs !
#sed -i 's/\_/ /g' Delta_seqID_from_lineage_report.txt seqtk subseq genomic.fna Delta_seqID_from_lineage_report.txt > Delta.fasta #Split the fasta in 11 equal sequences subsets pyfasta split -n 11 Delta.fasta907 days ago
Extract all fasta sequences except ids !
awk 'BEGIN{while((getline0)l[">"$1]=1}/^>/{f=!l[$1]}f' genomic.fna > filtered_without_omi.fasta #extract subseq seqtk subseq omi_ids.fa omi_single_id.txt > omi_sing...884 days ago