Run Pango on your multifasta file !
...or_Cova_all_samples_combined.fa ** Running sequence QC ** Number of sequences detected: 320 Total passing...aintainability-limitations warnings.warn( processing block of 293 sequences 11/30/2021, 13:10:08 /home/j...885 days ago
Omicron Sequences accession number !
EPI_ISL_6647956 EPI_ISL_6647957 EPI_ISL_6647958 EPI_ISL_6647959 EPI_ISL_6647960 EPI_ISL_6647962 EPI_ISL_6647961 Search the IDs in https://www.epicov.org/epi3/frontend883 days ago
Extract the sequences with IDs !
#sed -i 's/\_/ /g' Delta_seqID_from_lineage_report.txt seqtk subseq genomic.fna Delta_seqID_from_lineage_report.txt > Delta.fasta #Split the fasta in 11 equal sequences subsets pyfasta split -n 11 Delta.fasta849 days ago
Extract all fasta sequences except ids !
awk 'BEGIN{while((getline0)l[">"$1]=1}/^>/{f=!l[$1]}f' genomic.fna > filtered_without_omi.fasta #extract subseq seqtk subseq omi_ids.fa omi_single_id.txt > omi_singl...826 days ago
bash script to extract sequence by ids !
Use a Perl one-liner, grep and seqtk subseq to extract the desired fasta sequences: # Create test input: cat > in.fasta BGI_novel_T016313 Solyc03g025570.2.1 TTCAAGTGTTAGTTTCACAT...821 days ago
Script to rapid genome clustering based on pairwise ANI
First, create a blast+ database: makeblastdb -in -dbtype nucl -out Next, use megablast from blast+ package to perform all-vs-all blastn of sequences: blastn -query -db -outfmt '...633 days ago
590 days ago
Identify genome-wide synteny with LASTZ alignment
#This is the walkstrough how to identifiy genome-wide synteny markers based on LASTZ alignment. Step1:Mask the repeat sequences for both genomes and chromosomes. RepeatMasker -pa...515 days ago
Perl script to find edit distance between two sequences !
#!/usr/bin/perl use strict; use warnings; sub edit_distance { my ($s1, $s2) = @_; my $len1 = length($s1); my $len2 = length($s2); my @dp; for (my $i = 0; $i423 days ago
Perl script for chi-squared test !
...################ # Perform chi-squared test ############################################################ # need total of all counts in both sequences, plus totals of 'rows' in chi...409 days ago