Bash script to split multifasta file !
#Using awk, we can easily split a file (multi.fa) into chunks of size N (here, N=500), by using the following one-liner: awk 'BEGIN {n=0;} /^>/ {if(n%500==0){file=sp...836 days ago
Commands to get the detail of disk usage on Linux !
#A simplistic approach would be du -shc /home/* du -shc /home/jnarayan #To sort it: du -smc /home/* | sort -n #There is also a wellknown Perl script that has the option of mailing disk usage reports per user: durep http://www.ubuntugeek.com/create-disk-usage-reports-with-durep.html828 days ago
Bash script to transfer files to server !
# rsync options source destination rsync -azvh --progress PacBio_clean.fa xxx@xxx.xxx.res.in:/home/ # scp source_file_name username@destination_host:destination_folder scp –rpv /datafile xxx@192.168.1.100:/home/me816 days ago
Bash script to find difference between two files !
#lines which are exist only in file2: grep -Fxvf file1 file2 > file3 #lines which are exist only in file1: grep -Fxvf file2 file1 > file3 #lines which are exist in both files: grep -Fxf file1 file2 > file3785 days ago
Bash script to convert multiline fasta to single line fasta !
#file.fa is multiline fasta awk '/^>/ {printf("\n%s\n",$0);next; } { printf("%s",$0);} END {printf("\n");}' < file.fa779 days ago
Script to rapid genome clustering based on pairwise ANI
First, create a blast+ database: makeblastdb -in -dbtype nucl -out Next, use megablast from blast+ package to perform all-vs-all blastn of sequences: blastn -query...648 days ago
Perl script to read the next line of a file !
my $line = ; while(1) { # keep looping until I say so my $nextLine = ; if ($line =~ m/>/ || !defined $nextLine) { ### Do the stuff } ### Do any other stuff; last unless defined $nextLine; $line = $nextLine; }578 days ago
Perl script to find inverted repeats !
#!/usr/bin/perl use strict; use warnings; use Bio::SeqIO; use Bio::Tools::Run::RepeatMasker; my $genome_file = "genome.fasta"; # read genome sequence my...438 days ago
Perl script to find edit distance between two sequences !
#!/usr/bin/perl use strict; use warnings; sub edit_distance { my ($s1, $s2) = @_; my $len1 = length($s1); my $len2 = length($s2); my @dp; for (my $i = 0; $i437 days ago
Raku script to find palindrome in genomes !
sub is-palindrome(Str $str) returns Bool { $str.=uc; # convert to uppercase $str.=subst:g/\s+//; # remove any spaces return $str eq $str.flip; } sub...437 days ago