1847 days ago
1651 days ago
Bash script to download SRA file !
#We can use the sratoolkit to directly pull the sequence data (in paired FASTQ format) from the archive. fastq-dump is in the SRA toolkit. It allows directly downloading d...1606 days ago
To convert just one specific read group to fastq
# Stop script on error. set -uex # The SRR BioProject number for the sequencing data. PROJECT=PRJNA257197 # The number of datasets to subselect from the project....1589 days ago
Onliner to split the multifasta to singlefasta files !
#Split the multifasta to singlefasta # Multi fasta #Single fasta awk '$0 ~ "^>" { match($1, /^>([^:]+)/, id); filename=id[1]} {print >> filename".fa"}' sequence.fasta1447 days ago
Perl script to run awk inside perl
#!/usr/local/bin/perl eval 'exec /usr/local/bin/perl -S $0 ${1+"$@"}' if $running_under_some_shell; # this emulates #! processing on NIH machines. # (remove #! line ab...1210 days ago
Split the string with underscore and store values in array with AWK !
more enriched_ids | grep "WP_" | awk '{split($2,a,"_"); print a[4]"_"a[5]}' #Other extraction more enriched_ids | grep "WP_" | awk '{split($2,a,"_"); print a[4]"_"a[5]}'> enriched_ids_list awk 'NR==FNR{tgts[$1]; next} $1 in tgts' enriched_ids_list result/GO.out > enriched_GO.out.xls941 days ago
Perl script to rename the fasta file !
#Script #1 #!/usr/bin/perl -w use strict; #USAGE #perl extractPattern.pl kmerfasta > uniref100_result_broad my %kHash; local $/ = '>'; my $infile2 = "$ARGV[0]"; # Kmer fasta open( FH2, '905 days ago
Extract the sequences with IDs !
#sed -i 's/\_/ /g' Delta_seqID_from_lineage_report.txt seqtk subseq genomic.fna Delta_seqID_from_lineage_report.txt > Delta.fasta #Split the fasta in 11 equal sequences subsets pyfasta split -n 11 Delta.fasta898 days ago
Bash script to split multifasta file !
#Using awk, we can easily split a file (multi.fa) into chunks of size N (here, N=500), by using the following one-liner: awk 'BEGIN {n=0;} /^>/ {if(n%500==0){file=sprin...870 days ago