2032 days ago
Finding Kmers from fasta sequence file
Save it in sample.fa >test TAATGCCATGGGATGTT jellyfish count -m 3 -s 100000 sample.fa -o sample.jf jellyfish dump -c sample.jf It return TGT 1 GAT 1 GGG 1 GGA 1 CAT 1 TGC 1 TAA 1 GCC 1 CCA 1 GTT 1 TGG 1 ATG 3 AAT 12000 days ago
Split the multifasta in separate files !
cat Avaga_allPalindrome.fa | awk '{ if (substr($0, 1, 1)==">") {filename=(substr($0,2) ".fa")} print $0 > filename }'1986 days ago
1633 days ago
Installing ggplot2 and its dependencies on Ubuntu !
jit@jit-HP-Pro-3335-MT:~/Downloads/MitoHunter/minidot/bin$ sudo R [sudo] password for jit: R version 3.4.4 (2018-03-15) -- "Someone to Lean On" Copyright (C) 2018...1578 days ago
To convert just one specific read group to fastq
# Stop script on error. set -uex # The SRR BioProject number for the sequencing data. PROJECT=PRJNA257197 # The number of datasets to subselect from the project...1571 days ago
1535 days ago
Script to extract the cluster detail !
$ lsb_release -a No LSB modules are available. Distributor ID: Ubuntu Description: Ubuntu 18.04.1 LTS Release: 18.04 Codename: bionic $ cat /proc/cpuinfo | grep -i 'model name' | head -n 1 model name : Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz1407 days ago
Bash script to get intergenic region from genome files !
#For the intergenic region, we will require the size of the chromosomes. wget http://xxx.chrom.sizes cat xxx.chrom.sizes | sed 's/^chr//' | sed 's/Cp/Pt/' > tmp mv...1394 days ago
Commands to Remove White Space In Text Or String Using Awk And Sed In Linux
text=" ATGGTV AGTGACCTAGAGTGATGA G GGRTTT" echo "$text" | sed 's/ //g' OR echo "$text" | awk '{ gsub(/ /,""); print }' Return: ATGGTVAGTGACCTAGAGTGATGAGGGRTTT...984 days ago