Samtools commands for bioinformatician !
## count mapped reads samtools view -c -F 260 mapping_file.bam ### converting sam...q 42 -c sal_sej.bam ### sorting bam file by genome position samtools sort sal_s...ls index sal_sej_sorted.bam.bam ### identifying genome...1604 days ago
Bash script to alignment of short reads against reference genome !
bwa mem -t 40 -R '@RG\tID:K12\tSM:K12' \ E.coli_K12_MG1655.fa SRR1770413_1.fastq.gz SRR1770413_2.fastq.gz \ | samtools view -b - >SRR1770413.raw.bam sambamba...1560 days ago
Pack a perl program with their dependencies on Ubuntu !
#Follow steps to create your own executable ./web jit@jit-HP-Pro-3335-MT:~/Downloads...& answer key is released... Evaluation of genome assembly software based on lo...adsBy BioStar 409 days ago Evaluationgenomea...1512 days ago
Install Ragout genome assembler
$ conda install -c bioconda ragout Collecting package metadata (repodata.json): done Solving environment: done ## Package Plan ## environment location: /home/...1465 days ago
Perl One-Liner to print only non-uppercase letters
#Go through file and only print words that do not have any uppercase letters. perl -ne 'print unless m/[A-Z]/' dna.fa > dnaOnlyLowercase.fa #To lowercase everything perl -pne 'tr/[A-Z]/[a-z]/' dnaUpperCase.fa >dnawithoutuppercase.fa;1385 days ago
get GC across the entire CDS !
#look at GC across the entire CDS. gffread -x - -g | \ seqtk comp - | \ awk -v OFS="\t" '{ print $1, "0", $2, ($4 + $5) / $2 }'1394 days ago
Bash script to get exon fragments from genome files !
#Exons are already defined in the GTF file, so we simply need to print lines that are marked exonic. gunzip -c genome_file.gtf.gz | awk 'BEGIN{OFS="\t";} $3=="exon" {print $1,$4-1,$5}' | bedtools sort | bedtools merge -i - | gzip > my_exon.bed.gz1367 days ago
Bash script to get intergenic region from genome files !
#For the intergenic region, we will require the size of the chromosomes. wget http://xxx.chrom.sizes cat xxx.ch...s | sed 's/^chr//' | sed 's/Cp/Pt/' > tmp mv tmp xxx.chrom.sizes gunzip -c genome_...1367 days ago
1217 days ago
Create random 10000 SNPs in genome !
(base) ➜ dupStudy git:(master) ✗ perl ../simuG.pl -refseq SGDref.R64-2-1.dups.fa -snp_count 10000 -prefix simuSNP [Sun Jan 10 16:05:57 2021] Starting simuG .. [...1212 days ago