Download the genome from NCBI using bash script/command
#!/bin/bash # Download the genome from NCBI using command # Create a Directory mkdir genome cd genome # Look for genome assembly summary and extract the URL....gov/genomes/genbank/bacteria/assembly_summary.txt' | awk '{FS="\t"}...2527 days ago
Unzip all the genome file and remove all fasta header except first one
#!/bin/bash gzip -d *.gz FILES=$(pwd)/* for f in $FILES do echo "Processing $f file..." if [[ $f =~ \.fna$ ]]; then awk ' /^>/ && FNR > 1 {next} {print...2526 days ago
Download the gff files from NCBI using bash script/command
#!/bin/bash # Download the genome from NCBI using command # Create a Directory mkdir genome_gff cd genome_gff # Look for genome assembly summary and extract the URL....gov/genomes/genbank/bacteria/assembly_summary.txt' | awk '{FS="\t"}...2518 days ago
Calculate Dinucleotide Frequency with Perl
#!/usr/bin/perl -w use strict; my ($genome, $head, $tail); my (%mono_nt, %di_nt); $/ = ">"; open my $fasta, '2333 days ago
Clump Finding Problem Solved with Perl
#Find patterns forming clumps in a string. #Given: A string Genome, and integers k, L, and t. #Return: All distinct k-mers forming (L, t)-clumps in Genome....2329 days ago
Insert the sequence at desire location in multi-fasta file with Perl
#!/usr/bin/perl use warnings; use strict; use Bio::SeqIO; use Bio::Seq; use File::Copy; #ARGV[0] should be in following format --- Keep the coordinate sorted by name+location #Genomech...2309 days ago
Create genome scaffolding with Perl
...ad1 NAME psl_scaffolder.pl - use self-mapped PSL file to scaffold a genome =head1 SYNOPSIS ./psl_s...pod2usage({-exitVal => 1, -message => "Error: No query assembly file provided",...2303 days ago
Plot the density of genes in R
#column1 = chromosome name and column2 = start position of the gene # check if ggplot2 is installed, if so, load it, # if not, install and load it if("ggplot2" %in...2279 days ago
2259 days ago
Plot custom gene density with R
library(karyoploteR) pp2244 days ago