The health care industry is expected to sustain stable growth over the next decade for a variety of reasons. Advances in medicine have prolonged the average lifespans of most people, requiring more health care treatments over longer terms. In years...
Here is a small tutorial on how to make best use of multiple processors for bioinformatics analysis. One best way is using perl threads and forks. Knowing how these threads and forks work is very important before implementing them. Getting to know...
FYI, I've found it useful to use MUMmer to extract the specific changes that Racon makes, so I can evaluate them individually:
minimap -t 24 assembly.fasta long_reads.fastq.gz | racon -t 24 long_reads.fastq.gz - assembly.fasta...
These are some of the most famous Interview Puzzles being asked in top tech companies.Here is a list of Top 25 puzzles which have been asked in top Tech Interview.
2 Eggs and 100 Floor Classic Puzzle
Five pirates and gold coin Puzzle
Six...
BLASTn output format 6
BLASTn maps DNA against DNA, for example gene sequences against a reference genomeblastn -query genes.ffn -subject genome.fna -outfmt 6
BLASTn tabular output format 6
Column headers:qseqid sseqid pident...
There are many tools to perform gap filling using Illumina short reads, for example "GapFiller: a de novo assembly approach to fill the gap within paired reads" or "Toward almost closed genomes with GapFiller". There are also some tools like...
Python Programming is a general purpose programming language that is open source, flexible, powerful and easy to use. One of the most important features of python is its rich set of utilities and libraries for data processing and analytics...
To remove all line ends (\n) from a Unix text file:
sed ':a;N;$!ba;s/\n//g' filename.txt > newfilename_oneline.txt
To get average for a column of numbers (here the second column $2):
awk '{ sum += $2; n++ } END { if (n > 0) print sum / n;...
The purpose of this cheat sheet is to introduce biologist and bioinformatician to the frequently used tools for NGS analysis as well as giving experience in writing one-liners.
File System ls — list items in current directory ls...
Genome browsers are useful not only for showing final results but also for improving analysis protocols, testing data quality, and generating result drafts. Its integration in analysis pipelines allows the optimization of parameters, which...