Many times bioinformatician needs to parse binary files like bam and sff. Advantage of binary files is that they occupy less space in memory with maximum information content.
Link for those who looking for structure of Bam and sff...
www.crops.org - Plant scientists can exploit available 50 plant complete genomic data for their resequencing or other related projects for understanding the genetic mechanism behind their different traits and molecular evolution. Complete information about these...
www.geneprof.org - GeneProf is a web-based, graphical software suite that allows users to analyse data produced using high-throughput sequencing platforms (RNA-seq and ChIP-seq; "Next-Generation Sequencing" or NGS): Next-gen analysis for next-gen data!
Some of...
sourceforge.net - AIRVF that works on flowgram, raw and mapped reads and called variants to reduce artifact-driven false variant calls. Tests on sequencing data of standard reference material showed up to ∼98% reduction of false variants when combined to...
github.com - ALPACA requires Celera Assembler 8.3 or later. It is recommended to build Celera Assembler from source. (Why? The pre-built binaries CA_8.3rc1 and CA8.3rc2 will work for any large data set.
Detail paper...
Research Scientist/Project Associate/Project Assistant Jobs opportunity in Indian Institute of Technology Madras (IIT Madras)
Research Scientist
Qualification : Ph.D in any branch of life science or bioinformatics or computational biology...
bioconductor.org - This package estimates tumor purity, copy number, and loss of heterozygosity (LOH), and classifies single nucleotide variants (SNVs) by somatic status and clonality. PureCN is designed for targeted short read sequencing data, integrates well with...
gite.lirmm.fr - An error correction method that uses long reads only. The method consists of two phases: first, we use an iterative alignment-free correction method based on de Bruijn graphs with increasing length of k-mers, and second, the corrected reads are...