samtools.sourceforge.net - In current genome era, our day to day work is to handle the huge geneome sequences, expression data, several other datasets. This link provide a comprehensive list of commonly used sofware/tools.
Many times bioinformatician needs to parse binary files like bam and sff. Advantage of binary files is that they occupy less space in memory with maximum information content.
Link for those who looking for structure of Bam and sff...
sfu-compbio.github.io - SCALCE (/skeɪlz/, a.k.a. boosting Sequence Compression Algorithms using Locally ConsistentEncoding) is a tool for compressing FASTQ files. It is designed specifically for the Illumina-generated FASTQ files, but supports any...
broadinstitute.github.io - Picard is a set of command line tools for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF. These file formats are defined in the Hts-specs repository. See especially the SAM specification and the VCF...
To do sam to bam conversion, follow the following commands :-
Code:
$ samtools view -b -S file.sam > file.bam
Then you will need to use
Code:
$ samtools sort file.bam file-sorted
followed by
Code:
$ samtools index...
userweb.eng.gla.ac.uk - This webpage lists some of the one-liners that we frequently use in metagenomic analyses. You can click on the following links to browse through different topics. You can copy/paste the commands as they are in your terminal screen, provided you...
github.com - VcfR is an R package intended to allow easy manipulation and visualization of variant call format (VCF) data. Functions are provided to rapidly read from and write to VCF files. Once VCF data is read into R a parser function extracts matrices from...
github.com - Read one or more FASTQ files, fastqe will compute quality stats for each file and print those stats as emoji... for some reason.
Given a fastq file in Illumina 1.8+/Sanger format, calculate the mean (rounded) score for each position and...
bedtools.readthedocs.io - Collectively, the bedtools utilities are a swiss-army knife of tools for a wide-range of genomics analysis tasks. The most widely-used tools enable genome arithmetic: that is, set theory on the genome. For...
bamview.sourceforge.net - To run the application on UNIX from the downloaded jar file run the UNIX:
java -mx512m -jar BamView.jar
and extra command line options are given when '-h' is used:
java -jar BamView.jar -h
BAM files can be specified on the command line with the...