With the emergence of NGS technologies, and sequencing data most of the bioinformaticians mung and wrangle around massive amounts of genomics text. There are several "standardized" file formats (FASTQ, SAM, VCF, etc.) and some tools for manipulating...
Problem at http://rosalind.info/problems/1c/
#Find all occurrences of a pattern in a string.#Given: Strings Pattern and Genome.#Return: All starting positions in Genome where Pattern appears as a substring. Use 0-based indexing.use strict;use...
code.google.com - You are requested to please bookmark collection of bioinformatics tools, scripts, codes that can be pieced together in a very easy and flexible manner to perform both simple and complex bioinformatics tasks.
The next-generation sequencing included...
cpansearch.perl.org - This is a collection of libraries and high-quality end-user scripts for bioinformatic analysis, including working with gene annotation, collecting data scores from a variety of modern file formats, and conversion between file formats. The...
github.com - SNPGenie is a Perl script for estimating evolutionary parameters, mainly from pooled next-generation sequencing (NGS) single-nucleotide polymorphism (SNP) variant data. SNP reports (acceptable in a variety of formats) much each correspond to a...
Perl's second wave of adoption came from the growth of the world wide web. Dynamic web pages—the precursor to modern web applications—were easy to create with Perl and CGI. Thanks to Perl's ubiquity as a language for system...
vcftools.sourceforge.net - VCFtools contains a Perl API (Vcf.pm) and a number of Perl scripts that can be used to perform common tasks with VCF files such as file validation, file merging, intersecting, complements, etc. The Perl tools support all versions of the VCF...
There are many R software and bioconductor packages for NGS data analysis, some of them are as follows
Biostrings
The Biostrings package from Bioconductor provides an advanced environment for efficient sequence management and analysis in R. It...