BOL: Related items

CANU: Assembling Large Genomes with Single-Molecule Sequencing and Locality Sensitive Hashing.

Jit — Tue, 26 Apr 2016 11:38:10 -0500

Canu is a fork of the Celera Assembler designed for high-noise single-molecule sequencing (such as the PacBio RSII or Oxford Nanopore MinION). The software is currently alpha level, feel free to use and report issues encountered.

Canu is a hierachical assembly pipeline which runs in four steps:

Detect overlaps in high-noise sequences using MHAP
Generate corrected sequence consensus
Trim corrected sequences
Assemble trimmed corrected sequences

Read the documentation

New release https://github.com/marbl/canu/releases

Address of the bookmark: https://github.com/marbl/canu

Hagfish - assess an assembly through creative use of coverage plots

Abhi — Fri, 20 May 2016 19:08:17 -0500

Hagfish is a tool that is to be used in data analysis of Next Generation Sequencing (NGS) experiments. Hagfish builds on the concept of coverage plots and aims to assist (amongst others) in quality control of de novo genome assembly or identification of structural variation in a genome re-sequencing experiment.

Hagfish requires a reference sequence and a paired end re-sequencing data set. Hagfish has more power the larger the insert size of the paired end library is.

Quick links: Installation,Operation, Read mappers, Hagfish scripts, Hagfish plots

Address of the bookmark: https://github.com/mfiers/hagfish

A5-miseq

Jit — Thu, 18 Aug 2016 04:05:23 -0500

_A5-miseq_ is a pipeline for assembling DNA sequence data generated on the Illumina sequencing platform. This README will take you through the steps necessary for running _A5-miseq_.

Point to note:

There are many situations where A5-miseq is not the right tool for the job. In order to produce accurate results, A5-miseq requires Illumina data with certain characteristics. A5-miseq will likely not work well with Illumina reads shorter than around 80nt, or reads where the base qualities are low in all or most reads before 60nt. A5-miseq assumes it is assembling homozygous haploid genomes. Use a different assembler for metagenomes and heterozygous diploid or polyploid organisms. Use a different assembler if a tool like FastQC reports your data quality is dubious. You have been warned! Datasets consisting solely of unpaired reads are not currently supported.

Address of the bookmark: https://sourceforge.net/projects/ngopt/

VAGUE:Velvet Assembler Graphical Front End

Jit — Fri, 24 Feb 2017 08:56:49 -0600

VAGUE is a vague acronym for "Velvet Assembler Graphical Front End", which means it is a GUI for the Velvet de novo assembler. The command line version of Velvet can be complicated for beginners to use, but VAGUE makes it clear and simple

More at http://www.vicbioinformatics.com/software.vague.shtml

Address of the bookmark: http://www.vicbioinformatics.com/software.vague.shtml

RepeatModeler

Jit — Thu, 18 Aug 2016 09:57:15 -0500

RepeatModeler is a de-novo repeat family identification and modeling package. At the heart of RepeatModeler are two de-novo repeat finding programs ( RECON and RepeatScout ) which employ complementary computational methods for identifying repeat element boundaries and family relationships from sequence data. RepeatModeler assists in automating the runs of RECON and RepeatScout given a genomic database and uses the output to build, refine and classify consensus models of putative interspersed repeats.

Address of the bookmark: http://www.repeatmasker.org/RepeatModeler.html

SRF Bioinformatics job position in National Institute of Plant Genome Research (NIPGR)

Mon, 19 Sep 2016 05:43:38 -0500

SRF Bioinformatics job position in National Institute of Plant Genome Research (NIPGR)
Title : “Transcriptome and small RNA diversity analysis of developing seed contrasting rice varieties”
Qualification : Candidates having M.Sc./M.Tech. degree or equivalent (with minimum 60% marks) in Bioinformatics with a minimum of two years of post M.Sc./M.Tech research experience are eligible to apply.
No. of Post : 01
How to apply
Application should reach to Dr. Pinky Agarwal, Staff Scientist, National Institute of Plant Genome Research (NIPGR) Aruna Asaf Ali Marg, P.O. Box NO. 10531, New Delhi - 110067 on or before 30/09/2016

More at http://www.nipgr.res.in/careers/vacancies_latest.php#

Gene Finding and Predictions

Poonam Mahapatra — Fri, 26 Aug 2016 07:26:27 -0500

In this exercise, a previously annotated gene will be used to measure the accuracy of different gene finding approaches. GRAIL, GENSCAN, geneid, FGENESH, GenomeScan, GrailEXP and GENEWISE will be used to annotate the sequence. Both search by signal, content and homology (protein and cDNA sequences) methods will be employed in order to improve the ab initio results. Weak conservation of Start codons will lead to wrong prediction of initial exons in most cases.

http://genome.crg.es/courses/Bioinformatics2003_genefinding/

Address of the bookmark: http://genome.crg.es/courses/Bioinformatics2003_genefinding/

Artemis Comparison Tool (ACT)

Shruti Paniwala — Wed, 07 Sep 2016 03:54:41 -0500

ACT is a Java application for displaying pairwise comparisons between two or more DNA sequences. It can be used to identify and analyse regions of similarity and difference between genomes and to explore conservation of synteny, in the context of the entire sequences and their annotation. It can read complete EMBL, GENBANK and GFF entries or sequences in FASTA or raw format.

Address of the bookmark: http://www.sanger.ac.uk/science/tools/artemis-comparison-tool-act

CIRCOS Visualize !!

Jit — Fri, 02 Sep 2016 08:29:26 -0500

Before uploading a data file, check the samples gallery to make sure that your data format is compatible.

Your file must be plain text.
Your data values must be non-negative integers.
Data must be space-separated (one or more tab or space, which will be collapsed).
No two rows or columns may have the same name.
Column and row names must begin with a letter (e.g. 'A', 'A0', 'A-0') and can only contain letters, numbers and _. No punctuation!
Maximum row + column total is 150 — if exceeded, rows and columns are limited to 75.
If you are using order, size and color rows/columns in combination they must appear in that order.

Need help? Post questions to the Circos Google Group.

http://mkweb.bcgsc.ca/tableviewer/visualize/

Address of the bookmark: http://mkweb.bcgsc.ca/tableviewer/visualize/

Sybil

Jit — Wed, 07 Sep 2016 03:20:44 -0500

The Sybil software package provides a primarily web-based front-end to comparative genome datasets warehoused in a chado relational database. It was developed by the bioinformatics department at The Institute for Genomic Research (TIGR) and development continues at the J. Craig Venter Institute (JCVI) and the Institute for Genome Sciences (IGS) at the University of Maryland: Baltimore. Sybil has been used at TIGR/JCVI, IGS, NYU, New York Medical College, Novartis Vaccines and University of Maryland: College Park to support a number of research projects that involve comparative genome analysis. The following sections provide some high-level technical details about the overall architecture and external dependencies of the Sybil package.

Address of the bookmark: http://sybil.sourceforge.net/