BOL: All Site Activity

All Site Activity

- Jit@jit.aber
Jit bookmarked INC-Seq: accurate single molecule reads using nanopore sequencing 2555 days ago

INC-Seq reads enabled accurate species-level classification, identification of species at 0.1 % abundance and robust quantification of relative abundances, providing a cheap and effective approach for pathogen detection and microbiome profiling...

https://github.com/CSB5/INC-Seq
- Jit@jit.aber
Jit bookmarked Opera: An optimal genome scaffolding program 2555 days ago

Opera (Optimal Paired-End Read Assembler) is a sequence assembly program (http://en.wikipedia.org/wiki/Sequence_assembly ). It uses information from paired-end or long reads to optimally order and orient contigs assembled from...

https://sourceforge.net/projects/operasf/
- Jit@jit.aber
Jit bookmarked RITA: Rapid identification of high-confidence taxonomic assignments for metagenomic data 2555 days ago

RITA is a standalone software package and Web server for taxonomic assignment of metagenomic sequence reads. By combining homology predictions from BLAST or UBLAST with compositional classifications from a Naive Bayes classifier, RITA is able to...

http://kiwi.cs.dal.ca/Software/RITA
- Jit@jit.aber
Jit created a page SPAdes hybrid genome assembly 2555 days ago

When you have both Illumina and Nanopore data, then SPAdes remains a good option for hybrid assembly - SPAdes was used to produce the B fragilis assembly by Mick Watson’s group. Again, running spades.py will show you the...
Comments
- Rahul Nayak@rahul
  
  Rahul Nayak 2378 days ago
  use SPAdes to assemble the data. SPAdes is a swiss-army knife of genome assembly tools, and by default includes read correction. This takes up lots of RAM, so we are going to skip it. We will also only use 3 kmers to save time:
  ./SPAdes-3.6.2-Linux/bin/spades.py --only-assembler -t 4 -k 21,51,71 -1 SRR2627175_1.fastq.gz -2 SRR2627175_2.fastq.gz --nanopore minion.pass.2D.fastq -o SPAdes_hybrid &
  Use samtools to extract the top contig:
  head -n 1 SPAdes_hybrid/contigs.fasta samtools faidx SPAdes_hybrid/contigs.fasta samtools faidx SPAdes_hybrid/contigs.fasta NODE_1_length_4620446_cov_135.169_ID_22238 > single_contig.fa
  Finally, a quick comparison to the reference:
  sudo apt-get install mummer curl -s "http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi?db=nucleotide&id=NC_000913.3&rettype=fasta&retmode=txt" > NC_000913.3.fa nucmer NC_000913.3.fa single_contig.fa mummerplot -png out.delta display out.png &
- Jit@jit.aber
Jit bookmarked miniasm: very fast OLC-based de novo assembler for noisy long reads 2555 days ago

Miniasm is a very fast OLC-based de novo assembler for noisy long reads. It takes all-vs-all read self-mappings (typically by minimap) as input and outputs an assembly graph in the GFA format. Different from mainstream...

https://github.com/lh3/miniasm
- Jit@jit.aber
Jit bookmarked coursera genome assembly tutorial 2557 days ago

Solutions to Coursera Genome Sequencing (Bioinformatics II)

https://github.com/iansealy/coursera-assembly
- Rahul Nayak@rahul
Rahul Nayak posted to the wire 2557 days ago

Random forest algorithms http://blog.yhat.com/posts/random-forests-in-python.html #forest #random #algo
- Archana Malhotra@archana
Archana Malhotra posted to the wire 2558 days ago

Understanding MinION data https://porecamp.github.io/2016/tutorials/PoreCamp2016-02-MinIONData.pdf #data #understand #learn
- Archana Malhotra@archana
Archana Malhotra created a new bio-script Extract fasta sequence with Ids with Bash script 2558 days ago
- Neel@neelam
Neel bookmarked SIMA C++ Implementation: Simultaneous Multiple Alignment of LC/MS Peak Lists 2558 days ago

This is the c++ implementation for SIMA - Simultaneous Multiple Alignment of LC/MS Peak Lists. The package contains C++ source code as well as two binary files. The latter were tested under various operating systems, including Windows XP SP3 32bit,...

https://hciweb.iwr.uni-heidelberg.de/hci/softwares/sima
- Abhimanyu Singh@abhimanyu
Abhimanyu Singh asked How to resolve this NanoPlot error NanoImportError: No module named concurrent.futures 2559 days ago

NanoPlot []Traceback (most recent call last): File "/usr/local/bin/NanoPlot", line 7, in <module> from nanoplot.NanoPlot import main File "/usr/local/lib/python2.7/dist-packages/nanoplot/NanoPlot.py", line 18, in <module> from nanoget...
- Jit@jit.aber
Jit bookmarked IONiseR: tools for the quality assessment of data produced by Oxford Nanopore’s MinION sequencer 2559 days ago

This package is intended to provide tools for the quality assessment of data produced by Oxford Nanopore’s MinION sequencer. It includes a functions to generate a number plots for examining the statistics that we think will be useful for this...

https://www.bioconductor.org/packages/devel/bioc/vignettes/IONiseR/inst/doc/IONiseR.html
- Jit@jit.aber
Jit bookmarked ONT assembly and Illumina polishing pipeline 2559 days ago

This pipeline performs the following steps: Assembly of nanopore reads using Canu. Polish canu contigs using racon (optional). Map a paired-end Illumina dataset onto the contigs obtained in the previous steps...

https://github.com/nanoporetech/ont-assembly-polish
- Jit@jit.aber
Jit bookmarked poRe: an R package for the visualization and analysis of nanopore sequencing data 2559 days ago

Motivation: The Oxford Nanopore MinION device represents a unique sequencing technology. As a mobile sequencing device powered by the USB port of a laptop, the MinION has huge potential applications. To enable these applications, the...

https://academic.oup.com/bioinformatics/article/31/1/114/2365693
Comments
- Rahul Nayak@rahul
  
  Rahul Nayak 2378 days ago
  We now need to install the poRe dependencies in R, which is very easy:
  R source("http://www.bioconductor.org/biocLite.R") biocLite("rhdf5") install.packages(c("shiny","bit64","data.table","svDialogs")) q()
  R may ask if you want to install into a local library, just say Y and accept defaults. We need to download poRe from sourecforge and we are using version 0.16
  Once downloaded, and back at the Linux command line:
  R CMD INSTALL poRe_0.16.tar.gz
  The fastq extraction scripts for poRe are in github, so let’s go get those:
  git clone https://github.com/mw55309/poRe_scripts.git
  We will assemble using SPAdes, so let’s go get that:
  wget http://spades.bioinf.spbau.ru/release3.6.2/SPAdes-3.6.2-Linux.tar.gz gunzip < SPAdes-3.6.2-Linux.tar.gz | tar xvf -
  Now, we are ready to go. First off, let’s extract the 2D sequence data as FASTQ from the MinION data. Nick’s SQK-MAP-006 data are in the old FAST5 format so we use the script in “old_format”:
  ./poRe_scripts/old_format/extract2D MAP006-1/MAP006-1_downloads/pass/ > minion.pass.2D.fastq &
- Jit@jit.aber
Jit bookmarked TULIP - The Uncorrected Long read Itegration Pipeline 2559 days ago

#Running TULIP (The Uncorrected Long-read Integration Process), version 0.4 late 2016 (European eel) TULIP currently consists of to Perl scripts, tulipseed.perl and tulipbulb.perl. These are very much intended as prototypes, and additional...

https://github.com/Generade-nl/TULIP
- Jit@jit.aber
Jit bookmarked Taxoblast : Taxoblast is a pipeline to identify contamination in genomic sequence 2559 days ago

Modern genome sequencing strategies are highly sensitive to contamination making the detection of foreign DNA sequences an important part of analysis pipelines. Here we use Taxoblast, a simple pipeline with a graphical user interface, for the...

https://sourceforge.net/projects/taxoblast/files/
- Jit@jit.aber
Jit posted to the wire 2559 days ago

Fastq 2 fasta. $ sed -n '1~4s/^@/>/p;2~4p' test.fastq > test.fasta #fastq #fasta #convert
- Abhimanyu Singh@abhimanyu
Abhimanyu Singh bookmarked SLIDESORT-BPR 2562 days ago

Chromosomal rearrangement events are caused by abnormal breaking and rejoining of DNA molecules. They are responsible for many of the cancer related diseases. Detecting the DNA breaking and repairing mechanism, therefore, may offer vital clues about...

https://github.com/ewijaya/slidesort-bpr
- Abhimanyu Singh@abhimanyu
Abhimanyu Singh bookmarked DiscoSnp 2562 days ago

DiscoSnp is designed for discovering all kinds of SNPs (not only isolated ones), as well as insertions and deletions, from raw set(s) of reads. The number of input read sets is not constrained, it can be one, two, or more. No reference genome is...

https://github.com/GATB/DiscoSnp
- Jit@jit.aber
Jit bookmarked CHSMiner: a GUI tool to identify chromosomal homologous segments 2563 days ago

Background The identification of chromosomal homologous segments (CHS) within and between genomes is essential for comparative genomics. Various processes including insertion/deletion and inversion could cause the degeneration of...

https://almob.biomedcentral.com/articles/10.1186/1748-7188-4-2

BOL

Our Sponsors

All Site Activity