BOL: All Site Activity

All Site Activity

- Rahul Agarwal@raag
Rahul Agarwal posted to the wire 2532 days ago

https://support.bioconductor.org/t/Jobs/ #bioinfo #jobs
- Jit@jit.aber
Jit published a news post 1mb long DNA with Nanopore technology 2532 days ago

Longest read of 1mb achievements with Nanopore
- Rahul Nayak@rahul
Rahul Nayak published a blog post String graph based genome assembly software and tools ! 2532 days ago

In graph theory, a string graph is an intersection graph of curves in the plane; each curve is called a "string". String graphs were first proposed by E. W. Myers in a 2005 publication.
- Jit@jit.aber
Jit bookmarked NanoSim: nanopore sequence read simulator based on statistical characterization. 2534 days ago

NanoSim, a fast and scalable read simulator that captures the technology-specific features of ONT data and allows for adjustments upon improvement of nanopore sequencing technology. The first step of NanoSim is read characterization, which provides...

http://www.bcgsc.ca/platform/bioinfo/software/nanosim
- Jit@jit.aber
Jit created a page Run miniasm assembler on nanopore reads ! 2534 days ago

Miniasm is a very fast OLC-based de novo assembler for noisy long reads. It takes all-vs-all read self-mappings (typically by minimap) as input and outputs an assembly graph in the GFA format. Different from mainstream...
Comments
- Rahul Nayak@rahul
  
  Rahul Nayak 2371 days ago
  Here’s the quick and dirty of what was done:
  1 Run minimap:
  This uses a pre-built set of defaults (the ava-pb in the code below) for analyzing PacBio data. Minimap only accepts two FASTQ files and you need to map your FASTQ file against itself. So, if you have multiple FASTQ sequencing files, you have to concatenate them into a single file prior to running minimap.
  minimap2 -x ava-pb -t 23 \ 20170911_oly_pacbio_cat.fastq \ 20170911_oly_pacbio_cat.fastq \ > 20170911_minimap2_pacbio_oly.paf
  2 Run miniasm:
  This uses your concatenated FASTQ file and the PAF file output from the miniasm step. The code below is taken from the example provided in the miniasm documentation; there are other options available.
  miniasm \ -f \ /home/data/20170911_oly_pacbio_cat.fastq /home/data/20170911_minimap2_pacbio_oly.paf > /home/data/20170918_oly_pacbio_miniasm_reads.gfa
  3 Convert miniasm output GFA to FASTA
  The FASTA file is needed to re-run minimap in Step 4 below.
  awk '$1 ~/S/ {print ">"$2"\n"$3}' 20170918_oly_pacbio_miniasm_reads.gfa > 20170918_oly_pacbio_miniasm_reads.fasta
  4 Run minimap with default settings
  Using the default settings maps the FASTQ reads back to the contigs (the PAF file) created in the fist step. These mappings are required for Racon assembly (Step 5).
  minimap2 \ -t 23 \ 20170918_oly_pacbio_miniasm_reads.fasta 20170905_minimap2_pacibio_oly.paf > 20170918_minimap2_mapping_fasta_oly_pacbio.paf
  5 Run racon
  The output file is the FASTA file listed below.
  racon -t 24 \ 20170911_oly_pacbio_cat.fastq \ 20170918_oly_pacbio_minimap_mappings.paf \ 20170918_oly_pacbio_miniasm_assembly.gfa \ 20170918_oly_pacbio_racon1_consensus.fasta
  from Sam’s Notebook http://ift.tt/2fKBPUN
- Rahul Nayak@rahul
  
  Rahul Nayak 2371 days ago
  
  Must read paper https://genome.cshlp.org/content/27/5/737.full
- Rahul Nayak@rahul
  
  Rahul Nayak 1000 days ago
  minimap2 –x ava-ont \ ../../trimming_practical/nanofilt/nanofilt_trimmed.fastq \ ../../trimming_practical/nanofilt/nanofilt_trimmed.fastq \ | gzip -1 > ./minimap.paf.gz
  
  miniasm -f \ ../../trimming_practical/nanofilt/nanofilt_trimmed.fastq \ ./minimap.paf.gz > miniasm.gfa
  
  awk ’/^S/{print “>”$2”\n”$3}’ miniasm.gfa > miniasm.fasta
  
  assembly-stats ./miniasm.fasta
  
  dnadiff -p dnadiff ~/course_data/precompiled/chr17.fasta miniasm.fasta
+2 more
- Jit@jit.aber
Jit bookmarked Biological file format tutorial 2534 days ago

This section explains some of the commonly used file formats in bioinformatics. The information provided here is basic and designed to help users to distinguish the difference between different formats. Please refer user manual or other information...

https://bioinformatics.uconn.edu/resources-and-events/tutorials/file-formats-tutorial/
- Jit@jit.aber
Jit created a new bio-script Convert fastq to fasta in Perl 2534 days ago
- Jit@jit.aber
Jit posted to the wire 2534 days ago

Bioinformatics blog http://onetipperday.sterding.com/ #Blog
- Rahul Agarwal@raag
Rahul Agarwal posted a new ad in the ResearchLabs Prof. Dr. rer. nat. Jeanette Erdmann's Lab 2535 days ago
- Jit@jit.aber
Jit created a new bio-script Loop over with all files in a directory in bash 2535 days ago
- Radha Agarkar@radhaagarkar
Radha Agarkar created a page Tools for bacterial whole genome annotation 2535 days ago

RAST – Web tool (upload contigs), uses the subsystems in the SEED database and provides detailed annotation and pathway analysis. Takes several hours per genome but I think this is the best way to get a high quality annotation...
- Jit@jit.aber
Jit posted to the wire 2535 days ago

Beginner’s guide to comparative bacterial genome analysis using next-generation sequence data https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3630013/ #Bacteria #NGS #Comparative #Genomics
- Rahul Agarwal@raag
Rahul Agarwal posted a new ad in the ResearchLabs Genomic approaches to study global regulation of gene expression in the mouse immune system 2536 days ago
- Robert M Willioms@robert
Robert M Willioms bookmarked SVfinder: Tool for detecting genomic rearrangement form DNA-seq data 2537 days ago

SVfinder provides genome-wide detection of structural variants from next generation paired-end sequencing reads.

https://github.com/cauyrd/SVfinder
- Jit@jit.aber
Jit created a new bio-script Clump Finding Problem Solved with Perl 2538 days ago
- Jit@jit.aber
Jit posted to the wire 2538 days ago

#Remove a #sequence by id from multifasta: cat vaga.fa | awk '{if (substr($0,1) == ">scaffold_1 1087316 bp") censor=1; else if (substr($0,1,1) == ">") censor=0; if (censor==0) print $0}' > fixed.fasta
- Jit@jit.aber
Jit posted to the wire 2538 days ago

awk 'BEGIN {RS = ">" ; FS = "\n" ; ORS = ""} {if ($2) print ">"$0}' all_p_ctg.fa > all_p_ctg_CORRECTED.fa #remove #empty #clean #fasta
- Jit@jit.aber
Jit bookmarked TeachEnG: Teaching Engine for Genomics 2538 days ago

TeachEnG (pronounced “teaching”), a Teaching Engine for Genomics, provides educational games to help students and researchers understand key bioinformatics concepts. The current version includes interactive modules for sequence alignment...

http://teacheng.illinois.edu/
- Strand@Strand
Strand commented on the news Webinar on Unique Molecular Identifier (UMI)-powered Ultra-sensitive Variant Calling using Strand... 2539 days ago

Hurry! Limited seats available. To attend, register at http://www.strand-ngs.com/webinar_registration
- Jit@jit.aber
Jit bookmarked Mash: fast genome and metagenome distance estimation using MinHash 2539 days ago

Mash is normally distributed as a dependency-free binary for Linux or OSX (see https://github.com/marbl/Mash/releases). This source distribution is intended for other operating systems or for development. Mash requires c++11 to build, which is...

https://github.com/marbl/Mash/releases

BOL

Our Sponsors

All Site Activity

1 Run minimap:

2 Run miniasm:

3 Convert miniasm output GFA to FASTA

4 Run minimap with default settings

5 Run racon