<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/43587?offset=30</link>
	<atom:link href="https://bioinformaticsonline.com/related/43587?offset=30" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37205/afterqc-automatic-filtering-trimming-error-removing-and-quality-control-for-fastq-data</guid>
	<pubDate>Fri, 29 Jun 2018 03:26:03 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37205/afterqc-automatic-filtering-trimming-error-removing-and-quality-control-for-fastq-data</link>
	<title><![CDATA[AfterQC: Automatic Filtering, Trimming, Error Removing and Quality Control for fastq data]]></title>
	<description><![CDATA[Automatic Filtering, Trimming, Error Removing and Quality Control for fastq data
AfterQC can simply go through all fastq files in a folder and then output three folders: good, bad and QC folders, which contains good reads, bad reads and the QC results of each fastq file/pair.
Currently it supports processing data from HiSeq 2000/2500/3000/4000, Nextseq 500/550, MiniSeq...and other Illumina 1.8 or newer formats

The author has reimplemented this tool in C++ with multithreading support to make it much faster. The new tool is called fastp and can be found at: https://github.com/OpenGene/fastp . If you prefer a C++ based tool, please use fastp instead.

https://github.com/OpenGene/AfterQC<p>Address of the bookmark: <a href="https://github.com/OpenGene/AfterQC" rel="nofollow">https://github.com/OpenGene/AfterQC</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/file/view/36952/getoptspl-file</guid>
	<pubDate>Fri, 15 Jun 2018 04:43:03 -0500</pubDate>
	<link>https://bioinformaticsonline.com/file/view/36952/getoptspl-file</link>
	<title><![CDATA[getopts.pl file]]></title>
	<description><![CDATA[
<p>SSPACE_longread complain for getopts.pl file. </p>

<p>To resolve this, download and have in SSPACED-Longreads folder. </p>

<p>Cheers :)</p>
]]></description>
	<dc:creator>Jit</dc:creator>
	<enclosure url="https://bioinformaticsonline.com/file/download/36952" length="942" type="text/plain" />
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37574/simlord-a-read-simulator-for-third-generation-sequencing-reads</guid>
	<pubDate>Wed, 22 Aug 2018 10:40:27 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37574/simlord-a-read-simulator-for-third-generation-sequencing-reads</link>
	<title><![CDATA[SimLoRD: A read simulator for third generation sequencing reads]]></title>
	<description><![CDATA[<p>SimLoRD is a read simulator for third generation sequencing reads and is currently focused on the Pacific Biosciences SMRT error model.</p>
<p>Reads are simulated from both strands of a provided or randomly generated reference sequence.</p>
<div id="rst-header-features">
<ul>
<li>The reference can be read from a FASTA file or randomly generated with a given GC content. It can consist of several chromosomes, whose structure is respected when drawing reads. (Simulation of genome rearrangements may be incorporated at a later stage.)</li>
<li>The read lengths can be determined in four ways: drawing from a log-normal distribution (typical for genomic DNA), sampling from an existing FASTQ file (typical for RNA), sampling from a a text file with integers (RNA), or using a fixed length</li>
<li>Quality values and number of passes depend on fragment length.</li>
<li>Provided subread error probabilities are modified according to number of passes</li>
<li>Outputs reads in FASTQ format and alignments in SAM format</li>
</ul>
</div><p>Address of the bookmark: <a href="https://bitbucket.org/genomeinformatics/simlord/" rel="nofollow">https://bitbucket.org/genomeinformatics/simlord/</a></p>]]></description>
	<dc:creator>Aaryan Lokwani</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/10925/a-brief-bioinformatics-tutorial</guid>
	<pubDate>Wed, 21 May 2014 12:50:09 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/10925/a-brief-bioinformatics-tutorial</link>
	<title><![CDATA[A Brief Bioinformatics Tutorial]]></title>
	<description><![CDATA[<p>This is about how to use a computer to find what is known about a gene of interest and also how to get new insights about it.</p>
<p>The tutorial is divided in three main parts:</p>
<ul>
<li>In the <strong>Sequence </strong>part, you will see how to look efficiently for a particular protein sequence, how to blast it against the database of your choice to find homologues, how to perform a multiple alignment of the homologues you've selected and how to edit this alignment.</li>
<li>The <strong>Structure </strong>part is about molecular visualization, homology modeling and structural domain prediction.</li>
<li>In the <strong>Function </strong>part, you will be introduced to you 3 useful servers to investigate the function of a protein. i.e. finding interactors, co-expressed genes, see a phylogenetic profile, easily access papers citing your gene etc ...</li>
</ul>
<p>During all the three parts, we will use the <em>S. cerevisiae </em>VPS36 protein as an example.</p><p>Address of the bookmark: <a href="http://www.mrc-lmb.cam.ac.uk/rlw/text/bioinfo_tuto/introduction.html" rel="nofollow">http://www.mrc-lmb.cam.ac.uk/rlw/text/bioinfo_tuto/introduction.html</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/28200/machine-learning</guid>
	<pubDate>Fri, 01 Jul 2016 12:57:12 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/28200/machine-learning</link>
	<title><![CDATA[Machine Learning !!!]]></title>
	<description><![CDATA[<p>In machine learning, computers apply&nbsp;<strong>statistical learning</strong>&nbsp;techniques to automatically identify patterns in data. These techniques can be used to make highly accurate predictions.</p>
<p><em>Keep scrolling.</em>&nbsp;Using a data set about homes, we will create a machine learning model to distinguish homes in New York from homes in San Francisco.</p><p>Address of the bookmark: <a href="http://www.r2d3.us/visual-intro-to-machine-learning-part-1/" rel="nofollow">http://www.r2d3.us/visual-intro-to-machine-learning-part-1/</a></p>]]></description>
	<dc:creator>Gudiya Pal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/35798/an-introduction-to-applied-bioinformatics</guid>
	<pubDate>Fri, 02 Mar 2018 04:26:38 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/35798/an-introduction-to-applied-bioinformatics</link>
	<title><![CDATA[An Introduction to Applied Bioinformatics]]></title>
	<description><![CDATA[<p>IAB is primarily being developed by&nbsp;<a href="http://caporasolab.us/people/greg-caporaso/">Greg Caporaso</a>(GitHub/Twitter:&nbsp;<a href="https://github.com/gregcaporaso">@gregcaporaso</a>) in the&nbsp;<a href="http://www.caporasolab.us/">Caporaso Lab</a>&nbsp;at&nbsp;<a href="http://www.nau.edu/">Northern Arizona University</a>. You can find information on the courses I teach on&nbsp;<a href="http://www.caporasolab.us/teaching">my teaching website</a>&nbsp;and information on my research and lab on&nbsp;<a href="http://www.caporasolab.us/">my lab website</a>.</p>
<p>&nbsp;</p><p>Address of the bookmark: <a href="http://readiab.org/" rel="nofollow">http://readiab.org/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/42810/bioinformatics-in-africa-part3-mali</guid>
	<pubDate>Sat, 06 Feb 2021 13:28:44 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/42810/bioinformatics-in-africa-part3-mali</link>
	<title><![CDATA[Bioinformatics in Africa: Part3 - Mali]]></title>
	<description><![CDATA[<p>International&nbsp;Center&nbsp;for&nbsp;Excellence&nbsp;in&nbsp;Research&nbsp;(ICER):</p><p>The&nbsp;ICER&nbsp;is&nbsp;a&nbsp;research&nbsp;center&nbsp;composed&nbsp;of&nbsp;the&nbsp;following&nbsp;three&nbsp;programs: 1. The&nbsp;Malaria&nbsp;Research&nbsp;and&nbsp;Training&nbsp;Center&nbsp;&shy;&nbsp;Parasitology&nbsp;Group,&nbsp; 2. The&nbsp;Malaria&nbsp;Research&nbsp;and&nbsp;Training&nbsp;Center&nbsp;&shy;&nbsp;Entomology&nbsp;Group&nbsp; 3. The&nbsp;SEREFO&nbsp;Group.</p><p>The&nbsp;first&nbsp;two&nbsp;programs&nbsp;develop&nbsp;biomedical&nbsp;researches&nbsp;in&nbsp;malaria,&nbsp;Filariasis&nbsp;and&nbsp;Leishmaniasis.&nbsp;The&nbsp; third&nbsp;program&nbsp;develops&nbsp;biomedical&nbsp;researches&nbsp;in&nbsp;tuberculosis&nbsp;and&nbsp;HIV.</p><p>Bioinformatics&nbsp;was&nbsp;introduced&nbsp;recently&nbsp;to&nbsp;the&nbsp;ICER&nbsp;and&nbsp;is&nbsp;constantly&nbsp;growing.&nbsp;The&nbsp;ICER&nbsp;has&nbsp;one&nbsp; team,&nbsp;headed&nbsp;by&nbsp;Sidy&nbsp;SOUMARE,&nbsp;which&nbsp;supports&nbsp;the&nbsp;three&nbsp;programmes&nbsp;in&nbsp;all&nbsp;their&nbsp;needs&nbsp;in&nbsp; informatics&nbsp;and&nbsp;bioinformatics.&nbsp;This&nbsp;team&nbsp;can&nbsp;beneficiate&nbsp;from&nbsp;some&nbsp;computational&nbsp;facilities&nbsp;(4&nbsp; blast&nbsp;servers,&nbsp;15&nbsp;other&nbsp;servers&nbsp;and&nbsp;around&nbsp;200&nbsp;terminals),&nbsp;but&nbsp;the&nbsp;ICER&nbsp;staff&nbsp;needs&nbsp;some&nbsp;training&nbsp;in&nbsp; order&nbsp;to&nbsp;be&nbsp;able&nbsp;to&nbsp;administrate&nbsp;these&nbsp;facilities.</p><p>Research&nbsp;Interest&nbsp;and&nbsp;Activities: The&nbsp;following&nbsp;are&nbsp;the&nbsp;present&nbsp;areas&nbsp;of&nbsp;research&nbsp;interest: 1. Functional&nbsp;genomics 2. Analysis&nbsp;of&nbsp;microarray&nbsp;data 3. Interaction&nbsp;between&nbsp;the&nbsp;vector&nbsp;and&nbsp;the&nbsp;parasite.</p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/42815/bioinformatics-in-africa-part7-tunisia</guid>
	<pubDate>Sat, 06 Feb 2021 21:25:09 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/42815/bioinformatics-in-africa-part7-tunisia</link>
	<title><![CDATA[Bioinformatics in Africa: Part7 - Tunisia]]></title>
	<description><![CDATA[<p>Institut Pasteur de Tunis (IPT):<br />The IPT is a research institution founded in 1883. IPT is under the supervision of the Ministry of &nbsp;Health and is part of the Universit&eacute; El Manar of Tunis (Ministry of high Education). The missions &nbsp;of the institute are: Public Health Laboratory activities (PHL), Research on infectious diseases, and &nbsp;R/D on vaccines. Research programs are mainly oriented towards local health problems such as &nbsp;leishmaniais, viral hepatitis, and scorpion venoms. The &nbsp; group &nbsp; of &nbsp; Bioinformatics &nbsp; and &nbsp; Modelling &nbsp; of &nbsp; the &nbsp; IPT &nbsp; is &nbsp; hosted &nbsp; by &nbsp; the &nbsp;Laboratoire &nbsp;d&rsquo;Immunopathologie Vaccinologie et G&eacute;n&eacute;tique Mol&eacute;culaire &nbsp;(LIVGM), and exists since the &nbsp;beginning of 2005. Its present research activities include: genome annotation, EST clustering and &nbsp;modelling of the host/parasite response to Leishmania infection. It consists of two senior scientists, &nbsp;two PhD students and one MSc student</p><p>Centre&nbsp;de&nbsp;Biotechnology&nbsp;de&nbsp;Sfax&nbsp;(CBS):<br />Bioinformatics&nbsp;activity&nbsp;started&nbsp;at&nbsp;CBS&nbsp;in&nbsp;2001&nbsp;with&nbsp;the&nbsp;setting&shy;up&nbsp;of&nbsp;a&nbsp;research&nbsp;and&nbsp;service&nbsp;unit&nbsp;of&nbsp; bioinformatics.&nbsp;This&nbsp;unit&nbsp;currently&nbsp;includes&nbsp;one&nbsp;senior&nbsp;researcher,&nbsp;one&nbsp;engineer&nbsp;and&nbsp;four&nbsp;Phd&nbsp; students.&nbsp;Activities&nbsp;include&nbsp;sequence&nbsp;annotation&nbsp;(service)&nbsp;and&nbsp;three&nbsp;research&nbsp;programs:&nbsp;ab&nbsp;initio&nbsp; prediction&nbsp;of&nbsp;short&nbsp;eukaryote&nbsp;genes,&nbsp;statistical&nbsp;modelling&nbsp;by&nbsp;Bayesian&nbsp;networks&nbsp;approach&nbsp;of&nbsp;signal&nbsp; transduction&nbsp;pathways&nbsp;and&nbsp;statistical&nbsp;analysis&nbsp;of&nbsp;human&nbsp;sequence&nbsp;variation&nbsp;data&nbsp;(haplotype&nbsp; reconstruction&nbsp;and&nbsp;linkage&nbsp;disequilibrium).&nbsp;Activities&nbsp;of&nbsp;the&nbsp;Bioinformatics&nbsp;unit&nbsp;could&nbsp;be&nbsp;found&nbsp;at&nbsp; the&nbsp;website:&nbsp;http://www.cbs.rnrt.tn/&nbsp;and&nbsp;the&nbsp;research&nbsp;activity&nbsp;report&nbsp;is&nbsp;available&nbsp;under&nbsp;request&nbsp;to&nbsp; Bioinformatics@cbs.rnrt.tn.&nbsp;Although&nbsp;the&nbsp;computing&nbsp;facilities&nbsp;are&nbsp;good,&nbsp;there&nbsp;is&nbsp;still&nbsp;a&nbsp;need&nbsp;for&nbsp; trained&nbsp;human&nbsp;resources&nbsp;to&nbsp;strengthen&nbsp;bioinformatics&nbsp;capacities&nbsp;at&nbsp;CBS,&nbsp;particularly&nbsp;in&nbsp;structural&nbsp; bioinformatics.</p><p>Web site and links: http://www.cbs.rnrt.tn</p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/43550/basic-structure-of-snakemake-pipeline-run</guid>
	<pubDate>Thu, 14 Oct 2021 07:01:38 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/43550/basic-structure-of-snakemake-pipeline-run</link>
	<title><![CDATA[Basic Structure of Snakemake Pipeline Run !]]></title>
	<description><![CDATA[<div>/user/snakemake-demo$ ls</div><div>config.json data envs scripts slurm-240702.out Snakefile</div><ul>
<li>data = mock data for the snakefile to use</li>
<li>Snakefile = name of the snakemake &ldquo;formula&rdquo; file
<ul>
<li>Note: The default file that snakemake looks for in the current working directory is the&nbsp;<code>Snakefile</code>. If you would like to override that you can specify it following the&nbsp;<code>-s</code>
<ul>
<li><code>snakemake -s snakefile.py</code></li>
</ul>
</li>
</ul>
</li>
<li>envs = directory for storing the conda environments that the workflow will use.</li>
<li>scripts = directory for storing python scripts called by the snakemake formula.</li>
<li>config.json = json format file with extra parameters for our snakemake file to use.</li>
<li>cluster.json = json format file with specification for running on the HPC</li>
<li>samples.txt = file we will use later relating to the config.json file.</li>
</ul><p><span>Run the snakemake file as a dry run (the example workflow shown above).</span></p><ul>
<li>This will build a DAG of the jobs to be run without actually executing them.</li>
<li><code>snakemake --dry-run</code></li>
</ul><p>User can e<span>xecute rules of interest.</span></p><ul>
<li><code>snakemake --dry-run all</code>&nbsp;VS.&nbsp;<code>snakemake --dry-run call</code>&nbsp;VS.&nbsp;<code>snakemake --dry-run bwa</code></li>
</ul><p><span>Run the snakemake file in order to produce an image of the DAG of jobs to be run.</span></p><ul>
<li><code>snakemake --dag | dot -Tsvg &gt; dag.svg</code>&nbsp;OR&nbsp;<code>snakemake --dag | dot -Tsvg &gt; dag.svg</code></li>
</ul><p>Run the snakemake (this time not as a dry run)</p><ol>
<li><code>snakemake --use-conda</code></li>
</ol>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/44543/seeing-theory-and-learn</guid>
	<pubDate>Tue, 04 Jun 2024 00:31:54 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/44543/seeing-theory-and-learn</link>
	<title><![CDATA[Seeing Theory and Learn]]></title>
	<description><![CDATA[<p>Seeing Theory was created by Daniel Kunin while an undergraduate at Brown University. The goal of this website is to make statistics more accessible through interactive visualizations (designed using Mike Bostock&rsquo;s JavaScript library D3.js).</p><p>Address of the bookmark: <a href="https://seeing-theory.brown.edu/" rel="nofollow">https://seeing-theory.brown.edu/</a></p>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>

</channel>
</rss>