<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/36857?offset=0</link>
	<atom:link href="https://bioinformaticsonline.com/related/36857?offset=0" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44371/steps-to-find-all-the-repeats-in-the-genome</guid>
	<pubDate>Thu, 31 Aug 2023 02:43:28 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44371/steps-to-find-all-the-repeats-in-the-genome</link>
	<title><![CDATA[Steps to find all the repeats in the genome !]]></title>
	<description><![CDATA[<div><p>To find repeats in a genome from 2 to 9 length using a Perl script, you can use the RepeatMasker tool with the "--length" option<a href="https://mobilednajournal.biomedcentral.com/articles/10.1186/1759-8753-5-13" target="_blank">[0]</a>. Here's a step-by-step guide:</p></div><div><ol>
<li>Install RepeatMasker: First, you need to install RepeatMasker on your system. You can download it from the RepeatMasker website<a href="https://mobilednajournal.biomedcentral.com/articles/10.1186/1759-8753-5-13" target="_blank">[0]</a>.</li>
</ol></div><div><ol>
<li>Prepare the genome sequence: Make sure you have the genome sequence in a FASTA file format. Let's assume the file is named "genome.fasta".</li>
</ol><blockquote><p>./RepeatMasker -pa &lt;number_of_processors&gt; -nolow -norna -no_is -div &lt;divergence_value&gt; -lib RepeatMaskerLib.embl -gff -xsmall -small -poly -species &lt;species_name&gt; -dir &lt;output_directory&gt; -length &lt;min_length&gt;-&lt;max_length&gt; genome.fasta</p></blockquote><div><p>Replace the following placeholders with appropriate values:</p><ul>
<li><code>&lt;number_of_processors&gt;</code>: The number of processors/threads you want to use for parallel processing.</li>
<li><code>&lt;divergence_value&gt;</code>: The divergence value for the species you are analyzing. You can find divergence values for different species in the RepeatMasker documentation<a href="https://mobilednajournal.biomedcentral.com/articles/10.1186/1759-8753-5-13" target="_blank">[0]</a>.</li>
<li><code>&lt;species_name&gt;</code>: The name of the species you are analyzing.</li>
<li><code>&lt;output_directory&gt;</code>: The directory where you want the output files to be saved.</li>
<li><code>&lt;min_length&gt;</code>&nbsp;and&nbsp;<code>&lt;max_length&gt;</code>: The minimum and maximum lengths of the repeats you want to find (in this case, 2 and 9).</li>
</ul></div><div><ol>
<li>Analyze the output: RepeatMasker will generate several output files, including a .out file. You can parse this file to extract the information you need. There is a Perl tool called "one_code_to_find_them_all.pl" that can help you parse RepeatMasker output files<a href="https://mobilednajournal.biomedcentral.com/articles/10.1186/1759-8753-5-13" target="_blank">[0]</a>. You can download it from the source provided.</li>
</ol></div><div><ol>
<li>Use the provided Perl script: Once you have the "one_code_to_find_them_all.pl" script, you can run it to conveniently parse the RepeatMasker output files. Here's an example of how to use it:</li>
</ol><blockquote><p>perl one_code_to_find_them_all.pl --rm &lt;RepeatMasker_out_file&gt; --length &lt;length_file&gt;</p></blockquote></div><p>&nbsp;</p></div><div><div><p>Replace&nbsp;<code>&lt;RepeatMasker_out_file&gt;</code>&nbsp;with the path to your RepeatMasker .out file, and&nbsp;<code>&lt;length_file&gt;</code>&nbsp;with the path to a file containing the lengths of the reference elements.</p></div><div><p>This script will generate several output files, including .log.txt and .copynumber.csv, which contain quantitative information about the identified repeat elements.</p></div><div><p>Remember to adjust the parameters and options according to your specific needs and the characteristics of your genome.</p></div></div>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34490/collinearity-scripts-to-parse-and-analyse-mcscanx-collinearity-output</guid>
	<pubDate>Wed, 29 Nov 2017 16:47:54 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34490/collinearity-scripts-to-parse-and-analyse-mcscanx-collinearity-output</link>
	<title><![CDATA[collinearity: scripts to parse and analyse MCScanX collinearity output]]></title>
	<description><![CDATA[<p><span>scripts to parse and analyse MCScanX collinearity output</span></p><p>Address of the bookmark: <a href="https://github.com/reubwn/collinearity" rel="nofollow">https://github.com/reubwn/collinearity</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/33693/circleator</guid>
	<pubDate>Sun, 25 Jun 2017 18:04:32 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/33693/circleator</link>
	<title><![CDATA[Circleator]]></title>
	<description><![CDATA[<p>The Charm City Circleator--or Circleator for short--is a Perl-based visualization tool developed at the&nbsp;<a href="http://igs.umaryland.edu/">Institute for Genome Sciences</a>&nbsp;in the University of Maryland's School of Medicine. Circleator produces circular plots of genome-associated data, like this one:</p>
<p><a href="https://camo.githubusercontent.com/0b414f050a7dcb672386932ee0cd83e5f42d2271/687474703a2f2f6a6f6e617468616e63726162747265652e6769746875622e696f2f436972636c6561746f722f696d616765732f43503030323732352d322d3432302e706e673f7261773d74727565" target="_blank"><img src="https://camo.githubusercontent.com/0b414f050a7dcb672386932ee0cd83e5f42d2271/687474703a2f2f6a6f6e617468616e63726162747265652e6769746875622e696f2f436972636c6561746f722f696d616765732f43503030323732352d322d3432302e706e673f7261773d74727565" alt="Sample Circleator image" title="Sample Circleator Image" style="border: 0px;"></a></p>
<p>Common uses of the tool include:</p>
<ul>
<li>Displaying the sequence and/or genes in a&nbsp;<a href="http://www.ncbi.nlm.nih.gov/genbank/">GenBank</a>&nbsp;flat file.</li>
<li>Highlighting differences and/or similarities in gene content between related organisms.</li>
<li>Comparing SNPs and indels between closely-related strains or serovars.</li>
<li>Comparing gene expression values across multiple samples or timepoints.</li>
<li>Visualizing coverage plots of RNA-Seq read alignments.</li>
</ul>
<h3><a href="https://github.com/jonathancrabtree/Circleator#key-features"></a>Key Features</h3>
<p>Circleator...</p>
<ul>
<li>Builds on&nbsp;<a href="http://www.bioperl.org/">BioPerl</a>&nbsp;and the input file formats that it supports, including:
<ul>
<li><a href="http://www.ncbi.nlm.nih.gov/genbank/">GenBank</a>&nbsp;flat files, GFF, FASTA</li>
</ul>
</li>
<li>Accepts a number of other commonly-used datatypes and file formats:
<ul>
<li><a href="http://bsr.igs.umaryland.edu/">BSR</a>&nbsp;and&nbsp;<a href="http://tandem.bu.edu/trf/trf.html">TRF</a>&nbsp;output,&nbsp;<a href="http://samtools.sourceforge.net/">SAM/BAM</a>&nbsp;files,&nbsp;<a href="http://vcftools.sourceforge.net/">VCF</a>-encoded SNPs, tab-delimited files</li>
</ul>
</li>
<li>Outputs publication-ready figures in the&nbsp;<a href="http://www.w3.org/Graphics/SVG/">SVG</a>&nbsp;(Scalable Vector Graphics) format.</li>
<li>Requires only a single configuration file whose layout mirrors that of the figure itself.
<ul>
<li>Predefined configuration files and "track" types are supplied for common datasets.</li>
<li>Advanced features allow limited analyses to be performed as a figure is drawn.</li>
</ul>
</li>
<li>Includes an extensive set of regression tests.</li>
<li>Offers a prototype web-based GUI (under the "Ringmaster" project.)</li>
</ul>
<p>https://github.com/jonathancrabtree/Circleator</p><p>Address of the bookmark: <a href="https://github.com/jonathancrabtree/Circleator" rel="nofollow">https://github.com/jonathancrabtree/Circleator</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/33866/perlbrew-admin-free-perl-installation-management-tool</guid>
	<pubDate>Wed, 12 Jul 2017 03:53:08 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/33866/perlbrew-admin-free-perl-installation-management-tool</link>
	<title><![CDATA[Perlbrew: admin-free perl installation management tool.]]></title>
	<description><![CDATA[<p>perlbrew is an admin-free perl installation management tool. The latest version is 0.79, read the release note:&nbsp;<a href="https://perlbrew.pl/Release-0.79.html">Release 0.79</a>.&nbsp;</p>
<p>Copy &amp; Paste this line into your terminal:</p>
<pre><code>\curl -L https://install.perlbrew.pl | bash
</code></pre>
<p>Or, if your system does not have curl but something else:</p>
<pre><code># Linux
\wget -O - https://install.perlbrew.pl | bash

# FreeBSD
\fetch -o- https://install.perlbrew.pl | sh
</code></pre>
<p>If you prefer to install with cpan, there are two steps:</p>
<pre><code>sudo cpan App::perlbrew
perlbrew init
</code></pre>
<p>If it is installed with cpan, the perlbrew executable should be installed as&nbsp;<code>/usr/bin/perlbrew</code>&nbsp;or&nbsp;<code>/usr/local/bin/perlbrew</code>. For all users who want to use perlbrew, a prior&nbsp;<code>perlbrew init</code>&nbsp;needs to be executed.</p>
<p>The default perlbrew root directory is&nbsp;<code>~/perl5/perlbrew</code>, which can be changed by setting&nbsp;<code>PERLBREW_ROOT</code>environment variable before the installation and initialization. For more advanced installation process, please read&nbsp;<a href="http://metacpan.org/module/App::perlbrew">the perlbrew document</a>.</p>
<p>&nbsp;</p><p>Address of the bookmark: <a href="https://perlbrew.pl/" rel="nofollow">https://perlbrew.pl/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37509/vcftools-perform-common-tasks-with-vcf-files-such-as-file-validation-file-merging-intersecting-complements</guid>
	<pubDate>Tue, 07 Aug 2018 10:01:46 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37509/vcftools-perform-common-tasks-with-vcf-files-such-as-file-validation-file-merging-intersecting-complements</link>
	<title><![CDATA[VCFtools: perform common tasks with VCF files such as file validation, file merging, intersecting, complements]]></title>
	<description><![CDATA[<p>VCFtools contains a Perl API (<a href="http://vcftools.sourceforge.net/perl_module.html#Vcf.pm">Vcf.pm</a>) and a number of Perl scripts that can be used to perform common tasks with VCF files such as file validation, file merging, intersecting, complements, etc. The Perl tools support all versions of the VCF specification (3.2, 3.3, 4.0, 4.1 and 4.2), nevertheless, the users are encouraged to use the latest versions VCFv4.1 or VCFv4.2. The VCFtools in general have been used mainly with diploid data, but the Perl tools aim to support polyploid data as well. Run any of the Perl scripts with the&nbsp;<strong>--help</strong>&nbsp;switch to obtain more help.</p>
<p>Many of the&nbsp;<strong>Perl scripts require that the VCF files are compressed by&nbsp;<span>bgzip</span>&nbsp;and indexed by&nbsp;<span>tabix</span></strong>&nbsp;(both tools are part of the tabix package, available for&nbsp;<a href="https://sourceforge.net/projects/samtools/files/tabix/">download here</a>). The VCF files can be compressed and indexed using the following commands</p>
<p>bgzip my_file.vcf<br>tabix -p vcf my_file.vcf.gz</p>
<p>&nbsp;</p>
<p>http://vcftools.sourceforge.net/perl_module.html</p><p>Address of the bookmark: <a href="http://vcftools.sourceforge.net/perl_module.html" rel="nofollow">http://vcftools.sourceforge.net/perl_module.html</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34543/acana-an-accurate-and-consistent-alignment-tool-for-dna-sequences</guid>
	<pubDate>Wed, 06 Dec 2017 09:45:29 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34543/acana-an-accurate-and-consistent-alignment-tool-for-dna-sequences</link>
	<title><![CDATA[ACANA: An accurate and consistent alignment tool for DNA sequences]]></title>
	<description><![CDATA[<p><span>ACANA is an accurate and consistent alignment tool for DNA sequences. ACANA is specifically designed for aligning sequences that share only some moderately conserved regions and/or have a high frequency of long insertions or deletions. It attempts to combine the best of local and global alignments algorithms in searching for evolutionarily related regions of sequences in order to achieve the best alignment. ACANA is also robust to the small changes of alignment parameters, particularly the gap extension score. As an accurate alignment tool, ACANA is particularly useful in comparative sequence analysis for identifying conserved functional regulatory elements.</span></p><p>Address of the bookmark: <a href="https://www.niehs.nih.gov/research/resources/software/biostatistics/acana/index.cfm" rel="nofollow">https://www.niehs.nih.gov/research/resources/software/biostatistics/acana/index.cfm</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34734/smash-an-alignment-free-tool-to-find-and-visualise-rearrangements-between-pairs-of-dna-sequences</guid>
	<pubDate>Thu, 21 Dec 2017 08:26:57 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34734/smash-an-alignment-free-tool-to-find-and-visualise-rearrangements-between-pairs-of-dna-sequences</link>
	<title><![CDATA[SMASH: An alignment-free tool to find and visualise rearrangements between pairs of DNA sequences]]></title>
	<description><![CDATA[<p style="text-align: justify;"><span>SMASH is a completely alignment-free method to find and visualise rearrangements between pairs of DNA sequences</span>. The detection is based on&nbsp;<span>relative compression</span>, namely using a FCM, also known as Markov model, of high context order (typically 20). The method has been approached with a tool (also called SMASH). For visualization, SMASH outputs a SVG image, with an ideogram output architecture, where the patterns are represented with several HSV values (only value varies). The following image, illustrating the information maps between human and chimpanzee for the several chromosomes, depicts an example:</p>
<p><a href="https://github.com/pratas/smash/blob/master/imgs/HC.png" target="_blank"><img src="https://github.com/pratas/smash/raw/master/imgs/HC.png" alt="ScreenShot" style="border: 0px;"></a></p>
<p>&nbsp;</p>
<h2>&nbsp;</h2><p>Address of the bookmark: <a href="https://github.com/pratas/smash" rel="nofollow">https://github.com/pratas/smash</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36271/heap-a-highly-sensitive-and-accurate-snp-detection-tool-for-low-coverage-high-throughput-sequencing-data</guid>
	<pubDate>Thu, 19 Apr 2018 08:06:03 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36271/heap-a-highly-sensitive-and-accurate-snp-detection-tool-for-low-coverage-high-throughput-sequencing-data</link>
	<title><![CDATA[Heap: a highly sensitive and accurate SNP detection tool for low-coverage high-throughput sequencing data]]></title>
	<description><![CDATA[<p><span>Heap, that enables robustly sensitive and accurate calling of SNPs, particularly with a low coverage NGS data, which must be aligned to the reference genome sequences in advance. To reduce false positive SNPs, Heap determines genotypes and calls SNPs at each site except for sites at the both end of reads or containing a minor allele supported by only one read. Performance comparison with existing tools showed that Heap achieved the highest F-scores with low coverage (7X) restriction-site associated DNA sequencing reads of sorghum and rice individuals. This will facilitate cost-effective GWAS and GP studies in this NGS era. Code and documentation of Heap are freely available from&nbsp;</span><a href="https://github.com/meiji-bioinf/heap">https://github.com/meiji-bioinf/heap</a><span>&nbsp;and our web site (</span><a href="http://bioinf.mind.meiji.ac.jp/lab/en/tools.html">http://bioinf.mind.meiji.ac.jp/lab/en/tools.html</a><span>).</span></p><p>Address of the bookmark: <a href="https://github.com/meiji-bioinf/heap" rel="nofollow">https://github.com/meiji-bioinf/heap</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36812/porechop-tool-for-finding-and-removing-adapters-from-oxford-nanopore-reads</guid>
	<pubDate>Tue, 29 May 2018 07:33:44 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36812/porechop-tool-for-finding-and-removing-adapters-from-oxford-nanopore-reads</link>
	<title><![CDATA[Porechop:  tool for finding and removing adapters from Oxford Nanopore reads]]></title>
	<description><![CDATA[<p>Porechop is a tool for finding and removing adapters from <a href="https://nanoporetech.com/">Oxford Nanopore</a> reads. Adapters on the ends of reads are trimmed off, and when a read has an adapter in its middle, it is treated as chimeric and chopped into separate reads. Porechop performs thorough alignments to effectively find adapters, even at low sequence identity.</p>
<p>Porechop also supports demultiplexing of Nanopore reads that were barcoded with the <a href="https://store.nanoporetech.com/native-barcoding-kit-1d.html">Native Barcoding Kit</a>, <a href="https://store.nanoporetech.com/pcr-barcoding-kit-96.html">PCR Barcoding Kit</a> or <a href="https://store.nanoporetech.com/rapid-barcoding-sequencing-kit.html">Rapid Barcoding Kit</a>.</p><p>Address of the bookmark: <a href="https://github.com/rrwick/Porechop" rel="nofollow">https://github.com/rrwick/Porechop</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36907/higlass-a-tool-for-exploring-genomic-contact-matrices-and-tracks</guid>
	<pubDate>Mon, 11 Jun 2018 09:44:49 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36907/higlass-a-tool-for-exploring-genomic-contact-matrices-and-tracks</link>
	<title><![CDATA[HiGlass: a tool for exploring genomic contact matrices and tracks.]]></title>
	<description><![CDATA[HiGlass is a tool for exploring genomic contact matrices and tracks. Please take a look at the examples and documentation for a description of the ways that it can be configured to explore and compare contact matrices. To load private data, HiGlass can be run locally within a Docker container. The HiC data in the examples below is from Rao et al. (2014)

http://higlass.io/<p>Address of the bookmark: <a href="http://higlass.io/" rel="nofollow">http://higlass.io/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>