<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/36857?</link>
	<atom:link href="https://bioinformaticsonline.com/related/36857?" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44371/steps-to-find-all-the-repeats-in-the-genome</guid>
	<pubDate>Thu, 31 Aug 2023 02:43:28 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44371/steps-to-find-all-the-repeats-in-the-genome</link>
	<title><![CDATA[Steps to find all the repeats in the genome !]]></title>
	<description><![CDATA[<div><p>To find repeats in a genome from 2 to 9 length using a Perl script, you can use the RepeatMasker tool with the "--length" option<a href="https://mobilednajournal.biomedcentral.com/articles/10.1186/1759-8753-5-13" target="_blank">[0]</a>. Here's a step-by-step guide:</p></div><div><ol>
<li>Install RepeatMasker: First, you need to install RepeatMasker on your system. You can download it from the RepeatMasker website<a href="https://mobilednajournal.biomedcentral.com/articles/10.1186/1759-8753-5-13" target="_blank">[0]</a>.</li>
</ol></div><div><ol>
<li>Prepare the genome sequence: Make sure you have the genome sequence in a FASTA file format. Let's assume the file is named "genome.fasta".</li>
</ol><blockquote><p>./RepeatMasker -pa &lt;number_of_processors&gt; -nolow -norna -no_is -div &lt;divergence_value&gt; -lib RepeatMaskerLib.embl -gff -xsmall -small -poly -species &lt;species_name&gt; -dir &lt;output_directory&gt; -length &lt;min_length&gt;-&lt;max_length&gt; genome.fasta</p></blockquote><div><p>Replace the following placeholders with appropriate values:</p><ul>
<li><code>&lt;number_of_processors&gt;</code>: The number of processors/threads you want to use for parallel processing.</li>
<li><code>&lt;divergence_value&gt;</code>: The divergence value for the species you are analyzing. You can find divergence values for different species in the RepeatMasker documentation<a href="https://mobilednajournal.biomedcentral.com/articles/10.1186/1759-8753-5-13" target="_blank">[0]</a>.</li>
<li><code>&lt;species_name&gt;</code>: The name of the species you are analyzing.</li>
<li><code>&lt;output_directory&gt;</code>: The directory where you want the output files to be saved.</li>
<li><code>&lt;min_length&gt;</code>&nbsp;and&nbsp;<code>&lt;max_length&gt;</code>: The minimum and maximum lengths of the repeats you want to find (in this case, 2 and 9).</li>
</ul></div><div><ol>
<li>Analyze the output: RepeatMasker will generate several output files, including a .out file. You can parse this file to extract the information you need. There is a Perl tool called "one_code_to_find_them_all.pl" that can help you parse RepeatMasker output files<a href="https://mobilednajournal.biomedcentral.com/articles/10.1186/1759-8753-5-13" target="_blank">[0]</a>. You can download it from the source provided.</li>
</ol></div><div><ol>
<li>Use the provided Perl script: Once you have the "one_code_to_find_them_all.pl" script, you can run it to conveniently parse the RepeatMasker output files. Here's an example of how to use it:</li>
</ol><blockquote><p>perl one_code_to_find_them_all.pl --rm &lt;RepeatMasker_out_file&gt; --length &lt;length_file&gt;</p></blockquote></div><p>&nbsp;</p></div><div><div><p>Replace&nbsp;<code>&lt;RepeatMasker_out_file&gt;</code>&nbsp;with the path to your RepeatMasker .out file, and&nbsp;<code>&lt;length_file&gt;</code>&nbsp;with the path to a file containing the lengths of the reference elements.</p></div><div><p>This script will generate several output files, including .log.txt and .copynumber.csv, which contain quantitative information about the identified repeat elements.</p></div><div><p>Remember to adjust the parameters and options according to your specific needs and the characteristics of your genome.</p></div></div>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34490/collinearity-scripts-to-parse-and-analyse-mcscanx-collinearity-output</guid>
	<pubDate>Wed, 29 Nov 2017 16:47:54 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34490/collinearity-scripts-to-parse-and-analyse-mcscanx-collinearity-output</link>
	<title><![CDATA[collinearity: scripts to parse and analyse MCScanX collinearity output]]></title>
	<description><![CDATA[<p><span>scripts to parse and analyse MCScanX collinearity output</span></p><p>Address of the bookmark: <a href="https://github.com/reubwn/collinearity" rel="nofollow">https://github.com/reubwn/collinearity</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/33693/circleator</guid>
	<pubDate>Sun, 25 Jun 2017 18:04:32 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/33693/circleator</link>
	<title><![CDATA[Circleator]]></title>
	<description><![CDATA[<p>The Charm City Circleator--or Circleator for short--is a Perl-based visualization tool developed at the&nbsp;<a href="http://igs.umaryland.edu/">Institute for Genome Sciences</a>&nbsp;in the University of Maryland's School of Medicine. Circleator produces circular plots of genome-associated data, like this one:</p>
<p><a href="https://camo.githubusercontent.com/0b414f050a7dcb672386932ee0cd83e5f42d2271/687474703a2f2f6a6f6e617468616e63726162747265652e6769746875622e696f2f436972636c6561746f722f696d616765732f43503030323732352d322d3432302e706e673f7261773d74727565" target="_blank"><img src="https://camo.githubusercontent.com/0b414f050a7dcb672386932ee0cd83e5f42d2271/687474703a2f2f6a6f6e617468616e63726162747265652e6769746875622e696f2f436972636c6561746f722f696d616765732f43503030323732352d322d3432302e706e673f7261773d74727565" alt="Sample Circleator image" title="Sample Circleator Image" style="border: 0px;"></a></p>
<p>Common uses of the tool include:</p>
<ul>
<li>Displaying the sequence and/or genes in a&nbsp;<a href="http://www.ncbi.nlm.nih.gov/genbank/">GenBank</a>&nbsp;flat file.</li>
<li>Highlighting differences and/or similarities in gene content between related organisms.</li>
<li>Comparing SNPs and indels between closely-related strains or serovars.</li>
<li>Comparing gene expression values across multiple samples or timepoints.</li>
<li>Visualizing coverage plots of RNA-Seq read alignments.</li>
</ul>
<h3><a href="https://github.com/jonathancrabtree/Circleator#key-features"></a>Key Features</h3>
<p>Circleator...</p>
<ul>
<li>Builds on&nbsp;<a href="http://www.bioperl.org/">BioPerl</a>&nbsp;and the input file formats that it supports, including:
<ul>
<li><a href="http://www.ncbi.nlm.nih.gov/genbank/">GenBank</a>&nbsp;flat files, GFF, FASTA</li>
</ul>
</li>
<li>Accepts a number of other commonly-used datatypes and file formats:
<ul>
<li><a href="http://bsr.igs.umaryland.edu/">BSR</a>&nbsp;and&nbsp;<a href="http://tandem.bu.edu/trf/trf.html">TRF</a>&nbsp;output,&nbsp;<a href="http://samtools.sourceforge.net/">SAM/BAM</a>&nbsp;files,&nbsp;<a href="http://vcftools.sourceforge.net/">VCF</a>-encoded SNPs, tab-delimited files</li>
</ul>
</li>
<li>Outputs publication-ready figures in the&nbsp;<a href="http://www.w3.org/Graphics/SVG/">SVG</a>&nbsp;(Scalable Vector Graphics) format.</li>
<li>Requires only a single configuration file whose layout mirrors that of the figure itself.
<ul>
<li>Predefined configuration files and "track" types are supplied for common datasets.</li>
<li>Advanced features allow limited analyses to be performed as a figure is drawn.</li>
</ul>
</li>
<li>Includes an extensive set of regression tests.</li>
<li>Offers a prototype web-based GUI (under the "Ringmaster" project.)</li>
</ul>
<p>https://github.com/jonathancrabtree/Circleator</p><p>Address of the bookmark: <a href="https://github.com/jonathancrabtree/Circleator" rel="nofollow">https://github.com/jonathancrabtree/Circleator</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/33866/perlbrew-admin-free-perl-installation-management-tool</guid>
	<pubDate>Wed, 12 Jul 2017 03:53:08 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/33866/perlbrew-admin-free-perl-installation-management-tool</link>
	<title><![CDATA[Perlbrew: admin-free perl installation management tool.]]></title>
	<description><![CDATA[<p>perlbrew is an admin-free perl installation management tool. The latest version is 0.79, read the release note:&nbsp;<a href="https://perlbrew.pl/Release-0.79.html">Release 0.79</a>.&nbsp;</p>
<p>Copy &amp; Paste this line into your terminal:</p>
<pre><code>\curl -L https://install.perlbrew.pl | bash
</code></pre>
<p>Or, if your system does not have curl but something else:</p>
<pre><code># Linux
\wget -O - https://install.perlbrew.pl | bash

# FreeBSD
\fetch -o- https://install.perlbrew.pl | sh
</code></pre>
<p>If you prefer to install with cpan, there are two steps:</p>
<pre><code>sudo cpan App::perlbrew
perlbrew init
</code></pre>
<p>If it is installed with cpan, the perlbrew executable should be installed as&nbsp;<code>/usr/bin/perlbrew</code>&nbsp;or&nbsp;<code>/usr/local/bin/perlbrew</code>. For all users who want to use perlbrew, a prior&nbsp;<code>perlbrew init</code>&nbsp;needs to be executed.</p>
<p>The default perlbrew root directory is&nbsp;<code>~/perl5/perlbrew</code>, which can be changed by setting&nbsp;<code>PERLBREW_ROOT</code>environment variable before the installation and initialization. For more advanced installation process, please read&nbsp;<a href="http://metacpan.org/module/App::perlbrew">the perlbrew document</a>.</p>
<p>&nbsp;</p><p>Address of the bookmark: <a href="https://perlbrew.pl/" rel="nofollow">https://perlbrew.pl/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37509/vcftools-perform-common-tasks-with-vcf-files-such-as-file-validation-file-merging-intersecting-complements</guid>
	<pubDate>Tue, 07 Aug 2018 10:01:46 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37509/vcftools-perform-common-tasks-with-vcf-files-such-as-file-validation-file-merging-intersecting-complements</link>
	<title><![CDATA[VCFtools: perform common tasks with VCF files such as file validation, file merging, intersecting, complements]]></title>
	<description><![CDATA[<p>VCFtools contains a Perl API (<a href="http://vcftools.sourceforge.net/perl_module.html#Vcf.pm">Vcf.pm</a>) and a number of Perl scripts that can be used to perform common tasks with VCF files such as file validation, file merging, intersecting, complements, etc. The Perl tools support all versions of the VCF specification (3.2, 3.3, 4.0, 4.1 and 4.2), nevertheless, the users are encouraged to use the latest versions VCFv4.1 or VCFv4.2. The VCFtools in general have been used mainly with diploid data, but the Perl tools aim to support polyploid data as well. Run any of the Perl scripts with the&nbsp;<strong>--help</strong>&nbsp;switch to obtain more help.</p>
<p>Many of the&nbsp;<strong>Perl scripts require that the VCF files are compressed by&nbsp;<span>bgzip</span>&nbsp;and indexed by&nbsp;<span>tabix</span></strong>&nbsp;(both tools are part of the tabix package, available for&nbsp;<a href="https://sourceforge.net/projects/samtools/files/tabix/">download here</a>). The VCF files can be compressed and indexed using the following commands</p>
<p>bgzip my_file.vcf<br>tabix -p vcf my_file.vcf.gz</p>
<p>&nbsp;</p>
<p>http://vcftools.sourceforge.net/perl_module.html</p><p>Address of the bookmark: <a href="http://vcftools.sourceforge.net/perl_module.html" rel="nofollow">http://vcftools.sourceforge.net/perl_module.html</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42472/maftools-summarize-analyze-and-visualize-maf-files</guid>
	<pubDate>Wed, 23 Dec 2020 05:29:33 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42472/maftools-summarize-analyze-and-visualize-maf-files</link>
	<title><![CDATA[maftools : Summarize, Analyze and Visualize MAF Files]]></title>
	<description><![CDATA[<p><span>With advances in Cancer Genomics,&nbsp;</span><a href="https://docs.gdc.cancer.gov/Data/File_Formats/MAF_Format/">Mutation Annotation Format</a><span>&nbsp;(MAF) is being widely accepted and used to store somatic variants detected.&nbsp;</span><a href="http://cancergenome.nih.gov/">The Cancer Genome Atlas</a><span>&nbsp;Project has sequenced over 30 different cancers with sample size of each cancer type being over 200.&nbsp;</span><a href="https://wiki.nci.nih.gov/display/TCGA/TCGA+MAF+Files">Resulting data</a><span>&nbsp;consisting of somatic variants are stored in the form of&nbsp;</span><a href="https://docs.gdc.cancer.gov/Data/File_Formats/MAF_Format/">Mutation Annotation Format</a><span>. This package attempts to summarize, analyze, annotate and visualize MAF files in an efficient manner from either TCGA sources or any in-house studies as long as the data is in MAF format.</span></p><p>Address of the bookmark: <a href="https://www.bioconductor.org/packages/release/bioc/vignettes/maftools/inst/doc/maftools.html" rel="nofollow">https://www.bioconductor.org/packages/release/bioc/vignettes/maftools/inst/doc/maftools.html</a></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/38004/vcfr-a-package-to-manipulate-and-visualize-vcf-data-in-r</guid>
	<pubDate>Thu, 25 Oct 2018 09:05:59 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/38004/vcfr-a-package-to-manipulate-and-visualize-vcf-data-in-r</link>
	<title><![CDATA[vcfR:  a package to manipulate and visualize VCF data in R]]></title>
	<description><![CDATA[<p><span>VcfR is an R package intended to allow easy manipulation and visualization of variant call format (VCF) data. Functions are provided to rapidly read from and write to VCF files. Once VCF data is read into R a parser function extracts matrices from the VCF data for use with typical R functions. This information can then be used for quality control or other purposes. Additional functions provide visualization of genomic data. Once processing is complete data may be written to a VCF file or converted into other popular R objects (e.g., genlight, DNAbin). VcfR provides a link between VCF data and the R environment connecting familiar software with genomic data.</span></p><p>Address of the bookmark: <a href="https://github.com/knausb/vcfR" rel="nofollow">https://github.com/knausb/vcfR</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43870/quip-aggressive-compression-of-fastq-sam-and-bam-files</guid>
	<pubDate>Tue, 24 May 2022 06:31:48 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43870/quip-aggressive-compression-of-fastq-sam-and-bam-files</link>
	<title><![CDATA[Quip: Aggressive compression of FASTQ, SAM and BAM files.]]></title>
	<description><![CDATA[<p>This will help us to reduce the amount of drive space we take up and decrease data transfer times</p>
<p dir="auto">Quip compresses next-generation sequencing data with extreme prejudice. It supports input and output in the&nbsp;<a href="http://en.wikipedia.org/wiki/Fastq">FASTQ</a>&nbsp;and&nbsp;<a href="http://samtools.sourceforge.net/">SAM/BAM</a>&nbsp;formats, compressing large datasets to as little as 15% of their original size.</p><p>Address of the bookmark: <a href="https://github.com/dcjones/quip" rel="nofollow">https://github.com/dcjones/quip</a></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/41675/gapfinisher-a-reliable-gap-filling-pipeline-for-sspace-longread-scaffolder-output</guid>
	<pubDate>Thu, 14 May 2020 15:13:30 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/41675/gapfinisher-a-reliable-gap-filling-pipeline-for-sspace-longread-scaffolder-output</link>
	<title><![CDATA[gapFinisher: A reliable gap filling pipeline for SSPACE-LongRead scaffolder output]]></title>
	<description><![CDATA[<p>gapFinisher to process SSPACE-LongRead output to fill gaps after the scaffolding. gapFinisher is based on the controlled use of a previously published gap filling tool FGAP and works on all standard Linux/UNIX command lines.</p>
<p>https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6733440/</p><p>Address of the bookmark: <a href="https://github.com/kammoji/gapFinisher" rel="nofollow">https://github.com/kammoji/gapFinisher</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/396/bioinformatics-introduction-to-perl</guid>
	<pubDate>Thu, 11 Jul 2013 09:49:37 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/396/bioinformatics-introduction-to-perl</link>
	<title><![CDATA[Bioinformatics: Introduction to PERL]]></title>
	<description><![CDATA[<p>This course is aimed at those new to programming and provides an introduction to programming using <strong>Perl</strong>. By the end of this course, attendees should be able to write simple <strong>Perl</strong> programs and to understand more complex <strong>Perl</strong> programs written by others. The course will be taught using the online <a href="http://sofiarobb.com/learning-perl-toc/" title="http://sofiarobb.com/learning-perl-toc/">Learning Perl</a> materials created by <a href="http://stajich.bioinformatics.ucr.edu/members/sofia-robb" title="http://stajich.bioinformatics.ucr.edu/members/sofia-robb">Sofia Robb</a> of the <a href="http://www.ucr.edu/" title="http://www.ucr.edu/">University of California Riverside</a>. Further information is <a href="http://ruddles.bio.cam.ac.uk/%7Edpjudge/Descriptions/PERL.php" title="http://ruddles.bio.cam.ac.uk/~dpjudge/Descriptions/PERL.php">available</a>.</p>]]></description>
	<dc:creator>Archana Malhotra</dc:creator>
</item>

</channel>
</rss>