<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/42319?offset=20</link>
	<atom:link href="https://bioinformaticsonline.com/related/42319?offset=20" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/38381/repeatmasker-compatible-blast-tool</guid>
	<pubDate>Fri, 07 Dec 2018 08:13:03 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/38381/repeatmasker-compatible-blast-tool</link>
	<title><![CDATA[RepeatMasker compatible blast tool]]></title>
	<description><![CDATA[<p><span>RMBlast is a RepeatMasker compatible version of the standard NCBI blastn program. The primary difference between this distribution and the NCBI distribution is the addition of a new program "rmblastn" for use with RepeatMasker and RepeatModeler.</span></p>
<p>RMBlast supports RepeatMasker searches by adding a few necessary features to the stock NCBI blastn program. These include:</p>
<ul>
<li>Support for custom matrices ( without KA-Statistics ).</li>
<li>Support for cross_match-like complexity adjusted scoring. Cross_match is Phil Green's seeded smith-waterman search algorithm.</li>
<li>Support for cross_match-like masklevel filtering.</li>
</ul>
<p>https://anaconda.org/bioconda/rmblast</p><p>Address of the bookmark: <a href="http://www.repeatmasker.org/RMBlast.html" rel="nofollow">http://www.repeatmasker.org/RMBlast.html</a></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43985/visualise-blast-results</guid>
	<pubDate>Tue, 11 Oct 2022 03:15:10 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43985/visualise-blast-results</link>
	<title><![CDATA[Visualise blast results !]]></title>
	<description><![CDATA[<p>Kablammo helps you create interactive visualizations of BLAST results from your web browser. Find your most interesting alignments, list detailed parameters for each, and export a publication-ready vector image, all without installing any software.</p>
<p>&nbsp;</p><p>Address of the bookmark: <a href="https://kablammo.wasmuthlab.org/" rel="nofollow">https://kablammo.wasmuthlab.org/</a></p>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44709/a-step-by-step-guide-to-running-blast-offline</guid>
	<pubDate>Sat, 07 Dec 2024 22:32:37 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44709/a-step-by-step-guide-to-running-blast-offline</link>
	<title><![CDATA[A Step-by-Step Guide to Running BLAST Offline]]></title>
	<description><![CDATA[<p>BLAST (Basic Local Alignment Search Tool) is a powerful algorithm used to compare nucleotide or protein sequences to sequence databases, identifying regions of similarity. Running BLAST offline provides more control, ensures data security, and allows customization for specific research needs. Here&rsquo;s a detailed guide to set up and run BLAST locally on your system.</p><hr><h3>Step 1: <strong>Install BLAST</strong></h3><ol>
<li>
<p><strong>Download BLAST</strong>:</p>
<ul>
<li>Visit the <a href="https://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/">NCBI BLAST+ download page</a> to download the appropriate version for your operating system (Windows, macOS, or Linux).</li>
</ul>
</li>
<li>
<p><strong>Install BLAST</strong>:</p>
<ul>
<li>Extract the downloaded archive. For Linux/Mac, use:
<pre><code>tar -xvzf ncbi-blast-*.tar.gz
cd ncbi-blast-*
</code></pre>
</li>
<li>Add the BLAST binary folder to your system PATH for easier access:
<pre><code>export PATH=$PATH:/path/to/ncbi-blast-*/bin
</code></pre>
</li>
</ul>
</li>
<li>
<p><strong>Verify Installation</strong>:<br /> Run the following command to ensure BLAST is installed correctly:</p>
<pre><code>blastn -version
</code></pre>
</li>
</ol><hr><h3>Step 2: <strong>Prepare a Local Database</strong></h3><p>To run BLAST offline, you&rsquo;ll need a sequence database.</p><ol>
<li>
<p><strong>Download a Pre-Built Database (Optional)</strong>:</p>
<ul>
<li>NCBI provides ready-to-use databases such as <code>nt</code>, <code>nr</code>, and <code>Swiss-Prot</code>. Use the <code>update_blastdb.pl</code> script (bundled with BLAST) to download these:
<pre><code>update_blastdb.pl --decompress nt
</code></pre>
</li>
</ul>
</li>
<li>
<p><strong>Create a Custom Database</strong>:<br /> If you have specific sequences to use as a database:</p>
<ul>
<li>Prepare a FASTA file containing the sequences.</li>
<li>Use <code>makeblastdb</code> to create a database:
<pre><code>makeblastdb -in your_sequences.fasta -dbtype [nucl|prot] -out custom_db
</code></pre>
Replace <code>[nucl|prot]</code> with <code>nucl</code> for nucleotide sequences or <code>prot</code> for protein sequences.</li>
</ul>
</li>
</ol><hr><h3>Step 3: <strong>Prepare the Query Sequence</strong></h3><ul>
<li>Save your query sequence(s) in FASTA format.</li>
<li>Ensure the file is properly formatted, with a header line starting with <code>&gt;</code> followed by the sequence name, and the sequence on subsequent lines.</li>
</ul><p>Example:</p><pre><code>&gt;query_sequence
ATGCGTAGCTAGCGTAGCTAGCTAGCTA
</code></pre><hr><h3>Step 4: <strong>Run BLAST</strong></h3><ol>
<li>
<p><strong>Choose the Appropriate BLAST Tool</strong>:<br /> Depending on your data type:</p>
<ul>
<li><strong>blastn</strong>: For nucleotide-nucleotide searches.</li>
<li><strong>blastp</strong>: For protein-protein searches.</li>
<li><strong>blastx</strong>: Translates nucleotide sequences into proteins and compares them to a protein database.</li>
<li><strong>tblastn</strong>: Compares protein sequences to a nucleotide database.</li>
<li><strong>tblastx</strong>: Translates both nucleotide query and database sequences.</li>
</ul>
</li>
<li>
<p><strong>Run the Command</strong>:<br /> Example command for <code>blastn</code>:</p>
<pre><code>blastn -query query.fasta -db custom_db -out results.txt -outfmt 6 -evalue 1e-5
</code></pre>
<p><strong>Explanation of Parameters</strong>:</p>
<ul>
<li><code>-query</code>: Specifies the query file.</li>
<li><code>-db</code>: Points to the local database.</li>
<li><code>-out</code>: Output file name.</li>
<li><code>-outfmt</code>: Output format (e.g., 6 for tabular format).</li>
<li><code>-evalue</code>: E-value cutoff for significance.</li>
</ul>
</li>
</ol><hr><h3>Step 5: <strong>Interpret Results</strong></h3><ol>
<li>
<p><strong>Output Formats</strong>:</p>
<ul>
<li><strong>Default (outfmt 0)</strong>: Human-readable format.</li>
<li><strong>Tabular (outfmt 6)</strong>: Includes fields like query ID, subject ID, percent identity, alignment length, etc.</li>
</ul>
</li>
<li>
<p><strong>Analyze Results</strong>:<br /> Use tools like <code>grep</code>, Python, or R to parse and filter results for downstream analysis.</p>
</li>
</ol><hr><h3>Step 6: <strong>Optimize Performance</strong></h3><p>For large datasets, BLAST can be resource-intensive. To improve performance:</p><ol>
<li>
<p><strong>Multithreading</strong>:<br /> Use the <code>-num_threads</code> option to leverage multiple CPU cores:</p>
<pre><code>blastn -query query.fasta -db custom_db -out results.txt -num_threads 4
</code></pre>
</li>
<li>
<p><strong>Database Subsetting</strong>:<br /> Split large databases into smaller chunks for faster searches.</p>
</li>
<li>
<p><strong>Adjust Parameters</strong>:</p>
<ul>
<li>Lower the <code>-evalue</code> threshold for stricter matches.</li>
<li>Use <code>-max_target_seqs</code> to limit the number of results per query.</li>
</ul>
</li>
</ol><hr><h3>Step 7: <strong>Update Databases (Optional)</strong></h3><p>If using NCBI databases, regularly update them to ensure the inclusion of the latest sequences:</p><pre><code>update_blastdb.pl --decompress nt
</code></pre><hr><h3>Conclusion</h3><p>Running BLAST offline is a straightforward process that offers flexibility and security for bioinformaticians working with sensitive data. By following this guide, you can harness the power of BLAST to analyze sequences efficiently and gain valuable biological insights.</p><p>For advanced use cases, explore BLAST&rsquo;s customization options, such as custom scoring matrices, filtering, and iterative searches with tools like PSI-BLAST. Happy BLASTing!</p>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43641/refseq-viraal-genome-sequences</guid>
	<pubDate>Sat, 11 Dec 2021 08:35:18 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43641/refseq-viraal-genome-sequences</link>
	<title><![CDATA[Refseq viraal genome sequences !]]></title>
	<description><![CDATA[<p>List of all viruses on NCBI&nbsp;</p>
<p>https://ftp.ncbi.nlm.nih.gov/refseq/release/viral/</p><p>Address of the bookmark: <a href="https://ftp.ncbi.nlm.nih.gov/refseq/release/viral/" rel="nofollow">https://ftp.ncbi.nlm.nih.gov/refseq/release/viral/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42033/seastar-systematic-evaluation-of-alternative-start-site-in-rna</guid>
	<pubDate>Thu, 13 Aug 2020 09:54:27 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42033/seastar-systematic-evaluation-of-alternative-start-site-in-rna</link>
	<title><![CDATA[SEASTAR: Systematic Evaluation of Alternative STArt site in RNA]]></title>
	<description><![CDATA[<p>SEASTAR (Systematic Evaluation of Alternative STArt site in RNA) is a software package for Transcription Start Site (TSS) identification and quantification using only RNA-seq data. It assembles novel TSSs based only on RNA-Seq data and merges them with known TSSs from a public database. This package enables high-quality TSS identification that is comparable to the highly sophisticated CAGE technology. This package is particularly useful for finding novel TSSs that contribute to transcriptome complexity along with identifying differential promoter utilization.</p>
<p>version 1.0.0 - updates several descriptions and tests. To achieve v0.9.4, one can visit&nbsp;<a href="https://github.com/zhyqin/SEASTAR-0.9.4">https://github.com/zhyqin/SEASTAR-0.9.4</a>&nbsp;for download.</p><p>Address of the bookmark: <a href="https://github.com/Xinglab/SEASTAR" rel="nofollow">https://github.com/Xinglab/SEASTAR</a></p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/videolist/watch/2791/ncbi-psi-blast-tutorial</guid>
	<pubDate>Fri, 23 Aug 2013 02:25:02 -0500</pubDate>
	<link>https://bioinformaticsonline.com/videolist/watch/2791/ncbi-psi-blast-tutorial</link>
	<title><![CDATA[NCBI PSI-BLAST Tutorial]]></title>
	<description><![CDATA[<iframe width="" height="" src="https://www.youtube-nocookie.com/embed/T3kHEieyylk" frameborder="0" allowfullscreen></iframe>http:--www.biotechnology.jhu.edu-
Tutorial for PSI-BLAST, an extension of BLAST that uses matrix algebra. BLAST is a cornerstone bioinformatics tool at NCBI. BLAST is the
Basic Local Alignment Search tool and will protein and DNA sequences that
are related to a sequence that the user provides.]]></description>
	
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/4419/a-fast-package-to-parse-blast</guid>
	<pubDate>Tue, 10 Sep 2013 16:58:56 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/4419/a-fast-package-to-parse-blast</link>
	<title><![CDATA[A fast package to parse BLAST]]></title>
	<description><![CDATA[<p>In current era, we are handling huge amount of genomics data, and analysing it to make some biological sense out of it. Large-scale sequence studies requiring BLAST-based analysis produce huge amounts of data to be parsed. There are several BLAST parsers are available, but they are often missing some important features, such as keeping all information from the raw BLAST output, allowing direct access to single results, and performing logical operations over them.</p><p>Massimiliano Orsini and Simone Carcangiu develope a new and fast fast package "BlaSTorage" to parse and store BLAST results. BlaSTorage shows comparable speed of more basic parser written in compiled languages as C++ and can be easily integrated into web applications or software pipelines.</p><p>Find more @ http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3571973/</p><p>http://biowiki.crs4.it/biowiki/MassimilianoOrsini</p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/29270/blast-ring-image-generator-brig</guid>
	<pubDate>Fri, 30 Sep 2016 09:18:50 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/29270/blast-ring-image-generator-brig</link>
	<title><![CDATA[BLAST Ring Image Generator (BRIG)]]></title>
	<description><![CDATA[<p>BRIG is a free cross-platform (Windows/Mac/Unix) application that can display circular comparisons between a large number of genomes, with a focus on handling genome assembly data. The application is available at: <a href="http://sourceforge.net/projects/brig">http://sourceforge.net/projects/brig</a></p>
<p>If you have any questions or comments, post them on <a href="http://sourceforge.net/tracker/?group_id=328245">one of the trackers</a> on BRIG&rsquo;s SourceForge page: <a href="http://sourceforge.net/tracker/?group_id=328245">http://sourceforge.net/tracker/?group_id=328245</a>.</p>
<p>Features:</p>
<ul>
<li>Images show similarity between a central reference sequence and other sequences as concentric rings.</li>
<li>BRIG will perform all BLAST comparisons and file parsing automatically via a simple GUI.</li>
<li>Contig boundaries and read coverage can be displayed for draft genomes; customized graphs and annotations can be displayed.</li>
<li>Using a user-defined set of genes as input, BRIG can display gene presence, absence, truncation or sequence variation in a set of complete genomes, draft genomes or even raw, unassembled sequence data.</li>
<li>BRIG also accepts SAM-formatted read-mapping files enabling genomic regions present in unassembled sequence data from multiple samples to be compared simultaneously</li>
</ul><p>Address of the bookmark: <a href="http://brig.sourceforge.net/" rel="nofollow">http://brig.sourceforge.net/</a></p>]]></description>
	<dc:creator>Anjana</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/34600/converting-blast-output-into-csv</guid>
	<pubDate>Mon, 11 Dec 2017 04:17:58 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/34600/converting-blast-output-into-csv</link>
	<title><![CDATA[Converting BLAST output into CSV]]></title>
	<description><![CDATA[<p>Suppose we wanted to do something with all this BLAST output. Generally, that&rsquo;s the case - you want to retrieve all matches, or do a reciprocal BLAST, or something.</p><p>As with most programs that run on UNIX, the text output is in some specific format. If the program is popular enough, there will be one or more parsers written for that format &ndash; these are just utilities written to help you retrieve whatever information you are interested in from the output.</p><p>Let&rsquo;s conclude this tutorial by converting the BLAST output in out.txt into a spreadsheet format, using a Python script.&nbsp;</p><p>First, we need to get the script. We&rsquo;ll do that using the &lsquo;git&rsquo; program:</p><div><div><pre>git clone <a href="https://github.com/ngs-docs/ngs-scripts.git">https://github.com/ngs-docs/ngs-scripts.git</a> /root/ngs-scripts
</pre></div></div><p>We&rsquo;ll discuss &lsquo;git&rsquo; more later; for now, just think of it as a way to get ahold of a particular set of files. In this case, we&rsquo;ve placed the files in /root/ngs-scripts/, and you&rsquo;re looking to run the script blast/blast-to-csv.py using Python:</p><div><div><pre>python /root/ngs-scripts/blast/blast-to-csv.py out.txt
</pre></div></div><p>This outputs a spread-sheet like list of names and e-values. To save this to a file, do:</p><div><div><pre>python /root/ngs-scripts/blast/blast-to-csv.py out.txt &gt; ~out.csv
</pre></div></div><p>If you have Excel installed, try double clicking on it.</p>]]></description>
	<dc:creator>Poonam Mahapatra</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43952/elastic-blast</guid>
	<pubDate>Tue, 06 Sep 2022 18:14:57 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43952/elastic-blast</link>
	<title><![CDATA[Elastic BLAST !]]></title>
	<description><![CDATA[<p><a href="https://blast.ncbi.nlm.nih.gov/doc/elastic-blast/elasticblast.html?utm_source=ncbi_insights&amp;utm_medium=referral&amp;utm_campaign=elasticblast-top3-20220823">ElasticBLAST</a>&nbsp;is a new way to&nbsp;<a href="https://blast.ncbi.nlm.nih.gov/?utm_source=ncbi_insights&amp;utm_medium=referral&amp;utm_campaign=elasticblast-top3-20220823">BLAST</a>&nbsp;large numbers of queries, faster and on the cloud. Here are the top three reasons you should use ElasticBLAST:</p>
<h6><strong><img src="https://i0.wp.com/ncbiinsights.ncbi.nlm.nih.gov/wp-content/uploads/2022/08/ElasticBLAST_Larger-e1659978198941.png?resize=150%2C120&amp;ssl=1" alt="" width="150" height="120" style="border: 0px;">1. ElasticBLAST can handle much LARGER queries!&nbsp;</strong></h6>
<p>ElasticBLAST can search query sets that have&nbsp;<em>hundreds to millions of sequences</em>&nbsp;and against BLAST databases of all sizes.</p>
<h6><span><img src="https://i0.wp.com/ncbiinsights.ncbi.nlm.nih.gov/wp-content/uploads/2022/08/ElasticBLAST_Faster.png?resize=150%2C120&amp;ssl=1" alt="" width="150" height="120" style="border: 0px;">2. ElasticBLAST is FASTER</span></h6>
<p>ElasticBLAST distributes your searches across multiple cloud instances to process them simultaneously. The ability to scale resources in this way allows you to process large numbers of queries in a shorter time than you could with BLAST+.</p>
<h6><img src="https://i0.wp.com/ncbiinsights.ncbi.nlm.nih.gov/wp-content/uploads/2022/08/ElasticBLAST_Easy.png?resize=150%2C120&amp;ssl=1" alt="" width="150" height="120" style="border: 0px;">3. ElasticBLAST is EASY to run on the cloud<strong><br></strong></h6>
<p>ElasticBLAST is easy to set up using our step-by-step instructions&nbsp;<span>(</span><a href="https://blast.ncbi.nlm.nih.gov/doc/elastic-blast/quickstart-aws.html?utm_source=ncbi_insights&amp;utm_medium=referral&amp;utm_campaign=elasticblast-top3-20220823" target="_blank"><span><span>Amazon Web&nbsp;</span><span>Services (AWS)</span></span></a><span>,&nbsp;</span><a href="https://blast.ncbi.nlm.nih.gov/doc/elastic-blast/quickstart-gcp.html?utm_source=ncbi_insights&amp;utm_medium=referral&amp;utm_campaign=elasticblast-top3-20220823" target="_blank"><span>Google Cloud Platform (GCP)</span></a><span><span>)</span>&nbsp;<span>and</span>&nbsp;<span>allows&nbsp;</span><span>you&nbsp;</span><span>to leverage the power of</span><span>&nbsp;the&nbsp;</span><span>cloud. Once configured, i</span><span>t</span>&nbsp;<span>manages the software and database installation, handles partitioning of the BLAST workload among the various instances, and deallocates cloud resources when the searches are done.</span></span></p>
<p><span><span>ElasticBLAST</span>&nbsp;<span>also&nbsp;</span><span>selects the instance (</span><span>i.e.,</span><span>&nbsp;machine) type for you based on database size. Of course, you can also choose the instance type manually if you prefer</span><span>.&nbsp;</span></span></p><p>Address of the bookmark: <a href="https://blast.ncbi.nlm.nih.gov/doc/elastic-blast/" rel="nofollow">https://blast.ncbi.nlm.nih.gov/doc/elastic-blast/</a></p>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>

</channel>
</rss>