<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/41586?offset=20</link>
	<atom:link href="https://bioinformaticsonline.com/related/41586?offset=20" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44709/a-step-by-step-guide-to-running-blast-offline</guid>
	<pubDate>Sat, 07 Dec 2024 22:32:37 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44709/a-step-by-step-guide-to-running-blast-offline</link>
	<title><![CDATA[A Step-by-Step Guide to Running BLAST Offline]]></title>
	<description><![CDATA[<p>BLAST (Basic Local Alignment Search Tool) is a powerful algorithm used to compare nucleotide or protein sequences to sequence databases, identifying regions of similarity. Running BLAST offline provides more control, ensures data security, and allows customization for specific research needs. Here&rsquo;s a detailed guide to set up and run BLAST locally on your system.</p><hr><h3>Step 1: <strong>Install BLAST</strong></h3><ol>
<li>
<p><strong>Download BLAST</strong>:</p>
<ul>
<li>Visit the <a href="https://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/">NCBI BLAST+ download page</a> to download the appropriate version for your operating system (Windows, macOS, or Linux).</li>
</ul>
</li>
<li>
<p><strong>Install BLAST</strong>:</p>
<ul>
<li>Extract the downloaded archive. For Linux/Mac, use:
<pre><code>tar -xvzf ncbi-blast-*.tar.gz
cd ncbi-blast-*
</code></pre>
</li>
<li>Add the BLAST binary folder to your system PATH for easier access:
<pre><code>export PATH=$PATH:/path/to/ncbi-blast-*/bin
</code></pre>
</li>
</ul>
</li>
<li>
<p><strong>Verify Installation</strong>:<br /> Run the following command to ensure BLAST is installed correctly:</p>
<pre><code>blastn -version
</code></pre>
</li>
</ol><hr><h3>Step 2: <strong>Prepare a Local Database</strong></h3><p>To run BLAST offline, you&rsquo;ll need a sequence database.</p><ol>
<li>
<p><strong>Download a Pre-Built Database (Optional)</strong>:</p>
<ul>
<li>NCBI provides ready-to-use databases such as <code>nt</code>, <code>nr</code>, and <code>Swiss-Prot</code>. Use the <code>update_blastdb.pl</code> script (bundled with BLAST) to download these:
<pre><code>update_blastdb.pl --decompress nt
</code></pre>
</li>
</ul>
</li>
<li>
<p><strong>Create a Custom Database</strong>:<br /> If you have specific sequences to use as a database:</p>
<ul>
<li>Prepare a FASTA file containing the sequences.</li>
<li>Use <code>makeblastdb</code> to create a database:
<pre><code>makeblastdb -in your_sequences.fasta -dbtype [nucl|prot] -out custom_db
</code></pre>
Replace <code>[nucl|prot]</code> with <code>nucl</code> for nucleotide sequences or <code>prot</code> for protein sequences.</li>
</ul>
</li>
</ol><hr><h3>Step 3: <strong>Prepare the Query Sequence</strong></h3><ul>
<li>Save your query sequence(s) in FASTA format.</li>
<li>Ensure the file is properly formatted, with a header line starting with <code>&gt;</code> followed by the sequence name, and the sequence on subsequent lines.</li>
</ul><p>Example:</p><pre><code>&gt;query_sequence
ATGCGTAGCTAGCGTAGCTAGCTAGCTA
</code></pre><hr><h3>Step 4: <strong>Run BLAST</strong></h3><ol>
<li>
<p><strong>Choose the Appropriate BLAST Tool</strong>:<br /> Depending on your data type:</p>
<ul>
<li><strong>blastn</strong>: For nucleotide-nucleotide searches.</li>
<li><strong>blastp</strong>: For protein-protein searches.</li>
<li><strong>blastx</strong>: Translates nucleotide sequences into proteins and compares them to a protein database.</li>
<li><strong>tblastn</strong>: Compares protein sequences to a nucleotide database.</li>
<li><strong>tblastx</strong>: Translates both nucleotide query and database sequences.</li>
</ul>
</li>
<li>
<p><strong>Run the Command</strong>:<br /> Example command for <code>blastn</code>:</p>
<pre><code>blastn -query query.fasta -db custom_db -out results.txt -outfmt 6 -evalue 1e-5
</code></pre>
<p><strong>Explanation of Parameters</strong>:</p>
<ul>
<li><code>-query</code>: Specifies the query file.</li>
<li><code>-db</code>: Points to the local database.</li>
<li><code>-out</code>: Output file name.</li>
<li><code>-outfmt</code>: Output format (e.g., 6 for tabular format).</li>
<li><code>-evalue</code>: E-value cutoff for significance.</li>
</ul>
</li>
</ol><hr><h3>Step 5: <strong>Interpret Results</strong></h3><ol>
<li>
<p><strong>Output Formats</strong>:</p>
<ul>
<li><strong>Default (outfmt 0)</strong>: Human-readable format.</li>
<li><strong>Tabular (outfmt 6)</strong>: Includes fields like query ID, subject ID, percent identity, alignment length, etc.</li>
</ul>
</li>
<li>
<p><strong>Analyze Results</strong>:<br /> Use tools like <code>grep</code>, Python, or R to parse and filter results for downstream analysis.</p>
</li>
</ol><hr><h3>Step 6: <strong>Optimize Performance</strong></h3><p>For large datasets, BLAST can be resource-intensive. To improve performance:</p><ol>
<li>
<p><strong>Multithreading</strong>:<br /> Use the <code>-num_threads</code> option to leverage multiple CPU cores:</p>
<pre><code>blastn -query query.fasta -db custom_db -out results.txt -num_threads 4
</code></pre>
</li>
<li>
<p><strong>Database Subsetting</strong>:<br /> Split large databases into smaller chunks for faster searches.</p>
</li>
<li>
<p><strong>Adjust Parameters</strong>:</p>
<ul>
<li>Lower the <code>-evalue</code> threshold for stricter matches.</li>
<li>Use <code>-max_target_seqs</code> to limit the number of results per query.</li>
</ul>
</li>
</ol><hr><h3>Step 7: <strong>Update Databases (Optional)</strong></h3><p>If using NCBI databases, regularly update them to ensure the inclusion of the latest sequences:</p><pre><code>update_blastdb.pl --decompress nt
</code></pre><hr><h3>Conclusion</h3><p>Running BLAST offline is a straightforward process that offers flexibility and security for bioinformaticians working with sensitive data. By following this guide, you can harness the power of BLAST to analyze sequences efficiently and gain valuable biological insights.</p><p>For advanced use cases, explore BLAST&rsquo;s customization options, such as custom scoring matrices, filtering, and iterative searches with tools like PSI-BLAST. Happy BLASTing!</p>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42405/caretta-%E2%80%93-a-multiple-protein-structure-alignment-and-feature-extraction-suite</guid>
	<pubDate>Fri, 18 Dec 2020 02:09:44 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42405/caretta-%E2%80%93-a-multiple-protein-structure-alignment-and-feature-extraction-suite</link>
	<title><![CDATA[Caretta – A multiple protein structure alignment and feature extraction suite]]></title>
	<description><![CDATA[<h3>Caretta &ndash;&nbsp;a multiple protein structure alignment and feature extraction suite</h3>
<p><span>Caretta, a multiple structure alignment suite meant for homologous but sequentially divergent protein families which consistently returns accurate alignments with a higher coverage than current state-of-the-art tools. Caretta is available as a GUI and command-line application and additionally outputs an aligned structure feature matrix for a given set of input structures, which can readily be used in downstream steps for supervised or unsupervised machine learning.&nbsp;</span></p><p>Address of the bookmark: <a href="http://www.bioinformatics.nl/caretta/" rel="nofollow">http://www.bioinformatics.nl/caretta/</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/videolist/watch/4234/ncbi-psi-blast-tutorial</guid>
	<pubDate>Wed, 04 Sep 2013 11:46:06 -0500</pubDate>
	<link>https://bioinformaticsonline.com/videolist/watch/4234/ncbi-psi-blast-tutorial</link>
	<title><![CDATA[NCBI PSI-BLAST Tutorial]]></title>
	<description><![CDATA[<iframe width="" height="" src="https://www.youtube-nocookie.com/embed/T3kHEieyylk" frameborder="0" allowfullscreen></iframe>http:--www.biotechnology.jhu.edu-
Tutorial for PSI-BLAST, an extension of BLAST that uses matrix algebra. BLAST is a cornerstone bioinformatics tool at NCBI. BLAST is the
Basic Local Alignment Search tool and will protein and DNA sequences that
are related to a sequence that the user provides.]]></description>
	
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/22770/blast-updated</guid>
	<pubDate>Tue, 16 Jun 2015 16:55:24 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/22770/blast-updated</link>
	<title><![CDATA[BLAST+ updated !!!]]></title>
	<description><![CDATA[<p>A new version (2.2.31) of the stand-alone BLAST executables (Linux, Windows and MacOSX on <a href="ftp://ftp.ncbi.nlm.nih.gov/blast/executables/LATEST">FTP</a>) is now available. New features include support for BLAST-XML2 specification (information <a href="ftp://ftp.ncbi.nlm.nih.gov/blast/documents/NEWXML/xml2.pdf">here</a>) and JSON BLAST output format, as well as several bug fixes and improvements. The BLAST AMI at AWS will also be updated to 2.2.31 (see this BLAST Help page for more <a href="http://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD=Web&amp;PAGE_TYPE=BlastDocs&amp;DOC_TYPE=CloudBlast">information</a>). For a full list of improvements, see the <a href="http://www.ncbi.nlm.nih.gov/books/NBK131777">release notes</a>.</p><p>More at http://www.ncbi.nlm.nih.gov/news/06-16-2015-blast-plus-update/?</p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/31526/sequenceserver</guid>
	<pubDate>Fri, 10 Mar 2017 08:51:55 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/31526/sequenceserver</link>
	<title><![CDATA[sequenceserver]]></title>
	<description><![CDATA[<p><span>SequenceServer lets you rapidly set up a BLAST+ server with an intuitive user interface for use locally or over the web.</span></p>
<p><span><span>More at&nbsp;</span><a href="http://sequenceserver.com/">http://sequenceserver.com</a><span>.</span></span></p><p>Address of the bookmark: <a href="https://github.com/wurmlab/sequenceserver" rel="nofollow">https://github.com/wurmlab/sequenceserver</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37545/ncbi-magic-blast</guid>
	<pubDate>Tue, 14 Aug 2018 18:11:11 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37545/ncbi-magic-blast</link>
	<title><![CDATA[NCBI Magic-BLAST]]></title>
	<description><![CDATA[<p>Magic-BLAST is a tool for mapping large next-generation RNA or DNA sequencing runs against a whole genome or transcriptome. Each alignment optimizes a composite score, taking into account simultaneously the two reads of a pair, and in case of RNA-seq, locating the candidate introns and adding up the score of all exons. This is very different from other versions of BLAST, where each exon is scored as a separate hit and read-pairing is ignored.</p>
<p>Magic-BLAST incorporates within the NCBI BLAST code framework ideas developed in the NCBI Magic pipeline, in particular hit extensions by local walk and jump&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/26109056">(http://www.ncbi.nlm.nih.gov/pubmed/26109056)</a>, and recursive clipping of mismatches near the edges of the reads, which avoids accumulating artefactual mismatches near splice sites and is needed to distinguish short indels from substitutions near the edges.</p>
<p>&nbsp;</p><p>Address of the bookmark: <a href="https://ncbi.github.io/magicblast/" rel="nofollow">https://ncbi.github.io/magicblast/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/41482/magic-blast</guid>
	<pubDate>Fri, 20 Mar 2020 15:18:36 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/41482/magic-blast</link>
	<title><![CDATA[Magic-BLAST]]></title>
	<description><![CDATA[<p>Magic-BLAST is a tool for mapping large next-generation RNA or DNA sequencing runs against a whole genome or transcriptome. Each alignment optimizes a composite score, taking into account simultaneously the two reads of a pair, and in case of RNA-seq, locating the candidate introns and adding up the score of all exons. This is very different from other versions of BLAST, where each exon is scored as a separate hit and read-pairing is ignored.</p><p>Address of the bookmark: <a href="https://ncbi.github.io/magicblast/" rel="nofollow">https://ncbi.github.io/magicblast/</a></p>]]></description>
	<dc:creator>Shruti Paniwala</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/44515/cleaner-blast-databases-for-more-accurate-results</guid>
	<pubDate>Tue, 23 Apr 2024 01:23:08 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/44515/cleaner-blast-databases-for-more-accurate-results</link>
	<title><![CDATA[Cleaner BLAST Databases for More Accurate Results]]></title>
	<description><![CDATA[<p>Do you use&nbsp;<a href="https://blast.ncbi.nlm.nih.gov/Blast.cgi?utm_source=ncbi_insights&amp;utm_medium=referral&amp;utm_campaign=blast-cleaner-20240422">BLAST</a><span style="font-size: 12.8px; font-weight: normal;">&nbsp;to identify a sequence or the evolutionary scope of a gene? That can be challenging if contaminated and misclassified sequences are in the BLAST databases and show up in your search results. To address</span><span style="font-size: 12.8px; font-weight: normal;">&nbsp;this problem</span><span style="font-size: 12.8px; font-weight: normal;">, we now use the NCBI quality assurance tools listed below to systematically remove these misleading sequences from the default nucleotide (nt) and protein (nr) BLAST databases.</span><span style="font-size: 12.8px; font-weight: normal;">&nbsp;</span></p><div><ul>
<li><a href="https://github.com/ncbi/fcs">Foreign Contamination Screen tool for genome cross-species screening (FCS-GX)</a>&nbsp;detects contamination from foreign organisms in genomes and other sequences using the genome cross-species aligner (GX)&nbsp;</li>
<li><a href="https://ncbiinsights.ncbi.nlm.nih.gov/2022/05/27/ani-for-assembly-validation?utm_source=ncbi_insights&amp;utm_medium=referral&amp;utm_campaign=blast-cleaner-20240422">Average Nucleotide Identity (ANI)</a>&nbsp;evaluates the taxonomic classification of prokaryotic genome assemblies. Sequences from genomes marked up as &lsquo;unverified source organism&rsquo; are considered suspect and removed.&nbsp;</li>
</ul><p>Ref&nbsp;https://ncbiinsights.ncbi.nlm.nih.gov/2024/04/22/cleaner-blast-databases-more-accurate-results/</p></div>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/35550/circoletto-visualizing-sequence-similarity-with-circos</guid>
	<pubDate>Fri, 09 Feb 2018 10:23:40 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/35550/circoletto-visualizing-sequence-similarity-with-circos</link>
	<title><![CDATA[Circoletto: visualizing sequence similarity with Circos]]></title>
	<description><![CDATA[<p><span>Circoletto, an online visualization tool based on Circos, which provides a fast, aesthetically pleasing and informative overview of sequence similarity search results.</span></p>
<p>Online version and downloadable software package for offline use (source code in PERL) freely available at&nbsp;<a href="http://bat.ina.certh.gr/tools/circoletto/" target="">http://bat.ina.certh.gr/tools/circoletto/</a></p>
<p><strong>Contact:</strong><a href="mailto:ndarz@certh.gr" target="">ndarz@certh.gr</a></p><p>Address of the bookmark: <a href="http://tools.bat.infspire.org/circoletto/" rel="nofollow">http://tools.bat.infspire.org/circoletto/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/39913/twinblast-when-two-is-better-than-one</guid>
	<pubDate>Sat, 07 Sep 2019 08:50:08 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/39913/twinblast-when-two-is-better-than-one</link>
	<title><![CDATA[TwinBLAST: When Two Is Better than One]]></title>
	<description><![CDATA[<p>TwinBLAST is a web-based tool for viewing 2 BLAST reports simultaneouslyside-by-side. It uses ExtJS (www.sencha.com/products/extjs/) to provide 2independently scrollable panels. BioPerl (www.bioperl.org) is used to indexraw BLAST reports and Bio::Graphics is used to draw pictograms of the BLASThits.</p>
<p><a href="https://github.com/IGS/twinblast">https://github.com/IGS/twinblast</a></p>
<p><a href="https://mra.asm.org/content/8/35/e00842-19">https://mra.asm.org/content/8/35/e00842-19</a></p><p>Address of the bookmark: <a href="https://github.com/IGS/twinblast" rel="nofollow">https://github.com/IGS/twinblast</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>