<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/2791?offset=100</link>
	<atom:link href="https://bioinformaticsonline.com/related/2791?offset=100" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44709/a-step-by-step-guide-to-running-blast-offline</guid>
	<pubDate>Sat, 07 Dec 2024 22:32:37 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44709/a-step-by-step-guide-to-running-blast-offline</link>
	<title><![CDATA[A Step-by-Step Guide to Running BLAST Offline]]></title>
	<description><![CDATA[<p>BLAST (Basic Local Alignment Search Tool) is a powerful algorithm used to compare nucleotide or protein sequences to sequence databases, identifying regions of similarity. Running BLAST offline provides more control, ensures data security, and allows customization for specific research needs. Here&rsquo;s a detailed guide to set up and run BLAST locally on your system.</p><hr><h3>Step 1: <strong>Install BLAST</strong></h3><ol>
<li>
<p><strong>Download BLAST</strong>:</p>
<ul>
<li>Visit the <a href="https://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/">NCBI BLAST+ download page</a> to download the appropriate version for your operating system (Windows, macOS, or Linux).</li>
</ul>
</li>
<li>
<p><strong>Install BLAST</strong>:</p>
<ul>
<li>Extract the downloaded archive. For Linux/Mac, use:
<pre><code>tar -xvzf ncbi-blast-*.tar.gz
cd ncbi-blast-*
</code></pre>
</li>
<li>Add the BLAST binary folder to your system PATH for easier access:
<pre><code>export PATH=$PATH:/path/to/ncbi-blast-*/bin
</code></pre>
</li>
</ul>
</li>
<li>
<p><strong>Verify Installation</strong>:<br /> Run the following command to ensure BLAST is installed correctly:</p>
<pre><code>blastn -version
</code></pre>
</li>
</ol><hr><h3>Step 2: <strong>Prepare a Local Database</strong></h3><p>To run BLAST offline, you&rsquo;ll need a sequence database.</p><ol>
<li>
<p><strong>Download a Pre-Built Database (Optional)</strong>:</p>
<ul>
<li>NCBI provides ready-to-use databases such as <code>nt</code>, <code>nr</code>, and <code>Swiss-Prot</code>. Use the <code>update_blastdb.pl</code> script (bundled with BLAST) to download these:
<pre><code>update_blastdb.pl --decompress nt
</code></pre>
</li>
</ul>
</li>
<li>
<p><strong>Create a Custom Database</strong>:<br /> If you have specific sequences to use as a database:</p>
<ul>
<li>Prepare a FASTA file containing the sequences.</li>
<li>Use <code>makeblastdb</code> to create a database:
<pre><code>makeblastdb -in your_sequences.fasta -dbtype [nucl|prot] -out custom_db
</code></pre>
Replace <code>[nucl|prot]</code> with <code>nucl</code> for nucleotide sequences or <code>prot</code> for protein sequences.</li>
</ul>
</li>
</ol><hr><h3>Step 3: <strong>Prepare the Query Sequence</strong></h3><ul>
<li>Save your query sequence(s) in FASTA format.</li>
<li>Ensure the file is properly formatted, with a header line starting with <code>&gt;</code> followed by the sequence name, and the sequence on subsequent lines.</li>
</ul><p>Example:</p><pre><code>&gt;query_sequence
ATGCGTAGCTAGCGTAGCTAGCTAGCTA
</code></pre><hr><h3>Step 4: <strong>Run BLAST</strong></h3><ol>
<li>
<p><strong>Choose the Appropriate BLAST Tool</strong>:<br /> Depending on your data type:</p>
<ul>
<li><strong>blastn</strong>: For nucleotide-nucleotide searches.</li>
<li><strong>blastp</strong>: For protein-protein searches.</li>
<li><strong>blastx</strong>: Translates nucleotide sequences into proteins and compares them to a protein database.</li>
<li><strong>tblastn</strong>: Compares protein sequences to a nucleotide database.</li>
<li><strong>tblastx</strong>: Translates both nucleotide query and database sequences.</li>
</ul>
</li>
<li>
<p><strong>Run the Command</strong>:<br /> Example command for <code>blastn</code>:</p>
<pre><code>blastn -query query.fasta -db custom_db -out results.txt -outfmt 6 -evalue 1e-5
</code></pre>
<p><strong>Explanation of Parameters</strong>:</p>
<ul>
<li><code>-query</code>: Specifies the query file.</li>
<li><code>-db</code>: Points to the local database.</li>
<li><code>-out</code>: Output file name.</li>
<li><code>-outfmt</code>: Output format (e.g., 6 for tabular format).</li>
<li><code>-evalue</code>: E-value cutoff for significance.</li>
</ul>
</li>
</ol><hr><h3>Step 5: <strong>Interpret Results</strong></h3><ol>
<li>
<p><strong>Output Formats</strong>:</p>
<ul>
<li><strong>Default (outfmt 0)</strong>: Human-readable format.</li>
<li><strong>Tabular (outfmt 6)</strong>: Includes fields like query ID, subject ID, percent identity, alignment length, etc.</li>
</ul>
</li>
<li>
<p><strong>Analyze Results</strong>:<br /> Use tools like <code>grep</code>, Python, or R to parse and filter results for downstream analysis.</p>
</li>
</ol><hr><h3>Step 6: <strong>Optimize Performance</strong></h3><p>For large datasets, BLAST can be resource-intensive. To improve performance:</p><ol>
<li>
<p><strong>Multithreading</strong>:<br /> Use the <code>-num_threads</code> option to leverage multiple CPU cores:</p>
<pre><code>blastn -query query.fasta -db custom_db -out results.txt -num_threads 4
</code></pre>
</li>
<li>
<p><strong>Database Subsetting</strong>:<br /> Split large databases into smaller chunks for faster searches.</p>
</li>
<li>
<p><strong>Adjust Parameters</strong>:</p>
<ul>
<li>Lower the <code>-evalue</code> threshold for stricter matches.</li>
<li>Use <code>-max_target_seqs</code> to limit the number of results per query.</li>
</ul>
</li>
</ol><hr><h3>Step 7: <strong>Update Databases (Optional)</strong></h3><p>If using NCBI databases, regularly update them to ensure the inclusion of the latest sequences:</p><pre><code>update_blastdb.pl --decompress nt
</code></pre><hr><h3>Conclusion</h3><p>Running BLAST offline is a straightforward process that offers flexibility and security for bioinformaticians working with sensitive data. By following this guide, you can harness the power of BLAST to analyze sequences efficiently and gain valuable biological insights.</p><p>For advanced use cases, explore BLAST&rsquo;s customization options, such as custom scoring matrices, filtering, and iterative searches with tools like PSI-BLAST. Happy BLASTing!</p>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/40953/explore-taxdump-files</guid>
	<pubDate>Sat, 08 Feb 2020 04:44:55 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/40953/explore-taxdump-files</link>
	<title><![CDATA[Explore taxdump files !]]></title>
	<description><![CDATA[<pre>This is an extract of taxdump-readme.txt to be found at 
ftp://ftp.ncbi.nih.gov/pub/taxonomy/

The content of the archive
--------------------------

It may look like this:

delnodes.dmp
division.dmp
gencode.dmp
merged.dmp
names.dmp
nodes.dmp
readme.txt

The readme.txt file gives a brief description of *.dmp files. These files
contain taxonomic information and are briefly described below. Each of the
files store one record in the single line that are delimited by "\t|\n"
(tab, vertical bar, and newline) characters. Each record consists of one 
or more fields delimited by "\t|\t" (tab, vertical bar, and tab) characters.
The brief description of field position and meaning for each file follows.

nodes.dmp
---------

This file represents taxonomy nodes. The description for each node includes 
the following fields:

	tax_id					-- node id in GenBank taxonomy database
 	parent tax_id				-- parent node id in GenBank taxonomy database
 	rank					-- rank of this node (superkingdom, kingdom, ...) 
 	embl code				-- locus-name prefix; not unique
 	division id				-- see division.dmp file
 	inherited div flag  (1 or 0)		-- 1 if node inherits division from parent
 	genetic code id				-- see gencode.dmp file
 	inherited GC  flag  (1 or 0)		-- 1 if node inherits genetic code from parent
 	mitochondrial genetic code id		-- see gencode.dmp file
 	inherited MGC flag  (1 or 0)		-- 1 if node inherits mitochondrial gencode from parent
 	GenBank hidden flag (1 or 0)            -- 1 if name is suppressed in GenBank entry lineage
 	hidden subtree root flag (1 or 0)       -- 1 if this subtree has no sequence data yet
 	comments				-- free-text comments and citations

names.dmp
---------
Taxonomy names file has these fields:

	tax_id					-- the id of node associated with this name
	name_txt				-- name itself
	unique name				-- the unique variant of this name if name not unique
	name class				-- (synonym, common name, ...)

division.dmp
------------
Divisions file has these fields:
	division id				-- taxonomy database division id
	division cde				-- GenBank division code (three characters)
	division name				-- e.g. BCT, PLN, VRT, MAM, PRI...
	comments

gencode.dmp
-----------
Genetic codes file:

	genetic code id				-- GenBank genetic code id
	abbreviation				-- genetic code name abbreviation
	name					-- genetic code name
	cde					-- translation table for this genetic code
	starts					-- start codons for this genetic code

delnodes.dmp
------------
Deleted nodes (nodes that existed but were deleted) file field:

	tax_id					-- deleted node id

merged.dmp
----------
Merged nodes file fields:

	old_tax_id                              -- id of nodes which has been merged
	new_tax_id                              -- id of nodes which is result of merging

</pre>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/8417/conserved-domain-database-cdd-version-311-released</guid>
	<pubDate>Wed, 19 Feb 2014 15:02:40 -0600</pubDate>
	<link>https://bioinformaticsonline.com/news/view/8417/conserved-domain-database-cdd-version-311-released</link>
	<title><![CDATA[Conserved Domain Database (CDD) version 3.11 released]]></title>
	<description><![CDATA[<p>National Center for Biotechnology Information (NCBI) Conserved Domain Database (CDD) version 3.11 is now available with 596 new or updated NCBI-curated and 49,641 total domain models. The new version now contains the most recent Pfam release 27.</p><p><img src="http://www.ncbi.nlm.nih.gov/Structure/cdd/docs/images/np_081086_triangles_site_features_on_query_gi255958238_mouse_mutl1.png" alt="image" width="800" height="415" style="border: 0px; border: 0px;"></p><p>Updates to the Conserved Domain Database include:</p><ul>
<li>Position-specific score matrices (PSSMs) have been recomputed for many models in CDD, and frequency tables have been added to the PSSMs;</li>
</ul><ul>
<li>The search databases distributed as part of this release can now be used with the more recent versions of RPS-BLAST (BLAST release 2.2.28 and up) using composition-based scoring. This abolishes the need to mask out compositionally biased regions in query sequences;</li>
</ul><ul>
<li>Domain annotation displays in CD-Search, BATCH CD-Search, and other services now all use a uniform display style. A new display option in CD-Search and BATCH CD-Search provides “standard” results, in addition to “concise” and “full” results. “Standard” results will provide, for each region on the query sequence, the best0-scoring domain model (if any) from each of CDD’s database providers (Pfam, SMART, COG, TIGRFAMs, Protein Clusters, and the NCBI in-house curation project), but will suppress redundancy from within a single provider's results list.</li>
</ul><p>You can access CDD at the <a href="http://www.ncbi.nlm.nih.gov/cdd">Conserved Domains homepage</a> and find updated content on the <a href="ftp://ftp.ncbi.nih.gov/pub/mmdb/cdd">CDD FTP site</a>.</p><p>Reference:</p><p>NCBI Website</p>]]></description>
	<dc:creator>Shikha Logwani</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/27691/histonedb-20-%E2%80%93-with-variants</guid>
	<pubDate>Fri, 03 Jun 2016 05:06:20 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/27691/histonedb-20-%E2%80%93-with-variants</link>
	<title><![CDATA[HistoneDB 2.0 – with variants]]></title>
	<description><![CDATA[<p><span>This histone database can be used to explore the diversity of histone proteins and their sequence variants in many organisms. The resource was established to better understand how sequence variation may affect functional and structural features of nucleosomes. To get started, select a histone type to explore its variants.</span></p>
<p><span>More at&nbsp;http://www.ncbi.nlm.nih.gov/projects/HistoneDB2.0/index.fcgi/browse/</span></p><p>Address of the bookmark: <a href="http://www.ncbi.nlm.nih.gov/projects/HistoneDB2.0/index.fcgi/browse/" rel="nofollow">http://www.ncbi.nlm.nih.gov/projects/HistoneDB2.0/index.fcgi/browse/</a></p>]]></description>
	<dc:creator>Anjana</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/32719/download-assemblies-from-ncbi</guid>
	<pubDate>Mon, 15 May 2017 06:02:32 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/32719/download-assemblies-from-ncbi</link>
	<title><![CDATA[Download assemblies from NCBI]]></title>
	<description><![CDATA[<p>A new &ldquo;Download assemblies&rdquo; button is now available in the&nbsp;<a href="https://www.ncbi.nlm.nih.gov/assembly" target="_blank">Assembly</a>&nbsp;database. This makes it easy to download data for multiple genomes without having to write scripts.</p><p>For example, you can run a search in Assembly and use check boxes (see left side of screenshot below) to refine the set of genome assemblies of interest. Then, just open the &ldquo;Download assemblies&rdquo; menu, choose the source database (<a href="https://www.ncbi.nlm.nih.gov/genbank/" target="_blank">GenBank</a>&nbsp;or&nbsp;<a href="https://www.ncbi.nlm.nih.gov/refseq/" target="_blank">RefSeq</a>), choose the file type, and start the download. An archive file will be saved to your computer that can be expanded into a folder containing your selected genome data files.</p><p><img src="https://ncbiinsights.files.wordpress.com/2017/05/download_button.jpg?w=584" alt="image" width="584" height="444" style="border: 0px; border: 0px;"></p><p>&nbsp;</p><p>More at&nbsp;https://ncbiinsights.ncbi.nlm.nih.gov/2017/05/08/genome-data-download-made-easy/</p>]]></description>
	<dc:creator>Bulbul</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/40596/igblast-a-popular-ncbi-package-for-classifying-and-analyzing-immunoglobulin-ig-and-t-cell-receptor-tcr-variable-domain-sequences</guid>
	<pubDate>Thu, 23 Jan 2020 11:34:37 -0600</pubDate>
	<link>https://bioinformaticsonline.com/news/view/40596/igblast-a-popular-ncbi-package-for-classifying-and-analyzing-immunoglobulin-ig-and-t-cell-receptor-tcr-variable-domain-sequences</link>
	<title><![CDATA[IgBLAST: a popular NCBI package for classifying and analyzing immunoglobulin (IG) and T cell receptor (TCR) variable domain sequences]]></title>
	<description><![CDATA[<p>NCBI team released a new version of IgBLAST with four new improvements. IgBLAST is a popular NCBI package for classifying and analyzing immunoglobulin (IG) and T cell receptor (TCR) variable domain sequences. Improvements are:<span style="text-decoration: underline;"></span><span style="text-decoration: underline;"></span></p><p>&nbsp;&nbsp;&nbsp; 1. Support for the new FWR4 annotation feature in the AIRR format, both in standard format and in the AIRR alignment format.<span style="text-decoration: underline;"></span><span style="text-decoration: underline;"></span></p><p>&nbsp;&nbsp;&nbsp; 2. The previous &ldquo;-penalty&rdquo; parameter was renamed as -V_penalty to be consistent with other IgBLAST penalty options.<span style="text-decoration: underline;"></span><span style="text-decoration: underline;"></span></p><p>&nbsp;&nbsp;&nbsp; 3. Restored constant internal BLAST search parameters for domain annotation (i.e., FWR/CDR) such that this process is not influenced by user parameters.<span style="text-decoration: underline;"></span><span style="text-decoration: underline;"></span></p><p>&nbsp;&nbsp;&nbsp; 4. Corrected FWR/CDR annotations for certain mouse VK and rat VH germline genes.<span style="font-size: 12.8px;">&nbsp;</span></p><p><span style="text-decoration: underline;"></span></p><p>IgBLAST 1.15.0 is available for&nbsp;<a href="https://ftp.ncbi.nih.gov/blast/executables/igblast/release/LATEST/" target="_blank">download</a>&nbsp;from the BLAST FTP area. See the the new&nbsp;<a href="https://ncbi.github.io/igblast/" target="_blank">manual</a>&nbsp;on GitHub for information about setting up and running IgBLAST.</p><p><span style="text-decoration: underline;"></span></p><p>&nbsp;If you have any questions or concerns, please contact&nbsp;<a href="mailto:blast-help@ncbi.nlm.nih.gov" target="_blank" title="Follow link">blast-help@ncbi.nlm.nih.gov</a><span style="text-decoration: underline;"></span><span style="text-decoration: underline;"></span></p><p><span style="text-decoration: underline;"></span>&nbsp;</p>]]></description>
	<dc:creator>BioJoker</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/2422/bioinformatics-codes-search</guid>
	<pubDate>Thu, 15 Aug 2013 11:08:52 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/2422/bioinformatics-codes-search</link>
	<title><![CDATA[Bioinformatics Codes Search]]></title>
	<description><![CDATA[<p>I bet, this website will be your best friend in near future. This helps us to explore the existing open source codes and learn from it.</p>
<p>You can find some useful open source bioinformatics codes for your analysis work. You can use the left bar options to filtere out or narrow down your search result. This webpage can be an useful resource for a beginners bioinformatician as it contain several bioinformatics basics script that are commonly used by biological programmers and biologist.</p>
<p>Stand on the slumped, dandruff-covered shoulders of millions of computer nerds. _/\_</p>
<p>Enjoy the code and research work.</p>
<p>http://code.ohloh.net/search?s=bioinformatics</p><p>Address of the bookmark: <a href="http://code.ohloh.net/search?s=bioinformatics" rel="nofollow">http://code.ohloh.net/search?s=bioinformatics</a></p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/27459/tools-for-searching-repeats-and-palindromic-sequences</guid>
	<pubDate>Sat, 21 May 2016 22:32:25 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/27459/tools-for-searching-repeats-and-palindromic-sequences</link>
	<title><![CDATA[Tools for Searching Repeats And Palindromic Sequences]]></title>
	<description><![CDATA[<p>What are genomic interspersed repeats?</p><p>In the mid 1960's scientists discovered that many genomes contain stretches of highly repetitive DNA sequences ( see Reassociation Kinetics Experiments, and C-Value Paradox ). These sequences were later characterized and placed into five categories:</p><p><strong>Simple Repeats</strong> - Duplications of simple sets of DNA bases (typically 1-5bp) such as A, CA, CGG etc.<br /><strong>Tandem Repeats</strong> - Typically found at the centromeres and telomeres of chromosomes these are duplications of more complex 100-200 base sequences.<br /><strong>Segmental Duplications</strong> - Large blocks of 10-300 kilobases which are that have been copied to another region of the genome.<br /><strong>Interspersed Repeats</strong><br />Processed Pseudogenes, Retrotranscripts, SINES - Non-functional copies of RNA genes which have been reintegrated into the genome with the assitance of a reverse transcriptase.<br />DNA Transposons<br />Retrovirus Retrotransposons<br />Non-Retrovirus Retrotransposons ( LINES )</p><p>Currently up to 50% of the human genome is repetitive in nature and as improvements are made in detection methods this number is expected to increase.</p><p>On the other hand; In genetics, the term palindrome refers to a sequence of nucleotides along a DNA (deoxyribonucleic acid) or RNA (ribonucleic acid) strand that contains the same series of nitrogenous bases regardless from which direction the strand is analyzed. Akin to a language palindrome&mdash;wherein a word or phrase is spelled the same left-to-right as right-to-left (e.g., the word RADAR or the phrase "able was I ere I saw elba")&mdash;with genetic palindromes it does not matter whether the nucleic acid strand is read starting from the 3' (three prime) end or the 5' (five prime) end of the strand.</p><p>Recent research on palindromes centers on understanding palindrome formation during gene amplification. Other studies have attempted to relate palindrome formation to molecular mechanisms involved in double stranded breaks and in the formation of inverted repeats. Assisted by high speed computers, other groups of scientists link palindrome formation to the conservation of genetic information.</p><p>Related to the direction of transcription by RNA polymerase, DNA strands have upstream and downstream terminus defined by differing chemical groups at each end. The ends of each strand of DNA or RNA are termed the 5' (phosphate bound to the 5' position carbon) and 3' (phosphate bound to the 3' carbon) ends to indicate a polarity within the molecule. Using the letters A, T, C, G, to represent the nitrogenous bases adenine, thymine, cytosine, and guanine found in DNA, and the letters A, U, C, G to represent the nitrogenous bases adenine, uracil, cytosine, guanine found in RNA (Note that uracil in RNA replaces the thymine found in DNA), geneticists usually represent DNA by a series of base codes (e.g., 5' AATCGGATTGCA 3'). The base codes are usually arranged from the 5' end to the 3' end.</p><p>Because of specific base pairing in DNA (i.e., adenine (A) always bonds with (thymine (T) and cytosine (C) always bonds with guanine (G)) the complimentary stand to the sequence 5' AATCGGATTGCA 3' would be 3' TTAGCCTAACGT 5'.</p><p>With palindromes the sequences on the complimentary strands read the same in either direction. For example, a sequence of 5' GAATTC3' on one strand would be complimented by a 3' CTTAAG 5' strand. In either case, when either strand is read from the 5' prime end the sequence is GAATTC. Another example of a palindrome would be the sequence 5' CGAAGC 3' that, when reversed, still reads CGAAGC.</p><p>Palindromes are important sequences within nucleic acids. Often they are the site of binding for specific enzymes (e.g., restriction endobucleases) designed to cut the DNA strands at specific locations (i.e., at palindromes).</p><p>Palindromes may arise from brakeage and chromosomal inversions that form inverted repeats that compliment each other. When a palindrome results from an inversion, it is often referred to as an inverted repeat. For example, the sequence 5' CGAAGC 3', if inverted (reversed 180&deg;), still reads CGAAGC.</p><p>The <a href="http://emboss.open-bio.org/">European Molecular Biology Open Software Suite (EMBOSS)</a> includes some basic tools for finding tandem repeats and inverted repeats (see <a href="http://emboss.open-bio.org/html/use/apbs06.html#GroupsAppsTableNucleicrepeatsR6">B.6.22. Applications in group Nucleic:repeats</a>). There are many on-line services providing the EMBOSS tools, for example:</p><ul>
<li>Wageningen Bioinformatics Webportal <a href="http://emboss.bioinformatics.nl/">EMBOSS explorer</a></li>
<li><a href="http://mobyle.pasteur.fr/">Mobyle@Pasteur</a></li>
<li><a href="http://wsembnet.vital-it.ch/">Soaplab2 Web Services at Vital-IT</a></li>
</ul><p>For more sophisticated repeat finding you will want to look at tools using <a href="http://www.girinst.org/repbase/">Repbase</a> for example:</p><ul>
<li>CENSOR
<ul>
<li><a href="http://www.girinst.org/censor/">CENSOR@GIRI</a></li>
<li><a href="http://www.ebi.ac.uk/Tools/so/censor/">CENSOR@EMBL-EBI</a></li>
</ul>
</li>
<li><a href="http://www.repeatmasker.org/">RepeatMasker</a></li>
<li><a href="http://mummer.sourceforge.net/">MUMmer</a>&nbsp;(scan_for_match)</li>
<li><a href="http://emboss.bioinformatics.nl/cgi-bin/emboss/palindrome">Emboss Palindrome</a></li>
</ul><p>Other nucleotide repeat finding methods found by a couple of web searches:</p><ul>
<li><a href="http://tandem.bu.edu/trf/trf.html">Tandem Repeats Finder</a></li>
<li><a href="http://selab.janelia.org/recon.html">RECON</a></li>
<li><a href="http://www.yandell-lab.org/software/repeatrunner.html">RepeatRunner</a></li>
<li><a href="http://bibiserv.techfak.uni-bielefeld.de/reputer/">REPuter</a></li>
<li><a href="http://210.212.215.200/IMEX/index.html">Imperfect Microsatellite Extractor (IMEx)</a></li>
<li><a href="http://www.imtech.res.in/raghava/srf/">Spectral Repeat Finder (SRF)</a></li>
<li><a href="http://zlab.bu.edu/repfind/form.html">REPFIND</a></li>
<li><a href="http://crispr.u-psud.fr/Server/CRISPRfinder.php">CRISPRfinder</a></li>
<li><a href="http://grail.lsd.ornl.gov/grailexp/">GrailEXP</a></li>
<li><a href="http://alggen.lsi.upc.edu/recerca/search/frame-search.html">CONREPP</a></li>
<li><a href="http://www.biophp.org/minitools/find_palindromes/demo.php%20"><span>find_palindromes</span></a></li>
<li><a href="http://insilico.ehu.eus/palindromes/"><span>Palindrome</span></a></li>
<li><a href="http://emboss.bioinformatics.nl/cgi-bin/emboss/palindrome">EMBOSS Palindrome</a></li>
<li><a href="http://bioinfo.cs.technion.ac.il/projects/Engel-Freund/new.html">Palindrome Search</a></li>
</ul>]]></description>
	<dc:creator>Radha Agarkar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36026/mmseqs20-ultra-fast-and-sensitive-protein-search-and-clustering-suite</guid>
	<pubDate>Thu, 22 Mar 2018 10:40:51 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36026/mmseqs20-ultra-fast-and-sensitive-protein-search-and-clustering-suite</link>
	<title><![CDATA[MMseqs2.0: ultra fast and sensitive protein search and clustering suite]]></title>
	<description><![CDATA[<p>MMseqs2 (Many-against-Many sequence searching) is a software suite to search and cluster huge protein sequence sets. MMseqs2 is open source GPL-licensed software implemented in C++ for Linux, MacOS, and (as beta version, via cygwin) Windows. The software is designed to run on multiple cores and servers and exhibits very good scalability. MMseqs2 can run 10000 times faster than BLAST. At 100 times its speed it achieves almost the same sensitivity. It can perform profile searches with the same sensitivity as PSI-BLAST at over 400 times its speed.</p>
<p>The MMseqs2 user guide is available as&nbsp;<a href="https://github.com/soedinglab/mmseqs2/wiki">Github Wiki</a>&nbsp;or as&nbsp;<a href="https://mmseqs.com/latest/userguide.pdf">PDF file</a>&nbsp;(Thanks to&nbsp;<a href="https://github.com/jgm/pandoc">pandoc</a>!)</p>
<p>Please cite:&nbsp;<a href="https://www.nature.com/nbt/journal/vaop/ncurrent/full/nbt.3988.html">Steinegger M and Soeding J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nature Biotechnology, doi: 10.1038/nbt.3988 (2017)</a>.</p><p>Address of the bookmark: <a href="https://github.com/soedinglab/MMseqs2" rel="nofollow">https://github.com/soedinglab/MMseqs2</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/42664/common-bioinformatics-interview-questions</guid>
	<pubDate>Sat, 23 Jan 2021 06:07:50 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/42664/common-bioinformatics-interview-questions</link>
	<title><![CDATA[Common Bioinformatics Interview Questions !]]></title>
	<description><![CDATA[<p>The possibility of an interview for a bioinformatics position in the life sciences may be very disquieting, but the same concerns emerge time and again in my experience. So, it is exceedingly worthwhile to plan for future bioinformatics interview questions. Doing this will really give you the advantage in obtaining the position.</p><p>The following 5 questions are those that I have heard many times during the job-search process. There is no reason for not planning responses in such situations.</p><p><strong>1. Tell Us About Yourself</strong><br />This is a very typical opener in interviews. It's a perfect question to ask, and getting something planned will really help you concentrate and ease in the conversation. However, you need to make sure that your response is applicable to the job you're interviewing.<br />It's probably better to keep your answer professional. Try to include these in the answer as well: where did your love of science and bioinformatics come from? How the heck did you end up in this field? Why programming and scripting ?</p><p><strong>2. What is your plan for your bioinformatics career? / How do you look at yourself in five years? / How are your personal objectives to accomplish these goals / What are the plan for your research fundings ?</strong></p><p>Your CV/resume has already impressed the selection panel if you have been invited for an interview. The questions from the bioinformatics interview team provide an incentive for you to market yourself and illustrate the work in question with the most appropriate knowledge.</p><p><strong>3. What do you understand about the job description/What would your suggested research path be if you were a successful candidate?</strong><br />Summarize the specifics of the advertised bioinformatics position in your own words. Follow on with some suggestions of how you want to extend your research and create your own projects within the community.</p><p><strong>4. Will you work as a group or do you want to work on your own?</strong><br />This requirement can vary from jobs to job, so when addressing, be alert. A company/research PI may need a bioinformatician that is able to work on a single project autonomously, or they may need a person who can help direct and organize a team. In your response, refer to the job description.</p><p><strong>5. What particular methods have you used to date with your experiments?</strong><br />You might have experience with all the laboratory techniques described in the job description, but stress the ones you highly experienced with. Highlight your professional abilities and stress that you are extremely capable of mastering new techniques with others ...</p><p>At the end of the day, remember that you're questioning the jury as well as they're interviewing you. You will ought to think of any questions you would like the interview panel to pose. This indicates that you have done your homework and serious about the position.</p><p>All the best for your future job interview.</p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>