<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Neel's blogs]]></title>
	<link>https://bioinformaticsonline.com/blog/owner/neelam?</link>
	<atom:link href="https://bioinformaticsonline.com/blog/owner/neelam?" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44871/10-books-to-kickstart-and-level-up-your-bioinformatics-journey</guid>
	<pubDate>Tue, 12 Aug 2025 03:50:11 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44871/10-books-to-kickstart-and-level-up-your-bioinformatics-journey</link>
	<title><![CDATA[10 Books to Kickstart (and Level Up) Your Bioinformatics Journey]]></title>
	<description><![CDATA[<p>If you&rsquo;re starting out in bioinformatics or looking to sharpen your computational biology skills, having the right learning resources makes all the difference.<br />Here&rsquo;s my curated list of 10 must-read books &mdash; from beginner-friendly introductions to advanced computational genomics.</p><p>1️⃣ Data Analysis for the Life Sciences<br />A fantastic starting point to learn statistics, R programming, and exploratory data analysis in the context of biology. The best part? It&rsquo;s available free online from HarvardX.</p><p>2️⃣ Practical Computing for Biologists<br />The very first book I picked up when I started learning computational biology. It&rsquo;s beginner-friendly and focuses on essential computing skills every biologist needs.</p><p>3️⃣ A Primer for Computational Biology<br />An open-access, hands-on introduction to computational biology concepts and coding techniques. Perfect if you want to learn through real examples.</p><p>4️⃣ Computational Genomics with R<br />For those who already know R and want to dive deeper into genome-scale data analysis, from sequence alignment to gene expression.</p><p>5️⃣ The Biologist&rsquo;s Guide to Computing<br />Bridges the gap between biological problems and computational thinking, making it easier for life scientists to approach programming and data analysis.</p><p>6️⃣ Bioinformatics Data Skills<br />A must-read to sharpen your bioinformatics toolkit &mdash; from command-line skills to reproducible research workflows. Ideal once you&rsquo;ve covered the basics.</p><p>7️⃣ Bioinformatics Workbook<br />A practical tutorial series to help scientists design bioinformatics projects, analyze data, and understand best practices.</p><p>8️⃣ Modern Statistics for Modern Biology<br />An essential guide to modern statistical methods applied to biology, blending theory with hands-on examples in R.</p><p>9️⃣ Algorithms on Strings, Trees, and Sequences by Dan Gusfield<br />A classic reference for anyone wanting to understand the algorithms behind sequence alignment, genome assembly, and biological data structures.</p><p></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44720/a-beginners-guide-to-using-kraken-for-taxonomic-classification</guid>
	<pubDate>Fri, 13 Dec 2024 11:29:03 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44720/a-beginners-guide-to-using-kraken-for-taxonomic-classification</link>
	<title><![CDATA[A Beginner&#039;s Guide to Using Kraken for Taxonomic Classification]]></title>
	<description><![CDATA[<div>Kraken is a popular bioinformatics tool designed for fast and accurate taxonomic classification of metagenomic sequences. Its efficiency and precision make it a go-to resource for analyzing microbial communities, including bacteria, viruses, archaea, and fungi. Whether you're new to bioinformatics or experienced in the field, Kraken is an indispensable tool for taxonomic analysis.</div><div><div><div><div dir="auto"><div><div><p>In this blog, we&rsquo;ll walk through the basics of Kraken, from installation to running an analysis, and highlight its key features and applications.</p><h4><strong>What is Kraken?</strong></h4><p>Kraken is a sequence classification tool that assigns taxonomic labels to DNA sequences using exact k-mer matching. It uses a reference database of genomes, dividing sequences into k-mers and identifying matches in a computationally efficient way.</p><h4><strong>Key Features of Kraken</strong></h4><ul>
<li><strong>Speed</strong>: Kraken processes data much faster than alignment-based methods.</li>
<li><strong>Accuracy</strong>: It uses a precise k-mer matching algorithm for high-resolution taxonomic assignments.</li>
<li><strong>Scalability</strong>: It can handle large metagenomic datasets.</li>
<li><strong>Custom Databases</strong>: You can build and use custom databases tailored to your research needs.</li>
</ul><h4><strong>Installing Kraken</strong></h4><ol>
<li>
<p><strong>System Requirements</strong></p>
<ul>
<li>A Unix-based operating system (Linux/macOS).</li>
<li>Sufficient computational resources for database building (RAM and disk space).</li>
</ul>
</li>
<li>
<p><strong>Installation Steps</strong></p>
<ul>
<li>Clone the Kraken repository from GitHub:
<div>
<div>&nbsp;</div>
<div dir="ltr"><code>git <span style="font-size: 12.8px; font-weight: normal;">clone</span> https://github.com/DerrickWood/kraken.git <span style="font-size: 12.8px; font-weight: normal;">cd</span> kraken </code></div>
</div>
</li>
<li>Compile the Kraken binaries:
<div>
<div>&nbsp;</div>
<div dir="ltr"><code>make </code></div>
</div>
</li>
<li>Add Kraken to your PATH for easy access:
<div>
<div>&nbsp;</div>
<div dir="ltr"><code><span style="font-size: 12.8px; font-weight: normal;">export</span> PATH=<span style="font-size: 12.8px; font-weight: normal;">$PATH</span>:/path/to/kraken </code></div>
</div>
</li>
</ul>
</li>
</ol><h4><strong>Preparing a Database</strong></h4><p>Kraken requires a database of reference genomes. You can use a pre-built database or create a custom one.</p><ol>
<li>
<p><strong>Downloading a Pre-built Database</strong><br />Kraken offers pre-built databases, such as the <em>MiniKraken</em> database, which is lightweight and suitable for smaller datasets. Download it using:</p>
<div>
<div dir="ltr"><code>kraken-build --download-library minikraken </code></div>
</div>
</li>
<li>
<p><strong>Building a Custom Database</strong><br />To include specific genomes, download FASTA files and build the database:</p>
<div>
<div dir="ltr"><code>kraken-build --download-library bacteria --threads 4 --db my_database kraken-build --build --db my_database </code></div>
</div>
<p>This process may take considerable time and resources, depending on the size of the database.</p>
</li>
</ol><h4><strong>Running Kraken</strong></h4><p>Once the database is ready, you can classify sequences.</p><ol>
<li>
<p><strong>Basic Usage</strong><br />Use the following command to classify sequences:</p>
<div>
<div dir="ltr"><code>kraken --db my_database --threads 4 --fastq-input input_sequences.fastq --output kraken_output.txt </code></div>
</div>
<p>Key options:</p>
<ul>
<li><code>--db</code>: Specifies the database.</li>
<li><code>--threads</code>: Number of threads for parallel processing.</li>
<li><code>--fastq-input</code>: Indicates input file format (FASTQ/FASTA).</li>
</ul>
</li>
<li>
<p><strong>Interpreting Results</strong><br />Kraken generates an output file with columns for sequence IDs, taxonomic classifications, and the confidence score.</p>
</li>
</ol><h4><strong>Visualizing Kraken Results</strong></h4><p>Kraken results can be visualized using tools like <strong>Krona</strong> or converted to human-readable reports using <code>kraken-report</code>.</p><ol>
<li>
<p><strong>Generate a Report</strong></p>
<div>
<div dir="ltr"><code>kraken-report --db my_database kraken_output.txt &gt; kraken_report.txt </code></div>
</div>
</li>
<li>
<p><strong>Krona Visualization</strong><br />Install Krona and convert Kraken output for visualization:</p>
<div>
<div dir="ltr"><code>cut -f2,3 kraken_output.txt | ktImportTaxonomy -o krona_output.html </code></div>
</div>
<p>Open the HTML file in your browser to interactively explore the taxonomic classifications.</p>
</li>
</ol><h4><strong>Advanced Usage</strong></h4><ol>
<li>
<p><strong>Confidence Thresholds</strong><br />Adjust the confidence threshold for classification using the <code>--confidence</code> option. Higher values reduce false positives but may miss some true positives:</p>
<div>
<div dir="ltr"><code>kraken --db my_database --confidence 0.1 --fastq-input input.fastq </code></div>
</div>
</li>
<li>
<p><strong>Paired-End Reads</strong><br />For paired-end sequencing data, use:</p>
<div>
<div dir="ltr"><code>kraken --db my_database --paired reads_1.fastq reads_2.fastq </code></div>
</div>
</li>
<li>
<p><strong>Customizing K-mers</strong><br />Kraken allows you to set custom k-mer lengths during database building for specific applications.</p>
</li>
</ol><h4><strong>Applications of Kraken</strong></h4><ul>
<li><strong>Microbial Ecology</strong>: Characterizing microbial communities in soil, water, and the human microbiome.</li>
<li><strong>Pathogen Detection</strong>: Identifying pathogens in clinical samples.</li>
<li><strong>Fungal Research</strong>: Analyzing fungal diversity in metagenomic datasets.</li>
<li><strong>Environmental Monitoring</strong>: Tracking microbial populations in diverse habitats.</li>
</ul><h4><strong>Conclusion</strong></h4><p>Kraken is a versatile and efficient tool for taxonomic classification in metagenomics. Its speed, accuracy, and flexibility make it a favorite among bioinformaticians. By following this guide, you can set up and use Kraken to unlock insights into microbial and fungal communities, paving the way for discoveries in ecology, medicine, and biotechnology.</p></div></div></div></div></div></div>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44718/mycology-research-resources-for-bioinformaticians-unlocking-the-fungal-kingdom</guid>
	<pubDate>Fri, 13 Dec 2024 11:21:45 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44718/mycology-research-resources-for-bioinformaticians-unlocking-the-fungal-kingdom</link>
	<title><![CDATA[Mycology Research Resources for Bioinformaticians: Unlocking the Fungal Kingdom]]></title>
	<description><![CDATA[<p>Mycology, the study of fungi, is a field that bridges ecology, medicine, and biotechnology. With advancements in bioinformatics, researchers now have unprecedented opportunities to explore the fungal kingdom at molecular, genetic, and ecological levels. From understanding pathogenic fungi to harnessing fungal enzymes for industrial applications, the potential is vast.</p><p>To fully leverage these opportunities, bioinformaticians require specialized tools and databases. This blog highlights essential resources for mycology research, focusing on databases, tools, and platforms tailored for fungal biology.</p><h4><strong>1. Fungal Databases</strong></h4><h5><strong>1.1. MycoCosm</strong></h5><p><strong>Website</strong>: <a target="_new">MycoCosm</a><br />Developed by the DOE Joint Genome Institute, MycoCosm is a comprehensive portal for fungal genomics. It offers genomic and transcriptomic data for a wide range of fungi, including saprobes, pathogens, and symbionts.</p><ul>
<li><strong>Key Features</strong>: Genome browsers, comparative genomics tools, and functional annotations.</li>
<li><strong>Best For</strong>: Large-scale studies on fungal evolution and ecology.</li>
</ul><h5><strong>1.2. FungiDB</strong></h5><p><strong>Website</strong>: <a href="https://fungidb.org/" target="_new">FungiDB</a><br />FungiDB is an integrated genomic resource for fungal pathogens and non-pathogens. It provides access to genome sequences, transcriptomic data, and functional annotations.</p><ul>
<li><strong>Key Features</strong>: Advanced search options, BLAST, and pathway analysis tools.</li>
<li><strong>Best For</strong>: Studying fungal pathogenesis and host-pathogen interactions.</li>
</ul><h5><strong>1.3. Index Fungorum</strong></h5><p><strong>Website</strong>: <a href="http://www.indexfungorum.org/" target="_new">Index Fungorum</a><br />This nomenclatural database provides information on the scientific names of fungi. It&rsquo;s an essential resource for taxonomists and researchers focused on fungal biodiversity.</p><ul>
<li><strong>Key Features</strong>: Taxonomic hierarchy and synonymy tracking.</li>
<li><strong>Best For</strong>: Identifying and classifying fungal species.</li>
</ul><h5><strong>1.4. UNITE</strong></h5><p><strong>Website</strong>: <a target="_new">UNITE</a><br />UNITE is a specialized database for fungal ITS (Internal Transcribed Spacer) sequences, often used in fungal identification and phylogenetics.</p><ul>
<li><strong>Key Features</strong>: Curated reference datasets and community annotations.</li>
<li><strong>Best For</strong>: Environmental mycology and microbial ecology studies.</li>
</ul><h4><strong>2. Analytical Tools</strong></h4><h5><strong>2.1. Funannotate</strong></h5><p><strong>Repository</strong>: <a href="https://github.com/nextgenusfs/funannotate" target="_new">GitHub - Funannotate</a><br />Funannotate is a genome annotation tool designed for fungi. It supports tasks like gene prediction, functional annotation, and orthology analysis.</p><ul>
<li><strong>Best For</strong>: Annotating newly sequenced fungal genomes.</li>
</ul><h5><strong>2.2. BUSCO (Benchmarking Universal Single-Copy Orthologs)</strong></h5><p><strong>Website</strong>: <a target="_new">BUSCO</a><br />BUSCO evaluates genome assembly and annotation completeness using orthologs. It includes a fungal-specific dataset.</p><ul>
<li><strong>Best For</strong>: Assessing the quality of fungal genome assemblies.</li>
</ul><h5><strong>2.3. Pathogen-Host Interactions Database (PHI-base)</strong></h5><p><strong>Website</strong>: <a href="http://www.phi-base.org/" target="_new">PHI-base</a><br />PHI-base is a manually curated resource containing information on pathogen-host interactions, including fungal pathogens.</p><ul>
<li><strong>Best For</strong>: Exploring virulence factors and host-pathogen relationships.</li>
</ul><h4><strong>3. Visualization Platforms</strong></h4><h5><strong>3.1. Cytoscape</strong></h5><p><strong>Website</strong>: <a href="https://cytoscape.org/" target="_new">Cytoscape</a><br />A powerful tool for visualizing molecular interaction networks, Cytoscape can be used to study protein-protein interactions, gene networks, and metabolic pathways in fungi.</p><ul>
<li><strong>Best For</strong>: Network biology and functional genomics.</li>
</ul><h5><strong>3.2. iTOL (Interactive Tree of Life)</strong></h5><p><strong>Website</strong>: <a target="_new">iTOL</a><br />iTOL is an interactive tool for visualizing phylogenetic trees.</p><ul>
<li><strong>Best For</strong>: Displaying fungal phylogenies and comparing evolutionary relationships.</li>
</ul><h4><strong>4. Community Resources</strong></h4><h5><strong>4.1. Mycological Society of America (MSA)</strong></h5><p><strong>Website</strong>: <a href="https://msafungi.org/" target="_new">MSA</a><br />The MSA promotes fungal research and provides access to resources, conferences, and publications.</p><ul>
<li><strong>Best For</strong>: Networking with fungal researchers and accessing recent studies.</li>
</ul><h5><strong>4.2. OpenFungi</strong></h5><p><strong>Website</strong>: <a href="https://openfungi.org/" target="_new">OpenFungi</a><br />OpenFungi is an open-source initiative providing fungal genomic and transcriptomic datasets for research and education.</p><ul>
<li><strong>Best For</strong>: Sharing and accessing public fungal datasets.</li>
</ul><h4><strong>5. Genomics Workflows</strong></h4><h5><strong>5.1. Galaxy</strong></h5><p><strong>Website</strong>: <a href="https://usegalaxy.org/" target="_new">Galaxy Project</a><br />Galaxy offers a web-based platform for reproducible bioinformatics workflows, including tools for fungal genome and transcriptome analysis.</p><ul>
<li><strong>Best For</strong>: User-friendly analysis pipelines without requiring coding skills.</li>
</ul><h5><strong>5.2. Snakemake</strong></h5><p><strong>Repository</strong>: <a target="_new">Snakemake</a><br />A flexible pipeline management tool that supports fungal data processing and analysis.</p><ul>
<li><strong>Best For</strong>: Custom workflows for large-scale fungal datasets.</li>
</ul><h4><strong>Conclusion</strong></h4><p>Fungal research is a rapidly growing field with vast implications for medicine, agriculture, and industry. For bioinformaticians, the availability of specialized resources&mdash;databases, tools, and community platforms&mdash;opens doors to innovative discoveries. Whether you are investigating fungal genomics, studying host-pathogen interactions, or exploring fungal biodiversity, the resources outlined above will empower your research journey.</p><p>Dive into these resources and help unravel the mysteries of the fungal kingdom!</p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44716/exploring-rna-sequence-analysis-tools-for-every-bioinformatician</guid>
	<pubDate>Fri, 13 Dec 2024 04:03:04 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44716/exploring-rna-sequence-analysis-tools-for-every-bioinformatician</link>
	<title><![CDATA[Exploring RNA Sequence Analysis: Tools for Every Bioinformatician]]></title>
	<description><![CDATA[<p>RNA sequence analysis has become an essential part of modern biological research. From RNA-seq pipelines to specialized tools for specific RNA types, here's a comprehensive guide to tools you can use to make sense of RNA data.</p><h4><strong>1. RNA-Seq Analysis Pipelines</strong></h4><p>RNA-seq is one of the most popular techniques for studying RNA. These tools streamline processing raw sequence data:</p><ul>
<li><strong>FASTQC</strong>: For quality control of raw RNA-seq reads.</li>
<li><strong>Trimmomatic</strong>: For trimming and filtering RNA-seq reads.</li>
<li><strong>HISAT2/STAR</strong>: High-performance aligners for RNA-seq reads.</li>
<li><strong>FeatureCounts</strong>: For quantifying gene expression.</li>
<li><strong>DESeq2/EdgeR</strong>: For differential expression analysis.</li>
</ul><h4><strong>2. Transcriptome Assembly and Annotation</strong></h4><p>For analyzing transcriptomes from non-model organisms or assembling novel transcripts:</p><ul>
<li><strong>Trinity</strong>: For de novo transcriptome assembly.</li>
<li><strong>StringTie</strong>: For transcript assembly and quantification from RNA-seq alignments.</li>
<li><strong>TransDecoder</strong>: To predict coding regions within assembled transcripts.</li>
<li><strong>TAU</strong>: Tools for annotating non-coding and coding RNAs.</li>
</ul><h4><strong>3. Exploring Non-Coding RNA (ncRNA)</strong></h4><p>Non-coding RNAs play critical regulatory roles. Dedicated tools for studying them include:</p><ul>
<li><strong>Infernal</strong>: For identifying ncRNA sequences based on covariance models.</li>
<li><strong>Rfam</strong>: Database and tools for ncRNA families.</li>
<li><strong>miRDeep</strong>: For identifying microRNAs in RNA-seq datasets.</li>
</ul><h4><strong>4. RNA Structure and Motif Analysis</strong></h4><p>Structural biology of RNA helps in understanding its function:</p><ul>
<li><strong>RNAfold (ViennaRNA)</strong>: Predicts secondary structures from RNA sequences.</li>
<li><strong>RNAstructure</strong>: Tools for RNA secondary structure prediction and analysis.</li>
<li><strong>MEME Suite</strong>: For identifying motifs in RNA sequences.</li>
<li><strong>IntaRNA</strong>: For RNA-RNA interaction prediction.</li>
</ul><h4><strong>5. RNA Editing and Modifications</strong></h4><p>Epitranscriptomics is a growing field focusing on RNA modifications:</p><ul>
<li><strong>REDItools</strong>: For RNA editing analysis.</li>
<li><strong>m6Aboost</strong>: For identifying m6A modifications in RNA.</li>
</ul><h4><strong>6. Long-Read RNA Sequencing Analysis</strong></h4><p>Long-read technologies like Nanopore and PacBio are transforming RNA research:</p><ul>
<li><strong>FLAIR</strong>: For isoform-level analysis of long-read RNA-seq data.</li>
<li><strong>NanoMod</strong>: For detecting modifications in RNA from Nanopore sequencing.</li>
</ul><h4><strong>7. RNA-Protein Interactions</strong></h4><p>To study RNA-protein interactions and complexes:</p><ul>
<li><strong>RBPmap</strong>: For identifying RNA-binding protein motifs.</li>
<li><strong>PARalyzer</strong>: For analyzing PAR-CLIP data.</li>
</ul><h4><strong>8. Functional Enrichment Analysis</strong></h4><p>Understanding biological functions and pathways from RNA-seq data:</p><ul>
<li><strong>getENRICH</strong>: A tool designed for pathway enrichment analysis of non-model organisms (hypergeometric P-value calculation with FDR correction).</li>
<li><strong>ClusterProfiler</strong>: For GO and KEGG pathway enrichment analysis.</li>
</ul><h4><strong>9. Visualization and Data Sharing</strong></h4><p>Presenting and sharing RNA sequence analysis results effectively:</p><ul>
<li><strong>IGV</strong>: Genome browser for visualizing RNA-seq alignments.</li>
<li><strong>Circos</strong>: Circular visualization of RNA-seq data.</li>
<li><strong>DashBio</strong>: A Python library for creating bioinformatics visualizations.</li>
</ul><h4><strong>Conclusion</strong></h4><p>The bioinformatics landscape for RNA sequence analysis is vast, with tools catering to specific needs. Whether you&rsquo;re studying coding RNAs, non-coding RNAs, or exploring RNA-protein interactions, the right tools can transform your data into biological insights.</p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44713/understanding-rna-seq-normalization-methods-tpm-vs-fpkm-vs-cpm</guid>
	<pubDate>Wed, 11 Dec 2024 00:59:15 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44713/understanding-rna-seq-normalization-methods-tpm-vs-fpkm-vs-cpm</link>
	<title><![CDATA[Understanding RNA-Seq Normalization Methods: TPM vs. FPKM vs. CPM]]></title>
	<description><![CDATA[<p>RNA sequencing (RNA-Seq) is a powerful technology used to study transcriptomes, providing insights into gene expression levels. However, raw RNA-Seq data requires normalization to account for sequencing depth and gene length, enabling accurate comparisons between genes and samples. Among the most widely used normalization methods are TPM (Transcripts Per Million), FPKM (Fragments Per Kilobase Million), and CPM (Counts Per Million). Each method has its unique principles and applications, which we&rsquo;ll explore in this blog.</p><h2>Why Normalize RNA-Seq Data?</h2><p>Normalization is a crucial step in RNA-Seq analysis for the following reasons:</p><ul>
<li>
<p><strong>Sequencing depth:</strong> Different RNA-Seq experiments produce varying numbers of reads, making direct comparisons between samples misleading.</p>
</li>
<li>
<p><strong>Gene length:</strong> Longer genes inherently generate more reads, irrespective of their actual expression level.</p>
</li>
<li>
<p><strong>Bias reduction:</strong> Normalization mitigates technical biases, enabling meaningful biological interpretation.</p>
</li>
</ul><h2>TPM (Transcripts Per Million)</h2><p>TPM measures the proportion of reads mapped to a transcript, normalized by transcript length and sequencing depth. It is calculated as:</p><h3>Key Features:</h3><ol>
<li>
<p><strong>Proportionality:</strong> TPM values sum to 1,000,000 across all transcripts in a sample, making it easier to compare between samples.</p>
</li>
<li>
<p><strong>Intuitive interpretation:</strong> TPM values directly represent the abundance of transcripts in a sample.</p>
</li>
<li>
<p><strong>Preferred for comparisons:</strong> TPM facilitates between-sample comparisons better than FPKM.</p>
</li>
</ol><h2>FPKM (Fragments Per Kilobase Million)</h2><p>FPKM normalizes read counts by transcript length and sequencing depth, but without enforcing proportionality like TPM. It is defined as:</p><h3>Key Features:</h3><ol>
<li>
<p><strong>Historical significance:</strong> FPKM was one of the first normalization methods used for RNA-Seq.</p>
</li>
<li>
<p><strong>Single-end vs. paired-end:</strong> In paired-end sequencing, FPKM becomes RPKM (Reads Per Kilobase Million).</p>
</li>
<li>
<p><strong>Limited utility:</strong> FPKM values are not as robust as TPM for cross-sample comparisons due to lack of proportionality.</p>
</li>
</ol><h2>CPM (Counts Per Million)</h2><p>CPM normalizes raw read counts by sequencing depth, without considering gene length. It is expressed as:</p><h3>Key Features:</h3><ol>
<li>
<p><strong>Simplicity:</strong> CPM is straightforward and computationally less intensive.</p>
</li>
<li>
<p><strong>Application:</strong> Suitable for non-length-dependent analyses, such as comparing total expression levels or differential expression analysis.</p>
</li>
<li>
<p><strong>Gene length agnostic:</strong> CPM does not correct for gene length, making it less ideal for measuring expression levels.</p>
</li>
</ol><h2>When to Use Each Method</h2><ul>
<li>
<p><strong>TPM:</strong> Best for comparing expression levels between samples, especially when transcript length and sequencing depth vary.</p>
</li>
<li>
<p><strong>FPKM:</strong> Useful for historical consistency but generally replaced by TPM.</p>
</li>
<li>
<p><strong>CPM:</strong> Ideal for differential expression analysis when gene length normalization is unnecessary.</p>
</li>
</ul><h2>Conclusion</h2><p>Choosing the right normalization method depends on the specific objectives of your RNA-Seq analysis. TPM&rsquo;s proportionality and robustness make it the preferred choice for most applications, while CPM serves well for differential expression studies. Although FPKM paved the way for RNA-Seq normalization, it has largely been supplanted by TPM in modern workflows. Understanding these methods and their nuances ensures accurate and meaningful interpretations of RNA-Seq data.</p><h3>References:</h3><ol>
<li>
<p>Li, B., &amp; Dewey, C. N. (2011). RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. <em>BMC Bioinformatics.</em></p>
</li>
<li>
<p>Trapnell, C., et al. (2010). Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. <em>Nature Biotechnology.</em></p>
</li>
<li>
<p>Law, C. W., et al. (2014). voom: precision weights unlock linear model analysis tools for RNA-seq read counts. <em>Genome Biology.</em></p>
</li>
</ol>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/43940/langya-virus-update</guid>
	<pubDate>Fri, 12 Aug 2022 05:31:10 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/43940/langya-virus-update</link>
	<title><![CDATA[Langya Virus Update !]]></title>
	<description><![CDATA[<p>https://www.ncbi.nlm.nih.gov/nuccore/OM101125,OM101126,OM101127,OM101128,OM101129,OM101130?</p><p>Zoonotic Henipavirus</p><p>https://pubmed.ncbi.nlm.nih.gov/35921459/</p><p>https://www.ncbi.nlm.nih.gov/nuccore/OM069646,,OM069567,OM069568,OM069569,OM069570,OM069571,OM069572,OM069573,OM069574,OM069575,OM069576,OM069577,OM069578,OM069579,OM069580,OM069581,OM069582,OM069583,OM069584,OM069585,OM069586,OM069587,OM069588,OM069589,OM069590,OM069591,OM069592,OM069593,OM069594,OM069595,OM069596,OM069597,OM069598,OM069599,OM069600,OM069601,OM069602,OM069603,OM069604,OM069605,OM069606,OM069607,OM069608,OM069609,OM069610,OM069611,OM069612,OM069613,OM069614,OM069615,OM069616,OM069617,OM069618,OM069619,OM069620,OM069621,OM069622,OM069623,OM069624,OM069625,OM069626,OM069627,OM069628,OM069629,OM069630,OM069631,OM069632,OM069633,OM069634,OM069635,OM069636,OM069637,OM069638,OM069639,OM069640,OM069641,OM069642,OM069643,OM069644,OM069645,OM069646</p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/43872/installing-elgg-on-ubuntu</guid>
	<pubDate>Wed, 25 May 2022 02:26:05 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/43872/installing-elgg-on-ubuntu</link>
	<title><![CDATA[Installing ELGG on Ubuntu !]]></title>
	<description><![CDATA[<p>Elgg is an open-source and highly customizable framework used for building an online social environment. It provides a simple and powerful user interface that helps to manage and build your content through a web browser. Elgg offers a rich set of features including messaging, microblogging, file-sharing, RSS support, access control, groups, and many more.</p><p>&nbsp;</p><p>In this tutorial, we will show you how to install and configure Elgg social networking platform on Ubuntu 20.04.</p><h2>Prerequisites</h2><p>&bull; A fresh Ubuntu 20.04&nbsp;<a href="https://www.atlantic.net/vps-hosting/">VPS</a>&nbsp;on the Atlantic.net Cloud Platform<br />&bull; A valid domain name pointed to your server IP<br />&bull; A root password configured on your server</p><h2>Step 1 &ndash; Create Atlantic.Net Cloud Server</h2><p>First, log in to your&nbsp;<a href="https://cloud.atlantic.net/?page=userlogin" target="_blank">Atlantic.Net Cloud Server</a>. Create a new&nbsp;<a href="https://www.atlantic.net/vps-hosting/how-to-create-new-atlantic-net-cloud-server/">server</a>, choosing Ubuntu 20.04 as the operating system with at least 2GB RAM. Connect to your Cloud Server via SSH and log in using the credentials highlighted at the top of the page.</p><p>Once you are logged in to your Ubuntu 20.04 server, run the following command to update your base system with the latest available packages.</p><pre>apt-get update -y</pre><h2>Step 2 &ndash; Install Apache, MariaDB and PHP</h2><p>Elgg runs on Apache web server, is written in PHP, and uses MySQL/MariaDB as a database backend, so you will need to install the Apache, MariaDB, PHP and other required PHP extensions to your server. You can install all of them with the following command:</p><pre>apt-get install apache2 mariadb-server php libapache2-mod-php php-common php-sqlite3 php-curl 
php-intl php-mbstring php-xmlrpc php-mysql php-gd php-xml php-cli php-zip unzip wget -y</pre><p>After installing all the packages, edit the php.ini file and change some recommended settings.</p><pre>nano /etc/php/7.4/apache2/php.ini</pre><p>Change the following values:</p><pre>max_execution_time = 300
memory_limit = 512M
upload_max_filesize = 100M
date.timezone = Asia/Kolkata</pre><p>Save and close the file, then restart the Apache service to apply the configuration changes.</p><pre>systemctl restart apache2</pre><h2>Step 3 &ndash; Create a Database for Elgg</h2><p>Next, you will need to create a database and user for Elgg. First, log in to MySQL shell with the following command:</p><pre>mysql</pre><p>Once logged in, create a database and user with the following command:</p><pre>CREATE DATABASE elgg;
CREATE USER 'elgg'@'localhost' IDENTIFIED BY 'secure-password';</pre><p>Next, grant all the privileges to the elgg database with the following command:</p><pre>GRANT ALL ON elgg.* TO 'elgg'@'localhost' IDENTIFIED BY 'secure-password' WITH GRANT 
OPTION;</pre><p>Next, flush the privileges and exit from the MariaDB shell with the following command:</p><pre>FLUSH PRIVILEGES;
EXIT;</pre><p>At this point, the MariaDB database is created for Elgg.</p><h2>Step 4 &ndash; Install Elgg</h2><p>First, download the latest version of Elgg from its official website using the following command:</p><pre>wget https://elgg.org/download/elgg-3.3.13.zip</pre><p>Once the download is completed, unzip the downloaded file with the following command:</p><pre>unzip elgg-3.3.13.zip</pre><p>Next, move the extracted directory to the Apache root directory:</p><pre>mv elgg-3.3.13 /var/www/html/elgg</pre><p>Next, create a data directory and set proper ownership and permissions to the Elgg directory:</p><pre>mkdir /var/www/html/data
chown -R www-data:www-data /var/www/html/elgg
chown -R www-data:www-data /var/www/html/data
chmod -R 755 /var/www/html/elgg</pre><p>Once you are finished, you can proceed to the next step.</p><h2>Step 5 &ndash; Configure Apache for Elgg</h2><p>Next, you will need to configure Apache to serve Elgg. You can configure it by creating a new Apache virtual host configuration file:</p><pre>nano /etc/apache2/sites-available/elgg.conf</pre><p>Add the following lines:</p><pre>&lt;VirtualHost *:80&gt;
ServerAdmin admin@example.com
DocumentRoot /var/www/html/elgg/
ServerName elgg.example.com
Options FollowSymLinks
AllowOverride All
ErrorLog /var/log/apache2/elgg-error_log
CustomLog /var/log/apache2/elgg-access_log common
&lt;/VirtualHost&gt;</pre><p>Save and close the file, then enable the virtual host and Apache rewrite module with the following command:</p><pre>a2ensite elgg.conf
a2enmod rewrite</pre><p>Finally, restart the Apache service to apply the changes:</p><pre>systemctl restart apache2</pre><h2>Step 6 &ndash; Access Elgg Web Interface</h2><p>Now, open your web browser and access the Elgg web interface using the URL http://elgg.example.com. You should see the Elgg welcome screen:</p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/43670/useful-bioinformatics-analysis-tools</guid>
	<pubDate>Thu, 23 Dec 2021 23:10:02 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/43670/useful-bioinformatics-analysis-tools</link>
	<title><![CDATA[Useful Bioinformatics Analysis Tools !]]></title>
	<description><![CDATA[<h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=cometa&amp;subpage=about">CoMeta</a></h3><p><strong>Classificier of reads from metagenomic sequencing experiments.</strong></p><p><span>&bull;&nbsp;&nbsp;Kawulok, J., Deorowicz, S.,&nbsp;</span><em>CoMeta: Classification of Metagenomes Using k-mers</em><span>,&nbsp;</span><a href="http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0121453">PLOS ONE,&nbsp;</a><span>2015; 10(4):1&ndash;23,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=CoMSA&amp;subpage=about">CoMSA</a></h3><p><strong>Compressor of multiple sequence alignments of proteins.</strong></p><p><span>&bull;&nbsp;&nbsp;Deorowicz, S., Walczyszyn, J., Debudaj-Grabysz, A.,&nbsp;</span><em>CoMSA: compression of protein multiple sequence alignment files</em><span>,&nbsp;</span><a href="https://doi.org/10.1093/bioinformatics/bty619">Bioinformatics,&nbsp;</a><span>2019; 35(2):22&ndash;234,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=dsrc&amp;subpage=about">DSRC</a></h3><p><strong>Compressor of sequencing reads.</strong></p><p><span>&bull;&nbsp;&nbsp;Roguski, L., Deorowicz, S.,&nbsp;</span><em>DSRC 2: Industry-oriented compression of FASTQ files</em><span>,&nbsp;</span><a href="http://bioinformatics.oxfordjournals.org/content/30/15/2213">Bioinformatics,&nbsp;</a><span>2014; 30(15):2213&ndash;2215,</span><br /><span>&bull;&nbsp;&nbsp;Deorowicz, S., Grabowski, Sz.,&nbsp;</span><em>Compression of DNA sequences in FASTQ format</em><span>,&nbsp;</span><a href="http://bioinformatics.oxfordjournals.org/">Bioinformatics,&nbsp;</a><span>2011; 27(6):860&ndash;862,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=famsa&amp;subpage=about">FAMSA</a></h3><p><strong>Multiple sequence alignment designed for huge families of proteins (even containing hundreds of thousands of sequences).</strong></p><p><span>&bull;&nbsp;&nbsp;Deorowicz, S., Debudaj-Grabysz, A., Gudys, A.,&nbsp;</span><em>FAMSA: Fast and accurate multiple sequence alignment of huge protein families</em><span>,&nbsp;</span><a href="http://www.nature.com/articles/srep33964">Scientific Reports,&nbsp;</a><span>2016; 6(33964):</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=fastore&amp;subpage=about">FaStore</a></h3><p><strong>Compressor of FASTQ files.</strong></p><p><span>&bull;&nbsp;&nbsp;Roguski, L., Ochoa, I., Hernaez, M., Deorowicz, S.,&nbsp;</span><em>FaStore - a space-saving solution for raw sequencing data</em><span>,&nbsp;</span><a href="https://doi.org/10.1093/bioinformatics/bty205">Bioinformatics,&nbsp;</a><span>2018; 34(16):2748&ndash;2756,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=fqsqueezer&amp;subpage=about">FQSqueezer</a></h3><p><strong>Experimental high-end compressor of FASTQ files.</strong></p><p><span>&bull;&nbsp;&nbsp;Deorowicz, S.,&nbsp;</span><em>FQSqueezer: k-mer-based compression of sequencing data</em><span>,&nbsp;</span><a href="https://www.nature.com/articles/s41598-020-57452-6">Scientific Reports,&nbsp;</a><span>2020; 10(578):</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=gdc&amp;subpage=about">GDC</a></h3><p><strong>Compressor of collections of genome sequences.</strong></p><p><span>&bull;&nbsp;&nbsp;Deorowicz, S., Danek, A., Niemiec, M.,&nbsp;</span><em>GDC 2: Compression of large collections of genomes</em><span>,&nbsp;</span><a href="http://www.nature.com/srep/2015/150625/srep11565/full/srep11565.html">Scientific Reports,&nbsp;</a><span>2015; 5(11565):1&ndash;12,</span><br /><span>&bull;&nbsp;&nbsp;Deorowicz, S., Grabowski, Sz.,&nbsp;</span><em>Robust relative compression of genomes with random access</em><span>,&nbsp;</span><a href="http://sun.aei.polsl.pl/REFRESH/bioinformatics.oxfordjournals.org/content/27/21/2979.abstract">Bioinformatics,&nbsp;</a><span>2011; 27(21):2979&ndash;2986,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=gtc&amp;subpage=about">GTC</a></h3><p><strong>Genotype databases compressor with support for fast queries.</strong></p><p><span>&bull;&nbsp;&nbsp;Danek, A., Deorowicz, S.,&nbsp;</span><em>GTC: how to maintain huge genotype collections in a compressed form</em><span>,&nbsp;</span><a href="https://doi.org/10.1093/bioinformatics/bty023">Bioinformatics,&nbsp;</a><span>2018; 34(11):1834&ndash;1840,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=gtshark&amp;subpage=about">GTShark</a></h3><p><strong>Genotypes compressor.</strong></p><p><span>&bull;&nbsp;&nbsp;Deorowicz, S., Danek, A.,&nbsp;</span><em>GTShark: Genotype compression in large projects</em><span>,&nbsp;</span><a href="https://doi.org/10.1093/bioinformatics/btz508">Bioinformatics,&nbsp;</a><span>2019; 35(22):4791&ndash;4793,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=kmc&amp;subpage=about">KMC</a></h3><p><strong>Memory frugal&nbsp;<em>k</em>-mer counter.</strong></p><p><span>&bull;&nbsp;&nbsp;Kokot, M., Długosz, M., Deorowicz, S.,&nbsp;</span><em>KMC 3: counting and manipulating k -mer statistics</em><span>,&nbsp;</span><a href="https://doi.org/10.1093/bioinformatics/btx304">Bioinformatics,&nbsp;</a><span>2017; 33(17):2759&ndash;2761,</span><br /><span>&bull;&nbsp;&nbsp;Deorowicz, S., Kokot, M., Grabowski, Sz., Debudaj-Grabysz, A.,&nbsp;</span><em>KMC 2: Fast and resource-frugal k-mer counting</em><span>,&nbsp;</span><a href="https://doi.org/10.1093/bioinformatics/btv022">Bioinformatics,&nbsp;</a><span>2015; 31(10):1569&ndash;1576,</span><br /><span>&bull;&nbsp;&nbsp;Deorowicz, S., Debudaj-Grabysz, A., Grabowski, Sz.,&nbsp;</span><em>Disk-based k-mer counting on a PC</em><span>,&nbsp;</span><a href="http://www.biomedcentral.com/1471-2105/14/160">BMC Bioinformatics,&nbsp;</a><span>2013; 14():Article no. 160,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=kmer-db&amp;subpage=about">Kmer-db</a></h3><p><strong>Tool for estimation of evolutionary distances in a collection of genomes.</strong></p><p><span>&bull;&nbsp;&nbsp;Deorowicz, S., Gudys, A., Dlugosz, M., Kokot, M., Danek, A.,&nbsp;</span><em>Kmer-db: instant evolutionary distance estimation</em><span>,&nbsp;</span><a href="https://doi.org/10.1093/bioinformatics/bty610">Bioinformatics,&nbsp;</a><span>2019; 35(1):133&ndash;136,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=mugi&amp;subpage=about">MuGI</a></h3><p><strong>Index allowing queries for a collection of multiple genome sequences.</strong></p><p><span>&bull;&nbsp;&nbsp;Danek, A., Deorowicz, S., Grabowski, Sz.,&nbsp;</span><em>Indexes of Large Genome Collections on a PC</em><span>,&nbsp;</span><a href="http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0109384">PLOS ONE,&nbsp;</a><span>2014; 9(10):e109384,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=orcom&amp;subpage=about">ORCOM</a></h3><p><strong>Experimental compressor of sequencing reads.</strong></p><p><span>&bull;&nbsp;&nbsp;Grabowski, Sz., Deorowicz, S., Roguski, L.,&nbsp;</span><em>Disk-based compression of data from genome sequencing</em><span>,&nbsp;</span><a href="http://bioinformatics.oxfordjournals.org/content/early/2014/12/22/bioinformatics.btu844.abstract">Bioinformatics,&nbsp;</a><span>2014; 31(9):1389&ndash;1395,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=pgsa&amp;subpage=about">PgSA</a></h3><p><strong>Index allowing queries for a collection of sequencing reads.</strong></p><p><span>&bull;&nbsp;&nbsp;Kowalski, T., Grabowski, Sz., Deorowicz, S.,&nbsp;</span><em>Indexing arbitrary-length k-mers in sequencing reads</em><span>,&nbsp;</span><a href="http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0133198">PLOS ONE,&nbsp;</a><span>2015; 10(7):1&ndash;16,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=quickprobs&amp;subpage=about">QuickProbs</a></h3><p><strong>Multiple sequence alignment designed especially for GPU.</strong></p><p><span>&bull;&nbsp;&nbsp;Gudys, A., Deorowicz, S.,&nbsp;</span><em>QuickProbs 2: towards rapid construction of high-quality alignments of large protein families</em><span>,&nbsp;</span><a href="http://www.nature.com/articles/srep41553">Scientific Reports,&nbsp;</a><span>2017; 7(41553):</span><br /><span>&bull;&nbsp;&nbsp;Gudys, A., Deorowicz, S.,&nbsp;</span><em>QuickProbs &ndash; A Fast Multiple Sequence Alignment Algorithm Designed for Graphics Processors</em><span>,&nbsp;</span><a href="http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0088901">PLOS ONE,&nbsp;</a><span>2014; 9(2):e88901,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=reckoner&amp;subpage=about">RECKONER</a></h3><p><strong>Read error corrector.</strong></p><p><span>&bull;&nbsp;&nbsp;Maciej Długosz, M., Deorowicz, S.,&nbsp;</span><em>RECKONER: read error corrector based on KMC</em><span>,&nbsp;</span><a href="https://academic.oup.com/bioinformatics/article-abstract/33/7/1086/2843893/RECKONER-read-error-corrector-based-on-KMC">Bioinformatics,&nbsp;</a><span>2017; 33(7):1086&ndash;1089,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=tgc&amp;subpage=about">TGC</a></h3><p><strong>Compressor of collections of genomes given in Variant Call Format (VCF) files.</strong></p><p><span>&bull;&nbsp;&nbsp;Deorowicz, S., Danek, A., Grabowski, Sz.,&nbsp;</span><em>Genome compression: a novel approach for large collections</em><span>,&nbsp;</span><a href="http://bioinformatics.oxfordjournals.org/content/early/2013/08/29/bioinformatics.btt460">Bioinformatics,&nbsp;</a><span>2013; 29(20):2572&ndash;2578,</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=vcfshark&amp;subpage=about">VCFShark</a></h3><p><strong>Compressor of VCF files.</strong></p><p><span>&bull;&nbsp;&nbsp;Deorowicz, S., Danek, A.,&nbsp;</span><em>GTShark: Genotype compression in large projects</em><span>,&nbsp;</span><a href="https://www.biorxiv.org/content/10.1101/2020.12.18.423437v1">biorxiv.org,&nbsp;</a><span>2020; ():</span></p><h3><a href="http://sun.aei.polsl.pl/REFRESH/index.php?page=projects&amp;project=whisper&amp;subpage=about">Whisper</a></h3><p><strong>Experimental mapper of whole genome sequencing data.</strong></p><p><span>&bull;&nbsp;&nbsp;Deorowicz, S., Gudys, A.,&nbsp;</span><em>Whisper 2: indel-sensitive short read mapping</em><span>,&nbsp;</span><a href="https://doi.org/10.1101/2019.12.18.881292">bioRxiv.org,&nbsp;</a><span>2019; :</span><br /><span>&bull;&nbsp;&nbsp;Deorowicz, S., Debudaj-Grabysz, A., Gudys, A., Grabowski, Sz.,&nbsp;</span><em>Whisper: read sorting allows robust robust mapping of DNA sequencing data</em><span>,&nbsp;</span><a href="https://doi.org/10.1093/bioinformatics/bty927">Bioinformatics,&nbsp;</a><span>2019; 35(12):2043&ndash;2050,</span><br /><span>&bull;&nbsp;&nbsp;Deorowicz, S., Debudaj-Grabysz, A., Gudys, A., Grabowski, Sz.,&nbsp;</span><em>Robust mapping of whole genome sequencing data</em><span>,&nbsp;</span><a href="https://meetings.cshl.edu/abstracts.aspx?meet=GENOME&amp;year=17">Poster at The Biology of Genomes Conference,&nbsp;</a><span>2017;</span></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/43424/rest-api</guid>
	<pubDate>Mon, 04 Oct 2021 12:46:40 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/43424/rest-api</link>
	<title><![CDATA[REST API]]></title>
	<description><![CDATA[<h3 id="PSIBLASTHelpandDocumentation-RESTAPI">REST API</h3><p>The&nbsp;<a href="https://www.ebi.ac.uk/seqdb/confluence/pages/viewpage.action?pageId=68165098">Representational State Transfer (REST)</a>&nbsp;sample clients are provided for a number of programming languages. For details of how to use these clients,&nbsp;<a href="https://github.com/ebi-wp/webservice-clients">download</a>&nbsp;the client and run the program without any arguments.</p><div><table><colgroup><col><col><col></colgroup>
<thead>
<tr><th scope="col">
<div>Language</div>
</th><th scope="col">
<div>Download</div>
</th><th scope="col">
<div>Requirements</div>
</th></tr>
</thead>
<tbody>
<tr><th>Perl</th>
<td><a href="https://raw.githubusercontent.com/ebi-wp/webservice-clients/master/perl/psiblast.pl">psiblast.pl</a></td>
<td><a href="http://search.cpan.org/perldoc?LWP">LWP</a>&nbsp;and&nbsp;<a href="http://search.cpan.org/perldoc?XML::Simple">XML::Simple</a></td>
</tr>
<tr><th colspan="1">
<h4 id="PSIBLASTHelpandDocumentation-Python">Python</h4>
</th>
<td colspan="1">
<p><a href="https://raw.githubusercontent.com/ebi-wp/webservice-clients/master/python/psiblast.py">psiblast.py</a></p>
</td>
<td colspan="1"><a href="https://pypi.python.org/pypi/xmltramp2/3.0.10" title="https://pypi.python.org/pypi/xmltramp2/3.0.10">xmltramp2</a></td>
</tr>
</tbody>
</table></div><p>For details see&nbsp;<a href="https://www.ebi.ac.uk/seqdb/confluence/display/JDSAT/Environment+setup+for+REST+Web+Services">Environment setup for REST Web Services</a>&nbsp;and&nbsp;<a href="https://www.ebi.ac.uk/seqdb/confluence/display/JDSAT/Examples+for+Perl+REST+Web+Services+Clients">Examples for Perl REST Web Services Clients</a>&nbsp;pages.</p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/43084/frequently-used-bioinformatics-tools-for-viral-genome-analysis</guid>
	<pubDate>Wed, 23 Jun 2021 07:40:41 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/43084/frequently-used-bioinformatics-tools-for-viral-genome-analysis</link>
	<title><![CDATA[Frequently used bioinformatics tools for viral genome analysis !]]></title>
	<description><![CDATA[<p><strong>IVA: accurate de novo assembly of RNA virus genomes.</strong><br /> Hunt M, Gall A, Ong SH, Brener J, Ferns B, Goulder P, Nastouli E, Keane JA, Kellam P, Otto TD.<br /> Bioinformatics. 2015 Jul 15;31(14):2374-6. doi: <a href="http://bioinformatics.oxfordjournals.org/content/31/14/2374.long">10.1093/bioinformatics/btv120</a>. Epub 2015 Feb 28.</p><p><a href="http://www.nature.com/nmeth/journal/v9/n1/full/nmeth.1814.html"><strong>Adapter sequences</strong></a>:<br /> <strong>Optimal enzymes for amplifying sequencing libraries.</strong><br /> Quail, M. a et al. Nat. Methods 9, 10-1 (2012).</p><p><a href="http://genome.cshlp.org/content/early/2012/01/12/gr.131383.111"><strong>GAGE</strong></a>:<br /> <strong>GAGE: A critical evaluation of genome assemblies and assembly algorithms.</strong><br /> Salzberg, S. L. et al. Genome Res. 22, 557-67 (2012).</p><p><a href="http://www.biomedcentral.com/1471-2105/14/160"><strong>KMC</strong></a>:<br /> <strong>Disk-based k-mer counting on a PC.</strong><br /> Deorowicz, S., Debudaj-Grabysz, A. &amp; Grabowski, S. BMC Bioinformatics 14, 160 (2013).</p><p><a href="http://genomebiology.com/2014/15/3/R46"><strong>Kraken</strong></a>:<br /> <strong>Kraken: ultrafast metagenomic sequence classification using exact alignments.</strong><br /> Wood, D. E. &amp; Salzberg, S. L. Genome Biol. 15, R46 (2014).</p><p><a href="http://genomebiology.com/2004/5/2/r12"><strong>MUMmer</strong></a>:<br /> <strong>Versatile and open software for comparing large genomes.</strong><br /> Kurtz, S. et al. Genome Biol. 5, R12 (2004).</p><p><strong>R</strong>:<br /> <strong>R: A language and environment for statistical computing.</strong><br /> R Core Team (2013). R Foundation for Statistical Computing, Vienna, Austria. URL <a href="http://www.R-project.org/">http://www.R-project.org/</a>.</p><p><a href="http://nar.oxfordjournals.org/content/39/9/e57"><strong>RATT</strong></a>:<br /> <strong>RATT: Rapid Annotation Transfer Tool.</strong><br /> Otto, T. D., Dillon, G. P., Degrave, W. S. &amp; Berriman, M. Nucleic Acids Res. 39, e57 (2011).</p><p><a href="http://bioinformatics.oxfordjournals.org/content/25/16/2078.abstract"><strong>SAMtools</strong></a>:<br /> <strong>The Sequence Alignment/Map format and SAMtools.</strong><br /> Li, H. et al. Bioinformatics 25, 2078-9 (2009).</p><p><a href="http://bioinformatics.oxfordjournals.org/content/early/2014/04/12/bioinformatics.btu170"><strong>Trimmomatic</strong></a>:<br /> <strong>Trimmomatic: A flexible trimmer for Illumina Sequence Data.</strong><br /> Bolger, A. M., Lohse, M. &amp; Usadel, B. Bioinformatics 1-7 (2014).</p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>

</channel>
</rss>