github.com - With advances in Cancer Genomics, Mutation Annotation Format (MAF) is being widely accepted and used to store somatic variants detected. The Cancer Genome Atlas Project has sequenced over 30 different cancers with sample size of each cancer type...
If we only had Illumina reads, we could also assemble these using the tool Spades.
You can try this here, or try it later on your own data.
Get data
We will use the same Illumina data as we used above:
illumina_R1.fastq.gz: the Illumina...
Applications are invited for the following post of Junior Research Fellow (temporary position coterminous with the project) under DBT funded research project on ““Understanding the functions of α1β1γ1/α2β1γ1 selective AMPK Modulators in dissecting...
github.com - Phylogenomic Analysis Pipeline for Herbarium Specimens
What is PhyloHerb: PhyloHerb is a wrapper program to process genome skimming data collected from plant materials. The outcomes include the plastid genome (plastome) assemblies,...
The Regular Expression are the sole of Perl language, and for bioinformatician it is just a magical stick to resolve gingatic string data. We did not find any good and user friendly regular expression cheat sheet, hence write our own cheat...
Ever since a monk called Mendel started breeding pea plants we've been learning about our genomes. In 1953, Watson, Crick and Franklin described the structure of the molecule that makes up our genomes: the DNA double helix. Then, in 2001, scientists...
UC Davis's Bart Weimer describes foodborne pathogens and their proclivity for rapid genome rearrangement. The 100K Pathogen Genome Project he leads is using PacBio long-read sequencing to close genomes and analyze methylation; Weimer reports that...
The genetic code consists of 64 triplets of nucleotides. These triplets are called codons.With three exceptions, each codon encodes for one of the 20 amino acids used in the synthesis of proteins. That produces some redundancy in the code: most of...
www.ncbi.nlm.nih.gov - We describe a new algorithm, meraculous, for whole genome assembly of deep paired-end short reads, and apply it to the assembly of a dataset of paired 75-bp Illumina reads derived from the 15.4 megabase genome of the haploid yeast Pichia...