NGS quality control and trimming are essential steps to ensure reliable and accurate data for analysis. While the "ifs" highlight the clear benefits of these steps, the "buts" remind us of the potential pitfalls. By adopting best practices and...
Peng Lab at Janelia Farm Research Campus, Howard Hughes Medical Institute focuses on data mining for bioinformatics and computational molecular biology, particularly, bioimage data mining and informatics. These bioimages include cellular and...
FYI, I've found it useful to use MUMmer to extract the specific changes that Racon makes, so I can evaluate them individually:
minimap -t 24 assembly.fasta long_reads.fastq.gz | racon -t 24 long_reads.fastq.gz - assembly.fasta...
Qualifications: Candidates must have a Ph.D. and a strong background in Molecular and Cellular Biology, protein expression, FACS, or computational biology, and ability to work collaboratively.
This position will have a significant focus on...
gwct.github.io - Modern genome sequencing technologies provide a succint measure of quality at each position in every read, however all of this information is lost in the assembly process. Referee summarizes the quality information from the reads that map to a site...
An opportunity to perform research in DST supported project that involves building of mathematical models to understand the functional relationship between circadian rhythms and memory formation under stressful condition. In this project,...
github.com - jackalope simply and efficiently simulates (i) variants from reference genomes and (ii) reads from both Illumina and Pacific Biosciences (PacBio) platforms. It can either read reference genomes from FASTA files or simulate new ones. Genomic variants...
EVOLUTIONARY AND INTEGRATIVE CELL BIOLOGY
Our research is at the crossroad between cell biology, ecological genomics, systems biology, molecular evolution and population genetics. We study the architecture and evolution of protein and signalling...
github.com - Welcome to kevlar, software for predicting de novo genetic variants without mapping reads to a reference genome! kevlar's k-mer abundance based method calls single nucleotide variants (SNVs), multinucleotide variants (MNVs),...
We are involved in the development of methods and software in chemoinformatics. Current main projects are:
1.automatic learning of chemical reactivity and metabolism,
2.simulation of NMR spectra,
3.modelling of properties of ionic liquids,...