BOL: Related items

Common methods to discover tandem repeats

BioStar — Thu, 09 Mar 2023 02:40:52 -0600

Tandem repeats are DNA sequences that are repeated in a contiguous manner in the genome. These sequences are often used as genetic markers and are important in many areas of genetics and genomics research. Here are some methods for discovering tandem repeats in genomes:

Tandem Repeat Finder: Tandem Repeat Finder is a software tool that identifies tandem repeats in DNA sequences. It is available for free download and can be used on both nucleotide and protein sequences. The tool uses a statistical algorithm to identify repeats based on their length, copy number, and overall composition.
RepeatMasker: RepeatMasker is another software tool that can identify tandem repeats in DNA sequences. It works by comparing the input sequence to a database of known repeats and then identifies any tandem repeats that match those in the database.
PCR-based methods: Polymerase chain reaction (PCR) can be used to amplify and detect tandem repeats in genomic DNA. PCR primers are designed to flank the tandem repeat region, and amplification of the target DNA fragment can be visualized on a gel. This method can be useful for detecting novel tandem repeats and for genotyping.
Southern blotting: Southern blotting is a classic method for detecting DNA fragments in a sample. It can be used to detect tandem repeats by digesting genomic DNA with a restriction enzyme, separating the fragments by gel electrophoresis, and then probing the blot with a tandem repeat-specific probe.

Overall, a combination of these methods can be used to comprehensively identify tandem repeats in genomes.

List of motif discovery tools !

Neel — Tue, 20 Nov 2018 03:54:26 -0600

In genetics, a sequence motif is a nucleotide or amino-acid sequence pattern that is widespread and has, or is conjectured to have, a biological significance. For proteins, a sequence motif is distinguished from a structural motif, a motif formed by the three-dimensional arrangement of amino acids which may not be adjacent.

Following are the list of tools for motif discovery:

2Dsweep -- protein annotation by secondary structure elements

Perform secondary structure predictions on protein sequences.

3D-footprint -- database of DNA-binding protein structures

Find binding specificity information about DNA-protein complexes.

3D-footprint: DNA-binding protein database

Find information about the binding specificity of DNA-binding proteins.

3D-partner -- a web server to infer interacting partners and binding models

Predict interacting partners and binding models.

3MOTIF -- a protein structure visualization system for conserved sequence motifs

Use this web-based sequence motif visualization system to display sequence motif information in its appropriate three-dimensional (3D) context.

AFAWE -- Automatic functional annotation in a distributed Web Services Environment

Protein function prediction and annotation in an integrated environment powered by web service.

ANCHOR -- Prediction of Protein Binding Regions in Disordered Proteins

Find information about protein binding.

ANNIE -- ANNotation and Interpretation Environment for Protein Sequences

Use to predict function from de novo protein sequences.

Active Sequences Collection (ASC) database -- A new tool to assign functions to protein sequences

Search for short active protein sequences with demonstrated biological activities.

Blocks -- Ungapped segments in conserved protein sequences

Search for ungapped segments corresponding to the most highly conserved regions of proteins.

CASTp -- computed atlas of surface topography of proteins with structural and topographical mapping of functionally annotated residues

Identify and measure surface accessible pockets as well as interior inaccessible cavities, for proteins and other molecules.

CSA -- The Catalytic Site Atlas

To search for catalytic residue annotation for enzymes in the Protein Data Bank.

ConFunc -- Conserved residue Protein Function Prediction Server

Predict protein function using Gene Ontology.

ConSurf-DB -- evolutionary conservation profiles of protein structures database

Automatically calculate evolutionary conservation scores of key amino acid residues and map them on protein structures.

DBAli -- A Database of Structure Alignments

Mine the protein structure space.

DILIMOT -- discovery of linear motifs in proteins

Predict short linear motifs (3-8 residues) in a set of protein sequences.

Dasty2 -- an Ajax protein DAS client

A web client for visualizing protein sequence feature information using DAS.

DomainSweep -- protein annotation by domain analysis

Identify the domain architecture within a protein sequence.

E1DS -- catalytic site prediction based on 1D signatures of concurrent conservation

Predict enzyme catalytic site.

ELM -- Eukarotic Linear Motif Resource

Predict functional sites in eukaryotic proteins.

EXPASY Proteome Tools Collection

Use a collection of tools for protein analyses.

EXPASY-Findmod

Predict potential protein post-translational modifications and find potential single amino acid substitutions in peptides.

EzCatDB -- the Enzyme Catalytic-mechanism Database

Search for information related to the catalytic mechanisms of enzymes.

FFPred -- feature-based function prediction

An integrated feature-based function prediction server for vertebrate proteomes.

FingerPRINT Scan

Identify the closest matching PRINTS sequence motif fingerprints in a protein sequence.

FireDB -- a database of functionally important residues from proteins of known structure

Search for functional annotation of important sites in proteins with known structures.

Frog2 -- a FRee Online druG 3D conformation generator

Produce 3D conformations of small drug compounds.

HGPD -- Human Gene and Protein Database

A database presenting experiment-based results in human proteomics.

HHsenser -- exhaustive transitive profile search using HMMx96HMM comparison

Conduct exhaustive intermediate profile searches of a set of homologous protein sequences.

HotSpot Wizard -- Substrate Specificity Hot Spot Identification web server

Design protein mutations in site-directed mutagenesis.

INTREPID -- INformation-theoretic TREe traversal for Protein functional site IDentification

Use for protein functional site identification.

Integrating protein annotation resources through the Distributed Annotation System

Annotate protein using this integrated annotation resource.

InterProScan -- protein domains identifier

Identify protein family (and DNA) domains, patterns, motifs, protein families, and functional sites.

KFC -- Knowledge-based FADE and Contacts

Interactive forecasting of protein interaction hot spots.

MAGIIC-PRO -- detecting functional signatures by efficient discovery of long patterns in protein sequences

Discover long patterns in protein sequences.

MALISAM -- Manual ALIgnments for Structurally Analogous Motifs

Database containing pairs of structural analogs and their alignments.

MEME -- discovering and analyzing DNA and protein sequence motifs

Find sequence patterns in DNA and protein sequences.

MODPROPEP -- a program for knowledge-based modeling of protein-peptide complexes

A web server for knowledge-based modeling of protein-peptide complexes, specifically peptides in complex with major histocompatibility complex (MHC) proteins and kinases.

MeMo -- a web tool for prediction of protein methylation modifications

Predict protein methylation sites.

MegaMotifBase -- a database of structural motifs in protein families and superfamilies

Find structural segments or motifs for protein structures.

Minimotif Miner -- a tool for investigating protein function

Find motifs in a protein sequence.

Motif3D -- Relating protein sequence motifs to 3D structure

Visualize protein sequence motifs on the 3D protein structures.

MotifScan

Find presence of any known protein motif (Prosite and Pfam) in a protein sequence.

MultiBind -- Multiple Alignment of Protein Binding Sites

Recognize spatial chemical binding patterns common to a set of protein structures.

NMT -- The MYR Predictor

Analyze proteins for the presence of N-terminal N-myristoylation site.

NetNGlyc -- N-Glycosylation sites prediction tool

Find the presence of N-Glycosylation sites in human proteins.

NetOGly 3.1 -- O-glycosylation sites prediction tool

Find the presence of O-GalNAc (mucin type) glycosylation sites in mammalian proteins.

NetPhos 2.0 -- Phosphorylation sites predictions

Analyze eukaryotic proteins for the presence of serine, threonine and tyrosine phosphorylation sites.

NetPhosK 1.0 Server -- kinase specific eukaryotic protein phosphorylation sites prediction tool

Find possible kinase specific phosphorylation sites in eukaryotic proteins.

NetworKIN -- a resource for exploring cellular phosphorylation networks

NeuroPred -- a tool to predict cleavage sites in neuropeptide precursors and provide the masses of the resulting peptides

Predict cleavage sites at basic amino acid locations in neuropeptide precursor sequences.

Non-Redundant Patent Sequences - Patented Sequence Database

Find information about patented nucleotide and protein sequences.

O-GLYCBASE

Search for information about glycoproteins with O-linked and C-linked glycosylation sites.

PANDORA -- Protein ANnotation Diagram ORiented Analysis

Find information about protein sequence annotations.

PAR-3D -- Protein Active site Residue - 3D structural motif

A server to predict protein active site residues.

PDBSite -- a database of the 3D structure of protein functional sites

Search for structural and functional information on the protein functional sites.

PDBSiteScan -- A program for searching for active, binding and posttranslational modification sites in the 3D structures of proteins

Search 3D protein fragments similar in structure to known active, binding and posttranslational modification sites.

PEDANT -- Protein Extraction, Description and ANalysis Tool

Conduct genome wide functional and structural analysis.

PHOSIDA -- Phosphorylation site database

Search for phosphorylation data of any protein of interest.

PHOSPHORYLATION SITE DATABASE

Search for information on prokaryotic proteins that undergo serine, threonine, or tyrosine phosphorylation.

PNU -- Protein Naming Utility

Determine correct names for proteins.

POODLE-S -- Predicition Of Order and Disorder by machine LEarning

Web application for predicting protein disorder by using physicochemical features and reduced amino acid set of a position-specific scoring matrix.

PPISearch -- Protein-Protein Interaction Search

Find homologous protein-protein interactions across multiple species.

PPSearch

Search your query sequence against PROSITE pattern database for protein motifs.

PRIDB -- Protein-RNA Interface DataBase

Find information about protein-RNA complexes from the Protein Data Bank (PDB).

PRINTS and its automatic supplement, prePRINTS -- A compendium of protein fingerprints

Search for protein fingerprints.

PROSITE

Identify protein families and domains for a given protein sequence.

PRRDB -- Pattern Recognition Receptor Database

A comprehensive database of pattern-recognition receptors and their ligands.

PatMatch -- a program for finding patterns in peptide and nucleotide sequences

Search for short nucleotide or peptide sequences such as cis-elements in nucleotide sequences or small domains and motifs in protein sequences.

PepCyber:P~PEP -- a database of human protein protein interactions mediated by phosphoprotein-binding domains

Database specialized in documenting human PPBD-containing proteins and PPBD-mediated interactions.

PeptideCutter -- protein cleavage sites prediction tool

Predicts potential protease cleavage sites and sites cleaved by chemicals in a given protein sequence.

Phobius -- A combined transmembrane topology and signal peptide predictor

Predict combined transmembrane topology and signal peptides.

Phospho.ELM -- a database of phosphorylation sites

Search for eukaryotic phosphorylation sites.

Phospho3D -- a database of three-dimensional structures of protein phosphorylation sites

Search for 3D structure and functional annotation of phosphorylation sites in proteins.

PhosphoSite -- A bioinformatics resource dedicated to physiological protein phosphorylation.

Search the database of in vivo phosphorylation sites of human and mouse proteins

PolyQ -- Polyglutamine Database

Find information about polyglutamine (polyQ) repeats.

Pratt Protein motif and pattern discovery

Find the presence of protein motifs and patterns in an amino acid sequence.

PrediSi -- Prediction of Signal Peptides and their Cleavage Positions

Predict signal peptide sequences and their cleavage positions in bacterial and eukaryotic amino acid sequences.

ProFunc -- a server for predicting protein function from 3D structure

Predict protein functions based on known structures.

ProMateus--an open research approach to protein-binding sites analysis

Predict the location of potential protein-protein binding sites for unbound proteins.

ProTeus -- identifying signatures in protein termini

Identify short linear signatures in protein termini.

ProtSweep -- protein annotation by homology

Analyze and identify newly obtained protein sequences.

Protemot -- prediction of protein binding sites with automatically extracted geometrical templates

Predict protein binding sites in a protein sequence based on geometrical analysis of protein tertiary substructures.

QuasiMotiFinder -- protein annotation by searching for evolutionarily conserved motif-like patterns

Search for evolutionarily conserved motif-like patterns in protein sequences.

RNABindR -- software for prediction of RNA binding residues in proteins

Web-based server for analyzing and predicting RNA binding sites in proteins.

SCANMOT -- searching for similar sequences using a simultaneous scan of multiple sequence motifs

Search for similarities between proteins by simultaneous matching of multiple motifs.

SDPpred -- A Tool for Prediction of Amino Acid Residues that Determine Differences in Functional Specificity of Homologous Proteins

Predict residues in protein sequences that determine the proteins' functional specificity.

SDR -- Specificity Determining Residues Database

Predict specificity-determining residues in protein families.

SLiMDisc -- Short, Linear Motif Discovery

Find shared motifs in proteins with a common attribute.

SUMOsp -- a web server for sumoylation site prediction

Conduct in silico sumoylation sites prediction.

SWAKK -- a web server for detecting positive selection in proteins using a sliding window substitution rate analysis

Detect protein sequence section under positive evolution selection.

ScanProsite

Search for motifs and patterns within protein sequences.

ScanProsite -- detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins

Detect patterns, profiles and motifs in a protein sequence.

ScanSite 2.0 -- Proteome-wide prediction of cell signaling interactions using short sequence motifs

Search for motifs within proteins that are likely to be phosphorylated by specific protein kinases or bind to domains such as SH2 domains, 14-3-3 domains or PDZ domains.

SePreSA -- SErver for the PREdiction of populations susceptible to Serious Adverse drug reaction

Find information about populations carrying polymorphisms within protein binding pockets that make them susceptible to serious adverse drug reaction (SADR).

Sequence Motif Search

Search the presence of a motif in either amino acid sequence or nucleotide sequence.

Signal-3L -- A 3-layer approach for predicting signal peptides

Predict signal peptides.

SignalP -- Machine learning approaches to the prediction of signal peptides, their cleavage sites, and other protein sorting signals

Predict signal peptides and their cleavage sites.

Sulfinator -- tyrosine sulfation sites prediction tool

Predict the presence of tyrosine sulfation sites in protein sequences

SuperSite -- Ligand Binding Site Database

Look at protein structure from a ligand and binding site perspective.

Swiss EMBnet node web server

Use a collection of bioinformatics tools at this portal site.

T-REKS -- identification of Tandem REpeats in sequences with a K-meanS based algorithm

Find information about tandem repeats in proteins that carry fundamental biological functions and are related to a number of human diseases.

TMFunction -- The Functional Database of Membrane Proteins

Find information about functional residues in alpha-helical and beta-barrel membrane proteins.

TOPDOM -- Conservatively Located Domains and Motifs in Transmembrane Proteins

Database of domains and motifs with conservative location in transmembrane proteins.

The EMOTIF database

Search for highly conserved and specific protein sequence motifs.

TreeDet -- Predicting Functional Residues in Protein Sequence Alignments

Predict functional sites in protein sequence alignments use different methodologies.

W-ChIPMotifs -- ChIP-based protein Motif discovery web server

Find de novo protein motifs from chromatin immunoprecipitation data.

WebFEATURE -- an interactive web tool for identifying and visualizing functional sites on macromolecular structures

Scan query structures for functional sites in both proteins and nucleic acids.

WebProAnalyst -- an interactive tool for analysis of quantitative structurex96activity relationships in protein families

Analyze quantitative structure-activity relationship of related protein families.

eBLOCKs -- enumerating conserved protein blocks to achieve maximal sensitivity and specificity

Search for ungapped alignments of highly conserved regions among a protein family or superfamily.

eF-seek -- prediction of the functional sites of proteins by searching for similar electrostatic potential and molecular surface shape

Predict the functional sites of proteins.

firestar -- prediction of functionally important residues using structural templates and alignment reliability

An expert system for predicting ligand-binding residues in protein structures.

iMOTdb -- a comprehensive collection of spatially interacting motifs in proteins

Automatically identify spatially interacting motifs among distantly related proteins sharing similar folds and possessing common ancestral lineage.

Frontend: Perl Web framework documentation - Andrej Sali Lab

Jit — Mon, 08 Jan 2018 22:32:03 -0600

The frontend is a set of Perl classes that displays the web interface, allowing a user to upload their input files, start a job, display a list of all jobs in the system, and get back job results. The main saliwebfrontend class must be subclassed for each web service. This class is then used to display the web pages using a set of CGI scripts that are set up automatically by the build system.

Address of the bookmark: https://saliweb.readthedocs.io/en/latest/frontend.html

PilonGrid: parallel wrapper around the Pilon framework

Rahul Nayak — Thu, 13 Dec 2018 09:35:40 -0600

The distribution is a parallel wrapper around the Pilon framework The pipeline is composed of bash scripts, an example mapping.fofn which shows how to input your fastq files (you give paths to the R1 file), and how to launch the pipeline.

Address of the bookmark: https://github.com/skoren/PilonGrid

ALF--a simulation framework for genome evolution.

Jit — Tue, 22 Oct 2019 22:05:58 -0500

Artificial Life Framework (ALF) simulates a root genome into a number of related genomes. Result files include the resulting gene sequences, true tree and true MSAs. A description of ALF can be found in the following article:

Daniel A Dalquen, Maria Anisimova, Gaston H Gonnet, Christophe Dessimoz: ALF - A Simulation Framework for Genome Evolution. Mol Biol Evol, 29(4):1115-1123, April 2012.
http://mbe.oxfordjournals.org/content/29/4/1115

Address of the bookmark: http://alfsim.org/#index

Snakemake—a scalable bioinformatics workflow engine

Jit — Sun, 02 Sep 2018 16:32:42 -0500

Snakemake is a workflow engine that provides a readable Python-based workflow definition language and a powerful execution environment that scales from single-core workstations to compute clusters without modifying the workflow.

Address of the bookmark: https://bioconda.github.io/recipes/snakemake/README.html

Find certain files/documents in Linux OS

Rahul Nayak — Sun, 06 Apr 2014 23:56:18 -0500

As bioinformatician I know the fact that we usually handle the large dataset and lost in the huge numbers of files and folders. In order to search the missing file a strong search command is required. The Linux Find Command is one of the most important and much used command in Linux sytems. Find command used to search and locate list of files and directories based on conditions you specify for files that match the arguments. Find can be used in variety of conditions like you can find files by permissions, users, groups, file type, date, size and other possible criteria.

Through this article we are sharing our day-to-day Linux find command experience and its usage in the form of examples. In this article we will show you the most used 35 Find Commands examples in Linux. We have divided the section into Five parts from basic to advance usage of find command.

Part I – Basic Find Commands for Finding Files with Names
1. Find Files Using Name in Current Directory

Find all the files whose name is gene.txt in a current working directory.

# find . -name gene.txt

./gene.txt

2. Find Files Under Home Directory

Find all the files under /home directory with name gene.txt.

# find /home -name gene.txt

/home/gene.txt

3. Find Files Using Name and Ignoring Case

Find all the files whose name is gene.txt and contains both capital and small letters in /home directory.

# find /home -iname gene.txt

./gene.txt
./Gene.txt

4. Find Directories Using Name

Find all directories whose name is Gene in / directory.

# find / -type d -name Gene

/Gene

5. Find fasta Files Using Name

Find all php files whose name is gene.fasta in a current working directory.

# find . -type f -name gene.fasta

./gene.fasta

6. Find all PHP Files in Directory

Find all fasta files in a directory.

# find . -type f -name "*.fasta"

./gene.fasta
./cancer.fasta
./allgene.fasta

Part II – Find Files Based on their Permissions
7. Find Files With 777 Permissions

Find all the files whose permissions are 777.

# find . -type f -perm 0777 -print

8. Find Files Without 777 Permissions

Find all the files without permission 777.

# find / -type f ! -perm 777

9. Find SGID Files with 644 Permissions

Find all the SGID bit files whose permissions set to 644.

# find / -perm 2644

10. Find Sticky Bit Files with 551 Permissions

Find all the Sticky Bit set files whose permission are 551.

# find / -perm 1551

11. Find SUID Files

Find all SUID set files.

# find / -perm /u=s

12. Find SGID Files

Find all SGID set files.

# find / -perm /g+s

13. Find Read Only Files

Find all Read Only files.

# find / -perm /u=r

14. Find Executable Files

Find all Executable files.

# find / -perm /a=x

15. Find Files with 777 Permissions and Chmod to 644

Find all 777 permission files and use chmod command to set permissions to 644.

# find / -type f -perm 0777 -print -exec chmod 644 {} \;

16. Find Directories with 777 Permissions and Chmod to 755

Find all 777 permission directories and use chmod command to set permissions to 755.

# find / -type d -perm 777 -print -exec chmod 755 {} \;

17. Find and remove single File

To find a single file called gene.txt and remove it.

# find . -type f -name "gene.txt" -exec rm -f {} \;

18. Find and remove Multiple File

To find and remove multiple files such as .fa or .gb, then use.

# find . -type f -name "*.fa" -exec rm -f {} \;

OR

# find . -type f -name "*.gb" -exec rm -f {} \;

19. Find all Empty Files

To file all empty files under certain path.

# find /tmp -type f -empty

20. Find all Empty Directories

To file all empty directories under certain path.

# find /tmp -type d -empty

21. File all Hidden Files

To find all hidden files, use below command.

# find /tmp -type f -name ".*"

Part III – Search Files Based On Owners and Groups
22. Find Single File Based on User

To find all or single file called gene.txt under / root directory of owner root.

# find / -user root -name gene.txt

23. Find all Files Based on User

To find all files that belongs to user Rahul under /home directory.

# find /home -user rahul

24. Find all Files Based on Group

To find all files that belongs to group Developer under /home directory.

# find /home -group developer

25. Find Particular Files of User

To find all .txt files of user Rahul under /home directory.

# find /home -user rahul -iname "*.txt"

Part IV – Find Files and Directories Based on Date and Time
26. Find Last 50 Days Modified Files

To find all the files which are modified 50 days back.

# find / -mtime 50

27. Find Last 50 Days Accessed Files

To find all the files which are accessed 50 days back.

# find / -atime 50

28. Find Last 50-100 Days Modified Files

To find all the files which are modified more than 50 days back and less than 100 days.

# find / -mtime +50 –mtime -100

29. Find Changed Files in Last 1 Hour

To find all the files which are changed in last 1 hour.

# find / -cmin -60

30. Find Modified Files in Last 1 Hour

To find all the files which are modified in last 1 hour.

# find / -mmin -60

31. Find Accessed Files in Last 1 Hour

To find all the files which are accessed in last 1 hour.

# find / -amin -60

Part V – Find Files and Directories Based on Size
32. Find 50MB Files

To find all 50MB files, use.

# find / -size 50M

33. Find Size between 50MB – 100MB

To find all the files which are greater than 50MB and less than 100MB.

# find / -size +50M -size -100M

34. Find and Delete 100MB Files

To find all 100MB files and delete them using one single command.

# find / -size +100M -exec rm -rf {} \;

35. Find Specific Files and Delete

Find all .gb files with more than 10MB and delete them using one single command.

# find / -type f -name *.gb -size +10M -exec rm {} \;

Π-cyc: A Reference-free SNP Discovery Application using Parallel Graph Search

Jit — Tue, 28 Jan 2020 03:34:23 -0600

Reference free SNP search for comparative population genomics: multiple samples run simultanously. **experimental phase, compiles and runs with OpenMPI-1.8.8 with Intel Compiler only

Cycles enumeration (aka Bubbles) as part of de novo de bruijn graphs assembly using colours can be unpractical for large error prone genomes which makes the assembly process produce an excessive number of false positive cycles. Our solution is to search the graph in multicores shared memory parallel mode using graph decomposition then use filtering method to generate good quality SNPs.

https://arxiv.org/abs/1809.06700

https://github.com/redayounsi/2KP2P

/2kp2omp/bin/main_2kp2_K63_C2 -i fastq_files.txt -o fungus_bub.fasta -r stat_fungus.txt -c cov_fungus_hash.txt -k 63 -h 20 -b 100 -g 600 -l 100 -f 16 -t 5.0 -x 1 -v 0 -p 1 -y 1 -u 1

Address of the bookmark: https://github.com/redayounsi/2KP2P

gSearch: a fast and flexible general search tool for whole-genome sequencing

Jit — Mon, 06 Aug 2018 17:19:15 -0500

gSearch compares sequence variants in the Genome Variation Format (GVF) or Variant Call Format (VCF) with a pre-compiled annotation or with variants in other genomes. Its search algorithms are subsequently optimized and implemented in a multi-threaded manner.

Address of the bookmark: http://ml.ssu.ac.kr/gSearch/index.html

NGS Platforms launched by BGI’s MGI Tech

Jit — Thu, 10 Jan 2019 04:42:06 -0600

MGI Tech Co., Ltd. (MGI), a subsidiary of BGI Group, is committed to enabling effective and affordable healthcare solutions for all. Based on its proprietary technology, MGI produces sequencing devices, equipment, consumables and reagents to support life science research, medicine and healthcare. MGI's multi-omics platforms include genetic sequencing, mass spectrometry and medical imaging. Providing real-time, comprehensive, life-long solutions, its mission is to develop and promote advanced life science tools for future healthcare.

MGI, a subsidiary of global genomics leader BGI Group, announced pricing and its first early access customer for the new ultra high-throughput sequencer, MGISEQ-T7, saying it has driven down sequencing cost to $5 per gigabyte, with exceptionally high accuracy. Such innovations are helping more people to realize the benefits of genomic information.

In October, MGI launched the MGISEQ-T7, a highly flexible production-scale platform that is the most powerful sequencer to date. It can produce as many as 60 whole human genomes in one day. The instrument sells for $1 million.

The T7 enables simultaneous but independent operation of up to four flow cells, which means different applications such as single-cell RNA sequencing, whole exome sequencing and whole genome sequencing can be run in different flow cells at the same time. This helps to reduce costs, allowing MGI to offer the most competitive sequencing price in the market.

Powered by DNBseq™, MGISEQ delivers quality data with accuracy for SNP and Indel calling rate of 99.9% and 99%, respectively, along with decreased duplication rate down to less than 2 percent, and almost zero Index mis-assignment rate.

SOURCE MGI

https://www.bgi.com/global/company/news/bgis-mgi-tech-launches-two-new-ngs-platforms/

http://en.mgitech.cn/