<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/44559?offset=280</link>
	<atom:link href="https://bioinformaticsonline.com/related/44559?offset=280" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44227/common-methods-to-discover-tandem-repeats</guid>
	<pubDate>Thu, 09 Mar 2023 02:40:52 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44227/common-methods-to-discover-tandem-repeats</link>
	<title><![CDATA[Common methods to discover tandem repeats]]></title>
	<description><![CDATA[<div><div><div><div><div><div><div><div><div><div><p>Tandem repeats are DNA sequences that are repeated in a contiguous manner in the genome. These sequences are often used as genetic markers and are important in many areas of genetics and genomics research. Here are some methods for discovering tandem repeats in genomes:</p><ol>
<li>
<p>Tandem Repeat Finder: Tandem Repeat Finder is a software tool that identifies tandem repeats in DNA sequences. It is available for free download and can be used on both nucleotide and protein sequences. The tool uses a statistical algorithm to identify repeats based on their length, copy number, and overall composition.</p>
</li>
<li>
<p>RepeatMasker: RepeatMasker is another software tool that can identify tandem repeats in DNA sequences. It works by comparing the input sequence to a database of known repeats and then identifies any tandem repeats that match those in the database.</p>
</li>
<li>
<p>PCR-based methods: Polymerase chain reaction (PCR) can be used to amplify and detect tandem repeats in genomic DNA. PCR primers are designed to flank the tandem repeat region, and amplification of the target DNA fragment can be visualized on a gel. This method can be useful for detecting novel tandem repeats and for genotyping.</p>
</li>
<li>
<p>Southern blotting: Southern blotting is a classic method for detecting DNA fragments in a sample. It can be used to detect tandem repeats by digesting genomic DNA with a restriction enzyme, separating the fragments by gel electrophoresis, and then probing the blot with a tandem repeat-specific probe.</p>
</li>
</ol><p>Overall, a combination of these methods can be used to comprehensively identify tandem repeats in genomes.</p></div></div></div></div></div></div></div></div></div></div>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/38238/list-of-motif-discovery-tools</guid>
	<pubDate>Tue, 20 Nov 2018 03:54:26 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/38238/list-of-motif-discovery-tools</link>
	<title><![CDATA[List of motif discovery tools !]]></title>
	<description><![CDATA[<div><div>In genetics, a sequence motif is a nucleotide or amino-acid sequence pattern that is widespread and has, or is conjectured to have, a biological significance. For proteins, a sequence motif is distinguished from a structural motif, a motif formed by the three-dimensional arrangement of amino acids which may not be adjacent.</div><div>&nbsp;</div><div>Following are the list of tools for motif discovery:</div><div>&nbsp;</div><div><a href="http://genius.embnet.dkfz-heidelberg.de/menu/biounit/open-husar/">2Dsweep -- protein annotation by secondary structure elements</a></div><p>Perform secondary structure predictions on protein sequences.</p></div><div><div><a href="http://floresta.eead.csic.es/3dfootprint/">3D-footprint -- database of DNA-binding protein structures</a></div><p>Find binding specificity information about DNA-protein complexes.</p></div><div><div><a href="http://floresta.eead.csic.es/3dfootprint/">3D-footprint: DNA-binding protein database</a></div><p>Find information about the binding specificity of DNA-binding proteins.</p></div><div><div><a href="http://3d-partner.life.nctu.edu.tw/">3D-partner -- a web server to infer interacting partners and binding models</a></div><p>Predict interacting partners and binding models.</p></div><div><div><a href="http://motif.stanford.edu/distributions/3motif/">3MOTIF -- a protein structure visualization system for conserved sequence motifs</a></div><p>Use this web-based sequence motif visualization system to display sequence motif information in its appropriate three-dimensional (3D) context.</p></div><div><div><a href="http://bioinfo.mpiz-koeln.mpg.de/afawe/">AFAWE -- Automatic functional annotation in a distributed Web Services Environment</a></div><p>Protein function prediction and annotation in an integrated environment powered by web service.</p></div><div><div><a href="http://anchor.enzim.hu/">ANCHOR -- Prediction of Protein Binding Regions in Disordered Proteins</a></div><p>Find information about protein binding.</p></div><div><div><a href="http://annie.bii.a-star.edu.sg/annie/home.do">ANNIE -- ANNotation and Interpretation Environment for Protein Sequences</a></div><p>Use to predict function from de novo protein sequences.</p></div><div><div><a href="http://bioinformatica.isa.cnr.it/ASC/">Active Sequences Collection (ASC) database -- A new tool to assign functions to protein sequences</a></div><p>Search for short active protein sequences with demonstrated biological activities.</p></div><div><div><a href="http://blocks.fhcrc.org/">Blocks -- Ungapped segments in conserved protein sequences</a></div><p>Search for ungapped segments corresponding to the most highly conserved regions of proteins.</p></div><div><div><a href="http://cast.engr.uic.edu/">CASTp -- computed atlas of surface topography of proteins with structural and topographical mapping of functionally annotated residues</a></div><p>Identify and measure surface accessible pockets as well as interior inaccessible cavities, for proteins and other molecules.</p></div><div><div><a href="http://www.ebi.ac.uk/thornton-srv/databases/CSA">CSA -- The Catalytic Site Atlas</a></div><p>To search for catalytic residue annotation for enzymes in the Protein Data Bank.</p></div><div><div><a href="http://www.sbg.bio.ic.ac.uk/~confunc/">ConFunc -- Conserved residue Protein Function Prediction Server</a></div><p>Predict protein function using Gene Ontology.</p></div><div><div><a href="http://consurf.tau.ac.il/">ConSurf-DB -- evolutionary conservation profiles of protein structures database</a></div><p>Automatically calculate evolutionary conservation scores of key amino acid residues and map them on protein structures.</p></div><div><div><a href="http://salilab.org/DBAli/">DBAli -- A Database of Structure Alignments</a></div><p>Mine the protein structure space.</p></div><div><div><a href="http://dilimot.embl.de/">DILIMOT -- discovery of linear motifs in proteins</a></div><p>Predict short linear motifs (3-8 residues) in a set of protein sequences.</p></div><div><div><a href="http://www.ebi.ac.uk/dasty/">Dasty2 -- an Ajax protein DAS client</a></div><p>A web client for visualizing protein sequence feature information using DAS.</p></div><div><div><a href="http://genius.embnet.dkfz-heidelberg.de/menu/biounit/open-husar/">DomainSweep -- protein annotation by domain analysis</a></div><p>Identify the domain architecture within a protein sequence.</p></div><div><div><a href="http://e1ds.csbb.ntu.edu.tw/">E1DS -- catalytic site prediction based on 1D signatures of concurrent conservation</a></div><p>Predict enzyme catalytic site.</p></div><div><div><a href="http://elm.eu.org/">ELM -- Eukarotic Linear Motif Resource</a></div><p>Predict functional sites in eukaryotic proteins.</p></div><div><div><a href="http://us.expasy.org/tools/#proteome">EXPASY Proteome Tools Collection</a></div><p>Use a collection of tools for protein analyses.</p></div><div><div><a href="http://us.expasy.org/tools/findmod/">EXPASY-Findmod</a></div><p>Predict potential protein post-translational modifications and find potential single amino acid substitutions in peptides.</p></div><div><div><a href="http://mbs.cbrc.jp/EzCatDB/">EzCatDB -- the Enzyme Catalytic-mechanism Database</a></div><p>Search for information related to the catalytic mechanisms of enzymes.</p></div><div><div><a href="http://bioinf.cs.ucl.ac.uk/ffpred/">FFPred -- feature-based function prediction</a></div><p>An integrated feature-based function prediction server for vertebrate proteomes.</p></div><div><div><a href="http://www.ebi.ac.uk/printsscan/">FingerPRINT Scan</a></div><p>Identify the closest matching PRINTS sequence motif fingerprints in a protein sequence.</p></div><div><div><a href="http://firedb.bioinfo.cnio.es/">FireDB -- a database of functionally important residues from proteins of known structure</a></div><p>Search for functional annotation of important sites in proteins with known structures.</p></div><div><div><a href="http://bioserv.rpbs.univ-paris-diderot.fr/cgi-bin/Frog2">Frog2 -- a FRee Online druG 3D conformation generator</a></div><p>Produce 3D conformations of small drug compounds.</p></div><div><div><a href="http://www.hgpd.jp/">HGPD -- Human Gene and Protein Database</a></div><p>A database presenting experiment-based results in human proteomics.</p></div><div><div><a href="http://hhsenser.tuebingen.mpg.de/">HHsenser -- exhaustive transitive profile search using HMMx96HMM comparison</a></div><p>Conduct exhaustive intermediate profile searches of a set of homologous protein sequences.</p></div><div><div><a href="http://loschmidt.chemi.muni.cz/hotspotwizard/">HotSpot Wizard -- Substrate Specificity Hot Spot Identification web server</a></div><p>Design protein mutations in site-directed mutagenesis.</p></div><div><div><a href="http://phylogenomics.berkeley.edu/intrepid/">INTREPID -- INformation-theoretic TREe traversal for Protein functional site IDentification</a></div><p>Use for protein functional site identification.</p></div><div><div><a href="http://www.cbs.dtu.dk/">Integrating protein annotation resources through the Distributed Annotation System</a></div><p>Annotate protein using this integrated annotation resource.</p></div><div><div><a href="http://www.ebi.ac.uk/InterProScan/">InterProScan -- protein domains identifier</a></div><p>Identify protein family (and DNA) domains, patterns, motifs, protein families, and functional sites.</p></div><div><div><a href="http://kfc.mitchell-lab.org/">KFC -- Knowledge-based FADE and Contacts</a></div><p>Interactive forecasting of protein interaction hot spots.</p></div><div><div><a href="http://biominer.bime.ntu.edu.tw/magiicpro/">MAGIIC-PRO -- detecting functional signatures by efficient discovery of long patterns in protein sequences</a></div><p>Discover long patterns in protein sequences.</p></div><div><div><a href="http://prodata.swmed.edu/malisam">MALISAM -- Manual ALIgnments for Structurally Analogous Motifs</a></div><p>Database containing pairs of structural analogs and their alignments.</p></div><div><div><a href="http://meme.nbcr.net/">MEME -- discovering and analyzing DNA and protein sequence motifs</a></div><p>Find sequence patterns in DNA and protein sequences.</p></div><div><div><a href="http://www.nii.res.in/modpropep.html">MODPROPEP -- a program for knowledge-based modeling of protein-peptide complexes</a></div><p>A web server for knowledge-based modeling of protein-peptide complexes, specifically peptides in complex with major histocompatibility complex (MHC) proteins and kinases.</p></div><div><div><a href="http://www.bioinfo.tsinghua.edu.cn/~tigerchen/memo.html">MeMo -- a web tool for prediction of protein methylation modifications</a></div><p>Predict protein methylation sites.</p></div><div><div><a href="http://caps.ncbs.res.in/MegaMotifbase/index.html">MegaMotifBase -- a database of structural motifs in protein families and superfamilies</a></div><p>Find structural segments or motifs for protein structures.</p></div><div><div><a href="http://mnm.engr.uconn.edu/MNM/SMSSearchServlet">Minimotif Miner -- a tool for investigating protein function</a></div><p>Find motifs in a protein sequence.</p></div><div><div><a href="http://umber.sbs.man.ac.uk/dbbrowser/motif3d/motif3d.html">Motif3D -- Relating protein sequence motifs to 3D structure</a></div><p>Visualize protein sequence motifs on the 3D protein structures.</p></div><div><div><a href="http://myhits.isb-sib.ch/cgi-bin/motif_scan">MotifScan</a></div><p>Find presence of any known protein motif (Prosite and Pfam) in a protein sequence.</p></div><div><div><a href="http://bioinfo3d.cs.tau.ac.il/MultiBind">MultiBind -- Multiple Alignment of Protein Binding Sites</a></div><p>Recognize spatial chemical binding patterns common to a set of protein structures.</p></div><div><div><a href="http://mendel.imp.univie.ac.at/myristate/SUPLpredictor.htm">NMT -- The MYR Predictor</a></div><p>Analyze proteins for the presence of N-terminal N-myristoylation site.</p></div><div><div><a href="http://www.cbs.dtu.dk/services/NetNGlyc/">NetNGlyc -- N-Glycosylation sites prediction tool</a></div><p>Find the presence of N-Glycosylation sites in human proteins.</p></div><div><div><a href="http://www.cbs.dtu.dk/services/NetOGlyc/">NetOGly 3.1 -- O-glycosylation sites prediction tool</a></div><p>Find the presence of O-GalNAc (mucin type) glycosylation sites in mammalian proteins.</p></div><div><div><a href="http://www.cbs.dtu.dk/services/NetPhos/">NetPhos 2.0 -- Phosphorylation sites predictions</a></div><p>Analyze eukaryotic proteins for the presence of serine, threonine and tyrosine phosphorylation sites.</p></div><div><div><a href="http://www.cbs.dtu.dk/services/NetPhosK/">NetPhosK 1.0 Server -- kinase specific eukaryotic protein phosphorylation sites prediction tool</a></div><p>Find possible kinase specific phosphorylation sites in eukaryotic proteins.</p></div><div><div><a href="http://networkin.info/search.php">NetworKIN -- a resource for exploring cellular phosphorylation networks</a></div><div>&nbsp;</div></div><div><div><a href="http://neuroproteomics.scs.uiuc.edu/neuropred.html">NeuroPred -- a tool to predict cleavage sites in neuropeptide precursors and provide the masses of the resulting peptides</a></div><p>Predict cleavage sites at basic amino acid locations in neuropeptide precursor sequences.</p></div><div><div><a href="http://www.ebi.ac.uk/patentdata/nr/">Non-Redundant Patent Sequences - Patented Sequence Database</a></div><p>Find information about patented nucleotide and protein sequences.</p></div><div><div><a href="http://www.cbs.dtu.dk/databases/OGLYCBASE/">O-GLYCBASE</a></div><p>Search for information about glycoproteins with O-linked and C-linked glycosylation sites.</p></div><div><div><a href="http://www.pandora.cs.huji.ac.il/">PANDORA -- Protein ANnotation Diagram ORiented Analysis</a></div><p>Find information about protein sequence annotations.</p></div><div><div><a href="http://sunserver.cdfd.org.in:8080/protease/PAR_3D/index.html">PAR-3D -- Protein Active site Residue - 3D structural motif</a></div><p>A server to predict protein active site residues.</p></div><div><div><a href="http://wwwmgs.bionet.nsc.ru/mgs/gnw/pdbsite/">PDBSite -- a database of the 3D structure of protein functional sites</a></div><p>Search for structural and functional information on the protein functional sites.</p></div><div><div><a href="http://wwwmgs.bionet.nsc.ru/mgs/systems/fastprot/pdbsitescan.html">PDBSiteScan -- A program for searching for active, binding and posttranslational modification sites in the 3D structures of proteins</a></div><p>Search 3D protein fragments similar in structure to known active, binding and posttranslational modification sites.</p></div><div><div><a href="http://pedant.gsf.de/">PEDANT -- Protein Extraction, Description and ANalysis Tool</a></div><p>Conduct genome wide functional and structural analysis.</p></div><div><div><a href="http://phosida.org/">PHOSIDA -- Phosphorylation site database</a></div><p>Search for phosphorylation data of any protein of interest.</p></div><div><div><a href="http://www.phosphorylation.biochem.vt.edu/">PHOSPHORYLATION SITE DATABASE</a></div><p>Search for information on prokaryotic proteins that undergo serine, threonine, or tyrosine phosphorylation.</p></div><div><div><a href="http://www.jcvi.org/pn-utility/web/smarty_wrapper/about.php">PNU -- Protein Naming Utility</a></div><p>Determine correct names for proteins.</p></div><div><div><a href="http://mbs.cbrc.jp/poodle/poodle-s.html">POODLE-S -- Predicition Of Order and Disorder by machine LEarning</a></div><p>Web application for predicting protein disorder by using physicochemical features and reduced amino acid set of a position-specific scoring matrix.</p></div><div><div><a href="http://gemdock.life.nctu.edu.tw/ppisearch/">PPISearch -- Protein-Protein Interaction Search</a></div><p>Find homologous protein-protein interactions across multiple species.</p></div><div><div><a href="http://www.ebi.ac.uk/ppsearch/">PPSearch</a></div><p>Search your query sequence against PROSITE pattern database for protein motifs.</p></div><div><div><a href="http://pridb.gdcb.iastate.edu/">PRIDB -- Protein-RNA Interface DataBase</a></div><p>Find information about protein-RNA complexes from the Protein Data Bank (PDB).</p></div><div><div><a href="http://umber.sbs.man.ac.uk/dbbrowser/PRINTS/">PRINTS and its automatic supplement, prePRINTS -- A compendium of protein fingerprints</a></div><p>Search for protein fingerprints.</p></div><div><div><a href="http://www.expasy.org/prosite/">PROSITE</a></div><p>Identify protein families and domains for a given protein sequence.</p></div><div><div><a href="http://www.imtech.res.in/raghava/prrdb/">PRRDB -- Pattern Recognition Receptor Database</a></div><p>A comprehensive database of pattern-recognition receptors and their ligands.</p></div><div><div><a href="http://www.arabidopsis.org/cgi-bin/patmatch/nph-patmatch.pl">PatMatch -- a program for finding patterns in peptide and nucleotide sequences</a></div><p>Search for short nucleotide or peptide sequences such as cis-elements in nucleotide sequences or small domains and motifs in protein sequences.</p></div><div><div><a href="http://pepcyber.umn.edu/PPEP/">PepCyber:P~PEP -- a database of human protein protein interactions mediated by phosphoprotein-binding domains</a></div><p>Database specialized in documenting human PPBD-containing proteins and PPBD-mediated interactions.</p></div><div><div><a href="http://us.expasy.org/tools/peptidecutter/">PeptideCutter -- protein cleavage sites prediction tool</a></div><p>Predicts potential protease cleavage sites and sites cleaved by chemicals in a given protein sequence.</p></div><div><div><a href="http://phobius.binf.ku.dk/">Phobius -- A combined transmembrane topology and signal peptide predictor</a></div><p>Predict combined transmembrane topology and signal peptides.</p></div><div><div><a href="http://phospho.elm.eu.org/">Phospho.ELM -- a database of phosphorylation sites</a></div><p>Search for eukaryotic phosphorylation sites.</p></div><div><div><a href="http://www.phospho3d.org/">Phospho3D -- a database of three-dimensional structures of protein phosphorylation sites</a></div><p>Search for 3D structure and functional annotation of phosphorylation sites in proteins.</p></div><div><div><a href="http://www.phosphosite.org/">PhosphoSite -- A bioinformatics resource dedicated to physiological protein phosphorylation.</a></div><p>Search the database of in vivo phosphorylation sites of human and mouse proteins</p></div><div><div><a href="http://pxgrid.med.monash.edu.au/polyq/">PolyQ -- Polyglutamine Database</a></div><p>Find information about polyglutamine (polyQ) repeats.</p></div><div><div><a href="http://www.ebi.ac.uk/pratt/">Pratt Protein motif and pattern discovery</a></div><p>Find the presence of protein motifs and patterns in an amino acid sequence.</p></div><div><div><a href="http://www.predisi.de/">PrediSi -- Prediction of Signal Peptides and their Cleavage Positions</a></div><p>Predict signal peptide sequences and their cleavage positions in bacterial and eukaryotic amino acid sequences.</p></div><div><div><a href="http://www.ebi.ac.uk/thornton-srv/databases/ProFunc/">ProFunc -- a server for predicting protein function from 3D structure</a></div><p>Predict protein functions based on known structures.</p></div><div><div><a href="http://bioinfo41.weizmann.ac.il/promate/promateus.html">ProMateus--an open research approach to protein-binding sites analysis</a></div><p>Predict the location of potential protein-protein binding sites for unbound proteins.</p></div><div><div><a href="http://www.proteus.cs.huji.ac.il/">ProTeus -- identifying signatures in protein termini</a></div><p>Identify short linear signatures in protein termini.</p></div><div><div><a href="http://genius.embnet.dkfz-heidelberg.de/menu/cgi-bin/w2h-open/w2h.open/w2h.startthis?SIMGO=w2h%2ewelcome">ProtSweep -- protein annotation by homology</a></div><p>Analyze and identify newly obtained protein sequences.</p></div><div><div><a href="http://protemot.csbb.ntu.edu.tw/">Protemot -- prediction of protein binding sites with automatically extracted geometrical templates</a></div><p>Predict protein binding sites in a protein sequence based on geometrical analysis of protein tertiary substructures.</p></div><div><div><a href="http://quasimotifinder.tau.ac.il/">QuasiMotiFinder -- protein annotation by searching for evolutionarily conserved motif-like patterns</a></div><p>Search for evolutionarily conserved motif-like patterns in protein sequences.</p></div><div><div><a href="http://bindr.gdcb.iastate.edu/RNABindR">RNABindR -- software for prediction of RNA binding residues in proteins</a></div><p>Web-based server for analyzing and predicting RNA binding sites in proteins.</p></div><div><div><a href="http://caps.ncbs.res.in/scanmot/scanmot.html">SCANMOT -- searching for similar sequences using a simultaneous scan of multiple sequence motifs</a></div><p>Search for similarities between proteins by simultaneous matching of multiple motifs.</p></div><div><div><a href="http://bioinf.fbb.msu.ru/SDPpred/">SDPpred -- A Tool for Prediction of Amino Acid Residues that Determine Differences in Functional Specificity of Homologous Proteins</a></div><p>Predict residues in protein sequences that determine the proteins' functional specificity.</p></div><div><div><a href="http://tamm.mit.edu/SDR/">SDR -- Specificity Determining Residues Database</a></div><p>Predict specificity-determining residues in protein families.</p></div><div><div><a href="http://bioware.ucd.ie/~slimdisc/">SLiMDisc -- Short, Linear Motif Discovery</a></div><p>Find shared motifs in proteins with a common attribute.</p></div><div><div><a href="http://sumosp.biocuckoo.org/">SUMOsp -- a web server for sumoylation site prediction</a></div><p>Conduct in silico sumoylation sites prediction.</p></div><div><div><a href="http://oxytricha.princeton.edu/SWAKK/">SWAKK -- a web server for detecting positive selection in proteins using a sliding window substitution rate analysis</a></div><p>Detect protein sequence section under positive evolution selection.</p></div><div><div><a href="http://www.expasy.org/tools/scanprosite/">ScanProsite</a></div><p>Search for motifs and patterns within protein sequences.</p></div><div><div><a href="http://www.expasy.org/tools/scanprosite/">ScanProsite -- detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins</a></div><p>Detect patterns, profiles and motifs in a protein sequence.</p></div><div><div><a href="http://scansite.mit.edu/">ScanSite 2.0 -- Proteome-wide prediction of cell signaling interactions using short sequence motifs</a></div><p>Search for motifs within proteins that are likely to be phosphorylated by specific protein kinases or bind to domains such as SH2 domains, 14-3-3 domains or PDZ domains.</p></div><div><div><a href="http://sepresa.bio-x.cn/">SePreSA -- SErver for the PREdiction of populations susceptible to Serious Adverse drug reaction</a></div><p>Find information about populations carrying polymorphisms within protein binding pockets that make them susceptible to serious adverse drug reaction (SADR).</p></div><div><div><a href="http://motif.genome.jp/">Sequence Motif Search</a></div><p>Search the presence of a motif in either amino acid sequence or nucleotide sequence.</p></div><div><div><a href="http://www.csbio.sjtu.edu.cn/bioinf/Signal-3L/">Signal-3L -- A 3-layer approach for predicting signal peptides</a></div><p>Predict signal peptides.</p></div><div><div><a href="http://www.cbs.dtu.dk/services/SignalP/">SignalP -- Machine learning approaches to the prediction of signal peptides, their cleavage sites, and other protein sorting signals</a></div><p>Predict signal peptides and their cleavage sites.</p></div><div><div><a href="http://us.expasy.org/tools/sulfinator/">Sulfinator -- tyrosine sulfation sites prediction tool</a></div><p>Predict the presence of tyrosine sulfation sites in protein sequences</p></div><div><div><a href="http://bioinf-services.charite.de/supersite/">SuperSite -- Ligand Binding Site Database</a></div><p>Look at protein structure from a ligand and binding site perspective.</p></div><div><div><a href="http://www.ch.embnet.org/">Swiss EMBnet node web server</a></div><p>Use a collection of bioinformatics tools at this portal site.</p></div><div><div><a href="http://bioinfo.montp.cnrs.fr/?r=t-reks">T-REKS -- identification of Tandem REpeats in sequences with a K-meanS based algorithm</a></div><p>Find information about tandem repeats in proteins that carry fundamental biological functions and are related to a number of human diseases.</p></div><div><div><a href="http://tmbeta-genome.cbrc.jp/TMFunction/">TMFunction -- The Functional Database of Membrane Proteins</a></div><p>Find information about functional residues in alpha-helical and beta-barrel membrane proteins.</p></div><div><div><a href="http://topdom.enzim.hu/">TOPDOM -- Conservatively Located Domains and Motifs in Transmembrane Proteins</a></div><p>Database of domains and motifs with conservative location in transmembrane proteins.</p></div><div><div><a href="http://motif.stanford.edu/distributions/emotif/">The EMOTIF database</a></div><p>Search for highly conserved and specific protein sequence motifs.</p></div><div><div><a href="http://treedetv2.bioinfo.cnio.es/treedet/index.html">TreeDet -- Predicting Functional Residues in Protein Sequence Alignments</a></div><p>Predict functional sites in protein sequence alignments use different methodologies.</p></div><div><div><a href="http://motif.bmi.ohio-state.edu/ChIPMotifs/">W-ChIPMotifs -- ChIP-based protein Motif discovery web server</a></div><p>Find de novo protein motifs from chromatin immunoprecipitation data.</p></div><div><div><a href="http://feature.stanford.edu/webfeature/">WebFEATURE -- an interactive web tool for identifying and visualizing functional sites on macromolecular structures</a></div><p>Scan query structures for functional sites in both proteins and nucleic acids.</p></div><div><div><a href="http://wwwmgs.bionet.nsc.ru/mgs/programs/panalyst/">WebProAnalyst -- an interactive tool for analysis of quantitative structurex96activity relationships in protein families</a></div><p>Analyze quantitative structure-activity relationship of related protein families.</p></div><div><div><a href="http://motif.stanford.edu/distributions/eblocks/">eBLOCKs -- enumerating conserved protein blocks to achieve maximal sensitivity and specificity</a></div><p>Search for ungapped alignments of highly conserved regions among a protein family or superfamily.</p></div><div><div><a href="http://ef-site.hgc.jp/eF-seek/">eF-seek -- prediction of the functional sites of proteins by searching for similar electrostatic potential and molecular surface shape</a></div><p>Predict the functional sites of proteins.</p></div><div><div><a href="http://firedb.bioinfo.cnio.es/Php/FireStar.php">firestar -- prediction of functionally important residues using structural templates and alignment reliability</a></div><p>An expert system for predicting ligand-binding residues in protein structures.</p></div><div><div><a href="http://caps.ncbs.res.in/imotdb/">iMOTdb -- a comprehensive collection of spatially interacting motifs in proteins</a></div><p>Automatically identify spatially interacting motifs among distantly related proteins sharing similar folds and possessing common ancestral lineage.</p></div>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/35119/frontend-perl-web-framework-documentation-andrej-sali-lab</guid>
	<pubDate>Mon, 08 Jan 2018 22:32:03 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/35119/frontend-perl-web-framework-documentation-andrej-sali-lab</link>
	<title><![CDATA[Frontend: Perl Web framework documentation - Andrej Sali Lab]]></title>
	<description><![CDATA[<p><span>The frontend is a set of Perl classes that displays the web interface, allowing a user to upload their input files, start a job, display a list of all jobs in the system, and get back job results. The main&nbsp;</span><a href="https://saliweb.readthedocs.io/en/latest/modules/frontend.html#saliwebfrontend" title="saliwebfrontend"><code><span>saliwebfrontend</span></code></a><span>&nbsp;class must be subclassed for each web service. This class is then used to display the web pages using a set of CGI scripts that are set up automatically by the build system.</span></p><p>Address of the bookmark: <a href="https://saliweb.readthedocs.io/en/latest/frontend.html" rel="nofollow">https://saliweb.readthedocs.io/en/latest/frontend.html</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/38457/pilongrid-parallel-wrapper-around-the-pilon-framework</guid>
	<pubDate>Thu, 13 Dec 2018 09:35:40 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/38457/pilongrid-parallel-wrapper-around-the-pilon-framework</link>
	<title><![CDATA[PilonGrid: parallel wrapper around the Pilon framework]]></title>
	<description><![CDATA[<p>The distribution is a parallel wrapper around the&nbsp;<a href="https://github.com/broadinstitute/pilon">Pilon</a>&nbsp;framework The pipeline is composed of bash scripts, an example mapping.fofn which shows how to input your fastq files (you give paths to the R1 file), and how to launch the pipeline.</p>
<p>&nbsp;</p><p>Address of the bookmark: <a href="https://github.com/skoren/PilonGrid" rel="nofollow">https://github.com/skoren/PilonGrid</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/40140/alf-a-simulation-framework-for-genome-evolution</guid>
	<pubDate>Tue, 22 Oct 2019 22:05:58 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/40140/alf-a-simulation-framework-for-genome-evolution</link>
	<title><![CDATA[ALF--a simulation framework for genome evolution.]]></title>
	<description><![CDATA[<p style="color: #000000; font-size: small; font-style: normal; font-weight: 400; text-align: -webkit-left;"><span style="color: #4d4d4d; font-size: small; font-style: normal; font-weight: 400; text-align: left; background-color: #ffffff; float: none;">Artificial Life Framework (ALF)</span> simulates a root genome into a number of related genomes. Result files include the resulting gene sequences, true tree and true MSAs. A description of ALF can be found in the following article:</p>
<p style="color: #000000; font-size: small; font-style: normal; font-weight: 400; text-align: -webkit-left;">Daniel A Dalquen, Maria Anisimova, Gaston H Gonnet, Christophe Dessimoz: ALF - A Simulation Framework for Genome Evolution.<span>&nbsp;</span><em>Mol Biol Evol</em>, 29(4):1115-1123, April 2012.<br><a href="http://mbe.oxfordjournals.org/content/29/4/1115" target="_blank">http://mbe.oxfordjournals.org/content/29/4/1115</a></p><p>Address of the bookmark: <a href="http://alfsim.org/#index" rel="nofollow">http://alfsim.org/#index</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37618/snakemake%E2%80%94a-scalable-bioinformatics-workflow-engine</guid>
	<pubDate>Sun, 02 Sep 2018 16:32:42 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37618/snakemake%E2%80%94a-scalable-bioinformatics-workflow-engine</link>
	<title><![CDATA[Snakemake—a scalable bioinformatics workflow engine]]></title>
	<description><![CDATA[<p><span>Snakemake is a workflow engine that provides a readable Python-based workflow definition language and a powerful execution environment that scales from single-core workstations to compute clusters without modifying the workflow.&nbsp;</span></p><p>Address of the bookmark: <a href="https://bioconda.github.io/recipes/snakemake/README.html" rel="nofollow">https://bioconda.github.io/recipes/snakemake/README.html</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/9639/find-certain-filesdocuments-in-linux-os</guid>
	<pubDate>Sun, 06 Apr 2014 23:56:18 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/9639/find-certain-filesdocuments-in-linux-os</link>
	<title><![CDATA[Find certain files/documents in Linux OS]]></title>
	<description><![CDATA[<p>As bioinformatician I know the fact that we usually handle the large dataset and lost in the huge numbers of files and folders. In order to search the missing file a strong search command is required. The Linux Find Command is one of the most important and much used command in Linux sytems. Find command used to search and locate list of files and directories based on conditions you specify for files that match the arguments. Find can be used in variety of conditions like you can find files by permissions, users, groups, file type, date, size and other possible criteria.<br /><br />Through this article we are sharing our day-to-day Linux find command experience and its usage in the form of examples. In this article we will show you the most used 35 Find Commands examples in Linux. We have divided the section into Five parts from basic to advance usage of find command.</p><p><strong>Part I &ndash; Basic Find Commands for Finding Files with Names</strong><br />1. Find Files Using Name in Current Directory<br /><br />Find all the files whose name is gene.txt in a current working directory.<br /><br /># find . -name gene.txt<br /><br />./gene.txt<br /><br />2. Find Files Under Home Directory<br /><br />Find all the files under /home directory with name gene.txt.<br /><br /># find /home -name gene.txt<br /><br />/home/gene.txt<br /><br />3. Find Files Using Name and Ignoring Case<br /><br />Find all the files whose name is gene.txt and contains both capital and small letters in /home directory.<br /><br /># find /home -iname gene.txt<br /><br />./gene.txt<br />./Gene.txt<br /><br />4. Find Directories Using Name<br /><br />Find all directories whose name is Gene in / directory.<br /><br /># find / -type d -name Gene<br /><br />/Gene<br /><br />5. Find fasta Files Using Name<br /><br />Find all php files whose name is gene.fasta in a current working directory.<br /><br /># find . -type f -name gene.fasta<br /><br />./gene.fasta<br /><br />6. Find all PHP Files in Directory<br /><br />Find all fasta files in a directory.<br /><br /># find . -type f -name "*.fasta"<br /><br />./gene.fasta<br />./cancer.fasta<br />./allgene.fasta<br /><br /><strong>Part II &ndash; Find Files Based on their Permissions</strong><br />7. Find Files With 777 Permissions<br /><br />Find all the files whose permissions are 777.<br /><br /># find . -type f -perm 0777 -print<br /><br />8. Find Files Without 777 Permissions<br /><br />Find all the files without permission 777.<br /><br /># find / -type f ! -perm 777<br /><br />9. Find SGID Files with 644 Permissions<br /><br />Find all the SGID bit files whose permissions set to 644.<br /><br /># find / -perm 2644<br /><br />10. Find Sticky Bit Files with 551 Permissions<br /><br />Find all the Sticky Bit set files whose permission are 551.<br /><br /># find / -perm 1551<br /><br />11. Find SUID Files<br /><br />Find all SUID set files.<br /><br /># find / -perm /u=s<br /><br />12. Find SGID Files<br /><br />Find all SGID set files.<br /><br /># find / -perm /g+s<br /><br />13. Find Read Only Files<br /><br />Find all Read Only files.<br /><br /># find / -perm /u=r<br /><br />14. Find Executable Files<br /><br />Find all Executable files.<br /><br /># find / -perm /a=x<br /><br />15. Find Files with 777 Permissions and Chmod to 644<br /><br />Find all 777 permission files and use chmod command to set permissions to 644.<br /><br /># find / -type f -perm 0777 -print -exec chmod 644 {} \;<br /><br />16. Find Directories with 777 Permissions and Chmod to 755<br /><br />Find all 777 permission directories and use chmod command to set permissions to 755.<br /><br /># find / -type d -perm 777 -print -exec chmod 755 {} \;<br /><br />17. Find and remove single File<br /><br />To find a single file called gene.txt and remove it.<br /><br /># find . -type f -name "gene.txt" -exec rm -f {} \;<br /><br />18. Find and remove Multiple File<br /><br />To find and remove multiple files such as .fa or .gb, then use.<br /><br /># find . -type f -name "*.fa" -exec rm -f {} \;<br /><br />OR<br /><br /># find . -type f -name "*.gb" -exec rm -f {} \;<br /><br />19. Find all Empty Files<br /><br />To file all empty files under certain path.<br /><br /># find /tmp -type f -empty<br /><br />20. Find all Empty Directories<br /><br />To file all empty directories under certain path.<br /><br /># find /tmp -type d -empty<br /><br />21. File all Hidden Files<br /><br />To find all hidden files, use below command.<br /><br /># find /tmp -type f -name ".*"<br /><br /><strong>Part III &ndash; Search Files Based On Owners and Groups</strong><br />22. Find Single File Based on User<br /><br />To find all or single file called gene.txt under / root directory of owner root.<br /><br /># find / -user root -name gene.txt<br /><br />23. Find all Files Based on User<br /><br />To find all files that belongs to user Rahul under /home directory.<br /><br /># find /home -user rahul<br /><br />24. Find all Files Based on Group<br /><br />To find all files that belongs to group Developer under /home directory.<br /><br /># find /home -group developer<br /><br />25. Find Particular Files of User<br /><br />To find all .txt files of user Rahul under /home directory.<br /><br /># find /home -user rahul -iname "*.txt"<br /><br /><strong>Part IV &ndash; Find Files and Directories Based on Date and Time</strong><br />26. Find Last 50 Days Modified Files<br /><br />To find all the files which are modified 50 days back.<br /><br /># find / -mtime 50<br /><br />27. Find Last 50 Days Accessed Files<br /><br />To find all the files which are accessed 50 days back.<br /><br /># find / -atime 50<br /><br />28. Find Last 50-100 Days Modified Files<br /><br />To find all the files which are modified more than 50 days back and less than 100 days.<br /><br /># find / -mtime +50 &ndash;mtime -100<br /><br />29. Find Changed Files in Last 1 Hour<br /><br />To find all the files which are changed in last 1 hour.<br /><br /># find / -cmin -60<br /><br />30. Find Modified Files in Last 1 Hour<br /><br />To find all the files which are modified in last 1 hour.<br /><br /># find / -mmin -60<br /><br />31. Find Accessed Files in Last 1 Hour<br /><br />To find all the files which are accessed in last 1 hour.<br /><br /># find / -amin -60<br /><br /><strong>Part V &ndash; Find Files and Directories Based on Size</strong><br />32. Find 50MB Files<br /><br />To find all 50MB files, use.<br /><br /># find / -size 50M<br /><br />33. Find Size between 50MB &ndash; 100MB<br /><br />To find all the files which are greater than 50MB and less than 100MB.<br /><br /># find / -size +50M -size -100M<br /><br />34. Find and Delete 100MB Files<br /><br />To find all 100MB files and delete them using one single command.<br /><br /># find / -size +100M -exec rm -rf {} \;<br /><br />35. Find Specific Files and Delete<br /><br />Find all .gb files with more than 10MB and delete them using one single command.<br /><br /># find / -type f -name *.gb -size +10M -exec rm {} \;</p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/40703/%CF%80-cyc-a-reference-free-snp-discovery-application-using-parallel-graph-search</guid>
	<pubDate>Tue, 28 Jan 2020 03:34:23 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/40703/%CF%80-cyc-a-reference-free-snp-discovery-application-using-parallel-graph-search</link>
	<title><![CDATA[Π-cyc: A Reference-free SNP Discovery Application using Parallel Graph Search]]></title>
	<description><![CDATA[<p>Reference free SNP search for comparative population genomics: multiple samples run simultanously. **experimental phase, compiles and runs with OpenMPI-1.8.8 with Intel Compiler only</p>
<p><span>Cycles enumeration (aka Bubbles) as part of de novo de bruijn graphs assembly using colours can be unpractical for large error prone genomes which makes the assembly process produce an excessive number of false positive cycles.&nbsp; Our solution is to search the graph in multicores shared memory parallel mode using graph decomposition then use filtering method to generate good quality SNPs.</span></p>
<p><a href="https://arxiv.org/abs/1809.06700">https://arxiv.org/abs/1809.06700</a></p>
<p><a href="https://github.com/redayounsi/2KP2P">https://github.com/redayounsi/2KP2P</a></p>
<blockquote>
<p>/2kp2omp/bin/main_2kp2_K63_C2 -i fastq_files.txt -o fungus_bub.fasta -r stat_fungus.txt -c cov_fungus_hash.txt -k 63 -h 20 -b 100 -g 600 -l 100 -f 16 -t 5.0 -x 1 -v 0 -p 1 -y 1 -u 1</p>
<p>&nbsp;</p>
</blockquote><p>Address of the bookmark: <a href="https://github.com/redayounsi/2KP2P" rel="nofollow">https://github.com/redayounsi/2KP2P</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37496/gsearch-a-fast-and-flexible-general-search-tool-for-whole-genome-sequencing</guid>
	<pubDate>Mon, 06 Aug 2018 17:19:15 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37496/gsearch-a-fast-and-flexible-general-search-tool-for-whole-genome-sequencing</link>
	<title><![CDATA[gSearch: a fast and flexible general search tool for whole-genome sequencing]]></title>
	<description><![CDATA[<p><span>gSearch compares sequence variants in the Genome Variation Format (GVF) or Variant Call Format (VCF) with a pre-compiled annotation or with variants in other genomes. Its search algorithms are subsequently optimized and implemented in a multi-threaded manner.&nbsp;</span></p><p>Address of the bookmark: <a href="http://ml.ssu.ac.kr/gSearch/index.html" rel="nofollow">http://ml.ssu.ac.kr/gSearch/index.html</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/38649/ngs-platforms-launched-by-bgi%E2%80%99s-mgi-tech</guid>
	<pubDate>Thu, 10 Jan 2019 04:42:06 -0600</pubDate>
	<link>https://bioinformaticsonline.com/news/view/38649/ngs-platforms-launched-by-bgi%E2%80%99s-mgi-tech</link>
	<title><![CDATA[NGS Platforms launched by BGI’s MGI Tech]]></title>
	<description><![CDATA[<p>MGI Tech Co., Ltd. (MGI), a subsidiary of BGI Group, is committed to enabling effective and affordable healthcare solutions for all. Based on its proprietary technology, MGI produces sequencing devices, equipment, consumables and reagents to support life science research, medicine and healthcare. MGI's multi-omics platforms include genetic sequencing, mass spectrometry and medical imaging. Providing real-time, comprehensive, life-long solutions, its mission&nbsp;is to&nbsp;develop and promote advanced life science tools for future healthcare.</p><p>MGI, a subsidiary of global genomics leader BGI Group, announced pricing and its first early access customer for the new ultra high-throughput sequencer, MGISEQ-T7, saying it has driven down sequencing cost to&nbsp;$5&nbsp;per gigabyte, with exceptionally high accuracy. Such innovations are helping more people to realize the benefits of genomic information.</p><p>In October, MGI launched the MGISEQ-T7, a highly flexible production-scale platform that is the most powerful sequencer to date. It can produce as many as 60 whole human genomes in one day. The instrument sells for&nbsp;$1 million.</p><p>The T7 enables simultaneous but independent operation of up to four flow cells, which means different applications such as single-cell RNA sequencing, whole exome sequencing and whole genome sequencing can be run in different flow cells at the same time. This helps to reduce costs, allowing MGI to offer the most competitive sequencing price in the market.</p><p><span>Powered by DNBseq&trade;, MGISEQ delivers quality data with accuracy for SNP and Indel calling rate of 99.9% and 99%, respectively, along with decreased duplication rate down to less than 2 percent, and almost zero Index mis-assignment rate.</span></p><p><span><span>SOURCE MGI</span></span></p><p>https://www.bgi.com/global/company/news/bgis-mgi-tech-launches-two-new-ngs-platforms/</p><p>http://en.mgitech.cn/</p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>