BOL: Related items

Bioinformatician at QUB, UK

Tue, 01 Oct 2024 21:43:23 -0500

The post-holder will work under the direction of the Precision Medicine Centre of Excellence's (PMC) Bioinformatics lead and collaborate closely with the Scientific and Clinical leads. The primary responsibilities will be to develop, validate and maintain data analysis pipelines and algorithms that enable the comprehensive analysis of genomic information derived from cancer specimens, within the context of clinical studies. The PMC is an ISO 15189:2012 accredited medical laboratory (Ref 20634), providing an integrated cancer diagnostic and clinical research service that combines high throughput genomics and digital pathology (www.qub.ac.uk/research-centres/PMC).

About the person:

Essential criteria:

Hold or be about to obtain* a PhD in Computational biology, Bioinformatics, computing science or related subjects. (*must be obtained within 3 months of the closing date for the post) or MSc equivalent with at least 3 years' work experience in a relevant role.
Significant relevant research experience in genomics or work experience in a relevant technical/scientific role.
Significant experience in managing and analysing NGS data and other big data.
Experience in developing and maintaining analysis pipelines.
Experience working with Linux/UNIX environments.
Proficiency with python, bash, R and/or equivalent languages.
To be successful at shortlisting stage, please ensure you clearly evidence in your application how you meet the essential and, where applicable, desirable criteria listed in the Candidate Information document linked on our website.

More at https://hrwebapp.qub.ac.uk/tlive_webrecruitment/wrd/run/ETREC107GF.open

The Centre for Bioinformatics, Biomarker Discovery and Information-Based Medicine (CIBM)

Sun, 14 Jul 2013 12:31:38 -0500

The Centre for Bioinformatics, Biomarker Discovery and Information-Based Medicine (CIBM) is committed to shortening the process of obtaining novel discoveries to achieve distinctively better outcomes in clinical practice and translational individualised medicine.

Link @ http://www.newcastle.edu.au/research-and-innovation/centre/cibm/about-us

Postdoc in Comparative Single Cell Genomics at University of Basel

Fri, 06 Dec 2024 23:41:20 -0600

A fully funded 4-year Postdoc position is available in the lab of Patrick
Tschopp at the University of Basel, Switzerland, study the molecular and
tissue-scale dynamics during the embryonic formation of the vertebrate
skeleton and compare it across different vertebrate species with distinct
habitats.

We are looking for a highly motivated candidate with a PhD degree in
Bioinformatics or a related field. Candidates are expected to have a
strong background in evolutionary biology and/or comparative functional
genomics. Additional experiences in single cell functional genomics
analyses, statistics and computational data analyses are a plus, as is
an interest in comparative developmental (EvoDevo) questions.

We offer a dynamic and interactive research environment with state-of-the
art research facilities, good research funding and internationally
competitive salaries.

The Tschopp lab (www.evolution.unibas.ch/tschopp/research/)
studies the gene regulatory mechanisms of cell type
specification and evolution in vertebrates. See also our
preprints at https://doi.org/10.1101/2024.03.26.586769 and
https://doi.org/10.1101/2024.11.28.625862 Applications should include
a motivation letter, a CV, a list of publications, a statement about
research interests, as well as the names and contact details of at
least two referees. Applications (in the form of a single .pdf file)
should be sent to Patrick Tschopp (patrick.tschopp@unibas.ch); review
of applications will begin on January 1st 2025, and will continue until
the position is filled.

Patrick Tschopp

BioRuby :Ruby packages for biologist

Jitendra Narayan — Mon, 15 Jul 2013 01:36:28 -0500

BioRuby is a package of Open Source Ruby code, with classes for DNA and protein sequence analysis, alignment, database parsing, and other Bioinformatics tools.
BioRuby project provides an integrated environment in bioinformatics for the Ruby language. This project is supported by University of Tokyo (Human Genome Center), Kyoto University(Bioinformatics Center) and the Open Bio Foundation. The project was supported by Information-technology Promotion Agency (IPA) as an Exploratory Software Project in 2005
RubyForge is a home for open source Ruby projects: RubyForge is a home for open source Ruby projects. BioRuby project was started in late 2000, and is still in progress. Currently, there are over 80 files and 15,000 lines (except comment-only lines) in our source code. This might be equivalent to twice or more lines of other languages because of Ruby's extremely high descriptive power.

Classes for
Multiple alignment (Bio::Alignment),
Gene Ontology(Bio::GO),
PDB (Bio::PDB),
FANTOM database(Bio::FANTOM),
GFF (Bio::GFF) and KEGG
Orthology (Bio::KEGG::KO).

They also added support for many applications such as PSORT, SOSUI, TargetP, TMHMM, GenScan, ClustalW, MAFFT, and KEGG API.

Wiki Links
http://bioruby.open-bio.org/wiki/BioRubyOnRails
http://dev.bioruby.org/en/

BioRuby in Anger
http://dev.bioruby.org/en/?BioRuby+in+Anger

BioRuby RDocs
http://bioruby.org/rdoc/

BioRuby Tutorial Website
http://dev.bioruby.org/en/?Tutorial.rd

Why BioRuby Hub for BioRuby
http://www.linuxjournal.com/article/5915

Social Coding Hub for BioRuby
http://www.linuxjournal.com/article/5915

Bioinformatics on Rails: BioRuby Tutorial
http://bioinforuby.blogspot.com/2008/02/bioruby-tutorial.html

RRA BioRuby
http://raa.ruby-lang.org/project/bioruby/

BioRuby Project Discussion Group
http://portal.open-bio.org/mailman/listinfo/bioruby

BioRuby related Projects: Project tree
http://rubyforge.org/softwaremap/trove_list.php?form_cat=252

Reference
http://www.jsbi.org/journal/GIW03/GIW03P191.pdf

Mycology Research Resources for Bioinformaticians: Unlocking the Fungal Kingdom

Neel — Fri, 13 Dec 2024 11:21:45 -0600

Mycology, the study of fungi, is a field that bridges ecology, medicine, and biotechnology. With advancements in bioinformatics, researchers now have unprecedented opportunities to explore the fungal kingdom at molecular, genetic, and ecological levels. From understanding pathogenic fungi to harnessing fungal enzymes for industrial applications, the potential is vast.

To fully leverage these opportunities, bioinformaticians require specialized tools and databases. This blog highlights essential resources for mycology research, focusing on databases, tools, and platforms tailored for fungal biology.

1. Fungal Databases

1.1. MycoCosm

Website: MycoCosm
Developed by the DOE Joint Genome Institute, MycoCosm is a comprehensive portal for fungal genomics. It offers genomic and transcriptomic data for a wide range of fungi, including saprobes, pathogens, and symbionts.

Key Features: Genome browsers, comparative genomics tools, and functional annotations.
Best For: Large-scale studies on fungal evolution and ecology.

1.2. FungiDB

Website: FungiDB
FungiDB is an integrated genomic resource for fungal pathogens and non-pathogens. It provides access to genome sequences, transcriptomic data, and functional annotations.

Key Features: Advanced search options, BLAST, and pathway analysis tools.
Best For: Studying fungal pathogenesis and host-pathogen interactions.

1.3. Index Fungorum

Website: Index Fungorum
This nomenclatural database provides information on the scientific names of fungi. It’s an essential resource for taxonomists and researchers focused on fungal biodiversity.

Key Features: Taxonomic hierarchy and synonymy tracking.
Best For: Identifying and classifying fungal species.

1.4. UNITE

Website: UNITE
UNITE is a specialized database for fungal ITS (Internal Transcribed Spacer) sequences, often used in fungal identification and phylogenetics.

Key Features: Curated reference datasets and community annotations.
Best For: Environmental mycology and microbial ecology studies.

2. Analytical Tools

2.1. Funannotate

Repository: GitHub - Funannotate
Funannotate is a genome annotation tool designed for fungi. It supports tasks like gene prediction, functional annotation, and orthology analysis.

Best For: Annotating newly sequenced fungal genomes.

2.2. BUSCO (Benchmarking Universal Single-Copy Orthologs)

Website: BUSCO
BUSCO evaluates genome assembly and annotation completeness using orthologs. It includes a fungal-specific dataset.

Best For: Assessing the quality of fungal genome assemblies.

2.3. Pathogen-Host Interactions Database (PHI-base)

Website: PHI-base
PHI-base is a manually curated resource containing information on pathogen-host interactions, including fungal pathogens.

Best For: Exploring virulence factors and host-pathogen relationships.

3. Visualization Platforms

3.1. Cytoscape

Website: Cytoscape
A powerful tool for visualizing molecular interaction networks, Cytoscape can be used to study protein-protein interactions, gene networks, and metabolic pathways in fungi.

Best For: Network biology and functional genomics.

3.2. iTOL (Interactive Tree of Life)

Website: iTOL
iTOL is an interactive tool for visualizing phylogenetic trees.

Best For: Displaying fungal phylogenies and comparing evolutionary relationships.

4. Community Resources

4.1. Mycological Society of America (MSA)

Website: MSA
The MSA promotes fungal research and provides access to resources, conferences, and publications.

Best For: Networking with fungal researchers and accessing recent studies.

4.2. OpenFungi

Website: OpenFungi
OpenFungi is an open-source initiative providing fungal genomic and transcriptomic datasets for research and education.

Best For: Sharing and accessing public fungal datasets.

5. Genomics Workflows

5.1. Galaxy

Website: Galaxy Project
Galaxy offers a web-based platform for reproducible bioinformatics workflows, including tools for fungal genome and transcriptome analysis.

Best For: User-friendly analysis pipelines without requiring coding skills.

5.2. Snakemake

Repository: Snakemake
A flexible pipeline management tool that supports fungal data processing and analysis.

Best For: Custom workflows for large-scale fungal datasets.

Conclusion

Fungal research is a rapidly growing field with vast implications for medicine, agriculture, and industry. For bioinformaticians, the availability of specialized resources—databases, tools, and community platforms—opens doors to innovative discoveries. Whether you are investigating fungal genomics, studying host-pathogen interactions, or exploring fungal biodiversity, the resources outlined above will empower your research journey.

Dive into these resources and help unravel the mysteries of the fungal kingdom!

Postdoctoral position in bioinformatics @ Sweden

Sun, 14 Jul 2013 13:49:57 -0500

Information about the department
The Department of Mathematical Sciences at Chalmers University of Technology and the University of Gothenburg has about 170 faculty and staff and is the largest department of mathematical sciences in the Nordic countries. The department belongs to both Chalmers University of Technology and the University of Gothenburg (for more information see http://www.chalmers.se/math/).

Job description
We are looking for a motivated, self-driven post-doctoral researcher to work with large-scale sequence data analysis. The position is for 24 months and located at Mathematical Statistics, Department of Mathematical Sciences in Erik Kristiansson’s research group. We are focused on methods development for and analysis of next generation DNA sequencing, in particular comparative metagenomics and gene expression analysis (RNA-seq). We have strong interdisciplinary profile and are actively collaborating with several experimental groups, especially within the environmental sciences, ecology, infectious diseases and cancer genomics. More information is available at http://bioinformatics.math.chalmers.se.

The Post-doctoral position is an appointment that offers an opportunity to qualify for further research positions within academia or industry. The majority of your working time is devoted to your own research, normally as a member of a research group. Included in your work is also to take part in supervision of Ph.D. students and M.Sc thesis students. Teaching of undergraduate students may also be included to a small extent.

The employment is limited to a maximum of 2 years (1+1).

Qualifications
The applicant should have Ph.D. degree preferably in bioinformatics, mathematics, statistics, computer science or equivalent by the start of the appointment. Experience from analysis of large-scale data, in particular from next generation DNA sequencing, is highly valued. The applicant should also be proficient in programming (e.g. Python/Java/C) and comfortable with Unix/Linux systems. Interaction with experimental biologists is central and good collaborative skills are therefore important. Fluency in written and spoken English is a strong requirement. As a post-doctoral researcher you are expected to work independently and to be able to supervise/co-supervise PhD and Master’s students.

Application procedure
The application should be marked with Ref 20130126 and written in English. The application should be sent electronically via Chalmers webpage.

Application deadline: September 8, 2013.

For questions, please contact:
Ass prof. Erik Kristiansson, Matematiska Vetenskaper, erik.kristiansson@chalmers.se, +46 31-772 3521, +46 70-5259751.

Chalmers continuously strive to be an attractive employer. Equality and diversity are substantial foundations in all activities at Chalmers.

Cracking the Code: A Guide to Bioinformatics Job Hunting

Abhi — Mon, 23 Dec 2024 19:36:41 -0600

Entering the world of bioinformatics is an exciting journey, filled with opportunities to combine biology, data science, and technology to address some of the most pressing scientific challenges. However, securing a position in this competitive field can be daunting, especially for newcomers. Here’s a guide to help you navigate the job-hunting process and land your dream role in bioinformatics.

1. Understand the Landscape

Before diving into applications, take the time to understand the bioinformatics job market. Common roles include:

Bioinformatics Analyst/Scientist: Focused on data analysis and interpretation.
Computational Biologist: Combines computational techniques with biological research.
Data Scientist in Genomics: Applies machine learning and statistical models to genomic data.
Software Developer in Bioinformatics: Designs and develops tools and pipelines for biological research.

Familiarize yourself with the key industries hiring bioinformaticians, such as academia, biotech, pharmaceuticals, healthcare, and agriculture.

2. Build a Strong Foundation

Bioinformatics demands a diverse skill set. Ensure you have a solid foundation in the following areas:

Programming Skills: Proficiency in Python, R, or Perl is often required. Familiarity with tools like Bash scripting and version control systems (e.g., Git) is a plus.
Statistics and Data Analysis: Knowledge of statistical methods, machine learning, and data visualization is crucial.
Biological Knowledge: Understanding genomics, transcriptomics, and proteomics will help you communicate effectively with biologists.
Specialized Tools and Databases: Be comfortable using tools like BLAST, Bowtie, and databases like NCBI and Ensembl.

3. Create a Winning Resume and Portfolio

Highlight your technical skills, biological knowledge, and relevant experience. Tips for a standout application:

Tailor your resume to each job, emphasizing skills mentioned in the job description.
Showcase your experience with real-world datasets by linking to your GitHub profile or online portfolio.
Include details of any publications, presentations, or significant projects.

4. Network Actively

Networking is often the key to discovering opportunities. Here’s how to build connections:

Attend Conferences and Workshops: Events like ISMB or specialized bioinformatics workshops are great for meeting professionals.
Engage Online: Join LinkedIn groups, participate in bioinformatics forums, and follow relevant hashtags on Twitter.
Leverage Alumni Networks: Connect with alumni from your university who are working in the field.

5. Gain Relevant Experience

Experience is a major factor for hiring managers. Ways to enhance your profile include:

Internships: Seek out internships in research labs or biotech companies.
Collaborations: Volunteer to work on projects with professors or peers.
Open Source Contributions: Participate in bioinformatics software development on platforms like GitHub.

6. Prepare for Interviews

Bioinformatics interviews often combine technical and behavioral questions. Prepare by:

Reviewing Key Concepts: Refresh your knowledge of algorithms, sequence analysis, and statistical methods.
Practicing Coding: Be ready to solve coding challenges or discuss code snippets.
Understanding the Organization: Research their recent projects, publications, or products.
Preparing Questions: Demonstrate interest by asking about their tools, workflows, or team structure.

7. Stay Resilient and Persistent

Job hunting can be a long process, but persistence pays off. Tips to keep moving forward:

Keep improving your skills by taking online courses or certifications.
Stay updated with advancements in bioinformatics by following journals and blogs.
Apply to multiple positions and don’t get discouraged by rejections. Each application is a learning experience.

Closing Thoughts

Landing a bioinformatics job requires a mix of technical expertise, networking, and resilience. By understanding the market, showcasing your skills effectively, and continuously learning, you’ll be well on your way to a rewarding career in this dynamic field. Remember, the key to cracking the code is perseverance—stay curious, stay determined, and success will follow.

STUDENTSHIP and TRAINEESHIP @ University of Madras

Sat, 16 Nov 2013 19:27:40 -0600

Bioinformatics Infrastructure Facility
University of Madras
Chennai 600 025

Applications are invited for the STUDENTSHIP and TRAINEESHIP vacancies to carry out project/research work in the DBT - Bioinformatics Infrastructure Facility with consolidated stipend of Rs.5,000/- per month.

Essential Qualification

Student Trainee: Those who have completed M.Sc., Bioinformatics/Biophysics/Life sciences or Pursuing M.Tech., Bioinformatics/Biotechnology

Duration : 3-4 Months

Student Trainee: Those who are pursuing M.Sc Bioinformatics/Biophysics/ Life sciences/others

Duration : 2-3 Months

Mail your CV on or before 25th November 2013 to shirai2011@gmail.com and hard copy to "Dr. D. Velmurugan, Professor & Head, CAS in Crystallography and Biophysics, University of Madras, Guindy Campus, Chennai 600 025". Also, the applicants are requested to attend the interview on 29th November, 2013 at 11 A.M.

www.unom.ac.in/uploads/announcements/bifadvertisement_20131114080003_23240.pdf

What is Data Science? — A Bioinformatics Perspective

Abhi — Mon, 16 Jun 2025 01:44:34 -0500

In today’s era of big biology, we’re generating more data than ever before—genomes, transcriptomes, proteomes, metabolomes, microbiomes… you name it. But raw biological data doesn’t speak for itself. Making sense of it requires more than traditional biology. This is where data science steps in.

So, What Is Data Science?
At its core, data science is the interdisciplinary field that extracts knowledge and insights from data using programming, statistics, and domain expertise. In bioinformatics, data science enables us to turn gigabytes of sequence data into biological meaning.

Imagine trying to understand gene regulation in cancer by analyzing thousands of RNA-seq samples, or predicting antibiotic resistance from bacterial genomes—these challenges are not solvable through wet lab experiments alone. They require data-driven thinking.

Data Science Meets Bioinformatics
Bioinformatics is inherently a data science domain. From genomics to systems biology, every field in modern biology relies on data science techniques to:

Clean and process massive datasets

Discover patterns in high-dimensional data

Build predictive models (e.g., for disease classification)

Visualize complex biological networks and trends

Integrate diverse data types (e.g., transcriptomic + epigenomic data)

The Bioinformatics Toolkit
Here’s what data science typically looks like in bioinformatics:

Task Data Science Role
Sequence alignment Efficient algorithms, indexing, parallel processing
Gene expression analysis Statistical modeling (e.g., DESeq2, limma)
Variant calling Data filtering, probabilistic models
Clustering of cells in single-cell data Unsupervised learning
Protein structure prediction Deep learning models (e.g., AlphaFold)
Metagenomics Data integration, classification, dimensionality reduction

Common tools include Python, R, Bioconductor, scikit-learn, Pandas, Seurat, and TensorFlow—often working together in reproducible workflows.

It's Not Just About Coding
A common misconception is that bioinformatics is just programming or scripting. But being a data scientist in bioinformatics also means:

Understanding experimental design

Asking biologically meaningful questions

Choosing the right statistical or machine learning models

Communicating findings effectively (e.g., plots, dashboards, papers)

In other words, data science in bioinformatics is where biology, statistics, and computer science converge.

Why It Matters
The real power of data science in bioinformatics is its ability to scale discovery.

Instead of studying one gene, we can study thousands.

Instead of analyzing one species, we can explore entire ecosystems.

Instead of waiting months for lab results, we can generate hypotheses in days.

From personalized medicine and cancer diagnostics to agricultural genomics and pandemic surveillance, data science is at the heart of the bioinformatics revolution.

Final Thoughts
If you’re a biologist who’s curious about code, or a data enthusiast fascinated by life sciences, bioinformatics is your playground—and data science is your toolkit.

In bioinformatics, data science isn’t just useful. It’s essential.

Data Mining in Bioinformatics

Jitendra Narayan — Tue, 16 Jul 2013 03:21:28 -0500

Data mining, the extraction of hidden predictive information from large databases. Data mining is becoming an increasingly important tool to transform this data into information. It is commonly used in a wide range of profiling practices, such as marketing, surveillance, fraud detection and scientific discovery. Data Mining for Bioinformatics enables researchers to meet the challenge of mining vast amounts of biomolecular data to discover real knowledge. In other words, you’re a bioinformatician, and data has been dumped in your lap. Find the patterns, trend, answers, or what ever meaningful knowledge the data is hiding. They scour databases for hidden patterns, finding predictive information that experts may miss because it lies outside their expectations.This page Covering theory, algorithms, and methodologies, as well as data mining technologies. Unfortunately life is never simple. In molecular biology, it’s becoming more common to generate reams of data then ask someone in bioinformatics to produce an answer. This is exploratory data analysis, one of the most difficult things to do well. Especially if you’re thrown in at the deep end.

Data mining commonly involves four classes of tasks:

Classification - Arranges the data into predefined groups. For example, an email program might attempt to classify an email as legitimate or spam. Common algorithms include decision tree learning, nearest neighbor, naive Bayesian classification and neural networks.
Clustering - Is like classification but the groups are not predefined, so the algorithm will try to group similar items together.
Regression - Attempts to find a function which models the data with the least error.
Association rule learning - Searches for relationships between variables. For example a supermarket might gather data on customer purchasing habits. Using association rule learning, the supermarket can determine which products are frequently bought together and use this information for marketing purposes. This is sometimes referred to as market basket analysis.
From experience, I can say that is one of the most frustrating positions to be in. Data mining is a huge field and can easily be bewildering for a beginner. However, high through-put techniques in molecular biology require, more and more, that bioinformatics is required to interpret the data. Furthermore, people working in bioinformatics generally come from computer science, or biology backgrounds. Data mining, however, involves statistics to one degree or another, which means entering a field that is may not be your strong point.
Excel is fine for creating graphs. If you’re serious about data mining though, you’ll need something more heavy weight. I use R, free, and with good data mining packages such as vegan and labdsv. For beginners R can be impenetrable, I recommend this book an introduction to R as well as the underlying statistics.
Any of us can rush head on into a land of support vector machines, hidden markov models and neural networks. But coming back to the first point, what are you trying to prove? Always question what are you doing, how does it fit in to the wider picture? Try to regularly review, and keep track of where you are going? This will prevent you from falling into data mining despair.

Data Mining Resources on the net:

A laboratory of data mining and bioinformatics is headed by Prof. Ambuj Singh. There are currently seven graduate students in the research group. Our research focuses on image informatics and scalable querying and mining of graphs.For more detail visit: http://www.cs.ucsb.edu/~dbl/

Here are the materials (Lecture notes) from several past courses on data mining and/or Web mining by Stanford: For detail visit: http://infolab.stanford.edu/~ullman/mining/mining.html
Statistical Data Mining Tutorial Slides by Andrew Moore The following links point to a set of tutorials on many aspects of statistical data mining, including the foundations of probability, the foundations of statistical data analysis, and most of the classic machine learning and data mining algorithms. For detail visit: http://www.autonlab.org/tutorials/

A tutorial on Introduction to Data Mining for Discovering hidden value in your data warehouse:http://www.thearling.com/text/dmwhite/dmwhite.htm
Wiki Links: http://en.wikipedia.org/wiki/Data_mining
Bioinformatics with Clementine http://www.spss.ch/upload/1051192224_inseratClemBio.pdf
Causal Data Mining in Bioinformatics by Ioannis Tsamardinos: http://www.forth.gr/ics/bmi/In_the_News/2007/EN69-4.pdf

Report on ACM Text Mining in Bioinformatics (TMBIO 006) http://www.sigir.org/forum/2007J/2007j_sigirforum_song.pdf
BIOKDD 2002: Recent Advances in Data Mining for
Bioinformatics: http://www.acm.org/sigs/sigkdd/explorations/issue4-2/zaki.pdf

Bioinformatics and Medical Informatics:

Tools for Mining and Applying Genetic Information in Patient Care:http://www.biomedtechalliance.org/pdfs/03_03_05/03_03_05.pdf

DATA MINING OF MICROARRAY DATABASES FOR HUMAN LUNG CANCER: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.106.385&rep=rep1&type=pdf

Towards knowledge-based gene expression data mining: http://www.ailab.si/blaz/papers/2007-JBI-BellazziZupan.pdf

DRAFT Accepted for publication in 'Data Mining in Bioinformatics'
Jason Wang, Mohammed Zaki, Hannu Toivonen, and Dennis Shasha (Eds.), Springer:http://www.cs.helsinki.fi/u/htoivone/pubs/gene_mapping_by_pattern_discovery.pdf

Data Mining and Text Mining for Bioinformatics: Proceedings of the European Workshop: http://www.rok.informatik.hu-berlin.de/wbi/research/publications/2003/proceedings_ws_mining.pdf

Biological Network Analysis:

Graph Mining in Bioinformatics: http://agbs.kyb.tuebingen.mpg.de/wikis/bg/BNA-5.pdf.

Text mining in bioinformatics: http://agbs.kyb.tuebingen.mpg.de/wikis/bg/4.pdf

Some datamining books that are available on google books:

Data mining and bioinformatics: first international workshop, VDMB 2006 By Mehmet M. Dalkilic

Data mining: concepts and techniques By Jiawei Han, Micheline Kamber