In
biology,
bioinformatics ˌbaɪ.oʊˌɪnfɚˈmætɪks is an interdisciplinary field that develops and improves upon methods for storing, retrieving, organizing and analyzing biological data. A major activity in bioinformatics is to develop software tools to generate useful biological knowledge.
Bioinformatics has become an important part of many areas of biology. In experimental molecular biology, bioinformatics techniques such as image and signal processing allow extraction of useful results from large amounts of raw data. In the field of
genetics and
genomics, it aids in sequencing and annotating genomes and their observed
mutations. It plays a role in the textual mining of biological literature and the development of biological and gene ontologies to organize and query biological data. It plays a role in the analysis of gene and protein expression and regulation. Bioinformatics tools aid in the comparison of genetic and genomic data and more generally in the understanding of evolutionary aspects of molecular biology. At a more integrative level, it helps analyze and catalogue the biological pathways and networks that are an important part of systems biology. In structural biology, it aids in the simulation and modeling of DNA, RNA, and protein structures as well as molecular interactions.
Bioinformatics uses many areas of
computer science,
mathematics and
engineering to process biological data. Complex machines are used to read in biological data at a much faster rate than before.
Databases and
information systems are used to store and organize biological data. Analyzing biological data may involve
algorithms in
artificial intelligence,
soft computing,
data mining, and
simulation. The algorithms in turn depend on theoretical foundations such as
discrete mathematics,
control theory,
system theory,
information theory, and
statistics. Commonly used software tools and technologies in the field include
Java,
C#,
XML,
Perl,
C,
C++,
Python,
R,
SQL,
CUDA,
MATLAB, and
spreadsheet applications.