Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho...
-
Upload
sancha-hinojos -
Category
Documents
-
view
224 -
download
0
Transcript of Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho...
![Page 1: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/1.jpg)
Introducción a la Bioinformática2002
Universidad Nacional San Cristobal de Huamanga, Ayacucho
Mirko Zimic
![Page 2: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/2.jpg)
Tópicos de interés en la bioinformática
• Análisis de secuencias
• Filogenia y evolución molecular
• Modelamiento molecular
• Plegamiento de Proteínas
• Genómica y Proteómica
• Genética estadística
• Microarreglos
• Programación científica
![Page 3: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/3.jpg)
Pongamos un ejemplo …
Cisteíno proteasa de la fasciola hepática: En busca de un péptido
inmunogénico
![Page 4: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/4.jpg)
VPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGR ISLSEQNLVDCSRPQGNAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYM KNERTSISFSEQQLVDCSRPWGN
_____ROJO_________
QGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNY KPEYSVANDTGFVDIPQREKA LMKNGCGGGLMENAYQYLKQF GLETESSYPYTAVGGQCRYNKQLG VAKVTGYYTV QSGSEVEL KN _VIOLETA____ _AMARILLO_______
AVATVGPISVAIDAGHSFQFYKSGIYYEPDCSSKDLDHGVLVVGYGFEG TDSNNNKYW IVKNSWLIGSEGPSAVAVDVESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGTQGGTD YW IVKNSW_____ _VERDE_____
GPEWGM-GYVKMAKDRNNH CGIATAASYPTVGLSWGERGYIRMV RNRGNMCGIASLASLPMVARFP
Alineamiento: cisteíno proteasas de mamífero Vs. cisteíno
proteasa de Fasciola hepatica.
AA Idénticos AA divergentes
![Page 5: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/5.jpg)
Epítope Discontinuo, formado por porciones distantes de la secuencia.
Denaturación
El epítope se pierde con la denaturación.
![Page 6: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/6.jpg)
Denaturación
El epítope se conserva como tal.
Epítope Continuo, formado por una porción de la secuencia
![Page 7: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/7.jpg)
Modelaje tridimensional por homología. Identidad de secuencia de 56% con quimopapaína (1YAL)
![Page 8: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/8.jpg)
AA idénticos AA divergentes
Análisis de Superficie: vista de átomos por radio de van der Waals
![Page 9: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/9.jpg)
TMEGQYMKNERTSISFS
YYTVQSGSEVELKNLIGSE
QSQTCSPLRVN
RYNKQLGVAKV
Selección de secuencias (1)divergentes, (2)accesibles al solvente y (3)contínuas.
![Page 10: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/10.jpg)
Evaluación de la estabilidad conformacional de los péptidos por minimización de energía.
H2O “backbone”
TMEGQYMKNERTSISFS YYTVQSGSEVELKNLIGSE
![Page 11: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/11.jpg)
Pongamos otro ejemplo…
Sensibilidad de la aspartyl proteasa del HIV-1 a los inhibidores más
frecuentes
![Page 12: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/12.jpg)
Representación en “cartoon” de la enzima proteasa de HIV-1
![Page 13: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/13.jpg)
MONOMERO PROTEASA HIV
![Page 14: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/14.jpg)
Enzima proteasa de HIV-1 mostrando los elementos de estructura secundaria, flaps y
sitio activo
![Page 15: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/15.jpg)
Enzima proteasa de HIV-1 indicando los residuos consenso de unión inhibidor-enzima
![Page 16: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/16.jpg)
INDINAVIR
![Page 17: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/17.jpg)
RITONAVIR
![Page 18: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/18.jpg)
Asociación de indinavir a la proteasa de HIV-1
![Page 19: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/19.jpg)
Proteasa de HIV-1 mutante modelada en complejo con
Ritonavir
![Page 20: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/20.jpg)
COMPARACION ENTRE UNA ENZIMA SENSIBLE Y UNA
RESISTENTE A RITONAVIR
![Page 21: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/21.jpg)
Un ejemplo más…
Ordenamiento filogenético y el contenido de GC en tripanosomátidos
![Page 22: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/22.jpg)
Reported %GC variation for each codon position in Trypanosomatids
(Alonso et al,1992)
4 2 4 4 4 6 4 8 5 0 5 2 5 4 5 6 5 8 6 04 0
4 5
5 0
5 5
6 0
6 5
7 0
7 5
8 0
8 5
9 0
C r i t h i d i aL e i s h m a n i a
T . c r u z iT . b r u c e i
1 s t2 n d3 r d
% G Cc o d o np o s i t i o n
% G C t o t a l D N A
![Page 23: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/23.jpg)
Codon usage in Trypanosomatids leucine
0
10
20
30
40
50
60
70
TTA
CTA
TTG
CTT
CTC
CTG
TTA
CTA
TTG
CTT
CTC
CTG
TTA
CTA
TTG
CTT
CTC
CTG
TTA
CTA
TTG
CTT
CTC
CTG
T.brucei T.cruzi Leishmania Critidia
![Page 24: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/24.jpg)
Codon usage in Trypanosomatids serine
0
5
10
15
20
25
30
35
40
AG
T
TCA
TCT
TCC
AG
C
TCG
AG
T
TCA
TCT
TCC
AG
C
TCG
AG
T
TCA
TCT
TCC
AG
C
TCG
AG
T
TCA
TCT
TCC
AG
C
TCG
T.brucei T.cruzi Leishmania Critidia
![Page 25: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/25.jpg)
Phylogeny of Trypanosomatid lineage (Maslov & Simpson)
![Page 26: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/26.jpg)
“Hole” formation by DNA replication
![Page 27: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/27.jpg)
GC content variation in timeRestriction: AA family conservation
and AA conservation
![Page 28: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/28.jpg)
%GC variation in Trypanosomatid lineage(Nuclear coding DNA)
![Page 29: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/29.jpg)
GC variation in trypanosomatidae lineage Nuclear DNA
0.00
0.100.20
0.300.40
0.50
0.600.70
0.800.90
1.00
T.b
ruce
i
T.c
ruzi
Le
ishm
ani
a
Cri
thid
ia
% G
C
P1
P2
P3
P3*
P
![Page 30: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/30.jpg)
I. Proyecto Genoma Humano
La secuencia del genoma está casi completa!– aproximadamente 3.5 billones de pares de bases.
![Page 31: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/31.jpg)
All the Genes
• Any human gene can now be found in the genome by similarity searching with over 90% certainty.
• However, the sequence still has many gaps– one is unlikely to find a complete and
uninterrupted genomic segment for any gene – still can’t identify pseudogenes with certainty
• This will improve as more sequence data accumulates
![Page 32: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/32.jpg)
Raw Genome Data:
![Page 33: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/33.jpg)
The next step is obviously to locate all of the genes and describe their functions. This will probably take another 15-20 years!
![Page 34: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/34.jpg)
– so why are there 60,000 human genes on Affymetrix GeneChips?
– Why does GenBank have 49,000 gene coding sequence and UniGene have 89,000 clusters of unique ESTs?
• Clearly we are in desperate need of a theoretical framework to go with all of this data
…Algunos años atrás…Celera sostenía que sólo
habrían 30,000 genes
![Page 35: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/35.jpg)
Implications for Biomedicine
• Physicians will use genetic information to diagnose and treat disease.– Virtually all medical conditions (other than
trauma) have a genetic component.
• Faster drug development research– Individualized drugs– Gene therapy
• All Biologists will use gene sequence information in their daily work
![Page 36: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/36.jpg)
II. Bioinformatics Challenges
Lots of new sequences being added- automated sequencers
- Human Genome Project
- EST sequencing
GenBank has over 10 Billion bases and is doubling every year!!
(problem of exponential growth...)
How can computers keep up?
The huge dataset
![Page 37: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/37.jpg)
New Types of Biological Data
• Microarrays - gene expression
• Multi-level maps: genetic, physical, sequence, annotation
• Networks of Protein-protein interactions
• Cross-species relationships– Homologous genes– Chromosome organization
![Page 38: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/38.jpg)
Similarity Searching the Databanks
What is similar to my sequence?
Searching gets harder as the databases get bigger - and quality degrades
Tools: BLAST and FASTA = time saving heuristics (approximate)
Statistics + informed judgement of the biologist
![Page 39: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/39.jpg)
Alignment Alignment is the basis for finding similarity Pairwise alignment = dynamic
programming Multiple alignment: protein families and
functional domains Multiple alignment is "impossible" for lots
of sequences Another heuristic - progressive pairwise
alignment
![Page 40: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/40.jpg)
Sample Multiple Alignment
![Page 41: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/41.jpg)
Structure- Function Relationships Can we predict the function of protein
molecules from their sequence?
sequence > structure > function Conserved functional domains = motifs
Prediction of some simple 3-D structures (-helix, -sheet, membrane spanning, etc.)
![Page 42: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/42.jpg)
Protein domains
![Page 43: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/43.jpg)
DNA Sequencing Automated sequencers > 40 KB per day 500 bp reads must be assembled into
complete genes- errors especially insertions and deletions
- error rate is highest at the ends where we want to overlap the reads
- vector sequences must be removed from ends
Faster sequencing relies on better software
overlapping deletions vs. shotgun approaches: TIGR
![Page 44: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/44.jpg)
![Page 45: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/45.jpg)
Finding Genes in genome Sequence is
Not Easy • About 2% of human DNA encodes
functional genes.
• Genes are interspersed among long stretches of non-coding DNA.
• Repeats, pseudo-genes, and introns confound matters
![Page 46: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/46.jpg)
Pattern Finding Tools• It is possible to use DNA sequence patterns
to predict genes:• promoters• translational start and stop codes (ORFs)• intron splice sites• codon bias
• Can also use similarity to known genes/ESTs
![Page 47: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/47.jpg)
Phylogenetics Evolution = mutation of DNA (and
protein) sequences
Can we define evolutionary relationships between organisms by comparing DNA sequences- is there one molecular clock?- phenetic vs. cladisitic approaches- lots of methods and software, what is the
"correct" analysis?
![Page 48: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/48.jpg)
![Page 49: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/49.jpg)
II. El papel del Biólogo en la Era de la
Información
![Page 50: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/50.jpg)
El Internet provee abundante información biologica
Puede resultar abrumador…
- Web
Necesidad de nuevas habilidades = localizar información necesaria de manera eficiente
![Page 51: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/51.jpg)
Computing in the lab - everyday tasks (vs. computational biology)
ordering supplies reference books lab notes literature
searching
![Page 52: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/52.jpg)
Training "computer" scientists
Know the right tool for the job
Get the job done with tools available
Network connection is the lifeline of the scientist
Jobs change, computers change, projects change, scientists need to be adaptable
![Page 53: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/53.jpg)
The job of the biologist is changing
• As more biological information becomes available …– The biologist will spend more time using
computers– The biologist will spend more time on data
analysis (and less doing lab biochemistry)– Biology will become a more quantitative science
(think how the periodic table and atomic theory affected chemistry)
![Page 54: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/54.jpg)
III. Molecular Biology Software Tools
![Page 55: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/55.jpg)
GCG (Wisconsin Package) The most popular and most
comprehensive set of tools for the molecular biologist.
- Runs on mainframe computers: (UNIX)
- Web, X-Windows (SeqLab) interfaces
- Inexpensive for large numbers of users
- Requires local databases (on the mainframe computer)
- Allows for custom databases and programming
![Page 56: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/56.jpg)
The Web Many of the best tools are free over the Web
BLAST ENTREZ/PUBMED Protein motifs databases
Bioinformatics “service providers” DoubleTwist™, Celera, BioNavigator™
Hodgepodge collection of other tools PCR primer design Pairwise and Multiple Alignment
![Page 57: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/57.jpg)
Personal Computer Programs Macintosh and Windows applications
- Commercial: Vector NTI™, MacVector™, OMIGA™, Sequencher™
- Freeware: Phylip, Fasta, Clustal, etc.
Better graphics, easier to use Can't access very large databases or perform
demanding calculations Integration with web databases and computing
services
![Page 58: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/58.jpg)
![Page 59: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/59.jpg)
Putting it all together The current state of the art requires the
biologist to jump around from Web to mainframe to personal computer
The trend is for integration
– Web + personal computer will replace text interface to mainframe ?
– Will the Web become the ultimate interface for all computing ??
![Page 60: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/60.jpg)
IV. Genómica
![Page 61: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/61.jpg)
Genomics Technologies
• Automated DNA sequencing• Automated annotation of sequences• DNA microarrays
– gene expression (measure RNA levels)– single nucleotide polymorphisms (SNPs)
• Protein chips (SELDI, etc.)• Protein-protein interactions
![Page 62: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/62.jpg)
cDNA spotted microarrays
![Page 63: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/63.jpg)
Affymetrix Gene Chips
![Page 64: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/64.jpg)
Impact on Bioinformatics
• Genomics produces high-throughput, high-quality data, and bioinformatics provides the analysis and interpretation of these massive data sets.
• It is impossible to separate genomics laboratory technologies from the computational tools required for data analysis.
![Page 65: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/65.jpg)
Pharmacogenomics • The use of DNA sequence information to
measure and predict the reaction of individuals to drugs.
• Personalized drugs
• Faster clinical trials– Selected trail populations
• Less drug side effects– toxicogenomics
![Page 66: Introducción a la Bioinformática 2002 Universidad Nacional San Cristobal de Huamanga, Ayacucho Mirko Zimic.](https://reader036.fdocuments.es/reader036/viewer/2022062409/5665b4331a28abb57c8fea72/html5/thumbnails/66.jpg)