A.A. 2015-2016
CORSO DI
BIOINFORMATICA 2
per il CLM in BIOLOGIA EVOLUZIONISTICA
Scuola di Scienze, Università di Padova
Docenti:
Prof. Giorgio Valle
Prof. Stefania Bortoluzzi
SVOLGIMENTO DEL CORSO
• I semestre del I anno
• Impegno didattico di 6 crediti:
• 32 ore di lezione frontale
• 32 ore di esercitazioni al computer
Così suddivise:
Insegnamento
Lezione
Esercitazione
Valle
16
16
Bortoluzzi
16
16
MODALITÀ D’ESAME
Valutazione finale:
• Esito delle esercitazioni
• Verifica scritta per entrambe le
parti del corso integrato
• Voto basato su media
ponderata
WEB SITE DEDICATO AL CORSO
http://compgen.bio.unipd.it/~stefania/Didattica/
ORGANIZZAZIONE
DEL CORSO
Valle
Bortoluzzi
• Database e data retrieval (biosequenze, database secondari e di
conoscenza, strutture)
• Allineamento di sequenze di acidi nucleici e proteine, matrici di
sostituzione, metodi di allineamento esatto e euristici. Ricerca di
similarità, BLAST.
• Allineamento multiplo di sequenze, Clustal Omega e Tcoffee.
• Predizione della struttura tridimensionale delle biomolecole. Folding
delle proteine (metodi ab inizio, comparative modeling e threading).
• Evoluzione molecolare, determinazione delle distanze genetiche tra
sequenze, filogenesi molecolare.
• Genome sequencing, assembly, and annotation. Genome resequencing.
Risorse Genomiche (Navigare i genomi: NCBI, UCSC Genome Browser,
ENSEMBL). Cenni su Metodi di mappaggio per l'analisi di dati NGS e
transcriptome assembly.
• Genomica comparativa
• Metagenomica
TESTI e materiali CONSIGLIATI
• Materiale lezioni
• Risorse e tutorial online
• Bioinformatica. Dalla
sequenza alla struttura
delle proteine
Stefano Pascarella e
Alessandro Paiardini
Zanichelli
2014
THE BIG DATA ERA
“Researchers need
to be obliged to
document
and
manage their data
with
as
much
professionalism as
they devote to their
experiments.”
Nature journal Issue of 4 2008
 Importance of data:
• Retrieval
• Integration
• Analysis
 An at least basic knowledge of bioinformatic
methods in unavoidable also for experimental
researchers
 Bioinformatics
from basic methods for managing biosequences to
systems biology models
NIH BIG DATA to Knowledge (BD2K)
With advances in technologies,
investigators are increasingly
generating and using large,
complex, and diverse datasets.
Consequently, the biomedical
research
enterprise
is
increasingly becoming dataintensive and data-driven.
However, the ability of researchers to locate, analyze, and use Big Data (and
more generally all biomedical and behavioral data) is often limited for reasons
related to access to relevant software and tools, expertise, and other factors.
D2K aims to develop the new approaches, standards, methods, tools, software,
and competencies that will enhance the use of biomedical Big Data by
supporting research, implementation, and training in data science and other
Importance of Bioinformatics
Deep sequencing data analysis
ORARIO E AULE
ORARIO E AULE