| Home
syllabus
general information
homework
lectures
websites
Will Terzaghi's Homepage
Membership
Login |
|
|
|
week 8 homework
Biology 398INA: Topics in Bioinformatics
Homework # 8
Using 2-dimensional modeling programs
Due March 15
Please send me your answers by email. You can either create a new file, or
download the ms word file and type in your answers.
"week8homework"
Part I: Predicting RNA secondary structure
- Go to http://bioinfo.math.rpi.edu/~zukerm/
- Under research, select "An mfold manual: RNA folding
by energy minimization."
- Go to the introduction.
- What determines the activity of RNA?
- What is partially responsible for translational controls in mRNA and
replication controls in single-stranded RNA viruses?
- Why is there a need for modeling RNA secondary structure?
- Now go to Loops and Nearest neighbor
rules.
- What are free energies assigned to?
- Why are pseudoknots and base triples excluded?
- Scroll to the bottom of the page. What are DNA folding parameters based
on?
- Now go to Entrez (http://www.ncbi.nlm.nih.gov/Entrez)
and select nucleotide.
- Type in " BC005462"
- Copy the complete DNA sequence (remember that it is a cDNA).
- Go back to http://bioinfo.math.rpi.edu/~zukerm/
- Under research, select The
mfold server - Fold your RNA sequences.
- Paste your sequence into the box, select a batch job
then submit your query using the default settings (be sure to give them your
email address).
- When you get your results back,
- What percentage of the base pairs in the energy dot-plot were included
in the foldings?
- What is the free energy for structure 1?
- What is the free energy for structure 26?
- Select the jpg image for structure 1 and attach it to your homework
(you will get many structures).
Part II: Predicting Protein secondary structure
- What is protein secondary structure?
- What are the four types of secondary structures?
- How does the Chou-Fasman algorithm predict secondary structures?
- Copy the complete amino acid sequence for BC005462 and load it
into biologist's workbench.
- Run it through CHOFAS (under protein tools)
- How many alpha-helices does it predict?
- How many amino acids are in alpha-helices?
- How many beta-pleated sheets(only count ones with 5 or more aa
in a row)?
- How many amino acids are in beta-pleated sheets?
- How many of the helices overlap with beta-pleated sheets?
- How can you decide which prediction is correct?
- How does the GOR algorithm predict secondary structure?
- Run BC005462 through GOR4 (under protein tools).
- How many alpha-helices does it predict (do not count anything shorter
than 5)?
- How many beta-pleated sheets(only count ones with 5 or more aa
in a row)?
- How many of the helices overlap with beta-pleated sheets?
- Run BC005462 through PELE (under protein tools).
- How many different programs does PELE use?
- How many involve neural networks or machine learning?
- What is JOI?
- How many alpha-helices were predicted by JOI?
- How many beta-sheets?
- How many beta turns?
- How do neural networks work?
- How are they trained?
- Go to http://www.cmpharm.ucsf.edu/~nomi/nnpredict-instrucs.html
- How does nnPREDICT predict secondary structure?
- How were the weights determined?
- Now run the amino acid sequence of BC005462 through nnPREDICT at
http://www.cmpharm.ucsf.edu/~nomi/nnpredict.html
- How many alpha-helices does it predict?
- How many beta-pleated sheets?
- Now go to http://npsa-pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_server.html
and select references.
- How does DPM work?
- How does DSC work?
- How does HNN work?
- How does PHD work?
- How does Predator work?
- How does Simpa 96 work?
- How does SOPM work?
- Now, select NPS@ at the top of the page
- Scroll down the NPS@ to SECONDARY STRUCTURE CONSENSUS PREDICTION.
- Run the amino acid sequence of BC005462 using the defaults.
- How many alpha-helices were predicted by the consensus?
- How many beta-pleated sheets were predicted by the consensus?
- Which program was the most different from the consensus?
- Which program was the closest?
- Now go to http://cubic.bioc.columbia.edu/predictprotein/submit_def.html
and try running it through PredictProtein.
- How many alpha-helices?
- How many beta-pleated sheets?
Part III: Predicting other kinds of structures
- How does the Kyte-Doolittle algorithm predict hydrophobicity?
- Now go to Entrez (http://www.ncbi.nlm.nih.gov/Entrez)
and select nucleotide.
- Type in " NM_079687" (this is the sequence for a rhodopsin.
So far we have been dealing with a soluble protein because the secondary structure
prediction programs work better with those, but now were switching to
membrane bound ones)
- Copy the complete amino acid sequence for NM_079687 and load it
into biologist's workbench.
- Run it through GREASE.
- What does the Y-axis represent?
- How many hydrophobic domains do you predict? (Count the number
of peaks that reach at least 1.0 on the Y axis).
- Now run NM_079687 through TMAP
- How many trans-membrane domains does it predict and where are they?
- How do they compare with the hydrophobic peaks predicted by GREASE?
- Now run NM_079687 through TMHMM
- How many trans-membrane domains does it predict and where are they?
- How do they compare with the trans-membrane domains predicted by
TMAP?
- Now go to http://npsa-pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_server.html
- Select transmembrane helices prediction under miscellaneous
analysis tools.
- How does PHDhtm predict transmembrane domains?
- Now run NM_079687 through PHDhtm
- How many trans-membrane domains does it predict and where are they?
- How do they compare with the trans-membrane domains predicted by
TMHMM?
- Return to http://npsa-pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_server.html
- Select coiled-coil under miscellaneous analysis
tools.
- How does it predict coiled colis?
- Now run NM_079687 through Coiled-coil
- How many coiled-coils does it find?
- When you get back your Predict protein, you should also get output
from PROSITE Motif seaches.
- Does BC005462 have any glycosylation sites?
- Any Protein Kinase C phosphorylation sites?
- Any Casein Kinase 2 phosphorylation sites?
- Any Tyrosine Kinase phosphorylation sites?
- Any myristoylation sites?
- Any ATP binding sites?
- Now go to Entrez (http://www.ncbi.nlm.nih.gov/Entrez)
and select protein
- Type in " AAG50089" (this is the sequence for a plastocyanin).
- Copy the amino acid sequence, then go to http://www.cbs.dtu.dk/services/
and select TargetP.
- Click on abstract.
- How does targetP work?
- What percentage of plant proteins are mitochondrial?
- Go back to the TargetP server, paste your sequence into the window,
select plant under origin of sequences and perform
cleavage site predictions, then submit.
- Where does it predict plastocyanin will be localized?
- What is the probability that it has a chloroplast transit peptide?
- What is the predicted length of the transit peptide?

|