1 <& /page/page_title.mas, title => 'Instructions for using the <span class="secretary">SecreTary</span> signal peptide predictor' &>
3 <& /secretom/secretary/nav.mas, curr_page => 'instructions' &>
5 <div class="indented_content">
8 <span class="secretary">SecreTary</span> is for prediction of signal peptides from the amino acid
9 sequence of a protein. Signal peptides generally possess a common
10 structural design, which is characterized by a short, positively
11 charged n-region followed by a hydrophobic region (h-region) and a
12 somewhat polar and uncharged cleavage site (c-region). It should be
13 more emphasized that the h-region is known to be important for the
14 translocation of the signal peptides to the ER. To improve the
15 performance of the signal peptide prediction, we have developed new
16 parameters such as the overall hydrophobicity and amino acid
17 composition in the h-region as input. <span class="secretary">SecreTary</span> is mainly trained
18 and tested on the plant secretome that has been identified through
19 functional screens including a yeast secretion trap.
23 Questions or suggestions about <span class="secretary">SecreTary</span> should be addressed to
24 Jocelyn K.C. Rose (<a href="mailto:jr286@cornell.edu">
25 jr286@cornell.edu</a>).
30 <&| /page/info_section.mas, title => '1. Input Sequences' &>
33 The amino acid sequences can be input by pasting one or more
34 protein sequences, in <b>FASTA format</b> into the <span class="secretary">SecreTary</span> input
35 page. <span class="secretary">SecreTary</span> analyses proteins at a rate of approximate
36 80/second. At present there is a limit of 10000 sequences. All the
37 input sequences must be in one-letter amino acid code, not case-sensitive.
38 All letters of the alphabet are accepted, but <b>B J O U </b> and <b>Z</b>
39 are replaced by <b>X</b> before analysis.
43 <& /sequence/with_markup.mas,
44 seq => Bio::PrimarySeq->new(
45 -id => 'P05117_Tomato',
46 -seq => 'MVIQRNSILLLIIIFASSISTCRSNVIDDNLFKQVYDNILEQEFAHDFQAYLSYLSKNIESNNNIDKVDKNGIKVINVLSFGAKGDGKTYDNIAFEQAWNEACSSRTPVQFVVPKNKNYLLKQITFSGPCRSSISVKIFGSLEASSKISDYKDRRLWIAFDSVQNLVVGGGGTINGNGQVWWPSSCKINKSLPCRDAPTALTFWNCKNLKVNNLKSKNAQQIHIKFESCTNVVASNLMINASAKSPNTDGVHVSNTQYIQISDTIIGTGDDCISIVSGSQNVQATNITCGPGHGISIGSLGSGNSEAYVSNVTVNEAKIIGAENGVRIKTWQGGSGQASNIKFLNVEMQDVKYPIIIDQNYCDRVEPCIQQFSAVQVKNVVYENIKGTSATKVAIKFDCSTNFPCEGIIMENINLVGESGKPSEATCKNVHFNNAEHVTPHCTSLEISEDEALLYNY',
54 <&| /page/info_section.mas, title => '2. Submission' &>
56 Paste the sequence(s) into the window, or click browse to select a
57 fasta file on your disk for analysis. Then, click on the "Submit"
58 button. Expect to wait about 10-15 seconds to see the results of
59 analyzing 1000 sequences.
63 <style type="text/css">
64 .yellow { background: yellow }
65 .blue { background: #aaf }
68 <&| /page/info_section.mas, title => '3. Output' &>
71 For each sequence <span class="secretary">SecreTary</span> gives a score; sequences with scores
72 greater than or equal to 0.75 are predicted to have a signal peptide.
73 For each predicted signal peptide the
74 predicted length is shown, and the location of the hydrophobic
76 <FONT style=BACKGROUND-COLOR: #AAAAFF">blue</FONT>
77 ) within the predicted signal peptide (in
78 <FONT style=BACKGROUND-COLOR: #FFDD66">yellow</FONT>
79 ). On the input page you can choose
80 to sort the output by score, with the higher scores (strongest
81 candidates) at the top, and you can suppress output for sequences
82 with no predicted signal peptide.
88 <pre class="secretary_results">
89 Identifier SP Score Length Sequence 10 20 30 40 50 60
92 AT1G50920.1 NO 0 - MVQYNFKRITVVPNGKEFVDIILSRTQRQTPTVVHKGYKINRLRQFYMRKVKYTQTNFHAKL...
93 AT1G36960.1 NO 0 - MTRLLPYKGGDFLGPDFLTFIDLCVQVRGIPLPYLSELTVSFIAGTLGPILEMEFNQDTSTY...
94 AT1G44020.1 NO 0 - MDSESESKLISFISQLVSRNNTDSENISCMIQTISLVSSMDLKSQPKPESKLMSLVTQTISL...
95 AT1G15970.1 NO 0 - MSVPPRFRSVNSDEREFRSVLGPTGNKLQRKPPGMKLEKPMMEKTIIDSKDEKAKKPTTPAS...
96 AT1G73440.1 NO 0 - MARGESEGESSGSERESSSSSSGNESEPTKGTISKYEKQRLSRIAENKARLDALGISKAAKA...
97 AT1G75120.1 YES 0.8875 31 <FONT style="BACKGROUND-COLOR: #FFDD66">MAVRKEKVQPFRECG</FONT><FONT style="BACKGROUND-COLOR: #AAAAFF">IAIAVLVGIFI</FONT><FONT style="BACKGROUND-COLOR: #FFDD66">GCVCT</FONT>ILIPNDFVNFRSSKVASASCESPERVKMFKA...
98 AT1G17600.1 NO 0 - MVSSSAPRVSKYDVFLSFRGEDTRKTIVSHLYAALDSRGIVTFKDDQRLEIGDHISDELHRA...
99 AT1G51380.1 NO 0 - MEGTLDEENLVFETTKGIKPIKSFDDMGMNDKVLRGVYDYGYKKPSEIQQRALVPILKGRDV...
100 AT1G77370.1 YES 0.7625 26 <FONT style="BACKGROUND-COLOR: #FFDD66">MVDQSPRR</FONT><FONT style="BACKGROUND-COLOR: #AAAAFF">VVVAALLLFVVL</FONT><FONT style="BACKGROUND-COLOR: #FFDD66">CDLSNS</FONT>AGAANSVSAFVQNAILSNKIVIFSKSYCPYCLRSKR...
101 AT1G44090.1 NO 0 - MCIYASRQTVCPYLTPFKVKRPKSREMNSSDVNFSLLQSQPNVPAEFFWPEKDVAPSEGDLD...
103 2 secreted sequences predicted out of 10.
107 <&| /page/info_section.mas, title => '4. Getting help' &>
108 Please contact: <a href="mailto:sgn-feedback@sgn.cornell.edu">sgn-feedback@sgn.cornell.edu</a>.
111 <a href="/secretom/secretary">Return to <span class="secretary">SecreTary</span> input</a>