Bio::DB::TFBS namespace has been moved to its own distribution named after itself
[bioperl-live.git] / t / data / genemark.out
blob6c1015d2d10ec3e6076993e2c9fca299eab283e8
1 GeneMark.hmm (Version 2.2a)
2 Sequence name: Hvrn.contig8
3 Sequence length: 50124 bp
4 G+C content: 44.82%
5 Matrices file: /home/software/analysis/gene-prediction/genemark/matdir/osativa.mtx (Oryza sativa)
6 Thu Mar 22 10:25:00 2001
8 Predicted genes/exons
10 Gene Exon Strand Exon           Exon Range     Exon      Start/End
11   #    #         Type                         Length       Frame
12   1     1   -  Initial       1805      2176     372          3 1
14   2     5   -  Terminal      3108      3229     122          3 2
15   2     4   -  Internal      3869      4501     633          1 2
16   2     3   -  Internal      4820      4888      69          1 2
17   2     2   -  Internal      4981      5061      81          1 2
18   2     1   -  Initial       5296      5656     361          1 1
20   3     2   -  Terminal      7171      7288     118          3 3
21   3     1   -  Initial       7540      7787     248          2 1
23   4     1   +  Single       15431     15757     327          1 3
25   5     1   +  Initial      17526     17696     171          1 3
26   5     2   +  Internal     17772     17887     116          1 2
27   5     3   +  Internal     18005     18074      70          3 3
28   5     4   +  Internal     18456     18539      84          1 3
29   5     5   +  Internal     18628     18714      87          1 3
30   5     6   +  Internal     18807     18870      64          1 1
31   5     7   +  Internal     19944     20038      95          2 3
32   5     8   +  Internal     20139     20293     155          1 2
33   5     9   +  Terminal     20779     20788      10          3 3
35   6     5   -  Terminal     23000     23061      62          3 2
36   6     4   -  Internal     23397     24101     705          1 2
37   6     3   -  Internal     24708     24821     114          1 2
38   6     2   -  Internal     25079     25356     278          1 3
39   6     1   -  Initial      26970     26977       8          2 1
41   7     3   -  Terminal     34218     34310      93          3 1
42   7     2   -  Internal     35900     36301     402          3 1
43   7     1   -  Initial      36392     36448      57          3 1
45   8     1   +  Initial      36531     37064     534          1 3
46   8     2   +  Terminal     37153     37161       9          1 3
48   9     3   -  Terminal     37880     37917      38          3 2
49   9     2   -  Internal     38938     39006      69          1 2
50   9     1   -  Initial      39080     40214    1135          1 1
52  10     2   -  Terminal     41091     41554     464          3 2
53  10     1   -  Initial      41635     41713      79          1 1
54  11     1   -  Single       41744     42061     318          3 1
56  12     1   +  Initial      42171     42212      42          1 3
57  12     2   +  Terminal     42432     42824     393          1 3
59  13     7   -  Terminal     43798     43932     135          3 1
60  13     6   -  Internal     44220     44297      78          3 1
61  13     5   -  Internal     47595     47685      91          3 3
62  13     4   -  Internal     48393     48526     134          2 1
63  13     3   -  Internal     48643     49024     382          3 3
64  13     2   -  Internal     49118     49149      32          2 1
65  13     1   -  Initial      49457     49507      51          3 1
67 Predicted gene sequence(s):
69 >Hvrn.contig8|GeneMark.hmm|gene 1|124_aa
70 MEVAVKGYADASFDTDPDDSKSQTGYVFILNGGAVSWCSSKQSVVADSRCEAEYMAALEA
71 AKEGVWMKQFMTDLGVVSSALDPLTLLCDNTRAIALAKEPRFHNKTRHIKRRFNLIRDYV
72 EGED
74 >Hvrn.contig8|GeneMark.hmm|gene 2|421_aa
75 MAHAKVTLNFNTFLEKAKLKDDGSNFVDWARNLKLLLQAGKKDYVLNVALGDEPPAAADQ
76 DAKNAWLACKEDYSVVQCAVLYGLEPGLQRCFERHGAYEMFQELKFIFQKNARIERYETS
77 ESELRKEHQVLMVNKATSFKRSGKGKKGYGSLEAQLSKYLAGKKAAKEKSENNGCSISMS
78 NIFYGHAPNVRGLFILNLDSDNTHIHNIETKRVRVNNDSAMFLWHCRLGHIGVKRMKKLH
79 TDGLLESLDFDSLDTCEPCLMGKMTKTPFSGTMERASDLLEIIHTDVCGPMSAEARGGYR
80 YFLTFIDDLSRYGYVYLMKHKSETFEKFKQFQSEVENHRNKKIKFLRSDHGGEYLSFEFG
81 AHLRQCGIVSQLTPLGTPQRNEAMVGPDSNKWLEAMKSEIGSMYGNKVWTLEVLPEGRKA
84 >Hvrn.contig8|GeneMark.hmm|gene 3|121_aa
85 MVRRQRLIYRMTSFDYRKVFGHYRECTESDEWVPNVHREGPTHPGKPIGPRGGAPALGGL
86 VGQPKRALCAKDRKSKRKKKRKRSRYFTTTGAPSRCRRTHLLIRLACWIKKAEIIIELYV
89 >Hvrn.contig8|GeneMark.hmm|gene 4|108_aa
90 MFTTPKAGGGMYLCLSVGWGIVGRRRVMSGCGQGSEMGLVGLRTRRHWAKTGRGGAAGGA
91 ASIGDGPRRAADKATLGEDGPGRGVGRGGVGRRRVASGGGDREEDEWS
93 >Hvrn.contig8|GeneMark.hmm|gene 5|283_aa
94 MDAAVQEAKLLRQVNALIVAHLRDQNLTQAAAAVAAATMTPKADASLPNHLLRLVAKGLA
95 AEREEAARGGGAPPAFDSAGGGGLARPLGTSAVDFSVQNVRGPSKTFPKHETRHISDHKN
96 VARCAKFSPDGKHFATGSGDTSIKFFEVSKIKQTMLGDSKEGPGRPVVRTFYDHVQLLTQ
97 LLVHSTDKVSSFVTNIPGTDHPVAHLYDVNTFTCFLSANPQDSSAAINQVRYSGTGSMYV
98 TASKDGSLRIWDGVSAECVRPIIGAHGSVEATSAIFTKDESGF
100 >Hvrn.contig8|GeneMark.hmm|gene 6|388_aa
101 MGSVVFLEGSEGNLQALKDTLQAYQVASAQKVNLQKSSILDGKGCRDEDKGTLKQTIGID
102 SEALSERYSGLPTVVGRLKDGSFEYVRERSKGKVSGSVGKASVALQFPSSLCARVLKARY
103 FKECTIMNTTCPNAMFWKVLSSEKWVPVAIPPVSEGPHGELASWLLRWFAEVGDPERELM
104 VHAVYGLWLARNEARDGKRIVDPRVVEENVYQHIIEWNAIHMKKPRSTTPTLAVRWSPPE
105 QGWLKANSDGALAKLRDRGGGGVVLRDHDGAYRGGACYVFRDVSDPEVVEILACRKAVHL
106 AVQTGATRVHVEVDSKGMAAMLNDQAKNLSAAGPIVEEIKLLGRTLQGFIVSRVRRSGNH
107 GAHLLAREVRSVYTHVILKQPLFDTCRL
109 >Hvrn.contig8|GeneMark.hmm|gene 7|183_aa
110 MVLTEKEAKGFVFSGPVEEAWGLHHDAQFRDLGNNLFLVHFGGEGDWKHSRNNGPWQFDF
111 MILKGYDGKTRPSEMVFDSVEAWVRVEDLPLDRRTREFGEALGNWLGEVVKVDVERDGFA
112 KGKYLRVRAKIFVYEPVVRYFNLKESVDDEVETAEGQAGPLEAEAEARRGASVSAHSFGR
115 >Hvrn.contig8|GeneMark.hmm|gene 8|180_aa
116 MASTVSPWSETPQDILGLVIDRLHSSPDHEEPRLSAAWSRFLLAVPVAAANRRGFQRARR
117 TRHSAAADRARFRAVCRSWHLAMRQHVSTPRVLPWIILSDGYFFTPSDNGCRAPRRLPSL
118 PKNARCIGSTDGWLALDCTDARNVHTYLLHNPFSDTTVPLPELDPIIANVSEFFAVRKAA
121 >Hvrn.contig8|GeneMark.hmm|gene 9|413_aa
122 MPLKFWDETFSTAVYLINRVPSRVIHNQTPLERLFGLTPNYTFLRIFGCAVWPNLRPFNK
123 HKLEYRSKQCVFIGYNYLHKGYKCLDVSTGRVYVSQDVIFDEHIFPFASLHPNAGAQLRA
124 ELVLLPPTLLNLSSPLTPSAAPNDPMAISTIYAPTSANSVQDSAGISHDFMQPNVSTDLV
125 ATENPGLHASESATAAPGAGDPPLQASGSAAAAPGSSPGFVHQPAASVGRSPASTSDPAR
126 QPDASAARPPVSDPVRPTTVATALFPASDLVRSPQEIRLQRRAPPTAPWIGRGLPRVVGP
127 PCLLPWTREISLDVVTRYRLLRLRPMQRRRCPMQRPPRLLFLLVCHLIRYLLTLRCPVVS
128 STICNPCNQHLHPLGLILGEPENLKEAIADPKWKAAMDEEFDWAGCPDDRRST
130 >Hvrn.contig8|GeneMark.hmm|gene 10|180_aa
131 MAAAGKPLDDDELVSYILQGLDSDYNPEARIDAQNGSNTNSFSINLASKGGSRNNNDTRP
132 SGPGGGNPAAYRGAGGGFFPNTLVAPPPSGGRDETCQICKRQGHATWHCFKRYDKNFNPP
133 PKRQGGGGGNNSGGGGNSSGGNTKSANTVPAAYDVDTNWYLDTGAMDHVTGELEKLAMHD
136 >Hvrn.contig8|GeneMark.hmm|gene 11|105_aa
137 MGYLDGTMAEPPAVLTTETDVAGKKEISSTPNPAHVLWYTQDQQVLTFLLASLSRDVLLQ
138 VHSLASATGVWTAIQQMFASHSRARHIQLRGQLGNTKKGDSPVAI
140 >Hvrn.contig8|GeneMark.hmm|gene 12|144_aa
141 MVELEEEDDMSMEEVALMTNNSNYLIILIRPGKGVWLPKPDTAPFNLFIDIVFLQGKLYG
142 ITQAEDLASVSIDFDDCGMPTVTTVERLIKHPPLESCEFDVWSDAGEKLEADGDMGDEDQ
143 VENGGEDHDEALNEVDARIQKENR
145 >Hvrn.contig8|GeneMark.hmm|gene 13|300_aa
146 MSTATSLWDKAALMMREELAVAAVVAGCLDMTKLYVVGAGMFSCVTVALYPVSVIKTRMQ
147 VASGEAMRRNALATFKNILKVDGVPGLYRGFGTVITGAIPARIIFLTALEKTKATSLKLV
148 EPLQLSESMEAALANGLGGLTASLCSQAVFVPIDVVSQKLMVQGYSGHVRYKGGIDVVQK
149 IMKADGPRGLYRGFGLSVMTALGRLDDKEDTPSQLKIVGVQATGGMVAGATSLEDNPLSD
150 NVPQFAETSSAGSPLEKERVRQRASATISVTRDCQCSRRPTIGGVRQLGRSLPMRRDGAT