1 BLASTP 2.0MP-WashU [12-Feb-2001] [linux-i686 01:36:08 31-Jan-2001]
3 Copyright (C) 1996-2000 Washington University, Saint Louis, Missouri USA.
6 Reference: Gish, W. (1996-2000) http://blast.wustl.edu
8 Query= gi|1786183|gb|AAC73113.1| (AE000111) aspartokinase I, homoserine
9 dehydrogenase I [Escherichia coli]
13 4289 sequences; 1,358,990 total letters.
14 Searching....10....20....30....40....50....60....70....80....90....100% done
19 Sequences producing High-scoring Segment Pairs: Score P(N) N
21 gb|AAC73113.1| (AE000111) aspartokinase I, homoserine deh... 4141 0.0 1
22 gb|AAC76922.1| (AE000468) aspartokinase II and homoserine... 844 3.1e-86 1
23 gb|AAC76994.1| (AE000475) aspartokinase III, lysine sensi... 483 2.8e-47 1
24 gb|AAC73282.1| (AE000126) uridylate kinase [Escherichia c... 97 0.0010 1
28 >gb|AAC73113.1| (AE000111) aspartokinase I, homoserine dehydrogenase I
32 Score = 4141 (1462.8 bits), Expect = 0.0, P = 0.0
33 Identities = 820/820 (100%), Positives = 820/820 (100%)
35 Query: 1 MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA 60
36 MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA
37 Sbjct: 1 MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA 60
39 Query: 61 LPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINA 120
40 LPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINA
41 Sbjct: 61 LPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINA 120
43 Query: 121 ALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIP 180
44 ALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIP
45 Sbjct: 121 ALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIP 180
47 Query: 181 ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV 240
48 ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV
49 Sbjct: 181 ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV 240
51 Query: 241 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD 300
52 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD
53 Sbjct: 241 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD 300
55 Query: 301 EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF 360
56 EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF
57 Sbjct: 301 EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF 360
59 Query: 361 CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKFFAAL 420
60 CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKFFAAL
61 Sbjct: 361 CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKFFAAL 420
63 Query: 421 ARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQVIEVFVIGVGGVGGAL 480
64 ARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQVIEVFVIGVGGVGGAL
65 Sbjct: 421 ARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQVIEVFVIGVGGVGGAL 480
67 Query: 481 LEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNLENWQEELAQAKEPFNLGRLIRL 540
68 LEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNLENWQEELAQAKEPFNLGRLIRL
69 Sbjct: 481 LEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNLENWQEELAQAKEPFNLGRLIRL 540
71 Query: 541 VKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSR 600
72 VKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSR
73 Sbjct: 541 VKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSR 600
75 Query: 601 RKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGMSFSEATTLA 660
76 RKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGMSFSEATTLA
77 Sbjct: 601 RKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGMSFSEATTLA 660
79 Query: 661 REMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMA 720
80 REMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMA
81 Sbjct: 661 REMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMA 720
83 Query: 721 NLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAF 780
84 NLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAF
85 Sbjct: 721 NLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAF 780
87 Query: 781 YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV 820
88 YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV
89 Sbjct: 781 YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV 820
92 >gb|AAC76922.1| (AE000468) aspartokinase II and homoserine dehydrogenase II
96 Score = 844 (302.2 bits), Expect = 3.1e-86, P = 3.1e-86
97 Identities = 222/705 (31%), Positives = 356/705 (50%)
99 Query: 116 DSINAALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIA 175
100 D++ A ++ GE S +M+ VL +G +D E L A + VD S +
101 Sbjct: 119 DAVYAEVVGHGEVWSARLMSAVLNQQGLPAAWLDAREFLRAE-RAAQPQVDEGLSYPLLQ 177
103 Query: 176 ASRI--PADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVY 233
104 + P +V+ GF + N GE V+LGRNGSDYSA + A IW+DV GVY
105 Sbjct: 178 QLLVQHPGKRLVV-TGFISRNNAGETVLLGRNGSDYSATQIGALAGVSRVTIWSDVAGVY 236
107 Query: 234 TCDPRQVPDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGT 293
108 + DPR+V DA LL + EA EL+ A VLH RT+ P++ +I ++ + P T
109 Sbjct: 237 SADPRKVKDACLLPLLRLDEASELARLAAPVLHARTLQPVSGSEIDLQLRCSYTPDQGST 296
111 Query: 294 LIGASRDEDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSS 353
112 I + +++ +++ + P + + + RA++ + + +
113 Sbjct: 297 RIERVLASGT-GARIVTSHDDVCLIEFQVPASQDFKLAHKEIDQILKRAQVRPLAVGVHN 355
115 Query: 354 SEYSISFCVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGIS 413
116 + FC A + + E GL L + + LA++++VG G+ T +
117 Sbjct: 356 DRQLLQFCYTSEVADSALKILDEA-------GLPGELRLRQGLALVAMVGAGV-TRNPLH 407
119 Query: 414 A-KFFAALARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQVIEVFVIG 472
120 +F+ L + Q S+ V+ + ++ HQ +F ++ I + + G
121 Sbjct: 408 CHRFWQQLKGQPVEFTW--QSDDGISLVAVLRTGPTESLIQGLHQSVFRAEKRIGLVLFG 465
123 Query: 473 VGGVGGALLEQLKRQQSWLKNKH-IDLRVCGVANSKALLTNVHGLN----LENWQEELAQ 527
124 G +G LE R+QS L + + + GV +S+ L + GL+ L + +E +
125 Sbjct: 466 KGNIGSRWLELFAREQSTLSARTGFEFVLAGVVDSRRSLLSYDGLDASRALAFFNDEAVE 525
127 Query: 528 AKEPFNLGRLIRLVKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMD 587
128 E L ++ + + V++D T+SQ +ADQY DF GFHV++ NK A S +
129 Sbjct: 526 QDEE----SLFLWMRAHPYDDLVVLDVTASQQLADQYLDFASHGFHVISANKLAGASDSN 581
131 Query: 588 YYHQLRYAAEKSRRKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKL 647
132 Y Q+ A EK+ R +LY+ VGAGLP+ +++L+++GD ++ SGI SG+LS++F +
133 Sbjct: 582 KYRQIHDAFEKTGRHWLYNATVGAGLPINHTVRDLIDSGDTILSISGIFSGTLSWLFLQF 641
135 Query: 648 DEGMSFSEATTLAREMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPA 707
136 D + F+E A + G TEPDPRDDLSG DV RKL+ILARE G +E + +E ++PA
137 Sbjct: 642 DGSVPFTELVDQAWQQGLTEPDPRDDLSGKDVMRKLVILAREAGYNIEPDQVRVESLVPA 701
139 Query: 708 EFNAEGDVAAFMANLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDP 767
140 G + F N +L++ R+ AR+ G VLRYV D +G RV + V + P
141 Sbjct: 702 HCEG-GSIDHFFENGDELNEQMVQRLEAAREMGLVLRYVARFDANGKARVGVEAVREDHP 760
143 Query: 768 LFKVKNGENALAFYSHYYQPLPLVLRGYGAGNDVTAAGVFADLLR 812
144 L + +N A S +Y+ PLV+RG GAG DVTA + +D+ R
145 Sbjct: 761 LASLLPCDNVFAIESRWYRDNPLVIRGPGAGRDVTAGAIQSDINR 805
147 Score = 321 (118.1 bits), Expect = 3.6e-27, P = 3.6e-27
148 Identities = 108/406 (26%), Positives = 191/406 (47%)
150 Query: 5 KFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDALPNI 64
151 KFGG+S+A+ + +LRVA I+ ++ + V+SA TN L+ ++ + + + + +
152 Sbjct: 16 KFGGSSLADVKCYLRVAGIMAEYSQPDDMM-VVSAAGSTTNQLINWLKLSQTDRLSAHQV 74
154 Query: 65 SDAERIF-AELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLH-GISLLGQCPDSINAAL 122
155 R + +L++GL A+ L + FV + ++ +L GI+ D++ A +
156 Sbjct: 75 QQTLRRYQCDLISGLLPAEEADSL--ISAFVS-DLERLAALLDSGIN------DAVYAEV 125
158 Query: 123 ICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRI--P 180
159 + GE S +M+ VL +G +D E L A + VD S + + P
160 Sbjct: 126 VGHGEVWSARLMSAVLNQQGLPAAWLDAREFLRAE-RAAQPQVDEGLSYPLLQQLLVQHP 184
162 Query: 181 ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV 240
163 +V+ GF + N GE V+LGRNGSDYSA + A IW+DV GVY+ DPR+V
164 Sbjct: 185 GKRLVV-TGFISRNNAGETVLLGRNGSDYSATQIGALAGVSRVTIWSDVAGVYSADPRKV 243
166 Query: 241 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD 300
167 DA LL + EA EL+ A VLH RT+ P++ +I ++ + P T I
168 Sbjct: 244 KDACLLPLLRLDEASELARLAAPVLHARTLQPVSGSEIDLQLRCSYTPDQGSTRIERVLA 303
170 Query: 301 EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF 360
171 + +++ +++ + P + + + RA++ + + + + F
172 Sbjct: 304 SGT-GARIVTSHDDVCLIEFQVPASQDFKLAHKEIDQILKRAQVRPLAVGVHNDRQLLQF 362
174 Query: 361 CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGM 406
175 C A + + E GL L + + LA++++VG G+
176 Sbjct: 363 CYTSEVADSALKILDEA-------GLPGELRLRQGLALVAMVGAGV 401
179 >gb|AAC76994.1| (AE000475) aspartokinase III, lysine sensitive [Escherichia
183 Score = 483 (175.1 bits), Expect = 2.8e-47, P = 2.8e-47
184 Identities = 149/467 (31%), Positives = 233/467 (49%)
186 Query: 3 VLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQ---- 58
187 V KFGGTSVA+ + R ADI+ S+A V VLSA A ITN LVA+ E G+
188 Sbjct: 6 VSKFGGTSVADFDAMNRSADIVLSDANVRLV--VLSASAGITNLLVALAEGLEPGERFEK 63
190 Query: 59 -DALPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDS 117
191 DA+ NI FA +L L +P ++ +++ I VL + L P +
192 Sbjct: 64 LDAIRNIQ-----FA-ILERLR-----YPNV-IREEIERLLENIT-VLAEAAALATSP-A 109
194 Query: 118 INAALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVG-HYLESTVDIAESTRRIAA 176
195 + L+ GE MS + +L R D V K++ + + DIA A
196 Sbjct: 110 LTDELVSHGELMSTLLFVEILRERDVQAQWFD-VRKVMRTNDRFGRAEPDIAALAELAAL 168
198 Query: 177 SRIPA--DHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYT 234
199 +P + +V+ GF KG LGR GSDY+AA+LA L A +IWTDV G+YT
200 Sbjct: 169 QLLPRLNEGLVITQGFIGSENKGRTTTLGRGGSDYTAALLAEALHASRVDIWTDVPGIYT 228
202 Query: 235 CDPRQVPDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTL 294
203 DPR V A+ + +++ EA E++ FGAKVLHP T+ P + IP + ++ +P+A GTL
204 Sbjct: 229 TDPRVVSAAKRIDEIAFAEAAEMATFGAKVLHPATLLPAVRSDIPVFVGSSKDPRAGGTL 288
206 Query: 295 IGASRDEDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSS 354
207 + ++ E+ + ++ N + ++ M G A VF ++R ISV LIT +S
208 Sbjct: 289 V-CNKTENPPLFRALALRRNQTLLTLHSLNMLHSRGFLAEVFGILARHNISVDLIT--TS 345
210 Query: 355 EYSISFCVPQSDCVRA-ERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGIS 413
211 E S++ + + + + + +EL L + V E LA+++++G+ + G+
212 Sbjct: 346 EVSVALTLDTTGSTSTGDTLLTQSLLMELSA--LCRVEVEEGLALVALIGNDLSKACGVG 403
214 Query: 414 AKFFAALARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLF 460
215 + F L NI + I G+S ++ +V +DA V+ H LF
216 Sbjct: 404 KEVFGVLEPFNIRM--ICYGASSHNLCFLVPGEDAEQVVQKLHSNLF 448
219 >gb|AAC73282.1| (AE000126) uridylate kinase [Escherichia coli]
222 Score = 97 (39.2 bits), Expect = 0.0010, P = 0.0010
223 Identities = 30/98 (30%), Positives = 45/98 (45%)
225 Query: 199 LVVLGRN-GSDYSAAVLAACLR-----ADCCEIWTDVDGVYTCDPRQVPDARLLKSMSYQ 252
226 +V+L G+ + AACLR AD T VDGV+T DP + P A + + ++Y
227 Sbjct: 131 VVILSAGTGNPFFTTDSAACLRGIEIEADVVLKATKVDGVFTADPAKDPTATMYEQLTYS 190
229 Query: 253 EAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQA 290
230 E +E KV+ T ++P + N P A
231 Sbjct: 191 EVLEKEL---KVMDLAAFTLARDHKLPIRVFNMNKPGA 225
239 Query ----- As Used ----- ----- Computed ----
240 Frame MatID Matrix name Lambda K H Lambda K H
241 +0 0 BLOSUM62 0.319 0.136 0.384 same same same
242 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
245 Frame MatID Length Eff.Length E S W T X E2 S2
246 +0 0 820 820 0.010 93 3 11 22 0.19 34
252 Database: /home/jes12/db/ecoli.aa
254 Posted: 2:52:35 PM EST Nov 18, 2001
255 Created: 9:46:47 AM EST Nov 18, 2001
257 # of letters in database: 1,358,990
258 # of sequences in database: 4289
259 # of database sequences satisfying E: 4
260 No. of states in DFA: 573 (61 KB)
261 Total size of DFA: 281 KB (1149 KB)
262 Time to generate neighborhood: 0.00u 0.02s 0.02t Elapsed: 00:00:00
263 No. of threads or processors used: 1
264 Search cpu time: 1.58u 0.00s 1.58t Elapsed: 00:00:01
265 Total cpu time: 1.59u 0.02s 1.61t Elapsed: 00:00:01
266 Start: Thu Dec 6 11:09:14 2001 End: Thu Dec 6 11:09:15 2001