1BLASTP 2.1.3 [Apr-11-2001] 2 3 4Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 5Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 6"Gapped BLAST and PSI-BLAST: a new generation of protein database search 7programs", Nucleic Acids Res. 25:3389-3402. 8 9Query= CATH_RAT 10 (333 letters) 11 12Database: /data_2/jason/blastdb/wormpep62 13 20,085 sequences; 8,813,425 total letters 14 15Searching..................................................done 16 17 Score E 18Sequences producing significant alignments: (bits) Value 19 20T03E6.7 CE16333 cathepsin-like protease (HINXTON) TR:O4573... 196 2e-50 21F41E6.6 CE10254 cysteine protease and a protease inhibitor... 166 2e-41 22R09F10.1 CE28755 peptidase (ST.LOUIS) TR:Q23030 protein_id... 162 3e-40 23R07E3.1 CE02295 cysteine proteinase (HINXTON) TR:Q21810 pr... 126 2e-29 24Y40H7A.10 CE21821 Cysteine protease (HINXTON) TR:Q9XWA4 pr... 123 1e-28 25 26>T03E6.7 CE16333 cathepsin-like protease (HINXTON) TR:O45734 27 protein_id:CAB07275.1 28 Length = 337 29 30 Score = 196 bits (498), Expect = 2e-50 31 Identities = 122/318 (38%), Positives = 174/318 (54%), Gaps = 21/318 (6%) 32 33Query: 26 NAIEKFHFTSWMKQHQKTYSSREYSHRLQVFANNWRKIQAHNQRNH-----TFKMGLNQF 80 34 +AIEK+ + + K YS E ++ F N I+ HN R+H TF+MGLN 35Sbjct: 27 SAIEKWD--DYKEDFDKEYSESEEQTYMEAFVKNMIHIENHN-RDHRLGRKTFEMGLNHI 83 36 37Query: 81 SDMSFAEIK----HKYLWSEPQNCSATKSNYLRGTG-PYPSSMDWRKKGNVVSPVKNQGA 135 38 +D+ F++ + ++ L+ + + S++L P +DWR ++V+ VKNQG 39Sbjct: 84 ADLPFSQYRKLNGYRRLFGDSR--IKNSSSFLAPFNVQVPDEVDWRDT-HLVTDVKNQGM 140 40 41Query: 136 CGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNK 195 42 CGSCW FS TGALE A G++++L+EQ LVDC+ + NHGC GGL QAFEYI N 43Sbjct: 141 CGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDNH 200 44 45Query: 196 GIMGEDSYPYIGKNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEV-T 254 46 G+ E+SYPY G++ +C FN + A K V+ DE + AVA P+S A + 47Sbjct: 201 GVDTEESYPYKGRDMKCHFNKKTVGADDKGYVDTPEGDEEQLKIAVATQGPISIAIDAGH 260 48 49Query: 255 EDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYG-EQNGLLYWIVKXXXXXXXXXXXYFLI 313 50 F +YK GVY C + ++++H VL VGYG + YWIVK Y I 51Sbjct: 261 RSFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEHGDYWIVKNSWGAGWGEKGYIRI 318 52 53Query: 314 ERGK-NMCGLAACASYPI 330 54 R + N CG+A ASYP+ 55Sbjct: 319 ARNRNNHCGVATKASYPL 336 56 57 58>F41E6.6 CE10254 cysteine protease and a protease inhibitor (ST.LOUIS) 59 TR:O16454 protein_id:AAB65956.1 60 Length = 498 61 62 Score = 166 bits (419), Expect = 2e-41 63 Identities = 108/325 (33%), Positives = 155/325 (47%), Gaps = 35/325 (10%) 64 65Query: 33 FTSWMKQHQKTYSS-REYSHRLQVFANNWRKI-QAHNQRNHTFKMGLNQFSDMSFAEIKH 90 66 F ++ +H+K Y++ RE R +VF N + I + T G +FSDM+ E K 67Sbjct: 174 FLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKK 233 68 69Query: 91 ---KYLWSEP----QNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFS 143 70 Y W +P + + K + P S DWR+KG V+ VKNQG CGSCW FS 71Sbjct: 234 IMLPYQWEQPVYPMEQANFEKHDVTINEEDLPESFDWREKG-AVTQVKNQGNCGSCWAFS 292 72 73Query: 144 TTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFE------------YI 191 74 TTG +E A IA K+++L+EQ+LVDC + + GC GGLPS A++ + 75Sbjct: 293 TTGNVEGAWFIAKNKLVSLSEQELVDC--DSMDQGCNGGLPSNAYKIGKFVVSDNYCFLV 350 76 77Query: 192 LYNK---------GIMGEDSYPYIGKNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVA 242 78 Y+K G+ ED+YPY G+ C + ++ V + +DE M + + 79Sbjct: 351 FYHKTTKEIIRMGGLEPEDAYPYDGRGETCHLVRKDIAVYINGSVELP-HDEVEMQKWLV 409 80 81Query: 243 LYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKXXXX 302 82 P+S Y+ GV P +NH VL VGYG+ YWIVK 83Sbjct: 410 TKGPISIGLN-ANTLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWG 468 84 85Query: 303 XXXXXXXYFLIERGKNMCGLAACAS 327 86 YF + RGKN+CG+ A+ 87Sbjct: 469 PNWGEAGYFKLYRGKNVCGVQEMAT 493 88 89 90>R09F10.1 CE28755 peptidase (ST.LOUIS) TR:Q23030 protein_id:AAC69091.2 91 Length = 383 92 93 Score = 162 bits (410), Expect = 3e-40 94 Identities = 97/304 (31%), Positives = 157/304 (50%), Gaps = 19/304 (6%) 95 96Query: 37 MKQHQKTYSSREYSHRLQVFANNWRKIQAHNQRNHTFKMGLNQFSDMSFAEIKH------ 90 97 +K +K S E+ +R Q+F N + +A +RN + +N+F+D + E++ 98Sbjct: 87 LKFDRKYTSVEEFEYRYQIFLRNVIEFEAEEERNLGLDLDVNEFTDWTDEELQKMVQENK 146 99 100Query: 91 --KYLWSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGAL 148 101 KY + P+ + +YL P+S+DWR++G + +P+KNQG CGSCW F+T ++ 102Sbjct: 147 YTKYDFDTPK----FEGSYLETGVIRPASIDWREQGKL-TPIKNQGQCGSCWAFATVASV 201 103 104Query: 149 ESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIG- 207 105 E+ AI GK+++L+EQ++VDC + N+GC GG A +++ N G+ E YPY 106Sbjct: 202 EAQNAIKKGKLVSLSEQEMVDC--DGRNNGCSGGYRPYAMKFVKEN-GLESEKEYPYSAL 258 107 108Query: 208 KNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSS 267 109 K+ QC F+ + + N+E + V PV+F V + Y+SG+++ 110Sbjct: 259 KHDQCFLKENDTRVFIDD-FRMLSNNEEDIANWVGTKGPVTFGMNVVKAMYSYRSGIFNP 317 111 112Query: 268 NSCHKTPDKVN-HAVLAVGYGEQNGLLYWIVKXXXXXXXXXXXYFLIERGKNMCGLAACA 326 113 + T + HA+ +GYG + YWIVK YF + RG N CGLA 114Sbjct: 318 SVEDCTEKSMGAHALTIIGYGGEGESAYWIVKNSWGTSWGASGYFRLARGVNSCGLANTV 377 115 116Query: 327 SYPI 330 117 PI 118Sbjct: 378 VAPI 381 119 120 121>R07E3.1 CE02295 cysteine proteinase (HINXTON) TR:Q21810 122 protein_id:CAA89070.1 123 Length = 402 124 125 Score = 126 bits (316), Expect = 2e-29 126 Identities = 97/325 (29%), Positives = 152/325 (45%), Gaps = 31/325 (9%) 127 128Query: 20 TAELTVNAIEKFHFTSWMKQHQKTYS-SREYSHRLQVFANNWRKIQAHNQRNH--TFKMG 76 129 T E + I K + ++ ++ K+Y+ S+E RL + N I N +N + + G 130Sbjct: 78 TNERGIQNIAK-EYIAYTEKFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEYG 136 131 132Query: 77 LNQFSDMSFAEIK--------HKYLWSE-------PQNCSATKSNYLRGTGPYPSSMDWR 121 133 N SD + E + +K L E P++ +A K + P+P DWR 134Sbjct: 137 HNDMSDWTDEEFEKTLLPKSFYKRLHKEAEFIEPIPESLTAKKGE---SSSPFPDFFDWR 193 135 136Query: 122 KKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQG 181 137 K NV++PVK QG CGSCW F++T +E+A AIA G+ L+EQ L+DC + ++ C G 138Sbjct: 194 DK-NVITPVKAQGQCGSCWAFASTATVEAAWAIAHGEKRNLSEQTLLDC--DLVDNACDG 250 139 140Query: 182 GLPSQAFEYILYNKGIMGEDSYPYIG-KNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEA 240 141 G +AF YI + G+ PY+ + C N +K +DE +++ 142Sbjct: 251 GDEDKAFRYI-HRNGLANAVDLPYVAHRQNGCAVNDHWNTTRIK-AAYFLHHDEDSIINW 308 143 144Query: 241 VALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVN-HAVLAVGYG-EQNGLLYWIVK 298 145 + + PV+ V + YK GV++ + + + HA+L GYG + G YWIVK 146Sbjct: 309 LVNFGPVNIGMAVIQPMRAYKGGVFTPSEYACKNEVIGLHALLITGYGTSKTGEKYWIVK 368 147 148Query: 299 XX-XXXXXXXXXYFLIERGKNMCGL 322 149 Y RG N CG+ 150Sbjct: 369 NSWGNTWGVEHGYIYFARGINACGI 393 151 152 153>Y40H7A.10 CE21821 Cysteine protease (HINXTON) TR:Q9XWA4 154 protein_id:CAA22062.1 155 Length = 343 156 157 Score = 123 bits (309), Expect = 1e-28 158 Identities = 92/304 (30%), Positives = 145/304 (47%), Gaps = 26/304 (8%) 159 160Query: 26 NAIEKFHFTSWMKQHQKTYSSREYSHRLQVFANNWRKIQAHNQRNH---TFKMGLNQFSD 82 161 NA + F +++++ Y E R +F+ N ++ +N+ + T++ LN FSD 162Sbjct: 49 NAFQNF-LVKYLREYPNEY---EIVKRFTIFSRNLDLVERYNKEDAGKVTYE--LNDFSD 102 163 164Query: 83 MSFAEIKHKYLWSEPQNCSAT-KSNYLRGTGPYPSSMDWRKKG--NVVSPVKNQGACGSC 139 165 ++ E K + +P + + K L P+S+DWR N V+ +K QG CGSC 166Sbjct: 103 LTEEEWKKYLMTPKPDHSEKSLKPKTLIDKKNLPNSVDWRNVNGTNHVTGIKYQGPCGSC 162 167 168Query: 140 WTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMG 199 169 W F+T A+ESAV+I+ G + +L+ QQL+DC + C GG P +A +Y + GI 170Sbjct: 163 WAFATAAAIESAVSISGGGLQSLSSQQLLDC--TVVSDKCGGGEPVEALKY-AQSHGITT 219 171 172Query: 200 EDSYPYIGKNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNP-VSFAFEVTEDFM 258 173 +YPY +C+ VA + + + DE M + VAL P + A T 174Sbjct: 220 AHNYPYYFWTTKCR-ETVPTVARISSWMKAESEDE--MAQIVALNGPMIVCANFATNKNR 276 175 176Query: 259 MYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKXXXXXXXXXXXYFLIERGKN 318 177 Y SG+ C P HA++ +GYG YWI+K Y ++R N 178Sbjct: 277 FYHSGIAEDPDCGTEP---THALIVIGYGPD----YWILKNTYSKVWGEKGYMRVKRDVN 329 179 180Query: 319 MCGL 322 181 CG+ 182Sbjct: 330 WCGI 333 183 184 185>Y113G7B.15 CE23295 (HINXTON) TR:Q9U2X1 protein_id:CAB54334.1 186 Length = 328 187 188 Score = 120 bits (302), Expect = 9e-28 189 Identities = 94/317 (29%), Positives = 140/317 (43%), Gaps = 40/317 (12%) 190 191Query: 40 HQKTYSS-REYSHRLQVFANNWRKIQAHNQ------RNHTFKMGLNQFSDMSFAEIKHKY 92 192 H+K Y + E RL FA N +KIQ N RN TF G N+F+D + E+ + 193Sbjct: 3 HKKHYRTPAEKDRRLAHFAKNHQKIQELNAKARREGRNVTF--GWNKFADKNRQELSARN 60 194 195Query: 93 LWSEPQNCSAT---KSNYLRGTGPYPSSMDWRKKGN----------------VVSPVKNQ 133 196 P+N + K + RG+ + + R+ G+ VV PVK+Q 197Sbjct: 61 SKIHPKNHTDLPIYKPRHPRGSRNHHNKRSKRQSGDIPDYFDLRDIYVDGSPVVGPVKDQ 120 198 199Query: 134 GACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILY 193 200 CG CW F+TT E+A + S +L++Q++ DCA + + GC GG P + +++ 201Sbjct: 121 EQCGCCWAFATTAITEAANTLYSKSFTSLSDQEICDCADSGDTPGCVGGDPRNGLK-MVH 179 202 203Query: 194 NKGIMGEDSYPY----IGKNGQCKFNPEKAVAFVKNVVNITLND-----EAAMVEAVALY 244 204 +G + YPY G C EK+ +N+ D E M + 205Sbjct: 180 LRGQSSDGDYPYEEYRANTTGNC-VGDEKSTVIQPETLNVYRFDQDYAEEDIMENLYLNH 238 206 207Query: 245 NPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYG-EQNGLLYWIVKXXXXX 303 208 P + F V E+F Y SGV S C++ H+V VGYG +G+ YW+V+ 209Sbjct: 239 IPTAVYFRVGENFEWYTSGVLQSEDCYQMTPAEWHSVAIVGYGTSDDGVPYWLVRNSWNS 298 210 211Query: 304 XXXXXXYFLIERGKNMC 320 212 Y I RG N C 213Sbjct: 299 DWGLHGYVKIRRGVNWC 315 214 215 216>K02E7.10 CE11640 protease (ST.LOUIS) TR:O17255 protein_id:AAB71030.1 217 Length = 299 218 219 Score = 114 bits (284), Expect = 1e-25 220 Identities = 70/216 (32%), Positives = 107/216 (49%), Gaps = 8/216 (3%) 221 222Query: 118 MDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIAS-GKMMTLAEQQLVDCAQNFNN 176 223 +DWR+KG +V PVK+QG C + + F+ A+ES A A+ GK+++ +EQQ++DCA NF N 224Sbjct: 84 LDWREKG-IVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDCA-NFTN 141 225 226Query: 177 HGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKN--GQCKFNPEKAVAFVKNVVNITLNDE 234 227 CQ L + L G+ E YPY+GK G+C+++ K + +++ N+E 228Sbjct: 142 P-CQENLENVLSNRFLKENGVGTEADYPYVGKENVGKCEYDSSK-MKLRPTYIDVYPNEE 199 229 230Query: 235 AAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLY 294 231 A + + F F YK+G+Y+ ++ VGYG+ Y 232Sbjct: 200 WARAH-ITTFGTGYFRMRSPPSFFHYKTGIYNPTKEECGNANEARSLAIVGYGKDGAEKY 258 233 234Query: 295 WIVKXXXXXXXXXXXYFLIERGKNMCGLAACASYPI 330 235 WIVK Y + R N CG+A S PI 236Sbjct: 259 WIVKGSFGTSWGEHGYMKLARNVNACGMAESISIPI 294 237 238 239>C32B5.7 CE08515 cathepsin-like peptidase (ST.LOUIS) TR:P91111 240 protein_id:AAB37963.1 241 Length = 250 242 243 Score = 108 bits (270), Expect = 5e-24 244 Identities = 69/197 (35%), Positives = 104/197 (52%), Gaps = 18/197 (9%) 245 246Query: 106 NYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQ 165 247 NY P+ +DWR +G VV PVK+QG C + + F+ A+ES AIA+G++++ +EQ 248Sbjct: 63 NYKNAKKPF---LDWRDEG-VVGPVKDQGNCNASYAFAAISAIESMYAIANGQLLSFSEQ 118 249 250Query: 166 QLVDCAQNFNNHGCQ-GGLPSQAFEYILYNKGIMGEDSYPYIG-KNGQCKFNPEKAVAFV 223 251 Q++DC GC P A Y L KGI YP++G KN +C+++ +KA + 252Sbjct: 119 QIIDCL-----GGCAIESDPMMAMTY-LERKGIETYTDYPFVGKKNEKCEYDSKKAYLIL 172 253 254Query: 224 KNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVY--SSNSCHKTPDKVNHAV 281 255 + + ++DE+ + + P F F YKSG+Y + C T +K A+ 256Sbjct: 173 DDTYD--MSDESLALVFIDERGPGLFTMNTPPSFFNYKSGIYNPTEEECKSTNEK--RAL 228 257 258Query: 282 LAVGYGEQNGLLYWIVK 298 259 VGYG G YWIVK 260Sbjct: 229 TIVGYGNDKGQNYWIVK 245 261 262 263>Y51A2D.8 CE19204 Cysteine proteases (2 domains) (HINXTON) TR:Q9XXQ7 264 protein_id:CAA16407.1 265 Length = 386 266 267 Score = 106 bits (265), Expect = 2e-23 268 Identities = 82/330 (24%), Positives = 137/330 (40%), Gaps = 44/330 (13%) 269 270Query: 33 FTSWMKQHQKTYSSR-EYSHRLQVFANNWRKIQAHNQRN----HTFKMGLNQFSDMSFAE 87 271 F + K++ + Y E R F ++ + N ++ + + G+N+FSD+S AE 272Sbjct: 43 FEDFKKKYNRKYKDESENQQRFNNFVKSYNNVDKLNAKSKAAGYDTQFGINKFSDLSTAE 102 273 274Query: 88 IKHKYLWSEPQN------------------CSATKSNYLRGTGPYPSSMDWRKKG----N 125 275 + P N K+ + R + YP D R + 276Sbjct: 103 FHGRLSNVVPSNNTGLPMLNFDKKKPDFRAADMNKTRHKRRSTRYPDYFDLRNEKINGRY 162 277 278Query: 126 VVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPS 185 279 +V P+K+QG C CW F+ T +E+ A SGK +L++Q++ DC GC+GG + 280Sbjct: 163 IVGPIKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCGTE-GTPGCKGGSLT 221 281 282Query: 186 QAFEYILYNKGIMGEDSYPY----IGKNGQCKFNPEKAV----AFVKNVVNITLNDEAAM 237 283 +Y+ G+ G++ YPY + +C+ + AF V+N +E + 284Sbjct: 222 LGVQYV-KKYGLSGDEDYPYDQNRANQGRRCRLRETDRIVPARAFNFAVINPRRAEEQII 280 285 286Query: 238 VEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYG---EQNGLL- 293 287 PV+ F+V + F YK GV + C + HA VGY + G 288Sbjct: 281 QVLTEWKVPVAVYFKVGDQFKEYKEGVIIEDDCRRATQW--HAGAIVGYDTVEDSRGRSH 338 289 290Query: 294 -YWIVKXXXXXXXXXXXYFLIERGKNMCGL 322 291 YWI+K Y + RG++ C + 292Sbjct: 339 DYWIIKNSWGGDWAESGYVRVVRGRDWCSI 368 293 294 295>Y71H2AR.2 CE22930 (ST.LOUIS) protein_id:AAK29985.1 296 Length = 345 297 298 Score = 103 bits (257), Expect = 1e-22 299 Identities = 72/235 (30%), Positives = 105/235 (44%), Gaps = 16/235 (6%) 300 301Query: 91 KYLWSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALES 150 302 ++ W P + T +L DWR+KG +V PVK+QG C + F+ T ++ES 303Sbjct: 69 RFQWETPIHMDRTTEEFL----------DWREKG-IVGPVKDQGKCNASHAFAITSSIES 117 304 305Query: 151 AVAIAS-GKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGK- 208 306 A A+ G +++ +EQQL+DC GC+ A Y L GI E YPY+ K 307Sbjct: 118 MYAKATNGTLLSFSEQQLIDCNDQ-GYKGCEEQFAMNAIGY-LATHGIETEADYPYVDKT 175 308 309Query: 209 NGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSN 268 310 N +C F+ K+ +K V N+ V V Y P F YK G+Y+ + 311Sbjct: 176 NEKCTFDSTKSKIHLKKGVVAEGNEVLGKVY-VTNYGPAFFTMRAPPSLYDYKIGIYNPS 234 312 313Query: 269 SCHKTPDKVNHAVLAVGYGEQNGLLYWIVKXXXXXXXXXXXYFLIERGKNMCGLA 323 314 T +++ VGYG + YWIVK Y + R N C +A 315Sbjct: 235 IEECTSTHEIRSMVIVGYGIEGEQKYWIVKGSFGTSWGEQGYMKLARDVNACAMA 289 316 317 318>C50F4.3 CE05468 thiol protease (HINXTON) TR:Q18740 protein_id:CAA94738.1 319 Length = 374 320 321 Score = 102 bits (254), Expect = 3e-22 322 Identities = 86/319 (26%), Positives = 129/319 (39%), Gaps = 31/319 (9%) 323 324Query: 33 FTSWMKQHQKTYSSR-EYSHRLQVFANNWRKI----QAHNQRNHTFKMGLNQFSDMSFAE 87 325 F ++ ++++ Y E R Q F ++ +A + H K G+N+FSD+S E 326Sbjct: 47 FEDFIVKYKRNYKDEIEKKFRFQQFVATHNRVGKMNKAAKKAGHDTKYGINKFSDLSKKE 106 327 328Query: 88 IKHKYLWSEP--QNCSATKSNYL-----RGTGPYPSSMDWRKKG----NVVSPVKNQGAC 136 329 I Y P N + K N R P + D R K ++ P+K Q +C 330Sbjct: 107 IHGMYSKFGPPKNNTNVPKFNLKNLRVKRQMEGLPKTFDLRNKKVGGHYIIGPIKTQDSC 166 331 332Query: 137 GSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKG 196 333 CW F+ T E+A+ + K M L+EQ++ DCA + GC GG P EYI G 334Sbjct: 167 ACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPK-HGPGCNGGDPVDGLEYI-KEMG 224 335 336Query: 197 IMGEDSYPY-------IGKNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYN-PVS 248 337 + G YP+ +G+ K++ E + N E M + L N P+S 338Sbjct: 225 LTGGKEYPFNVNRSTQLGRCESEKYDRELNPLELDYYAIDPFNAEYQMTHHLYLLNLPIS 284 339 340Query: 249 FAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNG-----LLYWIVKXXXXX 303 341 AF Y SG+ C H+ VGYG + YWI + 342Sbjct: 285 VAFRTGASLSSYLSGILELADCDDEKGGHWHSGAIVGYGTTKNSAGRTVDYWIFRNSWWT 344 343 344Query: 304 XXXXXXYFLIERGKNMCGL 322 345 Y I RG++ C + 346Sbjct: 345 DWGDDGYARIVRGEDWCSI 363 347 348 349>F26E4.3 CE17714 cysteine protease (HINXTON) TR:P90850 350 protein_id:CAB03007.1 351 Length = 491 352 353 Score = 87.4 bits (215), Expect = 1e-17 354 Identities = 70/237 (29%), Positives = 102/237 (42%), Gaps = 33/237 (13%) 355 356Query: 115 PSSMDWRKK-GNVVSPVKNQGACGSCWTFSTTGALESAVAIAS-GKM-MTLAEQQLVDCA 171 357 P D R K G ++ PV +QG CGS W+ STT +AI S G++ TL+ QQL+ C 358Sbjct: 224 PEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLSCN 283 359 360Query: 172 QNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNG-------------------QC 212 361 Q+ GC+GG +A+ YI G++G+ YPY+ +C 362Sbjct: 284 QH-RQKGCEGGYLDRAWWYI-RKLGVVGDHCYPYVSGQSREPGHCLIPKRDYTNRQGLRC 341 363 364Query: 213 KFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVY--SSNSC 270 365 + + AF + E + + PV F V EDF MY GVY S + 366Sbjct: 342 PSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAA 401 367 368Query: 271 HKTPDKV---NHAVLAVGYGEQNG----LLYWIVKXXXXXXXXXXXYFLIERGKNMC 320 369 K V H+V +G+G + + YW+ YF + RG+N C 370Sbjct: 402 QKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRGENHC 458 371 372 373>C52E4.1 CE08943 locus:cpr-1 cathepsin-like cysteine protease (HINXTON) 374 TR:Q18783 protein_id:CAB01410.1 375 Length = 340 376 377 Score = 85.1 bits (209), Expect = 5e-17 378 Identities = 67/269 (24%), Positives = 111/269 (40%), Gaps = 33/269 (12%) 379 380Query: 82 DMSFAEIKHKYLWSEPQNCSATKSNYLRGTGP--YPSSMDWRKKGNVVSPVKNQGACGSC 139 381 +M F + KY + AT+ + + P + S W + ++ +++Q CGSC 382Sbjct: 66 EMKFKLMDGKYAAAHSDEIRATEQEVVLASVPATFDSRTQWSECKSI-KLIRDQATCGSC 124 383 384Query: 140 WTFSTTGALESAVAIAS--GKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGI 197 385 W F + I + + ++ L+ C + +GC+GG P QA + +KG+ 386Sbjct: 125 WAFGAAEMISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGGYPIQALRW-WDSKGV 183 387 388Query: 198 MGEDSYPYIG-----------------KNGQCKFNPEK--AVAFVKN----VVNITLNDE 234 389 + Y G K C + + + A+ K+ V + 390Sbjct: 184 VTGGDYHGAGCKPYPIAPCTSGNCPESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKN 243 391 392Query: 235 AAMVEAVALYN-PVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLL 293 393 AA ++A N PV AF V EDF YKSGVY + HA+ +G+G ++G 394Sbjct: 244 AASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLG---GHAIKIIGWGTESGSP 300 395 396Query: 294 YWIVKXXXXXXXXXXXYFLIERGKNMCGL 322 397 YW+V +F I RG + CG+ 398Sbjct: 301 YWLVANSWGVNWGESGFFKIYRGDDQCGI 329 399 400 401>M04G12.2 CE12424 cysteine protease (HINXTON) TR:P92005 402 protein_id:CAB03209.1 403 Length = 467 404 405 Score = 75.1 bits (183), Expect = 6e-14 406 Identities = 68/224 (30%), Positives = 100/224 (44%), Gaps = 44/224 (19%) 407 408Query: 101 SATKSNYLRGTGPYPSSMDWRKKG--NVVSPVKNQGA---CGSCWTFSTTGALESAVAIA 155 409 S+ KSN L P+ DWR N SP +NQ CGSCW F TTGAL +A 410Sbjct: 214 SSFKSNDL------PTGWDWRNVSGVNYCSPTRNQHIPVYCGSCWVFGTTGALNDRFNVA 267 411 412Query: 156 -SGK--MMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQC 212 413 G+ M L+ Q+++DC N CQGG E+ +G++ E Y NG+C 414Sbjct: 268 RKGRWPMTQLSPQEIIDCNGKGN---CQGGEIGNVLEHAKI-QGLVEEGCNVYRATNGEC 323 415 416Query: 213 KFNPEKAVA----------------FVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTED 256 417 NP +VK+ + D+ ++ + P++ A T+ 418Sbjct: 324 --NPYHRCGSCWPNECFSLTNYTRYYVKDYGQVQGRDK--IMSEIKKGGPIACAIGATKK 379 419 420Query: 257 F-MMYKSGVYSSNSCHKTPDKVNHAVLAVGYG-EQNGLLYWIVK 298 421 F Y GVYS K+ + NH + G+G ++NG+ YWI + 422Sbjct: 380 FEYEYVKGVYS----EKSDLESNHIISLTGWGVDENGVEYWIAR 419 423 424 425>F15D4.4 CE28917 cysteine protease (HINXTON) TR:Q93512 426 protein_id:CAB02487.1 427 Length = 622 428 429 Score = 75.1 bits (183), Expect = 6e-14 430 Identities = 85/332 (25%), Positives = 130/332 (38%), Gaps = 51/332 (15%) 431 432Query: 25 VNAIEKFH--------FTSWMKQHQKTYSSREYSHRLQVFANNWRKIQAHNQRNH----T 72 433 ++ +EKF+ F S M +++E R V++ +++ HN + 434Sbjct: 119 LSPLEKFNEAMNNDGAFKSLMDVINFNSTAKEGLKRFNVYSKVKKEVDEHNIMYELGMSS 178 435 436Query: 73 FKMGLNQFSDMSFAEIKHKYLWSEPQNCSAT------KSNYLRGTGPYPSSMDWRKKGNV 126 437 +KM NQFS E+ L + +AT S R T P ++DWR 438Sbjct: 179 YKMSTNQFSVALDGEVAPLTLNLDALTPTATVIPATISSRKKRDTEP---TVDWRP---F 232 439 440Query: 127 VSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQL------VDCAQNFNNHGCQ 180 441 + P+ +Q CG CW FS +ES AI +L+ QQL VD N GC+ 442Sbjct: 233 LKPILDQSTCGGCWAFSMISMIESFFAIQGYNTSSLSVQQLLTCDTKVDSTYGLANVGCK 292 443 444Query: 181 GGLPSQAFEYILYNKGIMGEDSYPYIGKNGQCK---FNPEKAVAFVKNVVNITLNDEAAM 237 445 GG A Y L P+ ++ C F P + + I+ N AA 446Sbjct: 293 GGYFQIAGSY-LEVSAARDASLIPFDLEDTSCDSSFFPPVVPTILLFDDGYISGNFTAAQ 351 447 448Query: 238 -------VEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQN 290 449 +E P++ D Y GVY + C +NHAV+ VG+ + 450Sbjct: 352 LITMEQNIEDKVRKGPIAVGMAAGPDIYKYSEGVYDGD-CGTI---INHAVVIVGFTDD- 406 451 452Query: 291 GLLYWIVKXXXXXXXXXXXYFLIER--GKNMC 320 453 YWI++ YF ++R GK+ C 454Sbjct: 407 ---YWIIRNSWGASWGEAGYFRVKRTPGKDPC 435 455 456 457>Y71H2AM.3 CE26272 (ST.LOUIS) protein_id:AAK29976.1 458 Length = 716 459 460 Score = 73.9 bits (180), Expect = 1e-13 461 Identities = 52/176 (29%), Positives = 77/176 (43%), Gaps = 32/176 (18%) 462 463Query: 92 YLWSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESA 151 464 + W P+ T +L DWR KG +V PVK+QG C + F+ + ++ES 465Sbjct: 70 FQWKTPKYTIQTTEEFL----------DWRDKG-IVGPVKDQGKCNASHAFAISSSIESM 118 466 467Query: 152 VAIA-SGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNG 210 468 A A +G +++ +EQQL+DC + GC+ A Y +++ GI E YPY GK 469Sbjct: 119 YAKATNGSLLSFSEQQLIDC-DDHGFKGCEEQPAINAVSYFIFH-GIETEADYPYAGKE- 175 470 471Query: 211 QCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYS 266 472 N L++E E V Y P F YK G+Y+ 473Sbjct: 176 -----------------NGKLSNETQGKELVTNYGPAFFTMRAPPSLYDYKIGIYN 214 474 475 476>C32B5.13 CE08521 (ST.LOUIS) TR:P91110 protein_id:AAB37968.1 477 Length = 150 478 479 Score = 71.6 bits (174), Expect = 6e-13 480 Identities = 45/143 (31%), Positives = 73/143 (50%), Gaps = 10/143 (6%) 481 482Query: 159 MMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGK-NGQCKFNPE 217 483 +++ +EQQ++DC NF + CQ + S F + G++ E YPY+GK N +CK++ 484Sbjct: 10 VLSFSEQQIIDCG-NFTSP-CQENILSHEF---IKKNGVVTEADYPYVGKENEKCKYDEN 64 485 486Query: 218 KAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYS--SNSCHKTPD 275 487 K + N++ + E + + + P F + F YK+G+YS C K D 488Sbjct: 65 KIKLWPTNMLLVGNLPETLLKLFIKEHGPGYFRMKAPPSFFNYKTGIYSPTQEECGKATD 124 489 490Query: 276 KVNHAVLAVGYGEQNGLLYWIVK 298 491 ++ VGYG + G YWIVK 492Sbjct: 125 A--RSLTIVGYGIEGGQNYWIVK 145 493 494 495 Database: /data_2/jason/blastdb/wormpep62 496 Posted date: Sep 3, 2001 2:17 PM 497 Number of letters in database: 8,813,425 498 Number of sequences in database: 20,085 499 500Lambda K H 501 0.319 0.131 0.412 502 503Gapped 504Lambda K H 505 0.267 0.0410 0.140 506 507 508Matrix: BLOSUM62 509Gap Penalties: Existence: 11, Extension: 1 510Number of Hits to DB: 5933049 511Number of Sequences: 20085 512Number of extensions: 243404 513Number of successful extensions: 614 514Number of sequences better than 1.0e-10: 17 515Number of HSP's better than 0.0 without gapping: 1 516Number of HSP's successfully gapped in prelim test: 16 517Number of HSP's that attempted gapping in prelim test: 568 518Number of HSP's gapped (non-prelim): 17 519length of query: 333 520length of database: 8,813,425 521effective HSP length: 46 522effective length of query: 287 523effective length of database: 7,889,515 524effective search space: 2264290805 525effective search space used: 2264290805 526T: 11 527A: 40 528X1: 16 ( 7.4 bits) 529X2: 38 (14.6 bits) 530X3: 64 (24.7 bits) 531S1: 41 (21.8 bits) 532S2: 155 (64.3 bits) 533BLASTP 2.1.3 [Apr-11-2001] 534 535 536Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 537Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 538"Gapped BLAST and PSI-BLAST: a new generation of protein database search 539programs", Nucleic Acids Res. 25:3389-3402. 540 541Query= CATL_HUMAN 542 (333 letters) 543 544Database: /data_2/jason/blastdb/wormpep62 545 20,085 sequences; 8,813,425 total letters 546 547Searching..................................................done 548 549 Score E 550Sequences producing significant alignments: (bits) Value 551 552T03E6.7 CE16333 cathepsin-like protease (HINXTON) TR:O4573... 334 4e-92 553F41E6.6 CE10254 cysteine protease and a protease inhibitor... 194 6e-50 554R09F10.1 CE28755 peptidase (ST.LOUIS) TR:Q23030 protein_id... 176 2e-44 555Y40H7A.10 CE21821 Cysteine protease (HINXTON) TR:Q9XWA4 pr... 133 1e-31 556R07E3.1 CE02295 cysteine proteinase (HINXTON) TR:Q21810 pr... 130 1e-30 557 558>T03E6.7 CE16333 cathepsin-like protease (HINXTON) TR:O45734 559 protein_id:CAB07275.1 560 Length = 337 561 562 Score = 334 bits (857), Expect = 4e-92 563 Identities = 164/341 (48%), Positives = 228/341 (66%), Gaps = 12/341 (3%) 564 565Query: 1 MNPTLILAAFCLGIASATLTFDHSLEA---QWTKWKAMHNRLYGMNEEGWRRAVWEKNMK 57 566 MN ++LA +A + +E+ +W +K ++ Y +EE + KNM 567Sbjct: 1 MNRFILLALVAAVVAVNSAKLSRQIESAIEKWDDYKEDFDKEYSESEEQTYMEAFVKNMI 60 568 569Query: 58 MIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ----NRKPRKGKVFQEPLFYE 113 570 IE HN+++R G+ +F M +N D+ ++R+ +NG++ + + + F P + 571Sbjct: 61 HIENHNRDHRLGRKTFEMGLNHIADLPFSQYRK-LNGYRRLFGDSRIKNSSSFLAPFNVQ 119 572 573Query: 114 APRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQ 173 574 P VDWR+ VT VKNQG CGSCWAFSATGALEGQ RK G+L+SLSEQNLVDCS 575Sbjct: 120 VPDEVDWRDTHLVTDVKNQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKY 179 576 577Query: 174 GNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QE 232 578 GN GCNGGLMD AF+Y++DN G+D+EESYPY+ + C +N K A+D G+VD P+ E 579Sbjct: 180 GNHGCNGGLMDQAFEYIRDNHGVDTEESYPYKGRDMKCHFNKKTVGADDKGYVDTPEGDE 239 580 581Query: 233 KALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDN 292 582 + L AVAT GPIS+AIDAGH SF YK+G+Y++ +CSSE++DHGVL+VGYG T+ ++ 583Sbjct: 240 EQLKIAVATQGPISIAIDAGHRSFQLYKKGVYYDEECSSEELDHGVLLVGYG---TDPEH 296 584 585Query: 293 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333 586 YW+VKNSWG WG GY+++A++R NHCG+A+ ASYP V 587Sbjct: 297 GDYWIVKNSWGAGWGEKGYIRIARNRNNHCGVATKASYPLV 337 588 589 590>F41E6.6 CE10254 cysteine protease and a protease inhibitor (ST.LOUIS) 591 TR:O16454 protein_id:AAB65956.1 592 Length = 498 593 594 Score = 194 bits (493), Expect = 6e-50 595 Identities = 124/330 (37%), Positives = 171/330 (51%), Gaps = 53/330 (16%) 596 597Query: 36 HNRLYGMNEEGWRR-AVWEKNMKMI-ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMN 93 598 H + Y E +R V++KN K+I EL E + FT F DMT+ EF+++M 599Sbjct: 181 HEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTK----FSDMTTMEFKKIML 236 600 601Query: 94 GFQNRKP-----------RKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFS 142 602 +Q +P + +E L P S DWREKG VT VKNQG CGSCWAFS 603Sbjct: 237 PYQWEQPVYPMEQANFEKHDVTINEEDL----PESFDWREKGAVTQVKNQGNCGSCWAFS 292 604 605Query: 143 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQ----YVQDN----- 193 606 TG +EG F +L+SLSEQ LVDC ++GCNGGL A++ V DN 607Sbjct: 293 TTGNVEGAWFIAKNKLVSLSEQELVDCDSM--DQGCNGGLPSNAYKIGKFVVSDNYCFLV 350 608 609Query: 194 ------------GGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVAT 241 610 GGL+ E++YPY+ E+C K G V++P E + K + T 611Sbjct: 351 FYHKTTKEIIRMGGLEPEDAYPYDGRGETCHLVRKDIAVYINGSVELPHDEVEMQKWLVT 410 612 613Query: 242 VGPISVAIDAGHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVK 299 614 GPIS+ ++A + FY+ G+ F+ C ++HGVL+VGYG + YW+VK 615Sbjct: 411 KGPISIGLNA--NTLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYG----KDGRKPYWIVK 464 616 617Query: 300 NSWGEEWGMGGYVKMAKDRRNHCGIASAAS 329 618 NSWG WG GY K+ + +N CG+ A+ 619Sbjct: 465 NSWGPNWGEAGYFKLYRG-KNVCGVQEMAT 493 620 621 622>R09F10.1 CE28755 peptidase (ST.LOUIS) TR:Q23030 protein_id:AAC69091.2 623 Length = 383 624 625 Score = 176 bits (446), Expect = 2e-44 626 Identities = 113/309 (36%), Positives = 171/309 (54%), Gaps = 39/309 (12%) 627 628Query: 42 MNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPR 101 629 + E +R ++ +N+ IE +E R + +N F D T EE ++++ Q K 630Sbjct: 96 VEEFEYRYQIFLRNV--IEFEAEEERN--LGLDLDVNEFTDWTDEELQKMV---QENKYT 148 631 632Query: 102 KGKVFQEPLFYEA--------PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR 153 633 K F P F + P S+DWRE+G +TP+KNQGQCGSCWAF+ ++E Q 634Sbjct: 149 KYD-FDTPKFEGSYLETGVIRPASIDWREQGKLTPIKNQGQCGSCWAFATVASVEAQNAI 207 635 636Query: 154 KTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKY 213 637 K G+L+SLSEQ +VDC G N GC+GG YA ++V++N GL+SE+ YPY A K+ 638Sbjct: 208 KKGKLVSLSEQEMVDCDG--RNNGCSGGYRPYAMKFVKEN-GLESEKEYPYSA----LKH 260 639 640Query: 214 NPKYSVANDTG-FVD----IPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEP- 267 641 + + NDT F+D + E+ + V T GP++ ++ ++ Y+ GI F P 642Sbjct: 261 DQCFLKENDTRVFIDDFRMLSNNEEDIANWVGTKGPVTFGMNV-VKAMYSYRSGI-FNPS 318 643 644Query: 268 --DCSSEDMD-HGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGI 324 645 DC+ + M H + ++GYG E + YW+VKNSWG WG GY ++A+ N CG+ 646Sbjct: 319 VEDCTEKSMGAHALTIIGYGGEG----ESAYWIVKNSWGTSWGASGYFRLARG-VNSCGL 373 647 648Query: 325 ASAASYPTV 333 649 A+ P + 650Sbjct: 374 ANTVVAPII 382 651 652 653>Y40H7A.10 CE21821 Cysteine protease (HINXTON) TR:Q9XWA4 654 protein_id:CAA22062.1 655 Length = 343 656 657 Score = 133 bits (335), Expect = 1e-31 658 Identities = 91/284 (32%), Positives = 146/284 (51%), Gaps = 28/284 (9%) 659 660Query: 48 RRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107 661 R ++ +N+ ++E +N+E GK T +N F D+T EE+++ + KP + 662Sbjct: 71 RFTIFSRNLDLVERYNKE-DAGK--VTYELNDFSDLTEEEWKKYL---MTPKPDHSEKSL 124 663 664Query: 108 EPLFY----EAPRSVDWRE---KGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS 160 665 +P P SVDWR +VT +K QG CGSCWAF+ A+E + G L S 666Sbjct: 125 KPKTLIDKKNLPNSVDWRNVNGTNHVTGIKYQGPCGSCWAFATAAAIESAVSISGGGLQS 184 667 668Query: 161 LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVA 220 669 LS Q L+DC+ ++ C GG A +Y Q + G+ + +YPY C+ +VA 670Sbjct: 185 LSSQQLLDCT--VVSDKCGGGEPVEALKYAQSH-GITTAHNYPYYFWTTKCRETVP-TVA 240 671 672Query: 221 NDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLV 280 673 + ++ + E + + VA GP+ V + FY GI +PDC +E H ++V 674Sbjct: 241 RISSWMK-AESEDEMAQIVALNGPMIVCANFATNKNRFYHSGIAEDPDCGTEP-THALIV 298 675 676Query: 281 VGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGI 324 677 +GYG + YW++KN++ + WG GY+++ +D N CGI 678Sbjct: 299 IGYGPD--------YWILKNTYSKVWGEKGYMRVKRD-VNWCGI 333 679 680 681>R07E3.1 CE02295 cysteine proteinase (HINXTON) TR:Q21810 682 protein_id:CAA89070.1 683 Length = 402 684 685 Score = 130 bits (327), Expect = 1e-30 686 Identities = 89/265 (33%), Positives = 130/265 (48%), Gaps = 27/265 (10%) 687 688Query: 78 NAFGDMTSEEFRQVM--NGFQNRKPRKGKVFQEPLFYEA-----------PRSVDWREKG 124 689 N D T EEF + + F R ++ + F EP+ P DWR+K 690Sbjct: 138 NDMSDWTDEEFEKTLLPKSFYKRLHKEAE-FIEPIPESLTAKKGESSSPFPDFFDWRDKN 196 691 692Query: 125 YVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMD 184 693 +TPVK QGQCGSCWAF++T +E G +LSEQ L+DC + C+GG D 694Sbjct: 197 VITPVKAQGQCGSCWAFASTATVEAAWAIAHGEKRNLSEQTLLDCD--LVDNACDGGDED 254 695 696Query: 185 YAFQYVQDNGGLDSEESYPYEA-TEESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVG 243 697 AF+Y+ N GL + PY A + C N ++ + E +++ + G 698Sbjct: 255 KAFRYIHRN-GLANAVDLPYVAHRQNGCAVNDHWNTTRIKAAYFLHHDEDSIINWLVNFG 313 699 700Query: 244 PISVAIDAGHESFLFYKEGIY--FEPDCSSEDMD-HGVLVVGYGFESTESDNNKYWLVKN 300 701 P+++ + A + YK G++ E C +E + H +L+ GYG T KYW+VKN 702Sbjct: 314 PVNIGM-AVIQPMRAYKGGVFTPSEYACKNEVIGLHALLITGYG---TSKTGEKYWIVKN 369 703 704Query: 301 SWGEEWGM-GGYVKMAKDRRNHCGI 324 705 SWG WG+ GY+ A+ N CGI 706Sbjct: 370 SWGNTWGVEHGYIYFARG-INACGI 393 707 708 709>Y51A2D.8 CE19204 Cysteine proteases (2 domains) (HINXTON) TR:Q9XXQ7 710 protein_id:CAA16407.1 711 Length = 386 712 713 Score = 123 bits (308), Expect = 2e-28 714 Identities = 87/322 (27%), Positives = 145/322 (45%), Gaps = 39/322 (12%) 715 716Query: 32 WKAMHNRLYGMNEEGWRRAV-WEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQ 90 717 +K +NR Y E +R + K+ ++ N + + + +N F D+++ EF 718Sbjct: 46 FKKKYNRKYKDESENQQRFNNFVKSYNNVDKLNAKSKAAGYDTQFGINKFSDLSTAEFHG 105 719 720Query: 91 VMNG-------------FQNRKP-----------RKGKVFQEPLFYEAPRSVDWREKGYV 126 721 ++ F +KP K + + P +++ R+ + V 722Sbjct: 106 RLSNVVPSNNTGLPMLNFDKKKPDFRAADMNKTRHKRRSTRYPDYFDL-RNEKINGRYIV 164 723 724Query: 127 TPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYA 186 725 P+K+QGQC CW F+ T +E +G+ SLS+Q + DC G +G GC GG + 726Sbjct: 165 GPIKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDC-GTEGTPGCKGGSLTLG 223 727 728Query: 187 FQYVQDNGGLDSEESYPYEATEES----CKYNPKYSVANDTGF---VDIPKQEKALMKAV 239 729 QYV+ GL +E YPY+ + C+ + F V P++ + + V 730Sbjct: 224 VQYVK-KYGLSGDEDYPYDQNRANQGRRCRLRETDRIVPARAFNFAVINPRRAEEQIIQV 282 731 732Query: 240 ATVG--PISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYG-FESTESDNNKYW 296 733 T P++V G + F YKEG+ E DC H +VGY E + ++ YW 734Sbjct: 283 LTEWKVPVAVYFKVG-DQFKEYKEGVIIEDDCRRATQWHAGAIVGYDTVEDSRGRSHDYW 341 735 736Query: 297 LVKNSWGEEWGMGGYVKMAKDR 318 737 ++KNSWG +W GYV++ + R 738Sbjct: 342 IIKNSWGGDWAESGYVRVVRGR 363 739 740 741>K02E7.10 CE11640 protease (ST.LOUIS) TR:O17255 protein_id:AAB71030.1 742 Length = 299 743 744 Score = 119 bits (298), Expect = 3e-27 745 Identities = 73/219 (33%), Positives = 112/219 (50%), Gaps = 14/219 (6%) 746 747Query: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR-KTGRLISLSEQNLVDCSGPQGNE 176 748 +DWREKG V PVK+QG+C + +AF+A A+E + G+L+S SEQ ++DC+ 749Sbjct: 84 LDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDCA--NFTN 141 750 751Query: 177 GCNGGLMDYAFQYVQDNGGLDSEESYPYEATEE--SCKYNPKYSVANDTGFVDIPKQEKA 234 752 C L + G+ +E YPY E C+Y+ T ++D+ E+ 753Sbjct: 142 PCQENLENVLSNRFLKENGVGTEADYPYVGKENVGKCEYDSSKMKLRPT-YIDVYPNEEW 200 754 755Query: 235 LMKAVATVGPISVAIDAGHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDN 292 756 + T G + SF YK GIY + +C + + + +VGYG + E 757Sbjct: 201 ARAHITTFGTGYFRM-RSPPSFFHYKTGIYNPTKEECGNANEARSLAIVGYGKDGAE--- 256 758 759Query: 293 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYP 331 760 KYW+VK S+G WG GY+K+A++ N CG+A + S P 761Sbjct: 257 -KYWIVKGSFGTSWGEHGYMKLARN-VNACGMAESISIP 293 762 763 764>Y71H2AR.2 CE22930 (ST.LOUIS) protein_id:AAK29985.1 765 Length = 345 766 767 Score = 119 bits (297), Expect = 3e-27 768 Identities = 79/219 (36%), Positives = 115/219 (52%), Gaps = 12/219 (5%) 769 770Query: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKT-GRLISLSEQNLVDCSGPQGNE 176 771 +DWREKG V PVK+QG+C + AF+ T ++E + T G L+S SEQ L+DC+ QG + 772Sbjct: 86 LDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTLLSFSEQQLIDCN-DQGYK 144 773 774Query: 177 GCNGGLMDYAFQYVQDNGGLDSEESYPY-EATEESCKYNPKYSVANDTGFVDIPKQEKAL 235 775 GC A Y+ + G+++E YPY + T E C ++ S + V E 776Sbjct: 145 GCEEQFAMNAIGYLATH-GIETEADYPYVDKTNEKCTFDSTKSKIHLKKGVVAEGNEVLG 203 777 778Query: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFE--PDCSSEDMDHGVLVVGYGFESTESDNN 293 779 V GP + A S YK GIY +C+S +++VGYG E + 780Sbjct: 204 KVYVTNYGPAFFTMRA-PPSLYDYKIGIYNPSIEECTSTHEIRSMVIVGYGIEGEQ---- 258 781 782Query: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPT 332 783 KYW+VK S+G WG GY+K+A+D N C +A+ + T 784Sbjct: 259 KYWIVKGSFGTSWGEQGYMKLARD-VNACAMATTIAVLT 296 785 786 787>Y51A2D.1 CE18411 Cysteine proteases (2 domains) (HINXTON) TR:O62484 788 protein_id:CAA16404.1 789 Length = 382 790 791 Score = 105 bits (262), Expect = 4e-23 792 Identities = 95/353 (26%), Positives = 148/353 (41%), Gaps = 76/353 (21%) 793 794Query: 28 QWTKWKAMHNRLYGMNEEGWRRA---VWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMT 84 795 ++ ++K +R Y E R V +N ++ L+ + G++S A+N F D+T 796Sbjct: 43 EFVEFKKKFSRTYKSEAENQLRLQNFVKSRN-NVVRLNKNAQKAGRNS-NFAVNQFSDLT 100 797 798Query: 85 SEEFRQVMNGF-----------QNRKPRKGKVFQEPLFYEAPRSVDWREKGY-----VTP 128 799 + E Q ++ F +N K GK + E R+ D R + V P 800Sbjct: 101 TSELHQRLSRFPPNLTENSVFHKNFKKLLGKTRTKRQNSEFARNFDLRSQKVNGRYIVGP 160 801 802Query: 129 VKNQGQCGSCWAFSATGALEG------------------------------QMFRKTGRL 158 803 +KNQGQC CW F+ T LE + K 804Sbjct: 161 IKNQGQCACCWGFAVTAMLETIYAVNVGRFKLMSHIPALAPNFSDFDFFFFEFLAKLNMF 220 805 806Query: 159 ISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYS 218 807 +S S+Q + DC+ GC GG + + +Y +N GL SE YP + + + 808Sbjct: 221 LSFSDQEMCDCATDGTKAGCAGGGLMWGVEYAINN-GLASEFDYPEFDQNRATRPGTCEA 279 809 810Query: 219 VANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCS-SEDMDHG 277 811 + +D T P++ A AG +FL YK G+ DC + + H 812Sbjct: 280 MDDD-----------------KTFPPVNFA--AG-TAFLQYKSGVLVTEDCDLAGTVWHA 319 813 814Query: 278 VLVVGYGFES-TESDNNKYWLVKNSWG-EEWGMGGYVKMAKDRRNHCGIASAA 328 815 +VGYG E+ + ++W++KNSWG WG GGYVK+ + +N CGI A 816Sbjct: 320 GAIVGYGEENDLRGRSQRFWIMKNSWGVSGWGTGGYVKLIRG-KNWCGIERGA 371 817 818 819>F26E4.3 CE17714 cysteine protease (HINXTON) TR:P90850 820 protein_id:CAB03007.1 821 Length = 491 822 823 Score = 100 bits (250), Expect = 1e-21 824 Identities = 73/245 (29%), Positives = 110/245 (44%), Gaps = 35/245 (14%) 825 826Query: 113 EAPRSVDWREKG--YVTPVKNQGQCGSCWAFSATGALEGQM-FRKTGRLIS-LSEQNLVD 168 827 E P D R+K + PV +QG CGS W+ S T ++ GR+ S LS Q L+ 828Sbjct: 222 ELPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLS 281 829 830Query: 169 CSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPY---EATEESCKYNPKYSVANDTGF 225 831 C+ + +GC GG +D A+ Y++ G + + YPY ++ E PK N G 832Sbjct: 282 CNQHR-QKGCEGGYLDRAWWYIRKLGVV-GDHCYPYVSGQSREPGHCLIPKRDYTNRQGL 339 833 834Query: 226 -----------------VDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPD 268 835 + +E+ + + T GP+ HE F Y G+Y D 836Sbjct: 340 RCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVV-HEDFFMYAGGVYQHSD 398 837 838Query: 269 CSSE-------DMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNH 321 839 +++ + H V V+G+G + + KYWL NSWG +WG GY K+ + NH 840Sbjct: 399 LAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRG-ENH 457 841 842Query: 322 CGIAS 326 843 C I S 844Sbjct: 458 CEIES 462 845 846 847>Y113G7B.15 CE23295 (HINXTON) TR:Q9U2X1 protein_id:CAB54334.1 848 Length = 328 849 850 Score = 100 bits (248), Expect = 2e-21 851 Identities = 92/317 (29%), Positives = 130/317 (40%), Gaps = 37/317 (11%) 852 853Query: 44 EEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNG--------- 94 854 E+ R A + KN + I+ N + R + T N F D +E + 855Sbjct: 12 EKDRRLAHFAKNHQKIQELNAKARREGRNVTFGWNKFADKNRQELSARNSKIHPKNHTDL 71 856 857Query: 95 --FQNRKPRKGKVFQEPLFY----EAPRSVDWRE-----KGYVTPVKNQGQCGSCWAFSA 143 858 ++ R PR + + P D R+ V PVK+Q QCG CWAF+ 859Sbjct: 72 PIYKPRHPRGSRNHHNKRSKRQSGDIPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAFAT 131 860 861Query: 144 TGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYP 203 862 T E + SLS+Q + DC+ GC GG + V G S+ YP 863Sbjct: 132 TAITEAANTLYSKSFTSLSDQEICDCADSGDTPGCVGGDPRNGLKMVHLR-GQSSDGDYP 190 864 865Query: 204 YEA----TEESCKYNPKYSV-----ANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHE 254 866 YE T +C + K +V N F +E + P +V G E 867Sbjct: 191 YEEYRANTTGNCVGDEKSTVIQPETLNVYRFDQDYAEEDIMENLYLNHIPTAVYFRVG-E 249 868 869Query: 255 SFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYV 312 870 +F +Y G+ DC + H V +VGYG T D YWLV+NSW +WG+ GYV 871Sbjct: 250 NFEWYTSGVLQSEDCYQMTPAEWHSVAIVGYG---TSDDGVPYWLVRNSWNSDWGLHGYV 306 872 873Query: 313 KMAKDRRNHCGIASAAS 329 874 K+ + N C I S A+ 875Sbjct: 307 KIRRG-VNWCLIESHAA 322 876 877 878>F15D4.4 CE28917 cysteine protease (HINXTON) TR:Q93512 879 protein_id:CAB02487.1 880 Length = 622 881 882 Score = 97.8 bits (242), Expect = 8e-21 883 Identities = 77/296 (26%), Positives = 127/296 (42%), Gaps = 39/296 (13%) 884 885Query: 44 EEGWRRA-VWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRK 102 886 +EG +R V+ K K ++ HN Y G S+ M+ N F E + P 887Sbjct: 149 KEGLKRFNVYSKVKKEVDEHNIMYELGMSSYKMSTNQFSVALDGEVAPLTLNLDALTPTA 208 888 889Query: 103 GKV---FQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLI 159 890 + + +VDWR ++ P+ +Q CG CWAFS +E + 891Sbjct: 209 TVIPATISSRKKRDTEPTVDWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGYNTS 266 892 893Query: 160 SLSEQNLVDCSGP------QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCK- 212 894 SLS Q L+ C N GC GG A Y++ + D+ P++ + SC 895Sbjct: 267 SLSVQQLLTCDTKVDSTYGLANVGCKGGYFQIAGSYLEVSAARDA-SLIPFDLEDTSCDS 325 896 897Query: 213 ------------YNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYK 260 898 ++ Y N T I ++ ++ GPI+V + AG + + Y 899Sbjct: 326 SFFPPVVPTILLFDDGYISGNFTAAQLITMEQN--IEDKVRKGPIAVGMAAGPDIYK-YS 382 900 901Query: 261 EGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAK 316 902 EG+Y + DC + ++H V++VG+ + YW+++NSWG WG GY ++ + 903Sbjct: 383 EGVY-DGDCGT-IINHAVVIVGF--------TDDYWIIRNSWGASWGEAGYFRVKR 428 904 905 906>C50F4.3 CE05468 thiol protease (HINXTON) TR:Q18740 protein_id:CAA94738.1 907 Length = 374 908 909 Score = 94.4 bits (233), Expect = 9e-20 910 Identities = 80/292 (27%), Positives = 125/292 (42%), Gaps = 36/292 (12%) 911 912Query: 63 NQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ-----------NRKPRKGKVFQEPLF 111 913 N+ ++ H +N F D++ +E + + F N K + K E L 914Sbjct: 82 NKAAKKAGHDTKYGINKFSDLSKKEIHGMYSKFGPPKNNTNVPKFNLKNLRVKRQMEGL- 140 915 916Query: 112 YEAPRSVDWREKGY-----VTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNL 166 917 P++ D R K + P+K Q C CW F+AT E + + ++LSEQ + 918Sbjct: 141 ---PKTFDLRNKKVGGHYIIGPIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEV 197 919 920Query: 167 VDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATE-------ESCKYNPKYS- 218 921 DC+ P+ GCNGG +Y+++ GL + YP+ ES KY+ + + 922Sbjct: 198 CDCA-PKHGPGCNGGDPVDGLEYIKEM-GLTGGKEYPFNVNRSTQLGRCESEKYDRELNP 255 923 924Query: 219 VANDTGFVDIPKQEKALMKAVATVG-PISVAIDAGHESFLFYKEGIYFEPDCSSEDMD-- 275 925 + D +D E + + + PISVA G S Y GI DC E 926Sbjct: 256 LELDYYAIDPFNAEYQMTHHLYLLNLPISVAFRTG-ASLSSYLSGILELADCDDEKGGHW 314 927 928Query: 276 HGVLVVGYGFESTESDNN-KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIAS 326 929 H +VGYG + YW+ +NSW +WG GY ++ + + C I S 930Sbjct: 315 HSGAIVGYGTTKNSAGRTVDYWIFRNSWWTDWGDDGYARIVRG-EDWCSIES 365 931 932 933>C32B5.7 CE08515 cathepsin-like peptidase (ST.LOUIS) TR:P91111 934 protein_id:AAB37963.1 935 Length = 250 936 937 Score = 94.0 bits (232), Expect = 1e-19 938 Identities = 63/191 (32%), Positives = 100/191 (51%), Gaps = 18/191 (9%) 939 940Query: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177 941 +DWR++G V PVK+QG C + +AF+A A+E G+L+S SEQ ++DC G E 942Sbjct: 72 LDWRDEGVVGPVKDQGNCNASYAFAAISAIESMYAIANGQLLSFSEQQIIDCLGGCAIES 131 943 944Query: 178 CNGGLMDYAFQYVQDNGGLDSEESYPYEATE-ESCKYNPK--YSVANDTGFVDIPKQEKA 234 945 M Y + G+++ YP+ + E C+Y+ K Y + +DT D+ + A 946Sbjct: 132 DPMMAMTYL-----ERKGIETYTDYPFVGKKNEKCEYDSKKAYLILDDT--YDMSDESLA 184 947 948Query: 235 LMKAVATVGPISVAIDAGHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDN 292 949 L+ + GP ++ SF YK GIY E +C S + + +VGYG + ++ 950Sbjct: 185 LV-FIDERGPGLFTMNT-PPSFFNYKSGIYNPTEEECKSTNEKRALTIVGYGNDKGQN-- 240 951 952Query: 293 NKYWLVKNSWG 303 953 YW+VK S+G 954Sbjct: 241 --YWIVKGSFG 249 955 956 957>M04G12.2 CE12424 cysteine protease (HINXTON) TR:P92005 958 protein_id:CAB03209.1 959 Length = 467 960 961 Score = 92.0 bits (227), Expect = 4e-19 962 Identities = 82/288 (28%), Positives = 133/288 (45%), Gaps = 45/288 (15%) 963 964Query: 66 YREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRK-GKVFQE---PLFYEA------- 114 965 Y E + M++ + +SEE+ + + +K GKVF+ P +E+ 966Sbjct: 161 YYEPNDEALVDMSSESEESSEEWEEARPYLKCGCLKKSGKVFESKTAPREWESSSFKSND 220 967 968Query: 115 -PRSVDWREKG---YVTPVKNQG---QCGSCWAFSATGALEGQM-FRKTGR--LISLSEQ 164 969 P DWR Y +P +NQ CGSCW F TGAL + + GR + LS Q 970Sbjct: 221 LPTGWDWRNVSGVNYCSPTRNQHIPVYCGSCWVFGTTGALNDRFNVARKGRWPMTQLSPQ 280 971 972Query: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEE---------SCKYNP 215 973 ++DC+G +GN C GG + ++ + G L E Y AT SC N 974Sbjct: 281 EIIDCNG-KGN--CQGGEIGNVLEHAKIQG-LVEEGCNVYRATNGECNPYHRCGSCWPNE 336 975 976Query: 216 KYSVANDTGFV-----DIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCS 270 977 +S+ N T + + ++K +M + GPI+ AI A + Y +G+Y E S 978Sbjct: 337 CFSLTNYTRYYVKDYGQVQGRDK-IMSEIKKGGPIACAIGATKKFEYEYVKGVYSEK--S 393 979 980Query: 271 SEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDR 318 981 + +H + + G+G + + +YW+ +NSWGE WG G+ ++ + 982Sbjct: 394 DLESNHIISLTGWG---VDENGVEYWIARNSWGEAWGELGWFRVVTSK 438 983 984 985>C52E4.1 CE08943 locus:cpr-1 cathepsin-like cysteine protease (HINXTON) 986 TR:Q18783 protein_id:CAB01410.1 987 Length = 340 988 989 Score = 88.6 bits (218), Expect = 5e-18 990 Identities = 66/251 (26%), Positives = 104/251 (41%), Gaps = 37/251 (14%) 991 992Query: 107 QEPLFYEAPRSVD----WREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKT--GRLIS 160 993 QE + P + D W E + +++Q CGSCWAF A + + +T + 994Sbjct: 89 QEVVLASVPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPI 148 995 996Query: 161 LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESY-----PYE---------- 205 997 +S +L+ C G GC GG A ++ G + + + PY 998Sbjct: 149 ISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGNCP 208 999 1000Query: 206 -----ATEESCKYNPKYSVANDTGF----VDIPKQEKALMKAVATVGPISVAIDAGHESF 256 1001 + SC+ + A D F +PK ++ + GP+ A +E F 1002Sbjct: 209 ESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSV-YEDF 267 1003 1004Query: 257 LFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAK 316 1005 YK G+Y + H + ++G+G ES + YWLV NSWG WG G+ K+ + 1006Sbjct: 268 YKYKSGVY-KHTAGKYLGGHAIKIIGWGTES----GSPYWLVANSWGVNWGESGFFKIYR 322 1007 1008Query: 317 DRRNHCGIASA 327 1009 + CGI SA 1010Sbjct: 323 G-DDQCGIESA 332 1011 1012 1013>F32B5.8 CE09855 cysteine proteinase (ST.LOUIS) TR:O01850 1014 protein_id:AAB54210.1 1015 Length = 427 1016 1017 Score = 88.2 bits (217), Expect = 6e-18 1018 Identities = 85/288 (29%), Positives = 130/288 (44%), Gaps = 54/288 (18%) 1019 1020Query: 75 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLF---YEA--------PRSVDWREK 123 1021 +A +A+G + R N + + G+VF+ + YE P++ DWR+ 1022Sbjct: 137 LASSAYGKVRKYSNRNRYN-LKGCYKQTGRVFEHKRYDRIYETEDFDSEDLPKTWDWRDA 195 1023 1024Query: 124 G---YVTPVKNQG---QCGSCWAFSATGALEGQMFRKTGRL---ISLSEQNLVDCSGP-- 172 1025 Y + +NQ CGSCWAF AT AL ++ K LS Q ++DCSG 1026Sbjct: 196 NGINYASADRNQHIPQYCGSCWAFGATSALADRINIKRKNAWPQAYLSVQEVIDCSGAGT 255 1027 1028Query: 173 --QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCK-YN-------------PK 216 1029 G E GG+ YA ++ G+ E Y+A + C YN 1030Sbjct: 256 CVMGGEP--GGVYKYAHEH-----GIPHETCNNYQARDGKCDPYNRCGSCWPGECFSIKN 308 1031 1032Query: 217 YSVANDTGFVDIPKQEKALMKA-VATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD 275 1033 Y++ + + + EK MKA + GPI+ I A ++F Y GIY E + ED+D 1034Sbjct: 309 YTLYKVSEYGTVHGYEK--MKAEIYHKGPIACGI-AATKAFETYAGGIYKE--VTDEDID 363 1035 1036Query: 276 HGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCG 323 1037 H + V G+G + +YW+ +NSWGE WG G+ K+ + + G 1038Sbjct: 364 HIISVHGWGVD--HESGVEYWIGRNSWGEPWGEHGWFKIVTSQYKNAG 409 1039 1040 1041>F57F5.1 CE05999 cysteine protease (HINXTON) TR:Q20950 1042 protein_id:CAB00098.1 1043 Length = 400 1044 1045 Score = 85.9 bits (211), Expect = 3e-17 1046 Identities = 72/280 (25%), Positives = 114/280 (40%), Gaps = 51/280 (18%) 1047 1048Query: 89 RQVMNGFQNRKPRKGKVFQ--EPLFYEA--PRSVD----WREKGYVTPVKNQGQCGSCWA 140 1049 +Q+M P + +VF+ P +A P S D W ++ +++Q CGSCWA 1050Sbjct: 117 KQLMGAKMVEIPEEYRVFEMTHPEVEDAAVPDSFDSRTAWPNCPSISKIRDQSSCGSCWA 176 1051 1052Query: 141 FSATGALEGQ--MFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQY--------- 189 1053 SA + + + ++S+S ++ C G GCNGG A+++ 1054Sbjct: 177 VSAAETISDRICIASNAKTILSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG 236 1055 1056Query: 190 --VQDNGGLD-------------------SEESYPYEATEESCKYNPKYSVANDTGF--- 225 1057 QD G YP + E SC+ + D F 1058Sbjct: 237 GSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYQQDLHFGQS 296 1059 1060Query: 226 -VDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYG 284 1061 + K+ + K + T GP+ VA +E F Y G+Y +S H V ++G+G 1062Sbjct: 297 AYAVSKKAAEIQKEIMTHGPVEVAFTV-YEDFEHYSGGVYVHTAGASLG-GHAVKMLGWG 354 1063 1064Query: 285 FESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGI 324 1065 + + YWL NSW E+WG GY ++ + N CGI 1066Sbjct: 355 VD----NGTPYWLCANSWNEDWGENGYFRIIRG-VNECGI 389 1067 1068 1069>T10H4.12 CE27590 locus:cpr-3 protease (HINXTON) TR:Q9TW93 1070 protein_id:CAB61024.2 1071 Length = 370 1072 1073 Score = 77.4 bits (189), Expect = 1e-14 1074 Identities = 80/345 (23%), Positives = 131/345 (37%), Gaps = 76/345 (22%) 1075 1076Query: 12 LGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKH 71 1077 +G + + DH Q T W A HN + + +E K++++ 1078Sbjct: 27 IGQSPQKVLVDHVNTVQ-TSWVAEHNEI----------SEFEMKFKVMDV---------- 65 1079 1080Query: 72 SFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREK----GYVT 127 1081 F + D+ SE F +G++ EPL P + D REK + 1082Sbjct: 66 KFAEPLEKDSDVASELFV------------RGEIVPEPL----PDTFDAREKWPDCNTIK 109 1083 1084Query: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTG--RLISLSEQNLVDCSGPQGNEGCNGGLMDY 185 1085 ++NQ CGSCWAF A + ++ ++ + +S ++++ C G GC GG 1086Sbjct: 110 LIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGYGCKGGYSIE 169 1087 1088Query: 186 AFQYVQDNGGLDSEE-----SYPY----------EATEESCK-----------YNPKYSV 219 1089 A ++ +G + + PY E+T SCK Y 1090Sbjct: 170 ALRFWASSGAVTGGDYGGHGCMPYSFAPCTKNCPESTTPSCKTTCQSSYKTEEYKKDKHY 229 1091 1092Query: 220 ANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVL 279 1093 V K + + GP+ + +E F YK G+Y H V 1094Sbjct: 230 GASAYKVTTTKSVTEIQTEIYHYGPVEASYKV-YEDFYHYKSGVYHYTSGKLVG-GHAVK 287 1095 1096Query: 280 VVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGI 324 1097 ++G+G E + YWL+ NSWG +G G+ K+ + N C I 1098Sbjct: 288 IIGWGVE----NGVDYWLIANSWGTSFGEKGFFKIRRG-TNECQI 327 1099 1100 1101>F36D3.9 CE15973 cysteine protease (HINXTON) TR:O45466 1102 protein_id:CAB04322.1 1103 Length = 345 1104 1105 Score = 77.0 bits (188), Expect = 1e-14 1106 Identities = 65/245 (26%), Positives = 100/245 (40%), Gaps = 36/245 (14%) 1107 1108Query: 109 PLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS--LSEQNL 166 1109 PL ++A W + + ++ Q CGSCWAFS + + + +S +L 1110Sbjct: 102 PLNFDA--RTRWPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDL 159 1111 1112Query: 167 VDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEE-------SYPYEATEE---------- 209 1113 + C G EGC+GG AFQ+ G + + YP 1114Sbjct: 160 LTCCGMSCGEGCDGGFPYRAFQWWARRGVVTGGDYLGTGCKPYPIRPCNSDNCVNLQTPP 219 1115 1116Query: 210 ---SCKYNPKYSVANDTGF-----VDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKE 261 1117 SC+ + + ND + +P+ A+ + GP+ VA +E F YK 1118Sbjct: 220 CRLSCQPGYRTTYTNDKNYGSNSAYPVPRTVAAIQADIYYNGPV-VAAFIVYEDFEKYKS 278 1119 1120Query: 262 GIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNH 321 1121 GIY S+ H V ++G+G E YWL NSWG +WG G ++ + + 1122Sbjct: 279 GIYRHIAGRSKG-GHAVKLIGWGTER----GTPYWLAVNSWGSQWGESGTFRILRG-VDE 332 1123 1124Query: 322 CGIAS 326 1125 CGI S 1126Sbjct: 333 CGIES 337 1127 1128 1129>C25B8.3 CE04078 locus:cpr-6 (ST.LOUIS) protein_id:AAK39189.1 1130 Length = 379 1131 1132 Score = 77.0 bits (188), Expect = 1e-14 1133 Identities = 67/255 (26%), Positives = 106/255 (41%), Gaps = 49/255 (19%) 1134 1135Query: 113 EAPRSVD----WREKGYVTPVKNQGQCGSCWAFSATGALEGQM-FRKTGRL-ISLSEQNL 166 1136 + P S D W + + +++Q CGSCWAF A A+ ++ G L ++LS +L 1137Sbjct: 104 DIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDL 163 1138 1139Query: 167 VDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSE--------ESYPYEATEESCKYN---- 214 1140 + C G GCNGG A++Y +G + + YP+ E K 1141Sbjct: 164 LSCCKSCGF-GCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDP 222 1142 1143Query: 215 --------PKYSVANDTGFVDIPKQE---------------KALMKAVATVGPISVAIDA 251 1144 PK + + D E +A+ K + T GP+ +A + 1145Sbjct: 223 CPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEV 282 1146 1147Query: 252 GHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGY 311 1148 +E FL Y G+Y H V ++G+G + D YW V NSW +WG G+ 1149Sbjct: 283 -YEDFLNYDGGVYVHTG-GKLGGGHAVKLIGWGID----DGIPYWTVANSWNTDWGEDGF 336 1150 1151Query: 312 VKMAKDRRNHCGIAS 326 1152 ++ + + CGI S 1153Sbjct: 337 FRILRG-VDECGIES 350 1154 1155 1156>W07B8.4 CE14680 thiol protease (ST.LOUIS) TR:O16288 protein_id:AAB65345.1 1157 Length = 335 1158 1159 Score = 75.9 bits (185), Expect = 3e-14 1160 Identities = 66/249 (26%), Positives = 99/249 (39%), Gaps = 47/249 (18%) 1161 1162Query: 120 WREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS--LSEQNLVDCSGPQGN-- 175 1163 W + V +++Q CGSCWA +A A+ + + ++ LS ++++ C + N 1164Sbjct: 83 WPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDILTCCTGKFNCG 142 1165 1166Query: 176 EGCNGGLMDYAFQYVQDNG---GLDSEESY---PYEAT---------------------- 207 1167 +GC GG A++Y NG G E Y PY 1168Sbjct: 143 DGCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCGETIDGVTWPECPMKISDTP 202 1169 1170Query: 208 --EESCKYNPKYSVAND------TGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFY 259 1171 E C N Y + D I + K + + GP+ V +E F Y 1172Sbjct: 203 KCEHHCTGNNSYPIPYDQDKHFGASAYAIGRSAKQIQTEILAHGPVEVGFIV-YEDFYLY 261 1173 1174Query: 260 KEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRR 319 1175 K GIY E H V ++G+G ++ YWL NSW WG GY ++ + 1176Sbjct: 262 KTGIYTHV-AGGELGGHAVKMLGWGVDN----GTPYWLAANSWNTVWGEKGYFRILRG-V 315 1177 1178Query: 320 NHCGIASAA 328 1179 + CGI SAA 1180Sbjct: 316 DECGIESAA 324 1181 1182 1183>Y71H2AM.3 CE26272 (ST.LOUIS) protein_id:AAK29976.1 1184 Length = 716 1185 1186 Score = 68.6 bits (166), Expect = 5e-12 1187 Identities = 55/168 (32%), Positives = 81/168 (47%), Gaps = 23/168 (13%) 1188 1189Query: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKT-GRLISLSEQNLVDCSGPQGNE 176 1190 +DWR+KG V PVK+QG+C + AF+ + ++E + T G L+S SEQ L+DC G + 1191Sbjct: 86 LDWRDKGIVGPVKDQGKCNASHAFAISSSIESMYAKATNGSLLSFSEQQLIDCD-DHGFK 144 1192 1193Query: 177 GCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALM 236 1194 GC A Y + G+++E YPY E ++N+T Q K L 1195Sbjct: 145 GCEEQPAINAVSYFIFH-GIETEADYPYAGKENG-------KLSNET-------QGKEL- 188 1196 1197Query: 237 KAVATVGPISVAIDAGHESFLFYKEGIYFE--PDCSSEDMDHGVLVVG 282 1198 V GP + A S YK GIY +C+S +++VG 1199Sbjct: 189 --VTNYGPAFFTMRA-PPSLYDYKIGIYNPSIEECTSTHEIRSMVIVG 233 1200 1201 1202 Database: /data_2/jason/blastdb/wormpep62 1203 Posted date: Sep 3, 2001 2:17 PM 1204 Number of letters in database: 8,813,425 1205 Number of sequences in database: 20,085 1206 1207Lambda K H 1208 0.317 0.133 0.417 1209 1210Gapped 1211Lambda K H 1212 0.267 0.0410 0.140 1213 1214 1215Matrix: BLOSUM62 1216Gap Penalties: Existence: 11, Extension: 1 1217Number of Hits to DB: 6230268 1218Number of Sequences: 20085 1219Number of extensions: 270881 1220Number of successful extensions: 651 1221Number of sequences better than 1.0e-10: 23 1222Number of HSP's better than 0.0 without gapping: 4 1223Number of HSP's successfully gapped in prelim test: 19 1224Number of HSP's that attempted gapping in prelim test: 588 1225Number of HSP's gapped (non-prelim): 27 1226length of query: 333 1227length of database: 8,813,425 1228effective HSP length: 45 1229effective length of query: 288 1230effective length of database: 7,909,600 1231effective search space: 2277964800 1232effective search space used: 2277964800 1233T: 11 1234A: 40 1235X1: 16 ( 7.3 bits) 1236X2: 38 (14.6 bits) 1237X3: 64 (24.7 bits) 1238S1: 41 (21.6 bits) 1239S2: 155 (64.3 bits) 1240BLASTP 2.1.3 [Apr-11-2001] 1241 1242 1243Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 1244Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 1245"Gapped BLAST and PSI-BLAST: a new generation of protein database search 1246programs", Nucleic Acids Res. 25:3389-3402. 1247 1248Query= CATL_RAT 1249 (334 letters) 1250 1251Database: /data_2/jason/blastdb/wormpep62 1252 20,085 sequences; 8,813,425 total letters 1253 1254Searching..................................................done 1255 1256 Score E 1257Sequences producing significant alignments: (bits) Value 1258 1259T03E6.7 CE16333 cathepsin-like protease (HINXTON) TR:O4573... 325 2e-89 1260F41E6.6 CE10254 cysteine protease and a protease inhibitor... 203 1e-52 1261R09F10.1 CE28755 peptidase (ST.LOUIS) TR:Q23030 protein_id... 192 2e-49 1262R07E3.1 CE02295 cysteine proteinase (HINXTON) TR:Q21810 pr... 139 2e-33 1263Y40H7A.10 CE21821 Cysteine protease (HINXTON) TR:Q9XWA4 pr... 131 5e-31 1264 1265>T03E6.7 CE16333 cathepsin-like protease (HINXTON) TR:O45734 1266 protein_id:CAB07275.1 1267 Length = 337 1268 1269 Score = 325 bits (834), Expect = 2e-89 1270 Identities = 159/311 (51%), Positives = 208/311 (66%), Gaps = 9/311 (2%) 1271 1272Query: 28 QWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEE 87 1273 +W +K + Y +EE+ + KNM I+ HN ++ G+ F M +N D+ + 1274Sbjct: 31 KWDDYKEDFDKEYSESEEQTYMEAFVKNMIHIENHNRDHRLGRKTFEMGLNHIADLPFSQ 90 1275 1276Query: 88 FRQIVNGYRH----QKHKKGRLFQEPLMLQIPKTVDWREKGCVTPVKNQGQCGSCWAFSA 143 1277 +R++ NGYR + K F P +Q+P VDWR+ VT VKNQG CGSCWAFSA 1278Sbjct: 91 YRKL-NGYRRLFGDSRIKNSSSFLAPFNVQVPDEVDWRDTHLVTDVKNQGMCGSCWAFSA 149 1279 1280Query: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203 1281 +G LEGQ K G+L+SLSEQNLVDCS GN GCNGGLMD AF+YI++N G+D+EESYP 1282Sbjct: 150 TGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDNHGVDTEESYP 209 1283 1284Query: 204 YEAKDGSCKYRAEYAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSSG 262 1285 Y+ +D C + + A+D G+VD P+ E+ L AVAT GPIS+A+DA H S Q Y G 1286Sbjct: 210 YKGRDMKCHFNKKTVGADDKGYVDTPEGDEEQLKIAVATQGPISIAIDAGHRSFQLYKKG 269 1287 1288Query: 263 IYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHC 322 1289 +YY+ CSS++LDHGVL+VGY GTD YW+VKNSWG WG GYI+IA++RNNHC 1290Sbjct: 270 VYYDEECSSEELDHGVLLVGY---GTDPEHGDYWIVKNSWGAGWGEKGYIRIARNRNNHC 326 1291 1292Query: 323 GLATAASYPIV 333 1293 G+AT ASYP+V 1294Sbjct: 327 GVATKASYPLV 337 1295 1296 1297>F41E6.6 CE10254 cysteine protease and a protease inhibitor (ST.LOUIS) 1298 TR:O16454 protein_id:AAB65956.1 1299 Length = 498 1300 1301 Score = 203 bits (516), Expect = 1e-52 1302 Identities = 122/331 (36%), Positives = 183/331 (54%), Gaps = 45/331 (13%) 1303 1304Query: 36 HRRLYGTNEEEWRR-AVWEKNMRMI-QLHNGEYSNGKHGFTMEMNAFGDMTNEEFRQIVN 93 1305 H + Y E +R V++KN ++I +L E +GFT F DMT EF++I+ 1306Sbjct: 181 HEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTK----FSDMTTMEFKKIML 236 1307 1308Query: 94 GYRHQKH----KKGRLFQEPLMLQ---IPKTVDWREKGCVTPVKNQGQCGSCWAFSASGC 146 1309 Y+ ++ ++ + + + +P++ DWREKG VT VKNQG CGSCWAFS +G 1310Sbjct: 237 PYQWEQPVYPMEQANFEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCGSCWAFSTTGN 296 1311 1312Query: 147 LEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQ----YIKEN--------- 193 1313 +EG F+ KL+SLSEQ LVDC D +QGCNGGL A++ + +N 1314Sbjct: 297 VEGAWFIAKNKLVSLSEQELVDC--DSMDQGCNGGLPSNAYKIGKFVVSDNYCFLVFYHK 354 1315 1316Query: 194 --------GGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQQEKALMKAVATVGPI 245 1317 GGL+ E++YPY+ + +C + G V++P E + K + T GPI 1318Sbjct: 355 TTKEIIRMGGLEPEDAYPYDGRGETCHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPI 414 1319 1320Query: 246 SVAMDASHPSLQFYSSGIY--YEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWG 303 1321 S+ ++A+ +LQFY G+ ++ C L+HGVL+VGYG +G + YW+VKNSWG 1322Sbjct: 415 SIGLNAN--TLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDG----RKPYWIVKNSWG 468 1323 1324Query: 304 KEWGMDGYIKIAKDRNNHCGLATAASYPIVN 334 1325 WG GY K+ + + N CG+ A+ +VN 1326Sbjct: 469 PNWGEAGYFKLYRGK-NVCGVQEMATSALVN 498 1327 1328 1329>R09F10.1 CE28755 peptidase (ST.LOUIS) TR:Q23030 protein_id:AAC69091.2 1330 Length = 383 1331 1332 Score = 192 bits (488), Expect = 2e-49 1333 Identities = 116/310 (37%), Positives = 176/310 (56%), Gaps = 29/310 (9%) 1334 1335Query: 37 RRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYR 96 1336 R+ E E+R ++ +N+ I+ E N G +++N F D T+EE +++V + 1337Sbjct: 91 RKYTSVEEFEYRYQIFLRNV--IEFEAEEERN--LGLDLDVNEFTDWTDEELQKMVQENK 146 1338 1339Query: 97 HQKHKKGRLFQEPLMLQI----PKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMF 152 1340 + K+ E L+ P ++DWRE+G +TP+KNQGQCGSCWAF+ +E Q 1341Sbjct: 147 YTKYDFDTPKFEGSYLETGVIRPASIDWREQGKLTPIKNQGQCGSCWAFATVASVEAQNA 206 1342 1343Query: 153 LKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCK 212 1344 +K GKL+SLSEQ +VDC D N GC+GG +A +++KEN GL+SE+ YPY A K 1345Sbjct: 207 IKKGKLVSLSEQEMVDC--DGRNNGCSGGYRPYAMKFVKEN-GLESEKEYPYSA----LK 259 1346 1347Query: 213 YRAEYAVANDTG-FVD----IPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYE- 266 1348 + + NDT F+D + E+ + V T GP++ M+ ++ Y SGI+ 1349Sbjct: 260 HDQCFLKENDTRVFIDDFRMLSNNEEDIANWVGTKGPVTFGMNVV-KAMYSYRSGIFNPS 318 1350 1351Query: 267 -PNCSSKDLD-HGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGL 324 1352 +C+ K + H + ++GYG EG + YW+VKNSWG WG GY ++A+ N+ CGL 1353Sbjct: 319 VEDCTEKSMGAHALTIIGYGGEG----ESAYWIVKNSWGTSWGASGYFRLARGVNS-CGL 373 1354 1355Query: 325 ATAASYPIVN 334 1356 A PI+N 1357Sbjct: 374 ANTVVAPIIN 383 1358 1359 1360>R07E3.1 CE02295 cysteine proteinase (HINXTON) TR:Q21810 1361 protein_id:CAA89070.1 1362 Length = 402 1363 1364 Score = 139 bits (351), Expect = 2e-33 1365 Identities = 96/307 (31%), Positives = 154/307 (49%), Gaps = 36/307 (11%) 1366 1367Query: 40 YGTNEEEWRRAV----WEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFRQIV--N 93 1368 Y T++E +R ++N+ + N E+ + ++G N D T+EEF + + 1369Sbjct: 101 YATSQESLKRLNAYYNTDENIANWNIQN-EHGSAEYGH----NDMSDWTDEEFEKTLLPK 155 1370 1371Query: 94 GYRHQKHKKGRLFQEPLMLQI-----------PKTVDWREKGCVTPVKNQGQCGSCWAFS 142 1372 + + HK+ F EP+ + P DWR+K +TPVK QGQCGSCWAF+ 1373Sbjct: 156 SFYKRLHKEAE-FIEPIPESLTAKKGESSSPFPDFFDWRDKNVITPVKAQGQCGSCWAFA 214 1374 1375Query: 143 ASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESY 202 1376 ++ +E + G+ +LSEQ L+DC D + C+GG D AF+YI N GL + 1377Sbjct: 215 STATVEAAWAIAHGEKRNLSEQTLLDC--DLVDNACDGGDEDKAFRYIHRN-GLANAVDL 271 1378 1379Query: 203 PYEA-KDGSCKYRAEYAVANDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSS 261 1380 PY A + C + + E +++ + GP+++ M P ++ Y 1381Sbjct: 272 PYVAHRQNGCAVNDHWNTTRIKAAYFLHHDEDSIINWLVNFGPVNIGMAVIQP-MRAYKG 330 1382 1383Query: 262 GIY--YEPNCSSKDLD-HGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMD-GYIKIAKD 317 1384 G++ E C ++ + H +L+ GY GT +KYW+VKNSWG WG++ GYI A+ 1385Sbjct: 331 GVFTPSEYACKNEVIGLHALLITGY---GTSKTGEKYWIVKNSWGNTWGVEHGYIYFARG 387 1386 1387Query: 318 RNNHCGL 324 1388 N CG+ 1389Sbjct: 388 -INACGI 393 1390 1391 1392>Y40H7A.10 CE21821 Cysteine protease (HINXTON) TR:Q9XWA4 1393 protein_id:CAA22062.1 1394 Length = 343 1395 1396 Score = 131 bits (330), Expect = 5e-31 1397 Identities = 88/284 (30%), Positives = 152/284 (52%), Gaps = 24/284 (8%) 1398 1399Query: 48 RRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYRHQKHKKGRLFQ 107 1400 R ++ +N+ +++ +N E + GK T E+N F D+T EE+++ + + H + L 1401Sbjct: 71 RFTIFSRNLDLVERYNKEDA-GK--VTYELNDFSDLTEEEWKKYLMTPKPD-HSEKSLKP 126 1402 1403Query: 108 EPLM--LQIPKTVDWRE---KGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLS 162 1404 + L+ +P +VDWR VT +K QG CGSCWAF+ + +E + + G L SLS 1405Sbjct: 127 KTLIDKKNLPNSVDWRNVNGTNHVTGIKYQGPCGSCWAFATAAAIESAVSISGGGLQSLS 186 1406 1407Query: 163 EQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVAND 222 1408 Q L+DC+ + C GG A +Y + + G+ + +YPY C+ VA 1409Sbjct: 187 SQQLLDCT--VVSDKCGGGEPVEALKYAQSH-GITTAHNYPYYFWTTKCRETVP-TVARI 242 1410 1411Query: 223 TGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVG 282 1412 + ++ + E + + VA GP+ V + + +FY SGI +P+C ++ H ++V+G 1413Sbjct: 243 SSWMK-AESEDEMAQIVALNGPMIVCANFATNKNRFYHSGIAEDPDCGTEP-THALIVIG 300 1414 1415Query: 283 YGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLAT 326 1416 YG + YW++KN++ K WG GY+++ +D N CG+ T 1417Sbjct: 301 YGPD--------YWILKNTYSKVWGEKGYMRVKRD-VNWCGINT 335 1418 1419 1420>K02E7.10 CE11640 protease (ST.LOUIS) TR:O17255 protein_id:AAB71030.1 1421 Length = 299 1422 1423 Score = 128 bits (321), Expect = 6e-30 1424 Identities = 81/222 (36%), Positives = 125/222 (55%), Gaps = 18/222 (8%) 1425 1426Query: 118 VDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLK--TGKLISLSEQNLVDCSHDQGN 175 1427 +DWREKG V PVK+QG+C + +AF+A +E M+ K GKL+S SEQ ++DC++ 1428Sbjct: 84 LDWREKGIVGPVKDQGKCNASYAFAAIAAIE-SMYAKANNGKLLSFSEQQIIDCAN--FT 140 1429 1430Query: 176 QGCNGGLMD-FAFQYIKENGGLDSEESYPYEAKD--GSCKYRAEYAVANDTGFVDIPQQE 232 1431 C L + + +++KEN G+ +E YPY K+ G C+Y + T ++D+ E 1432Sbjct: 141 NPCQENLENVLSNRFLKEN-GVGTEADYPYVGKENVGKCEYDSSKMKLRPT-YIDVYPNE 198 1433 1434Query: 233 KALMKAVATVGPISVAMDASHPSLQFYSSGIY--YEPNCSSKDLDHGVLVVGYGYEGTDS 290 1435 + + T G M S PS Y +GIY + C + + + +VGYG +G 1436Sbjct: 199 EWARAHITTFGTGYFRM-RSPPSFFHYKTGIYNPTKEECGNANEARSLAIVGYGKDGA-- 255 1437 1438Query: 291 NKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPI 332 1439 +KYW+VK S+G WG GY+K+A++ N CG+A + S PI 1440Sbjct: 256 --EKYWIVKGSFGTSWGEHGYMKLARN-VNACGMAESISIPI 294 1441 1442 1443>Y71H2AR.2 CE22930 (ST.LOUIS) protein_id:AAK29985.1 1444 Length = 345 1445 1446 Score = 120 bits (301), Expect = 1e-27 1447 Identities = 81/214 (37%), Positives = 114/214 (52%), Gaps = 14/214 (6%) 1448 1449Query: 118 VDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLK--TGKLISLSEQNLVDCSHDQGN 175 1450 +DWREKG V PVK+QG+C + AF+ + +E M+ K G L+S SEQ L+DC +DQG 1451Sbjct: 86 LDWREKGIVGPVKDQGKCNASHAFAITSSIE-SMYAKATNGTLLSFSEQQLIDC-NDQGY 143 1452 1453Query: 176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAK-DGSCKYRAEYAVANDTGFVDIPQQEKA 234 1454 +GC A Y+ + G+++E YPY K + C + + + + V E 1455Sbjct: 144 KGCEEQFAMNAIGYLATH-GIETEADYPYVDKTNEKCTFDSTKSKIHLKKGVVAEGNEVL 202 1456 1457Query: 235 LMKAVATVGPISVAMDASHPSLQFYSSGIYYE--PNCSSKDLDHGVLVVGYGYEGTDSNK 292 1458 V GP M A PSL Y GIY C+S +++VGYG EG + 1459Sbjct: 203 GKVYVTNYGPAFFTMRAP-PSLYDYKIGIYNPSIEECTSTHEIRSMVIVGYGIEG----E 257 1460 1461Query: 293 DKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLAT 326 1462 KYW+VK S+G WG GY+K+A+D N C +AT 1463Sbjct: 258 QKYWIVKGSFGTSWGEQGYMKLARD-VNACAMAT 290 1464 1465 1466>Y51A2D.8 CE19204 Cysteine proteases (2 domains) (HINXTON) TR:Q9XXQ7 1467 protein_id:CAA16407.1 1468 Length = 386 1469 1470 Score = 108 bits (271), Expect = 4e-24 1471 Identities = 64/203 (31%), Positives = 99/203 (48%), Gaps = 11/203 (5%) 1472 1473Query: 126 VTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDF 185 1474 V P+K+QGQC CW F+ + +E +GK SLS+Q + DC +G GC GG + 1475Sbjct: 164 VGPIKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCG-TEGTPGCKGGSLTL 222 1476 1477Query: 186 AFQYIKENGGLDSEESYPYE---AKDG-SCKYRAEYAVANDTGF---VDIPQQEKALMKA 238 1478 QY+K+ GL +E YPY+ A G C+ R + F V P++ + + 1479Sbjct: 223 GVQYVKKY-GLSGDEDYPYDQNRANQGRRCRLRETDRIVPARAFNFAVINPRRAEEQIIQ 281 1480 1481Query: 239 VATVGPISVAMDAS-HPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYG-YEGTDSNKDKYW 296 1482 V T + VA+ + Y G+ E +C H +VGY E + YW 1483Sbjct: 282 VLTEWKVPVAVYFKVGDQFKEYKEGVIIEDDCRRATQWHAGAIVGYDTVEDSRGRSHDYW 341 1484 1485Query: 297 LVKNSWGKEWGMDGYIKIAKDRN 319 1486 ++KNSWG +W GY+++ + R+ 1487Sbjct: 342 IIKNSWGGDWAESGYVRVVRGRD 364 1488 1489 1490>Y113G7B.15 CE23295 (HINXTON) TR:Q9U2X1 protein_id:CAB54334.1 1491 Length = 328 1492 1493 Score = 99.4 bits (246), Expect = 3e-21 1494 Identities = 87/321 (27%), Positives = 127/321 (39%), Gaps = 47/321 (14%) 1495 1496Query: 36 HRRLYGTNEEEWRR-AVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNG 94 1497 H++ Y T E+ RR A + KN + IQ N + T N F D +E N 1498Sbjct: 3 HKKHYRTPAEKDRRLAHFAKNHQKIQELNAKARREGRNVTFGWNKFADKNRQEL-SARNS 61 1499 1500Query: 95 YRHQKHKKGRLFQEPLMLQ----------------IPKTVDWRE-----KGCVTPVKNQG 133 1501 H K+ +P + IP D R+ V PVK+Q 1502Sbjct: 62 KIHPKNHTDLPIYKPRHPRGSRNHHNKRSKRQSGDIPDYFDLRDIYVDGSPVVGPVKDQE 121 1503 1504Query: 134 QCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKEN 193 1505 QCG CWAF+ + E L + SLS+Q + DC+ GC GG + + 1506Sbjct: 122 QCGCCWAFATTAITEAANTLYSKSFTSLSDQEICDCADSGDTPGCVGGDPRNGLKMVHLR 181 1507 1508Query: 194 GGLDSEESYPYEA----KDGSCKYRAEYAVAN---------DTGFVDIPQQEKALMKAVA 240 1509 G S+ YPYE G+C + V D + + E + + 1510Sbjct: 182 -GQSSDGDYPYEEYRANTTGNCVGDEKSTVIQPETLNVYRFDQDYAEEDIMENLYLNHIP 240 1511 1512Query: 241 TVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLD--HGVLVVGYGYEGTDSNKDKYWLV 298 1513 T V + ++Y+SG+ +C H V +VGY GT + YWLV 1514Sbjct: 241 TAVYFRVG-----ENFEWYTSGVLQSEDCYQMTPAEWHSVAIVGY---GTSDDGVPYWLV 292 1515 1516Query: 299 KNSWGKEWGMDGYIKIAKDRN 319 1517 +NSW +WG+ GY+KI + N 1518Sbjct: 293 RNSWNSDWGLHGYVKIRRGVN 313 1519 1520 1521>C50F4.3 CE05468 thiol protease (HINXTON) TR:Q18740 protein_id:CAA94738.1 1522 Length = 374 1523 1524 Score = 98.2 bits (243), Expect = 6e-21 1525 Identities = 80/270 (29%), Positives = 119/270 (43%), Gaps = 27/270 (10%) 1526 1527Query: 71 HGFTMEMNAFGDMTNEEFRQIVNGYRHQKHKKG-------RLFQEPLMLQIPKTVDWREK 123 1528 H +N F D++ +E + + + K+ L + M +PKT D R K 1529Sbjct: 90 HDTKYGINKFSDLSKKEIHGMYSKFGPPKNNTNVPKFNLKNLRVKRQMEGLPKTFDLRNK 149 1530 1531Query: 124 GC-----VTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGC 178 1532 + P+K Q C CW F+A+ E + + K ++LSEQ + DC+ G GC 1533Sbjct: 150 KVGGHYIIGPIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPKHG-PGC 208 1534 1535Query: 179 NGGLMDFAFQYIKENGGLDSEESYPYEAKD----GSC---KYRAEY-AVANDTGFVDIPQ 230 1536 NGG +YIKE GL + YP+ G C KY E + D +D 1537Sbjct: 209 NGGDPVDGLEYIKEM-GLTGGKEYPFNVNRSTQLGRCESEKYDRELNPLELDYYAIDPFN 267 1538 1539Query: 231 QEKALMKAVATVG-PISVAMDASHPSLQFYSSGIYYEPNCSSKDLD--HGVLVVGYGYEG 287 1540 E + + + PISVA + SL Y SGI +C + H +VGYG 1541Sbjct: 268 AEYQMTHHLYLLNLPISVAF-RTGASLSSYLSGILELADCDDEKGGHWHSGAIVGYGTTK 326 1542 1543Query: 288 TDSNKD-KYWLVKNSWGKEWGMDGYIKIAK 316 1544 + + YW+ +NSW +WG DGY +I + 1545Sbjct: 327 NSAGRTVDYWIFRNSWWTDWGDDGYARIVR 356 1546 1547 1548>F26E4.3 CE17714 cysteine protease (HINXTON) TR:P90850 1549 protein_id:CAB03007.1 1550 Length = 491 1551 1552 Score = 97.8 bits (242), Expect = 8e-21 1553 Identities = 68/241 (28%), Positives = 113/241 (46%), Gaps = 35/241 (14%) 1554 1555Query: 113 QIPKTVDWREKG--CVTPVKNQGQCGSCWAFSASGCLEGQM-FLKTGKLIS-LSEQNLVD 168 1556 ++P+ D R+K + PV +QG CGS W+ S + ++ + G++ S LS Q L+ 1557Sbjct: 222 ELPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLS 281 1558 1559Query: 169 CSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEA----KDGSCKY----------- 213 1560 C+ + +GC GG +D A+ YI++ G + + YPY + + G C 1561Sbjct: 282 CNQHR-QKGCEGGYLDRAWWYIRKLGVV-GDHCYPYVSGQSREPGHCLIPKRDYTNRQGL 339 1562 1563Query: 214 RAEYAVANDTGFVDIP-----QQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN 268 1564 R + T F P +E+ + + T GP+ H Y+ G+Y + 1565Sbjct: 340 RCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATF-VVHEDFFMYAGGVYQHSD 398 1566 1567Query: 269 CSSK-------DLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNH 321 1568 +++ + H V V+G+G + + KYWL NSWG +WG DGY K+ + NH 1569Sbjct: 399 LAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRG-ENH 457 1570 1571Query: 322 C 322 1572 C 1573Sbjct: 458 C 458 1574 1575 1576>F15D4.4 CE28917 cysteine protease (HINXTON) TR:Q93512 1577 protein_id:CAB02487.1 1578 Length = 622 1579 1580 Score = 96.7 bits (239), Expect = 2e-20 1581 Identities = 65/219 (29%), Positives = 102/219 (45%), Gaps = 35/219 (15%) 1582 1583Query: 117 TVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDC------S 170 1584 TVDWR + P+ +Q CG CWAFS +E ++ SLS Q L+ C + 1585Sbjct: 226 TVDWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGYNTSSLSVQQLLTCDTKVDST 283 1586 1587Query: 171 HDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCK-------------YRAEY 217 1588 + N GC GG A Y++ + D+ P++ +D SC + Y 1589Sbjct: 284 YGLANVGCKGGYFQIAGSYLEVSAARDAS-LIPFDLEDTSCDSSFFPPVVPTILLFDDGY 342 1590 1591Query: 218 AVANDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHG 277 1592 N T I ++ K GPI+V M A+ P + YS G+Y + +C + ++H 1593Sbjct: 343 ISGNFTAAQLITMEQNIEDKV--RKGPIAVGM-AAGPDIYKYSEGVY-DGDCGTI-INHA 397 1594 1595Query: 278 VLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAK 316 1596 V++VG+ D YW+++NSWG WG GY ++ + 1597Sbjct: 398 VVIVGF--------TDDYWIIRNSWGASWGEAGYFRVKR 428 1598 1599 1600>C32B5.7 CE08515 cathepsin-like peptidase (ST.LOUIS) TR:P91111 1601 protein_id:AAB37963.1 1602 Length = 250 1603 1604 Score = 90.1 bits (222), Expect = 2e-18 1605 Identities = 63/191 (32%), Positives = 98/191 (50%), Gaps = 18/191 (9%) 1606 1607Query: 118 VDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQG 177 1608 +DWR++G V PVK+QG C + +AF+A +E + G+L+S SEQ ++DC G 1609Sbjct: 72 LDWRDEGVVGPVKDQGNCNASYAFAAISAIESMYAIANGQLLSFSEQQIIDC---LGGCA 128 1610 1611Query: 178 CNGGLMDFAFQYIKENGGLDSEESYPYEA-KDGSCKY--RAEYAVANDTGFVDIPQQEKA 234 1612 M A Y+ E G+++ YP+ K+ C+Y + Y + +DT D+ + A 1613Sbjct: 129 IESDPM-MAMTYL-ERKGIETYTDYPFVGKKNEKCEYDSKKAYLILDDT--YDMSDESLA 184 1614 1615Query: 235 LMKAVATVGPISVAMDASHPSLQFYSSGIY--YEPNCSSKDLDHGVLVVGYGYEGTDSNK 292 1616 L+ + GP M+ + PS Y SGIY E C S + + +VGYG + 1617Sbjct: 185 LV-FIDERGPGLFTMN-TPPSFFNYKSGIYNPTEEECKSTNEKRALTIVGYG----NDKG 238 1618 1619Query: 293 DKYWLVKNSWG 303 1620 YW+VK S+G 1621Sbjct: 239 QNYWIVKGSFG 249 1622 1623 1624>Y51A2D.1 CE18411 Cysteine proteases (2 domains) (HINXTON) TR:O62484 1625 protein_id:CAA16404.1 1626 Length = 382 1627 1628 Score = 87.8 bits (216), Expect = 8e-18 1629 Identities = 87/350 (24%), Positives = 139/350 (38%), Gaps = 76/350 (21%) 1630 1631Query: 31 QWKSTHRRLYGTNEEEWRRA---VWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEE 87 1632 ++K R Y + E R V +N +++L+ G++ +N F D+T E 1633Sbjct: 46 EFKKKFSRTYKSEAENQLRLQNFVKSRN-NVVRLNKNAQKAGRNS-NFAVNQFSDLTTSE 103 1634 1635Query: 88 FRQIV---------NGYRHQKHKK--GRLFQEPLMLQIPKTVDWREKGC-----VTPVKN 131 1636 Q + N H+ KK G+ + + + D R + V P+KN 1637Sbjct: 104 LHQRLSRFPPNLTENSVFHKNFKKLLGKTRTKRQNSEFARNFDLRSQKVNGRYIVGPIKN 163 1638 1639Query: 132 QGQCGSCWAFSASGCLEG------------------------------QMFLKTGKLISL 161 1640 QGQC CW F+ + LE + K +S 1641Sbjct: 164 QGQCACCWGFAVTAMLETIYAVNVGRFKLMSHIPALAPNFSDFDFFFFEFLAKLNMFLSF 223 1642 1643Query: 162 SEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVAN 221 1644 S+Q + DC+ D GC GG + + +Y N GL SE YP ++ + + A+ + 1645Sbjct: 224 SDQEMCDCATDGTKAGCAGGGLMWGVEY-AINNGLASEFDYPEFDQNRATRPGTCEAMDD 282 1646 1647Query: 222 DTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCS-SKDLDHGVLV 280 1648 D T P++ A A LQ Y SG+ +C + + H + 1649Sbjct: 283 D-----------------KTFPPVNFA--AGTAFLQ-YKSGVLVTEDCDLAGTVWHAGAI 322 1650 1651Query: 281 VGYGYEG-TDSNKDKYWLVKNSWG-KEWGMDGYIKIAKDRNNHCGLATAA 328 1652 VGYG E ++W++KNSWG WG GY+K+ + + N CG+ A 1653Sbjct: 323 VGYGEENDLRGRSQRFWIMKNSWGVSGWGTGGYVKLIRGK-NWCGIERGA 371 1654 1655 1656>C52E4.1 CE08943 locus:cpr-1 cathepsin-like cysteine protease (HINXTON) 1657 TR:Q18783 protein_id:CAB01410.1 1658 Length = 340 1659 1660 Score = 87.4 bits (215), Expect = 1e-17 1661 Identities = 66/252 (26%), Positives = 110/252 (43%), Gaps = 39/252 (15%) 1662 1663Query: 107 QEPLMLQIPKTVD----WREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKT--GKLIS 160 1664 QE ++ +P T D W E + +++Q CGSCWAF A+ + + ++T + 1665Sbjct: 89 QEVVLASVPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPI 148 1666 1667Query: 161 LSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESY-----PY----------- 204 1668 +S +L+ C GC GG A ++ G + + + PY 1669Sbjct: 149 ISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGNCP 208 1670 1671Query: 205 EAKDGSCKYRAEY----AVANDTGF----VDIPQQEKALMKAVATVGPISVAMDASHPSL 256 1672 E+K SC + A A D F +P+ ++ + GP+ A + 1673Sbjct: 209 ESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSV-YEDF 267 1674 1675Query: 257 QFYSSGIYYEPNCSSKDLD-HGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIA 315 1676 Y SG+Y + + K L H + ++G+G E + YWLV NSWG WG G+ KI 1677Sbjct: 268 YKYKSGVY--KHTAGKYLGGHAIKIIGWGTE----SGSPYWLVANSWGVNWGESGFFKIY 321 1678 1679Query: 316 KDRNNHCGLATA 327 1680 + ++ CG+ +A 1681Sbjct: 322 RG-DDQCGIESA 332 1682 1683 1684>F32B5.8 CE09855 cysteine proteinase (ST.LOUIS) TR:O01850 1685 protein_id:AAB54210.1 1686 Length = 427 1687 1688 Score = 85.9 bits (211), Expect = 3e-17 1689 Identities = 73/261 (27%), Positives = 123/261 (46%), Gaps = 36/261 (13%) 1690 1691Query: 88 FRQIVNGYRHQKHKKGRLFQEPLMLQIPKTVDWREKGCVTPV---KNQG---QCGSCWAF 141 1692 ++Q + H+++ + ++ +PKT DWR+ + +NQ CGSCWAF 1693Sbjct: 160 YKQTGRVFEHKRYDRIYETEDFDSEDLPKTWDWRDANGINYASADRNQHIPQYCGSCWAF 219 1694 1695Query: 142 SASGCLEGQMFLKTGKL---ISLSEQNLVDCSHDQGNQGC-NGGLMDFAFQYIKENGGLD 197 1696 A+ L ++ +K LS Q ++DCS G C GG ++Y E+G + 1697Sbjct: 220 GATSALADRINIKRKNAWPQAYLSVQEVIDCS---GAGTCVMGGEPGGVYKYAHEHG-IP 275 1698 1699Query: 198 SEESYPYEAKDGSCK-YRA-------------EYAVANDTGFVDIPQQEKALMKA-VATV 242 1700 E Y+A+DG C Y Y + + + + EK MKA + 1701Sbjct: 276 HETCNNYQARDGKCDPYNRCGSCWPGECFSIKNYTLYKVSEYGTVHGYEK--MKAEIYHK 333 1702 1703Query: 243 GPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSW 302 1704 GPI+ + A+ + + Y+ GIY E + +D+DH + V G+G + + +YW+ +NSW 1705Sbjct: 334 GPIACGIAATK-AFETYAGGIYKE--VTDEDIDHIISVHGWGVD--HESGVEYWIGRNSW 388 1706 1707Query: 303 GKEWGMDGYIKIAKDRNNHCG 323 1708 G+ WG G+ KI + + G 1709Sbjct: 389 GEPWGEHGWFKIVTSQYKNAG 409 1710 1711 1712>M04G12.2 CE12424 cysteine protease (HINXTON) TR:P92005 1713 protein_id:CAB03209.1 1714 Length = 467 1715 1716 Score = 83.2 bits (204), Expect = 2e-16 1717 Identities = 62/228 (27%), Positives = 107/228 (46%), Gaps = 33/228 (14%) 1718 1719Query: 114 IPKTVDWREKGCV---TPVKNQG---QCGSCWAFSASGCLEGQMFL-KTGK--LISLSEQ 164 1720 +P DWR V +P +NQ CGSCW F +G L + + + G+ + LS Q 1721Sbjct: 221 LPTGWDWRNVSGVNYCSPTRNQHIPVYCGSCWVFGTTGALNDRFNVARKGRWPMTQLSPQ 280 1722 1723Query: 165 NLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLD---------SEESYPYEAKDGSCKYRA 215 1724 ++DC+ G C GG + ++ K G ++ + E PY + GSC 1725Sbjct: 281 EIIDCN---GKGNCQGGEIGNVLEHAKIQGLVEEGCNVYRATNGECNPYH-RCGSCWPNE 336 1726 1727Query: 216 EYAVANDTGFV-----DIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCS 270 1728 +++ N T + + ++K +M + GPI+ A+ A+ Y G+Y E S 1729Sbjct: 337 CFSLTNYTRYYVKDYGQVQGRDK-IMSEIKKGGPIACAIGATKKFEYEYVKGVYSEK--S 393 1730 1731Query: 271 SKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDR 318 1732 + +H + + G+G D N +YW+ +NSWG+ WG G+ ++ + 1733Sbjct: 394 DLESNHIISLTGWG---VDENGVEYWIARNSWGEAWGELGWFRVVTSK 438 1734 1735 1736>Y71H2AM.3 CE26272 (ST.LOUIS) protein_id:AAK29976.1 1737 Length = 716 1738 1739 Score = 75.5 bits (184), Expect = 4e-14 1740 Identities = 60/169 (35%), Positives = 83/169 (48%), Gaps = 25/169 (14%) 1741 1742Query: 118 VDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLK--TGKLISLSEQNLVDCSHDQGN 175 1743 +DWR+KG V PVK+QG+C + AF+ S +E M+ K G L+S SEQ L+DC D G 1744Sbjct: 86 LDWRDKGIVGPVKDQGKCNASHAFAISSSIE-SMYAKATNGSLLSFSEQQLIDCD-DHGF 143 1745 1746Query: 176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQQEKAL 235 1747 +GC A Y + G+++E YPY K+ ++N+T Q K L 1748Sbjct: 144 KGCEEQPAINAVSYFIFH-GIETEADYPYAGKENG-------KLSNET-------QGKEL 188 1749 1750Query: 236 MKAVATVGPISVAMDASHPSLQFYSSGIYYE--PNCSSKDLDHGVLVVG 282 1751 V GP M A PSL Y GIY C+S +++VG 1752Sbjct: 189 ---VTNYGPAFFTMRAP-PSLYDYKIGIYNPSIEECTSTHEIRSMVIVG 233 1753 1754 1755>T10H4.12 CE27590 locus:cpr-3 protease (HINXTON) TR:Q9TW93 1756 protein_id:CAB61024.2 1757 Length = 370 1758 1759 Score = 74.7 bits (182), Expect = 7e-14 1760 Identities = 60/250 (24%), Positives = 102/250 (40%), Gaps = 42/250 (16%) 1761 1762Query: 102 KGRLFQEPLMLQIPKTVDWREK----GCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTG- 156 1763 +G + EPL P T D REK + ++NQ CGSCWAF A+ + ++ +++ 1764Sbjct: 84 RGEIVPEPL----PDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNG 139 1765 1766Query: 157 -KLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEE-----SYPY------ 204 1767 + +S ++++ C GC GG A ++ +G + + PY 1768Sbjct: 140 TQQPVISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPYSFAPCT 199 1769 1770Query: 205 ----EAKDGSCK-----------YRAEYAVANDTGFVDIPQQEKALMKAVATVGPISVAM 249 1771 E+ SCK Y+ + V + + + GP+ + 1772Sbjct: 200 KNCPESTTPSCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASY 259 1773 1774Query: 250 DASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMD 309 1775 + Y SG+Y+ + H V ++G+G E N YWL+ NSWG +G 1776Sbjct: 260 KV-YEDFYHYKSGVYHYTSGKLVG-GHAVKIIGWGVE----NGVDYWLIANSWGTSFGEK 313 1777 1778Query: 310 GYIKIAKDRN 319 1779 G+ KI + N 1780Sbjct: 314 GFFKIRRGTN 323 1781 1782 1783>F36D3.9 CE15973 cysteine protease (HINXTON) TR:O45466 1784 protein_id:CAB04322.1 1785 Length = 345 1786 1787 Score = 71.6 bits (174), Expect = 6e-13 1788 Identities = 63/235 (26%), Positives = 98/235 (40%), Gaps = 40/235 (17%) 1789 1790Query: 120 WREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLIS--LSEQNLVDCSHDQGNQG 177 1791 W + + ++ Q CGSCWAFS + + + + + +S +L+ C +G 1792Sbjct: 111 WPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEG 170 1793 1794Query: 178 CNGGLMDFAFQYIKENGGLDSEE-------SYPYEAKDG-------------SCK--YRA 215 1795 C+GG AFQ+ G + + YP + SC+ YR 1796Sbjct: 171 CDGGFPYRAFQWWARRGVVTGGDYLGTGCKPYPIRPCNSDNCVNLQTPPCRLSCQPGYRT 230 1797 1798Query: 216 EYAVANDTGF-----VDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCS 270 1799 Y ND + +P+ A+ + GP+ VA + + Y SGIY 1800Sbjct: 231 TYT--NDKNYGSNSAYPVPRTVAAIQADIYYNGPV-VAAFIVYEDFEKYKSGIYRHIAGR 287 1801 1802Query: 271 SKDLDHGVLVVGYGYE-GTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGL 324 1803 SK H V ++G+G E GT YWL NSWG +WG G +I + + CG+ 1804Sbjct: 288 SKG-GHAVKLIGWGTERGTP-----YWLAVNSWGSQWGESGTFRILRG-VDECGI 335 1805 1806 1807 Database: /data_2/jason/blastdb/wormpep62 1808 Posted date: Sep 3, 2001 2:17 PM 1809 Number of letters in database: 8,813,425 1810 Number of sequences in database: 20,085 1811 1812Lambda K H 1813 0.317 0.134 0.426 1814 1815Gapped 1816Lambda K H 1817 0.267 0.0410 0.140 1818 1819 1820Matrix: BLOSUM62 1821Gap Penalties: Existence: 11, Extension: 1 1822Number of Hits to DB: 6241552 1823Number of Sequences: 20085 1824Number of extensions: 276768 1825Number of successful extensions: 629 1826Number of sequences better than 1.0e-10: 20 1827Number of HSP's better than 0.0 without gapping: 4 1828Number of HSP's successfully gapped in prelim test: 16 1829Number of HSP's that attempted gapping in prelim test: 578 1830Number of HSP's gapped (non-prelim): 20 1831length of query: 334 1832length of database: 8,813,425 1833effective HSP length: 44 1834effective length of query: 290 1835effective length of database: 7,929,685 1836effective search space: 2299608650 1837effective search space used: 2299608650 1838T: 11 1839A: 40 1840X1: 16 ( 7.3 bits) 1841X2: 38 (14.6 bits) 1842X3: 64 (24.7 bits) 1843S1: 41 (21.6 bits) 1844S2: 156 (64.7 bits) 1845BLASTP 2.1.3 [Apr-11-2001] 1846 1847 1848Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 1849Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 1850"Gapped BLAST and PSI-BLAST: a new generation of protein database search 1851programs", Nucleic Acids Res. 25:3389-3402. 1852 1853Query= PAPA_CARPA 1854 (345 letters) 1855 1856Database: /data_2/jason/blastdb/wormpep62 1857 20,085 sequences; 8,813,425 total letters 1858 1859Searching..................................................done 1860 1861 Score E 1862Sequences producing significant alignments: (bits) Value 1863 1864R09F10.1 CE28755 peptidase (ST.LOUIS) TR:Q23030 protein_id... 174 7e-44 1865T03E6.7 CE16333 cathepsin-like protease (HINXTON) TR:O4573... 171 5e-43 1866Y40H7A.10 CE21821 Cysteine protease (HINXTON) TR:Q9XWA4 pr... 160 8e-40 1867F41E6.6 CE10254 cysteine protease and a protease inhibitor... 156 2e-38 1868Y51A2D.8 CE19204 Cysteine proteases (2 domains) (HINXTON) ... 127 1e-29 1869 1870>R09F10.1 CE28755 peptidase (ST.LOUIS) TR:Q23030 protein_id:AAC69091.2 1871 Length = 383 1872 1873 Score = 174 bits (441), Expect = 7e-44 1874 Identities = 107/348 (30%), Positives = 173/348 (48%), Gaps = 18/348 (5%) 1875 1876Query: 7 ISKLLFVAICLFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKI 66 1877 +++L + L + + LSF F + + +L Q+F ++LK ++ Y +++E 1878Sbjct: 45 LTQLFSGLVLLTMLILLSFFVFQRLNHKMENLKHE----QMFNDFILKFDRKYTSVEEFE 100 1879 1880Query: 67 YRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEV 126 1881 YR++IF N+ + ++N L +N F D +++E ++ + Y +E 1882Sbjct: 101 YRYQIFLRNVIEFEAEEERNLGLDLDVNEFTDWTDEELQKMVQENKYTKYDFDTPKFEGS 160 1883 1884Query: 127 LNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQEL 186 1885 + V P +DWR++G +TP+KNQG CGSCWAF+ V ++E I+ G L SEQE+ 1886Sbjct: 161 YLETGVIRPASIDWREQGKLTPIKNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQEM 220 1887 1888Query: 187 LDCDRRSYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQV 246 1889 +DCD R+ GC+GGY A++ V + G+ YPY ++ ++ D R + 1890Sbjct: 221 VDCDGRNNGCSGGYRPYAMKFVKENGLESEKEYPYSALKHDQCFLKENDTRVFIDDFRML 280 1891 1892Query: 247 QPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIF---VGPCGNKV--DHAVAAVGYGP 301 1893 E + PV+ + K YR GIF V C K HA+ +GYG 1894Sbjct: 281 SNNEEDIANWVGTKGPVTFGMNVV-KAMYSYRSGIFNPSVEDCTEKSMGAHALTIIGYGG 339 1895 1896Query: 302 N----YILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVKN 345 1897 Y ++KNSWGT WG +GY R+ RG + CGL + P+ N 1898Sbjct: 340 EGESAYWIVKNSWGTSWGASGYFRLARGVNS----CGLANTVVAPIIN 383 1899 1900 1901>T03E6.7 CE16333 cathepsin-like protease (HINXTON) TR:O45734 1902 protein_id:CAB07275.1 1903 Length = 337 1904 1905 Score = 171 bits (434), Expect = 5e-43 1906 Identities = 107/319 (33%), Positives = 163/319 (50%), Gaps = 25/319 (7%) 1907 1908Query: 42 ERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKNN----SYWLGLNVFA 97 1909 E I+ ++ + +K Y +E+ Y E F N+ +I+ N+ + ++ +GLN A 1910Sbjct: 26 ESAIEKWDDYKEDFDKEYSESEEQTY-MEAFVKNMIHIENHNRDHRLGRKTFEMGLNHIA 84 1911 1912Query: 98 DMSNDEFKEK--YTGSIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSC 155 1913 D+ ++++ Y + S+ N V +P+ VDWR VT VKNQG C 1914Sbjct: 85 DLPFSQYRKLNGYRRLFGDSRIKNSSSFLAPFN---VQVPDEVDWRDTHLVTDVKNQGMC 141 1915 1916Query: 156 GSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR--SYGCNGGYPWSALQLVA-QYG 212 1917 GSCWAFSA +EG + G L SEQ L+DC + ++GCNGG A + + +G 1918Sbjct: 142 GSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDNHG 201 1919 1920Query: 213 IHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQ-PVSVVLEAAG 271 1921 + +YPY+G C +K A G +E L ++A Q P+S+ ++A 1922Sbjct: 202 VDTEESYPYKGRDMKCHFNKK-TVGADDKGYVDTPEGDEEQLKIAVATQGPISIAIDAGH 260 1923 1924Query: 272 KDFQLYRGGIFVGP--CGNKVDHAVAAVGYGP-----NYILIKNSWGTGWGENGYIRIKR 324 1925 + FQLY+ G++ ++DH V VGYG +Y ++KNSWG GWGE GYIRI R 1926Sbjct: 261 RSFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEHGDYWIVKNSWGAGWGEKGYIRIAR 320 1927 1928Query: 325 GTGNSYGVCGLYTSSFYPV 343 1929 N CG+ T + YP+ 1930Sbjct: 321 NRNNH---CGVATKASYPL 336 1931 1932 1933>Y40H7A.10 CE21821 Cysteine protease (HINXTON) TR:Q9XWA4 1934 protein_id:CAA22062.1 1935 Length = 343 1936 1937 Score = 160 bits (406), Expect = 8e-40 1938 Identities = 100/295 (33%), Positives = 153/295 (50%), Gaps = 15/295 (5%) 1939 1940Query: 48 FESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKN-NSYWLGLNVFADMSNDEFKE 106 1941 F+++++K+ + Y N E + RF IF NL ++ NK++ LN F+D++ +E+K 1942Sbjct: 51 FQNFLVKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGKVTYELNDFSDLTEEEWK- 109 1943 1944Query: 107 KYTGSIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGA---VTPVKNQGSCGSCWAFSA 163 1945 KY + +++ L + +++ N+P VDWR VT +K QG CGSCWAF+ 1946Sbjct: 110 KYLMTPKPDHSEKSLKPKTLIDKK--NLPNSVDWRNVNGTNHVTGIKYQGPCGSCWAFAT 167 1947 1948Query: 164 VVTIEGIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSALQLVAQYGIHYRNTYPYEG 223 1949 IE + I G L S Q+LLDC S C GG P AL+ +GI + YPY 1950Sbjct: 168 AAAIESAVSISGGGLQSLSSQQLLDCTVVSDKCGGGEPVEALKYAQSHGITTAHNYPYYF 227 1951 1952Query: 224 VQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFV 283 1953 C RE P A+ + + +E A + ++ N P+ V A + Y GI 1954Sbjct: 228 WTTKC--RETVPTVARISSWMKAESEDEMAQIVAL-NGPMIVCANFATNKNRFYHSGIAE 284 1955 1956Query: 284 GP-CGNKVDHAVAAVGYGPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYT 337 1957 P CG + HA+ +GYGP+Y ++KN++ WGE GY+R+KR CG+ T 1958Sbjct: 285 DPDCGTEPTHALIVIGYGPDYWILKNTYSKVWGEKGYMRVKR----DVNWCGINT 335 1959 1960 1961>F41E6.6 CE10254 cysteine protease and a protease inhibitor (ST.LOUIS) 1962 TR:O16454 protein_id:AAB65956.1 1963 Length = 498 1964 1965 Score = 156 bits (395), Expect = 2e-38 1966 Identities = 110/327 (33%), Positives = 156/327 (47%), Gaps = 51/327 (15%) 1967 1968Query: 48 FESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKNNSYWL-GLNVFADMSNDEFKE 106 1969 F ++ +H K Y N E + RF +FK N K I E K + G F+DM+ EFK+ 1970Sbjct: 174 FLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKK 233 1971 1972Query: 107 -----KYTGSIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAF 161 1973 ++ + ++ +N+ D +PE DWR+KGAVT VKNQG+CGSCWAF 1974Sbjct: 234 IMLPYQWEQPVYPMEQANFEKHDVTINEED--LPESFDWREKGAVTQVKNQGNCGSCWAF 291 1975 1976Query: 162 SAVVTIEGIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSAL---------------- 205 1977 S +EG I L SEQEL+DCD GCNGG P +A 1978Sbjct: 292 STTGNVEGAWFIAKNKLVSLSEQELVDCDSMDQGCNGGLPSNAYKIGKFVVSDNYCFLVF 351 1979 1980Query: 206 ------QLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGAL-LYSI 258 1981 +++ G+ + YPY+G C K A +G ++ P++E + + + 1982Sbjct: 352 YHKTTKEIIRMGGLEPEDAYPYDGRGETCHLVRK-DIAVYINGSVEL-PHDEVEMQKWLV 409 1983 1984Query: 259 ANQPVSVVLEAAGKDFQLYRGG------IFVGPCGNKVDHAVAAVGYGPN----YILIKN 308 1985 P+S+ L A Q YR G IF P ++H V VGYG + Y ++KN 1986Sbjct: 410 TKGPISIGLNA--NTLQFYRHGVVHPFKIFCEPF--MLNHGVLIVGYGKDGRKPYWIVKN 465 1987 1988Query: 309 SWGTGWGENGYIRIKRGTGNSYGVCGL 335 1989 SWG WGE GY ++ RG VCG+ 1990Sbjct: 466 SWGPNWGEAGYFKLYRGK----NVCGV 488 1991 1992 1993>Y51A2D.8 CE19204 Cysteine proteases (2 domains) (HINXTON) TR:Q9XXQ7 1994 protein_id:CAA16407.1 1995 Length = 386 1996 1997 Score = 127 bits (318), Expect = 1e-29 1998 Identities = 95/332 (28%), Positives = 148/332 (43%), Gaps = 44/332 (13%) 1999 2000Query: 37 DLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKNNSYW----LG 92 2001 D E+L + FE + K+N+ YK+ E RF F + +D+ N K+ + G 2002Sbjct: 32 DRDHPEKLYKAFEDFKKKYNRKYKDESENQQRFNNFVKSYNNVDKLNAKSKAAGYDTQFG 91 2003 2004Query: 93 LNVFADMSNDEFKEKYTGSIAGNYTTTE-LSYEEVLND---GDVN----------IPEYV 138 2005 +N F+D+S EF + + + N T L++++ D D+N P+Y 2006Sbjct: 92 INKFSDLSTAEFHGRLSNVVPSNNTGLPMLNFDKKKPDFRAADMNKTRHKRRSTRYPDYF 151 2007 2008Query: 139 DWRQKGA-----VTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR- 192 2009 D R + V P+K+QG C CW F+ +E + +G S+QE+ DC 2010Sbjct: 152 DLRNEKINGRYIVGPIKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCGTEG 211 2011 2012Query: 193 SYGCNGGYPWSALQLVAQYGIHYRNTYPYE----GVQRYCRSREKGPYA-AKTDGVRQVQ 247 2013 + GC GG +Q V +YG+ YPY+ R CR RE A+ + 2014Sbjct: 212 TPGCKGGSLTLGVQYVKKYGLSGDEDYPYDQNRANQGRRCRLRETDRIVPARAFNFAVIN 271 2015 2016Query: 248 PYNEGALLYSIANQ---PVSVVLEAAGKDFQLYRGGIFV-GPCGNKVD-HAVAAVGY--- 299 2017 P + + + PV+V + G F+ Y+ G+ + C HA A VGY 2018Sbjct: 272 PRRAEEQIIQVLTEWKVPVAVYFK-VGDQFKEYKEGVIIEDDCRRATQWHAGAIVGYDTV 330 2019 2020Query: 300 ------GPNYILIKNSWGTGWGENGYIRIKRG 325 2021 +Y +IKNSWG W E+GY+R+ RG 2022Sbjct: 331 EDSRGRSHDYWIIKNSWGGDWAESGYVRVVRG 362 2023 2024 2025>R07E3.1 CE02295 cysteine proteinase (HINXTON) TR:Q21810 2026 protein_id:CAA89070.1 2027 Length = 402 2028 2029 Score = 114 bits (286), Expect = 7e-26 2030 Identities = 90/309 (29%), Positives = 139/309 (44%), Gaps = 41/309 (13%) 2031 2032Query: 54 KHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKNN--SYWLGLNVFADMSNDEFKEKYTGS 111 2033 K +K Y E + R + + + I N +N S G N +D +++EF++ 2034Sbjct: 96 KFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEYGHNDMSDWTDEEFEKTLLPK 155 2035 2036Query: 112 IAGNYTTTELSYEEVL--------NDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSA 163 2037 E + E + + P++ DWR K +TPVK QG CGSCWAF++ 2038Sbjct: 156 SFYKRLHKEAEFIEPIPESLTAKKGESSSPFPDFFDWRDKNVITPVKAQGQCGSCWAFAS 215 2039 2040Query: 164 VVTIEGIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSALQLVAQYGIHYRNTYPYEG 223 2041 T+E I G SEQ LLDCD C+GG A + + + G+ PY 2042Sbjct: 216 TATVEAAWAIAHGEKRNLSEQTLLDCDLVDNACDGGDEDKAFRYIHRNGLANAVDLPYVA 275 2043 2044Query: 224 VQR--------YCRSREKGPYAAKTDGVRQVQPYNEGALLYSIAN-QPVSVVLEAAGKDF 274 2045 ++ + +R K Y D E +++ + N PV++ + A + 2046Sbjct: 276 HRQNGCAVNDHWNTTRIKAAYFLHHD---------EDSIINWLVNFGPVNIGM-AVIQPM 325 2047 2048Query: 275 QLYRGGIFVG---PCGNKVD--HAVAAVGYGPN-----YILIKNSWGTGWG-ENGYIRIK 323 2049 + Y+GG+F C N+V HA+ GYG + Y ++KNSWG WG E+GYI 2050Sbjct: 326 RAYKGGVFTPSEYACKNEVIGLHALLITGYGTSKTGEKYWIVKNSWGNTWGVEHGYIYFA 385 2051 2052Query: 324 RGTGNSYGV 332 2053 RG N+ G+ 2054Sbjct: 386 RGI-NACGI 393 2055 2056 2057>C50F4.3 CE05468 thiol protease (HINXTON) TR:Q18740 protein_id:CAA94738.1 2058 Length = 374 2059 2060 Score = 114 bits (286), Expect = 7e-26 2061 Identities = 97/357 (27%), Positives = 152/357 (42%), Gaps = 39/357 (10%) 2062 2063Query: 6 SISKLLFVAICLFVYMGLSFG-DFSIVGYSQN-DLTSTERLIQLFESWMLKHNKIYKNID 63 2064 S+ L F+ I +F G +F + N D + E+L + FE +++K+ + YK+ 2065Sbjct: 3 SLLALFFIQIFIFTVTSFDVGANFEDSFFEINIDRNNPEKLYKEFEDFIVKYKRNYKDEI 62 2066 2067Query: 64 EKIYRFEIFKDNLKYIDETNKK----NNSYWLGLNVFADMSNDEFKEKYT--GSIAGNYT 117 2068 EK +RF+ F + + NK + G+N F+D+S E Y+ G N 2069Sbjct: 63 EKKFRFQQFVATHNRVGKMNKAAKKAGHDTKYGINKFSDLSKKEIHGMYSKFGPPKNNTN 122 2070 2071Query: 118 TTELSYEEVLNDGDVN-IPEYVDWRQKGA-----VTPVKNQGSCGSCWAFSAVVTIEGII 171 2072 + + + + + +P+ D R K + P+K Q SC CW F+A E + 2073Sbjct: 123 VPKFNLKNLRVKRQMEGLPKTFDLRNKKVGGHYIIGPIKTQDSCACCWGFAATAVAEAAL 182 2074 2075Query: 172 KIRTGNLNEYSEQELLDC-DRRSYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQR---- 226 2076 + SEQE+ DC + GCNGG P L+ + + G+ YP+ V R 2077Sbjct: 183 TVHLKKAMNLSEQEVCDCAPKHGPGCNGGDPVDGLEYIKEMGLTGGKEYPF-NVNRSTQL 241 2078 2079Query: 227 -YCRS----REKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGI 281 2080 C S RE P + + + N P+SV G Y GI 2081Sbjct: 242 GRCESEKYDRELNPLELDYYAIDPFNAEYQMTHHLYLLNLPISVAFR-TGASLSSYLSGI 300 2082 2083Query: 282 F-VGPCGNKVD---HAVAAVGYGP---------NYILIKNSWGTGWGENGYIRIKRG 325 2084 + C ++ H+ A VGYG +Y + +NSW T WG++GY RI RG 2085Sbjct: 301 LELADCDDEKGGHWHSGAIVGYGTTKNSAGRTVDYWIFRNSWWTDWGDDGYARIVRG 357 2086 2087 2088>F15D4.4 CE28917 cysteine protease (HINXTON) TR:Q93512 2089 protein_id:CAB02487.1 2090 Length = 622 2091 2092 Score = 110 bits (276), Expect = 1e-24 2093 Identities = 84/290 (28%), Positives = 127/290 (42%), Gaps = 34/290 (11%) 2094 2095Query: 64 EKIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIA------GNYT 117 2096 E + RF ++ K +DE N Y LG++ + MS ++F G +A T 2097Sbjct: 150 EGLKRFNVYSKVKKEVDE---HNIMYELGMSSYK-MSTNQFSVALDGEVAPLTLNLDALT 205 2098 2099Query: 118 TTELSYEEVLNDGDVNIPE-YVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTG 176 2100 T ++ E VDWR + P+ +Q +CG CWAFS + IE I+ 2101Sbjct: 206 PTATVIPATISSRKKRDTEPTVDWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGY 263 2102 2103Query: 177 NLNEYSEQELLDCDRR--------SYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYC 228 2104 N + S Q+LL CD + + GC GGY A + + P++ C 2105Sbjct: 264 NTSSLSVQQLLTCDTKVDSTYGLANVGCKGGYFQIAGSYLEVSAARDASLIPFDLEDTSC 323 2106 2107Query: 229 RSREKGP-----------YAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLY 277 2108 S P Y + Q+ + + + P++V + AAG D Y 2109Sbjct: 324 DSSFFPPVVPTILLFDDGYISGNFTAAQLITMEQN-IEDKVRKGPIAVGM-AAGPDIYKY 381 2110 2111Query: 278 RGGIFVGPCGNKVDHAVAAVGYGPNYILIKNSWGTGWGENGYIRIKRGTG 327 2112 G++ G CG ++HAV VG+ +Y +I+NSWG WGE GY R+KR G 2113Sbjct: 382 SEGVYDGDCGTIINHAVVIVGFTDDYWIIRNSWGASWGEAGYFRVKRTPG 431 2114 2115 2116>Y113G7B.15 CE23295 (HINXTON) TR:Q9U2X1 protein_id:CAB54334.1 2117 Length = 328 2118 2119 Score = 103 bits (256), Expect = 2e-22 2120 Identities = 89/312 (28%), Positives = 132/312 (41%), Gaps = 40/312 (12%) 2121 2122Query: 53 LKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKK----NNSYWLGLNVFADMSNDEFKEKY 108 2123 + H K Y+ EK R F N + I E N K + G N FAD + E + 2124Sbjct: 1 MHHKKHYRTPAEKDRRLAHFAKNHQKIQELNAKARREGRNVTFGWNKFADKNRQELSARN 60 2125 2126Query: 109 TGSIAGNYTTTELSYEEVLNDGDVN------------IPEYVDWRQ-----KGAVTPVKN 151 2127 + N+T + Y+ G N IP+Y D R V PVK+ 2128Sbjct: 61 SKIHPKNHTDLPI-YKPRHPRGSRNHHNKRSKRQSGDIPDYFDLRDIYVDGSPVVGPVKD 119 2129 2130Query: 152 QGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDC--DRRSYGCNGGYPWSALQLVA 209 2131 Q CG CWAF+ E + + + S+QE+ DC + GC GG P + L++V 2132Sbjct: 120 QEQCGCCWAFATTAITEAANTLYSKSFTSLSDQEICDCADSGDTPGCVGGDPRNGLKMVH 179 2133 2134Query: 210 QYGIHYRNTYPYE----GVQRYCRSREKGP--YAAKTDGVRQVQPYNEGALLYSI-ANQP 262 2135 G YPYE C EK + R Q Y E ++ ++ N 2136Sbjct: 180 LRGQSSDGDYPYEEYRANTTGNCVGDEKSTVIQPETLNVYRFDQDYAEEDIMENLYLNHI 239 2137 2138Query: 263 VSVVLEAAGKDFQLYRGGIFVGPCGNKVD----HAVAAVGYGPN-----YILIKNSWGTG 313 2139 + V G++F+ Y G+ ++ H+VA VGYG + Y L++NSW + 2140Sbjct: 240 PTAVYFRVGENFEWYTSGVLQSEDCYQMTPAEWHSVAIVGYGTSDDGVPYWLVRNSWNSD 299 2141 2142Query: 314 WGENGYIRIKRG 325 2143 WG +GY++I+RG 2144Sbjct: 300 WGLHGYVKIRRG 311 2145 2146 2147>Y71H2AR.2 CE22930 (ST.LOUIS) protein_id:AAK29985.1 2148 Length = 345 2149 2150 Score = 93.6 bits (231), Expect = 2e-19 2151 Identities = 65/213 (30%), Positives = 106/213 (49%), Gaps = 29/213 (13%) 2152 2153Query: 131 DVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGI-IKIRTGNLNEYSEQELLDC 189 2154 D E++DWR+KG V PVK+QG C + AF+ +IE + K G L +SEQ+L+DC 2155Sbjct: 79 DRTTEEFLDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTLLSFSEQQLIDC 138 2156 2157Query: 190 DRRSY-GCNGGYPWSALQLVAQYGIHYRNTYPY-EGVQRYC-----RSR---EKGPYAAK 239 2158 + + Y GC + +A+ +A +GI YPY + C +S+ +KG A 2159Sbjct: 139 NDQGYKGCEEQFAMNAIGYLATHGIETEADYPYVDKTNEKCTFDSTKSKIHLKKGVVAEG 198 2160 2161Query: 240 TDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIF---VGPCGNKVD-HAVA 295 2162 + + +V N G +++ P Y+ GI+ + C + + ++ 2163Sbjct: 199 NEVLGKVYVTNYGPAFFTMRAPP----------SLYDYKIGIYNPSIEECTSTHEIRSMV 248 2164 2165Query: 296 AVGYG----PNYILIKNSWGTGWGENGYIRIKR 324 2166 VGYG Y ++K S+GT WGE GY+++ R 2167Sbjct: 249 IVGYGIEGEQKYWIVKGSFGTSWGEQGYMKLAR 281 2168 2169 2170>K02E7.10 CE11640 protease (ST.LOUIS) TR:O17255 protein_id:AAB71030.1 2171 Length = 299 2172 2173 Score = 93.6 bits (231), Expect = 2e-19 2174 Identities = 64/219 (29%), Positives = 104/219 (47%), Gaps = 15/219 (6%) 2175 2176Query: 136 EYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGI-IKIRTGNLNEYSEQELLDCDRRSY 194 2177 +++DWR+KG V PVK+QG C + +AF+A+ IE + K G L +SEQ+++DC + 2178Sbjct: 82 DFLDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDCANFTN 141 2179 2180Query: 195 GCNGGYP-WSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGA 253 2181 C + + + + G+ YPY G + + V P E A 2182Sbjct: 142 PCQENLENVLSNRFLKENGVGTEADYPYVGKENVGKCEYDSSKMKLRPTYIDVYPNEEWA 201 2183 2184Query: 254 LLYSIANQPVSVVLEAAGKDFQLYRGGIF---VGPCGNKVD-HAVAAVGYGPN----YIL 305 2185 + I + F Y+ GI+ CGN + ++A VGYG + Y + 2186Sbjct: 202 RAH-ITTFGTGYFRMRSPPSFFHYKTGIYNPTKEECGNANEARSLAIVGYGKDGAEKYWI 260 2187 2188Query: 306 IKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVK 344 2189 +K S+GT WGE+GY+++ R + CG+ S P+K 2190Sbjct: 261 VKGSFGTSWGEHGYMKLAR----NVNACGMAESISIPIK 295 2191 2192 2193>F57F5.1 CE05999 cysteine protease (HINXTON) TR:Q20950 2194 protein_id:CAB00098.1 2195 Length = 400 2196 2197 Score = 91.3 bits (225), Expect = 8e-19 2198 Identities = 84/315 (26%), Positives = 127/315 (39%), Gaps = 72/315 (22%) 2199 2200Query: 79 IDETNKKNNSYWLGLNVFADMSNDEFKEKYTGS----IAGNYTTTELSYEEVLNDGDVNI 134 2201 +D NK S+ L + D K++ G+ I Y E+++ EV D + 2202Sbjct: 90 VDYVNKVQTSFKAELGSYFSSYPDTIKKQLMGAKMVEIPEEYRVFEMTHPEV---EDAAV 146 2203 2204Query: 135 PEYVD----WRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTG--NLNEYSEQELLD 188 2205 P+ D W +++ +++Q SCGSCWA SA TI I I + + S ++ 2206Sbjct: 147 PDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSISADDINA 206 2207 2208Query: 189 CDRR--SYGCNGGYPWSALQLVAQ---------------------------YGIHYR--- 216 2209 C GCNGGYP A + + G HY+ 2210Sbjct: 207 CCGMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCP 266 2211 2212Query: 217 -NTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLE------- 268 2213 N YP + +R C++ Y Q + G Y+++ + + E 2214Sbjct: 267 SNMYPTDKCERSCQAGYALTYQ---------QDLHFGQSAYAVSKKAAEIQKEIMTHGPV 317 2215 2216Query: 269 ----AAGKDFQLYRGGIFVGPCGNKV-DHAVAAVGYGPN----YILIKNSWGTGWGENGY 319 2217 +DF+ Y GG++V G + HAV +G+G + Y L NSW WGENGY 2218Sbjct: 318 EVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLCANSWNEDWGENGY 377 2219 2220Query: 320 IRIKRGTGNSYGVCG 334 2221 RI RG N G+ G 2222Sbjct: 378 FRIIRGV-NECGIEG 391 2223 2224 2225>T10H4.12 CE27590 locus:cpr-3 protease (HINXTON) TR:Q9TW93 2226 protein_id:CAB61024.2 2227 Length = 370 2228 2229 Score = 85.9 bits (211), Expect = 3e-17 2230 Identities = 67/233 (28%), Positives = 104/233 (43%), Gaps = 42/233 (18%) 2231 2232Query: 134 IPEYVDWRQK----GAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNE--YSEQELL 187 2233 +P+ D R+K + ++NQ +CGSCWAF A I + I++ + S +++L 2234Sbjct: 92 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151 2235 2236Query: 188 DCDRRS--YGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRY----------------CR 229 2237 C + YGC GGY AL+ A G Y G Y C+ 2238Sbjct: 152 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPYSFAPCTKNCPESTTPSCK 211 2239 2240Query: 230 SREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVV--------LEAAGK---DFQLYR 278 2241 + + Y KT+ ++ + Y A + + +EA+ K DF Y+ 2242Sbjct: 212 TTCQSSY--KTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYK 269 2243 2244Query: 279 GGIFVGPCGNKVD-HAVAAVGYGP----NYILIKNSWGTGWGENGYIRIKRGT 326 2245 G++ G V HAV +G+G +Y LI NSWGT +GE G+ +I+RGT 2246Sbjct: 270 SGVYHYTSGKLVGGHAVKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGT 322 2247 2248 2249>C52E4.1 CE08943 locus:cpr-1 cathepsin-like cysteine protease (HINXTON) 2250 TR:Q18783 protein_id:CAB01410.1 2251 Length = 340 2252 2253 Score = 85.5 bits (210), Expect = 4e-17 2254 Identities = 73/252 (28%), Positives = 102/252 (39%), Gaps = 36/252 (14%) 2255 2256Query: 107 KYTGSIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVT 166 2257 KY + + TE E VL W + ++ +++Q +CGSCWAF A 2258Sbjct: 75 KYAAAHSDEIRATE--QEVVLASVPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEM 132 2259 2260Query: 167 IEGIIKIRTGNLNE--YSEQELLDCDRRSYG--CNGGYPWSALQLVAQYGIHYRNTYPYE 222 2261 I I T + S +LL C S G C GGYP AL+ G+ Y 2262Sbjct: 133 ISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGA 192 2263 2264Query: 223 GVQRY---------------------CRSREKGPYAA-KTDGVRQVQ-PYNEGALLYSI- 258 2265 G + Y C+S YA K GV P N ++ I 2266Sbjct: 193 GCKPYPIAPCTSGNCPESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIY 252 2267 2268Query: 259 ANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVD-HAVAAVGYGPN----YILIKNSWGTG 313 2269 AN PV +DF Y+ G++ G + HA+ +G+G Y L+ NSWG 2270Sbjct: 253 ANGPVEAAFSVY-EDFYKYKSGVYKHTAGKYLGGHAIKIIGWGTESGSPYWLVANSWGVN 311 2271 2272Query: 314 WGENGYIRIKRG 325 2273 WGE+G+ +I RG 2274Sbjct: 312 WGESGFFKIYRG 323 2275 2276 2277>F26E4.3 CE17714 cysteine protease (HINXTON) TR:P90850 2278 protein_id:CAB03007.1 2279 Length = 491 2280 2281 Score = 80.5 bits (197), Expect = 1e-15 2282 Identities = 66/233 (28%), Positives = 98/233 (41%), Gaps = 42/233 (18%) 2283 2284Query: 134 IPEYVDWRQKGA--VTPVKNQGSCGSCWAFSAV-VTIEGIIKIRTGNLNE-YSEQELLDC 189 2285 +PE+ D R K + PV +QG CGS W+ S ++ + + I G +N S Q+LL C 2286Sbjct: 223 LPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLSC 282 2287 2288Query: 190 DR-RSYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRY-------------------CR 229 2289 ++ R GC GGY A + + G+ + YPY Q C 2290Sbjct: 283 NQHRQKGCEGGYLDRAWWYIRKLGVVGDHCYPYVSGQSREPGHCLIPKRDYTNRQGLRCP 342 2291 2292Query: 230 SREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFV------ 283 2293 S + A K +V E + N PV +DF +Y GG++ 2294Sbjct: 343 SGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATF-VVHEDFFMYAGGVYQHSDLAA 401 2295 2296Query: 284 ---GPCGNKVDHAVAAVGYGPN--------YILIKNSWGTGWGENGYIRIKRG 325 2297 + H+V +G+G + Y L NSWGT WGE+GY ++ RG 2298Sbjct: 402 QKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRG 454 2299 2300 2301>W07B8.4 CE14680 thiol protease (ST.LOUIS) TR:O16288 protein_id:AAB65345.1 2302 Length = 335 2303 2304 Score = 78.2 bits (191), Expect = 7e-15 2305 Identities = 71/260 (27%), Positives = 108/260 (41%), Gaps = 60/260 (23%) 2306 2307Query: 133 NIPEYVD----WRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRT-GNLNE-YSEQEL 186 2308 +IP+ D W Q +V +++Q CGSCWA +A I I + G++N S +++ 2309Sbjct: 72 SIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDI 131 2310 2311Query: 187 LDCDRRSY----GCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDG 242 2312 L C + GC GGYP A + + G+ ++ Q C+ P DG 2313Sbjct: 132 LTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSFE---SQYGCKPYSIAPCGETIDG 188 2314 2315Query: 243 VRQVQ-----------------------PYNE----GALLYSIANQPVSVVLEAAG---- 271 2316 V + PY++ GA Y+I + E 2317Sbjct: 189 VTWPECPMKISDTPKCEHHCTGNNSYPIPYDQDKHFGASAYAIGRSAKQIQTEILAHGPV 248 2318 2319Query: 272 -------KDFQLYRGGIFVGPCGNKV-DHAVAAVGYGPN----YILIKNSWGTGWGENGY 319 2320 +DF LY+ GI+ G ++ HAV +G+G + Y L NSW T WGE GY 2321Sbjct: 249 EVGFIVYEDFYLYKTGIYTHVAGGELGGHAVKMLGWGVDNGTPYWLAANSWNTVWGEKGY 308 2322 2323Query: 320 IRIKRGTGNSYGVCGLYTSS 339 2324 RI RG CG+ +++ 2325Sbjct: 309 FRILRGVDE----CGIESAA 324 2326 2327 2328>C32B5.7 CE08515 cathepsin-like peptidase (ST.LOUIS) TR:P91111 2329 protein_id:AAB37963.1 2330 Length = 250 2331 2332 Score = 73.9 bits (180), Expect = 1e-13 2333 Identities = 57/189 (30%), Positives = 87/189 (45%), Gaps = 22/189 (11%) 2334 2335Query: 137 YVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRRSYGC 196 2336 ++DWR +G V PVK+QG+C + +AF+A+ IE + I G L +SEQ+++DC GC 2337Sbjct: 71 FLDWRDEGVVGPVKDQGNCNASYAFAAISAIESMYAIANGQLLSFSEQQIIDC---LGGC 127 2338 2339Query: 197 N-GGYPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPY---NEG 252 2340 P A+ + + GI YP+ G + EK Y +K + Y +E 2341Sbjct: 128 AIESDPMMAMTYLERKGIETYTDYPFVG-----KKNEKCEYDSKKAYLILDDTYDMSDES 182 2342 2343Query: 253 ALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPC-----GNKVDHAVAAVGY----GPNY 303 2344 L I + + F Y+ GI+ P A+ VGY G NY 2345Sbjct: 183 LALVFIDERGPGLFTMNTPPSFFNYKSGIY-NPTEEECKSTNEKRALTIVGYGNDKGQNY 241 2346 2347Query: 304 ILIKNSWGT 312 2348 ++K S+GT 2349Sbjct: 242 WIVKGSFGT 250 2350 2351 2352>Y71H2AM.3 CE26272 (ST.LOUIS) protein_id:AAK29976.1 2353 Length = 716 2354 2355 Score = 71.2 bits (173), Expect = 9e-13 2356 Identities = 42/116 (36%), Positives = 60/116 (51%), Gaps = 9/116 (7%) 2357 2358Query: 136 EYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGI-IKIRTGNLNEYSEQELLDCDRRSY 194 2359 E++DWR KG V PVK+QG C + AF+ +IE + K G+L +SEQ+L+DCD + 2360Sbjct: 84 EFLDWRDKGIVGPVKDQGKCNASHAFAISSSIESMYAKATNGSLLSFSEQQLIDCDDHGF 143 2361 2362Query: 195 -GCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPY 249 2363 GC +A+ +GI YPY G +E G + +T G V Y 2364Sbjct: 144 KGCEEQPAINAVSYFIFHGIETEADYPYAG-------KENGKLSNETQGKELVTNY 192 2365 2366 2367>F32B5.8 CE09855 cysteine proteinase (ST.LOUIS) TR:O01850 2368 protein_id:AAB54210.1 2369 Length = 427 2370 2371 Score = 69.7 bits (169), Expect = 2e-12 2372 Identities = 58/217 (26%), Positives = 91/217 (41%), Gaps = 28/217 (12%) 2373 2374Query: 133 NIPEYVDWRQKGAVTPV---KNQGS---CGSCWAFSAVVTIEGIIKIRTGNL---NEYSE 183 2375 ++P+ DWR + +NQ CGSCWAF A + I I+ N S 2376Sbjct: 185 DLPKTWDWRDANGINYASADRNQHIPQYCGSCWAFGATSALADRINIKRKNAWPQAYLSV 244 2377 2378Query: 184 QELLDCDRRSYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYC----RSREKGP---Y 236 2379 QE++DC GG P + ++GI + Y+ C R P + 2380Sbjct: 245 QEVIDCSGAGTCVMGGEPGGVYKYAHEHGIPHETCNNYQARDGKCDPYNRCGSCWPGECF 304 2381 2382Query: 237 AAKTDGVRQVQPYN-----EGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVD 291 2383 + K + +V Y E P++ + AA K F+ Y GGI+ +D 2384Sbjct: 305 SIKNYTLYKVSEYGTVHGYEKMKAEIYHKGPIACGI-AATKAFETYAGGIYKEVTDEDID 363 2385 2386Query: 292 HAVAAVGYGPN------YILIKNSWGTGWGENGYIRI 322 2387 H ++ G+G + Y + +NSWG WGE+G+ +I 2388Sbjct: 364 HIISVHGWGVDHESGVEYWIGRNSWGEPWGEHGWFKI 400 2389 2390 2391 Database: /data_2/jason/blastdb/wormpep62 2392 Posted date: Sep 3, 2001 2:17 PM 2393 Number of letters in database: 8,813,425 2394 Number of sequences in database: 20,085 2395 2396Lambda K H 2397 0.318 0.138 0.428 2398 2399Gapped 2400Lambda K H 2401 0.267 0.0410 0.140 2402 2403 2404Matrix: BLOSUM62 2405Gap Penalties: Existence: 11, Extension: 1 2406Number of Hits to DB: 6611257 2407Number of Sequences: 20085 2408Number of extensions: 311359 2409Number of successful extensions: 788 2410Number of sequences better than 1.0e-10: 19 2411Number of HSP's better than 0.0 without gapping: 6 2412Number of HSP's successfully gapped in prelim test: 13 2413Number of HSP's that attempted gapping in prelim test: 741 2414Number of HSP's gapped (non-prelim): 23 2415length of query: 345 2416length of database: 8,813,425 2417effective HSP length: 44 2418effective length of query: 301 2419effective length of database: 7,929,685 2420effective search space: 2386835185 2421effective search space used: 2386835185 2422T: 11 2423A: 40 2424X1: 16 ( 7.3 bits) 2425X2: 38 (14.6 bits) 2426X3: 64 (24.7 bits) 2427S1: 41 (21.7 bits) 2428S2: 156 (64.7 bits) 2429