1BLASTP 2.0.14 [Jun-29-2000] 2 3 4Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 5Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 6"Gapped BLAST and PSI-BLAST: a new generation of protein database search 7programs", Nucleic Acids Res. 25:3389-3402. 8 9Query= CYS1_DICDI 10 (351 letters) 11 12Database: /home/peter/blast/data/swissprot 13 88,780 sequences; 31,984,247 total letters 14 15Searching...................................................................................................................................................... 163 occurrence(s) of pattern in query 17 CYS1_DICDI; PATTERN. 18 pattern P-E-E-Q at position 23 of query sequence 19effective database length=3.2e+07 20 pattern probability=8.9e-06 21lengthXprobability=2.8e+02 22 23Number of occurrences of pattern in the database is 349 24 CYS1_DICDI; PATTERN. 25 pattern P-E-E-Q at position 120 of query sequence 26effective database length=3.2e+07 27 pattern probability=8.9e-06 28lengthXprobability=2.8e+02 29 30Number of occurrences of pattern in the database is 349 31 CYS1_DICDI; PATTERN. 32 pattern P-E-E-Q at position 237 of query sequence 33effective database length=3.2e+07 34 pattern probability=8.9e-06 35lengthXprobability=2.8e+02 36 37Number of occurrences of pattern in the database is 349 38done 39 40 41Results from round 1 42 43 Score E 44 (bits) Value 45 46Significant matches for pattern occurrence 1 at position 23 47 48 49sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR 688 0.0 50sp|P30957|RYNC_RABIT RYANODINE RECEPTOR, CARDIAC MUSCLE 8 4.8 51sp|Q08862|GTC_RABIT GLUTATHIONE S-TRANSFERASE YC (ALPHA II) (GST... 7 6.0 52sp|O95801|TTC4_HUMAN TETRATRICOPEPTIDE REPEAT PROTEIN 4 7 7.6 53sp|P36114|YKZ8_YEAST HYPOTHETICAL 81.8 KDA PROTEIN IN YPT52-DBP7... 7 9.6 54 55 56Significant matches for pattern occurrence 2 at position 120 57 58 59sp|P11559|MCRA_METVO METHYL-COENZYME M REDUCTASE ALPHA SUBUNIT 13 0.13 60sp|Q49605|MCRA_METKA METHYL-COENZYME M REDUCTASE I ALPHA SUBUNIT... 11 0.43 61sp|P81901|FER_PYRIS FERREDOXIN (SEVEN-IRON FERREDOXIN) 11 0.55 62sp|Q58256|MCRX_METJA METHYL-COENZYME M REDUCTASE II ALPHA SUBUNI... 10 1.1 63sp|P53203|YG14_YEAST HYPOTHETICAL 52.9 KD PROTEIN IN ERP6-TFG2 I... 8 3.0 64sp|P55002|MGP1_MOUSE MICROFIBRIL-ASSOCIATED GLYCOPROTEIN PRECURS... 7 6.0 65sp|Q06234|ASH1_XENLA ACHAETE-SCUTE HOMOLOG 1 7 7.6 66sp|P20918|PLMN_MOUSE PLASMINOGEN PRECURSOR [CONTAINS: ANGIOSTATIN] 7 7.6 67 68 69Significant matches for pattern occurrence 3 at position 237 70 71 72sp|P49362|GCSB_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] B, ... 9 1.4 73sp|P49361|GCSA_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] A, ... 9 1.4 74sp|O49852|GCSP_FLATR GLYCINE DEHYDROGENASE [DECARBOXYLATING], MI... 8 4.8 75sp|P32767|PDR6_YEAST PLEIOTROPIC DRUG RESISTANCE REGULATORY PROT... 7 6.0 76sp|O49850|GCSP_FLAAN GLYCINE DEHYDROGENASE [DECARBOXYLATING], MI... 7 9.6 77 78 79Significant alignments for pattern occurrence 1 at position 23 80 81>sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR 82 Length = 343 83 84 Score = 688 bits (1789), Expect = 0.0 85 Identities = 343/351 (97%), Positives = 343/351 (97%), Gaps = 8/351 (2%) 86 87Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 88pattern 23 **** 89 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 90Sbjct: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 91 92Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPP 120 93pattern 120 * 94 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 95Sbjct: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP- 119 96 97Query: 121 EEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 180 98pattern 121 *** 99 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 100Sbjct: 120 ---TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 101 102Query: 181 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ 240 103pattern 237 **** 104 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 105Sbjct: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG---- 232 106 107Query: 241 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 300 108 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 109Sbjct: 233 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 292 110 111Query: 301 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 112 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 113Sbjct: 293 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 114 115 116>sp|P30957|RYNC_RABIT RYANODINE RECEPTOR, CARDIAC MUSCLE 117 Length = 4969 118 119 Score = 7.8 bits (25), Expect = 4.8 120 Identities = 14/39 (35%), Positives = 19/39 (47%) 121 122Query: 23 PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61 123pattern 23 **** 124 PEEQ +F E + K +K EE E + G+ EE 125Sbjct: 4414 PEEQEKFQEQKTKEEEKEEKEETKSEPEKAEGEDGEKEE 4452 126 127 128>sp|Q08862|GTC_RABIT GLUTATHIONE S-TRANSFERASE YC (ALPHA II) (GST CLASS-ALPHA) 129 Length = 221 130 131 Score = 7.4 bits (24), Expect = 6.0 132 Identities = 19/67 (28%), Positives = 35/67 (51%), Gaps = 12/67 (17%) 133 134Query: 21 IPPEEQ-SQFLEFQDKFNKKY---------SH-EEYLERFEIFKSNLGKIEEL-NLIAIN 68 135pattern 23 **** 136 +PPEEQ ++ + +DK +Y SH ++YL ++ K+++ +E L N+ +N 137Sbjct: 112 LPPEEQEAKLAQIKDKAKNRYFPAFEKVLKSHGQDYLVGNKLSKADILLVELLYNVEELN 171 138 139Query: 69 HKADTKF 75 140 A F 141Sbjct: 172 PGATASF 178 142 143 144>sp|O95801|TTC4_HUMAN TETRATRICOPEPTIDE REPEAT PROTEIN 4 145 Length = 356 146 147 Score = 7.1 bits (23), Expect = 7.6 148 Identities = 14/67 (20%), Positives = 32/67 (46%), Gaps = 5/67 (7%) 149 150Query: 23 PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGK---IEELNLIAINHKADTKFGVNK 79 151pattern 23 **** 152 PEEQ++ ++D+ N + ++Y + + L K +LN + ++A ++ + 153Sbjct: 75 PEEQAK--TYKDEGNDYFKEKDYKKAVISYTEGLKKKCADPDLNAVLYTNRAAAQYYLGN 132 154 155Query: 80 FADLSSD 86 156 F +D 157Sbjct: 133 FRSALND 139 158 159 160>sp|P36114|YKZ8_YEAST HYPOTHETICAL 81.8 KDA PROTEIN IN YPT52-DBP7 INTERGENIC REGION 161 Length = 725 162 163 Score = 6.8 bits (22), Expect = 9.6 164 Identities = 21/99 (21%), Positives = 43/99 (43%), Gaps = 21/99 (21%) 165 166Query: 21 IPPEEQ--SQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN 78 167pattern 23 **** 168 + PEEQ L+F ++ H ER + +++G +N + + G+ 169Sbjct: 213 LTPEEQKDKDLLQFAEQI-----HSMRTER--LSGAHIGNSPAIN------RLRGELGLQ 259 170 171Query: 79 KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINS 117 172 DL +E ++ + +DD+ ++ DEF++S 173Sbjct: 260 AMEDLPEEEITDH------KVLSDDIDLSQATIDEFVHS 292 174 175 176 177Significant alignments for pattern occurrence 2 at position 120 178 179>sp|P11559|MCRA_METVO METHYL-COENZYME M REDUCTASE ALPHA SUBUNIT 180 Length = 555 181 182 Score = 13.0 bits (40), Expect = 0.13 183 Identities = 16/28 (57%), Positives = 18/28 (64%), Gaps = 3/28 (10%) 184 185Query: 99 IFTDDLPVADYLDDEF---INSIPPEEQ 123 186pattern 120 **** 187 IFT D +AD LDD F IN + PEEQ 188Sbjct: 170 IFTGDDELADELDDRFVIDINKLFPEEQ 197 189 190 191>sp|Q49605|MCRA_METKA METHYL-COENZYME M REDUCTASE I ALPHA SUBUNIT (MCR I ALPHA) 192 Length = 553 193 194 Score = 11.2 bits (35), Expect = 0.43 195 Identities = 14/28 (50%), Positives = 18/28 (64%), Gaps = 3/28 (10%) 196 197Query: 99 IFTDDLPVADYLDDEFINSIP---PEEQ 123 198pattern 120 **** 199 I T DL +AD +DD+F+ I PEEQ 200Sbjct: 168 IITGDLELADEIDDKFLIDIEKLFPEEQ 195 201 202 203>sp|P81901|FER_PYRIS FERREDOXIN (SEVEN-IRON FERREDOXIN) 204 Length = 101 205 206 Score = 10.9 bits (34), Expect = 0.55 207 Identities = 12/23 (52%), Positives = 16/23 (69%), Gaps = 1/23 (4%) 208 209Query: 114 FINSIPPEEQTAF-DWRTRGAVT 135 210pattern 120 **** 211 F S+ PEEQ AF +W+TR +T 212Sbjct: 78 FGKSLTPEEQRAFEEWKTRYGIT 100 213 214 215>sp|Q58256|MCRX_METJA METHYL-COENZYME M REDUCTASE II ALPHA SUBUNIT (MCR II ALPHA) 216 Length = 553 217 218 Score = 9.8 bits (31), Expect = 1.1 219 Identities = 14/28 (50%), Positives = 17/28 (60%), Gaps = 3/28 (10%) 220 221Query: 99 IFTDDLPVADYLDDEF---INSIPPEEQ 123 222pattern 120 **** 223 IFT D +AD +D F IN + PEEQ 224Sbjct: 168 IFTGDDELADEIDKRFLIDINKLFPEEQ 195 225 226 227>sp|P53203|YG14_YEAST HYPOTHETICAL 52.9 KD PROTEIN IN ERP6-TFG2 INTERGENIC REGION 228 Length = 462 229 230 Score = 8.5 bits (27), Expect = 3.0 231 Identities = 13/39 (33%), Positives = 21/39 (53%), Gaps = 9/39 (23%) 232 233Query: 112 DEFINSIP-------PEEQT--AFDWRTRGAVTPVKNQG 141 234pattern 120 **** 235 DEF+N+ P PEEQ+ A++W + + + N G 236Sbjct: 308 DEFLNTSPSPEVFTLPEEQSGMAWEWHDKDWMLDLTNDG 346 237 238 239>sp|P55002|MGP1_MOUSE MICROFIBRIL-ASSOCIATED GLYCOPROTEIN PRECURSOR (MAGP) (MAGP-1) 240 Length = 183 241 242 Score = 7.4 bits (24), Expect = 6.0 243 Identities = 11/37 (29%), Positives = 18/37 (47%) 244 245Query: 100 FTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTP 136 246pattern 120 **** 247 + D + ADY D + ++ PEEQ + + V P 248Sbjct: 37 YGDQIDNADYYDYQEVSPRTPEEQFQSQQQVQQEVIP 73 249 250 251>sp|Q06234|ASH1_XENLA ACHAETE-SCUTE HOMOLOG 1 252 Length = 199 253 254 Score = 7.1 bits (23), Expect = 7.6 255 Identities = 11/27 (40%), Positives = 15/27 (54%), Gaps = 1/27 (3%) 256 257Query: 105 PVADYLDDE-FINSIPPEEQTAFDWRT 130 258pattern 120 **** 259 PV+ Y DE + + PEEQ D+ T 260Sbjct: 171 PVSSYSSDEGSYDPLSPEEQELLDFTT 197 261 262 263>sp|P20918|PLMN_MOUSE PLASMINOGEN PRECURSOR [CONTAINS: ANGIOSTATIN] 264 Length = 812 265 266 Score = 7.1 bits (23), Expect = 7.6 267 Identities = 8/13 (61%), Positives = 11/13 (84%) 268 269Query: 112 DEFINSIPPEEQT 124 270pattern 120 **** 271 D+ +S+PPEEQT 272Sbjct: 359 DQSDSSVPPEEQT 371 273 274 275 276Significant alignments for pattern occurrence 3 at position 237 277 278>sp|P49362|GCSB_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] B, MITOCHONDRIAL PRECURSOR 279 (GLYCINE DECARBOXYLASE B) (GLYCINE CLEAVAGE SYSTEM 280 P-PROTEIN B) 281 Length = 1034 282 283 Score = 9.5 bits (30), Expect = 1.4 284 Identities = 21/79 (26%), Positives = 39/79 (48%), Gaps = 13/79 (16%) 285 286Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 287pattern 237 **** 288 NSA PEEQ K++ F P +++ I +T P +I D++++ + G+ + + 289Sbjct: 80 NSAT--PEEQTKMAEFVGFPNLDSL----IDATVPKSIRLDSMKYSKFDEGLTESQMIAH 133 290 291Query: 291 SLDHGILIVGYSAKNTIFR 309 292 D ++KN IF+ 293Sbjct: 134 MQD-------LASKNKIFK 145 294 295 296>sp|P49361|GCSA_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] A, MITOCHONDRIAL PRECURSOR 297 (GLYCINE DECARBOXYLASE A) (GLYCINE CLEAVAGE SYSTEM 298 P-PROTEIN A) 299 Length = 1037 300 301 Score = 9.5 bits (30), Expect = 1.4 302 Identities = 21/79 (26%), Positives = 39/79 (48%), Gaps = 13/79 (16%) 303 304Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 305pattern 237 **** 306 NSA PEEQ K++ F P +++ I +T P +I D++++ + G+ + + 307Sbjct: 83 NSAT--PEEQTKMAEFVGFPNLDSL----IDATVPKSIRLDSMKYSKFDEGLTESQMIAH 136 308 309Query: 291 SLDHGILIVGYSAKNTIFR 309 310 D ++KN IF+ 311Sbjct: 137 MQD-------LASKNKIFK 148 312 313 314>sp|O49852|GCSP_FLATR GLYCINE DEHYDROGENASE [DECARBOXYLATING], MITOCHONDRIAL PRECURSOR 315 (GLYCINE DECARBOXYLASE) (GLYCINE CLEAVAGE SYSTEM 316 P-PROTEIN) 317 Length = 1034 318 319 Score = 7.8 bits (25), Expect = 4.8 320 Identities = 21/79 (26%), Positives = 38/79 (47%), Gaps = 13/79 (16%) 321 322Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 323pattern 237 **** 324 NSA PEEQ K++ F +++ I +T P AI D++++ + G+ + + 325Sbjct: 80 NSAT--PEEQTKMAEFVGFSNLDSL----IDATVPKAIRLDSMKYSKFDEGLTESQMIAH 133 326 327Query: 291 SLDHGILIVGYSAKNTIFR 309 328 D ++KN IF+ 329Sbjct: 134 MQD-------LASKNKIFK 145 330 331 332>sp|P32767|PDR6_YEAST PLEIOTROPIC DRUG RESISTANCE REGULATORY PROTEIN 6 333 Length = 1081 334 335 Score = 7.4 bits (24), Expect = 6.0 336 Identities = 25/93 (26%), Positives = 37/93 (38%), Gaps = 17/93 (18%) 337 338Query: 159 HFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI-IKNGGIQTESS 217 339 +F S+N+ +S L E M + E C L P ++I N I +S+ 340Sbjct: 642 NFTSKNEQEKISNDKL-----EVMVIKTVSTLCETCREELTPYLMHFISFLNTVIMPDSN 696 341 342Query: 218 YPYTAETG--------TQCNFNSANIGPEEQAK 242 343pattern 237 **** 344 + T QC ++ GPEEQAK 345Sbjct: 697 VSHFTRTKLVRSIGYVVQCQVSN---GPEEQAK 726 346 347 348>sp|O49850|GCSP_FLAAN GLYCINE DEHYDROGENASE [DECARBOXYLATING], MITOCHONDRIAL PRECURSOR 349 (GLYCINE DECARBOXYLASE) (GLYCINE CLEAVAGE SYSTEM 350 P-PROTEIN) 351 Length = 1034 352 353 Score = 6.8 bits (22), Expect = 9.6 354 Identities = 20/79 (25%), Positives = 38/79 (47%), Gaps = 13/79 (16%) 355 356Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 357pattern 237 **** 358 NSA PEEQ K++ F +++ I +T P +I D++++ + G+ + + 359Sbjct: 80 NSAT--PEEQTKMAEFVGFSNLDSL----IDATVPKSIRLDSMKYSKFDEGLTESQMIAH 133 360 361Query: 291 SLDHGILIVGYSAKNTIFR 309 362 D ++KN IF+ 363Sbjct: 134 MQD-------LASKNKIFK 145 364 365 366Searching..................................................done 367 368 369Results from round 2 370 371 372 Score E 373Sequences producing significant alignments: (bits) Value 374Sequences used in model and found again: 375 376Sequences not found previously or not previously below threshold: 377 378sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR 709 0.0 379sp|P43295|A494_ARATH PROBABLE CYSTEINE PROTEINASE A494 PRECURSOR 273 4e-73 380sp|P25804|CYSP_PEA CYSTEINE PROTEINASE 15A PRECURSOR (TURGOR-RES... 270 2e-72 381sp|P43296|RD19_ARATH CYSTEINE PROTEINASE RD19A PRECURSOR 266 6e-71 382sp|Q10716|CYS1_MAIZE CYSTEINE PROTEINASE 1 PRECURSOR 252 6e-67 383sp|P04989|CYS2_DICDI CYSTEINE PROTEINASE 2 PRECURSOR (PRESTALK C... 250 2e-66 384sp|P54640|CYS5_DICDI CYSTEINE PROTEINASE 5 PRECURSOR 238 1e-62 385sp|P14658|CYSP_TRYBB CYSTEINE PROTEINASE PRECURSOR 236 4e-62 386sp|Q26534|CATL_SCHMA CATHEPSIN L PRECURSOR (SMCL1) 233 3e-61 387sp|P35591|CYS1_LEIPI CYSTEINE PROTEINASE 1 PRECURSOR (AMASTIGOTE... 233 3e-61 388sp|P25775|LCPA_LEIME CYSTEINE PROTEINASE A PRECURSOR 231 1e-60 389sp|P13277|CYS1_HOMAM DIGESTIVE CYSTEINE PROTEINASE 1 PRECURSOR 221 1e-57 390sp|P25779|CYSP_TRYCR CRUZIPAIN PRECURSOR (MAJOR CYSTEINE PROTEIN... 221 2e-57 391sp|P41721|CATV_NPVBM VIRAL CATHEPSIN (V-CATH) 216 5e-56 392sp|P25782|CYS2_HOMAM DIGESTIVE CYSTEINE PROTEINASE 2 PRECURSOR 215 1e-55 393sp|P41715|CATV_NPVCF VIRAL CATHEPSIN (V-CATH) 214 2e-55 394sp|P25784|CYS3_HOMAM DIGESTIVE CYSTEINE PROTEINASE 3 PRECURSOR 214 2e-55 395sp|P07154|CATL_RAT CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN... 212 7e-55 396sp|P06797|CATL_MOUSE CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTE... 212 1e-54 397sp|P12412|CYSP_VIGMU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYS... 209 8e-54 398sp|P25783|CATV_NPVAC VIRAL CATHEPSIN (V-CATH) 209 8e-54 399sp|P25975|CATL_BOVIN CATHEPSIN L PRECURSOR 208 1e-53 400sp|Q40143|CYS3_LYCES CYSTEINE PROTEINASE 3 PRECURSOR 207 2e-53 401sp|Q05094|CYS2_LEIPI CYSTEINE PROTEINASE 2 PRECURSOR (AMASTIGOTE... 207 3e-53 402sp|P36400|LCPB_LEIME CYSTEINE PROTEINASE B PRECURSOR 206 4e-53 403sp|P07711|CATL_HUMAN CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTE... 206 4e-53 404sp|Q28944|CATL_PIG CATHEPSIN L PRECURSOR 206 5e-53 405sp|P00785|ACTN_ACTCH ACTINIDAIN PRECURSOR (ACTINIDIN) 204 3e-52 406sp|P25803|CYSP_PHAVU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYS... 203 6e-52 407sp|Q10991|CATL_SHEEP CATHEPSIN L 201 1e-51 408sp|P43156|CYSP_HEMSP THIOL PROTEASE SEN102 PRECURSOR 201 2e-51 409sp|P54639|CYS4_DICDI CYSTEINE PROTEINASE 4 PRECURSOR 200 3e-51 410sp|O60911|CATM_HUMAN CATHEPSIN L2 PRECURSOR (CATHEPSIN V) 199 7e-51 411sp|O10364|CATV_NPVOP VIRAL CATHEPSIN (V-CATH) 196 5e-50 412sp|P25777|ORYB_ORYSA ORYZAIN BETA CHAIN PRECURSOR 196 5e-50 413sp|P25776|ORYA_ORYSA ORYZAIN ALPHA CHAIN PRECURSOR 194 2e-49 414sp|P43297|RD21_ARATH CYSTEINE PROTEINASE RD21A PRECURSOR 193 4e-49 415sp|Q10717|CYS2_MAIZE CYSTEINE PROTEINASE 2 PRECURSOR 193 5e-49 416sp|P14080|PAP2_CARPA CHYMOPAPAIN PRECURSOR (PAPAYA PROTEINASE II... 192 1e-48 417sp|P00786|CATH_RAT CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPS... 192 1e-48 418sp|P25251|CYS4_BRANA CYSTEINE PROTEINASE COT44 PRECURSOR 190 5e-48 419sp|P09668|CATH_HUMAN CATHEPSIN H PRECURSOR 188 2e-47 420sp|P10056|PAP3_CARPA CARICAIN PRECURSOR (PAPAYA PROTEINASE OMEGA... 187 2e-47 421sp|P25778|ORYC_ORYSA ORYZAIN GAMMA CHAIN PRECURSOR 187 2e-47 422sp|P15242|TES1_RAT TESTIN 1/2 PRECURSOR (CMB-22/CMB-23) 187 4e-47 423sp|O46427|CATH_PIG CATHEPSIN H PRECURSOR 186 5e-47 424sp|P05167|ALEU_HORVU THIOL PROTEASE ALEURAIN PRECURSOR 185 9e-47 425sp|P43235|CATK_HUMAN CATHEPSIN K PRECURSOR (CATHEPSIN O) (CATHEP... 185 1e-46 426sp|P05994|PAP4_CARPA PAPAYA PROTEINASE IV PRECURSOR (PPIV) (PAPA... 184 3e-46 427sp|P25250|CYS2_HORVU CYSTEINE PROTEINASE EP-B 2 PRECURSOR 183 3e-46 428sp|P25249|CYS1_HORVU CYSTEINE PROTEINASE EP-B 1 PRECURSOR 183 5e-46 429sp|P43236|CATK_RABIT CATHEPSIN K PRECURSOR (OC-2 PROTEIN) 183 6e-46 430sp|P22895|P34_SOYBN P34 PROBABLE THIOL PROTEASE PRECURSOR 182 8e-46 431sp|P49935|CATH_MOUSE CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHE... 180 5e-45 432sp|P55097|CATK_MOUSE CATHEPSIN K PRECURSOR 178 2e-44 433sp|P56202|CATW_HUMAN CATHEPSIN W PRECURSOR (LYMPHOPAIN) 177 3e-44 434sp|P56203|CATW_MOUSE CATHEPSIN W PRECURSOR (LYMPHOPAIN) 176 6e-44 435sp|P43234|CATO_HUMAN CATHEPSIN O PRECURSOR 173 4e-43 436sp|P00784|PAPA_CARPA PAPAIN PRECURSOR (PAPAYA PROTEINASE I) (PPI) 173 7e-43 437sp|P25774|CATS_HUMAN CATHEPSIN S PRECURSOR 171 3e-42 438sp||CATL_CHICK_1 [Segment 1 of 2] CATHEPSIN L 167 2e-41 439sp|P25326|CATS_BOVIN CATHEPSIN S 165 1e-40 440sp|P80884|ANAN_ANACO ANANAIN 161 2e-39 441sp|Q02765|CATS_RAT CATHEPSIN S PRECURSOR 158 1e-38 442sp|P20721|CYSL_LYCES LOW-TEMPERATURE-INDUCED CYSTEINE PROTEINASE... 158 2e-38 443sp|P36184|ACP1_ENTHI CYSTEINE PROTEINASE ACP1 PRECURSOR 152 1e-36 444sp|Q01957|CPP1_ENTHI CYSTEINE PROTEINASE 1 PRECURSOR 150 4e-36 445sp|O17473|CATL_BRUPA CATHEPSIN L-LIKE PRECURSOR 150 6e-36 446sp|P46102|CYSP_PLAVN CYSTEINE PROTEINASE PRECURSOR 150 6e-36 447sp|Q06964|CPP3_ENTHI CYSTEINE PROTEINASE 3 PRECURSOR (CYSTEINE P... 149 9e-36 448sp|Q01958|CPP2_ENTHI CYSTEINE PROTEINASE 2 PRECURSOR 149 9e-36 449sp|P36185|ACP2_ENTHI CYSTEINE PROTEINASE ACP2 PRECURSOR 145 1e-34 450sp|P25781|CYSP_THEAN CYSTEINE PROTEINASE PRECURSOR 145 1e-34 451sp|P22497|CYSP_THEPA CYSTEINE PROTEINASE PRECURSOR 143 5e-34 452sp|P25805|CYSP_PLAFA THROPHOZOITE CYSTEINE PROTEINASE PRECURSOR ... 141 3e-33 453sp|P14518|BROM_ANACO BROMELAIN, STEM 139 6e-33 454sp|P16311|MMAL_DERFA MAJOR MITE FECAL ALLERGEN DER F 1 PRECURSOR... 138 1e-32 455sp|P42666|CYSP_PLAVI CYSTEINE PROTEINASE PRECURSOR 129 1e-29 456sp|P08176|MMAL_DERPT MAJOR MITE FECAL ALLERGEN DER P 1 PRECURSOR... 121 3e-27 457sp|P80067|CATC_RAT DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPP... 111 3e-24 458sp|P97821|CATC_MOUSE DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (D... 109 9e-24 459sp|P25773|CATL_FELCA CATHEPSIN L (PROGESTERONE-DEPENDENT PROTEIN... 108 2e-23 460sp|Q26563|CATC_SCHMA CATHEPSIN C PRECURSOR 108 3e-23 461sp|P53634|CATC_HUMAN DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (D... 107 3e-23 462sp|P25780|EUM1_EURMA MITE GROUP I ALLERGEN EUR M 1 (EUR M I) 100 7e-21 463sp|Q23894|CYS3_DICDI CYSTEINE PROTEINASE 3 (CYSTEINE PROTEINASE II) 95 2e-19 464sp|P43509|CPR5_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 5 PREC... 91 4e-18 465sp|P43508|CPR4_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 4 PREC... 90 5e-18 466sp|P05993|PAP5_CARPA CYSTEINE PROTEINASE (CLONE PLBPC13) 90 5e-18 467sp|P07688|CATB_BOVIN CATHEPSIN B PRECURSOR 89 2e-17 468sp|P00787|CATB_RAT CATHEPSIN B PRECURSOR (CATHEPSIN B1) (RSG-2) 87 4e-17 469sp|P25807|CYS1_CAEEL GUT-SPECIFIC CYSTEINE PROTEINASE PRECURSOR 87 5e-17 470sp|P07858|CATB_HUMAN CATHEPSIN B PRECURSOR (CATHEPSIN B1) (APP S... 86 9e-17 471sp|P43157|CYSP_SCHJA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECUR... 85 2e-16 472sp|P43233|CATB_CHICK CATHEPSIN B PRECURSOR (CATHEPSIN B1) 85 2e-16 473sp|P43510|CPR6_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 6 PREC... 85 2e-16 474sp|P25792|CYSP_SCHMA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECUR... 85 3e-16 475sp|P10605|CATB_MOUSE CATHEPSIN B PRECURSOR (CATHEPSIN B1) 85 3e-16 476sp|P25802|CYS1_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PREC... 80 9e-15 477sp|P25793|CYS2_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 2 PREC... 78 2e-14 478sp|P19092|CYS1_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PREC... 78 4e-14 479sp|P43507|CPR3_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 PREC... 73 7e-13 480sp|P13823|SERA_PLAFG SERINE-REPEAT ANTIGEN PROTEIN PRECURSOR (P1... 70 6e-12 481sp|P32956|CC3_CARCN CYSTEINE PROTEINASE III (CC-III) 61 4e-09 482sp|P32957|CC4_CARCN CYSTEINE PROTEINASE IV (CC-IV) 60 9e-09 483sp|Q06544|CYS3_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 59 1e-08 484sp|P32954|CC1_CARCN CYSTEINE PROTEINASE I (CC-I) 58 3e-08 485sp|P32955|CC2_CARCN CYSTEINE PROTEINASE II (CC-II) 56 1e-07 486sp||CATL_CHICK_2 [Segment 2 of 2] CATHEPSIN L 52 2e-06 487sp|P12399|CT2A_MOUSE CTLA-2-ALPHA PROTEIN PRECURSOR 42 0.002 488sp|P05689|CATX_BOVIN CATHEPSIN 40 0.006 489sp|P12400|CT2B_MOUSE CTLA-2-BETA PROTEIN PRECURSOR 39 0.019 490sp|P23897|HSER_RAT HEAT-STABLE ENTEROTOXIN RECEPTOR PRECURSOR (G... 36 0.16 491sp|P20736|BM86_BOOMI GLYCOPROTEIN ANTIGEN BM86 PRECURSOR (PROTEC... 35 0.22 492sp|P46992|YJR1_YEAST HYPOTHETICAL 43.0 KD PROTEIN IN CPS1-FPP1 I... 32 1.9 493sp|P28493|PR5_ARATH PATHOGENESIS-RELATED PROTEIN 5 PRECURSOR (PR-5) 32 1.9 494sp|P54634|POLN_LORDV NON-STRUCTURAL POLYPROTEIN [CONTAINS: RNA-D... 31 3.2 495sp|Q02521|SPP2_YEAST SPLICEOSOME MATURATION PROTEIN SPP2 31 4.2 496sp|P41901|SPR3_YEAST SPORULATION-SPECIFIC SEPTIN 31 4.2 497sp|Q01532|BLH1_YEAST CYSTEINE PROTEINASE 1 (Y3) (BLEOMYCIN HYDRO... 30 5.5 498sp|P24896|NU5M_CAEEL NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 5 30 5.5 499sp|P25648|SRB8_YEAST SUPPRESSOR OF RNA POLYMERASE B SRB8 30 7.2 500sp|Q04723|PEPC_LACLC AMINOPEPTIDASE C 30 7.2 501sp|Q13867|BLMH_HUMAN BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 30 9.4 502sp|P87362|BLMH_CHICK BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) (... 30 9.4 503sp|P70645|BLMH_RAT BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 30 9.4 504 505>sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR 506 Length = 343 507 508 Score = 709 bits (1811), Expect = 0.0 509 Identities = 343/351 (97%), Positives = 343/351 (97%), Gaps = 8/351 (2%) 510 511Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 512 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 513Sbjct: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 514 515Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPP 120 516 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 517Sbjct: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP- 119 518 519Query: 121 EEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 180 520 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 521Sbjct: 120 ---TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 522 523Query: 181 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ 240 524pattern 237 **** 525 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 526Sbjct: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG---- 232 527 528Query: 241 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 300 529 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 530Sbjct: 233 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 292 531 532Query: 301 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 533 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 534Sbjct: 293 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 535 536 537>sp|P43295|A494_ARATH PROBABLE CYSTEINE PROTEINASE A494 PRECURSOR 538 Length = 313 539 540 Score = 273 bits (691), Expect = 4e-73 541 Identities = 149/324 (45%), Positives = 194/324 (58%), Gaps = 26/324 (8%) 542 543Query: 32 FQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKA---DTKFGVNKFADLSSDE 87 544 F+ KF K Y S EE+ RF +FK+NL L A+ H+ + GV +F+DL+ E 545Sbjct: 3 FKKKFGKVYGSIEEHYYRFSVFKANL-------LRAMRHQKMDPSARHGVTQFSDLTRSE 55 546 547Query: 88 FKNYYLNNKEAI-FTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSC 146 548 F+ +L K D A L + + PEE FDWR RGAVTPVKNQG CGSC 549Sbjct: 56 FRRKHLGVKGGFKLPKDANQAPILPTQNL----PEE---FDWRDRGAVTPVKNQGSCGSC 108 550 551Query: 147 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 206 552 WSFSTTG +EG HF++ KLVSLSEQ LVDCDHEC + E E +CD GCNGGL +A+ Y 553Sbjct: 109 WSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHEC-DPEEEGSCDSGCNGGLMNSAFEYT 167 554 555Query: 207 IKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 556pattern 237 **** 557 +K GG+ E YPYT G C + + I A +SNF+++ NE +A ++ GPL 558Sbjct: 168 LKTGGLMREKDYPYTGTDGGSCKLDRSKI----VASVSNFSVVSINEDQIAANLIKNGPL 223 559 560Query: 267 AIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAK--NTIFRKNMPYWIVKNSWGAD 324 561 A+A +A Q YIGGV L+HG+L+VGY + + K PYWI+KNSWG 562Sbjct: 224 AVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGES 283 563 564Query: 325 WGEQGYIYLRRGKNTCGVSNFVST 348 565 WGE G+ + +G+N CGV + VST 566Sbjct: 284 WGENGFYKICKGRNICGVDSLVST 307 567 568 569>sp|P25804|CYSP_PEA CYSTEINE PROTEINASE 15A PRECURSOR (TURGOR-RESPONSIVE PROTEIN 15A) 570 Length = 363 571 572 Score = 270 bits (684), Expect = 2e-72 573 Identities = 144/327 (44%), Positives = 201/327 (61%), Gaps = 20/327 (6%) 574 575Query: 26 QSQFLEFQDKFNKKYS-HEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 576 + F F+ KF+K Y+ EE+ RF +FKSNL K + + N + G+ KF+DL+ 577Sbjct: 45 EHHFTSFKSKFSKSYATKEEHDYRFGVFKSNLIKAK----LHQNRDPTAEHGITKFSDLT 100 578 579Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCG 144 580 + EF+ +L K+ + LP + PE+ FDWR +GAVTPVK+QG CG 581Sbjct: 101 ASEFRRQFLGLKKRL---RLPAHAQKAPILPTTNLPED---FDWREKGAVTPVKDQGSCG 154 582 583Query: 145 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYN 204 584 SCW+FSTTG +EG H+++ KLVSLSEQ LVDCDH C + E +CD GCNGGL NA+ 585Sbjct: 155 SCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVC-DPEQAGSCDSGCNGGLMNNAFE 213 586 587Query: 205 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTG 264 588pattern 237 **** 589 Y++++GG+ E Y YT G+ C F+ + + A +SNF+++ +E +A +V G 590Sbjct: 214 YLLESGGVVQEKDYAYTGRDGS-CKFDKSKV----VASVSNFSVVTLDEDQIAANLVKNG 268 591 592Query: 265 PLAIAADAVEWQFYIGGV-FDIPCNPNSLDHGILIVGY--SAKNTIFRKNMPYWIVKNSW 321 593 PLA+A +A Q Y+ GV C + LDHG+L+VG+ A I K PYWI+KNSW 594Sbjct: 269 PLAVAINAAWMQTYMSGVSCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSW 328 595 596Query: 322 GADWGEQGYIYLRRGKNTCGVSNFVST 348 597 G +WGEQGY + RG+N CGV + VST 598Sbjct: 329 GQNWGEQGYYKICRGRNVCGVDSMVST 355 599 600 601>sp|P43296|RD19_ARATH CYSTEINE PROTEINASE RD19A PRECURSOR 602 Length = 368 603 604 Score = 266 bits (672), Expect = 6e-71 605 Identities = 156/367 (42%), Positives = 206/367 (55%), Gaps = 42/367 (11%) 606 607Query: 6 LFVLAVFTVFVSSR---------------GIPPE---EQSQFLEFQDKFNKKY-SHEEYL 46 608 +FVL+ F V VSS G P+ + F F+ KF K Y S+EE+ 609Sbjct: 10 VFVLSFFIVSVSSSDVNDGDDLVIRQVVGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHD 69 610 611Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTK--FGVNKFADLSSDEFKNYYLNNKEAI-FTDD 103 612 RF +FK+NL + + K D GV +F+DL+ EF+ +L + D 613Sbjct: 70 YRFSVFKANLRRARR------HQKLDPSATHGVTQFSDLTRSEFRKKHLGVRSGFKLPKD 123 614 615Query: 104 LPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 163 616 A L E + PE+ FDWR GAVTPVKNQG CGSCWSFS TG +EG +F++ 617Sbjct: 124 ANKAPILPTENL----PED---FDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176 618 619Query: 164 NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAE 223 620 KLVSLSEQ LVDCDHEC + E ++CD GCNGGL +A+ Y +K GG+ E YPYT + 621Sbjct: 177 GKLVSLSEQQLVDCDHEC-DPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGK 235 622 623Query: 224 TGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVF 283 624pattern 237 **** 625 G C + + I A +SNF++I +E +A +V GPLA+A +A Q YIGGV 626Sbjct: 236 DGKTCKLDKSKI----VASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVS 291 627 628Query: 284 DIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCG 341 629 L+HG+L+VGY A K PYWI+KNSWG WGE G+ + +G+N CG 630Sbjct: 292 CPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICG 351 631 632Query: 342 VSNFVST 348 633 V + VST 634Sbjct: 352 VDSMVST 358 635 636 637>sp|Q10716|CYS1_MAIZE CYSTEINE PROTEINASE 1 PRECURSOR 638 Length = 371 639 640 Score = 252 bits (638), Expect = 6e-67 641 Identities = 138/332 (41%), Positives = 190/332 (56%), Gaps = 23/332 (6%) 642 643Query: 26 QSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 644 +S FL F +F K Y +E+ R +FK NL + L+ + GV KF+DL+ 645Sbjct: 45 ESHFLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQLL----DPSAEHGVTKFSDLT 100 646 647Query: 85 SDEFKNYYLN---NKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQG 141 648 EF+ YL ++ A+ + A + +P + FDWR GAV PVKNQG 649Sbjct: 101 PAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDD----FDWRDHGAVGPVKNQG 156 650 651Query: 142 QCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPN 201 652 CGSCWSFS +G +EG H+++ KL LSEQ VDCDHEC E ++CD GCNGGL 653Sbjct: 157 SCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSE-PDSCDSGCNGGLMTT 215 654 655Query: 202 AYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIV 261 656pattern 237 **** 657 A++Y+ K GG+++E YPYT G +C F+ + I A + NF+++ +E ++ ++ 658Sbjct: 216 AFSYLQKAGGLESEKDYPYTGSDG-KCKFDKSKI----VASVQNFSVVSVDEAQISANLI 270 659 660Query: 262 STGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKN 319 661 GPLAI +A Q YIGGV LDHG+L+VGY A I K+ PYWI+KN 662Sbjct: 271 KHGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKN 330 663 664Query: 320 SWGADWGEQGYIYLRRG---KNTCGVSNFVST 348 665 SWG +WGE GY + RG +N CGV + VST 666Sbjct: 331 SWGENWGENGYYKICRGSNVRNKCGVDSMVST 362 667 668 669>sp|P04989|CYS2_DICDI CYSTEINE PROTEINASE 2 PRECURSOR (PRESTALK CATHEPSIN) 670 Length = 376 671 672 Score = 250 bits (633), Expect = 2e-66 673 Identities = 147/391 (37%), Positives = 213/391 (53%), Gaps = 63/391 (16%) 674 675Query: 1 MKVILLFVLAVFTVFVSSRGIP-------PEEQSQFLEFQDKFNKKYSHEEYLERFEIFK 53 676 M++++ +L +F F + P + ++ F E+ KFN++YS E+ R+ IFK 677Sbjct: 1 MRLLVFLILLIFVNFSFANVRPNGRRFSESQYRTAFTEWTLKFNRQYSSSEFSNRYSIFK 60 678 679Query: 54 SNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK-EAIFTDDLPVADYLDD 112 680 SN+ ++ N + T G+N FAD++++E++ YL + A + + L+ 681Sbjct: 61 SNMDYVDNWNS---KGDSQTVLGLNNFADITNEEYRKTYLGTRVNAHSYNGYDGREVLNV 117 682 683Query: 113 EFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQ 172 684 E + + P + DWRT+ AVTP+K+QGQCGSCWSFSTTG+ EG H + KLVSLSEQ 685Sbjct: 118 EDLQTNPK----SIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQ 173 686 687Query: 173 NLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNS 232 688 NLVDC G E + GC+GGL NA++YIIKN GI TESSYPYTAETG+ C FN 689Sbjct: 174 NLVDC-------SGPEE-NFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAETGSTCLFNK 225 690 691Query: 233 ANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNP 289 692pattern 237 **** 693 ++IG A I + I + GP+++A DA +Q Y G++ P C+P 694Sbjct: 226 SDIG----ATIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSP 281 695 696Query: 290 NSLDHGILIVGY--------------------------------SAKNTIFRKNMPYWIV 317 697 LDHG+L+VGY + +++ K YWIV 698Sbjct: 282 TELDHGVLVVGYGVQGKDDEGPVLNRKQTIVIHKNEDNKVESSDDSSDSVRPKANNYWIV 341 699 700Query: 318 KNSWGADWGEQGYIYLRRG-KNTCGVSNFVS 347 701 KNSWG WG +GYI + + KN CG+++ S 702Sbjct: 342 KNSWGTSWGIKGYILMSKDRKNNCGIASVSS 372 703 704 705>sp|P54640|CYS5_DICDI CYSTEINE PROTEINASE 5 PRECURSOR 706 Length = 344 707 708 Score = 238 bits (601), Expect = 1e-62 709 Identities = 139/370 (37%), Positives = 201/370 (53%), Gaps = 45/370 (12%) 710 711Query: 1 MKVI-LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKI 59 712 MKV+ L VL V + + ++ F ++ K Y+ EE+ R+ IF +N+ + 713Sbjct: 1 MKVLSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSEEFGARYNIFTANMDYV 60 714 715Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 716 ++ N + ++T G+N FAD++++E++N YL K F + + NS 717Sbjct: 61 QQWN----SKGSETVLGLNNFADITNEEYRNTYLGTK---FDASSLIGTQEEKVHTNSSA 113 718 719Query: 120 PEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 179 720 + DWR+ GAVTPVKNQGQCG CWSFSTTG+ EG HF S+ +LVSLSEQNL+DC 721Sbjct: 114 ASK----DWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCST 169 722 723Query: 180 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEE 239 724pattern 237 *** 725 E + GC+GGL A+ YII N GI TESSYPY AE G +C + S N G 726Sbjct: 170 E----------NSGCDGGLMTYAFEYIINNNGIDTESSYPYKAENG-KCEYKSENSG--- 215 727 728Query: 240 QAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHGI 296 729pattern 240 * 730 A +S++ + V+ P+++A DA +Q Y G++ P C+ +LDHG+ 731Sbjct: 216 -ATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGV 274 732 733Query: 297 LIVGY--------------SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCG 341 734 L VGY S+ N + YWIVKNSWG WG +GYI + R + N CG 735Sbjct: 275 LAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTSWGIEGYILMSRNRDNNCG 334 736 737Query: 342 VSNFVSTSII 351 738 +++ S ++ 739Sbjct: 335 IASSASFPVV 344 740 741 742>sp|P14658|CYSP_TRYBB CYSTEINE PROTEINASE PRECURSOR 743 Length = 450 744 745 Score = 236 bits (597), Expect = 4e-62 746 Identities = 137/354 (38%), Positives = 193/354 (53%), Gaps = 34/354 (9%) 747 748Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEE 61 749 V+L + +V + S + + +F F+ K+ K Y +E RF F+ N+ E+ 750Sbjct: 15 VLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENM---EQ 71 751 752Query: 62 LNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPE 121 753 + A + T FGV F+D++ +EF+ Y N A + +N 754Sbjct: 72 AKIQAAANPYAT-FGVTPFSDMTREEFRARYRNGASYF-----AAAQKRLRKTVNVTTGR 125 755 756Query: 122 EQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 181 757 A DWR +GAVTPVK QGQCGSCW+FST GN+EGQ ++ N LVSLSEQ LV CD 758Sbjct: 126 APAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCD--- 182 759 760Query: 182 MEYEGEEACDEGCNGGLQPNAYNYIIKN--GGIQTESSYPYTAETG--TQCNFNSANIGP 237 761pattern 237 * 762 D GCNGGL NA+N+I+ + G + TE+SYPY + G QC N IG 763Sbjct: 183 -------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIG- 234 764 765Query: 238 EEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGIL 297 766pattern 238 *** 767 A I++ +P++E +A Y+ GPLAIA DA + Y GG+ C LDHG+L 768Sbjct: 235 ---AAITDHVDLPQDEDAIAAYLAENGPLAIAVDAESFMDYNGGIL-TSCTSKQLDHGVL 290 769 770Query: 298 IVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 771 +VGY+ + N PYWI+KNSW WGE GYI + +G N C ++ VS++++ 772Sbjct: 291 LVGYNDNS-----NPPYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339 773 774 775>sp|Q26534|CATL_SCHMA CATHEPSIN L PRECURSOR (SMCL1) 776 Length = 319 777 778 Score = 233 bits (589), Expect = 3e-61 779 Identities = 128/334 (38%), Positives = 190/334 (56%), Gaps = 30/334 (8%) 780 781Query: 21 IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80 782 +P ++++F+ K+ K+Y E RF IFKSN+ K + L + + +GV + 783Sbjct: 12 LPGNVDEKYVQFKLKYRKQYHETEDEIRFNIFKSNILKAQ---LYQVFVRGSAIYGVTPY 68 784 785Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQ 140 786 +DL++DEF +L + + L E +N+IP FDWR +GAVT VKNQ 787Sbjct: 69 SDLTTDEFARTHLTASWVVPSSRSNTPTSLGKE-VNNIPKN----FDWREKGAVTEVKNQ 123 788 789Query: 141 GQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQP 200 790 G CGSCW+FSTTGNVE Q F KL+SLSEQ LVDCD D+GCNGGL 791Sbjct: 124 GMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLVDCD----------GLDDGCNGGLPS 173 792 793Query: 201 NAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYI 260 794pattern 237 **** 795 NAY IIK GG+ E +YPY A+ +C+ + + I++ + ++ET +A ++ 796Sbjct: 174 NAYESIIKMGGLMLEDNYPYDAK-NEKCHLKTDGVA----VYINSSVNLTQDETELAAWL 228 797 798Query: 261 VSTGPLAIAADAVEWQFYIGGV---FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIV 317 799 +++ +A+ QFY G+ + I C+ LDH +L+VGY + KN P+WIV 800Sbjct: 229 YHNSTISVGMNALLLQFYQHGISHPWWIFCSKYLLDHAVLLVGYG----VSEKNEPFWIV 284 801 802Query: 318 KNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 803 KNSWG +WGE GY + RG +CG++ ++++I 804Sbjct: 285 KNSWGVEWGENGYFRMYRGDGSCGINTVATSAMI 318 805 806 807>sp|P35591|CYS1_LEIPI CYSTEINE PROTEINASE 1 PRECURSOR (AMASTIGOTE CYSTEINE PROTEINASE A-1) 808 Length = 354 809 810 Score = 233 bits (589), Expect = 3e-61 811 Identities = 144/355 (40%), Positives = 192/355 (53%), Gaps = 40/355 (11%) 812 813Query: 5 LLFVLAVFTVFVSSRGI-------PPEEQ----SQFLEFQDKFNKKYSHE-EYLERFEIF 52 814 LLF + V +FV G PP + + + F+ + K + + E RF F 815Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAF 66 816 817Query: 53 KSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDD 112 818 K N+ LN + D KFADL+ EF YLN + D+ +D 819Sbjct: 67 KQNMQTAYFLNTQNPHAHYDVS---GKFADLTPQEFAKLYLNPDYYA----RHLKDHKED 119 820 821Query: 113 EFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQ 172 822 ++ P + DWR +GAVTPVKNQG CGSCW+FS GN+EGQ S + LVSLSEQ 823Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179 824 825Query: 173 NLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETGTQCNF 230 826 LV CD+ DEGCNGGL A N+I++ NG + TE+SYPYT+ GT+ 827Sbjct: 180 MLVSCDN----------IDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPC 229 828 829Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 830pattern 237 **** 831 + E AKI+ F +P +E +A ++ GP+A+A DA WQ Y GGV + C 832Sbjct: 230 HDEG---EVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSL-CLAW 285 833 834Query: 291 SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNF 345 835 SL+HG+LIVG++ KN PYWIVKNSWG+ WGE+GYI L G N C + N+ 836Sbjct: 286 SLNHGVLIVGFN-KNA----KPPYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNY 335 837 838 839>sp|P25775|LCPA_LEIME CYSTEINE PROTEINASE A PRECURSOR 840 Length = 354 841 842 Score = 231 bits (584), Expect = 1e-60 843 Identities = 143/355 (40%), Positives = 192/355 (53%), Gaps = 40/355 (11%) 844 845Query: 5 LLFVLAVFTVFVSSRGI-------PPEEQ----SQFLEFQDKFNKKYSHE-EYLERFEIF 52 846 LLF + V +FV G PP + + + F+ + K + + E RF F 847Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAF 66 848 849Query: 53 KSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDD 112 850 K N+ LN + D KFADL+ EF YLN + ++ +D 851Sbjct: 67 KQNMQTAYFLNTQNPHAHYDVS---GKFADLTPQEFAKLYLNPDYYA----RHLKNHKED 119 852 853Query: 113 EFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQ 172 854 ++ P + DWR +GAVTPVKNQG CGSCW+FS GN+EGQ S + LVSLSEQ 855Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179 856 857Query: 173 NLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETGTQCNF 230 858 LV CD+ DEGCNGGL A N+I++ NG + TE+SYPYT+ GT+ 859Sbjct: 180 MLVSCDN----------IDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPC 229 860 861Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290 862pattern 237 **** 863 + E AKI+ F +P +E +A ++ GP+A+A DA WQ Y GGV + C 864Sbjct: 230 HDEG---EVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSL-CLAW 285 865 866Query: 291 SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNF 345 867 SL+HG+LIVG++ KN PYWIVKNSWG+ WGE+GYI L G N C + N+ 868Sbjct: 286 SLNHGVLIVGFN-KNA----KPPYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNY 335 869 870 871>sp|P13277|CYS1_HOMAM DIGESTIVE CYSTEINE PROTEINASE 1 PRECURSOR 872 Length = 322 873 874 Score = 221 bits (558), Expect = 1e-57 875 Identities = 132/349 (37%), Positives = 184/349 (51%), Gaps = 41/349 (11%) 876 877Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKI 59 878 MKV+ LF+ + + + EF+ KF +KY EE R +F NL I 879Sbjct: 1 MKVVALFLFGLALAAANP---------SWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYI 51 880 881Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 882 EE N + +N+F+D+++++F K+ P A F ++ 883Sbjct: 52 EEFNKKYERGEVTYNLAINQFSDMTNEKFNAVMKGYKKG----PRPAA-----VFTSTDA 102 884 885Query: 120 PEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 179 886 E T DWRT+GAVTPVK+QGQCGSCW+FSTTG +EGQHF+ +LVSLSEQ LVDC 887Sbjct: 103 APESTEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDC-- 160 888 889Query: 180 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEE 239 890pattern 237 *** 891 G ++GCNGG A Y+ NGG+ TESSYPY A T C FNS IG 892Sbjct: 161 -----AGGSYYNQGCNGGWVERAIMYVRDNGGVDTESSYPYEARDNT-CRFNSNTIG--- 211 893 894Query: 240 QAKISNFTMIPK-NETVMAGYIVSTGPLAIAADAVEWQF---YIGGVFDIPCNPNSLDHG 295 895pattern 240 * 896 A + + I + +E+ + GP+++A DA F Y G ++ C+ + LDH 897Sbjct: 212 -ATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSYYTGVYYEPSCSSSQLDHA 270 898 899Query: 296 ILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVS 343 900 +L VGY ++ +W+VKNSW WGE GYI + R + N CG++ 901Sbjct: 271 VLAVGYGSEG-----GQDFWLVKNSWATSWGESGYIKMARNRNNNCGIA 314 902 903 904>sp|P25779|CYSP_TRYCR CRUZIPAIN PRECURSOR (MAJOR CYSTEINE PROTEINASE) (CRUZAINE) 905 Length = 467 906 907 Score = 221 bits (557), Expect = 2e-57 908 Identities = 134/358 (37%), Positives = 189/358 (52%), Gaps = 38/358 (10%) 909 910Query: 3 VILLFVLAVFTVFV--SSRGIPPEEQ--SQFLEFQDKFNKKY-SHEEYLERFEIFKSNLG 57 911 ++L VL V V ++ + EE SQF EF+ K + Y S E R +F+ NL 912Sbjct: 8 LLLAAVLVVMACLVPAATASLHAEETLTSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF 67 913 914Query: 58 KIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINS 117 915 + L+ A H FGV F+DL+ +EF++ Y N + E + + 916Sbjct: 68 -LARLHAAANPHAT---FGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVKVEVVGA 123 917 918Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDC 177 919 A DWR RGAVT VK+QGQCGSCW+FS GNVE Q F++ + L +LSEQ LV C 920Sbjct: 124 -----PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSC 178 921 922Query: 178 DHECMEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETGTQ--CNFNSA 233 923 D D GC+GGL NA+ +I++ NG + TE SYPY + G C + 924Sbjct: 179 D----------KTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGH 228 925 926Query: 234 NIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLD 293 927pattern 237 **** 928 +G A I+ +P++E +A ++ GP+A+A DA W Y GGV C LD 929Sbjct: 229 TVG----ATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVM-TSCVSEQLD 283 930 931Query: 294 HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 932 HG+L+VGY+ + PYWI+KNSW WGE+GYI + +G N C V S++++ 933Sbjct: 284 HGVLLVGYNDSAAV-----PYWIIKNSWTTQWGEEGYIRIAKGSNQCLVKEEASSAVV 336 934 935 936>sp|P41721|CATV_NPVBM VIRAL CATHEPSIN (V-CATH) 937 Length = 323 938 939 Score = 216 bits (545), Expect = 5e-56 940 Identities = 131/349 (37%), Positives = 181/349 (51%), Gaps = 32/349 (9%) 941 942Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELN 63 943 +LF L V+ V S+ P + + F EF +FNK YS E E L RF+IF+ NL +I 944Sbjct: 4 ILFYLFVYAVVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI---- 59 945 946Query: 64 LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQ 123 947 I N K+ +NKF+DLS DE Y T + LD P + 948Sbjct: 60 -INKNQNDSAKYEINKFSDLSKDETIAKYTGLSLPTQTQNFCKVILLDQP-----PGKGP 113 949 950Query: 124 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 183 951 FDWR VT VKNQG CG+CW+F+T G++E Q I N+L++LSEQ ++DCD 952Sbjct: 114 LEFDWRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNELINLSEQQMIDCDF---- 169 953 954Query: 184 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKI 243 955pattern 237 **** 956 D GCNGGL A+ IIK GG+Q ES YPY A+ C NS + + 957Sbjct: 170 ------VDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEAD-NNNCRMNSNKFLVQVK--- 219 958 959Query: 244 SNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303 960 + I E + + GP+ +A DA + Y G+ C + L+H +L+VGY 961Sbjct: 220 DCYRYIIVYEEKLKDLLPLVGPIPMAIDAADIVNYKQGIIKY-CFDSGLNHAVLLVGYGV 278 962 963Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN-FVSTSII 351 964 +N N+PYW KN+WG DWGE G+ +++ N CG+ N ST++I 965Sbjct: 279 EN-----NIPYWTFKNTWGTDWGEDGFFRVQQNINACGMRNELASTAVI 322 966 967 968>sp|P25782|CYS2_HOMAM DIGESTIVE CYSTEINE PROTEINASE 2 PRECURSOR 969 Length = 323 970 971 Score = 215 bits (541), Expect = 1e-55 972 Identities = 132/357 (36%), Positives = 189/357 (51%), Gaps = 40/357 (11%) 973 974Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKI 59 975 MKV +LF+ V S + F+ K+ ++Y EE R IF+ N I 976Sbjct: 1 MKVAVLFLCGVALAAASP---------SWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYI 51 977 978Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 979 EE N N + +NKF D++ +EF N I PV+ + + 980Sbjct: 52 EEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMKGN---IPRRSAPVSVFYPKKETGP-- 106 981 982Query: 120 PEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 179 983 + T DWRT+GAVTPVK+QGQCGSCW+FSTTG++EGQHF+ L+SL+EQ LVDC 984Sbjct: 107 --QATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDC-- 162 985 986Query: 180 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEE 239 987pattern 237 *** 988 +GCNGG +A++YI N GI TE++YPY A G+ C F+S ++ 989Sbjct: 163 ------SRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDGS-CRFDSNSVA--- 212 990 991Query: 240 QAKISNFTMIPK-NETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHG 295 992pattern 240 * 993 A S T I +ET + + GP+++ DA +QFY GV+ P C+P+ LDH 994Sbjct: 213 -ATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHA 271 995 996Query: 296 ILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 351 997 +L VGY ++ +W+VKNSW WG+ GYI + R + N CG++ S ++ 998Sbjct: 272 VLAVGYGSEG-----GQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323 999 1000 1001>sp|P41715|CATV_NPVCF VIRAL CATHEPSIN (V-CATH) 1002 Length = 324 1003 1004 Score = 214 bits (540), Expect = 2e-55 1005 Identities = 130/351 (37%), Positives = 188/351 (53%), Gaps = 33/351 (9%) 1006 1007Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKI 59 1008 M I+L++L V ++ + + + F +F KFNK YS E E L RF+IF+ NL +I 1009Sbjct: 1 MNKIVLYLLVYGAVQCAAYDVL-KAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEI 59 1010 1011Query: 60 EELNLIAINHKADT-KFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSI 118 1012 I NH T ++ +NKFADLS DE + Y + T + LD 1013Sbjct: 60 -----INKNHNDSTAQYEINKFADLSKDETISKYTGLSLPLQTQNFCEVVVLDRP----- 109 1014 1015Query: 119 PPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCD 178 1016 P + FDWR VT VKNQG CG+CW+F+T G++E Q I N+ ++LSEQ L+DCD 1017Sbjct: 110 PDKGPLEFDWRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNQFINLSEQQLIDCD 169 1018 1019Query: 179 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPE 238 1020pattern 237 ** 1021 D GC+GGL A+ ++ GGIQ ES YPY A G C N+A + 1022Sbjct: 170 F----------VDAGCDGGLLHTAFEAVMNMGGIQAESDYPYEANNG-DCRANAAKFVVK 218 1023 1024Query: 239 EQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILI 298 1025pattern 239 ** 1026 + T+ E + + S GP+ +A DA + Y G+ C + L+H +L+ 1027Sbjct: 219 VKKCYRYITVF---EEKLKDLLRSVGPIPVAIDASDIVNYKRGIMKY-CANHGLNHAVLL 274 1028 1029Query: 299 VGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 349 1030 VGY+ +N +P+WI+KN+WGADWGEQGY +++ N CG+ N + +S 1031Sbjct: 275 VGYAVEN-----GVPFWILKNTWGADWGEQGYFRVQQNINACGIQNELPSS 320 1032 1033 1034>sp|P25784|CYS3_HOMAM DIGESTIVE CYSTEINE PROTEINASE 3 PRECURSOR 1035 Length = 321 1036 1037 Score = 214 bits (539), Expect = 2e-55 1038 Identities = 125/326 (38%), Positives = 184/326 (56%), Gaps = 47/326 (14%) 1039 1040Query: 32 FQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEF-- 88 1041 F+ ++ +KY +E L R +F+ N IE+ N N + K +N+F D++++EF 1042Sbjct: 23 FKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEFNA 82 1043 1044Query: 89 --KNYYLNNK---EAIFTDDL-PVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQ 142 1045 K Y ++ +A+FT + P+A DWRT+ VTPVK+Q Q 1046Sbjct: 83 VMKGYKKGSRGEPKAVFTAEAGPMA----------------ADVDWRTKALVTPVKDQEQ 126 1047 1048Query: 143 CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNA 202 1049 CGSCW+FS TG +EGQHF+ ++LVSLSEQ LVDC + ++GC GG +A 1050Sbjct: 127 CGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDC--------STDYGNDGCGGGWMTSA 178 1051 1052Query: 203 YNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVS 262 1053pattern 237 **** 1054 ++YI NGGI TESSYPY AE C F++ +IG A + + E + + 1055Sbjct: 179 FDYIKDNGGIDTESSYPYEAE-DRSCRFDANSIG----AICTGSVEVQHTEEALQEAVSG 233 1056 1057Query: 263 TGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKN 319 1058 GP+++A DA +QFY GV ++ C+P LDHG+L VGY ++T YW+VKN 1059Sbjct: 234 VGPISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVGYGTEST-----KDYWLVKN 288 1060 1061Query: 320 SWGADWGEQGYIYLRRGK-NTCGVSN 344 1062 SWG+ WG+ GYI + R + N CG+++ 1063Sbjct: 289 SWGSSWGDAGYIKMSRNRDNNCGIAS 314 1064 1065 1066>sp|P07154|CATL_RAT CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) (CYCLIC 1067 PROTEIN-2) (CP-2) 1068 Length = 334 1069 1070 Score = 212 bits (535), Expect = 7e-55 1071 Identities = 127/359 (35%), Positives = 195/359 (53%), Gaps = 39/359 (10%) 1072 1073Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEEL 62 1074 ++LL VL + T + + +Q+ +++ + Y E R +++ N+ I+ 1075Sbjct: 4 LLLLAVLCLGTALATPK-FDQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLH 62 1076 1077Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKN------YYLNNKEAIFTDDLPVADYLDDEFIN 116 1078 N N K +N F D++++EF+ + + K +F + L + 1079Sbjct: 63 NGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLML---------- 112 1080 1081Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 176 1082 IP DWR +G VTPVKNQGQCGSCW+FS +G +EGQ F+ KL+SLSEQNLVD 1083Sbjct: 113 QIPK----TVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168 1084 1085Query: 177 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 236 1086 C H+ +G ++GCNGGL A+ YI +NGG+ +E SYPY A+ G+ C + + 1087Sbjct: 169 CSHD----QG----NQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS-CKYRA---- 215 1088 1089Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLD 293 1090pattern 237 **** 1091 A + F IP+ E + + + GP+++A DA QFY G++ P C+ LD 1092Sbjct: 216 EYAVANDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLD 275 1093 1094Query: 294 HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVSTSII 351 1095 HG+L+VGY + T K+ YW+VKNSWG +WG GYI + + +N CG++ S I+ 1096Sbjct: 276 HGVLVVGYGYEGTDSNKD-KYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333 1097 1098 1099>sp|P06797|CATL_MOUSE CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) 1100 Length = 334 1101 1102 Score = 212 bits (533), Expect = 1e-54 1103 Identities = 126/359 (35%), Positives = 198/359 (55%), Gaps = 39/359 (10%) 1104 1105Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEEL 62 1106 ++LL VL + T + + +++ +++ + Y E R I++ N+ I+ 1107Sbjct: 4 LLLLAVLCLGTALATPK-FDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLH 62 1108 1109Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKN------YYLNNKEAIFTDDLPVADYLDDEFIN 116 1110 N N + +N F D++++EF+ + + K +F + L + 1111Sbjct: 63 NGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLML---------- 112 1112 1113Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 176 1114 IP + DWR +G VTPVKNQGQCGSCW+FS +G +EGQ F+ KL+SLSEQNLVD 1115Sbjct: 113 KIPK----SVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168 1116 1117Query: 177 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 236 1118 C H +G ++GCNGGL A+ YI +NGG+ +E SYPY A+ G+ C + + 1119Sbjct: 169 CSHA----QG----NQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS-CKYRA---- 215 1120 1121Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLD 293 1122pattern 237 **** 1123 A + F IP+ E + + + GP+++A DA QFY G++ P C+ +LD 1124Sbjct: 216 EFAVANDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLD 275 1125 1126Query: 294 HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 351 1127 HG+L+VGY + T KN YW+VKNSWG++WG +GYI + + + N CG++ S ++ 1128Sbjct: 276 HGVLLVGYGYEGTDSNKN-KYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333 1129 1130 1131>sp|P12412|CYSP_VIGMU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYSTEINE PROTEINASE) 1132 (SULFHYDRYL-ENDOPEPTIDASE) (SH-EP) 1133 Length = 362 1134 1135 Score = 209 bits (526), Expect = 8e-54 1136 Identities = 127/313 (40%), Positives = 179/313 (56%), Gaps = 35/313 (11%) 1137 1138Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK---EAIFTDD 103 1139 +RF +FK+N+ + N + +K +NKFAD+++ EF++ Y +K +F 1140Sbjct: 58 KRFNVFKANVMHVHNTNKMDKPYKLK----LNKFADMTNHEFRSTYAGSKVNHHKMFRGS 113 1141 1142Query: 104 LPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 163 1143 + E + S+P + DWR +GAVT VK+QGQCGSCW+FST VEG + I 1144Sbjct: 114 QHGSGTFMYEKVGSVP----ASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKT 169 1145 1146Query: 164 NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAE 223 1147 NKLVSLSEQ LVDCD E ++GCNGGL +A+ +I + GGI TES+YPYTA+ 1148Sbjct: 170 NKLVSLSEQELVDCDKE---------ENQGCNGGLMESAFEFIKQKGGITTESNYPYTAQ 220 1149 1150Query: 224 TGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGG 281 1151pattern 237 **** 1152 GT C+ + N + I +P N+ V+ P+++A DA ++QFY G 1153Sbjct: 221 EGT-CDESKVN---DLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEG 276 1154 1155Query: 282 VFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----K 337 1156 VF CN L+HG+ IVGY T+ N YWIV+NSWG +WGEQGYI ++R + 1157Sbjct: 277 VFTGDCN-TDLNHGVAIVGYG--TTVDGTN--YWIVRNSWGPEWGEQGYIRMQRNISKKE 331 1158 1159Query: 338 NTCGVSNFVSTSI 350 1160 CG++ S I 1161Sbjct: 332 GLCGIAMMASYPI 344 1162 1163 1164>sp|P25783|CATV_NPVAC VIRAL CATHEPSIN (V-CATH) 1165 Length = 323 1166 1167 Score = 209 bits (526), Expect = 8e-54 1168 Identities = 129/349 (36%), Positives = 179/349 (50%), Gaps = 32/349 (9%) 1169 1170Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELN 63 1171 +LF L V+ V S+ + + F EF +FNK Y E E L RF+IF+ NL +I 1172Sbjct: 4 ILFYLFVYGVVNSAAYDLLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI---- 59 1173 1174Query: 64 LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQ 123 1175 I N K+ +NKF+DLS DE Y I T + LD P + 1176Sbjct: 60 -INKNQNDSAKYEINKFSDLSKDETIAKYTGLSLPIQTQNFCKVIVLDQP-----PGKGP 113 1177 1178Query: 124 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 183 1179 FDWR VT VKNQG CG+CW+F+T ++E Q I N+L++LSEQ ++DCD 1180Sbjct: 114 LEFDWRRLNKVTSVKNQGMCGACWAFATLASLESQFAIKHNQLINLSEQQMIDCDF---- 169 1181 1182Query: 184 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKI 243 1183pattern 237 **** 1184 D GCNGGL A+ IIK GG+Q ES YPY A+ C NS + + 1185Sbjct: 170 ------VDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEAD-NNNCRMNSNKFLVQVK--- 219 1186 1187Query: 244 SNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303 1188 + I E + + GP+ +A DA + Y G+ C + L+H +L+VGY 1189Sbjct: 220 DCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGIIKY-CFNSGLNHAVLLVGYGV 278 1190 1191Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN-FVSTSII 351 1192 +N N+PYW KN+WG DWGE G+ +++ N CG+ N ST++I 1193Sbjct: 279 EN-----NIPYWTFKNTWGTDWGEDGFFRVQQNINACGMRNELASTAVI 322 1194 1195 1196>sp|P25975|CATL_BOVIN CATHEPSIN L PRECURSOR 1197 Length = 334 1198 1199 Score = 208 bits (525), Expect = 1e-53 1200 Identities = 126/351 (35%), Positives = 184/351 (51%), Gaps = 35/351 (9%) 1201 1202Query: 7 FVLAVFTVFVSSRG--IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNL 64 1203 F L V + V+S + P + + +++ + Y E R +++ N I+ N 1204Sbjct: 5 FFLTVLCLGVASAAPKLDPNLDAHWHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQ 64 1205 1206Query: 65 IAINHKADTKFGVNKFADLSSDEFK---NYYLNNKEAIFTDDLPVADYLDDEFINSIPPE 121 1207 K + +N F D++++EF+ N + N K + + +P 1208Sbjct: 65 EYSEGKHAFRMAMNAFGDMTNEEFRQVMNGFQNQKHK-------KGKLFHEPLLVDVPK- 116 1209 1210Query: 122 EQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 181 1211 + DW +G VTPVKNQGQCGSCW+FS TG +EGQ F KLVSLSEQNLVDC 1212Sbjct: 117 ---SVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRA- 172 1213 1214Query: 182 MEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPE-EQ 240 1215pattern 237 ** ** 1216 +G ++GCNGGL NA+ YI NGG+ +E SYPY A CN+ PE 1217Sbjct: 173 ---QG----NQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYK-----PECSA 220 1218 1219Query: 241 AKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGIL 297 1220 A + F IP+ E + + + GP+++A DA +QFY G+ +D C+ LDHG+L 1221Sbjct: 221 ANDTGFVDIPQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVL 280 1222 1223Query: 298 IVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVS 347 1224 +VGY + T N +WIVKNSWG +WG GY+ + + +N CG++ S 1225Sbjct: 281 VVGYGFEGTDSNNN-KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAAS 330 1226 1227 1228>sp|Q40143|CYS3_LYCES CYSTEINE PROTEINASE 3 PRECURSOR 1229 Length = 356 1230 1231 Score = 207 bits (522), Expect = 2e-53 1232 Identities = 129/331 (38%), Positives = 181/331 (53%), Gaps = 40/331 (12%) 1233 1234Query: 29 FLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 1235 F F + K+Y S EE +RFEIF NL I N +++K G+N+F DL+ DE 1236Sbjct: 57 FARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYK----LGINEFTDLTWDE 112 1237 1238Query: 88 FKNYYLN---NKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCG 144 1239 F+ + L N A +L + N + PE + DWR G V+PVK QG+CG 1240Sbjct: 113 FRKHKLGASQNCSATTKGNLKLT--------NVVLPETK---DWRKDGIVSPVKAQGKCG 161 1241 1242Query: 145 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYN 204 1243 SCW+FSTTG +E + + K +SLSEQ LVDC + GCNGGL A+ 1244Sbjct: 162 SCWTFSTTGALEAAYAQAFGKGISLSEQQLVDCAGAFNNF--------GCNGGLPSQAFE 213 1245 1246Query: 205 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTG 264 1247pattern 237 **** 1248 YI NGG+ TE +YPYT + G C F+ ANIG + + + N T+ + E A +V 1249Sbjct: 214 YIKFNGGLDTEEAYPYTGKNGI-CKFSQANIGVKVISSV-NITLGAEYELKYAVALVR-- 269 1250 1251Query: 265 PLAIAADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNS 320 1252 P+++A + V+ ++ Y GV+ + P ++H +L VGY +N PYW++KNS 1253Sbjct: 270 PVSVAFEVVKGFKQYKSGVYASTECGDTPMDVNHAVLAVGYGVEN-----GTPYWLIKNS 324 1254 1255Query: 321 WGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 1256 WGADWGE GY + GKN CGV+ S I+ 1257Sbjct: 325 WGADWGEDGYFKMEMGKNMCGVATCASYPIV 355 1258 1259 1260>sp|Q05094|CYS2_LEIPI CYSTEINE PROTEINASE 2 PRECURSOR (AMASTIGOTE CYSTEINE PROTEINASE A-2) 1261 Length = 444 1262 1263 Score = 207 bits (521), Expect = 3e-53 1264 Identities = 122/327 (37%), Positives = 177/327 (53%), Gaps = 39/327 (11%) 1265 1266Query: 29 FLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKA---DTKFGVNKFADLS 84 1267 F EF+ + + Y E +R F+ NL + E H+A +FG+ KF DLS 1268Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-------HQARNPHAQFGITKFFDLS 90 1269 1270Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEF--INSIPPEEQTAFDWRTRGAVTPVKNQGQ 142 1271 EF YLN A + ++++P A DWR +GAVTPVK+QG 1272Sbjct: 91 EAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPD----AVDWREKGAVTPVKDQGA 146 1273 1274Query: 143 CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNA 202 1275 CGSCW+FS GN+EGQ +++ ++LVSLSEQ LV CD ++GC+GGL A 1276Sbjct: 147 CGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDD----------MNDGCDGGLMLQA 196 1277 1278Query: 203 YNYIIK--NGGIQTESSYPYTAETG--TQCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258 1279pattern 237 **** 1280 ++++++ NG + TE SYPY + G +C+ +S + A+I +I +E MA 1281Sbjct: 197 FDWLLQNTNGHLHTEDSYPYVSGNGYVPECSNSSEEL--VVGAQIDGHVLIGSSEKAMAA 254 1282 1283Query: 259 YIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVK 318 1284 ++ GP+AIA DA + Y GV C L+HG+L+VGY + PYW++K 1285Sbjct: 255 WLAKNGPIAIALDASSFMSYKSGVL-TACIGKQLNHGVLLVGYDMTGEV-----PYWVIK 308 1286 1287Query: 319 NSWGADWGEQGYIYLRRGKNTCGVSNF 345 1288 NSWG DWGEQGY+ + G N C +S + 1289Sbjct: 309 NSWGGDWGEQGYVRVVMGVNACLLSEY 335 1290 1291 1292>sp|P36400|LCPB_LEIME CYSTEINE PROTEINASE B PRECURSOR 1293 Length = 443 1294 1295 Score = 206 bits (520), Expect = 4e-53 1296 Identities = 122/327 (37%), Positives = 177/327 (53%), Gaps = 40/327 (12%) 1297 1298Query: 29 FLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKA---DTKFGVNKFADLS 84 1299 F EF+ + + Y E +R F+ NL + E H+A +FG+ KF DLS 1300Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-------HQARNPHAQFGITKFFDLS 90 1301 1302Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEF--INSIPPEEQTAFDWRTRGAVTPVKNQGQ 142 1303 EF YLN A + ++++P A DWR +GAVTPVK+QG 1304Sbjct: 91 EAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPD----AVDWREKGAVTPVKDQGA 146 1305 1306Query: 143 CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNA 202 1307 CGSCW+FS GN+EGQ +++ ++LVSLSEQ LV CD ++GC+GGL A 1308Sbjct: 147 CGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDD----------MNDGCDGGLMLQA 196 1309 1310Query: 203 YNYIIK--NGGIQTESSYPYTAETG--TQCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258 1311pattern 237 **** 1312 ++++++ NG + TE SYPY + G +C+ +S + A+I +I +E MA 1313Sbjct: 197 FDWLLQNTNGHLHTEDSYPYVSGNGYVPECSNSSELV---VGAQIDGHVLIGSSEKAMAA 253 1314 1315Query: 259 YIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVK 318 1316 ++ GP+AIA DA + Y GV C L+HG+L+VGY + PYW++K 1317Sbjct: 254 WLAKNGPIAIALDASSFMSYKSGVL-TACIGKQLNHGVLLVGYDMTGEV-----PYWVIK 307 1318 1319Query: 319 NSWGADWGEQGYIYLRRGKNTCGVSNF 345 1320 NSWG DWGEQGY+ + G N C +S + 1321Sbjct: 308 NSWGGDWGEQGYVRVVMGVNACLLSEY 334 1322 1323 1324>sp|P07711|CATL_HUMAN CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) 1325 Length = 333 1326 1327 Score = 206 bits (520), Expect = 4e-53 1328 Identities = 125/349 (35%), Positives = 187/349 (52%), Gaps = 34/349 (9%) 1329 1330Query: 8 VLAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLI 65 1331 +LA F + ++S + + ++Q+ +++ N+ Y E R +++ N+ IE N 1332Sbjct: 6 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQE 65 1333 1334Query: 66 AINHKADTKFGVNKFADLSSDEFK---NYYLNNKEAIFTDDLPVADYLDDEFINSIPPEE 122 1335 K +N F D++S+EF+ N + N K F + E 1336Sbjct: 66 YREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPR-----------KGKVFQEPLFYEA 114 1337 1338Query: 123 QTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECM 182 1339 + DWR +G VTPVKNQGQCGSCW+FS TG +EGQ F +L+SLSEQNLVDC 1340Sbjct: 115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDC----- 169 1341 1342Query: 183 EYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAK 242 1343pattern 237 **** 1344 G + +EGCNGGL A+ Y+ NGG+ +E SYPY A T C +N A 1345Sbjct: 170 --SGPQG-NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEA-TEESCKYNP----KYSVAN 221 1346 1347Query: 243 ISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIV 299 1348 + F IPK E + + + GP+++A DA + FY G+ F+ C+ +DHG+L+V 1349Sbjct: 222 DTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVV 281 1350 1351Query: 300 GYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVS 347 1352 GY ++T N YW+VKNSWG +WG GY+ + + +N CG+++ S 1353Sbjct: 282 GYGFEST-ESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAAS 329 1354 1355 1356>sp|Q28944|CATL_PIG CATHEPSIN L PRECURSOR 1357 Length = 334 1358 1359 Score = 206 bits (519), Expect = 5e-53 1360 Identities = 121/316 (38%), Positives = 167/316 (52%), Gaps = 33/316 (10%) 1361 1362Query: 40 YSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFK---NYYLNNK 96 1363 Y E R +++ N+ IE N K +N F D++++EF+ N + N K 1364Sbjct: 40 YGMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQNQK 99 1365 1366Query: 97 EAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVE 156 1367 F S+ E + DWR +G VT VKNQGQCGSCW+FS TG +E 1368Sbjct: 100 HK-----------KGKVFHESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALE 148 1369 1370Query: 157 GQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTES 216 1371 GQ F KLVSLSEQNLVDC +G ++GCNGGL NA+ Y+ NGG+ TE 1372Sbjct: 149 GQMFRKTGKLVSLSEQNLVDCSRP----QG----NQGCNGGLMDNAFQYVKDNGGLDTEE 200 1373 1374Query: 217 SYPYTAETGTQCNFNSANIGPE-EQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--V 273 1375pattern 237 ** ** 1376 SYPY C + PE A + F IP+ E + + + GP+++A DA 1377Sbjct: 201 SYPYLGRETNSCTYK-----PECSAANDTGFVDIPQREKALMKAVATVGPISVAIDAGHS 255 1378 1379Query: 274 EWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIY 332 1380 +QFY G+ +D C+ LDHG+L+VGY + T + +WIVKNSWG +WG GY+ 1381Sbjct: 256 SFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGT-DSNSSKFWIVKNSWGPEWGWNGYVK 314 1382 1383Query: 333 LRRGKNT-CGVSNFVS 347 1384 + + +N CG+S S 1385Sbjct: 315 MAKDQNNHCGISTAAS 330 1386 1387 1388>sp|P00785|ACTN_ACTCH ACTINIDAIN PRECURSOR (ACTINIDIN) 1389 Length = 380 1390 1391 Score = 204 bits (513), Expect = 3e-52 1392 Identities = 124/334 (37%), Positives = 178/334 (53%), Gaps = 41/334 (12%) 1393 1394Query: 24 EEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADT----KFGVN 78 1395 E ++ + + K+ K Y S E+ RFEIFK L I+E H ADT K G+N 1396Sbjct: 37 EVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDE-------HNADTNRSYKVGLN 89 1397 1398Query: 79 KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVK 138 1399 +FADL+ +EF++ YL ++ V++ + F +P + DWR+ GAV +K 1400Sbjct: 90 QFADLTDEEFRSTYLGFTSG--SNKTKVSNRYEPRFGQVLP----SYVDWRSAGAVVDIK 143 1401 1402Query: 139 NQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGL 198 1403 +QG+CG CW+FS VEG + I L+SLSEQ L+DC G GCNGG 1404Sbjct: 144 SQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC--------GRTQNTRGCNGGY 195 1405 1406Query: 199 QPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258 1407pattern 237 **** 1408 + + +II NGGI TE +YPYTA+ G +CN + N E+ I + +P N 1409Sbjct: 196 ITDGFQFIINNGGINTEENYPYTAQDG-ECNLDLQN---EKYVTIDTYENVPYNNEWALQ 251 1410 1411Query: 259 YIVSTGPLAIAADAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWI 316 1412 V+ P+++A DA ++ Y G+F PC ++DH + IVGY + I YWI 1413Sbjct: 252 TAVTYQPVSVALDAAGDAFKHYSSGIFTGPCG-TAIDHAVTIVGYGTEGGI-----DYWI 305 1414 1415Query: 317 VKNSWGADWGEQGYIYLRR---GKNTCGVSNFVS 347 1416 VKNSW WGE+GY+ + R G TCG++ S 1417Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPS 339 1418 1419 1420>sp|P25803|CYSP_PHAVU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYSTEINE PROTEINASE EP-C1) 1421 Length = 362 1422 1423 Score = 203 bits (510), Expect = 6e-52 1424 Identities = 125/313 (39%), Positives = 177/313 (55%), Gaps = 35/313 (11%) 1425 1426Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK---EAIFTDD 103 1427 +RF +FK+NL + N + +K +NKFAD+++ EF++ Y +K +F 1428Sbjct: 58 KRFNVFKANLMHVHNTNKMDKPYKLK----LNKFADMTNHEFRSTYAGSKVNHPRMFRGT 113 1429 1430Query: 104 LPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 163 1431 E + S+PP + DWR +GAVT VK+QGQCGSCW+FST VEG + I 1432Sbjct: 114 PHENGAFMYEKVVSVPP----SVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKT 169 1433 1434Query: 164 NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAE 223 1435 NKLV+LSEQ LVDCD E ++GCNGGL +A+ +I + GGI TES+YPY A+ 1436Sbjct: 170 NKLVALSEQELVDCDKE---------ENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQ 220 1437 1438Query: 224 TGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGG 281 1439pattern 237 **** 1440 GT C+ + N + I +P N+ V+ P+++A DA ++QFY G 1441Sbjct: 221 EGT-CDASKVN---DLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEG 276 1442 1443Query: 282 VFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----K 337 1444 VF C+ L+HG+ IVGY T+ N YWIV+NSWG +WGE GYI ++R + 1445Sbjct: 277 VFTGDCS-TDLNHGVAIVGYG--TTVDGTN--YWIVRNSWGPEWGEHGYIRMQRNISKKE 331 1446 1447Query: 338 NTCGVSNFVSTSI 350 1448 CG++ S I 1449Sbjct: 332 GLCGIAMLPSYPI 344 1450 1451 1452>sp|Q10991|CATL_SHEEP CATHEPSIN L 1453 Length = 217 1454 1455 Score = 201 bits (507), Expect = 1e-51 1456 Identities = 105/226 (46%), Positives = 139/226 (61%), Gaps = 23/226 (10%) 1457 1458Query: 127 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEG 186 1459 DW +G VTPVKNQGQCGSCW+FS TG +EGQ F KLVSLSEQNLVD 1460Sbjct: 6 DWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVD--------SS 57 1461 1462Query: 187 EEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPE-EQAKISN 245 1463pattern 237 ** ** 1464 ++GCNGGL NA+ YI +NGG+ +E SYPY A T T CN+ PE AK + 1465Sbjct: 58 RPQGNQGCNGGLMDNAFQYIKENGGLDSEESYPYEA-TDTSCNYK-----PEYSAAKDTG 111 1466 1467Query: 246 FTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYS 302 1468 F IP+ E + + + GP+++A DA +QFY G+ +D C+ LDHG+L+VGY 1469Sbjct: 112 FVDIPQREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYG 171 1470 1471Query: 303 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVS 347 1472 + T N +WIVKNSWG +WG +GY+ + + +N CG++ S 1473Sbjct: 172 FEGT----NNKFWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAAS 213 1474 1475 1476>sp|P43156|CYSP_HEMSP THIOL PROTEASE SEN102 PRECURSOR 1477 Length = 360 1478 1479 Score = 201 bits (506), Expect = 2e-51 1480 Identities = 121/307 (39%), Positives = 161/307 (52%), Gaps = 28/307 (9%) 1481 1482Query: 43 EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTD 102 1483 +E RF +FK N+ I E N A K +NKF D+++ EF++ Y +K 1484Sbjct: 54 DEKNRRFNVFKENVKFIHEFNQ---KKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRS 110 1485 1486Query: 103 DLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFIS 162 1487 + ++ + DWR +GAVT VK+QGQCGSCW+FST +VEG + I 1488Sbjct: 111 QRGIQKNTGSFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIK 170 1489 1490Query: 163 QNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTA 222 1491 +LVSLSEQ LVDCD + +EGCNGGL A+ +I KN GI TE SYPY 1492Sbjct: 171 TGELVSLSEQELVDCD---------TSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAE 220 1493 1494Query: 223 ETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIG 280 1495pattern 237 **** 1496 + GT C N N I +P N V+ P++++ +A +QFY 1497Sbjct: 221 QDGT-CASNLLN---SPVVSIDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSE 276 1498 1499Query: 281 GVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG---- 336 1500 GVF C LDHG+ IVGY A R YWIVKNSWG +WGE GYI ++RG 1501Sbjct: 277 GVFTGRCG-TELDHGVAIVGYGAT----RDGTKYWIVKNSWGEEWGESGYIRMQRGISDK 331 1502 1503Query: 337 KNTCGVS 343 1504 + CG++ 1505Sbjct: 332 RGKCGIA 338 1506 1507 1508>sp|P54639|CYS4_DICDI CYSTEINE PROTEINASE 4 PRECURSOR 1509 Length = 442 1510 1511 Score = 200 bits (504), Expect = 3e-51 1512 Identities = 117/308 (37%), Positives = 169/308 (53%), Gaps = 32/308 (10%) 1513 1514Query: 4 ILLFVLAVFTVFVSSRGIPPEEQ--SQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61 1515 +L F+ + + S++ E Q + F + + YS EE+ R++IFKSN+ + + 1516Sbjct: 3 VLSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQIFKSNMDYVHQ 62 1517 1518Query: 62 LNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPE 121 1519 N + +T G+N FAD+++ E++ YL F + ++E I S P 1520Sbjct: 63 WN----SKGGETVLGLNVFADITNQEYRTTYLGTP---FDGSALIGT--EEEKIFSTPAP 113 1521 1522Query: 122 EQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFI---SQNKLVSLSEQNLVDCD 178 1523 DWR +GAVTP+KNQGQCG CWSFSTTG+ EG HFI ++ LVSLSEQNL+DC 1524Sbjct: 114 ---TVDWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDC- 169 1525 1526Query: 179 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPE 238 1527pattern 237 ** 1528 + + GC GGL + YII N GI TESSYPYTAE G +C F ++NIG 1529Sbjct: 170 -------SKSYGNNGCEGGLMTLGFEYIINNKGIDTESSYPYTAEDGKECKFKTSNIG-- 220 1530 1531Query: 239 EQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHG 295 1532pattern 239 ** 1533 A+I ++ + + P+++A DA +Q Y G++ P C P LDHG 1534Sbjct: 221 --AQIVSYQNVTSGSEASLQSASNNAPVSVAIDASNESFQLYESGIYYEPACTPTQLDHG 278 1535 1536Query: 296 ILIVGYSA 303 1537 +L+VGY + 1538Sbjct: 279 VLVVGYGS 286 1539 1540 1541 Score = 48.8 bits (114), Expect = 2e-05 1542 Identities = 18/35 (51%), Positives = 24/35 (68%), Gaps = 1/35 (2%) 1543 1544Query: 314 YWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347 1545 YWIVKNSWG WG GYI++ + + N CG++ S 1546Sbjct: 401 YWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMAS 435 1547 1548 1549>sp|O60911|CATM_HUMAN CATHEPSIN L2 PRECURSOR (CATHEPSIN V) 1550 Length = 334 1551 1552 Score = 199 bits (501), Expect = 7e-51 1553 Identities = 127/357 (35%), Positives = 191/357 (52%), Gaps = 43/357 (12%) 1554 1555Query: 5 LLFVLAVFTVFVSSRGIPPEEQS---QFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61 1556 L VLA F + ++S +P +Q+ ++ +++ + Y E R +++ N+ IE 1557Sbjct: 3 LSLVLAAFCLGIAS-AVPKFDQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIEL 61 1558 1559Query: 62 LNLIAINHKADTKFGVNKFADLSSDEFKNY---YLNNK---EAIFTDDLPVADYLDDEFI 115 1560 N K +N F D++++EF+ + N K +F + L +LD 1561Sbjct: 62 HNGEYSQGKHGFTMAMNAFPDMTNEEFRQMMGCFRNQKFRKGKVFREPL----FLD---- 113 1562 1563Query: 116 NSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLV 175 1564 +P + DWR +G VTPVKNQ QCGSCW+FS TG +EGQ F KLVSLSEQNLV 1565Sbjct: 114 --LPK----SVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLV 167 1566 1567Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANI 235 1568 DC +G ++GCNGG A+ Y+ +NGG+ +E SYPY A C + N 1569Sbjct: 168 DCSRP----QG----NQGCNGGFMARAFQYVKENGGLDSEESYPYVA-VDEICKYRPEN- 217 1570 1571Query: 236 GPEEQAKISNFTMI-PKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNS 291 1572pattern 237 **** 1573 A + FT++ P E + + + GP+++A DA +QFY G+ F+ C+ + 1574Sbjct: 218 ---SVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKN 274 1575 1576Query: 292 LDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVS 347 1577 LDHG+L+VGY + N YW+VKNSWG +WG GY+ + + KN CG++ S 1578Sbjct: 275 LDHGVLVVGYGFEGA-NSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAAS 330 1579 1580 1581>sp|O10364|CATV_NPVOP VIRAL CATHEPSIN (V-CATH) 1582 Length = 324 1583 1584 Score = 196 bits (494), Expect = 5e-50 1585 Identities = 116/322 (36%), Positives = 168/322 (52%), Gaps = 30/322 (9%) 1586 1587Query: 29 FLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 1588 F +F KFNK YS E E L RF+IF+ NL +I N + + ++ +NKF+DLS +E 1589Sbjct: 28 FEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKN----QNDSTAQYEINKFSDLSKEE 83 1590 1591Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCW 147 1592 + Y T + LD P FDWR VT VKNQG CG+CW 1593Sbjct: 84 AISKYTGLSLPHQTQNFCEVVILDRP-----PDRGPLEFDWRQFNKVTSVKNQGVCGACW 138 1594 1595Query: 148 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 207 1596 +F+T G++E Q I N+L++LSEQ +DCD + GC+GGL A+ + 1597Sbjct: 139 AFATLGSLESQFAIKYNRLINLSEQQFIDCDR----------VNAGCDGGLLHTAFESAM 188 1598 1599Query: 208 KNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLA 267 1600pattern 237 **** 1601 + GG+Q ES YPY G QC N ++ M E + + + GP+ 1602Sbjct: 189 EMGGVQMESDYPYETANG-QCRINPNRFVVGVRSCRRYIVMF---EEKLKDLLRAVGPIP 244 1603 1604Query: 268 IAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 327 1605 +A DA + Y G+ C + L+H +L+VGY+ +N N+PYWI+KN+WG DWGE 1606Sbjct: 245 VAIDASDIVNYRRGIMR-QCANHGLNHAVLLVGYAVEN-----NIPYWILKNTWGTDWGE 298 1607 1608Query: 328 QGYIYLRRGKNTCGVSNFVSTS 349 1609 GY +++ N CG+ N + +S 1610Sbjct: 299 DGYFRVQQNINACGIRNELVSS 320 1611 1612 1613>sp|P25777|ORYB_ORYSA ORYZAIN BETA CHAIN PRECURSOR 1614 Length = 471 1615 1616 Score = 196 bits (494), Expect = 5e-50 1617 Identities = 115/310 (37%), Positives = 166/310 (53%), Gaps = 31/310 (10%) 1618 1619Query: 44 EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDD 103 1620 E+ RF +F NL ++ N A + + G+N+FADL+++EF+ +L K A 1621Sbjct: 69 EHERRFLVFWDNLKFVDAHNARA-DEGGGFRLGMNRFADLTNEEFRATFLGAKVA--ERS 125 1622 1623Query: 104 LPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 163 1624 + + + +P + DWR +GAV PVKNQGQCGSCW+FS VE + + 1625Sbjct: 126 RAAGERYRHDGVEELPE----SVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVT 181 1626 1627Query: 164 NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAE 223 1628 ++++LSEQ LV+C + GCNGGL +A+++IIKNGGI TE YPY A 1629Sbjct: 182 GEMITLSEQELVEC--------STNGQNSGCNGGLMADAFDFIIKNGGIDTEDDYPYKAV 233 1630 1631Query: 224 TGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGG 281 1632pattern 237 **** 1633 G +C+ N N + I F +P+N+ V+ P+++A +A E+Q Y G 1634Sbjct: 234 DG-KCDINREN---AKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 289 1635 1636Query: 282 VFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-- 339 1637 VF C SLDHG++ VGY N YWIV+NSWG WGE GY+ + R N 1638Sbjct: 290 VFSGRCG-TSLDHGVVAVGYGTDN-----GKDYWIVRNSWGPKWGESGYVRMERNINVTT 343 1639 1640Query: 340 --CGVSNFVS 347 1641 CG++ S 1642Sbjct: 344 GKCGIAMMAS 353 1643 1644 1645>sp|P25776|ORYA_ORYSA ORYZAIN ALPHA CHAIN PRECURSOR 1646 Length = 458 1647 1648 Score = 194 bits (488), Expect = 2e-49 1649 Identities = 124/355 (34%), Positives = 183/355 (50%), Gaps = 43/355 (12%) 1650 1651Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQ--FLEFQDKFNKKYSHE-EYLERFEIFKSNLGKI 59 1652 ++LL LA + + S G EE+++ + E++ + K Y+ E R+ F+ NL I 1653Sbjct: 12 LLLLLSLAAADMSIVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYI 71 1654 1655Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLN-----NKEAIFTDDLPVADYLDDEF 114 1656 +E N A + G+N+FADL+++E+++ YL +E +D AD 1657Sbjct: 72 DEHNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAAD------ 125 1658 1659Query: 115 INSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNL 174 1660 N PE + DWRT+GAV +K+QG CGSCW+FS VE + I L+SLSEQ L 1661Sbjct: 126 -NEALPE---SVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQEL 181 1662 1663Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSAN 234 1664 VDCD + +EGCNGGL A+++II NGGI TE YPY + +C+ N N 1665Sbjct: 182 VDCD---------TSYNEGCNGGLMDYAFDFIINNGGIDTEDDYPYKGK-DERCDVNRKN 231 1666 1667Query: 235 IGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSL 292 1668pattern 237 **** 1669 + I ++ + N V P+++A +A +Q Y G+F C +L 1670Sbjct: 232 ---AKVVTIDSYEDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQLYSSGIFTGKCG-TAL 287 1671 1672Query: 293 DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR----GKNTCGVS 343 1673 DHG+ VGY +N YWIV+NSWG WGE GY+ + R CG++ 1674Sbjct: 288 DHGVAAVGYGTEN-----GKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGIA 337 1675 1676 1677>sp|P43297|RD21_ARATH CYSTEINE PROTEINASE RD21A PRECURSOR 1678 Length = 462 1679 1680 Score = 193 bits (486), Expect = 4e-49 1681 Identities = 122/321 (38%), Positives = 168/321 (52%), Gaps = 43/321 (13%) 1682 1683Query: 35 KFNKKYSHEEYLE---RFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNY 91 1684 K K S +E RFEIFK NL ++E N ++++ G+ +FADL++DE+++ 1685Sbjct: 56 KHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYR----LGLTRFADLTNDEYRSK 111 1686 1687Query: 92 YLN---NKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWS 148 1688 YL K+ L + DE SI DWR +GAV VK+QG CGSCW+ 1689Sbjct: 112 YLGAKMEKKGERRTSLRYEARVGDELPESI--------DWRKKGAVAEVKDQGGCGSCWA 163 1690 1691Query: 149 FSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK 208 1692 FST G VEG + I L++LSEQ LVDCD + +EGCNGGL A+ +IIK 1693Sbjct: 164 FSTIGAVEGINQIVTGDLITLSEQELVDCD---------TSYNEGCNGGLMDYAFEFIIK 214 1694 1695Query: 209 NGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAI 268 1696pattern 237 **** 1697 NGGI T+ YPY GT C+ N + I ++ +P V+ P++I 1698Sbjct: 215 NGGIDTDKDYPYKGVDGT-CDQIRKN---AKVVTIDSYEDVPTYSEESLKKAVAHQPISI 270 1699 1700Query: 269 AADA--VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 326 1701 A +A +Q Y G+FD C LDHG++ VGY +N YWIV+NSWG WG 1702Sbjct: 271 AIEAGGRAFQLYDSGIFDGSCG-TQLDHGVVAVGYGTEN-----GKDYWIVRNSWGKSWG 324 1703 1704Query: 327 EQGYIYLRR----GKNTCGVS 343 1705 E GY+ + R CG++ 1706Sbjct: 325 ESGYLRMARNIASSSGKCGIA 345 1707 1708 1709>sp|Q10717|CYS2_MAIZE CYSTEINE PROTEINASE 2 PRECURSOR 1710 Length = 360 1711 1712 Score = 193 bits (485), Expect = 5e-49 1713 Identities = 115/329 (34%), Positives = 172/329 (51%), Gaps = 32/329 (9%) 1714 1715Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 1716 +F F ++ K Y S E +RF IF +L + N ++++ G+N+FAD+S + 1717Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYR----LGINRFADMSWE 113 1718 1719Query: 87 EFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSC 146 1720 EF+ L + A + + + DWR G V+PVKNQG CGSC 1721Sbjct: 114 EFRATRLGAAQNCS------ATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSC 167 1722 1723Query: 147 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 206 1724 W+FSTTG +E + + K +SLSEQ LVDC + GCNGGL A+ YI 1725Sbjct: 168 WTFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNF--------GCNGGLPSQAFEYI 219 1726 1727Query: 207 IKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 1728pattern 237 **** 1729 NGG+ TE SYPY G C F + N+G + + N T+ ++E A +V P+ 1730Sbjct: 220 KYNGGLDTEESYPYQGVNGI-CKFKNENVGVKVLDSV-NITLGAEDELKDAVGLVR--PV 275 1731 1732Query: 267 AIAADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWG 322 1733 ++A + + ++ Y GV+ P ++H +L VGY ++ +PYW++KNSWG 1734Sbjct: 276 SVAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVED-----GVPYWLIKNSWG 330 1735 1736Query: 323 ADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 1737 ADWG++GY + GKN CGV+ S I+ 1738Sbjct: 331 ADWGDEGYFKMEMGKNMCGVATCASYPIV 359 1739 1740 1741>sp|P14080|PAP2_CARPA CHYMOPAPAIN PRECURSOR (PAPAYA PROTEINASE II) (PPII) 1742 Length = 352 1743 1744 Score = 192 bits (482), Expect = 1e-48 1745 Identities = 128/319 (40%), Positives = 169/319 (52%), Gaps = 43/319 (13%) 1746 1747Query: 35 KFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNY 91 1748 K NK Y S +E + RFEIF+ NL I+E N K + + G+N FADLS+DEFK 1749Sbjct: 54 KHNKIYESIDEKIYRFEIFRDNLMYIDETN------KKNNSYWLGLNGFADLSNDEFKKK 107 1750 1751Query: 92 YLNNKEAIFTDDLPVADYLDDE-FINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFS 150 1752 Y+ +D ++ D+E F + DWR +GAVTPVKNQG CGSCW+FS 1753Sbjct: 108 YVG----FVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFS 163 1754 1755Query: 151 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 210 1756 T VEG + I L+ LSEQ LVDCD GC GG Q + Y + N 1757Sbjct: 164 TIATVEGINKIVTGNLLELSEQELVDCDKH----------SYGCKGGYQTTSLQY-VANN 212 1758 1759Query: 211 GIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKN-ETVMAGYIVSTGPLAIA 269 1760pattern 237 **** 1761 G+ T YPY A+ +C A P + KI+ + +P N ET G + + PL++ 1762Sbjct: 213 GVHTSKVYPYQAKQ-YKCR---ATDKPGPKVKITGYKRVPSNCETSFLGALANQ-PLSVL 267 1763 1764Query: 270 ADA--VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 327 1765 +A +Q Y GVFD PC LDH + VGY + KN Y I+KNSWG +WGE 1766Sbjct: 268 VEAGGKPFQLYKSGVFDGPCG-TKLDHAVTAVGYGTSD---GKN--YIIIKNSWGPNWGE 321 1767 1768Query: 328 QGYIYLRR----GKNTCGV 342 1769 +GY+ L+R + TCGV 1770Sbjct: 322 KGYMRLKRQSGNSQGTCGV 340 1771 1772 1773>sp|P00786|CATH_RAT CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPSIN BA) 1774 Length = 333 1775 1776 Score = 192 bits (482), Expect = 1e-48 1777 Identities = 121/333 (36%), Positives = 173/333 (51%), Gaps = 38/333 (11%) 1778 1779Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 1780 E+ F + + K YS EY R ++F +N KI+ N NH K G+N+F+D+S 1781Sbjct: 29 EKFHFTSWMKQHQKTYSSREYSHRLQVFANNWRKIQAHN--QRNHTF--KMGLNQFSDMS 84 1782 1783Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRG-AVTPVKNQGQC 143 1784 E K+ YL ++ ++ P ++ DWR +G V+PVKNQG C 1785Sbjct: 85 FAEIKHKYLWSEPQN-------CSATKSNYLRGTGPYP-SSMDWRKKGNVVSPVKNQGAC 136 1786 1787Query: 144 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203 1788 GSCW+FSTTG +E I+ K+++L+EQ LVDC + + GC GGL A+ 1789Sbjct: 137 GSCWTFSTTGALESAVAIASGKMMTLAEQQLVDC--------AQNFNNHGCQGGLPSQAF 188 1790 1791Query: 204 NYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ-AKISNFTMIPKN-ETVMAGYIV 261 1792pattern 237 **** 1793 YI+ N GI E SYPY + G QC FN PE+ A + N I N E M + 1794Sbjct: 189 EYILYNKGIMGEDSYPYIGKNG-QCKFN-----PEKAVAFVKNVVNITLNDEAAMVEAVA 242 1795 1796Query: 262 STGPLAIAADAVE-WQFYIGGVFDI-PCN--PNSLDHGILIVGYSAKNTIFRKNMPYWIV 317 1797 P++ A + E + Y GV+ C+ P+ ++H +L VGY +N + YWIV 1798Sbjct: 243 LYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLL-----YWIV 297 1799 1800Query: 318 KNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350 1801 KNSWG++WG GY + RGKN CG++ S I 1802Sbjct: 298 KNSWGSNWGNNGYFLIERGKNMCGLAACASYPI 330 1803 1804 1805>sp|P25251|CYS4_BRANA CYSTEINE PROTEINASE COT44 PRECURSOR 1806 Length = 328 1807 1808 Score = 190 bits (477), Expect = 5e-48 1809 Identities = 114/304 (37%), Positives = 164/304 (53%), Gaps = 29/304 (9%) 1810 1811Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPV 106 1812 ERF IFK NL I+ N N A K G+ FA+L++DE+++ YL + + 1813Sbjct: 27 ERFNIFKDNLRFIDLHN--ENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRR-ITK 83 1814 1815Query: 107 ADYLDDEFINSIPPEE-QTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK 165 1816 A ++ ++ ++ +E DWR +GAV +K+QG CGSCW+FST VEG + I + 1817Sbjct: 84 AKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGE 143 1818 1819Query: 166 LVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETG 225 1820 LVSLSEQ LVDCD ++ ++GCNGGL A+ +I+KNGG+ TE YPY G 1821Sbjct: 144 LVSLSEQELVDCD---------KSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNG 194 1822 1823Query: 226 TQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVF 283 1824pattern 237 **** 1825 +CN N I + +P + VS P+++A DA +Q Y G+F 1826Sbjct: 195 -KCNSLLKN---SRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIF 250 1827 1828Query: 284 DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNT 339 1829 C N +DH ++ VGY ++N + YWIV+NSWG WGE GYI + R 1830Sbjct: 251 TGKCGTN-MDHAVVAVGYGSEN-----GVDYWIVRNSWGTRWGEDGYIRMERNVASKSGK 304 1831 1832Query: 340 CGVS 343 1833 CG++ 1834Sbjct: 305 CGIA 308 1835 1836 1837>sp|P09668|CATH_HUMAN CATHEPSIN H PRECURSOR 1838 Length = 335 1839 1840 Score = 188 bits (472), Expect = 2e-47 1841 Identities = 123/332 (37%), Positives = 170/332 (51%), Gaps = 36/332 (10%) 1842 1843Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 1844 E+ F + K K YS EEY R + F SN KI N N K +N+F+D+S 1845Sbjct: 31 EKFHFKSWMSKHRKTYSTEEYHHRLQTFASNWRKINAHN----NGNHTFKMALNQFSDMS 86 1846 1847Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGA-VTPVKNQGQC 143 1848 E K+ YL ++ ++YL PP + DWR +G V+PVKNQG C 1849Sbjct: 87 FAEIKHKYLWSEPQ--NCSATKSNYLRGT--GPYPP----SVDWRKKGNFVSPVKNQGAC 138 1850 1851Query: 144 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203 1852 GSCW+FSTTG +E I+ K++SL+EQ LVDC + Y GC GGL A+ 1853Sbjct: 139 GSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNY--------GCQGGLPSQAF 190 1854 1855Query: 204 NYIIKNGGIQTESSYPYTAETGTQCNFNSAN-IGPEEQAKISNFTMIPKNETVMAGYIVS 262 1856pattern 237 **** 1857 YI+ N GI E +YPY + G C F IG + ++N T+ +E M + 1858Sbjct: 191 EYILYNKGIMGEDTYPYQGKDG-YCKFQPGKAIGFVKD--VANITIY--DEEAMVEAVAL 245 1859 1860Query: 263 TGPLAIAADAV-EWQFYIGGVF-DIPCN--PNSLDHGILIVGYSAKNTIFRKNMPYWIVK 318 1861 P++ A + ++ Y G++ C+ P+ ++H +L VGY KN I PYWIVK 1862Sbjct: 246 YNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGI-----PYWIVK 300 1863 1864Query: 319 NSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350 1865 NSWG WG GY + RGKN CG++ S I 1866Sbjct: 301 NSWGPQWGMNGYFLIERGKNMCGLAACASYPI 332 1867 1868 1869>sp|P10056|PAP3_CARPA CARICAIN PRECURSOR (PAPAYA PROTEINASE OMEGA) (PAPAYA PROTEINASE III) 1870 (PPIII) (PAPAYA PEPTIDASE A) 1871 Length = 348 1872 1873 Score = 187 bits (471), Expect = 2e-47 1874 Identities = 121/319 (37%), Positives = 161/319 (49%), Gaps = 38/319 (11%) 1875 1876Query: 37 NKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNYYL 93 1877 NK Y + +E L RFEIFK NL I+E N K + + G+N+FADLS+DEF Y+ 1878Sbjct: 56 NKFYENVDEKLYRFEIFKDNLNYIDETN------KKNNSYWLGLNEFADLSNDEFNEKYV 109 1879 1880Query: 94 NNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTG 153 1881 + D + D+EFIN DWR +GAVTPV++QG CGSCW+FS 1882Sbjct: 110 GS-----LIDATIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVA 164 1883 1884Query: 154 NVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ 213 1885 VEG + I KLV LSEQ LVDC+ GC GG P A Y+ KN GI 1886Sbjct: 165 TVEGINKIRTGKLVELSEQELVDCERR----------SHGCKGGYPPYALEYVAKN-GIH 213 1887 1888Query: 214 TESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV 273 1889pattern 237 **** 1890 S YPY A+ GT C GP K S + N ++ P+++ ++ 1891Sbjct: 214 LRSKYPYKAKQGT-CRAKQVG-GP--IVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESK 269 1892 1893Query: 274 --EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 331 1894 +Q Y GG+F+ PC +DH + VGY Y ++KNSWG WGE+GYI 1895Sbjct: 270 GRPFQLYKGGIFEGPCG-TKVDHAVTAVGYGKSG-----GKGYILIKNSWGTAWGEKGYI 323 1896 1897Query: 332 YLRRGK-NTCGVSNFVSTS 349 1898 ++R N+ GV +S 1899Sbjct: 324 RIKRAPGNSPGVCGLYKSS 342 1900 1901 1902>sp|P25778|ORYC_ORYSA ORYZAIN GAMMA CHAIN PRECURSOR 1903 Length = 362 1904 1905 Score = 187 bits (471), Expect = 2e-47 1906 Identities = 112/329 (34%), Positives = 170/329 (51%), Gaps = 33/329 (10%) 1907 1908Query: 28 QFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 1909 +F F + K+Y E RF IF +L + N + ++ G+N+FAD+S + 1910Sbjct: 61 RFARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNRRGLPYR----LGINRFADMSWE 116 1911 1912Query: 87 EFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSC 146 1913 EF+ L + A + + P +T DWR G V+PVK+QG CGSC 1914Sbjct: 117 EFQASRLGAAQNCS------ATLAGNHRMRDAPALPETK-DWREDGIVSPVKDQGHCGSC 169 1915 1916Query: 147 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 206 1917 W FSTTG++E ++ + VSLSEQ L DC + GC+GGL A+ YI 1918Sbjct: 170 WPFSTTGSLEARYTQATGPPVSLSEQQLADCATRYNNF--------GCSGGLPSQAFEYI 221 1919 1920Query: 207 IKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 1921pattern 237 **** 1922 NGG+ TE +YPYT G C++ N G + + N T++ ++E A +V P+ 1923Sbjct: 222 KYNGGLDTEEAYPYTGVNGI-CHYKPENAGVKVLDSV-NITLVAEDELKNAVGLVR--PV 277 1924 1925Query: 267 AIAADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWG 322 1926 ++A + ++ Y GV+ +P ++H +L VGY +N +PYW++KNSWG 1927Sbjct: 278 SVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVEN-----GVPYWLIKNSWG 332 1928 1929Query: 323 ADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 1930 ADWG+ GY + GKN CG++ S I+ 1931Sbjct: 333 ADWGDNGYFTMEMGKNMCGIATCASYPIV 361 1932 1933 1934>sp|P15242|TES1_RAT TESTIN 1/2 PRECURSOR (CMB-22/CMB-23) 1935 Length = 333 1936 1937 Score = 187 bits (469), Expect = 4e-47 1938 Identities = 115/356 (32%), Positives = 184/356 (51%), Gaps = 30/356 (8%) 1939 1940Query: 3 VILLFVLAVFTVFVSSRGIPPEEQS--QFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 1941 +I + LA+ + V S P+ ++ E++ K K Y+ E + +++ N IE 1942Sbjct: 1 MIAVLFLAILCLEVDSTAPTPDPSLDVEWNEWRTKHGKTYNMNEERLKRAVWEKNFKMIE 60 1943 1944Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLN-NKEAIFTDDLPVADYLDDEFINSIP 119 1945 N + + D +N F DL++ EF ++ I + + D +F+ +P 1946Sbjct: 61 LHNWEYLEGRHDFTMAMNAFGDLTNIEFVKMMTGFQRQKIKKTHI----FQDHQFLY-VP 115 1947 1948Query: 120 PEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 179 1949 DWR G VTPVKNQG C S W+FS TG++EGQ F +L+ LSEQNL+DC 1950Sbjct: 116 KR----VDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMG 171 1951 1952Query: 180 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEE 239 1953pattern 237 *** 1954 + + GC+GG A+ Y+ NGG+ TE SYPY + G +C +++ N 1955Sbjct: 172 SNVTH--------GCSGGFMQYAFQYVKDNGGLATEESYPYRGQ-GRECRYHAEN----S 218 1956 1957Query: 240 QAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGI 296 1958pattern 240 * 1959 A + +F IP +E + + GP+++A DA +QFY G++ P C L+H + 1960Sbjct: 219 AANVRDFVQIPGSEEALMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQCKRVHLNHAV 278 1961 1962Query: 297 LIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 351 1963 L+VGY + N +W+VKNSWG +WG +GY+ L + N CG++ + + I+ 1964Sbjct: 279 LVVGYGFEGEESDGN-SFWLVKNSWGEEWGMKGYMKLAKDWSNHCGIATYSTYPIV 333 1965 1966 1967>sp|O46427|CATH_PIG CATHEPSIN H PRECURSOR 1968 Length = 335 1969 1970 Score = 186 bits (468), Expect = 5e-47 1971 Identities = 124/343 (36%), Positives = 176/343 (51%), Gaps = 42/343 (12%) 1972 1973Query: 17 SSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFG 76 1974 S+ + E+ F + + KKYS EEY R ++F SN KI N A NH K G 1975Sbjct: 23 SNLAVSSFEKLHFKSWMVQHQKKYSLEEYHHRLQVFVSNWRKINAHN--AGNHTF--KLG 78 1976 1977Query: 77 VNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGA-VT 135 1978 +N+F+D+S DE ++ YL ++ +YL PP + DWR +G V+ 1979Sbjct: 79 LNQFSDMSFDEIRHKYLWSEPQ--NCSATKGNYLRGT--GPYPP----SMDWRKKGNFVS 130 1980 1981Query: 136 PVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCN 195 1982 PVKNQG CGSCW+FSTTG +E I+ K++SL+EQ LVDC + + GC 1983Sbjct: 131 PVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDC--------AQNFNNHGCQ 182 1984 1985Query: 196 GGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ----AKISNFTMIPK 251 1986pattern 237 **** 1987 GGL A+ YI N GI E +YPY + C F P++ ++N TM 1988Sbjct: 183 GGLPSQAFEYIRYNKGIMGEDTYPYKGQ-DDHCKFQ-----PDKAIAFVKDVANITM--N 234 1989 1990Query: 252 NETVMAGYIVSTGPLAIAADAV-EWQFYIGGVF-DIPCN--PNSLDHGILIVGYSAKNTI 307 1991 +E M + P++ A + ++ Y G++ C+ P+ ++H +L VGY +N I 1992Sbjct: 235 DEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGI 294 1993 1994Query: 308 FRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350 1995 PYWIVKNSWG WG GY + RGKN CG++ S I 1996Sbjct: 295 -----PYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPI 332 1997 1998 1999>sp|P05167|ALEU_HORVU THIOL PROTEASE ALEURAIN PRECURSOR 2000 Length = 362 2001 2002 Score = 185 bits (466), Expect = 9e-47 2003 Identities = 111/329 (33%), Positives = 169/329 (50%), Gaps = 33/329 (10%) 2004 2005Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 2006 +F F ++ K Y S E RF IF +L ++ N + ++ G+N+F+D+S + 2007Sbjct: 60 RFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNRKGLPYR----LGINRFSDMSWE 115 2008 2009Query: 87 EFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSC 146 2010 EF+ L + A + + +T DWR G V+PVKNQ CGSC 2011Sbjct: 116 EFQATRLGAAQTCS------ATLAGNHLMRDAAALPETK-DWREDGIVSPVKNQAHCGSC 168 2012 2013Query: 147 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 206 2014 W+FSTTG +E + + K +SLSEQ LVDC + GCNGGL A+ YI 2015Sbjct: 169 WTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNF--------GCNGGLPSQAFEYI 220 2016 2017Query: 207 IKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 2018pattern 237 **** 2019 NGGI TE SYPY G C++ + N + + N T+ ++E A +V P+ 2020Sbjct: 221 KYNGGIDTEESYPYKGVNGV-CHYKAENAAVQVLDSV-NITLNAEDELKNAVGLVR--PV 276 2021 2022Query: 267 AIAADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWG 322 2023 ++A ++ ++ Y GV+ P+ ++H +L VGY +N +PYW++KNSWG 2024Sbjct: 277 SVAFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVEN-----GVPYWLIKNSWG 331 2025 2026Query: 323 ADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 2027 ADWG+ GY + GKN C ++ S ++ 2028Sbjct: 332 ADWGDNGYFKMEMGKNMCAIATCASYPVV 360 2029 2030 2031>sp|P43235|CATK_HUMAN CATHEPSIN K PRECURSOR (CATHEPSIN O) (CATHEPSIN X) (CATHEPSIN O2) 2032 Length = 329 2033 2034 Score = 185 bits (465), Expect = 1e-46 2035 Identities = 123/350 (35%), Positives = 185/350 (52%), Gaps = 39/350 (11%) 2036 2037Query: 9 LAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65 2038 L V + V S + PEE + + ++ K+Y+++ + + R I++ NL I NL 2039Sbjct: 4 LKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLE 63 2040 2041Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTA 125 2042 A + +N D++S+E K +P++ ++ + IP E A 2043Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLK-------VPLSHSRSNDTLY-IPEWEGRA 115 2044 2045Query: 126 ---FDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECM 182 2046 D+R +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E 2047Sbjct: 116 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-- 173 2048 2049Query: 183 EYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAK 242 2050pattern 237 **** 2051 ++GC GG NA+ Y+ KN GI +E +YPY + C +N + AK 2052Sbjct: 174 --------NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE-ESCMYNPTG----KAAK 220 2053 2054Query: 243 ISNFTMIPK-NETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILI 298 2055 + IP+ NE + + GP+++A DA +QFY GV +D CN ++L+H +L 2056Sbjct: 221 CRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLA 280 2057 2058Query: 299 VGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347 2059 VGY +K +WI+KNSWG +WG +GYI + R K N CG++N S 2060Sbjct: 281 VGYG-----IQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLAS 325 2061 2062 2063>sp|P05994|PAP4_CARPA PAPAYA PROTEINASE IV PRECURSOR (PPIV) (PAPAYA PEPTIDASE B) (GLYCYL 2064 ENDOPEPTIDASE) 2065 Length = 348 2066 2067 Score = 184 bits (462), Expect = 3e-46 2068 Identities = 116/315 (36%), Positives = 162/315 (50%), Gaps = 37/315 (11%) 2069 2070Query: 35 KFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYL 93 2071 K NK Y + +E L RFEIFK NL I+E N + + G+N+F+DLS+DEFK Y+ 2072Sbjct: 54 KHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMINGYW----LGLNEFSDLSNDEFKEKYV 109 2073 2074Query: 94 NNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTG 153 2075 + +T+ D+EF+N + + DWR +GAVTPVK+QG C SCW+FST 2076Sbjct: 110 GSLPEDYTNQP-----YDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFSTVA 164 2077 2078Query: 154 NVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ 213 2079 VEG + I LV LSEQ LVDCD + GCN G Q + Y+ +N GI 2080Sbjct: 165 TVEGINKIKTGNLVELSEQELVDCDKQ----------SYGCNRGYQSTSLQYVAQN-GIH 213 2081 2082Query: 214 TESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV 273 2083pattern 237 **** 2084 + YPY A+ T C N GP + K + + N ++ P+++ ++ 2085Sbjct: 214 LRAKYPYIAKQQT-CRANQVG-GP--KVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESA 269 2086 2087Query: 274 --EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 331 2088 ++Q Y GG+F+ C +DH + VGY Y ++KNSWG WGE GYI 2089Sbjct: 270 GRDFQNYKGGIFEGSCG-TKVDHAVTAVGYGKSG-----GKGYILIKNSWGPGWGENGYI 323 2090 2091Query: 332 YLRRGK----NTCGV 342 2092 +RR CGV 2093Sbjct: 324 RIRRASGNSPGVCGV 338 2094 2095 2096>sp|P25250|CYS2_HORVU CYSTEINE PROTEINASE EP-B 2 PRECURSOR 2097 Length = 373 2098 2099 Score = 183 bits (461), Expect = 3e-46 2100 Identities = 125/349 (35%), Positives = 171/349 (48%), Gaps = 40/349 (11%) 2101 2102Query: 8 VLAVFTVFVSSRGIPPEEQSQFLE---------FQDKFNKKYSHEEYLERFEIFKSNLGK 58 2103 VLAV V + S IP E++ E +Q + H E RF FKSN 2104Sbjct: 17 VLAVAAVELCS-AIPMEDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHF 75 2105 2106Query: 59 IEELNLIAINHKADTKFGV--NKFADLSSDEFKNYYLNNKEAIFTDDLP-VADYLDDEF- 114 2107 I + N + D + + N+F D+ EF+ ++ + P V ++ 2108Sbjct: 76 IH-----SHNKRGDHPYRLHLNRFGDMDQAEFRATFVGDLRRDTPSKPPSVPGFMYAALN 130 2109 2110Query: 115 INSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNL 174 2111 ++ +PP + DWR +GAVT VK+QG+CGSCW+FST +VEG + I LVSLSEQ L 2112Sbjct: 131 VSDLPP----SVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQEL 186 2113 2114Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSAN 234 2115 +DCD A ++GC GGL NA+ YI NGG+ TE++YPY A GT CN A 2116Sbjct: 187 IDCD---------TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGT-CNVARAA 236 2117 2118Query: 235 IGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSL 292 2119pattern 237 **** 2120 I +P N V+ P+++A +A + FY GVF C L 2121Sbjct: 237 QNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECG-TEL 295 2122 2123Query: 293 DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCG 341 2124 DHG+ +VGY + YW VKNSWG WGEQGYI + + G 2125Sbjct: 296 DHGVAVVGYG----VAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASG 340 2126 2127 2128>sp|P25249|CYS1_HORVU CYSTEINE PROTEINASE EP-B 1 PRECURSOR 2129 Length = 371 2130 2131 Score = 183 bits (460), Expect = 5e-46 2132 Identities = 126/353 (35%), Positives = 170/353 (47%), Gaps = 48/353 (13%) 2133 2134Query: 8 VLAVFTVFVSSRGIPPEEQSQFLE---------FQDKFNKKYSHEEYLERFEIFKSNLGK 58 2135 VLAV V + S IP E++ E +Q + H E RF FKSN 2136Sbjct: 17 VLAVAAVELCS-AIPMEDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHF 75 2137 2138Query: 59 IEELNLIAINHKADTKFGV--NKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF-- 114 2139 I + N + D + + N+F D+ EF+ ++ + D P F 2140Sbjct: 76 IH-----SHNKRGDHPYRLHLNRFGDMDQAEFRATFVGDLRR----DTPAKPPSVPGFMY 126 2141 2142Query: 115 ----INSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLS 170 2143 ++ +PP + DWR +GAVT VK+QG+CGSCW+FST +VEG + I LVSLS 2144Sbjct: 127 AALNVSDLPP----SVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLS 182 2145 2146Query: 171 EQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNF 230 2147 EQ L+DCD A ++GC GGL NA+ YI NGG+ TE++YPY A GT CN 2148Sbjct: 183 EQELIDCD---------TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGT-CNV 232 2149 2150Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCN 288 2151pattern 237 **** 2152 A I +P N V+ P+++A +A + FY GVF C 2153Sbjct: 233 ARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGDCG 292 2154 2155Query: 289 PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCG 341 2156 LDHG+ +VGY + YW VKNSWG WGEQGYI + + G 2157Sbjct: 293 -TELDHGVAVVGYG----VAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASG 340 2158 2159 2160>sp|P43236|CATK_RABIT CATHEPSIN K PRECURSOR (OC-2 PROTEIN) 2161 Length = 329 2162 2163 Score = 183 bits (459), Expect = 6e-46 2164 Identities = 119/348 (34%), Positives = 181/348 (51%), Gaps = 35/348 (10%) 2165 2166Query: 9 LAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65 2167 L V + V S + PEE +Q+ ++ ++K+Y+ + + + R I++ NL I NL 2168Sbjct: 4 LKVLLLPVVSFALHPEEILDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLE 63 2169 2170Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDE-FINSIPPEEQT 124 2171 A + +N D++S+E K P + +D +I 2172Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVP------PSRSHSNDTLYIPDWEGRTPD 117 2173 2174Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 184 2175 + D+R +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E 2176Sbjct: 118 SIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE---- 173 2177 2178Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244 2179pattern 237 **** 2180 + GC GG NA+ Y+ +N GI +E +YPY + C +N + AK 2181Sbjct: 174 ------NYGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQ-DESCMYNPTG----KAAKCR 222 2182 2183Query: 245 NFTMIPK-NETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVG 300 2184 + IP+ NE + + GP+++A DA +QFY GV +D C+ ++++H +L VG 2185Sbjct: 223 GYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSDNVNHAVLAVG 282 2186 2187Query: 301 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347 2188 Y +K +WI+KNSWG WG +GYI + R K N CG++N S 2189Sbjct: 283 YG-----IQKGNKHWIIKNSWGESWGNKGYILMARNKNNACGIANLAS 325 2190 2191 2192>sp|P22895|P34_SOYBN P34 PROBABLE THIOL PROTEASE PRECURSOR 2193 Length = 379 2194 2195 Score = 182 bits (458), Expect = 8e-46 2196 Identities = 110/322 (34%), Positives = 173/322 (53%), Gaps = 38/322 (11%) 2197 2198Query: 40 YSHEEYLERFEIFKSNLGKIEELNLIAINHKA--DTKFGVNKFADLSSDEFKNYYLNNKE 97 2199 ++HEE +R EIFK+N I ++N N K+ + G+NKFAD++ EF YL + 2200Sbjct: 56 HNHEEEAKRLEIFKNNSNYIRDMNA---NRKSPHSHRLGLNKFADITPQEFSKKYLQAPK 112 2201 2202Query: 98 AIFTDDLPVAD--YLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNV 155 2203 + + + +A+ +++ PP ++DWR +G +T VK QG CG W+FS TG + 2204Sbjct: 113 DV-SQQIKMANKKMKKEQYSCDHPP---ASWDWRKKGVITQVKYQGGCGRGWAFSATGAI 168 2205 2206Query: 156 EGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTE 215 2207 E H I+ LVSLSEQ LVDC E EG G Q ++ +++++GGI T+ 2208Sbjct: 169 EAAHAIATGDLVSLSEQELVDCVEE----------SEGSYNGWQYQSFEWVLEHGGIATD 218 2209 2210Query: 216 SSYPYTAETGTQCNFN----SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAAD 271 2211pattern 237 **** 2212 YPY A+ G +C N I E +S+ + + E I+ P++++ D 2213Sbjct: 219 DDYPYRAKEG-RCKANKIQDKVTIDGYETLIMSDESTESETEQAFLSAILEQ-PISVSID 276 2214 2215Query: 272 AVEWQFYIGGVFDIP--CNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQG 329 2216 A ++ Y GG++D +P ++H +L+VGY + + + YWI KNSWG DWGE G 2217Sbjct: 277 AKDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSAD-----GVDYWIAKNSWGFDWGEDG 331 2218 2219Query: 330 YIYLRRGK----NTCGVSNFVS 347 2220 YI+++R CG++ F S 2221Sbjct: 332 YIWIQRNTGNLLGVCGMNYFAS 353 2222 2223 2224>sp|P49935|CATH_MOUSE CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPSIN BA) 2225 Length = 333 2226 2227 Score = 180 bits (451), Expect = 5e-45 2228 Identities = 115/332 (34%), Positives = 166/332 (49%), Gaps = 36/332 (10%) 2229 2230Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 2231 E+ F + + K YS EY R ++F +N KI+ N NH K +N+F+D+S 2232Sbjct: 29 EKFHFKSWMKQHQKTYSSVEYNHRLQMFANNWRKIQAHN--QRNHTF--KMALNQFSDMS 84 2233 2234Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRG-AVTPVKNQGQC 143 2235 E K+ +L ++ ++ P ++ DWR +G V+PVKNQG C 2236Sbjct: 85 FAEIKHKFLWSEPQN-------CSATKSNYLRGTGPYP-SSMDWRKKGNVVSPVKNQGAC 136 2237 2238Query: 144 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203 2239 SCW+FSTTG +E I+ K++SL+EQ LVDC + + GC GGL A+ 2240Sbjct: 137 ASCWTFSTTGALESAVAIASGKMLSLAEQQLVDC--------AQAFNNHGCKGGLPSQAF 188 2241 2242Query: 204 NYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKN-ETVMAGYIVS 262 2243pattern 237 **** 2244 YI+ N GI E SYPY + + C FN + A + N I N E M + 2245Sbjct: 189 EYILYNKGIMEEDSYPYIGK-DSSCRFNP----QKAVAFVKNVVNITLNDEAAMVEAVAL 243 2246 2247Query: 263 TGPLAIAADAVE-WQFYIGGVFDIPC---NPNSLDHGILIVGYSAKNTIFRKNMPYWIVK 318 2248 P++ A + E + Y GV+ P+ ++H +L VGY +N + YWIVK 2249Sbjct: 244 YNPVSFAFEVTEDFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLL-----YWIVK 298 2250 2251Query: 319 NSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350 2252 NSWG+ WGE GY + RGKN CG++ S I 2253Sbjct: 299 NSWGSQWGENGYFLIERGKNMCGLAACASYPI 330 2254 2255 2256>sp|P55097|CATK_MOUSE CATHEPSIN K PRECURSOR 2257 Length = 329 2258 2259 Score = 178 bits (447), Expect = 2e-44 2260 Identities = 117/352 (33%), Positives = 182/352 (51%), Gaps = 43/352 (12%) 2261 2262Query: 9 LAVFTVFVSSRGIPPEEQ--SQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65 2263 L V + + S + PEE +Q+ ++ K+Y+ + + + R I++ NL +I NL 2264Sbjct: 4 LKVLLLPMVSFALSPEEMLDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLE 63 2265 2266Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDD-----EFINSIPP 120 2267 A + +N D++S+E + P Y +D E+ +P 2268Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLRIP------PSRSYSNDTLYTPEWEGRVPD 117 2269 2270Query: 121 EEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 180 2271 + D+R +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E 2272Sbjct: 118 ----SIDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTE 173 2273 2274Query: 181 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ 240 2275pattern 237 **** 2276 + GC GG A+ Y+ +NGGI +E ++PY + C +N+ + 2277Sbjct: 174 ----------NYGCGGGYMTTAFQYVQQNGGIDSEDAFPYVGQ-DESCMYNAT----AKA 218 2278 2279Query: 241 AKISNFTMIP-KNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGI 296 2280 AK + IP NE + + GP++++ DA +QFY GV +D C+ ++++H + 2281Sbjct: 219 AKCRGYREIPVGNEKALKRAVARVGPISVSIDASLASFQFYSRGVYYDENCDRDNVNHAV 278 2282 2283Query: 297 LIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347 2284 L+VGY +K +WI+KNSWG WG +GY L R K N CG++N S 2285Sbjct: 279 LVVGYGT-----QKGSKHWIIKNSWGESWGNKGYALLARNKNNACGITNMAS 325 2286 2287 2288>sp|P56202|CATW_HUMAN CATHEPSIN W PRECURSOR (LYMPHOPAIN) 2289 Length = 376 2290 2291 Score = 177 bits (445), Expect = 3e-44 2292 Identities = 112/351 (31%), Positives = 171/351 (47%), Gaps = 47/351 (13%) 2293 2294Query: 22 PPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80 2295 P E + F FQ +FN+ Y S EE+ R +IF NL + + L + +FGV F 2296Sbjct: 35 PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLG---TAEFGVTPF 91 2297 2298Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAF--DWR-TRGAVTPV 137 2299 +DL+ +EF Y + A + I S PEE F DWR GA++P+ 2300Sbjct: 92 SDLTEEEFGQLYGYRRAAGGVPSM-------GREIRSEEPEESVPFSCDWRKVAGAISPI 144 2301 2302Query: 138 KNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGG 197 2303 K+Q C CW+ + GN+E IS V +S L+DC C +GC+GG 2304Sbjct: 145 KDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVHELLDCGR----------CGDGCHGG 194 2305 2306Query: 198 LQPNAYNYIIKNGGIQTESSYPYTAETGT-QCNFNSANIGPEEQAKISNFTMIPKNETVM 256 2307pattern 237 **** 2308 +A+ ++ N G+ +E YP+ + +C+ ++ A I +F M+ NE + 2309Sbjct: 195 FVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKKY----QKVAWIQDFIMLQNNEHRI 250 2310 2311Query: 257 AGYIVSTGPLAIAADAVEWQFYIGGVFDIP---CNPNSLDHGILIVGYSA--------KN 305 2312 A Y+ + GP+ + + Q Y GV C+P +DH +L+VG+ + 2313Sbjct: 251 AQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAE 310 2314 2315Query: 306 TIFRKNMP-------YWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 349 2316 T+ ++ P YWI+KNSWGA WGE+GY L RG NTCG++ F T+ 2317Sbjct: 311 TVSSQSQPQPPHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLTA 361 2318 2319 2320>sp|P56203|CATW_MOUSE CATHEPSIN W PRECURSOR (LYMPHOPAIN) 2321 Length = 371 2322 2323 Score = 176 bits (442), Expect = 6e-44 2324 Identities = 110/346 (31%), Positives = 166/346 (47%), Gaps = 40/346 (11%) 2325 2326Query: 22 PPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80 2327 P E + F FQ +FN+ Y + EY R IF NL + + L + +FG F 2328Sbjct: 33 PLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLG---TAEFGETPF 89 2329 2330Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWR-TRGAVTPVKN 139 2331 +DL+ +EF Y + T ++ + + S+P DWR + ++ VKN 2332Sbjct: 90 SDLTEEEFGQLYGQERSPERTPNM-TKKVESNTWGESVP----RTCDWRKAKNIISSVKN 144 2333 2334Query: 140 QGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQ 199 2335 QG C CW+ + N++ I + V +S Q L+DC E C GCNGG 2336Sbjct: 145 QGSCKCCWAMAAADNIQALWRIKHQQFVDVSVQELLDC----------ERCGNGCNGGFV 194 2337 2338Query: 200 PNAYNYIIKNGGIQTESSYPYTAETGT-QCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258 2339pattern 237 **** 2340 +AY ++ N G+ +E YP+ + +C ++ A I +FTM+ NE +A 2341Sbjct: 195 WDAYLTVLNNSGLASEKDYPFQGDRKPHRCLAKKY----KKVAWIQDFTMLSNNEQAIAH 250 2342 2343Query: 259 YIVSTGPLAIAADAVEWQFYIGGVFDIP---CNPNSLDHGILIVGYSAKN------TIF- 308 2344 Y+ GP+ + + Q Y GV C+P +DH +L+VG+ K T+ 2345Sbjct: 251 YLAVHGPITVTINMKLLQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKKKEGMQTGTVLS 310 2346 2347Query: 309 -----RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 349 2348 R + PYWI+KNSWGA WGE+GY L RG NTCGV+ + T+ 2349Sbjct: 311 HSRKRRHSSPYWILKNSWGAHWGEKGYFRLYRGNNTCGVTKYPFTA 356 2350 2351 2352>sp|P43234|CATO_HUMAN CATHEPSIN O PRECURSOR 2353 Length = 321 2354 2355 Score = 173 bits (435), Expect = 4e-43 2356 Identities = 100/304 (32%), Positives = 152/304 (49%), Gaps = 30/304 (9%) 2357 2358Query: 52 FKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLD 111 2359 F+ +L + LN + + + +G+N+F+ L +EFK YL +K + F 2360Sbjct: 44 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPR-------YS 96 2361 2362Query: 112 DEFINSIPPEE-QTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLS 170 2363 E SIP FDWR + VT V+NQ CG CW+FS G VE + I L LS 2364Sbjct: 97 AEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLS 156 2365 2366Query: 171 EQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK-NGGIQTESSYPYTAETGTQCN 229 2367 Q ++DC + + GCNGG NA N++ K + +S YP+ A+ G C+ 2368Sbjct: 157 VQQVIDCSYN----------NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGL-CH 205 2369 2370Query: 230 FNSANIGPEEQAKISNFTM--IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPC 287 2371pattern 237 **** 2372 + S G I ++ E MA +++ GPL + DAV WQ Y+GG+ C 2373Sbjct: 206 YFS---GSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC 262 2374 2375Query: 288 NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVS 347 2376 + +H +LI G+ + PYWIV+NSWG+ WG GY +++ G N CG+++ VS 2377Sbjct: 263 SSGEANHAVLITGFDKTG-----STPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVS 317 2378 2379Query: 348 TSII 351 2380 + + 2381Sbjct: 318 SIFV 321 2382 2383 2384>sp|P00784|PAPA_CARPA PAPAIN PRECURSOR (PAPAYA PROTEINASE I) (PPI) 2385 Length = 345 2386 2387 Score = 173 bits (433), Expect = 7e-43 2388 Identities = 119/322 (36%), Positives = 163/322 (49%), Gaps = 43/322 (13%) 2389 2390Query: 35 KFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNY 91 2391 K NK Y + +E + RFEIFK NL I+E N K + + G+N FAD+S+DEFK 2392Sbjct: 54 KHNKIYKNIDEKIYRFEIFKDNLKYIDETN------KKNNSYWLGLNVFADMSNDEFKEK 107 2393 2394Query: 92 YLNNKEAIFTD-DLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFS 150 2395 Y + +T +L + L+D +N PE DWR +GAVTPVKNQG CGSCW+FS 2396Sbjct: 108 YTGSIAGNYTTTELSYEEVLNDGDVNI--PEY---VDWRQKGAVTPVKNQGSCGSCWAFS 162 2397 2398Query: 151 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 210 2399 +EG I L SEQ L+DCD GCNGG +A ++ 2400Sbjct: 163 AVVTIEGIIKIRTGNLNEYSEQELLDCDRR----------SYGCNGGYPWSALQ-LVAQY 211 2401 2402Query: 211 GIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAA 270 2403pattern 237 **** 2404 GI ++YPY G Q S GP + P NE + Y ++ P+++ 2405Sbjct: 212 GIHYRNTYPY---EGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALL-YSIANQPVSVVL 267 2406 2407Query: 271 DAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQ 328 2408 +A ++Q Y GG+F PC N +DH + VGY Y ++KNSWG WGE 2409Sbjct: 268 EAAGKDFQLYRGGIFVGPCG-NKVDHAVAAVGYGPN---------YILIKNSWGTGWGEN 317 2410 2411Query: 329 GYIYLRRGK-NTCGVSNFVSTS 349 2412 GYI ++RG N+ GV ++S 2413Sbjct: 318 GYIRIKRGTGNSYGVCGLYTSS 339 2414 2415 2416>sp|P25774|CATS_HUMAN CATHEPSIN S PRECURSOR 2417 Length = 331 2418 2419 Score = 171 bits (428), Expect = 3e-42 2420 Identities = 116/351 (33%), Positives = 175/351 (49%), Gaps = 35/351 (9%) 2421 2422Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYS--HEEYLERFEIFKSNLGKIEEL 62 2423 L+ VL V + V+ P + ++ + K+Y +EE + R I++ NL + 2424Sbjct: 4 LVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRL-IWEKNLKFVMLH 62 2425 2426Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEE 122 2427 NL G+N D++S+E + T L V P 2428Sbjct: 63 NLEHSMGMHSYDLGMNHLGDMTSEEVMS---------LTSSLRVPSQWQRNITYKSNPNR 113 2429 2430Query: 123 --QTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 180 2431 + DWR +G VT VK QG CG+CW+FS G +E Q + KLV+LS QNLVDC 2432Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDC--- 170 2433 2434Query: 181 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ 240 2435pattern 237 **** 2436 E+ ++GCNGG A+ YII N GI +++SYPY A +C ++S 2437Sbjct: 171 ----STEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKA-MDQKCQYDS----KYRA 221 2438 2439Query: 241 AKISNFTMIP-KNETVMAGYIVSTGPLAIAADAVEWQFYI--GGVFDIPCNPNSLDHGIL 297 2440 A S +T +P E V+ + + GP+++ DA F++ GV+ P +++HG+L 2441Sbjct: 222 ATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVL 281 2442 2443Query: 298 IVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347 2444 +VGY N YW+VKNSWG ++GE+GYI + R K N CG+++F S 2445Sbjct: 282 VVGYGDLN-----GKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPS 327 2446 2447 2448>sp||CATL_CHICK_1 [Segment 1 of 2] CATHEPSIN L 2449 Length = 176 2450 2451 Score = 167 bits (420), Expect = 2e-41 2452 Identities = 87/179 (48%), Positives = 115/179 (63%), Gaps = 16/179 (8%) 2453 2454Query: 127 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEG 186 2455 DWR +G VTPVK+QGQCGSCW+FSTTG +EGQHF ++ KLVSLSEQNLVDC EG 2456Sbjct: 6 DWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRP----EG 61 2457 2458Query: 187 EEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNF 246 2459pattern 237 **** 2460 ++GCNGGL A+ Y+ NGGI +E SYPYTA+ C + + A + F 2461Sbjct: 62 ----NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKA----EYNAANDTGF 113 2462 2463Query: 247 TMIPK-NETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHGILIVGY 301 2464 IP+ +E + + S GP+++A DA +QFY G++ P C+ LDHG+L+VGY 2465Sbjct: 114 VDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGY 172 2466 2467 2468>sp|P25326|CATS_BOVIN CATHEPSIN S 2469 Length = 217 2470 2471 Score = 165 bits (413), Expect = 1e-40 2472 Identities = 90/227 (39%), Positives = 129/227 (56%), Gaps = 21/227 (9%) 2473 2474Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 184 2475 + DWR +G VT VK QG CGSCW+FS G +E Q + KLVSLS QNLVDC 2476Sbjct: 4 SMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDC------- 56 2477 2478Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244 2479pattern 237 **** 2480 + ++GCNGG A+ YII N GI +E+SYPY A G +C ++ N A S 2481Sbjct: 57 STAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDG-KCQYDVKN----RAATCS 111 2482 2483Query: 245 NFTMIP-KNETVMAGYIVSTGPLAIAADAVEWQFYI--GGVFDIPCNPNSLDHGILIVGY 301 2484 + +P +E + + + GP+++ DA F++ GV+ P +++HG+L+VGY 2485Sbjct: 112 RYIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGY 171 2486 2487Query: 302 SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347 2488 + YW+VKNSWG +G+QGYI + R N CG++N+ S 2489Sbjct: 172 GNLD-----GKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPS 213 2490 2491 2492>sp|P80884|ANAN_ANACO ANANAIN 2493 Length = 216 2494 2495 Score = 161 bits (403), Expect = 2e-39 2496 Identities = 93/224 (41%), Positives = 123/224 (54%), Gaps = 26/224 (11%) 2497 2498Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 184 2499 + DWR GAVT VKNQG+CGSCW+F++ VE + I + LVSLSEQ ++DC 2500Sbjct: 4 SIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDC------- 56 2501 2502Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244 2503pattern 237 **** 2504 A GC GG AY++II N G+ + + YPY A GT C N G A I+ 2505Sbjct: 57 ----AVSYGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGT-CKTN----GVPNSAYIT 107 2506 2507Query: 245 NFTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303 2508 +T + +N Y VS P+A A DA +Q Y GVF PC L+H I+I+GY 2509Sbjct: 108 RYTYVQRNNERNMMYAVSNQPIAAALDASGNFQHYKRGVFTGPCG-TRLNHAIVIIGYGQ 166 2510 2511Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT----CGVS 343 2512 + +WIV+NSWGA WGE GYI L R ++ CG++ 2513Sbjct: 167 DSA----GKKFWIVRNSWGAGWGEGGYIRLARDVSSSFGICGIA 206 2514 2515 2516>sp|Q02765|CATS_RAT CATHEPSIN S PRECURSOR 2517 Length = 330 2518 2519 Score = 158 bits (396), Expect = 1e-38 2520 Identities = 89/226 (39%), Positives = 128/226 (56%), Gaps = 22/226 (9%) 2521 2522Query: 127 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEG 186 2523 DWR +G VT VK QG CGSCW+FS G +EGQ + KLVSLS QNLVDC E 2524Sbjct: 118 DWREKGCVTNVKYQGSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTE------ 171 2525 2526Query: 187 EEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNF 246 2527pattern 237 **** 2528 E+ ++GC GG A+ YII + I +E+SYPY A +C ++ N A S + 2529Sbjct: 172 EKYGNKGCGGGFMTEAFQYII-DTSIDSEASYPYKA-MDEKCLYDPKN----RAATCSRY 225 2530 2531Query: 247 TMIP-KNETVMAGYIVSTGPLAIAADAV---EWQFYIGGVFDIPCNPNSLDHGILIVGYS 302 2532 +P +E + + + GP+++ D + Y GV+D P +++HG+L+VGY 2533Sbjct: 226 IELPFGDEEALKEAVATKGPVSVGIDDASHSSFFLYQSGVYDDPSCTENMNHGVLVVGYG 285 2534 2535Query: 303 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYL-RRGKNTCGVSNFVS 347 2536 + YW+VKNSWG +G+QGYI + R KN CG++++ S 2537Sbjct: 286 TLD-----GKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIASYCS 326 2538 2539 2540>sp|P20721|CYSL_LYCES LOW-TEMPERATURE-INDUCED CYSTEINE PROTEINASE PRECURSOR 2541 Length = 346 2542 2543 Score = 158 bits (395), Expect = 2e-38 2544 Identities = 87/238 (36%), Positives = 130/238 (54%), Gaps = 25/238 (10%) 2545 2546Query: 112 DEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSE 171 2547 D ++ + + DWR +G + VK+QG CGSCW+FS +E + I L+SLSE 2548Sbjct: 8 DRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSE 67 2549 2550Query: 172 QNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFN 231 2551 Q LVDCD + +EGC+GGL A+ ++IKNGGI TE YPY G C+ 2552Sbjct: 68 QELVDCD---------RSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGV-CDQY 117 2553 2554Query: 232 SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNP 289 2555pattern 237 **** 2556 N + KI ++ +P N V+ P++IA +A ++Q Y G+F C 2557Sbjct: 118 RKN---AKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCG- 173 2558 2559Query: 290 NSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT----CGVS 343 2560 ++DHG++I GY +N M YWIV+NSWGA+ E GY+ ++R ++ CG++ 2561Sbjct: 174 TAVDHGVVIAGYGTEN-----GMDYWIVRNSWGANCRENGYLRVQRNVSSSSGLCGLA 226 2562 2563 2564>sp|P36184|ACP1_ENTHI CYSTEINE PROTEINASE ACP1 PRECURSOR 2565 Length = 308 2566 2567 Score = 152 bits (379), Expect = 1e-36 2568 Identities = 105/320 (32%), Positives = 151/320 (46%), Gaps = 48/320 (15%) 2569 2570Query: 29 FLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 2571 F ++ NK +++ EYL RF +F N +E A+ +N FAD++ +E 2572Sbjct: 18 FKQWAATHNKVFANRAEYLYRFAVFLDNKKFVE----------ANANTELNVFADMTHEE 67 2573 2574Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCW 147 2575 F +L T ++P + + P + DWR+ + P K+QGQCGSCW 2576Sbjct: 68 FIQTHLG-----MTYEVPETTSNVKAAVKAAPE----SVDWRS--IMNPAKDQGQCGSCW 116 2577 2578Query: 148 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 207 2579 +F TT +EG+ KL S SEQ LVDCD A D GC GG N+ +I 2580Sbjct: 117 TFCTTAVLEGRVNKDLGKLYSFSEQQLVDCD----------ASDNGCEGGHPSNSLKFIQ 166 2581 2582Query: 208 KNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLA 267 2583pattern 237 **** 2584 +N G+ ES YPY A GT C N+ ++ + +ET + I GP+A 2585Sbjct: 167 ENNGLGLESDYPYKAVAGT-CK-KVKNVATVTGSR----RVTDGSETGLQTIIAENGPVA 220 2586 2587Query: 268 IAADA--VEWQFYIGGVF--DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGA 323 2588 + DA +Q Y G D C ++H + VGY + + N YWI++NSWG 2589Sbjct: 221 VGMDASRPSFQLYKKGTIYSDTKCRSRMMNHCVTAVGYGSNS-----NGKYWIIRNSWGT 275 2590 2591Query: 324 DWGEQGYIYLRR-GKNTCGV 342 2592 WG+ GY L R N CG+ 2593Sbjct: 276 SWGDAGYFLLARDSNNMCGI 295 2594 2595 2596>sp|Q01957|CPP1_ENTHI CYSTEINE PROTEINASE 1 PRECURSOR 2597 Length = 315 2598 2599 Score = 150 bits (375), Expect = 4e-36 2600 Identities = 103/317 (32%), Positives = 163/317 (50%), Gaps = 47/317 (14%) 2601 2602Query: 37 NKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADT-KFGVN-KFADLSSDEFKNYYLN 94 2603 NK ++ E L R IF N ++A N++ +T K V+ FA ++++E+ + 2604Sbjct: 24 NKHFTAVESLRRRAIFNMNA------RIVAENNRKETFKLSVDGPFAAMTNEEYNSLLKL 77 2605 2606Query: 95 NKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGN 154 2607 + ++ ++N P+ A DWR +G VTP+++QG CGSC++F + 2608Sbjct: 78 KRSGEEKGEV--------RYLNIQAPK---AVDWRKKGKVTPIRDQGNCGSCYTFGSIAA 126 2609 2610Query: 155 VEGQHFISQ---NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGG 211 2611 +EG+ I + ++ + LSE+++V C E +G + GCNGGL N YNYI++N G 2612Sbjct: 127 LEGRLLIEKGGDSETLDLSEEHMVQCTRE----DG----NNGCNGGLGSNVYNYIMEN-G 177 2613 2614Query: 212 IQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAAD 271 2615pattern 237 **** 2616 I ES YPYT T + AKI ++ + +N V +S G + ++ D 2617Sbjct: 178 IAKESDYPYTGSDST------CRSDVKAFAKIKSYNRVARNNEVELKAAISQGLVDVSID 231 2618 2619Query: 272 A--VEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 326 2620 A V++Q Y G + D C N +L+H + VGY + WIV+NSWG WG 2621Sbjct: 232 ASSVQFQLYKSGAYTDTQCKNNYFALNHEVCAVGYGVVD-----GKECWIVRNSWGTGWG 286 2622 2623Query: 327 EQGYIYLRRGKNTCGVS 343 2624 E+GYI + NTCGV+ 2625Sbjct: 287 EKGYINMVIEGNTCGVA 303 2626 2627 2628>sp|O17473|CATL_BRUPA CATHEPSIN L-LIKE PRECURSOR 2629 Length = 395 2630 2631 Score = 150 bits (374), Expect = 6e-36 2632 Identities = 101/331 (30%), Positives = 157/331 (46%), Gaps = 29/331 (8%) 2633 2634Query: 26 QSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 2635 ++++ ++ K Y +E R IF+SN E +N +N ADL+ 2636Sbjct: 88 ETEWKDYVTALGKHYDQKENNFRMAIFESNELMTERINKKYEQGLVSYTTALNDLADLTD 147 2637 2638Query: 86 DEF--KNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQC 143 2639 +EF +N + +++ + +P + DWRT+GAVTPV+NQG+C 2640Sbjct: 148 EEFMVRNGLRLPNQTDLRGKRQTSEFYRYDKSERLPDQ----VDWRTKGAVTPVRNQGEC 203 2641 2642Query: 144 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203 2643 GSC++F+T +E H +L+ LS QN+VDC + GC+GG P A+ 2644Sbjct: 204 GSCYAFATAAALEAYHKQMTGRLLDLSPQNIVDCT--------RNLGNNGCSGGYMPTAF 255 2645 2646Query: 204 NYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMI-PKNETVMAGYIVS 262 2647pattern 237 **** 2648 Y + GI ES YPY T +C + + + + F I P +E + + 2649Sbjct: 256 QYASRY-GIAMESRYPYVG-TEQRCRWQQSIAVVTD----NGFNEIQPGDELALKHAVAK 309 2650 2651Query: 263 TGP--LAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNS 320 2652 GP + I+ ++FY GV+ N DH +L VGY + YWIVKNS 2653Sbjct: 310 RGPVVVGISGSKRSFRFYKDGVYS-EGNCGRPDHAVLAVGYGTHPSY----GDYWIVKNS 364 2654 2655Query: 321 WGADWGEQGYIYLRRGK-NTCGVSNFVSTSI 350 2656 WG DWG+ GY+Y+ R + N C +++ S I 2657Sbjct: 365 WGTDWGKDGYVYMARNRGNMCHIASAASFPI 395 2658 2659 2660>sp|P46102|CYSP_PLAVN CYSTEINE PROTEINASE PRECURSOR 2661 Length = 506 2662 2663 Score = 150 bits (374), Expect = 6e-36 2664 Identities = 116/363 (31%), Positives = 180/363 (48%), Gaps = 64/363 (17%) 2665 2666Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 2667 S+F ++ + NKKY + +E L+RFE FK K ++ N + + VN+++D S 2668Sbjct: 160 SKFFKYMKENNKKYENMDEQLQRFENFKIRYMKTQKHNEMVGKNGLTYVQKVNQYSDFSK 219 2669 2670Query: 86 DEFKNYYLNNKEAIFTDDL------PVADYLDDEFINSIPPEEQT---AFDWRTRGAVTP 136 2671 +EF NY+ K DL P+ +L + + S+ + + + D+R++ P 2672Sbjct: 220 EEFDNYF--KKLLSVPMDLKSKYIVPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNFLP 277 2673 2674Query: 137 VKNQGQCGSCWSFSTTGNVEGQHFISQNKL-VSLSEQNLVDCDHECMEYEGEEACDEGCN 195 2675 K+QG CGSCW+F+ GN E + +++++ +S SEQ +VDC E + GC+ 2676Sbjct: 278 PKDQGNCGSCWAFAAIGNFEYLYVHTRHEMPISFSEQQMVDCSTE----------NYGCD 327 2677 2678Query: 196 GGLQPNAYNYIIKNGGIQTESSYPYTAETGTQC-NFNSANIGPEEQAKISNFTMIPKNET 254 2679pattern 237 **** 2680 GG A+ Y+I NG + YPY C N+ + +G ++ + NE 2681Sbjct: 328 GGNPFYAFLYMINNG-VCLGDEYPYKGHEDFFCLNYRCSLLG-----RVHFIGDVKPNEL 381 2682 2683Query: 255 VMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSA---------- 303 2684 +MA V GP+ IA A E + Y GGVFD CNP L+H +L+VGY 2685Sbjct: 382 IMALNYV--GPVTIAVGASEDFVLYSGGVFDGECNPE-LNHSVLLVGYGQVKKSLAFEDS 438 2686 2687Query: 304 -----KNTI--FRKNMP---------YWIVKNSWGADWGEQGYIYLRRGK----NTCGVS 343 2688 N I +++N+ YWIV+NSWG +WGE GYI ++R K CGV 2689Sbjct: 439 HSNVDSNLIKKYKENIKGDDDDDIIYYWIVRNSWGPNWGEGGYIRIKRNKAGDDGFCGVG 498 2690 2691Query: 344 NFV 346 2692 + V 2693Sbjct: 499 SDV 501 2694 2695 2696>sp|Q06964|CPP3_ENTHI CYSTEINE PROTEINASE 3 PRECURSOR (CYSTEINE PROTEINASE ACP3) 2697 Length = 308 2698 2699 Score = 149 bits (372), Expect = 9e-36 2700 Identities = 103/316 (32%), Positives = 159/316 (49%), Gaps = 45/316 (14%) 2701 2702Query: 37 NKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN-KFADLSSDEFKNYYLNN 95 2703 NK ++ E L R IF N + E N K K V+ FA ++++E++ L + 2704Sbjct: 17 NKHFTAVEALRRRAIFNMNARFVAEFN-----KKGSFKLSVDGPFAAMTNEEYRTL-LKS 70 2705 2706Query: 96 KEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNV 155 2707 K + + ++N PE + DWR +G VTP+++Q QCGSC++F + + 2708Sbjct: 71 KRTVEENGKVT-------YLNIQAPE---SVDWRAQGKVTPIRDQAQCGSCYTFGSLAAL 120 2709 2710Query: 156 EGQHFISQN---KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGI 212 2711 EG+ I + + LSE++LV C + + GCNGGL N Y+YII+N G+ 2712Sbjct: 121 EGRLLIEKGGNANTLDLSEEHLVQCT--------RDNGNNGCNGGLGSNVYDYIIQN-GV 171 2713 2714Query: 213 QTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA 272 2715pattern 237 **** 2716 ES YPYT T + C N + AKI+ + +P+N +S G + ++ DA 2717Sbjct: 172 AKESDYPYTG-TDSTCKTN-----VKAFAKITGYNKVPRNNEAELKAALSQGLVDVSIDA 225 2718 2719Query: 273 --VEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 327 2720 ++Q Y G + D C N +L+H + VGY + WIV+NSWG WG+ 2721Sbjct: 226 SSAKFQLYKSGAYSDTKCKNNFFALNHEVCAVGYGVVD-----GKECWIVRNSWGTGWGD 280 2722 2723Query: 328 QGYIYLRRGKNTCGVS 343 2724 +GYI + NTCGV+ 2725Sbjct: 281 KGYINMVIEGNTCGVA 296 2726 2727 2728>sp|Q01958|CPP2_ENTHI CYSTEINE PROTEINASE 2 PRECURSOR 2729 Length = 315 2730 2731 Score = 149 bits (372), Expect = 9e-36 2732 Identities = 102/324 (31%), Positives = 161/324 (49%), Gaps = 45/324 (13%) 2733 2734Query: 29 FLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN-KFADLSSDE 87 2735 F + K NK ++ E L R IF N ++ N I K V+ FA ++++E 2736Sbjct: 16 FNTWASKNNKHFTAIEKLRRRAIFNMNAKFVDSFNKIG-----SFKLSVDGPFAAMTNEE 70 2737 2738Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCW 147 2739 ++ + + T++ YL+ + S+ DWR G VTP+++Q QCGSC+ 2740Sbjct: 71 YRTLLKSKRT---TEENGQVKYLNIQAPESV--------DWRKEGKVTPIRDQAQCGSCY 119 2741 2742Query: 148 SFSTTGNVEGQHFISQN---KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYN 204 2743 +F + +EG+ I + + LSE+++V C + + GCNGGL N Y+ 2744Sbjct: 120 TFGSLAALEGRLLIEKGGDANTLDLSEEHMVQCT--------RDNGNNGCNGGLGSNVYD 171 2745 2746Query: 205 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTG 264 2747pattern 237 **** 2748 YII++ G+ ES YPYT T C N + AKI+ +T +P+N +S G 2749Sbjct: 172 YIIEH-GVAKESDYPYTGSDST-CKTNVKSF-----AKITGYTKVPRNNEAELKAALSQG 224 2750 2751Query: 265 PLAIAADA--VEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKN 319 2752 + ++ DA ++Q Y G + D C N +L+H + VGY + WIV+N 2753Sbjct: 225 LVDVSIDASSAKFQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVD-----GKECWIVRN 279 2754 2755Query: 320 SWGADWGEQGYIYLRRGKNTCGVS 343 2756 SWG WG++GYI + NTCGV+ 2757Sbjct: 280 SWGTGWGDKGYINMVIEGNTCGVA 303 2758 2759 2760>sp|P36185|ACP2_ENTHI CYSTEINE PROTEINASE ACP2 PRECURSOR 2761 Length = 310 2762 2763 Score = 145 bits (363), Expect = 1e-34 2764 Identities = 102/330 (30%), Positives = 160/330 (47%), Gaps = 40/330 (12%) 2765 2766Query: 20 GIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN- 78 2767 GI F + K NK ++ E L R IF N ++ N I K V+ 2768Sbjct: 3 GIRIASAIDFNTWASKNNKHFTAIEKLRRRAIFNMNAKFVDSFNKIG-----SFKLSVDG 57 2769 2770Query: 79 KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVK 138 2771 FA ++++E++ + + T++ YL+ + S+ DWR G VTP++ 2772Sbjct: 58 PFAAMTNEEYRTLLKSKRT---TEENGQVKYLNIQAPESV--------DWRKEGKVTPLR 106 2773 2774Query: 139 NQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGL 198 2775 +Q QCGSC++F + +EG+ I + + N +D E M+ + + GCNGGL 2776Sbjct: 107 DQAQCGSCYTFGSLAALEGRLLIEKG-----GDANTLDLSEEHMQCTRDNG-NNGCNGGL 160 2777 2778Query: 199 QPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258 2779pattern 237 **** 2780 N Y+YII++G + ES YPYT T C N + KI+ +T +P+N 2781Sbjct: 161 GSNVYDYIIEHG-VAKESDYPYTGSDST-CKTNVKSF-----RKITGYTKVPRNNEAELK 213 2782 2783Query: 259 YIVSTGPLAIAAD--AVEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMP 313 2784 +S G L ++ D + ++Q Y G + D C N +L+H + VGY + 2785Sbjct: 214 AALSQGLLDVSIDVSSAKFQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVD-----GKE 268 2786 2787Query: 314 YWIVKNSWGADWGEQGYIYLRRGKNTCGVS 343 2788 WIV+NSWG WG++GYI + NTCGV+ 2789Sbjct: 269 CWIVRNSWGTSWGDKGYINMVIEGNTCGVA 298 2790 2791 2792>sp|P25781|CYSP_THEAN CYSTEINE PROTEINASE PRECURSOR 2793 Length = 441 2794 2795 Score = 145 bits (362), Expect = 1e-34 2796 Identities = 107/345 (31%), Positives = 165/345 (47%), Gaps = 58/345 (16%) 2797 2798Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGV--NKFADLS 84 2799 +F F +K+ K + S ++ ++RF F+ N ++ HK + + NKF+DLS 2800Sbjct: 119 EFDAFVEKYKKVHRSFDQRVQRFLTFRKNYHIVK-------THKPTEPYSLDLNKFSDLS 171 2801 2802Query: 85 SDEFKNYY--------------------LNNKEAIFTDDLPVADYLDDEFINSIPPEEQT 124 2803 +EFK Y +++K I+ L A +++ S+ E 2804Sbjct: 172 DEEFKALYPVITPPKTYTSLSKHLEFKKMSHKNPIYISKLKKAKGIEEIKDLSLITGEN- 230 2805 2806Query: 125 AFDWRTRGAVTPVKNQGQ-CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 183 2807 +W AV+P K+QG CGSCW+FS+ +VE + + +NK LSEQ LV+CD M 2808Sbjct: 231 -LNWARTDAVSPTKDQGDHCGSCWAFSSIASVESLYRLYKNKSYFLSEQELVNCDKSSM- 288 2809 2810Query: 184 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKI 243 2811pattern 237 **** 2812 GC GGL A Y I + G+ ES PYT + C + N + I 2813Sbjct: 289 ---------GCAGGLPITALEY-IHSKGVSFESEVPYTGIV-SPCKPSIKN-----KVFI 332 2814 2815Query: 244 SNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303 2816 + +++ N+ V ++S + IA E + Y GG+F C L+H +L+VG 2817Sbjct: 333 DSISILKGNDVVNKSLVISPTVVGIAV-TKELKLYSGGIFTGKCG-GELNHAVLLVGEGV 390 2818 2819Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR---GKNTCGVSNF 345 2820 + M YWI+KNSWG DWGE G++ L+R G + CG+ F 2821Sbjct: 391 DH---ETGMRYWIIKNSWGEDWGENGFLRLQRTKKGLDKCGILTF 432 2822 2823 2824>sp|P22497|CYSP_THEPA CYSTEINE PROTEINASE PRECURSOR 2825 Length = 439 2826 2827 Score = 143 bits (357), Expect = 5e-34 2828 Identities = 105/351 (29%), Positives = 163/351 (45%), Gaps = 72/351 (20%) 2829 2830Query: 24 EEQSQFLEFQDKFNKKYS-HEEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKF 80 2831 E +F EF K+N++++ +E L R F+SN +++E K D + G+N+F 2832Sbjct: 119 EVYREFEEFNSKYNRRHATQQERLNRLVTFRSNYLEVKE-------QKGDEPYVKGINRF 171 2833 2834Query: 81 ADLSSDEF--------------------------KNYYLNNKEAIFTDDLPVADYLDDEF 114 2835 +DL+ EF K Y N K+A+ TD+ D 2836Sbjct: 172 SDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKALNTDE--------DVD 223 2837 2838Query: 115 INSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNL 174 2839 + + E DWR +VT VK+Q CG CW+FST G+VEG + +K LS Q L 2840Sbjct: 224 LAKLTGEN---LDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQEL 280 2841 2842Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSAN 234 2843 +DCD + GC GGL +AY Y+ K G+ + P+ + +C+ A 2844Sbjct: 281 LDCD----------SFSNGCQGGLLESAYEYVRKY-GLVSAKDLPF-VDKARRCSVPKA- 327 2845 2846Query: 235 IGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDH 294 2847pattern 237 **** 2848 ++ + ++ + K + VM + S+ + + E Y GVF C SL+H 2849Sbjct: 328 ----KKVSVPSYHVF-KGKEVMTRSLTSSPCSVYLSVSPELAKYKSGVFTGECG-KSLNH 381 2850 2851Query: 295 GILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR---GKNTCGV 342 2852 +++VG ++ YW+V+NSWG DWGE GY+ L R G + CGV 2853Sbjct: 382 AVVLVGEGYDEVTKKR---YWVVQNSWGTDWGENGYMRLERTNMGTDKCGV 429 2854 2855 2856>sp|P25805|CYSP_PLAFA THROPHOZOITE CYSTEINE PROTEINASE PRECURSOR (TCP) 2857 Length = 569 2858 2859 Score = 141 bits (351), Expect = 3e-33 2860 Identities = 107/367 (29%), Positives = 169/367 (45%), Gaps = 62/367 (16%) 2861 2862Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 2863 S+F +F + NK Y + +E + +FEIFK N I+ N +N A K VN+F+D S 2864Sbjct: 223 SKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHN--KLNKNAMYKKKVNQFSDYSE 280 2865 2866Query: 86 DEFKNYYLN----NKEAIFTDDLPVADYLDD-----EFINSIPPEEQTAF-------DWR 129 2867 +E K Y+ I P ++L D EF + E+ F D+R 2868Sbjct: 281 EELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYR 340 2869 2870Query: 130 TRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEA 189 2871 +G V K+QG CGSCW+F++ GN+E ++S SEQ +VDC + 2872Sbjct: 341 EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--------- 391 2873 2874Query: 190 CDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMI 249 2875pattern 237 **** 2876 + GC+GG ++ Y+++N + Y Y A+ C N + + +S+ + 2877Sbjct: 392 -NFGCDGGHPFYSFLYVLQN-ELCLGDEYKYKAKDDMFC----LNYRCKRKVSLSSIGAV 445 2878 2879Query: 250 PKNETVMAGYIVSTGPLAIAADA-VEWQFYIGGVFDIPCNPNSLDHGILIVGY------- 301 2880 +N+ ++A + GPL++ ++ Y GV++ C+ L+H +L+VGY 2881Sbjct: 446 KENQLILA--LNEVGPLSVNVGVNNDFVAYSEGVYNGTCS-EELNHSVLLVGYGQVEKTK 502 2882 2883Query: 302 -------SAKNTIFRKNMP------YWIVKNSWGADWGEQGYIYLRRGKN----TCGVSN 344 2884 NT N P YWI+KNSW WGE G++ L R KN CG+ 2885Sbjct: 503 LNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGE 562 2886 2887Query: 345 FVSTSII 351 2888 V I+ 2889Sbjct: 563 EVFYPIL 569 2890 2891 2892>sp|P14518|BROM_ANACO BROMELAIN, STEM 2893 Length = 212 2894 2895 Score = 139 bits (348), Expect = 6e-33 2896 Identities = 81/224 (36%), Positives = 113/224 (50%), Gaps = 31/224 (13%) 2897 2898Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 184 2899 + DWR GAVT VKNQ CG+CW+F+ VE + I + L LSEQ ++DC 2900Sbjct: 5 SIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDC------- 57 2901 2902Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244 2903pattern 237 **** 2904 A GC GG + A+ +II N G+ + + YPY A GT C + G A I+ 2905Sbjct: 58 ----AKGYGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGT-CKTD----GVPNSAYIT 108 2906 2907Query: 245 NFTMIPKNETVMAGYIVSTGPLAIAADA-VEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303 2908 + +P+N Y VS P+ +A DA +Q+Y GVF+ PC SL+H + +GY 2909Sbjct: 109 GYARVPRNNESSMMYAVSKQPITVAVDANANFQYYKSGVFNGPCG-TSLNHAVTAIGYGQ 167 2910 2911Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR----GKNTCGVS 343 2912 + I+ K WGA WGE GYI + R CG++ 2913Sbjct: 168 DSIIYPK---------KWGAKWGEAGYIRMARDVSSSSGICGIA 202 2914 2915 2916>sp|P16311|MMAL_DERFA MAJOR MITE FECAL ALLERGEN DER F 1 PRECURSOR (DER F I) 2917 Length = 321 2918 2919 Score = 138 bits (345), Expect = 1e-32 2920 Identities = 115/352 (32%), Positives = 157/352 (43%), Gaps = 52/352 (14%) 2921 2922Query: 7 FVLAVFTVFVSSRGIP-PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLI 65 2923 FVLA+ ++ V S P F EF+ FNK Y+ +E E+ + N +E L + 2924Sbjct: 3 FVLAIASLLVLSTVYARPASIKTFEEFKKAFNKNYAT---VEEEEVARKNF--LESLKYV 57 2925 2926Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF----INSIPPE 121 2927 N K +N +DLS DEFKN YL + EA + L L+ E INS+ 2928Sbjct: 58 EAN-----KGAINHLSDLSLDEFKNRYLMSAEAF--EQLKTQFDLNAETSACRINSVNVP 110 2929 2930Query: 122 EQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 181 2931 + D R+ VTP++ QG CGSCW+FS E + +N + LSEQ LVDC 2932Sbjct: 111 SE--LDLRSLRTVTPIRMQGGCGSCWAFSGVAATESAYLAYRNTSLDLSEQELVDC---- 164 2933 2934Query: 182 MEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQA 241 2935pattern 237 **** 2936 A GC+G P YI +NG ++ E SYPY A NS + G 2937Sbjct: 165 -------ASQHGCHGDTIPRGIEYIQQNGVVE-ERSYPYVAREQRCRRPNSQHYG----- 211 2938 2939Query: 242 KISNFTMIPKNETVMAGYIVSTGPLAIAA-----DAVEWQFYIGGVF---DIPCNPNSLD 293 2940 ISN+ I + ++ AIA D +Q Y G D PN 2941Sbjct: 212 -ISNYCQIYPPDVKQIREALTQTHTAIAVIIGIKDLRAFQHYDGRTIIQHDNGYQPNY-- 268 2942 2943Query: 294 HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNF 345 2944 H + IVGY + + YWIV+NSW WG+ GY Y + G N + + 2945Sbjct: 269 HAVNIVGYGS-----TQGDDYWIVRNSWDTTWGDSGYGYFQAGNNLMMIEQY 315 2946 2947 2948>sp|P42666|CYSP_PLAVI CYSTEINE PROTEINASE PRECURSOR 2949 Length = 583 2950 2951 Score = 129 bits (320), Expect = 1e-29 2952 Identities = 100/370 (27%), Positives = 166/370 (44%), Gaps = 84/370 (22%) 2953 2954Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 2955 S+F F +K+ + Y E +E+++ FK N KI++ N K VN+F+D S 2956Sbjct: 235 SKFFNFMNKYKRSYKDINEQMEKYKNFKMNYLKIKKHN----ETNQMYKMKVNQFSDYSK 290 2957 2958Query: 86 DEFKNYYLNNKEAIFTDDLPVADYLDDEFI--------------------NSIPPEEQTA 125 2959 +F++Y F +P+ D+L +++ ++ + 2960Sbjct: 291 KDFESY--------FRKLVPIPDHLKKKYVVPFSSMNNGKGKNVVTSSSGANLLADVPEI 342 2961 2962Query: 126 FDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK-LVSLSEQNLVDCDHECMEY 184 2963 D+R +G V K+QG CGSCW+F++ GNVE + NK +++LSEQ +VDC 2964Sbjct: 343 LDYREKGIVHEPKDQGLCGSCWAFASVGNVECMYAKEHNKTILTLSEQEVVDC------- 395 2965 2966Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244 2967pattern 237 **** 2968 + GC+GG ++ Y I+N GI Y Y A C N + + +S 2969Sbjct: 396 ---SKLNFGCDGGHPFYSFIYAIEN-GICMGDDYKYKAMDNLFC----LNYRCKNKVTLS 447 2970 2971Query: 245 NFTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYS- 302 2972 + + +NE + A + GP+++ ++ FY GG+F+ C L+H +L+VGY 2973Sbjct: 448 SVGGVKENELIRA--LNEVGPVSVNVGVTDDFSFYGGGIFNGTCT-EELNHSVLLVGYGQ 504 2974 2975Query: 303 -AKNTIFRKN-------------------------MPYWIVKNSWGADWGEQGYIYLRRG 336 2976 + IF++ YWI+KNSW WGE G++ + R 2977Sbjct: 505 VQSSKIFQEKNAYDDASGVTKKGALSYPSKADDGIQYYWIIKNSWSKFWGENGFMRISRN 564 2978 2979Query: 337 KN----TCGV 342 2980 K CG+ 2981Sbjct: 565 KEGDNVFCGI 574 2982 2983 2984>sp|P08176|MMAL_DERPT MAJOR MITE FECAL ALLERGEN DER P 1 PRECURSOR (DER P I) 2985 Length = 320 2986 2987 Score = 121 bits (300), Expect = 3e-27 2988 Identities = 111/345 (32%), Positives = 151/345 (43%), Gaps = 57/345 (16%) 2989 2990Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 2991 MK++L + V +R P F E++ FNK Y+ E E + N +E 2992Sbjct: 1 MKIVLAIASLLALSAVYAR---PSSIKTFEEYKKAFNKSYAT---FEDEEAARKNF--LE 52 2993 2994Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF----IN 116 2995 + + N A +N +DLS DEFKN +L + EA + L L+ E IN 2996Sbjct: 53 SVKYVQSNGGA-----INHLSDLSLDEFKNRFLMSAEAF--EHLKTQFDLNAETNACSIN 105 2997 2998Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 176 2999 P E D R VTP++ QG CGSCW+FS E + +N+ + L+EQ LVD 3000Sbjct: 106 GNAPAE---IDLRQMRTVTPIRMQGGCGSCWAFSGVAATESAYLAYRNQSLDLAEQELVD 162 3001 3002Query: 177 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 236 3003 C A GC+G P YI NG +Q ES Y Y A + N+ G 3004Sbjct: 163 C-----------ASQHGCHGDTIPRGIEYIQHNGVVQ-ESYYRYVAREQSCRRPNAQRFG 210 3005 3006Query: 237 PEEQAKISNFTMI-PKNETVMAGYIVSTGPLAIAA-----DAVEWQFYIGGVF---DIPC 287 3007pattern 237 **** 3008 ISN+ I P N + + T AIA D ++ Y G D 3009Sbjct: 211 ------ISNYCQIYPPNVNKIREALAQTHS-AIAVIIGIKDLDAFRHYDGRTIIQRDNGY 263 3010 3011Query: 288 NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIY 332 3012 PN H + IVGYS + + YWIV+NSW +WG+ GY Y 3013Sbjct: 264 QPNY--HAVNIVGYSN-----AQGVDYWIVRNSWDTNWGDNGYGY 301 3014 3015 3016>sp|P80067|CATC_RAT DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C) 3017 (CATHEPSIN J) (DIPEPTIDYL TRANSFERASE) 3018 Length = 462 3019 3020 Score = 111 bits (274), Expect = 3e-24 3021 Identities = 83/260 (31%), Positives = 128/260 (48%), Gaps = 34/260 (13%) 3022 3023Query: 105 PVADYLDDEFINSIPPEEQTAFDWRT-RGA--VTPVKNQGQCGSCWSFSTTGNVEGQHFI 161 3024 P+ D + + + S+P ++DWR RG V+PV+NQ CGSC+SF++ G +E + I 3025Sbjct: 218 PITDEIQQQIL-SLPE----SWDWRNVRGINFVSPVRNQESCGSCYSFASIGMLEARIRI 272 3026 3027Query: 162 SQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYP 219 3028 N + LS Q +V C +GC+GG ++ G+ E+ +P 3029Sbjct: 273 LTNNSQTPILSPQEVVSCSPYA----------QGCDGGFPYLIAGKYAQDFGVVEENCFP 322 3030 3031Query: 220 YTAETGTQCN--FNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVE-WQ 276 3032pattern 237 **** 3033 YTA T C N E + F NE +M +V GP+A+A + + + 3034Sbjct: 323 YTA-TDAPCKPKENCLRYYSSEYYYVGGFYG-GCNEALMKLELVKHGPMAVAFEVHDDFL 380 3035 3036Query: 277 FYIGGVF-----DIPCNPNSL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGY 330 3037 Y G++ P NP L +H +L+VGY K+ + + YWIVKNSWG+ WGE GY 3038Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYG-KDPV--TGLDYWIVKNSWGSQWGESGY 437 3039 3040Query: 331 IYLRRGKNTCGVSNFVSTSI 350 3041 +RRG + C + + +I 3042Sbjct: 438 FRIRRGTDECAIESIAMAAI 457 3043 3044 3045>sp|P97821|CATC_MOUSE DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C) 3046 (CATHEPSIN J) (DIPEPTIDYL TRANSFERASE) 3047 Length = 462 3048 3049 Score = 109 bits (270), Expect = 9e-24 3050 Identities = 91/335 (27%), Positives = 155/335 (46%), Gaps = 42/335 (12%) 3051 3052Query: 34 DKFNKKYSH-----EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEF 88 3053 +K N +H E Y ER ++ N ++ +N + K+ T ++ +S + 3054Sbjct: 147 EKVNMNAAHLGGLQERYSER--LYTHNHNFVKAINTV---QKSWTATAYKEYEKMSLRDL 201 3055 3056Query: 89 KNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRT-RGA--VTPVKNQGQCGS 145 3057 +++ P+ D + + +N PE ++DWR +G V+PV+NQ CGS 3058Sbjct: 202 IRRSGHSQRIPRPKPAPMTDEIQQQILNL--PE---SWDWRNVQGVNYVSPVRNQESCGS 256 3059 3060Query: 146 CWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203 3061 C+SF++ G +E + I N + LS Q +V C +GC+GG 3062Sbjct: 257 CYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSPYA----------QGCDGGFPYLIA 306 3063 3064Query: 204 NYIIKNGGIQTESSYPYTA-ETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVS 262 3065pattern 237 **** 3066 ++ G+ ES +PYTA ++ + N + + F NE +M +V 3067Sbjct: 307 GKYAQDFGVVEESCFPYTAKDSPCKPRENCLRYYSSDYYYVGGFYG-GCNEALMKLELVK 365 3068 3069Query: 263 TGPLAIAADAVE-WQFYIGGVF-----DIPCNPNSL-DHGILIVGYSAKNTIFRKNMPYW 315 3070 GP+A+A + + + Y G++ P NP L +H +L+VGY + YW 3071Sbjct: 366 HGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVT---GIEYW 422 3072 3073Query: 316 IVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350 3074 I+KNSWG++WGE GY +RRG + C + + +I 3075Sbjct: 423 IIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAI 457 3076 3077 3078>sp|P25773|CATL_FELCA CATHEPSIN L (PROGESTERONE-DEPENDENT PROTEIN) (PDP) 3079 Length = 139 3080 3081 Score = 108 bits (267), Expect = 2e-23 3082 Identities = 55/145 (37%), Positives = 84/145 (57%), Gaps = 9/145 (6%) 3083 3084Query: 196 GGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETV 255 3085pattern 237 **** 3086 GGL +A+ Y+ NGG+ +E SYPY A+ G C + N A ++++ IP E 3087Sbjct: 1 GGLIDDAFQYVKDNGGLDSEESYPYHAQ-GDSCKYRPEN----SVANVTDYWDIPSKENE 55 3088 3089Query: 256 MAGYIVSTGPLAIAADAV--EWQFYIGGVF-DIPCNPNSLDHGILIVGYSAKNTIFRKNM 312 3090 + + + GP++ A DA ++FY G++ D C+ +DHG+L+VGY A T +N 3091Sbjct: 56 LMITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDVDHGVLVVGYGADGTE-TENK 114 3092 3093Query: 313 PYWIVKNSWGADWGEQGYIYLRRGK 337 3094 YWI+KNSWG DWG GYI + + + 3095Sbjct: 115 KYWIIKNSWGTDWGMDGYIKMAKDR 139 3096 3097 3098>sp|Q26563|CATC_SCHMA CATHEPSIN C PRECURSOR 3099 Length = 454 3100 3101 Score = 108 bits (266), Expect = 3e-23 3102 Identities = 75/238 (31%), Positives = 109/238 (45%), Gaps = 33/238 (13%) 3103 3104Query: 126 FDWRT-----RGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLVDCD 178 3105 FDW + R VTP++NQG CGSC++ + +E + + N + LS Q +VDC 3106Sbjct: 222 FDWTSPPDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSNFSEQPILSPQTVVDCS 281 3107 3108Query: 179 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNF--NSANIG 236 3109 EGCNGG ++ G+ + PYT E +C N 3110Sbjct: 282 ----------PYSEGCNGGFPFLIAGKYGEDFGLPQKIVIPYTGEDTGKCTVSKNCTRYY 331 3111 3112Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPC-------- 287 3113pattern 237 **** 3114 + + I + NE +M ++S GP + + E +QFY G++ 3115Sbjct: 332 TTDYSYIGGYYGAT-NEKLMQLELISNGPFPVGFEVYEDFQFYKEGIYHHTTVQTDHYNF 390 3116 3117Query: 288 NPNSL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 344 3118 NP L +H +L+VGY PYW VKNSWG +WGEQGY + RG + CGV + 3119Sbjct: 391 NPFELTNHAVLLVGYGVDKL---SGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVES 445 3120 3121 3122>sp|P53634|CATC_HUMAN DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C) 3123 (CATHEPSIN J) (DIPEPTIDYL TRANSFERASE) 3124 Length = 463 3125 3126 Score = 107 bits (265), Expect = 3e-23 3127 Identities = 75/235 (31%), Positives = 111/235 (46%), Gaps = 29/235 (12%) 3128 3129Query: 124 TAFDWRTRGA---VTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCD 178 3130 T++DWR V+PV+NQ CGSC+SF++ G +E + I N + LS Q +V C 3131Sbjct: 233 TSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS 292 3132 3133Query: 179 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSA--NIG 236 3134 +GC GG ++ G+ E+ +PYT T + C 3135Sbjct: 293 QYA----------QGCEGGFPYLIAGKYAQDFGLVEEACFPYTG-TDSPCKMKEDCFRYY 341 3136 3137Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDI-----PCNPN 290 3138pattern 237 **** 3139 E + F NE +M +V GP+A+A + + + Y G++ P NP 3140Sbjct: 342 SSEYHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPF 400 3141 3142Query: 291 SL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 344 3143 L +H +L+VGY + M YWIVKNSWG WGE GY +RRG + C + + 3144Sbjct: 401 ELTNHAVLLVGYGTDSA---SGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIES 452 3145 3146 3147>sp|P25780|EUM1_EURMA MITE GROUP I ALLERGEN EUR M 1 (EUR M I) 3148 Length = 211 3149 3150 Score = 99.8 bits (245), Expect = 7e-21 3151 Identities = 73/228 (32%), Positives = 102/228 (44%), Gaps = 33/228 (14%) 3152 3153Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 176 3154 S+P E D R+ VTP++ QG CGSCW+FS + E + +N + L+EQ LVD 3155Sbjct: 10 SLPSE----LDLRSLRTVTPIRMQGGCGSCWAFSGVASTESAYLAYRNMSLDLAEQELVD 65 3156 3157Query: 177 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 236 3158 C A GC+G P YI +NG +Q E YPY A + N+ G 3159Sbjct: 66 C-----------ASQNGCHGDTIPRGIEYIQQNGVVQ-EHYYPYVAREQSCHRPNAQRYG 113 3160 3161Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAI---AADAVEWQFYIGGVF---DIPCNPN 290 3162pattern 237 **** 3163 + +IS P + + + +A+ D ++ Y G D PN 3164Sbjct: 114 LKNYCQISP----PDSNKIRQALTQTHTAVAVIIGIKDLNAFRHYDGRTIMQHDNGYQPN 169 3165 3166Query: 291 SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKN 338 3167 H + IVGY NT + + YWIV+NSW WG+ GY Y N 3168Sbjct: 170 Y--HAVNIVGYG--NT---QGVDYWIVRNSWDTTWGDNGYGYFAANIN 210 3169 3170 3171>sp|Q23894|CYS3_DICDI CYSTEINE PROTEINASE 3 (CYSTEINE PROTEINASE II) 3172 Length = 151 3173 3174 Score = 94.8 bits (232), Expect = 2e-19 3175 Identities = 60/158 (37%), Positives = 87/158 (54%), Gaps = 15/158 (9%) 3176 3177Query: 41 SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIF 100 3178 +H+E++ R+E FK N+ + N + + T G+N+ ADLS++E++ YL + I 3179Sbjct: 1 THKEFMPRYEEFKKNMDYVHNWN----SKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIK 56 3180 3181Query: 101 TDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHF 160 3182 + + +N ++ DWR + AVTPVK+QGQCGSC STTG+VEG 3183Sbjct: 57 LNGYHKRNL--GLRLNRPHFKQPLNVDWREKDAVTPVKDQGQCGSC-IISTTGSVEGVTA 113 3184 3185Query: 161 ISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGL 198 3186 I KLVSLSEQN++ +EGCNGGL 3187Sbjct: 114 IKTGKLVSLSEQNILRL--------SSSFGNEGCNGGL 143 3188 3189 3190>sp|P43509|CPR5_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 5 PRECURSOR 3191 Length = 344 3192 3193 Score = 90.9 bits (222), Expect = 4e-18 3194 Identities = 69/272 (25%), Positives = 111/272 (40%), Gaps = 47/272 (17%) 3195 3196Query: 108 DYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLV 167 3197 D + E ++IP W ++ +++Q CGSCW+F+ + + I+ N V 3198Sbjct: 72 DIVATEVSDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAV 131 3199 3200Query: 168 S--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSY------- 218 3201 + LS ++L+ C G +C GC GG A+ + +K+G + T SY 3202Sbjct: 132 NTLLSSEDLLSC------CTGMFSCGNGCEGGYPIQAWKWWVKHG-LVTGGSYETQFGCK 184 3203 3204Query: 219 PY-----------------------TAETGTQCNFNSANIGPEEQAKISNFTM--IPKNE 253 3205pattern 237 **** 3206 PY T + C + P Q K T + K 3207Sbjct: 185 PYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKV 244 3208 3209Query: 254 TVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNM 312 3210 + I++ GP+ +A E + Y GV+ + H + I+G+ N 3211Sbjct: 245 EQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDN-----GT 299 3212 3213Query: 313 PYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 344 3214 PYW+V NSW WGE+GY + RG N CG+ + 3215Sbjct: 300 PYWLVANSWNVAWGEKGYFRIIRGLNECGIEH 331 3216 3217 3218>sp|P43508|CPR4_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 4 PRECURSOR 3219 Length = 335 3220 3221 Score = 90.5 bits (221), Expect = 5e-18 3222 Identities = 73/299 (24%), Positives = 124/299 (41%), Gaps = 50/299 (16%) 3223 3224Query: 82 DLSSDEFKNYYLNNK-EAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQ 140 3225 D++ ++ K + + A T D+ V + +E ++IP W ++ +++Q 3226Sbjct: 46 DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINE--DTIPATFDARTQWPNCMSINNIRDQ 103 3227 3228Query: 141 GQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGL 198 3229 CGSCW+F+ + I+ N V+ LS ++++ C C C GC GG 3230Sbjct: 104 SDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSC---CSN------CGYGCEGGY 154 3231 3232Query: 199 QPNAYNYIIKNG---GIQTESSYPYTAETGTQCNFNSANI--------GPEEQAKISNFT 247 3233pattern 237 **** 3234 NA+ Y++K+G G E+ + + C N+ G + A ++ T 3235Sbjct: 155 PINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCT 214 3236 3237Query: 248 -------------------MIPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPC 287 3238 + K + + I++ GP+ A E + Y GV+ 3239Sbjct: 215 NKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTT 274 3240 3241Query: 288 NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFV 346 3242 H I I+G+ N PYW+V NSW +WGE GY + RG N CG+ + V 3243Sbjct: 275 GQELGGHAIRILGWGTDN-----GTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAV 328 3244 3245 3246>sp|P05993|PAP5_CARPA CYSTEINE PROTEINASE (CLONE PLBPC13) 3247 Length = 96 3248 3249 Score = 90.5 bits (221), Expect = 5e-18 3250 Identities = 43/87 (49%), Positives = 55/87 (62%), Gaps = 2/87 (2%) 3251 3252Query: 264 GPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSW 321 3253 GPLA+A +A Q YIGGV L+HG+L+VGY + I K PYW++KNSW 3254Sbjct: 1 GPLAVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGYAPIRLKEKPYWVIKNSW 60 3255 3256Query: 322 GADWGEQGYIYLRRGKNTCGVSNFVST 348 3257 G +WGE GY + RG+N CGV + VST 3258Sbjct: 61 GENWGENGYYKICRGRNICGVDSMVST 87 3259 3260 3261>sp|P07688|CATB_BOVIN CATHEPSIN B PRECURSOR 3262 Length = 335 3263 3264 Score = 88.5 bits (216), Expect = 2e-17 3265 Identities = 65/259 (25%), Positives = 105/259 (40%), Gaps = 47/259 (18%) 3266 3267Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL---SEQNL 174 3268 +P W + +++QG CGSCW+F + + I N V++ +E L 3269Sbjct: 80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139 3270 3271Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ--------------------- 213 3272 C EC +GCNGG A+N+ K G + 3273Sbjct: 140 TCCGGEC---------GDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHH 190 3274 3275Query: 214 -TESSYPYTAETGT-QCNFN-----SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 3276pattern 237 **** 3277 S P T E T +CN S + ++ S++++ + +MA I GP+ 3278Sbjct: 191 VNGSRPPCTGEGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAE-IYKNGPV 249 3279 3280Query: 267 AIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325 3281 A ++ Y GV+ H I I+G+ +N PYW+V NSW DW 3282Sbjct: 250 EGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVEN-----GTPYWLVGNSWNTDW 304 3283 3284Query: 326 GEQGYIYLRRGKNTCGVSN 344 3285 G+ G+ + RG++ CG+ + 3286Sbjct: 305 GDNGFFKILRGQDHCGIES 323 3287 3288 3289>sp|P00787|CATB_RAT CATHEPSIN B PRECURSOR (CATHEPSIN B1) (RSG-2) 3290 Length = 339 3291 3292 Score = 87.4 bits (213), Expect = 4e-17 3293 Identities = 66/265 (24%), Positives = 113/265 (41%), Gaps = 45/265 (16%) 3294 3295Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL--SEQNL 174 3296 ++P W + +++QG CGSCW+F + + I N V++ S ++L 3297Sbjct: 79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 138 3298 3299Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK----NGGIQTE--------------- 215 3300 + C C C +GCNGG A+N+ + +GG+ 3301Sbjct: 139 LTC---C-----GIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHH 190 3302 3303Query: 216 ---SSYPYTAETGT-QCNFN-----SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 3304pattern 237 **** 3305 S P T E T +CN S + ++ +++++ + +MA I GP+ 3306Sbjct: 191 VNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAE-IYKNGPV 249 3307 3308Query: 267 AIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325 3309 A ++ Y GV+ H I I+G+ +N + PYW+V NSW DW 3310Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGV-----PYWLVANSWNVDW 304 3311 3312Query: 326 GEQGYIYLRRGKNTCGVSNFVSTSI 350 3313 G+ G+ + RG+N CG+ + + I 3314Sbjct: 305 GDNGFFKILRGENHCGIESEIVAGI 329 3315 3316 3317>sp|P25807|CYS1_CAEEL GUT-SPECIFIC CYSTEINE PROTEINASE PRECURSOR 3318 Length = 329 3319 3320 Score = 87.0 bits (212), Expect = 5e-17 3321 Identities = 66/288 (22%), Positives = 117/288 (39%), Gaps = 38/288 (13%) 3322 3323Query: 82 DLSSDEFKNYYLNNK-EAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQ 140 3324 +++ +E K ++ K A +D++ + + + S+P + W ++ +++Q 3325Sbjct: 50 EITEEEMKFKLMDGKYAAAHSDEIRATE--QEVVLASVPATFDSRTQWSECKSIKLIRDQ 107 3326 3327Query: 141 GQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGL 198 3328 CGSCW+F + + I +S +L+ C C +C GC GG 3329Sbjct: 108 ATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSC---C-----GSSCGNGCEGGY 159 3330 3331Query: 199 QPNAYNY-----IIKNGGIQTESSYPYTAETGTQ----------CNFNSANIGPEEQAKI 243 3332pattern 237 **** 3333 A + ++ G PY T C+ + + AK 3334Sbjct: 160 PIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGNCPESKTPSCSMSCQSGYSTAYAKD 219 3335 3336Query: 244 SNFTM----IPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILI 298 3337 +F + +PKN + I + GP+ A E + Y GV+ H I I 3338Sbjct: 220 KHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHAIKI 279 3339 3340Query: 299 VGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFV 346 3341 +G+ ++ PYW+V NSWG +WGE G+ + RG + CG+ + V 3342Sbjct: 280 IGWGTES-----GSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAV 322 3343 3344 3345>sp|P07858|CATB_HUMAN CATHEPSIN B PRECURSOR (CATHEPSIN B1) (APP SECRETASE) 3346 Length = 339 3347 3348 Score = 86.2 bits (210), Expect = 9e-17 3349 Identities = 68/285 (23%), Positives = 110/285 (37%), Gaps = 55/285 (19%) 3350 3351Query: 96 KEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNV 155 3352 + +FT+DL +P W + +++QG CGSCW+F + 3353Sbjct: 70 QRVMFTEDL------------KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAI 117 3354 3355Query: 156 EGQHFISQNKLVSL--SEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ 213 3356 + I N VS+ S ++L+ C C C +GCNGG A+N+ + G + 3357Sbjct: 118 SDRICIHTNAHVSVEVSAEDLLTC---C-----GSMCGDGCNGGYPAEAWNFWTRKGLVS 169 3358 3359Query: 214 ----------------------TESSYPYTAETGTQ-----CNFNSANIGPEEQAKISNF 246 3360pattern 237 **** 3361 S P T E T C + +++ N 3362Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229 3363 3364Query: 247 TMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN 305 3365 + +E + I GP+ A ++ Y GV+ H I I+G+ +N 3366Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVEN 289 3367 3368Query: 306 TIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350 3369 PYW+V NSW DWG+ G+ + RG++ CG+ + V I 3370Sbjct: 290 -----GTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGI 329 3371 3372 3373>sp|P43157|CYSP_SCHJA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECURSOR (ANTIGEN SJ31) 3374 Length = 342 3375 3376 Score = 85.4 bits (208), Expect = 2e-16 3377 Identities = 64/271 (23%), Positives = 109/271 (39%), Gaps = 57/271 (21%) 3378 3379Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ--NKLVSLSEQNLV 175 3380 IP + + W +++ +++Q +CGSCW+F + + I + LS +L+ 3381Sbjct: 90 IPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLI 149 3382 3383Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGI---------------------QT 214 3384 C C + C +GC GG A++Y +K G + T 3385Sbjct: 150 SC---CKD------CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHT 200 3386 3387Query: 215 ESSYP-------------YTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIV 261 3388pattern 237 **** 3389 + YP T + G + + +E + N NE V+ I+ 3390Sbjct: 201 KGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQN------NEKVIQRDIM 254 3391 3392Query: 262 STGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNS 320 3393 GP+ A D E + Y G++ H I I+G+ + K PYW++ NS 3394Sbjct: 255 MYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVE-----KRTPYWLIANS 309 3395 3396Query: 321 WGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 3397 W DWGE+G + RG++ C + + V +I 3398Sbjct: 310 WNEDWGEKGLFRMVRGRDECSIESDVVAGLI 340 3399 3400 3401>sp|P43233|CATB_CHICK CATHEPSIN B PRECURSOR (CATHEPSIN B1) 3402 Length = 340 3403 3404 Score = 85.4 bits (208), Expect = 2e-16 3405 Identities = 66/265 (24%), Positives = 111/265 (40%), Gaps = 46/265 (17%) 3406 3407Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL--SEQNLV 175 3408 +P T W ++ +++QG CGSCW+F + + + N VS+ S ++L+ 3409Sbjct: 80 LPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139 3410 3411Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ---------------------- 213 3412 C C G E C GCNGG A+ Y + G + 3413Sbjct: 140 SC---C----GFE-CGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEHHV 191 3414 3415Query: 214 TESSYPYTAETGT--QCNFN-----SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266 3416pattern 237 **** 3417 S P T E G +C+ + S + ++ I+++ +P++E + I GP+ 3418Sbjct: 192 NGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYG-VPRSEKEIMAEIYKNGPV 250 3419 3420Query: 267 AIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325 3421 A E + Y GV+ H I I+G+ +N PYW+ NSW DW 3422Sbjct: 251 EGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGVEN-----GTPYWLAANSWNTDW 305 3423 3424Query: 326 GEQGYIYLRRGKNTCGVSNFVSTSI 350 3425 G G+ + RG++ CG+ + + + 3426Sbjct: 306 GITGFFKILRGEDHCGIESEIVAGV 330 3427 3428 3429>sp|P43510|CPR6_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 6 PRECURSOR 3430 Length = 379 3431 3432 Score = 85.0 bits (207), Expect = 2e-16 3433 Identities = 71/265 (26%), Positives = 116/265 (42%), Gaps = 53/265 (20%) 3434 3435Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK--LVSLSEQNLV 175 3436 IP + +W ++ +++Q CGSCW+F + + I+ + V+LS +L+ 3437Sbjct: 105 IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLL 164 3438 3439Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQ------CN 229 3440 C C ++C GCNGG A+ Y +K+G I T S+Y TA G + C 3441Sbjct: 165 SC---C------KSCGFGCNGGDPLAAWRYWVKDG-IVTGSNY--TANNGCKPYPFPPCE 212 3442 3443Query: 230 FNSANIGPE------------EQAKISNFTMIPKNETVMAGY---------------IVS 262 3444pattern 237 ** ** 3445 +S + E+ +S++T +E G +++ 3446Sbjct: 213 HHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMT 272 3447 3448Query: 263 TGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSW 321 3449 GPL IA + E + Y GGV+ H + ++G+ + I PYW V NSW 3450Sbjct: 273 HGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGI-----PYWTVANSW 327 3451 3452Query: 322 GADWGEQGYIYLRRGKNTCGVSNFV 346 3453 DWGE G+ + RG + CG+ + V 3454Sbjct: 328 NTDWGEDGFFRILRGVDECGIESGV 352 3455 3456 3457>sp|P25792|CYSP_SCHMA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECURSOR (ANTIGEN SM31) 3458 Length = 340 3459 3460 Score = 84.6 bits (206), Expect = 3e-16 3461 Identities = 64/260 (24%), Positives = 107/260 (40%), Gaps = 45/260 (17%) 3462 3463Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLV 175 3464 IP + W ++ +++Q +CGSCWSF + + I + V LS +L+ 3465Sbjct: 89 IPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLL 148 3466 3467Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTA---ETGTQCNFNS 232 3468 C C E+C GC GG+ A++Y +K G + S +T +C ++ 3469Sbjct: 149 TC---C------ESCGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEHHT 199 3470 3471Query: 233 ANIGPEEQAKISN---------------FTM----------IPKNETVMAGYIVSTGPLA 267 3472pattern 237 **** 3473 P +KI N +T + +E + I+ GP+ 3474Sbjct: 200 KGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVE 259 3475 3476Query: 268 IAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 326 3477 + E + Y G++ H I I+G+ +N PYW++ NSW DWG 3478Sbjct: 260 ASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVEN-----KTPYWLIANSWNEDWG 314 3479 3480Query: 327 EQGYIYLRRGKNTCGVSNFV 346 3481 E GY + RG++ C + + V 3482Sbjct: 315 ENGYFRIVRGRDECSIESEV 334 3483 3484 3485>sp|P10605|CATB_MOUSE CATHEPSIN B PRECURSOR (CATHEPSIN B1) 3486 Length = 339 3487 3488 Score = 84.6 bits (206), Expect = 3e-16 3489 Identities = 66/253 (26%), Positives = 108/253 (42%), Gaps = 43/253 (16%) 3490 3491Query: 128 WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL--SEQNLVDCDHECMEYE 185 3492 W + +++QG CGSCW+F + + I N V++ S ++L+ C C 3493Sbjct: 90 WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTC---C---- 142 3494 3495Query: 186 GEEACDEGCNGGLQPNAYNYIIK----NGGIQTE------------------SSYPYTAE 223 3496 C +GCNGG A+++ K +GG+ S P T E 3497Sbjct: 143 -GIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGE 201 3498 3499Query: 224 TGT-QCNFN-SANIGPE-EQAKISNFTMIPKNETV--MAGYIVSTGPLAIAADAV-EWQF 277 3500pattern 237 ** ** 3501 T +CN + A P ++ K +T + +V + I GP+ A ++ 3502Sbjct: 202 GDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLT 261 3503 3504Query: 278 YIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK 337 3505 Y GV+ H I I+G+ +N + PYW+ NSW DWG+ G+ + RG+ 3506Sbjct: 262 YKSGVYKHEAGDMMGGHAIRILGWGVENGV-----PYWLAANSWNLDWGDNGFFKILRGE 316 3507 3508Query: 338 NTCGVSNFVSTSI 350 3509 N CG+ + + I 3510Sbjct: 317 NHCGIESEIVAGI 329 3511 3512 3513>sp|P25802|CYS1_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PRECURSOR 3514 Length = 341 3515 3516 Score = 79.6 bits (193), Expect = 9e-15 3517 Identities = 63/270 (23%), Positives = 106/270 (38%), Gaps = 46/270 (17%) 3518 3519Query: 103 DLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFIS 162 3520 D V D +E + IP W ++ + +Q CGSCW+ S+ + + I+ 3521Sbjct: 76 DEEVEDEELEENNDDIPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIA 135 3522 3523Query: 163 QN--KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNY-----IIKNGGIQTE 215 3524 K V +S Q++V C C C +GC GG +A+ + ++ G T+ 3525Sbjct: 136 SKGAKQVLISAQDVVSC---CTW------CGDGCEGGWPISAFRFHADEGVVTGGDYNTK 186 3526 3527Query: 216 SSY-PYTAET----GTQCNFNSANIGPEEQAKISNFTMI------PKNETVMAGYIVSTG 264 3528pattern 237 **** 3529 S PY G + + +G + + ++ P + Y + 3530Sbjct: 187 GSCRPYEIHPCGHHGNETYYGEC-VGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNS 245 3531 3532Query: 265 PLAIAADAV-------------EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKN 311 3533 AI D + ++ Y G++ + H + ++G+ + K 3534Sbjct: 246 VKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWGEE-----KG 300 3535 3536Query: 312 MPYWIVKNSWGADWGEQGYIYLRRGKNTCG 341 3537 PYWIV NSW DWGE G+ + RG N CG 3538Sbjct: 301 TPYWIVANSWHDDWGENGFFRMHRGSNDCG 330 3539 3540 3541>sp|P25793|CYS2_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 2 PRECURSOR 3542 Length = 342 3543 3544 Score = 78.4 bits (190), Expect = 2e-14 3545 Identities = 59/266 (22%), Positives = 110/266 (41%), Gaps = 47/266 (17%) 3546 3547Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLV 175 3548 IPP W+ +++Q CGSCW+ ST + + I+ K V++S +++ 3549Sbjct: 87 IPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIM 145 3550 3551Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ------TESSYPY--------- 220 3552 C C C +GC GG A+ Y I +G + + PY 3553Sbjct: 146 TC---C-----RPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHG 197 3554 3555Query: 221 -------------TAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLA 267 3556pattern 237 **** 3557 T +C + ++ + ++ ++ + I+ GP+ 3558Sbjct: 198 NDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPV- 256 3559 3560Query: 268 IAADAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325 3561 +A+ AV +++ Y G++ H + ++G+ +N N +W++ NSW DW 3562Sbjct: 257 VASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNEN-----NTDFWLIANSWHNDW 311 3563 3564Query: 326 GEQGYIYLRRGKNTCGVSNFVSTSII 351 3565 GE+GY + RG N CG+ ++ I+ 3566Sbjct: 312 GEKGYFRIVRGSNDCGIEGTIAAGIV 337 3567 3568 3569>sp|P19092|CYS1_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PRECURSOR 3570 Length = 342 3571 3572 Score = 77.6 bits (188), Expect = 4e-14 3573 Identities = 59/266 (22%), Positives = 110/266 (41%), Gaps = 47/266 (17%) 3574 3575Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLV 175 3576 IPP W+ +++Q CGSCW+ ST + + I+ K V++S +++ 3577Sbjct: 87 IPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIM 145 3578 3579Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ------TESSYPY--------- 220 3580 C C C +GC GG A+ Y I +G + + PY 3581Sbjct: 146 TC---C-----RPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHG 197 3582 3583Query: 221 -------------TAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLA 267 3584pattern 237 **** 3585 T +C + ++ + ++ ++ + I+ GP+ 3586Sbjct: 198 NDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPV- 256 3587 3588Query: 268 IAADAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325 3589 +A+ AV +++ Y G++ H + ++G+ +N N +W++ NSW DW 3590Sbjct: 257 VASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNEN-----NTDFWLIANSWHNDW 311 3591 3592Query: 326 GEQGYIYLRRGKNTCGVSNFVSTSII 351 3593 GE+GY + RG N CG+ ++ I+ 3594Sbjct: 312 GEKGYFRIIRGTNDCGIEGTIAAGIV 337 3595 3596 3597>sp|P43507|CPR3_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 PRECURSOR 3598 Length = 370 3599 3600 Score = 73.3 bits (177), Expect = 7e-13 3601 Identities = 56/248 (22%), Positives = 98/248 (38%), Gaps = 39/248 (15%) 3602 3603Query: 128 WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYE 185 3604 W + ++NQ CGSCW+F + + I N +S ++++ C C 3605Sbjct: 102 WPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSC---C---- 154 3606 3607Query: 186 GEEACDEGCNGGLQPNAYNYIIKNGGIQ---------------------TESSYPYTAET 224 3608 C GC GG A + +G + ES+ P + +T 3609Sbjct: 155 -GTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPYSFAPCTKNCPESTTP-SCKT 212 3610 3611Query: 225 GTQCNFNSANIGPEEQAKISNFTMIP-KNETVMAGYIVSTGPLAIAADAVE-WQFYIGGV 282 3612pattern 237 **** 3613 Q ++ + ++ S + + K+ T + I GP+ + E + Y GV 3614Sbjct: 213 TCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGV 272 3615 3616Query: 283 FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGV 342 3617 + H + I+G+ +N + YW++ NSWG +GE+G+ +RRG N C + 3618Sbjct: 273 YHYTSGKLVGGHAVKIIGWGVENGV-----DYWLIANSWGTSFGEKGFFKIRRGTNECQI 327 3619 3620Query: 343 SNFVSTSI 350 3621 V I 3622Sbjct: 328 EGNVVAGI 335 3623 3624 3625>sp|P13823|SERA_PLAFG SERINE-REPEAT ANTIGEN PROTEIN PRECURSOR (P126) (111 KD ANTIGEN) 3626 Length = 989 3627 3628 Score = 70.2 bits (169), Expect = 6e-12 3629 Identities = 63/247 (25%), Positives = 102/247 (40%), Gaps = 46/247 (18%) 3630 3631Query: 137 VKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGE--EACDEGC 194 3632 V++QG C + W F++ ++E + + +S + +C Y+GE + CDEG 3633Sbjct: 579 VEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANC------YKGEHKDRCDEGS 632 3634 3635Query: 195 NGGLQPNAYNYIIKNGG-IQTESSYPYT-AETGTQC------------------NFNSAN 234 3636 + P + II++ G + ES+YPY + G QC N N N 3637Sbjct: 633 S----PMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKILHNKNEPN 688 3638 3639Query: 235 I----------GPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFD 284 3640pattern 237 **** 3641 + F I K E + G +++ I A+ V + G 3642Sbjct: 689 SLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAY----IKAENVMGYEFSGKKVQ 744 3643 3644Query: 285 IPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 344 3645 C ++ DH + IVGY + YWIV+NSWG WG++GY + T N 3646Sbjct: 745 NLCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGPTHCHFN 804 3647 3648Query: 345 FVSTSII 351 3649 F+ + +I 3650Sbjct: 805 FIHSVVI 811 3651 3652 3653>sp|P32956|CC3_CARCN CYSTEINE PROTEINASE III (CC-III) 3654 Length = 43 3655 3656 Score = 60.9 bits (145), Expect = 4e-09 3657 Identities = 24/33 (72%), Positives = 27/33 (81%) 3658 3659Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 157 3660 + DWR +GAVTPVKNQG CGSCW+FST VEG 3661Sbjct: 4 SIDWRKKGAVTPVKNQGSCGSCWAFSTIATVEG 36 3662 3663 3664>sp|P32957|CC4_CARCN CYSTEINE PROTEINASE IV (CC-IV) 3665 Length = 43 3666 3667 Score = 59.7 bits (142), Expect = 9e-09 3668 Identities = 24/33 (72%), Positives = 27/33 (81%) 3669 3670Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 157 3671 + DWR +GAVTPVKNQG CGSCW+FST VEG 3672Sbjct: 4 SIDWRKKGAVTPVKNQGSCGSCWAFSTIVTVEG 36 3673 3674 3675>sp|Q06544|CYS3_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 3676 Length = 174 3677 3678 Score = 59.3 bits (141), Expect = 1e-08 3679 Identities = 31/103 (30%), Positives = 49/103 (47%), Gaps = 15/103 (14%) 3680 3681Query: 249 IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIF 308 3682 I KN V+AG+IV ++ Y G++ + H + I+G+ + 3683Sbjct: 87 IMKNGPVVAGFIVYE----------DFAHYKSGIYKHTAGRMTGGHAVKIIGWGKE---- 132 3684 3685Query: 309 RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351 3686 K PYW++ NSW DWGE+G+ + RG N C + V I+ 3687Sbjct: 133 -KGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGIV 174 3688 3689 3690>sp|P32954|CC1_CARCN CYSTEINE PROTEINASE I (CC-I) 3691 Length = 43 3692 3693 Score = 57.8 bits (137), Expect = 3e-08 3694 Identities = 22/33 (66%), Positives = 27/33 (81%) 3695 3696Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 157 3697 + DWR +GAVTPV+NQG CGSCW+FS+ VEG 3698Sbjct: 4 SIDWRQKGAVTPVRNQGSCGSCWTFSSVAAVEG 36 3699 3700 3701>sp|P32955|CC2_CARCN CYSTEINE PROTEINASE II (CC-II) 3702 Length = 43 3703 3704 Score = 56.2 bits (133), Expect = 1e-07 3705 Identities = 22/31 (70%), Positives = 25/31 (79%) 3706 3707Query: 127 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 157 3708 DWR +GAVTPVK+Q CGSCW+FST VEG 3709Sbjct: 6 DWRQKGAVTPVKDQNPCGSCWAFSTVATVEG 36 3710 3711 3712>sp||CATL_CHICK_2 [Segment 2 of 2] CATHEPSIN L 3713 Length = 42 3714 3715 Score = 51.9 bits (122), Expect = 2e-06 3716 Identities = 20/39 (51%), Positives = 28/39 (71%), Gaps = 1/39 (2%) 3717 3718Query: 314 YWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 351 3719 YWIVKNSWG WG++GYIY+ + KN CG++ S ++ 3720Sbjct: 4 YWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 42 3721 3722 3723>sp|P12399|CT2A_MOUSE CTLA-2-ALPHA PROTEIN PRECURSOR 3724 Length = 136 3725 3726 Score = 41.8 bits (96), Expect = 0.002 3727 Identities = 31/101 (30%), Positives = 50/101 (48%), Gaps = 4/101 (3%) 3728 3729Query: 9 LAVFTVFVSSRGIPPEEQ--SQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIA 66 3730 L + + + S PP+ +++ E++ KF K Y+ E R +++ N KIE N 3731Sbjct: 17 LLILCLGMMSAAPPPDPSLDNEWKEWKTKFAKAYNLNEERHRRLVWEENKKKIEAHNADY 76 3732 3733Query: 67 INHKADTKFGVNKFADLSSDEFK-NYYLNN-KEAIFTDDLP 105 3734 K G+N+F+DL+ +EFK N Y N+ DLP 3735Sbjct: 77 EQGKTSFYMGLNQFSDLTPEEFKTNCYGNSLNRGEMAPDLP 117 3736 3737 3738>sp|P05689|CATX_BOVIN CATHEPSIN 3739 Length = 73 3740 3741 Score = 40.2 bits (92), Expect = 0.006 3742 Identities = 15/40 (37%), Positives = 24/40 (59%), Gaps = 5/40 (12%) 3743 3744Query: 292 LDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 331 3745 ++H + + G+ + M YWIV+NSWG WGE G++ 3746Sbjct: 9 INHIVSVAGWGVSD-----GMEYWIVRNSWGEPWGEHGWM 43 3747 3748 3749>sp|P12400|CT2B_MOUSE CTLA-2-BETA PROTEIN PRECURSOR 3750 Length = 141 3751 3752 Score = 38.7 bits (88), Expect = 0.019 3753 Identities = 25/85 (29%), Positives = 45/85 (52%), Gaps = 1/85 (1%) 3754 3755Query: 6 LFVLAVFTVFVSSRGIP-PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNL 64 3756 +F+L + +S+ P P +++ E++ F K YS +E R +++ N KIE N 3757Sbjct: 20 VFLLILCLGMMSAAPSPDPSLDNEWKEWKTTFAKAYSLDEERHRRLMWEENKKKIEAHNA 79 3758 3759Query: 65 IAINHKADTKFGVNKFADLSSDEFK 89 3760 K G+N+F+DL+ +EF+ 3761Sbjct: 80 DYERGKTSFYMGLNQFSDLTPEEFR 104 3762 3763 3764>sp|P23897|HSER_RAT HEAT-STABLE ENTEROTOXIN RECEPTOR PRECURSOR (GC-C) (INTESTINAL 3765 GUANYLATE CYCLASE) (STA RECEPTOR) 3766 Length = 1072 3767 3768 Score = 35.6 bits (80), Expect = 0.16 3769 Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 19/120 (15%) 3770 3771Query: 15 FVSSRGIPPEEQSQFLEFQDK----FNKKYSHEEYLERFEIFKSNL-GKIEELNLIAINH 69 3772 +V G PE+ +L + F++ S ++ L R E F+ L G+ + N+I + 3773Sbjct: 190 YVYKNGSEPEDCFWYLNALEAGVSYFSEVLSFKDVLRRSEQFQEILMGRNRKSNVIVMCG 249 3774 3775Query: 70 KADTKFGVN---KFAD----LSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEE 122 3776 +T + V K AD + D F N+Y F DD +Y+D+ + ++PPE+ 3777Sbjct: 250 TPETFYNVKGDLKVADDTVVILVDLFSNHY-------FEDDTRAPEYMDNVLVLTLPPEK 302 3778 3779 3780>sp|P20736|BM86_BOOMI GLYCOPROTEIN ANTIGEN BM86 PRECURSOR (PROTECTIVE ANTIGEN) 3781 Length = 650 3782 3783 Score = 35.2 bits (79), Expect = 0.22 3784 Identities = 24/81 (29%), Positives = 36/81 (43%), Gaps = 5/81 (6%) 3785 3786Query: 151 TTGNVEGQHFISQNKLVSLSEQNLVDC----DHECMEYEGEEACDEGCNGGLQPNAYNYI 206 3787 TT N + KL + + + +C DHEC +++C E NG Q + + 3788Sbjct: 533 TTCNPKEIQECQDKKLECVYKNHKAECECPDDHECYREPAKDSCSEEDNGKCQSSGQRCV 592 3789 3790Query: 207 IKNG-GIQTESSYPYTAETGT 226 3791 I+NG + E S TA T T 3792Sbjct: 593 IENGKAVCKEKSEATTAATTT 613 3793 3794 3795>sp|P46992|YJR1_YEAST HYPOTHETICAL 43.0 KD PROTEIN IN CPS1-FPP1 INTERGENIC REGION 3796 Length = 396 3797 3798 Score = 32.0 bits (71), Expect = 1.9 3799 Identities = 39/191 (20%), Positives = 77/191 (39%), Gaps = 39/191 (20%) 3800 3801Query: 77 VNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEE-------------- 122 3802 VNKF D++++E + ++ + P+ADYL F + ++ 3803Sbjct: 42 VNKFKDITNNESCTCEVGDRVWFSGKNAPLADYLSVHFRGPLKLKQFAFYTSPGFTVNNS 101 3804 3805Query: 123 QTAFDW----------RTRGAVTPVKNQGQCGSCW-------SFSTTGNVEGQHFISQNK 165 3806 +++ DW +T VT + + G+ C S + TG+ ++ 3807Sbjct: 102 RSSSDWNRLAYYESSSKTADNVTFLNHGGEASPCLGNALSYASSNGTGSASEATVLADGT 161 3808 3809Query: 166 LVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETG 225 3810 L+S ++ ++ + C + ++ C +G P Y Y GG T + + E 3811Sbjct: 162 LISSDQEYIIYSNVSCPKSGYDKGCGVYRSG--IPAYYGY----GG--TTKMFLFEFEMP 213 3812 3813Query: 226 TQCNFNSANIG 236 3814 T+ NS++IG 3815Sbjct: 214 TETEKNSSSIG 224 3816 3817 3818>sp|P28493|PR5_ARATH PATHOGENESIS-RELATED PROTEIN 5 PRECURSOR (PR-5) 3819 Length = 239 3820 3821 Score = 32.0 bits (71), Expect = 1.9 3822 Identities = 24/93 (25%), Positives = 36/93 (37%), Gaps = 7/93 (7%) 3823 3824Query: 137 VKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNG 196 3825 ++ G G C G V + + L + + N+V C C + ++ C G N 3826Sbjct: 137 IRPSGGSGDC---KYAGCVSDLNAACPDMLKVMDQNNVVACKSACERFNTDQYCCRGAND 193 3827 3828Query: 197 GLQ---PNAYNYIIKNGGIQTESSYPYTAETGT 226 3829 + P Y+ I KN SY Y ET T 3830Sbjct: 194 KPETCPPTDYSRIFKN-ACPDAYSYAYDDETST 225 3831 3832 3833>sp|P54634|POLN_LORDV NON-STRUCTURAL POLYPROTEIN [CONTAINS: RNA-DIRECTED RNA POLYMERASE ; 3834 THIOL PROTEASE 3C ; HELICASE (2C LIKE PROTEIN)] 3835 Length = 1699 3836 3837 Score = 31.3 bits (69), Expect = 3.2 3838 Identities = 13/31 (41%), Positives = 21/31 (66%) 3839 3840Query: 17 SSRGIPPEEQSQFLEFQDKFNKKYSHEEYLE 47 3841 SS+G+ EE ++ +++ N KYS EEYL+ 3842Sbjct: 893 SSKGLSDEEYDEYKRIREERNGKYSIEEYLQ 923 3843 3844 3845>sp|Q02521|SPP2_YEAST SPLICEOSOME MATURATION PROTEIN SPP2 3846 Length = 185 3847 3848 Score = 30.9 bits (68), Expect = 4.2 3849 Identities = 24/99 (24%), Positives = 47/99 (47%), Gaps = 6/99 (6%) 3850 3851Query: 30 LEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKF---GVNKF-ADLSS 85 3852 L+ K KK ++ ++ + K+NL ++ +++HK +K ++KF D S 3853Sbjct: 6 LKLGSKTLKKNISKKTKKKNSLQKANLFDWDDAETASLSHKPQSKIKIQSIDKFDLDEES 65 3854 3855Query: 86 DEFKNYYLNNKEAIFT--DDLPVADYLDDEFINSIPPEE 122 3856 K + E T +D P+ +Y+ ++ N +P EE 3857Sbjct: 66 SSKKKLVIKLSENADTKKNDAPLVEYVTEKEYNEVPVEE 104 3858 3859 3860>sp|P41901|SPR3_YEAST SPORULATION-SPECIFIC SEPTIN 3861 Length = 512 3862 3863 Score = 30.9 bits (68), Expect = 4.2 3864 Identities = 17/58 (29%), Positives = 29/58 (49%), Gaps = 9/58 (15%) 3865 3866Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINS 117 3867 + +NLI + K+D L+ +E KN+ +E I D+PV + DE +N+ 3868Sbjct: 237 KRVNLIPVIAKSDL---------LTKEELKNFKTQVREIIRVQDIPVCFFFGDEVLNA 285 3869 3870 3871>sp|Q01532|BLH1_YEAST CYSTEINE PROTEINASE 1 (Y3) (BLEOMYCIN HYDROLASE) (BLM HYDROLASE) 3872 Length = 454 3873 3874 Score = 30.5 bits (67), Expect = 5.5 3875 Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 11/66 (16%) 3876 3877Query: 111 DDEFINS--IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS 168 3878 DD +N + ++ F+ TPV NQ G CW F+ T +Q +L 3879Sbjct: 36 DDALLNKTRLQKQDNRVFNTVVSTDSTPVTNQKSSGRCWLFAAT---------NQLRLNV 86 3880 3881Query: 169 LSEQNL 174 3882 LSE NL 3883Sbjct: 87 LSELNL 92 3884 3885 3886>sp|P24896|NU5M_CAEEL NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 5 3887 Length = 527 3888 3889 Score = 30.5 bits (67), Expect = 5.5 3890 Identities = 21/52 (40%), Positives = 26/52 (49%), Gaps = 7/52 (13%) 3891 3892Query: 44 EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNN 95 3893 +YL + I+K K +L L IN K T F LSS FKNYYL + 3894Sbjct: 466 DYLAKNSIYKMKNLKFMDLFLNNINSKGYTLF-------LSSGMFKNYYLKS 510 3895 3896 3897>sp|P25648|SRB8_YEAST SUPPRESSOR OF RNA POLYMERASE B SRB8 3898 Length = 1427 3899 3900 Score = 30.1 bits (66), Expect = 7.2 3901 Identities = 22/89 (24%), Positives = 44/89 (48%), Gaps = 10/89 (11%) 3902 3903Query: 21 IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGV--- 77 3904 +PP + S F++ + Y EE ++ E F NLG + ++ I H+ + K+ + 3905Sbjct: 1314 LPPFQVSSFVKETKLHSGDYGEEEDADQEESFSLNLG----IGIVEIAHENEQKWLIYDK 1369 3906 3907Query: 78 --NKFADLSSDEFKNYYLNNKEAIFTDDL 104 3908 +K+ S E ++++N +TDD+ 3909Sbjct: 1370 KDHKYVCTFSME-PYHFISNYNTKYTDDM 1397 3910 3911 3912>sp|Q04723|PEPC_LACLC AMINOPEPTIDASE C 3913 Length = 436 3914 3915 Score = 30.1 bits (66), Expect = 7.2 3916 Identities = 11/20 (55%), Positives = 14/20 (70%) 3917 3918Query: 311 NMPYWIVKNSWGADWGEQGY 330 3919 N W V+NSWG D G++GY 3920Sbjct: 370 NSTKWKVENSWGKDAGQKGY 389 3921 3922 3923>sp|Q13867|BLMH_HUMAN BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 3924 Length = 455 3925 3926 Score = 29.7 bits (65), Expect = 9.4 3927 Identities = 10/17 (58%), Positives = 13/17 (75%) 3928 3929Query: 315 WIVKNSWGADWGEQGYI 331 3930 W V+NSWG D G +GY+ 3931Sbjct: 392 WRVENSWGEDHGHKGYL 408 3932 3933 3934>sp|P87362|BLMH_CHICK BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) (AMINOPEPTIDASE H) 3935 Length = 455 3936 3937 Score = 29.7 bits (65), Expect = 9.4 3938 Identities = 10/19 (52%), Positives = 14/19 (73%) 3939 3940Query: 315 WIVKNSWGADWGEQGYIYL 333 3941 W V+NSWG D G +GY+ + 3942Sbjct: 392 WRVENSWGEDRGNKGYLIM 410 3943 3944 3945>sp|P70645|BLMH_RAT BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 3946 Length = 454 3947 3948 Score = 29.7 bits (65), Expect = 9.4 3949 Identities = 10/17 (58%), Positives = 13/17 (75%) 3950 3951Query: 315 WIVKNSWGADWGEQGYI 331 3952 W V+NSWG D G +GY+ 3953Sbjct: 392 WRVENSWGEDHGHKGYL 408 3954 3955 3956 Database: /home/peter/blast/data/swissprot 3957 Posted date: Oct 10, 2000 10:43 AM 3958 Number of letters in database: 31,984,247 3959 Number of sequences in database: 88,780 3960 3961Lambda K H 3962 0.317 0.136 0.414 3963 3964Lambda K H 3965 0.270 0.0477 0.230 3966 3967 3968Matrix: BLOSUM62 3969Gap Penalties: Existence: 11, Extension: 1 3970Number of Hits to DB: 23348054 3971Number of Sequences: 88780 3972Number of extensions: 1039466 3973Number of successful extensions: 3135 3974Number of sequences better than 10.0: 162 3975Number of HSP's better than 10.0 without gapping: 118 3976Number of HSP's successfully gapped in prelim test: 8 3977Number of HSP's that attempted gapping in prelim test: 2557 3978Number of HSP's gapped (non-prelim): 148 3979length of query: 351 3980length of database: 31,984,247 3981effective HSP length: 50 3982effective length of query: 301 3983effective length of database: 27,545,247 3984effective search space: 8291119347 3985effective search space used: 8291119347 3986T: 11 3987A: 40 3988X1: 16 ( 7.3 bits) 3989X2: 38 (14.8 bits) 3990X3: 64 (24.9 bits) 3991S1: 41 (21.6 bits) 3992S2: 65 (29.7 bits) 3993