1BLASTP 2.0.14 [Jun-29-2000] 2 3 4Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 5Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 6"Gapped BLAST and PSI-BLAST: a new generation of protein database search 7programs", Nucleic Acids Res. 25:3389-3402. 8 9Query= CYS1_DICDI 10 (343 letters) 11 12Database: /home/peter/blast/data/swissprot.pr 13 88,780 sequences; 31,984,247 total letters 14 15Searching..................................................done 16 17 18Results from round 1 19 20 21 Score E 22Sequences producing significant alignments: (bits) Value 23 24sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR 721 0.0 25sp|P43295|A494_ARATH PROBABLE CYSTEINE PROTEINASE A494 PRECURSOR 281 1e-75 26sp|P25804|CYSP_PEA CYSTEINE PROTEINASE 15A PRECURSOR (TURGOR-RES... 278 1e-74 27sp|P43296|RD19_ARATH CYSTEINE PROTEINASE RD19A PRECURSOR 275 1e-73 28sp|Q10716|CYS1_MAIZE CYSTEINE PROTEINASE 1 PRECURSOR 262 7e-70 29sp|P04989|CYS2_DICDI CYSTEINE PROTEINASE 2 PRECURSOR (PRESTALK C... 262 7e-70 30sp|P54640|CYS5_DICDI CYSTEINE PROTEINASE 5 PRECURSOR 249 7e-66 31sp|Q26534|CATL_SCHMA CATHEPSIN L PRECURSOR (SMCL1) 244 2e-64 32sp|P14658|CYSP_TRYBB CYSTEINE PROTEINASE PRECURSOR 243 4e-64 33sp|P35591|CYS1_LEIPI CYSTEINE PROTEINASE 1 PRECURSOR (AMASTIGOTE... 236 5e-62 34sp|P25775|LCPA_LEIME CYSTEINE PROTEINASE A PRECURSOR 235 9e-62 35sp|P25779|CYSP_TRYCR CRUZIPAIN PRECURSOR (MAJOR CYSTEINE PROTEIN... 232 1e-60 36sp|P13277|CYS1_HOMAM DIGESTIVE CYSTEINE PROTEINASE 1 PRECURSOR 225 1e-58 37sp|P25782|CYS2_HOMAM DIGESTIVE CYSTEINE PROTEINASE 2 PRECURSOR 224 2e-58 38sp|P07154|CATL_RAT CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN... 224 3e-58 39sp|P06797|CATL_MOUSE CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTE... 223 5e-58 40sp|P25784|CYS3_HOMAM DIGESTIVE CYSTEINE PROTEINASE 3 PRECURSOR 221 1e-57 41sp|P41721|CATV_NPVBM VIRAL CATHEPSIN (V-CATH) 221 2e-57 42sp|P41715|CATV_NPVCF VIRAL CATHEPSIN (V-CATH) 220 4e-57 43sp|P25975|CATL_BOVIN CATHEPSIN L PRECURSOR 218 1e-56 44sp|P36400|LCPB_LEIME CYSTEINE PROTEINASE B PRECURSOR 218 2e-56 45sp|Q05094|CYS2_LEIPI CYSTEINE PROTEINASE 2 PRECURSOR (AMASTIGOTE... 218 2e-56 46sp|P12412|CYSP_VIGMU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYS... 217 3e-56 47sp|P07711|CATL_HUMAN CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTE... 215 8e-56 48sp|Q28944|CATL_PIG CATHEPSIN L PRECURSOR 214 2e-55 49sp|P25783|CATV_NPVAC VIRAL CATHEPSIN (V-CATH) 213 3e-55 50sp|Q40143|CYS3_LYCES CYSTEINE PROTEINASE 3 PRECURSOR 212 7e-55 51sp|O60911|CATM_HUMAN CATHEPSIN L2 PRECURSOR (CATHEPSIN V) 210 5e-54 52sp|P54639|CYS4_DICDI CYSTEINE PROTEINASE 4 PRECURSOR 209 6e-54 53sp|Q10991|CATL_SHEEP CATHEPSIN L 209 6e-54 54sp|P25803|CYSP_PHAVU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYS... 208 2e-53 55sp|P00785|ACTN_ACTCH ACTINIDAIN PRECURSOR (ACTINIDIN) 206 4e-53 56sp|P43156|CYSP_HEMSP THIOL PROTEASE SEN102 PRECURSOR 203 6e-52 57sp|P25777|ORYB_ORYSA ORYZAIN BETA CHAIN PRECURSOR 203 6e-52 58sp|Q10717|CYS2_MAIZE CYSTEINE PROTEINASE 2 PRECURSOR 201 2e-51 59sp|P00786|CATH_RAT CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPS... 200 3e-51 60sp|O10364|CATV_NPVOP VIRAL CATHEPSIN (V-CATH) 199 9e-51 61sp|P43297|RD21_ARATH CYSTEINE PROTEINASE RD21A PRECURSOR 198 2e-50 62sp|P15242|TES1_RAT TESTIN 1/2 PRECURSOR (CMB-22/CMB-23) 198 2e-50 63sp|P25776|ORYA_ORYSA ORYZAIN ALPHA CHAIN PRECURSOR 198 2e-50 64sp|P14080|PAP2_CARPA CHYMOPAPAIN PRECURSOR (PAPAYA PROTEINASE II... 194 2e-49 65sp|P43235|CATK_HUMAN CATHEPSIN K PRECURSOR (CATHEPSIN O) (CATHEP... 194 2e-49 66sp|P25778|ORYC_ORYSA ORYZAIN GAMMA CHAIN PRECURSOR 194 3e-49 67sp|P25251|CYS4_BRANA CYSTEINE PROTEINASE COT44 PRECURSOR 194 3e-49 68sp|P09668|CATH_HUMAN CATHEPSIN H PRECURSOR 192 1e-48 69sp|O46427|CATH_PIG CATHEPSIN H PRECURSOR 191 2e-48 70sp|P05167|ALEU_HORVU THIOL PROTEASE ALEURAIN PRECURSOR 191 2e-48 71sp|P43236|CATK_RABIT CATHEPSIN K PRECURSOR (OC-2 PROTEIN) 191 2e-48 72sp|P10056|PAP3_CARPA CARICAIN PRECURSOR (PAPAYA PROTEINASE OMEGA... 190 3e-48 73sp|P49935|CATH_MOUSE CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHE... 189 6e-48 74sp|P55097|CATK_MOUSE CATHEPSIN K PRECURSOR 188 1e-47 75sp|P56203|CATW_MOUSE CATHEPSIN W PRECURSOR (LYMPHOPAIN) 185 9e-47 76sp|P25250|CYS2_HORVU CYSTEINE PROTEINASE EP-B 2 PRECURSOR 184 2e-46 77sp|P25249|CYS1_HORVU CYSTEINE PROTEINASE EP-B 1 PRECURSOR 184 3e-46 78sp|P05994|PAP4_CARPA PAPAYA PROTEINASE IV PRECURSOR (PPIV) (PAPA... 183 3e-46 79sp|P22895|P34_SOYBN P34 PROBABLE THIOL PROTEASE PRECURSOR 183 3e-46 80sp|P43234|CATO_HUMAN CATHEPSIN O PRECURSOR 182 8e-46 81sp|P56202|CATW_HUMAN CATHEPSIN W PRECURSOR (LYMPHOPAIN) 182 1e-45 82sp|P25774|CATS_HUMAN CATHEPSIN S PRECURSOR 177 3e-44 83sp|P00784|PAPA_CARPA PAPAIN PRECURSOR (PAPAYA PROTEINASE I) (PPI) 176 6e-44 84sp||CATL_CHICK_1 [Segment 1 of 2] CATHEPSIN L 175 1e-43 85sp|P25326|CATS_BOVIN CATHEPSIN S 173 5e-43 86sp|P80884|ANAN_ANACO ANANAIN 167 3e-41 87sp|Q02765|CATS_RAT CATHEPSIN S PRECURSOR 167 4e-41 88sp|P20721|CYSL_LYCES LOW-TEMPERATURE-INDUCED CYSTEINE PROTEINASE... 164 3e-40 89sp|P36184|ACP1_ENTHI CYSTEINE PROTEINASE ACP1 PRECURSOR 162 7e-40 90sp|O17473|CATL_BRUPA CATHEPSIN L-LIKE PRECURSOR 159 8e-39 91sp|Q01957|CPP1_ENTHI CYSTEINE PROTEINASE 1 PRECURSOR 156 5e-38 92sp|Q06964|CPP3_ENTHI CYSTEINE PROTEINASE 3 PRECURSOR (CYSTEINE P... 153 4e-37 93sp|Q01958|CPP2_ENTHI CYSTEINE PROTEINASE 2 PRECURSOR 153 5e-37 94sp|P46102|CYSP_PLAVN CYSTEINE PROTEINASE PRECURSOR 153 6e-37 95sp|P36185|ACP2_ENTHI CYSTEINE PROTEINASE ACP2 PRECURSOR 150 5e-36 96sp|P25781|CYSP_THEAN CYSTEINE PROTEINASE PRECURSOR 146 5e-35 97sp|P14518|BROM_ANACO BROMELAIN, STEM 146 6e-35 98sp|P22497|CYSP_THEPA CYSTEINE PROTEINASE PRECURSOR 145 1e-34 99sp|P16311|MMAL_DERFA MAJOR MITE FECAL ALLERGEN DER F 1 PRECURSOR... 144 3e-34 100sp|P25805|CYSP_PLAFA THROPHOZOITE CYSTEINE PROTEINASE PRECURSOR ... 144 3e-34 101sp|P42666|CYSP_PLAVI CYSTEINE PROTEINASE PRECURSOR 132 1e-30 102sp|P08176|MMAL_DERPT MAJOR MITE FECAL ALLERGEN DER P 1 PRECURSOR... 123 7e-28 103sp|P80067|CATC_RAT DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPP... 117 3e-26 104sp|P97821|CATC_MOUSE DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (D... 116 8e-26 105sp|P53634|CATC_HUMAN DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (D... 113 4e-25 106sp|Q26563|CATC_SCHMA CATHEPSIN C PRECURSOR 113 7e-25 107sp|P25773|CATL_FELCA CATHEPSIN L (PROGESTERONE-DEPENDENT PROTEIN... 113 7e-25 108sp|P25780|EUM1_EURMA MITE GROUP I ALLERGEN EUR M 1 (EUR M I) 105 1e-22 109sp|Q23894|CYS3_DICDI CYSTEINE PROTEINASE 3 (CYSTEINE PROTEINASE II) 96 1e-19 110sp|P43508|CPR4_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 4 PREC... 94 3e-19 111sp|P05993|PAP5_CARPA CYSTEINE PROTEINASE (CLONE PLBPC13) 90 5e-18 112sp|P25807|CYS1_CAEEL GUT-SPECIFIC CYSTEINE PROTEINASE PRECURSOR 90 5e-18 113sp|P43509|CPR5_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 5 PREC... 89 1e-17 114sp|P00787|CATB_RAT CATHEPSIN B PRECURSOR (CATHEPSIN B1) (RSG-2) 88 3e-17 115sp|P07688|CATB_BOVIN CATHEPSIN B PRECURSOR 87 5e-17 116sp|P07858|CATB_HUMAN CATHEPSIN B PRECURSOR (CATHEPSIN B1) (APP S... 86 1e-16 117sp|P43157|CYSP_SCHJA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECUR... 85 3e-16 118sp|P25792|CYSP_SCHMA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECUR... 85 3e-16 119sp|P10605|CATB_MOUSE CATHEPSIN B PRECURSOR (CATHEPSIN B1) 84 5e-16 120sp|P43510|CPR6_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 6 PREC... 83 6e-16 121sp|P43233|CATB_CHICK CATHEPSIN B PRECURSOR (CATHEPSIN B1) 83 6e-16 122sp|P25802|CYS1_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PREC... 76 1e-13 123sp|P25793|CYS2_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 2 PREC... 75 2e-13 124sp|P19092|CYS1_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PREC... 75 3e-13 125sp|P43507|CPR3_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 PREC... 73 7e-13 126sp|P13823|SERA_PLAFG SERINE-REPEAT ANTIGEN PROTEIN PRECURSOR (P1... 71 4e-12 127sp|P32956|CC3_CARCN CYSTEINE PROTEINASE III (CC-III) 63 8e-10 128sp|P32957|CC4_CARCN CYSTEINE PROTEINASE IV (CC-IV) 62 2e-09 129sp|Q06544|CYS3_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 59 1e-08 130sp|P32954|CC1_CARCN CYSTEINE PROTEINASE I (CC-I) 59 2e-08 131sp|P32955|CC2_CARCN CYSTEINE PROTEINASE II (CC-II) 58 3e-08 132sp||CATL_CHICK_2 [Segment 2 of 2] CATHEPSIN L 52 2e-06 133sp|P12399|CT2A_MOUSE CTLA-2-ALPHA PROTEIN PRECURSOR 42 0.002 134sp|P21381|THPA_THADA THAUMATOPAIN 41 0.005 135sp|P05689|CATX_BOVIN CATHEPSIN 40 0.006 136sp|P12400|CT2B_MOUSE CTLA-2-BETA PROTEIN PRECURSOR 39 0.018 137sp|P20736|BM86_BOOMI GLYCOPROTEIN ANTIGEN BM86 PRECURSOR (PROTEC... 35 0.21 138sp|Q11121|PLB1_TORDE LYSOPHOSPHOLIPASE PRECURSOR (PHOSPHOLIPASE B) 35 0.27 139sp|P46992|YJR1_YEAST HYPOTHETICAL 43.0 KD PROTEIN IN CPS1-FPP1 I... 34 0.61 140sp|P28493|PR5_ARATH PATHOGENESIS-RELATED PROTEIN 5 PRECURSOR (PR-5) 32 1.8 141sp|P41901|SPR3_YEAST SPORULATION-SPECIFIC SEPTIN 32 2.4 142sp|P54634|POLN_LORDV NON-STRUCTURAL POLYPROTEIN [CONTAINS: RNA-D... 31 3.1 143sp|P21173|DNAA_MICLU CHROMOSOMAL REPLICATION INITIATOR PROTEIN DNAA 31 3.1 144sp|P89263|Y022_GVXN HYPOTHETICAL ORF22 HOMOLOG 31 5.3 145sp|P24896|NU5M_CAEEL NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 5 31 5.3 146sp|P25648|SRB8_YEAST SUPPRESSOR OF RNA POLYMERASE B SRB8 30 7.0 147sp|Q04723|PEPC_LACLC AMINOPEPTIDASE C 30 7.0 148sp|Q13867|BLMH_HUMAN BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 30 9.1 149sp|P87362|BLMH_CHICK BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) (... 30 9.1 150sp|P70645|BLMH_RAT BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 30 9.1 151 152>sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR 153 Length = 343 154 155 Score = 721 bits (1841), Expect = 0.0 156 Identities = 343/343 (100%), Positives = 343/343 (100%) 157 158Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 159 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 160Sbjct: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 161 162Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPT 120 163 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPT 164Sbjct: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPT 120 165 166Query: 121 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 180 167 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 168Sbjct: 121 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 180 169 170Query: 181 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTM 240 171 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTM 172Sbjct: 181 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTM 240 173 174Query: 241 IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIF 300 175 IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIF 176Sbjct: 241 IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIF 300 177 178Query: 301 RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 179 RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 180Sbjct: 301 RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 181 182 183>sp|P43295|A494_ARATH PROBABLE CYSTEINE PROTEINASE A494 PRECURSOR 184 Length = 313 185 186 Score = 281 bits (712), Expect = 1e-75 187 Identities = 147/316 (46%), Positives = 193/316 (60%), Gaps = 18/316 (5%) 188 189Query: 32 FQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKA---DTKFGVNKFADLSSDE 87 190 F+ KF K Y S EE+ RF +FK+NL L A+ H+ + GV +F+DL+ E 191Sbjct: 3 FKKKFGKVYGSIEEHYYRFSVFKANL-------LRAMRHQKMDPSARHGVTQFSDLTRSE 55 192 193Query: 88 FKNYYLNNKEAI-FTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFS 146 194 F+ +L K D A L + ++P FDWR RGAVTPVKNQG CGSCWSFS 195Sbjct: 56 FRRKHLGVKGGFKLPKDANQAPILPTQ---NLPEEFDWRDRGAVTPVKNQGSCGSCWSFS 112 196 197Query: 147 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 206 198 TTG +EG HF++ KLVSLSEQ LVDCDHEC + E E +CD GCNGGL +A+ Y +K G 199Sbjct: 113 TTGALEGAHFLATGKLVSLSEQQLVDCDHEC-DPEEEGSCDSGCNGGLMNSAFEYTLKTG 171 200 201Query: 207 GIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVE 266 202 G+ E YPYT G C + + I A +SNF+++ NE +A ++ GPLA+A +A 203Sbjct: 172 GLMREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAY 231 204 205Query: 267 WQFYIGGVFDIPCNPNSLDHGILIVGYSAK--NTIFRKNMPYWIVKNSWGADWGEQGYIY 324 206 Q YIGGV L+HG+L+VGY + + K PYWI+KNSWG WGE G+ 207Sbjct: 232 MQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYK 291 208 209Query: 325 LRRGKNTCGVSNFVST 340 210 + +G+N CGV + VST 211Sbjct: 292 ICKGRNICGVDSLVST 307 212 213 214>sp|P25804|CYSP_PEA CYSTEINE PROTEINASE 15A PRECURSOR (TURGOR-RESPONSIVE PROTEIN 15A) 215 Length = 363 216 217 Score = 278 bits (703), Expect = 1e-74 218 Identities = 144/320 (45%), Positives = 201/320 (62%), Gaps = 14/320 (4%) 219 220Query: 26 QSQFLEFQDKFNKKYS-HEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 221 + F F+ KF+K Y+ EE+ RF +FKSNL K + + N + G+ KF+DL+ 222Sbjct: 45 EHHFTSFKSKFSKSYATKEEHDYRFGVFKSNLIKAK----LHQNRDPTAEHGITKFSDLT 100 223 224Query: 85 SDEFKNYYLNNKEAIFTDDLPV-ADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCW 143 225 + EF+ +L K+ + LP A ++P FDWR +GAVTPVK+QG CGSCW 226Sbjct: 101 ASEFRRQFLGLKKRL---RLPAHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCW 157 227 228Query: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203 229 +FSTTG +EG H+++ KLVSLSEQ LVDCDH C + E +CD GCNGGL NA+ Y++ 230Sbjct: 158 AFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVC-DPEQAGSCDSGCNGGLMNNAFEYLL 216 231 232Query: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAAD 263 233 ++GG+ E Y YT G+ C F+ + + A +SNF+++ +E +A +V GPLA+A + 234Sbjct: 217 ESGGVVQEKDYAYTGRDGS-CKFDKSKVVASVSNFSVVTLDEDQIAANLVKNGPLAVAIN 275 235 236Query: 264 AVEWQFYIGGV-FDIPCNPNSLDHGILIVGY--SAKNTIFRKNMPYWIVKNSWGADWGEQ 320 237 A Q Y+ GV C + LDHG+L+VG+ A I K PYWI+KNSWG +WGEQ 238Sbjct: 276 AAWMQTYMSGVSCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWGEQ 335 239 240Query: 321 GYIYLRRGKNTCGVSNFVST 340 241 GY + RG+N CGV + VST 242Sbjct: 336 GYYKICRGRNVCGVDSMVST 355 243 244 245>sp|P43296|RD19_ARATH CYSTEINE PROTEINASE RD19A PRECURSOR 246 Length = 368 247 248 Score = 275 bits (695), Expect = 1e-73 249 Identities = 155/359 (43%), Positives = 205/359 (56%), Gaps = 34/359 (9%) 250 251Query: 6 LFVLAVFTVFVSSR---------------GIPPE---EQSQFLEFQDKFNKKY-SHEEYL 46 252 +FVL+ F V VSS G P+ + F F+ KF K Y S+EE+ 253Sbjct: 10 VFVLSFFIVSVSSSDVNDGDDLVIRQVVGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHD 69 254 255Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTK--FGVNKFADLSSDEFKNYYLNNKEAI-FTDD 103 256 RF +FK+NL + + K D GV +F+DL+ EF+ +L + D 257Sbjct: 70 YRFSVFKANLRRARR------HQKLDPSATHGVTQFSDLTRSEFRKKHLGVRSGFKLPKD 123 258 259Query: 104 LPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLV 163 260 A L E ++P FDWR GAVTPVKNQG CGSCWSFS TG +EG +F++ KLV 261Sbjct: 124 ANKAPILPTE---NLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLV 180 262 263Query: 164 SLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQ 223 264 SLSEQ LVDCDHEC + E ++CD GCNGGL +A+ Y +K GG+ E YPYT + G 265Sbjct: 181 SLSEQQLVDCDHEC-DPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKT 239 266 267Query: 224 CNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNS 283 268 C + + I A +SNF++I +E +A +V GPLA+A +A Q YIGGV 269Sbjct: 240 CKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYICTRR 299 270 271Query: 284 LDHGILIVGYSAKN--TIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVST 340 272 L+HG+L+VGY A K PYWI+KNSWG WGE G+ + +G+N CGV + VST 273Sbjct: 300 LNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSMVST 358 274 275 276>sp|Q10716|CYS1_MAIZE CYSTEINE PROTEINASE 1 PRECURSOR 277 Length = 371 278 279 Score = 262 bits (663), Expect = 7e-70 280 Identities = 138/324 (42%), Positives = 189/324 (57%), Gaps = 15/324 (4%) 281 282Query: 26 QSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 283 +S FL F +F K Y +E+ R +FK NL + L+ + GV KF+DL+ 284Sbjct: 45 ESHFLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQLL----DPSAEHGVTKFSDLT 100 285 286Query: 85 SDEFKNYYLN---NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGS 141 287 EF+ YL ++ A+ + A + +P FDWR GAV PVKNQG CGS 288Sbjct: 101 PAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGS 160 289 290Query: 142 CWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNY 201 291 CWSFS +G +EG H+++ KL LSEQ VDCDHEC E ++CD GCNGGL A++Y 292Sbjct: 161 CWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSE-PDSCDSGCNGGLMTTAFSY 219 293 294Query: 202 IIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIA 261 295 + K GG+++E YPYT G +C F+ + I A + NF+++ +E ++ ++ GPLAI 296Sbjct: 220 LQKAGGLESEKDYPYTGSDG-KCKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAIG 278 297 298Query: 262 ADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSWGADWGE 319 299 +A Q YIGGV LDHG+L+VGY A I K+ PYWI+KNSWG +WGE 300Sbjct: 279 INAAYMQTYIGGVSCPYICGRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGE 338 301 302Query: 320 QGYIYLRRG---KNTCGVSNFVST 340 303 GY + RG +N CGV + VST 304Sbjct: 339 NGYYKICRGSNVRNKCGVDSMVST 362 305 306 307>sp|P04989|CYS2_DICDI CYSTEINE PROTEINASE 2 PRECURSOR (PRESTALK CATHEPSIN) 308 Length = 376 309 310 Score = 262 bits (663), Expect = 7e-70 311 Identities = 147/383 (38%), Positives = 213/383 (55%), Gaps = 55/383 (14%) 312 313Query: 1 MKVILLFVLAVFTVFVSSRGIP-------PEEQSQFLEFQDKFNKKYSHEEYLERFEIFK 53 314 M++++ +L +F F + P + ++ F E+ KFN++YS E+ R+ IFK 315Sbjct: 1 MRLLVFLILLIFVNFSFANVRPNGRRFSESQYRTAFTEWTLKFNRQYSSSEFSNRYSIFK 60 316 317Query: 54 SNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK-EAIFTDDLPVADYLDD 112 318 SN+ ++ N + T G+N FAD++++E++ YL + A + + L+ 319Sbjct: 61 SNMDYVDNWNS---KGDSQTVLGLNNFADITNEEYRKTYLGTRVNAHSYNGYDGREVLNV 117 320 321Query: 113 EFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 172 322 E + + P + DWRT+ AVTP+K+QGQCGSCWSFSTTG+ EG H + KLVSLSEQNLVD 323Sbjct: 118 EDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVD 177 324 325Query: 173 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 232 326 C G E + GC+GGL NA++YIIKN GI TESSYPYTAETG+ C FN ++IG 327Sbjct: 178 C-------SGPEE-NFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAETGSTCLFNKSDIG 229 328 329Query: 233 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGIL 289 330 A I + I + GP+++A DA +Q Y G++ P C+P LDHG+L 331Sbjct: 230 ATIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSPTELDHGVL 289 332 333Query: 290 IVGY--------------------------------SAKNTIFRKNMPYWIVKNSWGADW 317 334 +VGY + +++ K YWIVKNSWG W 335Sbjct: 290 VVGYGVQGKDDEGPVLNRKQTIVIHKNEDNKVESSDDSSDSVRPKANNYWIVKNSWGTSW 349 336 337Query: 318 GEQGYIYLRRG-KNTCGVSNFVS 339 338 G +GYI + + KN CG+++ S 339Sbjct: 350 GIKGYILMSKDRKNNCGIASVSS 372 340 341 342>sp|P54640|CYS5_DICDI CYSTEINE PROTEINASE 5 PRECURSOR 343 Length = 344 344 345 Score = 249 bits (629), Expect = 7e-66 346 Identities = 139/362 (38%), Positives = 201/362 (55%), Gaps = 37/362 (10%) 347 348Query: 1 MKVI-LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKI 59 349 MKV+ L VL V + + ++ F ++ K Y+ EE+ R+ IF +N+ + 350Sbjct: 1 MKVLSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSEEFGARYNIFTANMDYV 60 351 352Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 353 ++ N + ++T G+N FAD++++E++N YL K F + + NS 354Sbjct: 61 QQWN----SKGSETVLGLNNFADITNEEYRNTYLGTK---FDASSLIGTQEEKVHTNSSA 113 355 356Query: 120 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 179 357 + DWR+ GAVTPVKNQGQCG CWSFSTTG+ EG HF S+ +LVSLSEQNL+DC E 358Sbjct: 114 ASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE--- 170 359 360Query: 180 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFT 239 361 + GC+GGL A+ YII N GI TESSYPY AE G +C + S N GA +S++ 362Sbjct: 171 -------NSGCDGGLMTYAFEYIINNNGIDTESSYPYKAENG-KCEYKSENSGATLSSYK 222 363 364Query: 240 MIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHGILIVGY--- 293 365 + V+ P+++A DA +Q Y G++ P C+ +LDHG+L VGY 366Sbjct: 223 TVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSG 282 367 368Query: 294 -----------SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTS 341 369 S+ N + YWIVKNSWG WG +GYI + R + N CG+++ S 370Sbjct: 283 SGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSASFP 342 371 372Query: 342 II 343 373 ++ 374Sbjct: 343 VV 344 375 376 377>sp|Q26534|CATL_SCHMA CATHEPSIN L PRECURSOR (SMCL1) 378 Length = 319 379 380 Score = 244 bits (617), Expect = 2e-64 381 Identities = 128/326 (39%), Positives = 190/326 (58%), Gaps = 22/326 (6%) 382 383Query: 21 IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80 384 +P ++++F+ K+ K+Y E RF IFKSN+ K + L + + +GV + 385Sbjct: 12 LPGNVDEKYVQFKLKYRKQYHETEDEIRFNIFKSNILKAQ---LYQVFVRGSAIYGVTPY 68 386 387Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCG 140 388 +DL++DEF +L + + L E +N+IP FDWR +GAVT VKNQG CG 389Sbjct: 69 SDLTTDEFARTHLTASWVVPSSRSNTPTSLGKE-VNNIPKNFDWREKGAVTEVKNQGMCG 127 390 391Query: 141 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYN 200 392 SCW+FSTTGNVE Q F KL+SLSEQ LVDCD D+GCNGGL NAY 393Sbjct: 128 SCWAFSTTGNVESQWFRKTGKLLSLSEQQLVDCD----------GLDDGCNGGLPSNAYE 177 394 395Query: 201 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAI 260 396 IIK GG+ E +YPY A+ +C+ + + I++ + ++ET +A ++ +++ 397Sbjct: 178 SIIKMGGLMLEDNYPYDAK-NEKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISV 236 398 399Query: 261 AADAVEWQFYIGGV---FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 317 400 +A+ QFY G+ + I C+ LDH +L+VGY + KN P+WIVKNSWG +W 401Sbjct: 237 GMNALLLQFYQHGISHPWWIFCSKYLLDHAVLLVGYG----VSEKNEPFWIVKNSWGVEW 292 402 403Query: 318 GEQGYIYLRRGKNTCGVSNFVSTSII 343 404 GE GY + RG +CG++ ++++I 405Sbjct: 293 GENGYFRMYRGDGSCGINTVATSAMI 318 406 407 408>sp|P14658|CYSP_TRYBB CYSTEINE PROTEINASE PRECURSOR 409 Length = 450 410 411 Score = 243 bits (614), Expect = 4e-64 412 Identities = 136/346 (39%), Positives = 193/346 (55%), Gaps = 26/346 (7%) 413 414Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEE 61 415 V+L + +V + S + + +F F+ K+ K Y +E RF F+ N+ E+ 416Sbjct: 15 VLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENM---EQ 71 417 418Query: 62 LNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTA 121 419 + A + T FGV F+D++ +EF+ Y N + ++ P A 420Sbjct: 72 AKIQAAANPYAT-FGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTVNVT-TGRAPAA 129 421 422Query: 122 FDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYE 181 423 DWR +GAVTPVK QGQCGSCW+FST GN+EGQ ++ N LVSLSEQ LV CD 424Sbjct: 130 VDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCD------- 182 425 426Query: 182 GEEACDEGCNGGLQPNAYNYIIKN--GGIQTESSYPYTAETG--TQCNFNSANIGAKISN 237 427 D GCNGGL NA+N+I+ + G + TE+SYPY + G QC N IGA I++ 428Sbjct: 183 ---TIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITD 239 429 430Query: 238 FTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN 297 431 +P++E +A Y+ GPLAIA DA + Y GG+ C LDHG+L+VGY+ + 432Sbjct: 240 HVDLPQDEDAIAAYLAENGPLAIAVDAESFMDYNGGIL-TSCTSKQLDHGVLLVGYNDNS 298 433 434Query: 298 TIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 435 N PYWI+KNSW WGE GYI + +G N C ++ VS++++ 436Sbjct: 299 -----NPPYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339 437 438 439>sp|P35591|CYS1_LEIPI CYSTEINE PROTEINASE 1 PRECURSOR (AMASTIGOTE CYSTEINE PROTEINASE 440 A-1) 441 Length = 354 442 443 Score = 236 bits (596), Expect = 5e-62 444 Identities = 146/350 (41%), Positives = 196/350 (55%), Gaps = 38/350 (10%) 445 446Query: 5 LLFVLAVFTVFVSSRGI-------PPEEQ----SQFLEFQDKFNKKYSHE-EYLERFEIF 52 447 LLF + V +FV G PP + + + F+ + K + + E RF F 448Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAF 66 449 450Query: 53 KSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLN-NKEAIFTDDLPVADYLD 111 451 K N+ LN + D KFADL+ EF YLN + A D ++D 452Sbjct: 67 KQNMQTAYFLNTQNPHAHYDVS---GKFADLTPQEFAKLYLNPDYYARHLKDHKEDVHVD 123 453 454Query: 112 DEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLV 171 455 D + + + DWR +GAVTPVKNQG CGSCW+FS GN+EGQ S + LVSLSEQ LV 456Sbjct: 124 DSAPSGVMSV-DWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQMLV 182 457 458Query: 172 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETGTQ--CNFN 227 459 CD+ DEGCNGGL A N+I++ NG + TE+SYPYT+ GT+ C+ + 460Sbjct: 183 SCDN----------IDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCH-D 231 461 462Query: 228 SANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHG 287 463 +GAKI+ F +P +E +A ++ GP+A+A DA WQ Y GGV + C SL+HG 464Sbjct: 232 EGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSL-CLAWSLNHG 290 465 466Query: 288 ILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNF 337 467 +LIVG++ KN PYWIVKNSWG+ WGE+GYI L G N C + N+ 468Sbjct: 291 VLIVGFN-KNA----KPPYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNY 335 469 470 471>sp|P25775|LCPA_LEIME CYSTEINE PROTEINASE A PRECURSOR 472 Length = 354 473 474 Score = 235 bits (594), Expect = 9e-62 475 Identities = 146/359 (40%), Positives = 195/359 (53%), Gaps = 56/359 (15%) 476 477Query: 5 LLFVLAVFTVFVSSRGI-------PPEEQ----SQFLEFQDKFNKKYSHE-EYLERFEIF 52 478 LLF + V +FV G PP + + + F+ + K + + E RF F 479Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAF 66 480 481Query: 53 KSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLN----------NKEAIFTD 102 482 K N+ LN + D KFADL+ EF YLN +KE + D 483Sbjct: 67 KQNMQTAYFLNTQNPHAHYDVS---GKFADLTPQEFAKLYLNPDYYARHLKNHKEDVHVD 123 484 485Query: 103 DLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKL 162 486 D + + + DWR +GAVTPVKNQG CGSCW+FS GN+EGQ S + L 487Sbjct: 124 DSAPSGVM----------SVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSL 173 488 489Query: 163 VSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAET 220 490 VSLSEQ LV CD+ DEGCNGGL A N+I++ NG + TE+SYPYT+ 491Sbjct: 174 VSLSEQMLVSCDN----------IDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGG 223 492 493Query: 221 GTQ--CNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIP 278 494 GT+ C+ + +GAKI+ F +P +E +A ++ GP+A+A DA WQ Y GGV + 495Sbjct: 224 GTRPPCH-DEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSL- 281 496 497Query: 279 CNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNF 337 498 C SL+HG+LIVG++ KN PYWIVKNSWG+ WGE+GYI L G N C + N+ 499Sbjct: 282 CLAWSLNHGVLIVGFN-KNA----KPPYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNY 335 500 501 502>sp|P25779|CYSP_TRYCR CRUZIPAIN PRECURSOR (MAJOR CYSTEINE PROTEINASE) (CRUZAINE) 503 Length = 467 504 505 Score = 232 bits (585), Expect = 1e-60 506 Identities = 137/350 (39%), Positives = 191/350 (54%), Gaps = 30/350 (8%) 507 508Query: 3 VILLFVLAVFTVFV--SSRGIPPEEQ--SQFLEFQDKFNKKY-SHEEYLERFEIFKSNLG 57 509 ++L VL V V ++ + EE SQF EF+ K + Y S E R +F+ NL 510Sbjct: 8 LLLAAVLVVMACLVPAATASLHAEETLTSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF 67 511 512Query: 58 KIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINS 117 513 + L+ A H FGV F+DL+ +EF++ Y +N A F A + 514Sbjct: 68 -LARLHAAANPHAT---FGVTPFSDLTREEFRSRY-HNGAAHFAAAQERARVPVKVEVVG 122 515 516Query: 118 IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 177 517 P A DWR RGAVT VK+QGQCGSCW+FS GNVE Q F++ + L +LSEQ LV CD 518Sbjct: 123 APAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCD--- 179 519 520Query: 178 MEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETGTQ--CNFNSANIGA 233 521 D GC+GGL NA+ +I++ NG + TE SYPY + G C + +GA 522Sbjct: 180 -------KTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGA 232 523 524Query: 234 KISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGY 293 525 I+ +P++E +A ++ GP+A+A DA W Y GGV C LDHG+L+VGY 526Sbjct: 233 TITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVM-TSCVSEQLDHGVLLVGY 291 527 528Query: 294 SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 529 + + PYWI+KNSW WGE+GYI + +G N C V S++++ 530Sbjct: 292 NDSAAV-----PYWIIKNSWTTQWGEEGYIRIAKGSNQCLVKEEASSAVV 336 531 532 533>sp|P13277|CYS1_HOMAM DIGESTIVE CYSTEINE PROTEINASE 1 PRECURSOR 534 Length = 322 535 536 Score = 225 bits (567), Expect = 1e-58 537 Identities = 130/341 (38%), Positives = 182/341 (53%), Gaps = 33/341 (9%) 538 539Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKI 59 540 MKV+ LF+ + + + EF+ KF +KY EE R +F NL I 541Sbjct: 1 MKVVALFLFGLALAAANP---------SWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYI 51 542 543Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 544 EE N + +N+F+D+++++F K+ P A + + 545Sbjct: 52 EEFNKKYERGEVTYNLAINQFSDMTNEKFNAVMKGYKKG----PRPAAVFTSTDAAPE-S 106 546 547Query: 120 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 179 548 T DWRT+GAVTPVK+QGQCGSCW+FSTTG +EGQHF+ +LVSLSEQ LVDC 549Sbjct: 107 TEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDC------ 160 550 551Query: 180 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFT 239 552 G ++GCNGG A Y+ NGG+ TESSYPY A T C FNS IGA + + 553Sbjct: 161 -AGGSYYNQGCNGGWVERAIMYVRDNGGVDTESSYPYEARDNT-CRFNSNTIGATCTGYV 218 554 555Query: 240 MIPK-NETVMAGYIVSTGPLAIAADAVEWQF---YIGGVFDIPCNPNSLDHGILIVGYSA 295 556 I + +E+ + GP+++A DA F Y G ++ C+ + LDH +L VGY + 557Sbjct: 219 GIAQGSESALKTATRDIGPISVAIDASHRSFQSYYTGVYYEPSCSSSQLDHAVLAVGYGS 278 558 559Query: 296 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVS 335 560 + +W+VKNSW WGE GYI + R + N CG++ 561Sbjct: 279 EG-----GQDFWLVKNSWATSWGESGYIKMARNRNNNCGIA 314 562 563 564>sp|P25782|CYS2_HOMAM DIGESTIVE CYSTEINE PROTEINASE 2 PRECURSOR 565 Length = 323 566 567 Score = 224 bits (566), Expect = 2e-58 568 Identities = 132/349 (37%), Positives = 188/349 (53%), Gaps = 32/349 (9%) 569 570Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKI 59 571 MKV +LF+ V S + F+ K+ ++Y EE R IF+ N I 572Sbjct: 1 MKVAVLFLCGVALAAASP---------SWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYI 51 573 574Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 575 EE N N + +NKF D++ +EF N I PV+ + + 576Sbjct: 52 EEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMKGN---IPRRSAPVSVFYPKKETGPQA 108 577 578Query: 120 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 179 579 T DWRT+GAVTPVK+QGQCGSCW+FSTTG++EGQHF+ L+SL+EQ LVDC 580Sbjct: 109 TEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDC------ 162 581 582Query: 180 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFT 239 583 +GCNGG +A++YI N GI TE++YPY A G+ C F+S ++ A S T 584Sbjct: 163 --SRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDGS-CRFDSNSVAATCSGHT 219 585 586Query: 240 MIPK-NETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGILIVGYSA 295 587 I +ET + + GP+++ DA +QFY GV+ P C+P+ LDH +L VGY + 588Sbjct: 220 NIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVGYGS 279 589 590Query: 296 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 343 591 + +W+VKNSW WG+ GYI + R + N CG++ S ++ 592Sbjct: 280 EG-----GQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323 593 594 595>sp|P07154|CATL_RAT CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) (CYCLIC 596 PROTEIN-2) (CP-2) 597 Length = 334 598 599 Score = 224 bits (564), Expect = 3e-58 600 Identities = 127/351 (36%), Positives = 195/351 (55%), Gaps = 31/351 (8%) 601 602Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEEL 62 603 ++LL VL + T + + +Q+ +++ + Y E R +++ N+ I+ 604Sbjct: 4 LLLLAVLCLGTALATPK-FDQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLH 62 605 606Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKN------YYLNNKEAIFTDDLPVADYLDDEFIN 116 607 N N K +N F D++++EF+ + + K +F + L + 608Sbjct: 63 NGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLML---------- 112 609 610Query: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 611 IP DWR +G VTPVKNQGQCGSCW+FS +G +EGQ F+ KL+SLSEQNLVDC H+ 612Sbjct: 113 QIPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHD 172 613 614Query: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKIS 236 615 +G ++GCNGGL A+ YI +NGG+ +E SYPY A+ G+ C + + A + 616Sbjct: 173 ----QG----NQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS-CKYRAEYAVANDT 223 617 618Query: 237 NFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHGILIVGY 293 619 F IP+ E + + + GP+++A DA QFY G++ P C+ LDHG+L+VGY 620Sbjct: 224 GFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGY 283 621 622Query: 294 SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVSTSII 343 623 + T K+ YW+VKNSWG +WG GYI + + +N CG++ S I+ 624Sbjct: 284 GYEGTDSNKD-KYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333 625 626 627>sp|P06797|CATL_MOUSE CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) 628 Length = 334 629 630 Score = 223 bits (562), Expect = 5e-58 631 Identities = 126/351 (35%), Positives = 198/351 (55%), Gaps = 31/351 (8%) 632 633Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEEL 62 634 ++LL VL + T + + +++ +++ + Y E R I++ N+ I+ 635Sbjct: 4 LLLLAVLCLGTALATPK-FDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLH 62 636 637Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKN------YYLNNKEAIFTDDLPVADYLDDEFIN 116 638 N N + +N F D++++EF+ + + K +F + L + 639Sbjct: 63 NGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLML---------- 112 640 641Query: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 642 IP + DWR +G VTPVKNQGQCGSCW+FS +G +EGQ F+ KL+SLSEQNLVDC H 643Sbjct: 113 KIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHA 172 644 645Query: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKIS 236 646 +G ++GCNGGL A+ YI +NGG+ +E SYPY A+ G+ C + + A + 647Sbjct: 173 ----QG----NQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS-CKYRAEFAVANDT 223 648 649Query: 237 NFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHGILIVGY 293 650 F IP+ E + + + GP+++A DA QFY G++ P C+ +LDHG+L+VGY 651Sbjct: 224 GFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGY 283 652 653Query: 294 SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 343 654 + T KN YW+VKNSWG++WG +GYI + + + N CG++ S ++ 655Sbjct: 284 GYEGTDSNKN-KYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333 656 657 658>sp|P25784|CYS3_HOMAM DIGESTIVE CYSTEINE PROTEINASE 3 PRECURSOR 659 Length = 321 660 661 Score = 221 bits (558), Expect = 1e-57 662 Identities = 123/317 (38%), Positives = 182/317 (56%), Gaps = 37/317 (11%) 663 664Query: 32 FQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEF-- 88 665 F+ ++ +KY +E L R +F+ N IE+ N N + K +N+F D++++EF 666Sbjct: 23 FKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEFNA 82 667 668Query: 89 --KNYYLNNK---EAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCW 143 669 K Y ++ +A+FT + + DWRT+ VTPVK+Q QCGSCW 670Sbjct: 83 VMKGYKKGSRGEPKAVFTAEA-----------GPMAADVDWRTKALVTPVKDQEQCGSCW 131 671 672Query: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203 673 +FS TG +EGQHF+ ++LVSLSEQ LVDC + ++GC GG +A++YI 674Sbjct: 132 AFSATGALEGQHFLKNDELVSLSEQQLVDC--------STDYGNDGCGGGWMTSAFDYIK 183 675 676Query: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAAD 263 677 NGGI TESSYPY AE C F++ +IGA + + E + + GP+++A D 678Sbjct: 184 DNGGIDTESSYPYEAE-DRSCRFDANSIGAICTGSVEVQHTEEALQEAVSGVGPISVAID 242 679 680Query: 264 A--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQ 320 681 A +QFY GV ++ C+P LDHG+L VGY ++T YW+VKNSWG+ WG+ 682Sbjct: 243 ASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVGYGTEST-----KDYWLVKNSWGSSWGDA 297 683 684Query: 321 GYIYLRRGK-NTCGVSN 336 685 GYI + R + N CG+++ 686Sbjct: 298 GYIKMSRNRDNNCGIAS 314 687 688 689>sp|P41721|CATV_NPVBM VIRAL CATHEPSIN (V-CATH) 690 Length = 323 691 692 Score = 221 bits (557), Expect = 2e-57 693 Identities = 131/342 (38%), Positives = 181/342 (52%), Gaps = 26/342 (7%) 694 695Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELN 63 696 +LF L V+ V S+ P + + F EF +FNK YS E E L RF+IF+ NL +I 697Sbjct: 4 ILFYLFVYAVVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI---- 59 698 699Query: 64 LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD 123 700 I N K+ +NKF+DLS DE Y T + LD P FD 701Sbjct: 60 -INKNQNDSAKYEINKFSDLSKDETIAKYTGLSLPTQTQNFCKVILLDQP-PGKGPLEFD 117 702 703Query: 124 WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGE 183 704 WR VT VKNQG CG+CW+F+T G++E Q I N+L++LSEQ ++DCD 705Sbjct: 118 WRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNELINLSEQQMIDCDF-------- 169 706 707Query: 184 EACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN-FTMIP 242 708 D GCNGGL A+ IIK GG+Q ES YPY A+ C NS ++ + + I 709Sbjct: 170 --VDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEAD-NNNCRMNSNKFLVQVKDCYRYII 226 710 711Query: 243 KNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRK 302 712 E + + GP+ +A DA + Y G+ C + L+H +L+VGY +N 713Sbjct: 227 VYEEKLKDLLPLVGPIPMAIDAADIVNYKQGIIKY-CFDSGLNHAVLLVGYGVEN----- 280 714 715Query: 303 NMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN-FVSTSII 343 716 N+PYW KN+WG DWGE G+ +++ N CG+ N ST++I 717Sbjct: 281 NIPYWTFKNTWGTDWGEDGFFRVQQNINACGMRNELASTAVI 322 718 719 720>sp|P41715|CATV_NPVCF VIRAL CATHEPSIN (V-CATH) 721 Length = 324 722 723 Score = 220 bits (554), Expect = 4e-57 724 Identities = 131/344 (38%), Positives = 188/344 (54%), Gaps = 27/344 (7%) 725 726Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKI 59 727 M I+L++L V ++ + + + F +F KFNK YS E E L RF+IF+ NL +I 728Sbjct: 1 MNKIVLYLLVYGAVQCAAYDVL-KAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEI 59 729 730Query: 60 EELNLIAINHKADT-KFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSI 118 731 I NH T ++ +NKFADLS DE + Y + T + LD + 732Sbjct: 60 -----INKNHNDSTAQYEINKFADLSKDETISKYTGLSLPLQTQNFCEVVVLDRP-PDKG 113 733 734Query: 119 PTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECM 178 735 P FDWR VT VKNQG CG+CW+F+T G++E Q I N+ ++LSEQ L+DCD 736Sbjct: 114 PLEFDWRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNQFINLSEQQLIDCDF--- 170 737 738Query: 179 EYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN- 237 739 D GC+GGL A+ ++ GGIQ ES YPY A G C N+A K+ 740Sbjct: 171 -------VDAGCDGGLLHTAFEAVMNMGGIQAESDYPYEANNG-DCRANAAKFVVKVKKC 222 741 742Query: 238 FTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN 297 743 + I E + + S GP+ +A DA + Y G+ C + L+H +L+VGY+ +N 744Sbjct: 223 YRYITVFEEKLKDLLRSVGPIPVAIDASDIVNYKRGIMKY-CANHGLNHAVLLVGYAVEN 281 745 746Query: 298 TIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 341 747 +P+WI+KN+WGADWGEQGY +++ N CG+ N + +S 748Sbjct: 282 -----GVPFWILKNTWGADWGEQGYFRVQQNINACGIQNELPSS 320 749 750 751>sp|P25975|CATL_BOVIN CATHEPSIN L PRECURSOR 752 Length = 334 753 754 Score = 218 bits (550), Expect = 1e-56 755 Identities = 124/342 (36%), Positives = 182/342 (52%), Gaps = 25/342 (7%) 756 757Query: 7 FVLAVFTVFVSSRG--IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNL 64 758 F L V + V+S + P + + +++ + Y E R +++ N I+ N 759Sbjct: 5 FFLTVLCLGVASAAPKLDPNLDAHWHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQ 64 760 761Query: 65 IAINHKADTKFGVNKFADLSSDEFK---NYYLNNKEAIFTDDLPVADYLDDEFINSIPTA 121 762 K + +N F D++++EF+ N + N K + + +P + 763Sbjct: 65 EYSEGKHAFRMAMNAFGDMTNEEFRQVMNGFQNQKHK-------KGKLFHEPLLVDVPKS 117 764 765Query: 122 FDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYE 181 766 DW +G VTPVKNQGQCGSCW+FS TG +EGQ F KLVSLSEQNLVDC + 767Sbjct: 118 VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRA----Q 173 768 769Query: 182 GEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMI 241 770 G ++GCNGGL NA+ YI NGG+ +E SYPY A CN+ A + F I 771Sbjct: 174 G----NQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI 229 772 773Query: 242 PKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNT 298 774 P+ E + + + GP+++A DA +QFY G+ +D C+ LDHG+L+VGY + T 775Sbjct: 230 PQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGT 289 776 777Query: 299 IFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVS 339 778 N +WIVKNSWG +WG GY+ + + +N CG++ S 779Sbjct: 290 DSNNN-KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAAS 330 780 781 782>sp|P36400|LCPB_LEIME CYSTEINE PROTEINASE B PRECURSOR 783 Length = 443 784 785 Score = 218 bits (549), Expect = 2e-56 786 Identities = 123/320 (38%), Positives = 177/320 (54%), Gaps = 34/320 (10%) 787 788Query: 29 FLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKA---DTKFGVNKFADLS 84 789 F EF+ + + Y E +R F+ NL + E H+A +FG+ KF DLS 790Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-------HQARNPHAQFGITKFFDLS 90 791 792Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEF--INSIPTAFDWRTRGAVTPVKNQGQCGSC 142 793 EF YLN A + ++++P A DWR +GAVTPVK+QG CGSC 794Sbjct: 91 EAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSC 150 795 796Query: 143 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 202 797 W+FS GN+EGQ +++ ++LVSLSEQ LV CD ++GC+GGL A++++ 798Sbjct: 151 WAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDD----------MNDGCDGGLMLQAFDWL 200 799 800Query: 203 IK--NGGIQTESSYPYTAETG--TQC-NFNSANIGAKISNFTMIPKNETVMAGYIVSTGP 257 801 ++ NG + TE SYPY + G +C N + +GA+I +I +E MA ++ GP 802Sbjct: 201 LQNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGP 260 803 804Query: 258 LAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 317 805 +AIA DA + Y GV C L+HG+L+VGY + PYW++KNSWG DW 806Sbjct: 261 IAIALDASSFMSYKSGVL-TACIGKQLNHGVLLVGYDMTGEV-----PYWVIKNSWGGDW 314 807 808Query: 318 GEQGYIYLRRGKNTCGVSNF 337 809 GEQGY+ + G N C +S + 810Sbjct: 315 GEQGYVRVVMGVNACLLSEY 334 811 812 813>sp|Q05094|CYS2_LEIPI CYSTEINE PROTEINASE 2 PRECURSOR (AMASTIGOTE CYSTEINE PROTEINASE 814 A-2) 815 Length = 444 816 817 Score = 218 bits (549), Expect = 2e-56 818 Identities = 123/321 (38%), Positives = 178/321 (55%), Gaps = 35/321 (10%) 819 820Query: 29 FLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKA---DTKFGVNKFADLS 84 821 F EF+ + + Y E +R F+ NL + E H+A +FG+ KF DLS 822Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-------HQARNPHAQFGITKFFDLS 90 823 824Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEF--INSIPTAFDWRTRGAVTPVKNQGQCGSC 142 825 EF YLN A + ++++P A DWR +GAVTPVK+QG CGSC 826Sbjct: 91 EAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSC 150 827 828Query: 143 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 202 829 W+FS GN+EGQ +++ ++LVSLSEQ LV CD ++GC+GGL A++++ 830Sbjct: 151 WAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDD----------MNDGCDGGLMLQAFDWL 200 831 832Query: 203 IK--NGGIQTESSYPYTAETG--TQCNFNSAN--IGAKISNFTMIPKNETVMAGYIVSTG 256 833 ++ NG + TE SYPY + G +C+ +S +GA+I +I +E MA ++ G 834Sbjct: 201 LQNTNGHLHTEDSYPYVSGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNG 260 835 836Query: 257 PLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGAD 316 837 P+AIA DA + Y GV C L+HG+L+VGY + PYW++KNSWG D 838Sbjct: 261 PIAIALDASSFMSYKSGVL-TACIGKQLNHGVLLVGYDMTGEV-----PYWVIKNSWGGD 314 839 840Query: 317 WGEQGYIYLRRGKNTCGVSNF 337 841 WGEQGY+ + G N C +S + 842Sbjct: 315 WGEQGYVRVVMGVNACLLSEY 335 843 844 845>sp|P12412|CYSP_VIGMU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYSTEINE PROTEINASE) 846 (SULFHYDRYL-ENDOPEPTIDASE) (SH-EP) 847 Length = 362 848 849 Score = 217 bits (547), Expect = 3e-56 850 Identities = 127/306 (41%), Positives = 179/306 (57%), Gaps = 29/306 (9%) 851 852Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK---EAIFTDD 103 853 +RF +FK+N+ + N + +K +NKFAD+++ EF++ Y +K +F 854Sbjct: 58 KRFNVFKANVMHVHNTNKMDKPYKLK----LNKFADMTNHEFRSTYAGSKVNHHKMFRGS 113 855 856Query: 104 LPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLV 163 857 + E + S+P + DWR +GAVT VK+QGQCGSCW+FST VEG + I NKLV 858Sbjct: 114 QHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLV 173 859 860Query: 164 SLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQ 223 861 SLSEQ LVDCD E ++GCNGGL +A+ +I + GGI TES+YPYTA+ GT 862Sbjct: 174 SLSEQELVDCDKE---------ENQGCNGGLMESAFEFIKQKGGITTESNYPYTAQEGT- 223 863 864Query: 224 CNFNSAN-IGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCN 280 865 C+ + N + I +P N+ V+ P+++A DA ++QFY GVF CN 866Sbjct: 224 CDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCN 283 867 868Query: 281 PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNTCGVSN 336 869 L+HG+ IVGY T+ N YWIV+NSWG +WGEQGYI ++R + CG++ 870Sbjct: 284 -TDLNHGVAIVGYG--TTVDGTN--YWIVRNSWGPEWGEQGYIRMQRNISKKEGLCGIAM 338 871 872Query: 337 FVSTSI 342 873 S I 874Sbjct: 339 MASYPI 344 875 876 877>sp|P07711|CATL_HUMAN CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) 878 Length = 333 879 880 Score = 215 bits (543), Expect = 8e-56 881 Identities = 124/341 (36%), Positives = 186/341 (54%), Gaps = 26/341 (7%) 882 883Query: 8 VLAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLI 65 884 +LA F + ++S + + ++Q+ +++ N+ Y E R +++ N+ IE N 885Sbjct: 6 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQE 65 886 887Query: 66 AINHKADTKFGVNKFADLSSDEFK---NYYLNNKEAIFTDDLPVADYLDDEFINSIPTAF 122 888 K +N F D++S+EF+ N + N K + P + 889Sbjct: 66 YREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPR-------KGKVFQEPLFYEAPRSV 118 890 891Query: 123 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEG 182 892 DWR +G VTPVKNQGQCGSCW+FS TG +EGQ F +L+SLSEQNLVDC G 893Sbjct: 119 DWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDC-------SG 171 894 895Query: 183 EEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIP 242 896 + +EGCNGGL A+ Y+ NGG+ +E SYPY A T C +N A + F IP 897Sbjct: 172 PQG-NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEA-TEESCKYNPKYSVANDTGFVDIP 229 898 899Query: 243 KNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTI 299 900 K E + + + GP+++A DA + FY G+ F+ C+ +DHG+L+VGY ++T 901Sbjct: 230 KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFEST- 288 902 903Query: 300 FRKNMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVS 339 904 N YW+VKNSWG +WG GY+ + + +N CG+++ S 905Sbjct: 289 ESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAAS 329 906 907 908>sp|Q28944|CATL_PIG CATHEPSIN L PRECURSOR 909 Length = 334 910 911 Score = 214 bits (540), Expect = 2e-55 912 Identities = 117/307 (38%), Positives = 165/307 (53%), Gaps = 23/307 (7%) 913 914Query: 40 YSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFK---NYYLNNK 96 915 Y E R +++ N+ IE N K +N F D++++EF+ N + N K 916Sbjct: 40 YGMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQNQK 99 917 918Query: 97 EAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHF 156 919 + + +P + DWR +G VT VKNQGQCGSCW+FS TG +EGQ F 920Sbjct: 100 HK-------KGKVFHESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMF 152 921 922Query: 157 ISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPY 216 923 KLVSLSEQNLVDC +G ++GCNGGL NA+ Y+ NGG+ TE SYPY 924Sbjct: 153 RKTGKLVSLSEQNLVDCSRP----QG----NQGCNGGLMDNAFQYVKDNGGLDTEESYPY 204 925 926Query: 217 TAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV 274 927 C + A + F IP+ E + + + GP+++A DA +QFY G+ 928Sbjct: 205 LGRETNSCTYKPECSAANDTGFVDIPQREKALMKAVATVGPISVAIDAGHSSFQFYKSGI 264 929 930Query: 275 -FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-C 332 931 +D C+ LDHG+L+VGY + T + +WIVKNSWG +WG GY+ + + +N C 932Sbjct: 265 YYDPDCSSKDLDHGVLVVGYGFEGT-DSNSSKFWIVKNSWGPEWGWNGYVKMAKDQNNHC 323 933 934Query: 333 GVSNFVS 339 935 G+S S 936Sbjct: 324 GISTAAS 330 937 938 939>sp|P25783|CATV_NPVAC VIRAL CATHEPSIN (V-CATH) 940 Length = 323 941 942 Score = 213 bits (538), Expect = 3e-55 943 Identities = 129/342 (37%), Positives = 179/342 (51%), Gaps = 26/342 (7%) 944 945Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELN 63 946 +LF L V+ V S+ + + F EF +FNK Y E E L RF+IF+ NL +I 947Sbjct: 4 ILFYLFVYGVVNSAAYDLLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI---- 59 948 949Query: 64 LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD 123 950 I N K+ +NKF+DLS DE Y I T + LD P FD 951Sbjct: 60 -INKNQNDSAKYEINKFSDLSKDETIAKYTGLSLPIQTQNFCKVIVLDQP-PGKGPLEFD 117 952 953Query: 124 WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGE 183 954 WR VT VKNQG CG+CW+F+T ++E Q I N+L++LSEQ ++DCD 955Sbjct: 118 WRRLNKVTSVKNQGMCGACWAFATLASLESQFAIKHNQLINLSEQQMIDCDF-------- 169 956 957Query: 184 EACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN-FTMIP 242 958 D GCNGGL A+ IIK GG+Q ES YPY A+ C NS ++ + + I 959Sbjct: 170 --VDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEAD-NNNCRMNSNKFLVQVKDCYRYIT 226 960 961Query: 243 KNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRK 302 962 E + + GP+ +A DA + Y G+ C + L+H +L+VGY +N 963Sbjct: 227 VYEEKLKDLLRLVGPIPMAIDAADIVNYKQGIIKY-CFNSGLNHAVLLVGYGVEN----- 280 964 965Query: 303 NMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN-FVSTSII 343 966 N+PYW KN+WG DWGE G+ +++ N CG+ N ST++I 967Sbjct: 281 NIPYWTFKNTWGTDWGEDGFFRVQQNINACGMRNELASTAVI 322 968 969 970>sp|Q40143|CYS3_LYCES CYSTEINE PROTEINASE 3 PRECURSOR 971 Length = 356 972 973 Score = 212 bits (535), Expect = 7e-55 974 Identities = 126/324 (38%), Positives = 176/324 (53%), Gaps = 34/324 (10%) 975 976Query: 29 FLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 977 F F + K+Y S EE +RFEIF NL I N +++K G+N+F DL+ DE 978Sbjct: 57 FARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYK----LGINEFTDLTWDE 112 979 980Query: 88 FKNYYLN---NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWS 144 981 F+ + L N A +L + + + +P DWR G V+PVK QG+CGSCW+ 982Sbjct: 113 FRKHKLGASQNCSATTKGNLKLTNVV-------LPETKDWRKDGIVSPVKAQGKCGSCWT 165 983 984Query: 145 FSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK 204 985 FSTTG +E + + K +SLSEQ LVDC + GCNGGL A+ YI 986Sbjct: 166 FSTTGALEAAYAQAFGKGISLSEQQLVDCAGAFNNF--------GCNGGLPSQAFEYIKF 217 987 988Query: 205 NGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVS-TGPLAIAAD 263 989 NGG+ TE +YPYT + G C F+ ANIG K+ + I Y V+ P+++A + 990Sbjct: 218 NGGLDTEEAYPYTGKNGI-CKFSQANIGVKVISSVNITLGAEYELKYAVALVRPVSVAFE 276 991 992Query: 264 AVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 319 993 V+ ++ Y GV+ + P ++H +L VGY +N PYW++KNSWGADWGE 994Sbjct: 277 VVKGFKQYKSGVYASTECGDTPMDVNHAVLAVGYGVEN-----GTPYWLIKNSWGADWGE 331 995 996Query: 320 QGYIYLRRGKNTCGVSNFVSTSII 343 997 GY + GKN CGV+ S I+ 998Sbjct: 332 DGYFKMEMGKNMCGVATCASYPIV 355 999 1000 1001>sp|O60911|CATM_HUMAN CATHEPSIN L2 PRECURSOR (CATHEPSIN V) 1002 Length = 334 1003 1004 Score = 210 bits (528), Expect = 5e-54 1005 Identities = 127/349 (36%), Positives = 191/349 (54%), Gaps = 35/349 (10%) 1006 1007Query: 5 LLFVLAVFTVFVSSRGIPPEEQS---QFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61 1008 L VLA F + ++S +P +Q+ ++ +++ + Y E R +++ N+ IE 1009Sbjct: 3 LSLVLAAFCLGIAS-AVPKFDQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIEL 61 1010 1011Query: 62 LNLIAINHKADTKFGVNKFADLSSDEFKNY---YLNNK---EAIFTDDLPVADYLDDEFI 115 1012 N K +N F D++++EF+ + N K +F + L +LD 1013Sbjct: 62 HNGEYSQGKHGFTMAMNAFPDMTNEEFRQMMGCFRNQKFRKGKVFREPL----FLD---- 113 1014 1015Query: 116 NSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 175 1016 +P + DWR +G VTPVKNQ QCGSCW+FS TG +EGQ F KLVSLSEQNLVDC 1017Sbjct: 114 --LPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR 171 1018 1019Query: 176 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKI 235 1020 +G ++GCNGG A+ Y+ +NGG+ +E SYPY A C + N A 1021Sbjct: 172 P----QG----NQGCNGGFMARAFQYVKENGGLDSEESYPYVA-VDEICKYRPENSVAND 222 1022 1023Query: 236 SNFTMI-PKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIV 291 1024 + FT++ P E + + + GP+++A DA +QFY G+ F+ C+ +LDHG+L+V 1025Sbjct: 223 TGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVV 282 1026 1027Query: 292 GYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVS 339 1028 GY + N YW+VKNSWG +WG GY+ + + KN CG++ S 1029Sbjct: 283 GYGFEGA-NSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAAS 330 1030 1031 1032>sp|P54639|CYS4_DICDI CYSTEINE PROTEINASE 4 PRECURSOR 1033 Length = 442 1034 1035 Score = 209 bits (527), Expect = 6e-54 1036 Identities = 116/300 (38%), Positives = 167/300 (55%), Gaps = 24/300 (8%) 1037 1038Query: 4 ILLFVLAVFTVFVSSRGIPPEEQ--SQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61 1039 +L F+ + + S++ E Q + F + + YS EE+ R++IFKSN+ + + 1040Sbjct: 3 VLSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQIFKSNMDYVHQ 62 1041 1042Query: 62 LNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTA 121 1043 N + +T G+N FAD+++ E++ YL F + + F PT 1044Sbjct: 63 WN----SKGGETVLGLNVFADITNQEYRTTYLGTP---FDGSALIGTEEEKIFSTPAPTV 115 1045 1046Query: 122 FDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFIS---QNKLVSLSEQNLVDCDHECM 178 1047 DWR +GAVTP+KNQGQCG CWSFSTTG+ EG HFI+ + LVSLSEQNL+DC 1048Sbjct: 116 -DWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDCS---- 170 1049 1050Query: 179 EYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNF 238 1051 + + GC GGL + YII N GI TESSYPYTAE G +C F ++NIGA+I ++ 1052Sbjct: 171 ----KSYGNNGCEGGLMTLGFEYIINNKGIDTESSYPYTAEDGKECKFKTSNIGAQIVSY 226 1053 1054Query: 239 TMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHGILIVGYSA 295 1055 + + P+++A DA +Q Y G++ P C P LDHG+L+VGY + 1056Sbjct: 227 QNVTSGSEASLQSASNNAPVSVAIDASNESFQLYESGIYYEPACTPTQLDHGVLVVGYGS 286 1057 1058 1059 Score = 48.8 bits (114), Expect = 2e-05 1060 Identities = 18/35 (51%), Positives = 24/35 (68%), Gaps = 1/35 (2%) 1061 1062Query: 306 YWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 339 1063 YWIVKNSWG WG GYI++ + + N CG++ S 1064Sbjct: 401 YWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMAS 435 1065 1066 1067>sp|Q10991|CATL_SHEEP CATHEPSIN L 1068 Length = 217 1069 1070 Score = 209 bits (527), Expect = 6e-54 1071 Identities = 104/226 (46%), Positives = 140/226 (61%), Gaps = 17/226 (7%) 1072 1073Query: 118 IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 177 1074 +P + DW +G VTPVKNQGQCGSCW+FS TG +EGQ F KLVSLSEQNLVD 1075Sbjct: 1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVD----- 55 1076 1077Query: 178 MEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN 237 1078 ++GCNGGL NA+ YI +NGG+ +E SYPY A T T CN+ AK + 1079Sbjct: 56 ---SSRPQGNQGCNGGLMDNAFQYIKENGGLDSEESYPYEA-TDTSCNYKPEYSAAKDTG 111 1080 1081Query: 238 FTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYS 294 1082 F IP+ E + + + GP+++A DA +QFY G+ +D C+ LDHG+L+VGY 1083Sbjct: 112 FVDIPQREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYG 171 1084 1085Query: 295 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVS 339 1086 + T N +WIVKNSWG +WG +GY+ + + +N CG++ S 1087Sbjct: 172 FEGT----NNKFWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAAS 213 1088 1089 1090>sp|P25803|CYSP_PHAVU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYSTEINE PROTEINASE EP-C1) 1091 Length = 362 1092 1093 Score = 208 bits (523), Expect = 2e-53 1094 Identities = 124/306 (40%), Positives = 176/306 (56%), Gaps = 29/306 (9%) 1095 1096Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK---EAIFTDD 103 1097 +RF +FK+NL + N + +K +NKFAD+++ EF++ Y +K +F 1098Sbjct: 58 KRFNVFKANLMHVHNTNKMDKPYKLK----LNKFADMTNHEFRSTYAGSKVNHPRMFRGT 113 1099 1100Query: 104 LPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLV 163 1101 E + S+P + DWR +GAVT VK+QGQCGSCW+FST VEG + I NKLV 1102Sbjct: 114 PHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLV 173 1103 1104Query: 164 SLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQ 223 1105 +LSEQ LVDCD E ++GCNGGL +A+ +I + GGI TES+YPY A+ GT 1106Sbjct: 174 ALSEQELVDCDKE---------ENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGT- 223 1107 1108Query: 224 CNFNSAN-IGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCN 280 1109 C+ + N + I +P N+ V+ P+++A DA ++QFY GVF C+ 1110Sbjct: 224 CDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCS 283 1111 1112Query: 281 PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNTCGVSN 336 1113 L+HG+ IVGY T+ N YWIV+NSWG +WGE GYI ++R + CG++ 1114Sbjct: 284 -TDLNHGVAIVGYG--TTVDGTN--YWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAM 338 1115 1116Query: 337 FVSTSI 342 1117 S I 1118Sbjct: 339 LPSYPI 344 1119 1120 1121>sp|P00785|ACTN_ACTCH ACTINIDAIN PRECURSOR (ACTINIDIN) 1122 Length = 380 1123 1124 Score = 206 bits (520), Expect = 4e-53 1125 Identities = 123/327 (37%), Positives = 176/327 (53%), Gaps = 35/327 (10%) 1126 1127Query: 24 EEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADT----KFGVN 78 1128 E ++ + + K+ K Y S E+ RFEIFK L I+E H ADT K G+N 1129Sbjct: 37 EVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDE-------HNADTNRSYKVGLN 89 1130 1131Query: 79 KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQ 138 1132 +FADL+ +EF++ YL ++ V++ + F +P+ DWR+ GAV +K+QG+ 1133Sbjct: 90 QFADLTDEEFRSTYLGFTSG--SNKTKVSNRYEPRFGQVLPSYVDWRSAGAVVDIKSQGE 147 1134 1135Query: 139 CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNA 198 1136 CG CW+FS VEG + I L+SLSEQ L+DC G GCNGG + 1137Sbjct: 148 CGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC--------GRTQNTRGCNGGYITDG 199 1138 1139Query: 199 YNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG-AKISNFTMIPKNETVMAGYIVSTGP 257 1140 + +II NGGI TE +YPYTA+ G +CN + N I + +P N V+ P 1141Sbjct: 200 FQFIINNGGINTEENYPYTAQDG-ECNLDLQNEKYVTIDTYENVPYNNEWALQTAVTYQP 258 1142 1143Query: 258 LAIAADAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGA 315 1144 +++A DA ++ Y G+F PC ++DH + IVGY + I YWIVKNSW 1145Sbjct: 259 VSVALDAAGDAFKHYSSGIFTGPCG-TAIDHAVTIVGYGTEGGI-----DYWIVKNSWDT 312 1146 1147Query: 316 DWGEQGYIYLRR---GKNTCGVSNFVS 339 1148 WGE+GY+ + R G TCG++ S 1149Sbjct: 313 TWGEEGYMRILRNVGGAGTCGIATMPS 339 1150 1151 1152>sp|P43156|CYSP_HEMSP THIOL PROTEASE SEN102 PRECURSOR 1153 Length = 360 1154 1155 Score = 203 bits (510), Expect = 6e-52 1156 Identities = 125/304 (41%), Positives = 164/304 (53%), Gaps = 30/304 (9%) 1157 1158Query: 43 EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTD 102 1159 +E RF +FK N+ I E N A K +NKF D+++ EF++ Y +K 1160Sbjct: 54 DEKNRRFNVFKENVKFIHEFNQ---KKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRS 110 1161 1162Query: 103 DLPVADYLDD---EFINSIPTA-FDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFIS 158 1163 + E + S+P A DWR +GAVT VK+QGQCGSCW+FST +VEG + I 1164Sbjct: 111 QRGIQKNTGSFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIK 170 1165 1166Query: 159 QNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTA 218 1167 +LVSLSEQ LVDCD + +EGCNGGL A+ +I KN GI TE SYPY 1168Sbjct: 171 TGELVSLSEQELVDCD---------TSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAE 220 1169 1170Query: 219 ETGTQCNFNSANIG-AKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVF 275 1171 + GT C N N I +P N V+ P++++ +A +QFY GVF 1172Sbjct: 221 QDGT-CASNLLNSPVVSIDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVF 279 1173 1174Query: 276 DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNT 331 1175 C LDHG+ IVGY A R YWIVKNSWG +WGE GYI ++RG + 1176Sbjct: 280 TGRCG-TELDHGVAIVGYGAT----RDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGK 334 1177 1178Query: 332 CGVS 335 1179 CG++ 1180Sbjct: 335 CGIA 338 1181 1182 1183>sp|P25777|ORYB_ORYSA ORYZAIN BETA CHAIN PRECURSOR 1184 Length = 471 1185 1186 Score = 203 bits (510), Expect = 6e-52 1187 Identities = 115/303 (37%), Positives = 165/303 (53%), Gaps = 25/303 (8%) 1188 1189Query: 44 EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDD 103 1190 E+ RF +F NL ++ N A + + G+N+FADL+++EF+ +L K A 1191Sbjct: 69 EHERRFLVFWDNLKFVDAHNARA-DEGGGFRLGMNRFADLTNEEFRATFLGAKVA--ERS 125 1192 1193Query: 104 LPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLV 163 1194 + + + +P + DWR +GAV PVKNQGQCGSCW+FS VE + + +++ 1195Sbjct: 126 RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMI 185 1196 1197Query: 164 SLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQ 223 1198 +LSEQ LV+C + GCNGGL +A+++IIKNGGI TE YPY A G + 1199Sbjct: 186 TLSEQELVEC--------STNGQNSGCNGGLMADAFDFIIKNGGIDTEDDYPYKAVDG-K 236 1200 1201Query: 224 CNFNSANIG-AKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCN 280 1202 C+ N N I F +P+N+ V+ P+++A +A E+Q Y GVF C 1203Sbjct: 237 CDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCG 296 1204 1205Query: 281 PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT----CGVSN 336 1206 SLDHG++ VGY N YWIV+NSWG WGE GY+ + R N CG++ 1207Sbjct: 297 -TSLDHGVVAVGYGTDN-----GKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGIAM 350 1208 1209Query: 337 FVS 339 1210 S 1211Sbjct: 351 MAS 353 1212 1213 1214>sp|Q10717|CYS2_MAIZE CYSTEINE PROTEINASE 2 PRECURSOR 1215 Length = 360 1216 1217 Score = 201 bits (505), Expect = 2e-51 1218 Identities = 119/327 (36%), Positives = 175/327 (53%), Gaps = 36/327 (11%) 1219 1220Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 1221 +F F ++ K Y S E +RF IF +L + N ++++ G+N+FAD+S + 1222Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYR----LGINRFADMSWE 113 1223 1224Query: 87 EFKNYYLN---NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCW 143 1225 EF+ L N A T ++ ++P DWR G V+PVKNQG CGSCW 1226Sbjct: 114 EFRATRLGAAQNCSATLT-----GNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSCW 168 1227 1228Query: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203 1229 +FSTTG +E + + K +SLSEQ LVDC + GCNGGL A+ YI 1230Sbjct: 169 TFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNF--------GCNGGLPSQAFEYIK 220 1231 1232Query: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKI---SNFTMIPKNETVMAGYIVSTGPLAI 260 1233 NGG+ TE SYPY G C F + N+G K+ N T+ ++E A +V P+++ 1234Sbjct: 221 YNGGLDTEESYPYQGVNGI-CKFKNENVGVKVLDSVNITLGAEDELKDAVGLVR--PVSV 277 1235 1236Query: 261 AADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGAD 316 1237 A + + ++ Y GV+ P ++H +L VGY ++ +PYW++KNSWGAD 1238Sbjct: 278 AFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVED-----GVPYWLIKNSWGAD 332 1239 1240Query: 317 WGEQGYIYLRRGKNTCGVSNFVSTSII 343 1241 WG++GY + GKN CGV+ S I+ 1242Sbjct: 333 WGDEGYFKMEMGKNMCGVATCASYPIV 359 1243 1244 1245>sp|P00786|CATH_RAT CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPSIN BA) 1246 Length = 333 1247 1248 Score = 200 bits (504), Expect = 3e-51 1249 Identities = 121/324 (37%), Positives = 172/324 (52%), Gaps = 28/324 (8%) 1250 1251Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 1252 E+ F + + K YS EY R ++F +N KI+ N NH K G+N+F+D+S 1253Sbjct: 29 EKFHFTSWMKQHQKTYSSREYSHRLQVFANNWRKIQAHN--QRNHTF--KMGLNQFSDMS 84 1254 1255Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRG-AVTPVKNQGQCGSCW 143 1256 E K+ YL ++ ++YL P++ DWR +G V+PVKNQG CGSCW 1257Sbjct: 85 FAEIKHKYLWSEPQ--NCSATKSNYLRGT--GPYPSSMDWRKKGNVVSPVKNQGACGSCW 140 1258 1259Query: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203 1260 +FSTTG +E I+ K+++L+EQ LVDC + + GC GGL A+ YI+ 1261Sbjct: 141 TFSTTGALESAVAIASGKMMTLAEQQLVDC--------AQNFNNHGCQGGLPSQAFEYIL 192 1262 1263Query: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN-ETVMAGYIVSTGPLAIAA 262 1264 N GI E SYPY + G QC FN A + N I N E M + P++ A 1265Sbjct: 193 YNKGIMGEDSYPYIGKNG-QCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAF 251 1266 1267Query: 263 DAVE-WQFYIGGVFDI-PCN--PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 318 1268 + E + Y GV+ C+ P+ ++H +L VGY +N + YWIVKNSWG++WG 1269Sbjct: 252 EVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLL-----YWIVKNSWGSNWG 306 1270 1271Query: 319 EQGYIYLRRGKNTCGVSNFVSTSI 342 1272 GY + RGKN CG++ S I 1273Sbjct: 307 NNGYFLIERGKNMCGLAACASYPI 330 1274 1275 1276>sp|O10364|CATV_NPVOP VIRAL CATHEPSIN (V-CATH) 1277 Length = 324 1278 1279 Score = 199 bits (500), Expect = 9e-51 1280 Identities = 118/316 (37%), Positives = 170/316 (53%), Gaps = 26/316 (8%) 1281 1282Query: 29 FLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 1283 F +F KFNK YS E E L RF+IF+ NL +I N + + ++ +NKF+DLS +E 1284Sbjct: 28 FEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKN----QNDSTAQYEINKFSDLSKEE 83 1285 1286Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFST 147 1287 + Y T + LD P FDWR VT VKNQG CG+CW+F+T 1288Sbjct: 84 AISKYTGLSLPHQTQNFCEVVILDRPPDRG-PLEFDWRQFNKVTSVKNQGVCGACWAFAT 142 1289 1290Query: 148 TGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGG 207 1291 G++E Q I N+L++LSEQ +DCD + GC+GGL A+ ++ GG 1292Sbjct: 143 LGSLESQFAIKYNRLINLSEQQFIDCDR----------VNAGCDGGLLHTAFESAMEMGG 192 1293 1294Query: 208 IQTESSYPYTAETGTQCNFNSAN--IGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV 265 1295 +Q ES YPY G QC N +G + S I E + + + GP+ +A DA 1296Sbjct: 193 VQMESDYPYETANG-QCRINPNRFVVGVR-SCRRYIVMFEEKLKDLLRAVGPIPVAIDAS 250 1297 1298Query: 266 EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325 1299 + Y G+ C + L+H +L+VGY+ +N N+PYWI+KN+WG DWGE GY + 1300Sbjct: 251 DIVNYRRGIMR-QCANHGLNHAVLLVGYAVEN-----NIPYWILKNTWGTDWGEDGYFRV 304 1301 1302Query: 326 RRGKNTCGVSNFVSTS 341 1303 ++ N CG+ N + +S 1304Sbjct: 305 QQNINACGIRNELVSS 320 1305 1306 1307>sp|P43297|RD21_ARATH CYSTEINE PROTEINASE RD21A PRECURSOR 1308 Length = 462 1309 1310 Score = 198 bits (498), Expect = 2e-50 1311 Identities = 119/313 (38%), Positives = 165/313 (52%), Gaps = 35/313 (11%) 1312 1313Query: 35 KFNKKYSHEEYLE---RFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNY 91 1314 K K S +E RFEIFK NL ++E N ++++ G+ +FADL++DE+++ 1315Sbjct: 56 KHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYR----LGLTRFADLTNDEYRSK 111 1316 1317Query: 92 YLN---NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTT 148 1318 YL K+ L + DE +P + DWR +GAV VK+QG CGSCW+FST 1319Sbjct: 112 YLGAKMEKKGERRTSLRYEARVGDE----LPESIDWRKKGAVAEVKDQGGCGSCWAFSTI 167 1320 1321Query: 149 GNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGI 208 1322 G VEG + I L++LSEQ LVDCD + +EGCNGGL A+ +IIKNGGI 1323Sbjct: 168 GAVEGINQIVTGDLITLSEQELVDCD---------TSYNEGCNGGLMDYAFEFIIKNGGI 218 1324 1325Query: 209 QTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VE 266 1326 T+ YPY GT I ++ +P V+ P++IA +A 1327Sbjct: 219 DTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRA 278 1328 1329Query: 267 WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLR 326 1330 +Q Y G+FD C LDHG++ VGY +N YWIV+NSWG WGE GY+ + 1331Sbjct: 279 FQLYDSGIFDGSCG-TQLDHGVVAVGYGTEN-----GKDYWIVRNSWGKSWGESGYLRMA 332 1332 1333Query: 327 R----GKNTCGVS 335 1334 R CG++ 1335Sbjct: 333 RNIASSSGKCGIA 345 1336 1337 1338>sp|P15242|TES1_RAT TESTIN 1/2 PRECURSOR (CMB-22/CMB-23) 1339 Length = 333 1340 1341 Score = 198 bits (497), Expect = 2e-50 1342 Identities = 115/348 (33%), Positives = 184/348 (52%), Gaps = 22/348 (6%) 1343 1344Query: 3 VILLFVLAVFTVFVSSRGIPPEEQS--QFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 1345 +I + LA+ + V S P+ ++ E++ K K Y+ E + +++ N IE 1346Sbjct: 1 MIAVLFLAILCLEVDSTAPTPDPSLDVEWNEWRTKHGKTYNMNEERLKRAVWEKNFKMIE 60 1347 1348Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLN-NKEAIFTDDLPVADYLDDEFINSIP 119 1349 N + + D +N F DL++ EF ++ I + + D +F+ +P 1350Sbjct: 61 LHNWEYLEGRHDFTMAMNAFGDLTNIEFVKMMTGFQRQKIKKTHI----FQDHQFLY-VP 115 1351 1352Query: 120 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 179 1353 DWR G VTPVKNQG C S W+FS TG++EGQ F +L+ LSEQNL+DC + 1354Sbjct: 116 KRVDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSNVT 175 1355 1356Query: 180 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFT 239 1357 + GC+GG A+ Y+ NGG+ TE SYPY + G +C +++ N A + +F 1358Sbjct: 176 H--------GCSGGFMQYAFQYVKDNGGLATEESYPYRGQ-GRECRYHAENSAANVRDFV 226 1359 1360Query: 240 MIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGILIVGYSAK 296 1361 IP +E + + GP+++A DA +QFY G++ P C L+H +L+VGY + 1362Sbjct: 227 QIPGSEEALMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQCKRVHLNHAVLVVGYGFE 286 1363 1364Query: 297 NTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 343 1365 N +W+VKNSWG +WG +GY+ L + N CG++ + + I+ 1366Sbjct: 287 GEESDGN-SFWLVKNSWGEEWGMKGYMKLAKDWSNHCGIATYSTYPIV 333 1367 1368 1369>sp|P25776|ORYA_ORYSA ORYZAIN ALPHA CHAIN PRECURSOR 1370 Length = 458 1371 1372 Score = 198 bits (497), Expect = 2e-50 1373 Identities = 122/348 (35%), Positives = 182/348 (52%), Gaps = 37/348 (10%) 1374 1375Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQ--FLEFQDKFNKKYSHE-EYLERFEIFKSNLGKI 59 1376 ++LL LA + + S G EE+++ + E++ + K Y+ E R+ F+ NL I 1377Sbjct: 12 LLLLLSLAAADMSIVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYI 71 1378 1379Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLN-----NKEAIFTDDLPVADYLDDEF 114 1380 +E N A + G+N+FADL+++E+++ YL +E +D AD 1381Sbjct: 72 DEHNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADN----- 126 1382 1383Query: 115 INSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCD 174 1384 ++P + DWRT+GAV +K+QG CGSCW+FS VE + I L+SLSEQ LVDCD 1385Sbjct: 127 -EALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCD 185 1386 1387Query: 175 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG-A 233 1388 + +EGCNGGL A+++II NGGI TE YPY + +C+ N N 1389Sbjct: 186 ---------TSYNEGCNGGLMDYAFDFIINNGGIDTEDDYPYKGK-DERCDVNRKNAKVV 235 1390 1391Query: 234 KISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSLDHGILIV 291 1392 I ++ + N V P+++A +A +Q Y G+F C +LDHG+ V 1393Sbjct: 236 TIDSYEDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQLYSSGIFTGKCG-TALDHGVAAV 294 1394 1395Query: 292 GYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR----GKNTCGVS 335 1396 GY +N YWIV+NSWG WGE GY+ + R CG++ 1397Sbjct: 295 GYGTEN-----GKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGIA 337 1398 1399 1400>sp|P14080|PAP2_CARPA CHYMOPAPAIN PRECURSOR (PAPAYA PROTEINASE II) (PPII) 1401 Length = 352 1402 1403 Score = 194 bits (488), Expect = 2e-49 1404 Identities = 125/315 (39%), Positives = 167/315 (52%), Gaps = 43/315 (13%) 1405 1406Query: 35 KFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNY 91 1407 K NK Y S +E + RFEIF+ NL I+E N K + + G+N FADLS+DEFK 1408Sbjct: 54 KHNKIYESIDEKIYRFEIFRDNLMYIDETN------KKNNSYWLGLNGFADLSNDEFKKK 107 1409 1410Query: 92 YLNNKEAIFTDDLPVADYLDDE-----FINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFS 146 1411 Y+ +D ++ D+E + + P + DWR +GAVTPVKNQG CGSCW+FS 1412Sbjct: 108 YVG----FVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFS 163 1413 1414Query: 147 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 206 1415 T VEG + I L+ LSEQ LVDCD GC GG Q + Y + N 1416Sbjct: 164 TIATVEGINKIVTGNLLELSEQELVDCDKH----------SYGCKGGYQTTSLQY-VANN 212 1417 1418Query: 207 GIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN-ETVMAGYIVSTGPLAIAADA- 264 1419 G+ T YPY A+ + KI+ + +P N ET G + + PL++ +A 1420Sbjct: 213 GVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQ-PLSVLVEAG 271 1421 1422Query: 265 -VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 323 1423 +Q Y GVFD PC LDH + VGY + KN Y I+KNSWG +WGE+GY+ 1424Sbjct: 272 GKPFQLYKSGVFDGPCG-TKLDHAVTAVGYGTSD---GKN--YIIIKNSWGPNWGEKGYM 325 1425 1426Query: 324 YLRR----GKNTCGV 334 1427 L+R + TCGV 1428Sbjct: 326 RLKRQSGNSQGTCGV 340 1429 1430 1431>sp|P43235|CATK_HUMAN CATHEPSIN K PRECURSOR (CATHEPSIN O) (CATHEPSIN X) (CATHEPSIN O2) 1432 Length = 329 1433 1434 Score = 194 bits (488), Expect = 2e-49 1435 Identities = 121/339 (35%), Positives = 180/339 (52%), Gaps = 25/339 (7%) 1436 1437Query: 9 LAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65 1438 L V + V S + PEE + + ++ K+Y+++ + + R I++ NL I NL 1439Sbjct: 4 LKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLE 63 1440 1441Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWR 125 1442 A + +N D++S+E K + Y+ E+ P + D+R 1443Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIP-EWEGRAPDSVDYR 122 1444 1445Query: 126 TRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEA 185 1446 +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E 1447Sbjct: 123 KKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--------- 173 1448 1449Query: 186 CDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPK-N 244 1450 ++GC GG NA+ Y+ KN GI +E +YPY + C +N AK + IP+ N 1451Sbjct: 174 -NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE-ESCMYNPTGKAAKCRGYREIPEGN 231 1452 1453Query: 245 ETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFR 301 1454 E + + GP+++A DA +QFY GV +D CN ++L+H +L VGY + 1455Sbjct: 232 EKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-----IQ 286 1456 1457Query: 302 KNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 339 1458 K +WI+KNSWG +WG +GYI + R K N CG++N S 1459Sbjct: 287 KGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLAS 325 1460 1461 1462>sp|P25778|ORYC_ORYSA ORYZAIN GAMMA CHAIN PRECURSOR 1463 Length = 362 1464 1465 Score = 194 bits (487), Expect = 3e-49 1466 Identities = 114/327 (34%), Positives = 171/327 (51%), Gaps = 37/327 (11%) 1467 1468Query: 28 QFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 1469 +F F + K+Y E RF IF +L + N + ++ G+N+FAD+S + 1470Sbjct: 61 RFARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNRRGLPYR----LGINRFADMSWE 116 1471 1472Query: 87 EFKNYYLN---NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCW 143 1473 EF+ L N A + + D ++P DWR G V+PVK+QG CGSCW 1474Sbjct: 117 EFQASRLGAAQNCSATLAGNHRMRD------APALPETKDWREDGIVSPVKDQGHCGSCW 170 1475 1476Query: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203 1477 FSTTG++E ++ + VSLSEQ L DC + GC+GGL A+ YI 1478Sbjct: 171 PFSTTGSLEARYTQATGPPVSLSEQQLADCATRYNNF--------GCSGGLPSQAFEYIK 222 1479 1480Query: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKI---SNFTMIPKNETVMAGYIVSTGPLAI 260 1481 NGG+ TE +YPYT G C++ N G K+ N T++ ++E A +V P+++ 1482Sbjct: 223 YNGGLDTEEAYPYTGVNGI-CHYKPENAGVKVLDSVNITLVAEDELKNAVGLVR--PVSV 279 1483 1484Query: 261 AADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGAD 316 1485 A + ++ Y GV+ +P ++H +L VGY +N +PYW++KNSWGAD 1486Sbjct: 280 AFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVEN-----GVPYWLIKNSWGAD 334 1487 1488Query: 317 WGEQGYIYLRRGKNTCGVSNFVSTSII 343 1489 WG+ GY + GKN CG++ S I+ 1490Sbjct: 335 WGDNGYFTMEMGKNMCGIATCASYPIV 361 1491 1492 1493>sp|P25251|CYS4_BRANA CYSTEINE PROTEINASE COT44 PRECURSOR 1494 Length = 328 1495 1496 Score = 194 bits (487), Expect = 3e-49 1497 Identities = 115/300 (38%), Positives = 160/300 (53%), Gaps = 29/300 (9%) 1498 1499Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK----EAIFTD 102 1500 ERF IFK NL I+ N N A K G+ FA+L++DE+++ YL + I 1501Sbjct: 27 ERFNIFKDNLRFIDLHN--ENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITKA 84 1502 1503Query: 103 DLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKL 162 1504 Y ++ +P DWR +GAV +K+QG CGSCW+FST VEG + I +L 1505Sbjct: 85 KNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGEL 144 1506 1507Query: 163 VSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGT 222 1508 VSLSEQ LVDCD ++ ++GCNGGL A+ +I+KNGG+ TE YPY G 1509Sbjct: 145 VSLSEQELVDCD---------KSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNG- 194 1510 1511Query: 223 QCNFNSANIG-AKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPC 279 1512 +CN N I + +P + VS P+++A DA +Q Y G+F C 1513Sbjct: 195 KCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGKC 254 1514 1515Query: 280 NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNTCGVS 335 1516 N +DH ++ VGY ++N + YWIV+NSWG WGE GYI + R CG++ 1517Sbjct: 255 GTN-MDHAVVAVGYGSEN-----GVDYWIVRNSWGTRWGEDGYIRMERNVASKSGKCGIA 308 1518 1519 1520>sp|P09668|CATH_HUMAN CATHEPSIN H PRECURSOR 1521 Length = 335 1522 1523 Score = 192 bits (482), Expect = 1e-48 1524 Identities = 122/326 (37%), Positives = 168/326 (51%), Gaps = 32/326 (9%) 1525 1526Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 1527 E+ F + K K YS EEY R + F SN KI N N K +N+F+D+S 1528Sbjct: 31 EKFHFKSWMSKHRKTYSTEEYHHRLQTFASNWRKINAHN----NGNHTFKMALNQFSDMS 86 1529 1530Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGA-VTPVKNQGQCGSCW 143 1531 E K+ YL ++ ++YL P + DWR +G V+PVKNQG CGSCW 1532Sbjct: 87 FAEIKHKYLWSEPQ--NCSATKSNYLRGT--GPYPPSVDWRKKGNFVSPVKNQGACGSCW 142 1533 1534Query: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203 1535 +FSTTG +E I+ K++SL+EQ LVDC + Y GC GGL A+ YI+ 1536Sbjct: 143 TFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNY--------GCQGGLPSQAFEYIL 194 1537 1538Query: 204 KNGGIQTESSYPYTAETGTQCNFNSAN-IG--AKISNFTMIPKNETVMAGYIVSTGPLAI 260 1539 N GI E +YPY + G C F IG ++N T+ +E M + P++ 1540Sbjct: 195 YNKGIMGEDTYPYQGKDG-YCKFQPGKAIGFVKDVANITIY--DEEAMVEAVALYNPVSF 251 1541 1542Query: 261 AADAV-EWQFYIGGVF-DIPCN--PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGAD 316 1543 A + ++ Y G++ C+ P+ ++H +L VGY KN I PYWIVKNSWG 1544Sbjct: 252 AFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGI-----PYWIVKNSWGPQ 306 1545 1546Query: 317 WGEQGYIYLRRGKNTCGVSNFVSTSI 342 1547 WG GY + RGKN CG++ S I 1548Sbjct: 307 WGMNGYFLIERGKNMCGLAACASYPI 332 1549 1550 1551>sp|O46427|CATH_PIG CATHEPSIN H PRECURSOR 1552 Length = 335 1553 1554 Score = 191 bits (481), Expect = 2e-48 1555 Identities = 122/332 (36%), Positives = 171/332 (50%), Gaps = 28/332 (8%) 1556 1557Query: 17 SSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFG 76 1558 S+ + E+ F + + KKYS EEY R ++F SN KI N A NH K G 1559Sbjct: 23 SNLAVSSFEKLHFKSWMVQHQKKYSLEEYHHRLQVFVSNWRKINAHN--AGNHTF--KLG 78 1560 1561Query: 77 VNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGA-VTPVKN 135 1562 +N+F+D+S DE ++ YL ++ +YL P + DWR +G V+PVKN 1563Sbjct: 79 LNQFSDMSFDEIRHKYLWSEPQ--NCSATKGNYLRGT--GPYPPSMDWRKKGNFVSPVKN 134 1564 1565Query: 136 QGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQ 195 1566 QG CGSCW+FSTTG +E I+ K++SL+EQ LVDC + + GC GGL 1567Sbjct: 135 QGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDC--------AQNFNNHGCQGGLP 186 1568 1569Query: 196 PNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN-ETVMAGYIVS 254 1570 A+ YI N GI E +YPY + C F A + + I N E M + 1571Sbjct: 187 SQAFEYIRYNKGIMGEDTYPYKGQ-DDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVAL 245 1572 1573Query: 255 TGPLAIAADAV-EWQFYIGGVF-DIPCN--PNSLDHGILIVGYSAKNTIFRKNMPYWIVK 310 1574 P++ A + ++ Y G++ C+ P+ ++H +L VGY +N I PYWIVK 1575Sbjct: 246 YNPVSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGI-----PYWIVK 300 1576 1577Query: 311 NSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 342 1578 NSWG WG GY + RGKN CG++ S I 1579Sbjct: 301 NSWGPQWGMNGYFLIERGKNMCGLAACASYPI 332 1580 1581 1582>sp|P05167|ALEU_HORVU THIOL PROTEASE ALEURAIN PRECURSOR 1583 Length = 362 1584 1585 Score = 191 bits (481), Expect = 2e-48 1586 Identities = 111/322 (34%), Positives = 167/322 (51%), Gaps = 27/322 (8%) 1587 1588Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 1589 +F F ++ K Y S E RF IF +L ++ N + ++ G+N+F+D+S + 1590Sbjct: 60 RFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNRKGLPYR----LGINRFSDMSWE 115 1591 1592Query: 87 EFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFS 146 1593 EF+ L A T +A ++P DWR G V+PVKNQ CGSCW+FS 1594Sbjct: 116 EFQATRLG---AAQTCSATLAGNHLMRDAAALPETKDWREDGIVSPVKNQAHCGSCWTFS 172 1595 1596Query: 147 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 206 1597 TTG +E + + K +SLSEQ LVDC + GCNGGL A+ YI NG 1598Sbjct: 173 TTGALEAAYTQATGKNISLSEQQLVDCAGGFNNF--------GCNGGLPSQAFEYIKYNG 224 1599 1600Query: 207 GIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN-ETVMAGYIVSTGPLAIAADAV 265 1601 GI TE SYPY G C++ + N ++ + I N E + + P+++A + 1602Sbjct: 225 GIDTEESYPYKGVNGV-CHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVI 283 1603 1604Query: 266 E-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQG 321 1605 + ++ Y GV+ P+ ++H +L VGY +N +PYW++KNSWGADWG+ G 1606Sbjct: 284 DGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVEN-----GVPYWLIKNSWGADWGDNG 338 1607 1608Query: 322 YIYLRRGKNTCGVSNFVSTSII 343 1609 Y + GKN C ++ S ++ 1610Sbjct: 339 YFKMEMGKNMCAIATCASYPVV 360 1611 1612 1613>sp|P43236|CATK_RABIT CATHEPSIN K PRECURSOR (OC-2 PROTEIN) 1614 Length = 329 1615 1616 Score = 191 bits (480), Expect = 2e-48 1617 Identities = 119/339 (35%), Positives = 179/339 (52%), Gaps = 25/339 (7%) 1618 1619Query: 9 LAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65 1620 L V + V S + PEE +Q+ ++ ++K+Y+ + + + R I++ NL I NL 1621Sbjct: 4 LKVLLLPVVSFALHPEEILDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLE 63 1622 1623Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWR 125 1624 A + +N D++S+E K Y+ D + P + D+R 1625Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSRSHSNDTLYIPD-WEGRTPDSIDYR 122 1626 1627Query: 126 TRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEA 185 1628 +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E 1629Sbjct: 123 KKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--------- 173 1630 1631Query: 186 CDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPK-N 244 1632 + GC GG NA+ Y+ +N GI +E +YPY + C +N AK + IP+ N 1633Sbjct: 174 -NYGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQ-DESCMYNPTGKAAKCRGYREIPEGN 231 1634 1635Query: 245 ETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFR 301 1636 E + + GP+++A DA +QFY GV +D C+ ++++H +L VGY + 1637Sbjct: 232 EKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSDNVNHAVLAVGYG-----IQ 286 1638 1639Query: 302 KNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 339 1640 K +WI+KNSWG WG +GYI + R K N CG++N S 1641Sbjct: 287 KGNKHWIIKNSWGESWGNKGYILMARNKNNACGIANLAS 325 1642 1643 1644>sp|P10056|PAP3_CARPA CARICAIN PRECURSOR (PAPAYA PROTEINASE OMEGA) (PAPAYA PROTEINASE 1645 III) (PPIII) (PAPAYA PEPTIDASE A) 1646 Length = 348 1647 1648 Score = 190 bits (479), Expect = 3e-48 1649 Identities = 122/319 (38%), Positives = 166/319 (51%), Gaps = 46/319 (14%) 1650 1651Query: 37 NKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNYYL 93 1652 NK Y + +E L RFEIFK NL I+E N K + + G+N+FADLS+DEF Y+ 1653Sbjct: 56 NKFYENVDEKLYRFEIFKDNLNYIDETN------KKNNSYWLGLNEFADLSNDEFNEKYV 109 1654 1655Query: 94 NNKEAIFTDDLPVADYLDDEFIN----SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTG 149 1656 + D + D+EFIN ++P DWR +GAVTPV++QG CGSCW+FS 1657Sbjct: 110 GS-----LIDATIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVA 164 1658 1659Query: 150 NVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ 209 1660 VEG + I KLV LSEQ LVDC+ GC GG P A Y+ KN GI 1661Sbjct: 165 TVEGINKIRTGKLVELSEQELVDCERR----------SHGCKGGYPPYALEYVAKN-GIH 213 1662 1663Query: 210 TESSYPYTAETGTQCNFNSANIGAKISNFTMI----PKNETVMAGYIVSTGPLAIAADAV 265 1664 S YPY A+ GT + +G I + + P NE + I P+++ ++ 1665Sbjct: 214 LRSKYPYKAKQGT---CRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQ-PVSVVVESK 269 1666 1667Query: 266 --EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 323 1668 +Q Y GG+F+ PC +DH + VGY Y ++KNSWG WGE+GYI 1669Sbjct: 270 GRPFQLYKGGIFEGPCG-TKVDHAVTAVGYGKSG-----GKGYILIKNSWGTAWGEKGYI 323 1670 1671Query: 324 YLRRGK-NTCGVSNFVSTS 341 1672 ++R N+ GV +S 1673Sbjct: 324 RIKRAPGNSPGVCGLYKSS 342 1674 1675 1676>sp|P49935|CATH_MOUSE CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPSIN BA) 1677 Length = 333 1678 1679 Score = 189 bits (476), Expect = 6e-48 1680 Identities = 117/324 (36%), Positives = 167/324 (51%), Gaps = 28/324 (8%) 1681 1682Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 1683 E+ F + + K YS EY R ++F +N KI+ N NH K +N+F+D+S 1684Sbjct: 29 EKFHFKSWMKQHQKTYSSVEYNHRLQMFANNWRKIQAHN--QRNHTF--KMALNQFSDMS 84 1685 1686Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRG-AVTPVKNQGQCGSCW 143 1687 E K+ +L ++ ++YL P++ DWR +G V+PVKNQG C SCW 1688Sbjct: 85 FAEIKHKFLWSEPQ--NCSATKSNYLRGT--GPYPSSMDWRKKGNVVSPVKNQGACASCW 140 1689 1690Query: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203 1691 +FSTTG +E I+ K++SL+EQ LVDC + + GC GGL A+ YI+ 1692Sbjct: 141 TFSTTGALESAVAIASGKMLSLAEQQLVDC--------AQAFNNHGCKGGLPSQAFEYIL 192 1693 1694Query: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN-ETVMAGYIVSTGPLAIAA 262 1695 N GI E SYPY + + C FN A + N I N E M + P++ A 1696Sbjct: 193 YNKGIMEEDSYPYIGK-DSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAF 251 1697 1698Query: 263 DAVE-WQFYIGGVFDIPC---NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 318 1699 + E + Y GV+ P+ ++H +L VGY +N + YWIVKNSWG+ WG 1700Sbjct: 252 EVTEDFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLL-----YWIVKNSWGSQWG 306 1701 1702Query: 319 EQGYIYLRRGKNTCGVSNFVSTSI 342 1703 E GY + RGKN CG++ S I 1704Sbjct: 307 ENGYFLIERGKNMCGLAACASYPI 330 1705 1706 1707>sp|P55097|CATK_MOUSE CATHEPSIN K PRECURSOR 1708 Length = 329 1709 1710 Score = 188 bits (473), Expect = 1e-47 1711 Identities = 117/344 (34%), Positives = 181/344 (52%), Gaps = 35/344 (10%) 1712 1713Query: 9 LAVFTVFVSSRGIPPEEQ--SQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65 1714 L V + + S + PEE +Q+ ++ K+Y+ + + + R I++ NL +I NL 1715Sbjct: 4 LKVLLLPMVSFALSPEEMLDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLE 63 1716 1717Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDD-----EFINSIPT 120 1718 A + +N D++S+E + P Y +D E+ +P 1719Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLRIP------PSRSYSNDTLYTPEWEGRVPD 117 1720 1721Query: 121 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 180 1722 + D+R +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E 1723Sbjct: 118 SIDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTE---- 173 1724 1725Query: 181 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTM 240 1726 + GC GG A+ Y+ +NGGI +E ++PY + C +N+ AK + 1727Sbjct: 174 ------NYGCGGGYMTTAFQYVQQNGGIDSEDAFPYVGQ-DESCMYNATAKAAKCRGYRE 226 1728 1729Query: 241 IP-KNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYSAK 296 1730 IP NE + + GP++++ DA +QFY GV +D C+ ++++H +L+VGY 1731Sbjct: 227 IPVGNEKALKRAVARVGPISVSIDASLASFQFYSRGVYYDENCDRDNVNHAVLVVGYGT- 285 1732 1733Query: 297 NTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 339 1734 +K +WI+KNSWG WG +GY L R K N CG++N S 1735Sbjct: 286 ----QKGSKHWIIKNSWGESWGNKGYALLARNKNNACGITNMAS 325 1736 1737 1738>sp|P56203|CATW_MOUSE CATHEPSIN W PRECURSOR (LYMPHOPAIN) 1739 Length = 371 1740 1741 Score = 185 bits (466), Expect = 9e-47 1742 Identities = 110/338 (32%), Positives = 164/338 (47%), Gaps = 32/338 (9%) 1743 1744Query: 22 PPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80 1745 P E + F FQ +FN+ Y + EY R IF NL + + L + +FG F 1746Sbjct: 33 PLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLG---TAEFGETPF 89 1747 1748Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWR-TRGAVTPVKNQGQC 139 1749 +DL+ +EF Y + T ++ + + S+P DWR + ++ VKNQG C 1750Sbjct: 90 SDLTEEEFGQLYGQERSPERTPNM-TKKVESNTWGESVPRTCDWRKAKNIISSVKNQGSC 148 1751 1752Query: 140 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 199 1753 CW+ + N++ I + V +S Q L+DC E C GCNGG +AY 1754Sbjct: 149 KCCWAMAAADNIQALWRIKHQQFVDVSVQELLDC----------ERCGNGCNGGFVWDAY 198 1755 1756Query: 200 NYIIKNGGIQTESSYPYTAETGT-QCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPL 258 1757 ++ N G+ +E YP+ + +C A I +FTM+ NE +A Y+ GP+ 1758Sbjct: 199 LTVLNNSGLASEKDYPFQGDRKPHRCLAKKYKKVAWIQDFTMLSNNEQAIAHYLAVHGPI 258 1759 1760Query: 259 AIAADAVEWQFYIGGVFDIP---CNPNSLDHGILIVGYSAKN------TIF------RKN 303 1761 + + Q Y GV C+P +DH +L+VG+ K T+ R + 1762Sbjct: 259 TVTINMKLLQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKKKEGMQTGTVLSHSRKRRHS 318 1763 1764Query: 304 MPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 341 1765 PYWI+KNSWGA WGE+GY L RG NTCGV+ + T+ 1766Sbjct: 319 SPYWILKNSWGAHWGEKGYFRLYRGNNTCGVTKYPFTA 356 1767 1768 1769>sp|P25250|CYS2_HORVU CYSTEINE PROTEINASE EP-B 2 PRECURSOR 1770 Length = 373 1771 1772 Score = 184 bits (463), Expect = 2e-46 1773 Identities = 123/345 (35%), Positives = 172/345 (49%), Gaps = 40/345 (11%) 1774 1775Query: 8 VLAVFTVFVSSRGIPPEEQSQFLE---------FQDKFNKKYSHEEYLERFEIFKSNLGK 58 1776 VLAV V + S IP E++ E +Q + H E RF FKSN 1777Sbjct: 17 VLAVAAVELCS-AIPMEDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHF 75 1778 1779Query: 59 IEELNLIAINHKADTKFGV--NKFADLSSDEFKNYYLNNKEAIFTDDLP-VADYLDDEF- 114 1780 I + N + D + + N+F D+ EF+ ++ + P V ++ 1781Sbjct: 76 IH-----SHNKRGDHPYRLHLNRFGDMDQAEFRATFVGDLRRDTPSKPPSVPGFMYAALN 130 1782 1783Query: 115 INSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCD 174 1784 ++ +P + DWR +GAVT VK+QG+CGSCW+FST +VEG + I LVSLSEQ L+DCD 1785Sbjct: 131 VSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCD 190 1786 1787Query: 175 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNF----NSAN 230 1788 A ++GC GGL NA+ YI NGG+ TE++YPY A GT CN ++ 1789Sbjct: 191 ---------TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGT-CNVARAAQNSP 240 1790 1791Query: 231 IGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSLDHGI 288 1792 + I +P N V+ P+++A +A + FY GVF C LDHG+ 1793Sbjct: 241 VVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECG-TELDHGV 299 1794 1795Query: 289 LIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCG 333 1796 +VGY + YW VKNSWG WGEQGYI + + G 1797Sbjct: 300 AVVGYG----VAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASG 340 1798 1799 1800>sp|P25249|CYS1_HORVU CYSTEINE PROTEINASE EP-B 1 PRECURSOR 1801 Length = 371 1802 1803 Score = 184 bits (462), Expect = 3e-46 1804 Identities = 124/349 (35%), Positives = 171/349 (48%), Gaps = 48/349 (13%) 1805 1806Query: 8 VLAVFTVFVSSRGIPPEEQSQFLE---------FQDKFNKKYSHEEYLERFEIFKSNLGK 58 1807 VLAV V + S IP E++ E +Q + H E RF FKSN 1808Sbjct: 17 VLAVAAVELCS-AIPMEDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHF 75 1809 1810Query: 59 IEELNLIAINHKADTKFGV--NKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF-- 114 1811 I + N + D + + N+F D+ EF+ ++ + D P F 1812Sbjct: 76 IH-----SHNKRGDHPYRLHLNRFGDMDQAEFRATFVGDLRR----DTPAKPPSVPGFMY 126 1813 1814Query: 115 ----INSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNL 170 1815 ++ +P + DWR +GAVT VK+QG+CGSCW+FST +VEG + I LVSLSEQ L 1816Sbjct: 127 AALNVSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQEL 186 1817 1818Query: 171 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNF---- 226 1819 +DCD A ++GC GGL NA+ YI NGG+ TE++YPY A GT CN 1820Sbjct: 187 IDCD---------TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGT-CNVARAA 236 1821 1822Query: 227 NSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSL 284 1823 ++ + I +P N V+ P+++A +A + FY GVF C L 1824Sbjct: 237 QNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGDCG-TEL 295 1825 1826Query: 285 DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCG 333 1827 DHG+ +VGY + YW VKNSWG WGEQGYI + + G 1828Sbjct: 296 DHGVAVVGYG----VAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASG 340 1829 1830 1831>sp|P05994|PAP4_CARPA PAPAYA PROTEINASE IV PRECURSOR (PPIV) (PAPAYA PEPTIDASE B) (GLYCYL 1832 ENDOPEPTIDASE) 1833 Length = 348 1834 1835 Score = 183 bits (461), Expect = 3e-46 1836 Identities = 112/309 (36%), Positives = 164/309 (52%), Gaps = 33/309 (10%) 1837 1838Query: 35 KFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYL 93 1839 K NK Y + +E L RFEIFK NL I+E N + + G+N+F+DLS+DEFK Y+ 1840Sbjct: 54 KHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMINGYW----LGLNEFSDLSNDEFKEKYV 109 1841 1842Query: 94 NNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 153 1843 + +T+ ++++++ ++ +P + DWR +GAVTPVK+QG C SCW+FST VEG 1844Sbjct: 110 GSLPEDYTNQPYDEEFVNEDIVD-LPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEG 168 1845 1846Query: 154 QHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESS 213 1847 + I LV LSEQ LVDCD + GCN G Q + Y+ +N GI + 1848Sbjct: 169 INKIKTGNLVELSEQELVDCDKQ----------SYGCNRGYQSTSLQYVAQN-GIHLRAK 217 1849 1850Query: 214 YPYTAETGTQCNFNSANIGAKI--SNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQF 269 1851 YPY A+ T C N G K+ + + N ++ P+++ ++ ++Q 1852Sbjct: 218 YPYIAKQQT-CRANQVG-GPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQN 275 1853 1854Query: 270 YIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK 329 1855 Y GG+F+ C +DH + VGY Y ++KNSWG WGE GYI +RR 1856Sbjct: 276 YKGGIFEGSCG-TKVDHAVTAVGYGKSG-----GKGYILIKNSWGPGWGENGYIRIRRAS 329 1857 1858Query: 330 ----NTCGV 334 1859 CGV 1860Sbjct: 330 GNSPGVCGV 338 1861 1862 1863>sp|P22895|P34_SOYBN P34 PROBABLE THIOL PROTEASE PRECURSOR 1864 Length = 379 1865 1866 Score = 183 bits (461), Expect = 3e-46 1867 Identities = 108/318 (33%), Positives = 171/318 (52%), Gaps = 38/318 (11%) 1868 1869Query: 40 YSHEEYLERFEIFKSNLGKIEELNLIAINHKA--DTKFGVNKFADLSSDEFKNYYLNNKE 97 1870 ++HEE +R EIFK+N I ++N N K+ + G+NKFAD++ EF YL + 1871Sbjct: 56 HNHEEEAKRLEIFKNNSNYIRDMNA---NRKSPHSHRLGLNKFADITPQEFSKKYLQAPK 112 1872 1873Query: 98 AIFTDDLPVADYL---DDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQ 154 1874 + + + +A+ + + P ++DWR +G +T VK QG CG W+FS TG +E 1875Sbjct: 113 DV-SQQIKMANKKMKKEQYSCDHPPASWDWRKKGVITQVKYQGGCGRGWAFSATGAIEAA 171 1876 1877Query: 155 HFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSY 214 1878 H I+ LVSLSEQ LVDC E EG G Q ++ +++++GGI T+ Y 1879Sbjct: 172 HAIATGDLVSLSEQELVDCVEE----------SEGSYNGWQYQSFEWVLEHGGIATDDDY 221 1880 1881Query: 215 PYTAETGTQCNFNSANIGAKISNF-TMIPKNETVMAG------YIVSTGPLAIAADAVEW 267 1882 PY A+ G +C N I + T+I +E+ + + P++++ DA ++ 1883Sbjct: 222 PYRAKEG-RCKANKIQDKVTIDGYETLIMSDESTESETEQAFLSAILEQPISVSIDAKDF 280 1884 1885Query: 268 QFYIGGVFDIP--CNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325 1886 Y GG++D +P ++H +L+VGY + + + YWI KNSWG DWGE GYI++ 1887Sbjct: 281 HLYTGGIYDGENCTSPYGINHFVLLVGYGSAD-----GVDYWIAKNSWGFDWGEDGYIWI 335 1888 1889Query: 326 RRGK----NTCGVSNFVS 339 1890 +R CG++ F S 1891Sbjct: 336 QRNTGNLLGVCGMNYFAS 353 1892 1893 1894>sp|P43234|CATO_HUMAN CATHEPSIN O PRECURSOR 1895 Length = 321 1896 1897 Score = 182 bits (458), Expect = 8e-46 1898 Identities = 98/299 (32%), Positives = 155/299 (51%), Gaps = 28/299 (9%) 1899 1900Query: 52 FKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLD 111 1901 F+ +L + LN + + + +G+N+F+ L +EFK YL +K + F Y 1902Sbjct: 44 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPR------YSA 97 1903 1904Query: 112 DEFIN----SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSE 167 1905 + ++ S+P FDWR + VT V+NQ CG CW+FS G VE + I L LS 1906Sbjct: 98 EVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSV 157 1907 1908Query: 168 QNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK-NGGIQTESSYPYTAETGTQCNF 226 1909 Q ++DC + + GCNGG NA N++ K + +S YP+ A+ G F 1910Sbjct: 158 QQVIDCSYN----------NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYF 207 1911 1912Query: 227 NSANIGAKISNFTM--IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSL 284 1913 + ++ G I ++ E MA +++ GPL + DAV WQ Y+GG+ C+ 1914Sbjct: 208 SGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHCSSGEA 267 1915 1916Query: 285 DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 1917 +H +LI G+ + PYWIV+NSWG+ WG GY +++ G N CG+++ VS+ + 1918Sbjct: 268 NHAVLITGFDKTG-----STPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 321 1919 1920 1921>sp|P56202|CATW_HUMAN CATHEPSIN W PRECURSOR (LYMPHOPAIN) 1922 Length = 376 1923 1924 Score = 182 bits (457), Expect = 1e-45 1925 Identities = 109/341 (31%), Positives = 170/341 (48%), Gaps = 35/341 (10%) 1926 1927Query: 22 PPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80 1928 P E + F FQ +FN+ Y S EE+ R +IF NL + + L + +FGV F 1929Sbjct: 35 PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLG---TAEFGVTPF 91 1930 1931Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWR-TRGAVTPVKNQGQC 139 1932 +DL+ +EF Y + A + + +E S+P + DWR GA++P+K+Q C 1933Sbjct: 92 SDLTEEEFGQLYGYRRAAGGVPSMG-REIRSEEPEESVPFSCDWRKVAGAISPIKDQKNC 150 1934 1935Query: 140 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 199 1936 CW+ + GN+E IS V +S L+DC C +GC+GG +A+ 1937Sbjct: 151 NCCWAMAAAGNIETLWRISFWDFVDVSVHELLDCGR----------CGDGCHGGFVWDAF 200 1938 1939Query: 200 NYIIKNGGIQTESSYPYTAETGT-QCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPL 258 1940 ++ N G+ +E YP+ + +C+ A I +F M+ NE +A Y+ + GP+ 1941Sbjct: 201 ITVLNNSGLASEKDYPFQGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPI 260 1942 1943Query: 259 AIAADAVEWQFYIGGVFDIP---CNPNSLDHGILIVGYSA--------KNTIFRKNMP-- 305 1944 + + Q Y GV C+P +DH +L+VG+ + T+ ++ P 1945Sbjct: 261 TVTINMKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQP 320 1946 1947Query: 306 -----YWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 341 1948 YWI+KNSWGA WGE+GY L RG NTCG++ F T+ 1949Sbjct: 321 PHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLTA 361 1950 1951 1952>sp|P25774|CATS_HUMAN CATHEPSIN S PRECURSOR 1953 Length = 331 1954 1955 Score = 177 bits (444), Expect = 3e-44 1956 Identities = 116/347 (33%), Positives = 176/347 (50%), Gaps = 35/347 (10%) 1957 1958Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYS--HEEYLERFEIFKSNLGKIEEL 62 1959 L+ VL V + V+ P + ++ + K+Y +EE + R I++ NL + 1960Sbjct: 4 LVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRL-IWEKNLKFVMLH 62 1961 1962Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINS----- 117 1963 NL G+N D++S+E + T L V 1964Sbjct: 63 NLEHSMGMHSYDLGMNHLGDMTSEEVMS---------LTSSLRVPSQWQRNITYKSNPNR 113 1965 1966Query: 118 -IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 1967 +P + DWR +G VT VK QG CG+CW+FS G +E Q + KLV+LS QNLVDC 1968Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDC--- 170 1969 1970Query: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKIS 236 1971 E+ ++GCNGG A+ YII N GI +++SYPY A +C ++S A S 1972Sbjct: 171 ----STEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKA-MDQKCQYDSKYRAATCS 225 1973 1974Query: 237 NFTMIP-KNETVMAGYIVSTGPLAIAADAVEWQFYI--GGVFDIPCNPNSLDHGILIVGY 293 1975 +T +P E V+ + + GP+++ DA F++ GV+ P +++HG+L+VGY 1976Sbjct: 226 KYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGY 285 1977 1978Query: 294 SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 339 1979 N YW+VKNSWG ++GE+GYI + R K N CG+++F S 1980Sbjct: 286 GDLN-----GKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPS 327 1981 1982 1983>sp|P00784|PAPA_CARPA PAPAIN PRECURSOR (PAPAYA PROTEINASE I) (PPI) 1984 Length = 345 1985 1986 Score = 176 bits (442), Expect = 6e-44 1987 Identities = 116/315 (36%), Positives = 161/315 (50%), Gaps = 37/315 (11%) 1988 1989Query: 35 KFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNY 91 1990 K NK Y + +E + RFEIFK NL I+E N K + + G+N FAD+S+DEFK 1991Sbjct: 54 KHNKIYKNIDEKIYRFEIFKDNLKYIDETN------KKNNSYWLGLNVFADMSNDEFKEK 107 1992 1993Query: 92 YLNNKEAIFTD-DLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGN 150 1994 Y + +T +L + L+D +N IP DWR +GAVTPVKNQG CGSCW+FS 1995Sbjct: 108 YTGSIAGNYTTTELSYEEVLNDGDVN-IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVT 166 1996 1997Query: 151 VEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQT 210 1998 +EG I L SEQ L+DCD GCNGG +A ++ GI 1999Sbjct: 167 IEGIIKIRTGNLNEYSEQELLDCDRR----------SYGCNGGYPWSALQ-LVAQYGIHY 215 2000 2001Query: 211 ESSYPYTAETGTQCNFNSANIGAKISNFTMI-PKNETVMAGYIVSTGPLAIAADAV--EW 267 2002 ++YPY + AK + P NE + Y ++ P+++ +A ++ 2003Sbjct: 216 RNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALL-YSIANQPVSVVLEAAGKDF 274 2004 2005Query: 268 QFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR 327 2006 Q Y GG+F PC N +DH + VGY Y ++KNSWG WGE GYI ++R 2007Sbjct: 275 QLYRGGIFVGPCG-NKVDHAVAAVGYGPN---------YILIKNSWGTGWGENGYIRIKR 324 2008 2009Query: 328 GK-NTCGVSNFVSTS 341 2010 G N+ GV ++S 2011Sbjct: 325 GTGNSYGVCGLYTSS 339 2012 2013 2014>sp||CATL_CHICK_1 [Segment 1 of 2] CATHEPSIN L 2015 Length = 176 2016 2017 Score = 175 bits (439), Expect = 1e-43 2018 Identities = 88/179 (49%), Positives = 117/179 (65%), Gaps = 12/179 (6%) 2019 2020Query: 119 PTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECM 178 2021 P + DWR +G VTPVK+QGQCGSCW+FSTTG +EGQHF ++ KLVSLSEQNLVDC 2022Sbjct: 2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRP-- 59 2023 2024Query: 179 EYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNF 238 2025 EG ++GCNGGL A+ Y+ NGGI +E SYPYTA+ C + + A + F 2026Sbjct: 60 --EG----NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGF 113 2027 2028Query: 239 TMIPK-NETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHGILIVGY 293 2029 IP+ +E + + S GP+++A DA +QFY G++ P C+ LDHG+L+VGY 2030Sbjct: 114 VDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGY 172 2031 2032 2033>sp|P25326|CATS_BOVIN CATHEPSIN S 2034 Length = 217 2035 2036 Score = 173 bits (434), Expect = 5e-43 2037 Identities = 91/226 (40%), Positives = 131/226 (57%), Gaps = 17/226 (7%) 2038 2039Query: 118 IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 177 2040 +P + DWR +G VT VK QG CGSCW+FS G +E Q + KLVSLS QNLVDC 2041Sbjct: 1 LPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDC---- 56 2042 2043Query: 178 MEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN 237 2044 + ++GCNGG A+ YII N GI +E+SYPY A G +C ++ N A S 2045Sbjct: 57 ---STAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDG-KCQYDVKNRAATCSR 112 2046 2047Query: 238 FTMIP-KNETVMAGYIVSTGPLAIAADAVEWQFYI--GGVFDIPCNPNSLDHGILIVGYS 294 2048 + +P +E + + + GP+++ DA F++ GV+ P +++HG+L+VGY 2049Sbjct: 113 YIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYG 172 2050 2051Query: 295 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 339 2052 + YW+VKNSWG +G+QGYI + R N CG++N+ S 2053Sbjct: 173 NLD-----GKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPS 213 2054 2055 2056>sp|P80884|ANAN_ANACO ANANAIN 2057 Length = 216 2058 2059 Score = 167 bits (419), Expect = 3e-41 2060 Identities = 93/223 (41%), Positives = 124/223 (54%), Gaps = 22/223 (9%) 2061 2062Query: 118 IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 177 2063 +P + DWR GAVT VKNQG+CGSCW+F++ VE + I + LVSLSEQ ++DC 2064Sbjct: 1 VPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDC---- 56 2065 2066Query: 178 MEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN 237 2067 A GC GG AY++II N G+ + + YPY A GT C N A I+ 2068Sbjct: 57 -------AVSYGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGT-CKTNGVPNSAYITR 108 2069 2070Query: 238 FTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAK 296 2071 +T + +N Y VS P+A A DA +Q Y GVF PC L+H I+I+GY 2072Sbjct: 109 YTYVQRNNERNMMYAVSNQPIAAALDASGNFQHYKRGVFTGPCG-TRLNHAIVIIGYGQD 167 2073 2074Query: 297 NTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT----CGVS 335 2075 + +WIV+NSWGA WGE GYI L R ++ CG++ 2076Sbjct: 168 SA----GKKFWIVRNSWGAGWGEGGYIRLARDVSSSFGICGIA 206 2077 2078 2079>sp|Q02765|CATS_RAT CATHEPSIN S PRECURSOR 2080 Length = 330 2081 2082 Score = 167 bits (418), Expect = 4e-41 2083 Identities = 90/228 (39%), Positives = 132/228 (57%), Gaps = 18/228 (7%) 2084 2085Query: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 2086 ++P + DWR +G VT VK QG CGSCW+FS G +EGQ + KLVSLS QNLVDC E 2087Sbjct: 112 TLPDSVDWREKGCVTNVKYQGSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTE 171 2088 2089Query: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKIS 236 2090 E+ ++GC GG A+ YII + I +E+SYPY A +C ++ N A S 2091Sbjct: 172 ------EKYGNKGCGGGFMTEAFQYII-DTSIDSEASYPYKA-MDEKCLYDPKNRAATCS 223 2092 2093Query: 237 NFTMIP-KNETVMAGYIVSTGPLAIAADAV---EWQFYIGGVFDIPCNPNSLDHGILIVG 292 2094 + +P +E + + + GP+++ D + Y GV+D P +++HG+L+VG 2095Sbjct: 224 RYIELPFGDEEALKEAVATKGPVSVGIDDASHSSFFLYQSGVYDDPSCTENMNHGVLVVG 283 2096 2097Query: 293 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL-RRGKNTCGVSNFVS 339 2098 Y + YW+VKNSWG +G+QGYI + R KN CG++++ S 2099Sbjct: 284 YGTLD-----GKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIASYCS 326 2100 2101 2102>sp|P20721|CYSL_LYCES LOW-TEMPERATURE-INDUCED CYSTEINE PROTEINASE PRECURSOR 2103 Length = 346 2104 2105 Score = 164 bits (410), Expect = 3e-40 2106 Identities = 86/226 (38%), Positives = 127/226 (56%), Gaps = 21/226 (9%) 2107 2108Query: 116 NSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 175 2109 +S+P + DWR +G + VK+QG CGSCW+FS +E + I L+SLSEQ LVDCD 2110Sbjct: 16 DSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD- 74 2111 2112Query: 176 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKI 235 2113 + +EGC+GGL A+ ++IKNGGI TE YPY G + KI 2114Sbjct: 75 --------RSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKI 126 2115 2116Query: 236 SNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSLDHGILIVGY 293 2117 ++ +P N V+ P++IA +A ++Q Y G+F C ++DHG++I GY 2118Sbjct: 127 DSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCG-TAVDHGVVIAGY 185 2119 2120Query: 294 SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT----CGVS 335 2121 +N M YWIV+NSWGA+ E GY+ ++R ++ CG++ 2122Sbjct: 186 GTEN-----GMDYWIVRNSWGANCRENGYLRVQRNVSSSSGLCGLA 226 2123 2124 2125>sp|P36184|ACP1_ENTHI CYSTEINE PROTEINASE ACP1 PRECURSOR 2126 Length = 308 2127 2128 Score = 162 bits (407), Expect = 7e-40 2129 Identities = 105/312 (33%), Positives = 150/312 (47%), Gaps = 40/312 (12%) 2130 2131Query: 29 FLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 2132 F ++ NK +++ EYL RF +F N +E A+ +N FAD++ +E 2133Sbjct: 18 FKQWAATHNKVFANRAEYLYRFAVFLDNKKFVE----------ANANTELNVFADMTHEE 67 2134 2135Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFST 147 2136 F +L T ++P + + P + DWR+ + P K+QGQCGSCW+F T 2137Sbjct: 68 FIQTHLG-----MTYEVPETTSNVKAAVKAAPESVDWRS--IMNPAKDQGQCGSCWTFCT 120 2138 2139Query: 148 TGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGG 207 2140 T +EG+ KL S SEQ LVDCD A D GC GG N+ +I +N G 2141Sbjct: 121 TAVLEGRVNKDLGKLYSFSEQQLVDCD----------ASDNGCEGGHPSNSLKFIQENNG 170 2142 2143Query: 208 IQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--V 265 2144 + ES YPY A GT C N+ + + +ET + I GP+A+ DA 2145Sbjct: 171 LGLESDYPYKAVAGT-CK-KVKNVATVTGSRRVTDGSETGLQTIIAENGPVAVGMDASRP 228 2146 2147Query: 266 EWQFYIGGVF--DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 323 2148 +Q Y G D C ++H + VGY + + N YWI++NSWG WG+ GY 2149Sbjct: 229 SFQLYKKGTIYSDTKCRSRMMNHCVTAVGYGSNS-----NGKYWIIRNSWGTSWGDAGYF 283 2150 2151Query: 324 YLRR-GKNTCGV 334 2152 L R N CG+ 2153Sbjct: 284 LLARDSNNMCGI 295 2154 2155 2156>sp|O17473|CATL_BRUPA CATHEPSIN L-LIKE PRECURSOR 2157 Length = 395 2158 2159 Score = 159 bits (398), Expect = 8e-39 2160 Identities = 101/323 (31%), Positives = 155/323 (47%), Gaps = 21/323 (6%) 2161 2162Query: 26 QSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 2163 ++++ ++ K Y +E R IF+SN E +N +N ADL+ 2164Sbjct: 88 ETEWKDYVTALGKHYDQKENNFRMAIFESNELMTERINKKYEQGLVSYTTALNDLADLTD 147 2165 2166Query: 86 DEF--KNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCW 143 2167 +EF +N + +++ + +P DWRT+GAVTPV+NQG+CGSC+ 2168Sbjct: 148 EEFMVRNGLRLPNQTDLRGKRQTSEFYRYDKSERLPDQVDWRTKGAVTPVRNQGECGSCY 207 2169 2170Query: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203 2171 +F+T +E H +L+ LS QN+VDC + GC+GG P A+ Y 2172Sbjct: 208 AFATAAALEAYHKQMTGRLLDLSPQNIVDCT--------RNLGNNGCSGGYMPTAFQYAS 259 2173 2174Query: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMI-PKNETVMAGYIVSTGP--LAI 260 2175 + GI ES YPY T +C + + + F I P +E + + GP + I 2176Sbjct: 260 RY-GIAMESRYPYVG-TEQRCRWQQSIAVVTDNGFNEIQPGDELALKHAVAKRGPVVVGI 317 2177 2178Query: 261 AADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQ 320 2179 + ++FY GV+ N DH +L VGY + YWIVKNSWG DWG+ 2180Sbjct: 318 SGSKRSFRFYKDGVYS-EGNCGRPDHAVLAVGYGTHPSY----GDYWIVKNSWGTDWGKD 372 2181 2182Query: 321 GYIYLRRGK-NTCGVSNFVSTSI 342 2183 GY+Y+ R + N C +++ S I 2184Sbjct: 373 GYVYMARNRGNMCHIASAASFPI 395 2185 2186 2187>sp|Q01957|CPP1_ENTHI CYSTEINE PROTEINASE 1 PRECURSOR 2188 Length = 315 2189 2190 Score = 156 bits (391), Expect = 5e-38 2191 Identities = 108/309 (34%), Positives = 166/309 (52%), Gaps = 39/309 (12%) 2192 2193Query: 37 NKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADT-KFGVN-KFADLSSDEFKNYYLN 94 2194 NK ++ E L R IF N ++A N++ +T K V+ FA ++++E+ N L 2195Sbjct: 24 NKHFTAVESLRRRAIFNMNA------RIVAENNRKETFKLSVDGPFAAMTNEEY-NSLLK 76 2196 2197Query: 95 NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQ 154 2198 K + ++ YL+ + P A DWR +G VTP+++QG CGSC++F + +EG+ 2199Sbjct: 77 LKRS--GEEKGEVRYLNIQ----APKAVDWRKKGKVTPIRDQGNCGSCYTFGSIAALEGR 130 2200 2201Query: 155 HFISQ---NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTE 211 2202 I + ++ + LSE+++V C E +G + GCNGGL N YNYI++N GI E 2203Sbjct: 131 LLIEKGGDSETLDLSEEHMVQCTRE----DG----NNGCNGGLGSNVYNYIMEN-GIAKE 181 2204 2205Query: 212 SSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQF 269 2206 S YPYT T C + AKI ++ + +N V +S G + ++ DA V++Q 2207Sbjct: 182 SDYPYTGSDST-CR-SDVKAFAKIKSYNRVARNNEVELKAAISQGLVDVSIDASSVQFQL 239 2208 2209Query: 270 YIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLR 326 2210 Y G + D C N +L+H + VGY + WIV+NSWG WGE+GYI + 2211Sbjct: 240 YKSGAYTDTQCKNNYFALNHEVCAVGYGVVD-----GKECWIVRNSWGTGWGEKGYINMV 294 2212 2213Query: 327 RGKNTCGVS 335 2214 NTCGV+ 2215Sbjct: 295 IEGNTCGVA 303 2216 2217 2218>sp|Q06964|CPP3_ENTHI CYSTEINE PROTEINASE 3 PRECURSOR (CYSTEINE PROTEINASE ACP3) 2219 Length = 308 2220 2221 Score = 153 bits (384), Expect = 4e-37 2222 Identities = 103/308 (33%), Positives = 159/308 (51%), Gaps = 37/308 (12%) 2223 2224Query: 37 NKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN-KFADLSSDEFKNYYLNN 95 2225 NK ++ E L R IF N + E N K K V+ FA ++++E++ L + 2226Sbjct: 17 NKHFTAVEALRRRAIFNMNARFVAEFN-----KKGSFKLSVDGPFAAMTNEEYRTL-LKS 70 2227 2228Query: 96 KEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQH 155 2229 K + ++ YL+ + P + DWR +G VTP+++Q QCGSC++F + +EG+ 2230Sbjct: 71 KRTV--EENGKVTYLNIQ----APESVDWRAQGKVTPIRDQAQCGSCYTFGSLAALEGRL 124 2231 2232Query: 156 FISQN---KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTES 212 2233 I + + LSE++LV C + + GCNGGL N Y+YII+N G+ ES 2234Sbjct: 125 LIEKGGNANTLDLSEEHLVQCT--------RDNGNNGCNGGLGSNVYDYIIQN-GVAKES 175 2235 2236Query: 213 SYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFY 270 2237 YPYT T + C N AKI+ + +P+N +S G + ++ DA ++Q Y 2238Sbjct: 176 DYPYTG-TDSTCKTN-VKAFAKITGYNKVPRNNEAELKAALSQGLVDVSIDASSAKFQLY 233 2239 2240Query: 271 IGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR 327 2241 G + D C N +L+H + VGY + WIV+NSWG WG++GYI + 2242Sbjct: 234 KSGAYSDTKCKNNFFALNHEVCAVGYGVVD-----GKECWIVRNSWGTGWGDKGYINMVI 288 2243 2244Query: 328 GKNTCGVS 335 2245 NTCGV+ 2246Sbjct: 289 EGNTCGVA 296 2247 2248 2249>sp|Q01958|CPP2_ENTHI CYSTEINE PROTEINASE 2 PRECURSOR 2250 Length = 315 2251 2252 Score = 153 bits (383), Expect = 5e-37 2253 Identities = 102/316 (32%), Positives = 161/316 (50%), Gaps = 37/316 (11%) 2254 2255Query: 29 FLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN-KFADLSSDE 87 2256 F + K NK ++ E L R IF N ++ N I K V+ FA ++++E 2257Sbjct: 16 FNTWASKNNKHFTAIEKLRRRAIFNMNAKFVDSFNKIG-----SFKLSVDGPFAAMTNEE 70 2258 2259Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFST 147 2260 ++ + + T++ YL+ + P + DWR G VTP+++Q QCGSC++F + 2261Sbjct: 71 YRTLLKSKRT---TEENGQVKYLNIQ----APESVDWRKEGKVTPIRDQAQCGSCYTFGS 123 2262 2263Query: 148 TGNVEGQHFISQN---KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK 204 2264 +EG+ I + + LSE+++V C + + GCNGGL N Y+YII+ 2265Sbjct: 124 LAALEGRLLIEKGGDANTLDLSEEHMVQCT--------RDNGNNGCNGGLGSNVYDYIIE 175 2266 2267Query: 205 NGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA 264 2268 + G+ ES YPYT T C N + AKI+ +T +P+N +S G + ++ DA 2269Sbjct: 176 H-GVAKESDYPYTGSDST-CKTNVKSF-AKITGYTKVPRNNEAELKAALSQGLVDVSIDA 232 2270 2271Query: 265 --VEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 319 2272 ++Q Y G + D C N +L+H + VGY + WIV+NSWG WG+ 2273Sbjct: 233 SSAKFQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVD-----GKECWIVRNSWGTGWGD 287 2274 2275Query: 320 QGYIYLRRGKNTCGVS 335 2276 +GYI + NTCGV+ 2277Sbjct: 288 KGYINMVIEGNTCGVA 303 2278 2279 2280>sp|P46102|CYSP_PLAVN CYSTEINE PROTEINASE PRECURSOR 2281 Length = 506 2282 2283 Score = 153 bits (382), Expect = 6e-37 2284 Identities = 115/358 (32%), Positives = 176/358 (49%), Gaps = 62/358 (17%) 2285 2286Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 2287 S+F ++ + NKKY + +E L+RFE FK K ++ N + + VN+++D S 2288Sbjct: 160 SKFFKYMKENNKKYENMDEQLQRFENFKIRYMKTQKHNEMVGKNGLTYVQKVNQYSDFSK 219 2289 2290Query: 86 DEFKNYYLNNKEAIFTDDL------PVADYLDDEFINSI-------PTAFDWRTRGAVTP 132 2291 +EF NY+ K DL P+ +L + + S+ P + D+R++ P 2292Sbjct: 220 EEFDNYF--KKLLSVPMDLKSKYIVPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNFLP 277 2293 2294Query: 133 VKNQGQCGSCWSFSTTGNVEGQHFISQNKL-VSLSEQNLVDCDHECMEYEGEEACDEGCN 191 2295 K+QG CGSCW+F+ GN E + +++++ +S SEQ +VDC E + GC+ 2296Sbjct: 278 PKDQGNCGSCWAFAAIGNFEYLYVHTRHEMPISFSEQQMVDCSTE----------NYGCD 327 2297 2298Query: 192 GGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGY 251 2299 GG A+ Y+I NG + YPY C ++ ++ + NE +MA 2300Sbjct: 328 GGNPFYAFLYMINNG-VCLGDEYPYKGHEDFFCLNYRCSLLGRVHFIGDVKPNELIMALN 386 2301 2302Query: 252 IVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSA--------------- 295 2303 V GP+ IA A E + Y GGVFD CNP L+H +L+VGY 2304Sbjct: 387 YV--GPVTIAVGASEDFVLYSGGVFDGECNPE-LNHSVLLVGYGQVKKSLAFEDSHSNVD 443 2305 2306Query: 296 KNTI--FRKNMP---------YWIVKNSWGADWGEQGYIYLRRGK----NTCGVSNFV 338 2307 N I +++N+ YWIV+NSWG +WGE GYI ++R K CGV + V 2308Sbjct: 444 SNLIKKYKENIKGDDDDDIIYYWIVRNSWGPNWGEGGYIRIKRNKAGDDGFCGVGSDV 501 2309 2310 2311>sp|P36185|ACP2_ENTHI CYSTEINE PROTEINASE ACP2 PRECURSOR 2312 Length = 310 2313 2314 Score = 150 bits (374), Expect = 5e-36 2315 Identities = 102/322 (31%), Positives = 160/322 (49%), Gaps = 32/322 (9%) 2316 2317Query: 20 GIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN- 78 2318 GI F + K NK ++ E L R IF N ++ N I K V+ 2319Sbjct: 3 GIRIASAIDFNTWASKNNKHFTAIEKLRRRAIFNMNAKFVDSFNKIG-----SFKLSVDG 57 2320 2321Query: 79 KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQ 138 2322 FA ++++E++ + + T++ YL+ + P + DWR G VTP+++Q Q 2323Sbjct: 58 PFAAMTNEEYRTLLKSKRT---TEENGQVKYLNIQ----APESVDWRKEGKVTPLRDQAQ 110 2324 2325Query: 139 CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNA 198 2326 CGSC++F + +EG+ I + + N +D E M+ + + GCNGGL N 2327Sbjct: 111 CGSCYTFGSLAALEGRLLIEKG-----GDANTLDLSEEHMQCTRDNG-NNGCNGGLGSNV 164 2328 2329Query: 199 YNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPL 258 2330 Y+YII++ G+ ES YPYT T C N + KI+ +T +P+N +S G L 2331Sbjct: 165 YDYIIEH-GVAKESDYPYTGSDST-CKTNVKSF-RKITGYTKVPRNNEAELKAALSQGLL 221 2332 2333Query: 259 AIAAD--AVEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKNSW 313 2334 ++ D + ++Q Y G + D C N +L+H + VGY + WIV+NSW 2335Sbjct: 222 DVSIDVSSAKFQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVD-----GKECWIVRNSW 276 2336 2337Query: 314 GADWGEQGYIYLRRGKNTCGVS 335 2338 G WG++GYI + NTCGV+ 2339Sbjct: 277 GTSWGDKGYINMVIEGNTCGVA 298 2340 2341 2342>sp|P25781|CYSP_THEAN CYSTEINE PROTEINASE PRECURSOR 2343 Length = 441 2344 2345 Score = 146 bits (366), Expect = 5e-35 2346 Identities = 107/339 (31%), Positives = 164/339 (47%), Gaps = 54/339 (15%) 2347 2348Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGV--NKFADLS 84 2349 +F F +K+ K + S ++ ++RF F+ N ++ HK + + NKF+DLS 2350Sbjct: 119 EFDAFVEKYKKVHRSFDQRVQRFLTFRKNYHIVK-------THKPTEPYSLDLNKFSDLS 171 2351 2352Query: 85 SDEFKNYY--------------------LNNKEAIFTDDLPVADYLDDEFINSIPTA--F 122 2353 +EFK Y +++K I+ L A +++ S+ T 2354Sbjct: 172 DEEFKALYPVITPPKTYTSLSKHLEFKKMSHKNPIYISKLKKAKGIEEIKDLSLITGENL 231 2355 2356Query: 123 DWRTRGAVTPVKNQGQ-CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYE 181 2357 +W AV+P K+QG CGSCW+FS+ +VE + + +NK LSEQ LV+CD M 2358Sbjct: 232 NWARTDAVSPTKDQGDHCGSCWAFSSIASVESLYRLYKNKSYFLSEQELVNCDKSSM--- 288 2359 2360Query: 182 GEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMI 241 2361 GC GGL A Y I + G+ ES PYT + C + N I + +++ 2362Sbjct: 289 -------GCAGGLPITALEY-IHSKGVSFESEVPYTGIV-SPCKPSIKN-KVFIDSISIL 338 2363 2364Query: 242 PKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFR 301 2365 N+ V ++S + IA E + Y GG+F C L+H +L+VG + 2366Sbjct: 339 KGNDVVNKSLVISPTVVGIAV-TKELKLYSGGIFTGKCG-GELNHAVLLVGEGVDH---E 393 2367 2368Query: 302 KNMPYWIVKNSWGADWGEQGYIYLRR---GKNTCGVSNF 337 2369 M YWI+KNSWG DWGE G++ L+R G + CG+ F 2370Sbjct: 394 TGMRYWIIKNSWGEDWGENGFLRLQRTKKGLDKCGILTF 432 2371 2372 2373>sp|P14518|BROM_ANACO BROMELAIN, STEM 2374 Length = 212 2375 2376 Score = 146 bits (365), Expect = 6e-35 2377 Identities = 81/224 (36%), Positives = 115/224 (51%), Gaps = 27/224 (12%) 2378 2379Query: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 2380 ++P + DWR GAVT VKNQ CG+CW+F+ VE + I + L LSEQ ++DC 2381Sbjct: 1 AVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDC--- 57 2382 2383Query: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKIS 236 2384 A GC GG + A+ +II N G+ + + YPY A GT C + A I+ 2385Sbjct: 58 --------AKGYGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGT-CKTDGVPNSAYIT 108 2386 2387Query: 237 NFTMIPKNETVMAGYIVSTGPLAIAADA-VEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 295 2388 + +P+N Y VS P+ +A DA +Q+Y GVF+ PC SL+H + +GY 2389Sbjct: 109 GYARVPRNNESSMMYAVSKQPITVAVDANANFQYYKSGVFNGPCG-TSLNHAVTAIGYGQ 167 2390 2391Query: 296 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR----GKNTCGVS 335 2392 + I+ K WGA WGE GYI + R CG++ 2393Sbjct: 168 DSIIYPK---------KWGAKWGEAGYIRMARDVSSSSGICGIA 202 2394 2395 2396>sp|P22497|CYSP_THEPA CYSTEINE PROTEINASE PRECURSOR 2397 Length = 439 2398 2399 Score = 145 bits (363), Expect = 1e-34 2400 Identities = 104/343 (30%), Positives = 159/343 (46%), Gaps = 64/343 (18%) 2401 2402Query: 24 EEQSQFLEFQDKFNKKYS-HEEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKF 80 2403 E +F EF K+N++++ +E L R F+SN +++E K D + G+N+F 2404Sbjct: 119 EVYREFEEFNSKYNRRHATQQERLNRLVTFRSNYLEVKE-------QKGDEPYVKGINRF 171 2405 2406Query: 81 ADLSSDEF--------------------------KNYYLNNKEAIFTDDLPVADYLDDEF 114 2407 +DL+ EF K Y N K+A+ TD+ D + 2408Sbjct: 172 SDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKALNTDE-------DVDL 224 2409 2410Query: 115 INSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCD 174 2411 DWR +VT VK+Q CG CW+FST G+VEG + +K LS Q L+DCD 2412Sbjct: 225 AKLTGENLDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDCD 284 2413 2414Query: 175 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAK 234 2415 + GC GGL +AY Y+ K G+ + P+ + +C+ A 2416Sbjct: 285 ----------SFSNGCQGGLLESAYEYVRKY-GLVSAKDLPF-VDKARRCSVPKAK-KVS 331 2417 2418Query: 235 ISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYS 294 2419 + ++ + K + VM + S+ + + E Y GVF C SL+H +++VG 2420Sbjct: 332 VPSYHVF-KGKEVMTRSLTSSPCSVYLSVSPELAKYKSGVFTGECG-KSLNHAVVLVGEG 389 2421 2422Query: 295 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR---GKNTCGV 334 2423 ++ YW+V+NSWG DWGE GY+ L R G + CGV 2424Sbjct: 390 YDEVTKKR---YWVVQNSWGTDWGENGYMRLERTNMGTDKCGV 429 2425 2426 2427>sp|P16311|MMAL_DERFA MAJOR MITE FECAL ALLERGEN DER F 1 PRECURSOR (DER F I) 2428 Length = 321 2429 2430 Score = 144 bits (359), Expect = 3e-34 2431 Identities = 116/346 (33%), Positives = 158/346 (45%), Gaps = 48/346 (13%) 2432 2433Query: 7 FVLAVFTVFVSSRGIP-PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLI 65 2434 FVLA+ ++ V S P F EF+ FNK Y+ +E E+ + N +E L + 2435Sbjct: 3 FVLAIASLLVLSTVYARPASIKTFEEFKKAFNKNYAT---VEEEEVARKNF--LESLKYV 57 2436 2437Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF----INSI--P 119 2438 N K +N +DLS DEFKN YL + EA + L L+ E INS+ P 2439Sbjct: 58 EAN-----KGAINHLSDLSLDEFKNRYLMSAEAF--EQLKTQFDLNAETSACRINSVNVP 110 2440 2441Query: 120 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 179 2442 + D R+ VTP++ QG CGSCW+FS E + +N + LSEQ LVDC 2443Sbjct: 111 SELDLRSLRTVTPIRMQGGCGSCWAFSGVAATESAYLAYRNTSLDLSEQELVDC------ 164 2444 2445Query: 180 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFT 239 2446 A GC+G P YI +NG ++ E SYPY A NS + G ISN+ 2447Sbjct: 165 -----ASQHGCHGDTIPRGIEYIQQNGVVE-ERSYPYVAREQRCRRPNSQHYG--ISNYC 216 2448 2449Query: 240 MIPKNETVMAGYIVSTGPLAIAA-----DAVEWQFYIGGVF---DIPCNPNSLDHGILIV 291 2450 I + ++ AIA D +Q Y G D PN H + IV 2451Sbjct: 217 QIYPPDVKQIREALTQTHTAIAVIIGIKDLRAFQHYDGRTIIQHDNGYQPNY--HAVNIV 274 2452 2453Query: 292 GYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNF 337 2454 GY + + YWIV+NSW WG+ GY Y + G N + + 2455Sbjct: 275 GYGS-----TQGDDYWIVRNSWDTTWGDSGYGYFQAGNNLMMIEQY 315 2456 2457 2458>sp|P25805|CYSP_PLAFA THROPHOZOITE CYSTEINE PROTEINASE PRECURSOR (TCP) 2459 Length = 569 2460 2461 Score = 144 bits (359), Expect = 3e-34 2462 Identities = 102/363 (28%), Positives = 167/363 (45%), Gaps = 62/363 (17%) 2463 2464Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 2465 S+F +F + NK Y + +E + +FEIFK N I+ N +N A K VN+F+D S 2466Sbjct: 223 SKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHN--KLNKNAMYKKKVNQFSDYSE 280 2467 2468Query: 86 DEFKNY--------------YLNNKEAIFTDDLPVADYL------DDEFINSIPTAFDWR 125 2469 +E K Y Y E D++ ++++ + + + +P D+R 2470Sbjct: 281 EELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYR 340 2471 2472Query: 126 TRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEA 185 2473 +G V K+QG CGSCW+F++ GN+E ++S SEQ +VDC + 2474Sbjct: 341 EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--------- 391 2475 2476Query: 186 CDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNE 245 2477 + GC+GG ++ Y+++N + Y Y A+ C +S+ + +N+ 2478Sbjct: 392 -NFGCDGGHPFYSFLYVLQN-ELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQ 449 2479 2480Query: 246 TVMAGYIVSTGPLAIAADA-VEWQFYIGGVFDIPCNPNSLDHGILIVGY----------- 293 2481 ++A + GPL++ ++ Y GV++ C+ L+H +L+VGY 2482Sbjct: 450 LILA--LNEVGPLSVNVGVNNDFVAYSEGVYNGTCS-EELNHSVLLVGYGQVEKTKLNYN 506 2483 2484Query: 294 ---SAKNTIFRKNMP------YWIVKNSWGADWGEQGYIYLRRGKN----TCGVSNFVST 340 2485 NT N P YWI+KNSW WGE G++ L R KN CG+ V 2486Sbjct: 507 NKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFY 566 2487 2488Query: 341 SII 343 2489 I+ 2490Sbjct: 567 PIL 569 2491 2492 2493>sp|P42666|CYSP_PLAVI CYSTEINE PROTEINASE PRECURSOR 2494 Length = 583 2495 2496 Score = 132 bits (329), Expect = 1e-30 2497 Identities = 102/367 (27%), Positives = 165/367 (44%), Gaps = 86/367 (23%) 2498 2499Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 2500 S+F F +K+ + Y E +E+++ FK N KI++ N K VN+F+D S 2501Sbjct: 235 SKFFNFMNKYKRSYKDINEQMEKYKNFKMNYLKIKKHN----ETNQMYKMKVNQFSDYSK 290 2502 2503Query: 86 DEFKNYYLNNKEAIFTDDLPVADYLDDEFI------------------------NSIPTA 121 2504 +F++Y F +P+ D+L +++ +P 2505Sbjct: 291 KDFESY--------FRKLVPIPDHLKKKYVVPFSSMNNGKGKNVVTSSSGANLLADVPEI 342 2506 2507Query: 122 FDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK-LVSLSEQNLVDCDHECMEY 180 2508 D+R +G V K+QG CGSCW+F++ GNVE + NK +++LSEQ +VDC 2509Sbjct: 343 LDYREKGIVHEPKDQGLCGSCWAFASVGNVECMYAKEHNKTILTLSEQEVVDC------- 395 2510 2511Query: 181 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQC-NFNSANIGAKISNFT 239 2512 + GC+GG ++ Y I+N GI Y Y A C N+ N +S+ 2513Sbjct: 396 ---SKLNFGCDGGHPFYSFIYAIEN-GICMGDDYKYKAMDNLFCLNYRCKN-KVTLSSVG 450 2514 2515Query: 240 MIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYS--AK 296 2516 + +NE + A + GP+++ ++ FY GG+F+ C L+H +L+VGY 2517Sbjct: 451 GVKENELIRA--LNEVGPVSVNVGVTDDFSFYGGGIFNGTCT-EELNHSVLLVGYGQVQS 507 2518 2519Query: 297 NTIFRKN-------------------------MPYWIVKNSWGADWGEQGYIYLRRGKN- 330 2520 + IF++ YWI+KNSW WGE G++ + R K 2521Sbjct: 508 SKIFQEKNAYDDASGVTKKGALSYPSKADDGIQYYWIIKNSWSKFWGENGFMRISRNKEG 567 2522 2523Query: 331 ---TCGV 334 2524 CG+ 2525Sbjct: 568 DNVFCGI 574 2526 2527 2528>sp|P08176|MMAL_DERPT MAJOR MITE FECAL ALLERGEN DER P 1 PRECURSOR (DER P I) 2529 Length = 320 2530 2531 Score = 123 bits (305), Expect = 7e-28 2532 Identities = 110/338 (32%), Positives = 151/338 (44%), Gaps = 51/338 (15%) 2533 2534Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 2535 MK++L + V +R P F E++ FNK Y+ E E + N +E 2536Sbjct: 1 MKIVLAIASLLALSAVYAR---PSSIKTFEEYKKAFNKSYAT---FEDEEAARKNF--LE 52 2537 2538Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF----IN 116 2539 + + N A +N +DLS DEFKN +L + EA + L L+ E IN 2540Sbjct: 53 SVKYVQSNGGA-----INHLSDLSLDEFKNRFLMSAEAF--EHLKTQFDLNAETNACSIN 105 2541 2542Query: 117 -SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 175 2543 + P D R VTP++ QG CGSCW+FS E + +N+ + L+EQ LVDC 2544Sbjct: 106 GNAPAEIDLRQMRTVTPIRMQGGCGSCWAFSGVAATESAYLAYRNQSLDLAEQELVDC-- 163 2545 2546Query: 176 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKI 235 2547 A GC+G P YI NG +Q ES Y Y A + N+ G I 2548Sbjct: 164 ---------ASQHGCHGDTIPRGIEYIQHNGVVQ-ESYYRYVAREQSCRRPNAQRFG--I 211 2549 2550Query: 236 SNFTMI-PKNETVMAGYIVSTGPLAIAA-----DAVEWQFYIGGVF---DIPCNPNSLDH 286 2551 SN+ I P N + + T AIA D ++ Y G D PN H 2552Sbjct: 212 SNYCQIYPPNVNKIREALAQTHS-AIAVIIGIKDLDAFRHYDGRTIIQRDNGYQPNY--H 268 2553 2554Query: 287 GILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIY 324 2555 + IVGYS + + YWIV+NSW +WG+ GY Y 2556Sbjct: 269 AVNIVGYSN-----AQGVDYWIVRNSWDTNWGDNGYGY 301 2557 2558 2559>sp|P80067|CATC_RAT DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C) 2560 (CATHEPSIN J) (DIPEPTIDYL TRANSFERASE) 2561 Length = 462 2562 2563 Score = 117 bits (291), Expect = 3e-26 2564 Identities = 81/255 (31%), Positives = 128/255 (49%), Gaps = 32/255 (12%) 2565 2566Query: 105 PVADYLDDEFINSIPTAFDWRT-RGA--VTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK 161 2567 P+ D + + + S+P ++DWR RG V+PV+NQ CGSC+SF++ G +E + I N 2568Sbjct: 218 PITDEIQQQIL-SLPESWDWRNVRGINFVSPVRNQESCGSCYSFASIGMLEARIRILTNN 276 2569 2570Query: 162 LVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAE 219 2571 + LS Q +V C +GC+GG ++ G+ E+ +PYTA 2572Sbjct: 277 SQTPILSPQEVVSCSPYA----------QGCDGGFPYLIAGKYAQDFGVVEENCFPYTA- 325 2573 2574Query: 220 TGTQCNFNSANIGAKISNFTMIPK-----NETVMAGYIVSTGPLAIAADAVE-WQFYIGG 273 2575 T C + S + + NE +M +V GP+A+A + + + Y G 2576Sbjct: 326 TDAPCKPKENCLRYYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSG 385 2577 2578Query: 274 VF-----DIPCNPNSL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR 327 2579 ++ P NP L +H +L+VGY K+ + + YWIVKNSWG+ WGE GY +RR 2580Sbjct: 386 IYHHTGLSDPFNPFELTNHAVLLVGYG-KDPV--TGLDYWIVKNSWGSQWGESGYFRIRR 442 2581 2582Query: 328 GKNTCGVSNFVSTSI 342 2583 G + C + + +I 2584Sbjct: 443 GTDECAIESIAMAAI 457 2585 2586 2587>sp|P97821|CATC_MOUSE DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C) 2588 (CATHEPSIN J) (DIPEPTIDYL TRANSFERASE) 2589 Length = 462 2590 2591 Score = 116 bits (287), Expect = 8e-26 2592 Identities = 90/331 (27%), Positives = 156/331 (46%), Gaps = 42/331 (12%) 2593 2594Query: 34 DKFNKKYSH-----EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEF 88 2595 +K N +H E Y ER ++ N ++ +N + K+ T ++ +S + 2596Sbjct: 147 EKVNMNAAHLGGLQERYSER--LYTHNHNFVKAINTV---QKSWTATAYKEYEKMSLRDL 201 2597 2598Query: 89 KNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRT-RGA--VTPVKNQGQCGSCWSF 145 2599 +++ P+ D + + +N +P ++DWR +G V+PV+NQ CGSC+SF 2600Sbjct: 202 IRRSGHSQRIPRPKPAPMTDEIQQQILN-LPESWDWRNVQGVNYVSPVRNQESCGSCYSF 260 2601 2602Query: 146 STTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203 2603 ++ G +E + I N + LS Q +V C +GC+GG 2604Sbjct: 261 ASMGMLEARIRILTNNSQTPILSPQEVVSCSPYA----------QGCDGGFPYLIAGKYA 310 2605 2606Query: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPK-----NETVMAGYIVSTGPL 258 2607 ++ G+ ES +PYTA+ + C + S++ + NE +M +V GP+ 2608Sbjct: 311 QDFGVVEESCFPYTAKD-SPCKPRENCLRYYSSDYYYVGGFYGGCNEALMKLELVKHGPM 369 2609 2610Query: 259 AIAADAVE-WQFYIGGVF-----DIPCNPNSL-DHGILIVGYSAKNTIFRKNMPYWIVKN 311 2611 A+A + + + Y G++ P NP L +H +L+VGY + YWI+KN 2612Sbjct: 370 AVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVT---GIEYWIIKN 426 2613 2614Query: 312 SWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 342 2615 SWG++WGE GY +RRG + C + + +I 2616Sbjct: 427 SWGSNWGESGYFRIRRGTDECAIESIAVAAI 457 2617 2618 2619>sp|P53634|CATC_HUMAN DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C) 2620 (CATHEPSIN J) (DIPEPTIDYL TRANSFERASE) 2621 Length = 463 2622 2623 Score = 113 bits (281), Expect = 4e-25 2624 Identities = 75/236 (31%), Positives = 113/236 (47%), Gaps = 31/236 (13%) 2625 2626Query: 118 IPTAFDWRTRGA---VTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVD 172 2627 +PT++DWR V+PV+NQ CGSC+SF++ G +E + I N + LS Q +V 2628Sbjct: 231 LPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVS 290 2629 2630Query: 173 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 232 2631 C +GC GG ++ G+ E+ +PYT T + C 2632Sbjct: 291 CSQYA----------QGCEGGFPYLIAGKYAQDFGLVEEACFPYTG-TDSPCKMKEDCFR 339 2633 2634Query: 233 AKISNFTMIPK-----NETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDI-----PCNP 281 2635 S + + NE +M +V GP+A+A + + + Y G++ P NP 2636Sbjct: 340 YYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNP 399 2637 2638Query: 282 NSL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 336 2639 L +H +L+VGY + M YWIVKNSWG WGE GY +RRG + C + + 2640Sbjct: 400 FELTNHAVLLVGYGTDSA---SGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIES 452 2641 2642 2643>sp|Q26563|CATC_SCHMA CATHEPSIN C PRECURSOR 2644 Length = 454 2645 2646 Score = 113 bits (279), Expect = 7e-25 2647 Identities = 75/242 (30%), Positives = 113/242 (45%), Gaps = 35/242 (14%) 2648 2649Query: 117 SIPTAFDWRT-----RGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQN 169 2650 ++P FDW + R VTP++NQG CGSC++ + +E + + N + LS Q 2651Sbjct: 217 NLPLEFDWTSPPDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSNFSEQPILSPQT 276 2652 2653Query: 170 LVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSA 229 2654 +VDC EGCNGG ++ G+ + PYT E +C + 2655Sbjct: 277 VVDCS----------PYSEGCNGGFPFLIAGKYGEDFGLPQKIVIPYTGEDTGKCTVSKN 326 2656 2657Query: 230 NIGAKISNFTMI-----PKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPC---- 279 2658 ++++ I NE +M ++S GP + + E +QFY G++ 2659Sbjct: 327 CTRYYTTDYSYIGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQFYKEGIYHHTTVQTD 386 2660 2661Query: 280 ----NPNSL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGV 334 2662 NP L +H +L+VGY PYW VKNSWG +WGEQGY + RG + CGV 2663Sbjct: 387 HYNFNPFELTNHAVLLVGYGVDKL---SGEPYWKVKNSWGVEWGEQGYFRILRGTDECGV 443 2664 2665Query: 335 SN 336 2666 + 2667Sbjct: 444 ES 445 2668 2669 2670>sp|P25773|CATL_FELCA CATHEPSIN L (PROGESTERONE-DEPENDENT PROTEIN) (PDP) 2671 Length = 139 2672 2673 Score = 113 bits (279), Expect = 7e-25 2674 Identities = 55/141 (39%), Positives = 84/141 (59%), Gaps = 5/141 (3%) 2675 2676Query: 192 GGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGY 251 2677 GGL +A+ Y+ NGG+ +E SYPY A+ G C + N A ++++ IP E + 2678Sbjct: 1 GGLIDDAFQYVKDNGGLDSEESYPYHAQ-GDSCKYRPENSVANVTDYWDIPSKENELMIT 59 2679 2680Query: 252 IVSTGPLAIAADAV--EWQFYIGGVF-DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWI 308 2681 + + GP++ A DA ++FY G++ D C+ +DHG+L+VGY A T +N YWI 2682Sbjct: 60 LAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDVDHGVLVVGYGADGTE-TENKKYWI 118 2683 2684Query: 309 VKNSWGADWGEQGYIYLRRGK 329 2685 +KNSWG DWG GYI + + + 2686Sbjct: 119 IKNSWGTDWGMDGYIKMAKDR 139 2687 2688 2689>sp|P25780|EUM1_EURMA MITE GROUP I ALLERGEN EUR M 1 (EUR M I) 2690 Length = 211 2691 2692 Score = 105 bits (260), Expect = 1e-22 2693 Identities = 73/222 (32%), Positives = 102/222 (45%), Gaps = 29/222 (13%) 2694 2695Query: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 2696 S+P+ D R+ VTP++ QG CGSCW+FS + E + +N + L+EQ LVDC 2697Sbjct: 10 SLPSELDLRSLRTVTPIRMQGGCGSCWAFSGVASTESAYLAYRNMSLDLAEQELVDC--- 66 2698 2699Query: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKIS 236 2700 A GC+G P YI +NG +Q E YPY A + N+ G K 2701Sbjct: 67 --------ASQNGCHGDTIPRGIEYIQQNGVVQ-EHYYPYVAREQSCHRPNAQRYGLK-- 115 2702 2703Query: 237 NFTMIPKNETVMAGYIVSTGPLAIAA-----DAVEWQFYIGGVF---DIPCNPNSLDHGI 288 2704 N+ I ++ ++ A+A D ++ Y G D PN H + 2705Sbjct: 116 NYCQISPPDSNKIRQALTQTHTAVAVIIGIKDLNAFRHYDGRTIMQHDNGYQPNY--HAV 173 2706 2707Query: 289 LIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKN 330 2708 IVGY NT + + YWIV+NSW WG+ GY Y N 2709Sbjct: 174 NIVGYG--NT---QGVDYWIVRNSWDTTWGDNGYGYFAANIN 210 2710 2711 2712>sp|Q23894|CYS3_DICDI CYSTEINE PROTEINASE 3 (CYSTEINE PROTEINASE II) 2713 Length = 151 2714 2715 Score = 95.6 bits (234), Expect = 1e-19 2716 Identities = 61/157 (38%), Positives = 86/157 (53%), Gaps = 17/157 (10%) 2717 2718Query: 41 SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIF 100 2719 +H+E++ R+E FK N+ + N + + T G+N+ ADLS++E++ YL + I 2720Sbjct: 1 THKEFMPRYEEFKKNMDYVHNWN----SKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIK 56 2721 2722Query: 101 TDDLPVADY---LDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFI 157 2723 + + L+ P DWR + AVTPVK+QGQCGSC STTG+VEG I 2724Sbjct: 57 LNGYHKRNLGLRLNRPHFKQ-PLNVDWREKDAVTPVKDQGQCGSC-IISTTGSVEGVTAI 114 2725 2726Query: 158 SQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGL 194 2727 KLVSLSEQN++ +EGCNGGL 2728Sbjct: 115 KTGKLVSLSEQNILRL--------SSSFGNEGCNGGL 143 2729 2730 2731>sp|P43508|CPR4_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 4 PRECURSOR 2732 Length = 335 2733 2734 Score = 94.4 bits (231), Expect = 3e-19 2735 Identities = 82/300 (27%), Positives = 129/300 (42%), Gaps = 60/300 (20%) 2736 2737Query: 82 DLSSDEFKNYYLNNK-EAIFTDDLPVADYLDDEFINSIPTAFDWRTRG----AVTPVKNQ 136 2738 D++ ++ K + + A T D+ V + +E ++IP FD RT+ ++ +++Q 2739Sbjct: 46 DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINE--DTIPATFDARTQWPNCMSINNIRDQ 103 2740 2741Query: 137 GQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGL 194 2742 CGSCW+F+ + I+ N V+ LS ++++ C C C GC GG 2743Sbjct: 104 SDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSC---CSN------CGYGCEGGY 154 2744 2745Query: 195 QPNAYNYIIKNGGIQTESSY-------PYT-AETG------------------------- 221 2746 NA+ Y++K+G T SY PY+ A G 2747Sbjct: 155 PINAWKYLVKSG-FCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKC 213 2748 2749Query: 222 TQCNFNSANIGAKISNFTM--IPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIP 278 2750 T N+N A K T + K + + I++ GP+ A E + Y GV+ 2751Sbjct: 214 TNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHT 273 2752 2753Query: 279 CNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFV 338 2754 H I I+G+ N PYW+V NSW +WGE GY + RG N CG+ + V 2755Sbjct: 274 TGQELGGHAIRILGWGTDN-----GTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAV 328 2756 2757 2758>sp|P05993|PAP5_CARPA CYSTEINE PROTEINASE (CLONE PLBPC13) 2759 Length = 96 2760 2761 Score = 90.5 bits (221), Expect = 5e-18 2762 Identities = 43/87 (49%), Positives = 55/87 (62%), Gaps = 2/87 (2%) 2763 2764Query: 256 GPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSW 313 2765 GPLA+A +A Q YIGGV L+HG+L+VGY + I K PYW++KNSW 2766Sbjct: 1 GPLAVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGYAPIRLKEKPYWVIKNSW 60 2767 2768Query: 314 GADWGEQGYIYLRRGKNTCGVSNFVST 340 2769 G +WGE GY + RG+N CGV + VST 2770Sbjct: 61 GENWGENGYYKICRGRNICGVDSMVST 87 2771 2772 2773>sp|P25807|CYS1_CAEEL GUT-SPECIFIC CYSTEINE PROTEINASE PRECURSOR 2774 Length = 329 2775 2776 Score = 90.5 bits (221), Expect = 5e-18 2777 Identities = 69/288 (23%), Positives = 118/288 (40%), Gaps = 46/288 (15%) 2778 2779Query: 82 DLSSDEFKNYYLNNK-EAIFTDDLPVADYLDDEFINSIPTAFDWRTRGA----VTPVKNQ 136 2780 +++ +E K ++ K A +D++ + + + S+P FD RT+ + + +++Q 2781Sbjct: 50 EITEEEMKFKLMDGKYAAAHSDEIRATE--QEVVLASVPATFDSRTQWSECKSIKLIRDQ 107 2782 2783Query: 137 GQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGL 194 2784 CGSCW+F + + I +S +L+ C C +C GC GG 2785Sbjct: 108 ATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSC---C-----GSSCGNGCEGGY 159 2786 2787Query: 195 QPNAYNY-----IIKNGGIQTESSYPY--------------TAETGTQCNFNSANIGAKI 235 2788 A + ++ G PY T C + AK 2789Sbjct: 160 PIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGNCPESKTPSCSMSCQSGYSTAYAKD 219 2790 2791Query: 236 SNFTM----IPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILI 290 2792 +F + +PKN + I + GP+ A E + Y GV+ H I I 2793Sbjct: 220 KHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHAIKI 279 2794 2795Query: 291 VGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFV 338 2796 +G+ ++ PYW+V NSWG +WGE G+ + RG + CG+ + V 2797Sbjct: 280 IGWGTES-----GSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAV 322 2798 2799 2800>sp|P43509|CPR5_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 5 PRECURSOR 2801 Length = 344 2802 2803 Score = 89.3 bits (218), Expect = 1e-17 2804 Identities = 69/272 (25%), Positives = 113/272 (41%), Gaps = 55/272 (20%) 2805 2806Query: 108 DYLDDEFINSIPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLV 163 2807 D + E ++IP FD W ++ +++Q CGSCW+F+ + + I+ N V 2808Sbjct: 72 DIVATEVSDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAV 131 2809 2810Query: 164 S--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSY------- 214 2811 + LS ++L+ C G +C GC GG A+ + +K+G + T SY 2812Sbjct: 132 NTLLSSEDLLSC------CTGMFSCGNGCEGGYPIQAWKWWVKHG-LVTGGSYETQFGCK 184 2813 2814Query: 215 -------------------PYTAETGTQC--------NFNSANIGAKISNFTM--IPKNE 245 2815 P E +C N+ + + K T + K 2816Sbjct: 185 PYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKV 244 2817 2818Query: 246 TVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNM 304 2819 + I++ GP+ +A E + Y GV+ + H + I+G+ N 2820Sbjct: 245 EQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDN-----GT 299 2821 2822Query: 305 PYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 336 2823 PYW+V NSW WGE+GY + RG N CG+ + 2824Sbjct: 300 PYWLVANSWNVAWGEKGYFRIIRGLNECGIEH 331 2825 2826 2827>sp|P00787|CATB_RAT CATHEPSIN B PRECURSOR (CATHEPSIN B1) (RSG-2) 2828 Length = 339 2829 2830 Score = 87.8 bits (214), Expect = 3e-17 2831 Identities = 68/264 (25%), Positives = 114/264 (42%), Gaps = 51/264 (19%) 2832 2833Query: 117 SIPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL--SEQNL 170 2834 ++P +FD W + +++QG CGSCW+F + + I N V++ S ++L 2835Sbjct: 79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 138 2836 2837Query: 171 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK----NGGIQTE--------------- 211 2838 + C C C +GCNGG A+N+ + +GG+ 2839Sbjct: 139 LTC---C-----GIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHH 190 2840 2841Query: 212 ---SSYPYTAETGT-QCN------FNSANIGAKISNFTM--IPKNETVMAGYIVSTGPLA 259 2842 S P T E T +CN ++++ K +T + +E + I GP+ 2843Sbjct: 191 VNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVE 250 2844 2845Query: 260 IAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 318 2846 A ++ Y GV+ H I I+G+ +N + PYW+V NSW DWG 2847Sbjct: 251 GAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGV-----PYWLVANSWNVDWG 305 2848 2849Query: 319 EQGYIYLRRGKNTCGVSNFVSTSI 342 2850 + G+ + RG+N CG+ + + I 2851Sbjct: 306 DNGFFKILRGENHCGIESEIVAGI 329 2852 2853 2854>sp|P07688|CATB_BOVIN CATHEPSIN B PRECURSOR 2855 Length = 335 2856 2857 Score = 87.0 bits (212), Expect = 5e-17 2858 Identities = 67/259 (25%), Positives = 103/259 (38%), Gaps = 55/259 (21%) 2859 2860Query: 118 IPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL---SEQNL 170 2861 +P +FD W + +++QG CGSCW+F + + I N V++ +E L 2862Sbjct: 80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139 2863 2864Query: 171 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ--------------------- 209 2865 C EC +GCNGG A+N+ K G + 2866Sbjct: 140 TCCGGEC---------GDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHH 190 2867 2868Query: 210 -TESSYPYTAETGT-QCNFNSANIGAKIS---------NFTMIPKNETVMAGYIVSTGPL 258 2869 S P T E T +CN + G S + + NE + I GP+ 2870Sbjct: 191 VNGSRPPCTGEGDTPKCN-KTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPV 249 2871 2872Query: 259 AIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 317 2873 A ++ Y GV+ H I I+G+ +N PYW+V NSW DW 2874Sbjct: 250 EGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVEN-----GTPYWLVGNSWNTDW 304 2875 2876Query: 318 GEQGYIYLRRGKNTCGVSN 336 2877 G+ G+ + RG++ CG+ + 2878Sbjct: 305 GDNGFFKILRGQDHCGIES 323 2879 2880 2881>sp|P07858|CATB_HUMAN CATHEPSIN B PRECURSOR (CATHEPSIN B1) (APP SECRETASE) 2882 Length = 339 2883 2884 Score = 85.8 bits (209), Expect = 1e-16 2885 Identities = 70/285 (24%), Positives = 110/285 (38%), Gaps = 63/285 (22%) 2886 2887Query: 96 KEAIFTDDLPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNV 151 2888 + +FT+DL +P +FD W + +++QG CGSCW+F + 2889Sbjct: 70 QRVMFTEDL------------KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAI 117 2890 2891Query: 152 EGQHFISQNKLVSL--SEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ 209 2892 + I N VS+ S ++L+ C C C +GCNGG A+N+ + G + 2893Sbjct: 118 SDRICIHTNAHVSVEVSAEDLLTC---C-----GSMCGDGCNGGYPAEAWNFWTRKGLVS 169 2894 2895Query: 210 ----------------------TESSYPYTAETGTQCNFNSANIGAKIS---------NF 238 2896 S P T E T G + N 2897Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229 2898 2899Query: 239 TMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN 297 2900 + +E + I GP+ A ++ Y GV+ H I I+G+ +N 2901Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVEN 289 2902 2903Query: 298 TIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 342 2904 PYW+V NSW DWG+ G+ + RG++ CG+ + V I 2905Sbjct: 290 -----GTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGI 329 2906 2907 2908>sp|P43157|CYSP_SCHJA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECURSOR (ANTIGEN SJ31) 2909 Length = 342 2910 2911 Score = 84.7 bits (206), Expect = 3e-16 2912 Identities = 66/268 (24%), Positives = 111/268 (40%), Gaps = 59/268 (22%) 2913 2914Query: 118 IPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLV 171 2915 IP+ FD W +++ +++Q +CGSCW+F + + I + LS +L+ 2916Sbjct: 90 IPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLI 149 2917 2918Query: 172 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGI---------------------QT 210 2919 C C + C +GC GG A++Y +K G + T 2920Sbjct: 150 SC---CKD------CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHT 200 2921 2922Query: 211 ESSYP-------------YTAETGTQCNFNS-ANIGAKISNFTMIPKNETVMAGYIVSTG 256 2923 + YP T + G + + + G + N + NE V+ I+ G 2924Sbjct: 201 KGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYN---VQNNEKVIQRDIMMYG 257 2925 2926Query: 257 PLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGA 315 2927 P+ A D E + Y G++ H I I+G+ + K PYW++ NSW 2928Sbjct: 258 PVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVE-----KRTPYWLIANSWNE 312 2929 2930Query: 316 DWGEQGYIYLRRGKNTCGVSNFVSTSII 343 2931 DWGE+G + RG++ C + + V +I 2932Sbjct: 313 DWGEKGLFRMVRGRDECSIESDVVAGLI 340 2933 2934 2935>sp|P25792|CYSP_SCHMA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECURSOR (ANTIGEN SM31) 2936 Length = 340 2937 2938 Score = 84.7 bits (206), Expect = 3e-16 2939 Identities = 66/260 (25%), Positives = 109/260 (41%), Gaps = 53/260 (20%) 2940 2941Query: 118 IPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLV 171 2942 IP+ FD W ++ +++Q +CGSCWSF + + I + V LS +L+ 2943Sbjct: 89 IPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLL 148 2944 2945Query: 172 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTA---ETGTQCNFNS 228 2946 C C E+C GC GG+ A++Y +K G + S +T +C ++ 2947Sbjct: 149 TC---C------ESCGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEHHT 199 2948 2949Query: 229 AN----IGAKISN---------------FTM----------IPKNETVMAGYIVSTGPLA 259 2950 G+KI N +T + +E + I+ GP+ 2951Sbjct: 200 KGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVE 259 2952 2953Query: 260 IAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 318 2954 + E + Y G++ H I I+G+ +N PYW++ NSW DWG 2955Sbjct: 260 ASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVEN-----KTPYWLIANSWNEDWG 314 2956 2957Query: 319 EQGYIYLRRGKNTCGVSNFV 338 2958 E GY + RG++ C + + V 2959Sbjct: 315 ENGYFRIVRGRDECSIESEV 334 2960 2961 2962>sp|P10605|CATB_MOUSE CATHEPSIN B PRECURSOR (CATHEPSIN B1) 2963 Length = 339 2964 2965 Score = 83.9 bits (204), Expect = 5e-16 2966 Identities = 69/265 (26%), Positives = 111/265 (41%), Gaps = 55/265 (20%) 2967 2968Query: 118 IPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL--SEQNLV 171 2969 +P FD W + +++QG CGSCW+F + + I N V++ S ++L+ 2970Sbjct: 80 LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLL 139 2971 2972Query: 172 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK----NGGIQTE---------------- 211 2973 C C C +GCNGG A+++ K +GG+ 2974Sbjct: 140 TC---C-----GIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHV 191 2975 2976Query: 212 --SSYPYTAETGT-QCNFNSANIGAKIS----------NFTMIPKNETVMAGYIVSTGPL 258 2977 S P T E T +CN S G S ++++ + +MA I GP+ 2978Sbjct: 192 NGSRPPCTGEGDTPRCN-KSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAE-IYKNGPV 249 2979 2980Query: 259 AIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 317 2981 A ++ Y GV+ H I I+G+ +N + PYW+ NSW DW 2982Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGV-----PYWLAANSWNLDW 304 2983 2984Query: 318 GEQGYIYLRRGKNTCGVSNFVSTSI 342 2985 G+ G+ + RG+N CG+ + + I 2986Sbjct: 305 GDNGFFKILRGENHCGIESEIVAGI 329 2987 2988 2989>sp|P43510|CPR6_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 6 PRECURSOR 2990 Length = 379 2991 2992 Score = 83.5 bits (203), Expect = 6e-16 2993 Identities = 72/265 (27%), Positives = 114/265 (42%), Gaps = 61/265 (23%) 2994 2995Query: 118 IPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK--LVSLSEQNLV 171 2996 IP +FD W ++ +++Q CGSCW+F + + I+ + V+LS +L+ 2997Sbjct: 105 IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLL 164 2998 2999Query: 172 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQ------CN 225 3000 C C ++C GCNGG A+ Y +K+G I T S+Y TA G + C 3001Sbjct: 165 SC---C------KSCGFGCNGGDPLAAWRYWVKDG-IVTGSNY--TANNGCKPYPFPPCE 212 3002 3003Query: 226 FNSANIGAK----------------ISNFTMIPKNETVMAGY---------------IVS 254 3004 +S +S++T +E G +++ 3005Sbjct: 213 HHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMT 272 3006 3007Query: 255 TGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSW 313 3008 GPL IA + E + Y GGV+ H + ++G+ + I PYW V NSW 3009Sbjct: 273 HGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGI-----PYWTVANSW 327 3010 3011Query: 314 GADWGEQGYIYLRRGKNTCGVSNFV 338 3012 DWGE G+ + RG + CG+ + V 3013Sbjct: 328 NTDWGEDGFFRILRGVDECGIESGV 352 3014 3015 3016>sp|P43233|CATB_CHICK CATHEPSIN B PRECURSOR (CATHEPSIN B1) 3017 Length = 340 3018 3019 Score = 83.5 bits (203), Expect = 6e-16 3020 Identities = 69/278 (24%), Positives = 112/278 (39%), Gaps = 56/278 (20%) 3021 3022Query: 100 FTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 159 3023 F +D+ + D D T W ++ +++QG CGSCW+F + + + 3024Sbjct: 74 FAEDMDLPDTFD--------TRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHT 125 3025 3026Query: 160 NKLVSL--SEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ-------- 209 3027 N VS+ S ++L+ C C G E C GCNGG A+ Y + G + 3028Sbjct: 126 NAKVSVEVSAEDLLSC---C----GFE-CGMGCNGGYPSGAWRYWTERGLVSGGLYDSHV 177 3029 3030Query: 210 --------------TESSYPYTAETGT--QCN------FNSANIGAKISNFTM--IPKNE 245 3031 S P T E G +C+ ++ + K T +P++E 3032Sbjct: 178 GCRAYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSE 237 3033 3034Query: 246 TVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNM 304 3035 + I GP+ A E + Y GV+ H I I+G+ +N 3036Sbjct: 238 KEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGVEN-----GT 292 3037 3038Query: 305 PYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 342 3039 PYW+ NSW DWG G+ + RG++ CG+ + + + 3040Sbjct: 293 PYWLAANSWNTDWGITGFFKILRGEDHCGIESEIVAGV 330 3041 3042 3043>sp|P25802|CYS1_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PRECURSOR 3044 Length = 341 3045 3046 Score = 76.1 bits (184), Expect = 1e-13 3047 Identities = 65/270 (24%), Positives = 104/270 (38%), Gaps = 54/270 (20%) 3048 3049Query: 103 DLPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFIS 158 3050 D V D +E + IP ++D W ++ + +Q CGSCW+ S+ + + I+ 3051Sbjct: 76 DEEVEDEELEENNDDIPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIA 135 3052 3053Query: 159 QN--KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNY-----IIKNGGIQTE 211 3054 K V +S Q++V C C C +GC GG +A+ + ++ G T+ 3055Sbjct: 136 SKGAKQVLISAQDVVSC---CTW------CGDGCEGGWPISAFRFHADEGVVTGGDYNTK 186 3056 3057Query: 212 SSY-PYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGY------------------- 251 3058 S PY + N G + + GY 3059Sbjct: 187 GSCRPYEIHPCGH-HGNETYYGECVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNS 245 3060 3061Query: 252 -------IVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKN 303 3062 I+ GP+ E + Y G++ + H + ++G+ + K 3063Sbjct: 246 VKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWGEE-----KG 300 3064 3065Query: 304 MPYWIVKNSWGADWGEQGYIYLRRGKNTCG 333 3066 PYWIV NSW DWGE G+ + RG N CG 3067Sbjct: 301 TPYWIVANSWHDDWGENGFFRMHRGSNDCG 330 3068 3069 3070>sp|P25793|CYS2_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 2 PRECURSOR 3071 Length = 342 3072 3073 Score = 75.3 bits (182), Expect = 2e-13 3074 Identities = 55/247 (22%), Positives = 102/247 (41%), Gaps = 50/247 (20%) 3075 3076Query: 133 VKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLVDCDHECMEYEGEEACDEGC 190 3077 +++Q CGSCW+ ST + + I+ K V++S +++ C C C +GC 3078Sbjct: 105 IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTC---C-----RPQCGDGC 156 3079 3080Query: 191 NGGLQPNAYNYIIKNGGIQ------TESSYPYTAET----GTQCNFNSANIGAKI----- 235 3081 GG A+ Y I +G + + PY G + A 3082Sbjct: 157 EGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYGECRGTAPTPPCKR 216 3083 3084Query: 236 -----------------SNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFD 276 3085 + ++ ++ + I+ GP+ +A+ AV +++ Y G++ 3086Sbjct: 217 KCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPV-VASFAVYEDFRHYKSGIYK 275 3087 3088Query: 277 IPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 336 3089 H + ++G+ +N N +W++ NSW DWGE+GY + RG N CG+ 3090Sbjct: 276 HTAGELRGYHAVKMIGWGNEN-----NTDFWLIANSWHNDWGEKGYFRIVRGSNDCGIEG 330 3091 3092Query: 337 FVSTSII 343 3093 ++ I+ 3094Sbjct: 331 TIAAGIV 337 3095 3096 3097>sp|P19092|CYS1_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PRECURSOR 3098 Length = 342 3099 3100 Score = 74.5 bits (180), Expect = 3e-13 3101 Identities = 55/247 (22%), Positives = 102/247 (41%), Gaps = 50/247 (20%) 3102 3103Query: 133 VKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLVDCDHECMEYEGEEACDEGC 190 3104 +++Q CGSCW+ ST + + I+ K V++S +++ C C C +GC 3105Sbjct: 105 IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTC---C-----RPQCGDGC 156 3106 3107Query: 191 NGGLQPNAYNYIIKNGGIQ------TESSYPYTAET----GTQCNFNSANIGAKI----- 235 3108 GG A+ Y I +G + + PY G + A 3109Sbjct: 157 EGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYGECRGTAPTPPCKR 216 3110 3111Query: 236 -----------------SNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFD 276 3112 + ++ ++ + I+ GP+ +A+ AV +++ Y G++ 3113Sbjct: 217 KCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPV-VASFAVYEDFRHYKSGIYK 275 3114 3115Query: 277 IPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 336 3116 H + ++G+ +N N +W++ NSW DWGE+GY + RG N CG+ 3117Sbjct: 276 HTAGELRGYHAVKMIGWGNEN-----NTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEG 330 3118 3119Query: 337 FVSTSII 343 3120 ++ I+ 3121Sbjct: 331 TIAAGIV 337 3122 3123 3124>sp|P43507|CPR3_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 PRECURSOR 3125 Length = 370 3126 3127 Score = 73.4 bits (177), Expect = 7e-13 3128 Identities = 61/259 (23%), Positives = 101/259 (38%), Gaps = 49/259 (18%) 3129 3130Query: 118 IPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLV 171 3131 +P FD W + ++NQ CGSCW+F + + I N +S ++++ 3132Sbjct: 92 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151 3133 3134Query: 172 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ---------------------T 210 3135 C C C GC GG A + +G + 3136Sbjct: 152 SC---C-----GTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPYSFAPCTKNCP 203 3137 3138Query: 211 ESSYPYTAETGTQCNFNSA------NIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA 264 3139 ES+ P + +T Q ++ + + GA T K+ T + I GP+ + 3140Sbjct: 204 ESTTP-SCKTTCQSSYKTEEYKKDKHYGASAYKVTTT-KSVTEIQTEIYHYGPVEASYKV 261 3141 3142Query: 265 VE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 323 3143 E + Y GV+ H + I+G+ +N + YW++ NSWG +GE+G+ 3144Sbjct: 262 YEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGVENGV-----DYWLIANSWGTSFGEKGFF 316 3145 3146Query: 324 YLRRGKNTCGVSNFVSTSI 342 3147 +RRG N C + V I 3148Sbjct: 317 KIRRGTNECQIEGNVVAGI 335 3149 3150 3151>sp|P13823|SERA_PLAFG SERINE-REPEAT ANTIGEN PROTEIN PRECURSOR (P126) (111 KD ANTIGEN) 3152 Length = 989 3153 3154 Score = 70.6 bits (170), Expect = 4e-12 3155 Identities = 61/247 (24%), Positives = 101/247 (40%), Gaps = 50/247 (20%) 3156 3157Query: 133 VKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGE--EACDEGC 190 3158 V++QG C + W F++ ++E + + +S + +C Y+GE + CDEG 3159Sbjct: 579 VEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANC------YKGEHKDRCDEGS 632 3160 3161Query: 191 NGGLQPNAYNYIIKNGG-IQTESSYPYT-AETGTQCN----------------------- 225 3162 + P + II++ G + ES+YPY + G QC 3163Sbjct: 633 S----PMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKILHNKNEPN 688 3164 3165Query: 226 ---------FNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFD 276 3166 + S + F I K E + G +++ I A+ V + G 3167Sbjct: 689 SLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAY----IKAENVMGYEFSGKKVQ 744 3168 3169Query: 277 IPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 336 3170 C ++ DH + IVGY + YWIV+NSWG WG++GY + T N 3171Sbjct: 745 NLCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGPTHCHFN 804 3172 3173Query: 337 FVSTSII 343 3174 F+ + +I 3175Sbjct: 805 FIHSVVI 811 3176 3177 3178>sp|P32956|CC3_CARCN CYSTEINE PROTEINASE III (CC-III) 3179 Length = 43 3180 3181 Score = 63.2 bits (151), Expect = 8e-10 3182 Identities = 25/35 (71%), Positives = 28/35 (79%) 3183 3184Query: 119 PTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 153 3185 P + DWR +GAVTPVKNQG CGSCW+FST VEG 3186Sbjct: 2 PESIDWRKKGAVTPVKNQGSCGSCWAFSTIATVEG 36 3187 3188 3189>sp|P32957|CC4_CARCN CYSTEINE PROTEINASE IV (CC-IV) 3190 Length = 43 3191 3192 Score = 62.1 bits (148), Expect = 2e-09 3193 Identities = 25/35 (71%), Positives = 28/35 (79%) 3194 3195Query: 119 PTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 153 3196 P + DWR +GAVTPVKNQG CGSCW+FST VEG 3197Sbjct: 2 PESIDWRKKGAVTPVKNQGSCGSCWAFSTIVTVEG 36 3198 3199 3200>sp|Q06544|CYS3_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 3201 Length = 174 3202 3203 Score = 59.3 bits (141), Expect = 1e-08 3204 Identities = 31/103 (30%), Positives = 49/103 (47%), Gaps = 15/103 (14%) 3205 3206Query: 241 IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIF 300 3207 I KN V+AG+IV ++ Y G++ + H + I+G+ + 3208Sbjct: 87 IMKNGPVVAGFIVYE----------DFAHYKSGIYKHTAGRMTGGHAVKIIGWGKE---- 132 3209 3210Query: 301 RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 3211 K PYW++ NSW DWGE+G+ + RG N C + V I+ 3212Sbjct: 133 -KGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGIV 174 3213 3214 3215>sp|P32954|CC1_CARCN CYSTEINE PROTEINASE I (CC-I) 3216 Length = 43 3217 3218 Score = 58.6 bits (139), Expect = 2e-08 3219 Identities = 23/36 (63%), Positives = 28/36 (76%) 3220 3221Query: 118 IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 153 3222 I + DWR +GAVTPV+NQG CGSCW+FS+ VEG 3223Sbjct: 1 IVASIDWRQKGAVTPVRNQGSCGSCWTFSSVAAVEG 36 3224 3225 3226>sp|P32955|CC2_CARCN CYSTEINE PROTEINASE II (CC-II) 3227 Length = 43 3228 3229 Score = 58.2 bits (138), Expect = 3e-08 3230 Identities = 23/35 (65%), Positives = 27/35 (76%) 3231 3232Query: 119 PTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 153 3233 P + DWR +GAVTPVK+Q CGSCW+FST VEG 3234Sbjct: 2 PGSVDWRQKGAVTPVKDQNPCGSCWAFSTVATVEG 36 3235 3236 3237>sp||CATL_CHICK_2 [Segment 2 of 2] CATHEPSIN L 3238 Length = 42 3239 3240 Score = 51.9 bits (122), Expect = 2e-06 3241 Identities = 20/39 (51%), Positives = 28/39 (71%), Gaps = 1/39 (2%) 3242 3243Query: 306 YWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 343 3244 YWIVKNSWG WG++GYIY+ + KN CG++ S ++ 3245Sbjct: 4 YWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 42 3246 3247 3248>sp|P12399|CT2A_MOUSE CTLA-2-ALPHA PROTEIN PRECURSOR 3249 Length = 136 3250 3251 Score = 41.8 bits (96), Expect = 0.002 3252 Identities = 31/101 (30%), Positives = 50/101 (48%), Gaps = 4/101 (3%) 3253 3254Query: 9 LAVFTVFVSSRGIPPEEQ--SQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIA 66 3255 L + + + S PP+ +++ E++ KF K Y+ E R +++ N KIE N 3256Sbjct: 17 LLILCLGMMSAAPPPDPSLDNEWKEWKTKFAKAYNLNEERHRRLVWEENKKKIEAHNADY 76 3257 3258Query: 67 INHKADTKFGVNKFADLSSDEFK-NYYLNN-KEAIFTDDLP 105 3259 K G+N+F+DL+ +EFK N Y N+ DLP 3260Sbjct: 77 EQGKTSFYMGLNQFSDLTPEEFKTNCYGNSLNRGEMAPDLP 117 3261 3262 3263>sp|P21381|THPA_THADA THAUMATOPAIN 3264 Length = 35 3265 3266 Score = 40.6 bits (93), Expect = 0.005 3267 Identities = 16/31 (51%), Positives = 22/31 (70%) 3268 3269Query: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFST 147 3270 ++P + DW +GAV VKNQ CGSC +FS+ 3271Sbjct: 1 NLPNSVDWWKKGAVAAVKNQRXCGSCXAFSS 31 3272 3273 3274>sp|P05689|CATX_BOVIN CATHEPSIN 3275 Length = 73 3276 3277 Score = 40.2 bits (92), Expect = 0.006 3278 Identities = 15/40 (37%), Positives = 24/40 (59%), Gaps = 5/40 (12%) 3279 3280Query: 284 LDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 323 3281 ++H + + G+ + M YWIV+NSWG WGE G++ 3282Sbjct: 9 INHIVSVAGWGVSD-----GMEYWIVRNSWGEPWGEHGWM 43 3283 3284 3285>sp|P12400|CT2B_MOUSE CTLA-2-BETA PROTEIN PRECURSOR 3286 Length = 141 3287 3288 Score = 38.7 bits (88), Expect = 0.018 3289 Identities = 25/85 (29%), Positives = 45/85 (52%), Gaps = 1/85 (1%) 3290 3291Query: 6 LFVLAVFTVFVSSRGIP-PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNL 64 3292 +F+L + +S+ P P +++ E++ F K YS +E R +++ N KIE N 3293Sbjct: 20 VFLLILCLGMMSAAPSPDPSLDNEWKEWKTTFAKAYSLDEERHRRLMWEENKKKIEAHNA 79 3294 3295Query: 65 IAINHKADTKFGVNKFADLSSDEFK 89 3296 K G+N+F+DL+ +EF+ 3297Sbjct: 80 DYERGKTSFYMGLNQFSDLTPEEFR 104 3298 3299 3300>sp|P20736|BM86_BOOMI GLYCOPROTEIN ANTIGEN BM86 PRECURSOR (PROTECTIVE ANTIGEN) 3301 Length = 650 3302 3303 Score = 35.2 bits (79), Expect = 0.21 3304 Identities = 24/81 (29%), Positives = 36/81 (43%), Gaps = 5/81 (6%) 3305 3306Query: 147 TTGNVEGQHFISQNKLVSLSEQNLVDC----DHECMEYEGEEACDEGCNGGLQPNAYNYI 202 3307 TT N + KL + + + +C DHEC +++C E NG Q + + 3308Sbjct: 533 TTCNPKEIQECQDKKLECVYKNHKAECECPDDHECYREPAKDSCSEEDNGKCQSSGQRCV 592 3309 3310Query: 203 IKNG-GIQTESSYPYTAETGT 222 3311 I+NG + E S TA T T 3312Sbjct: 593 IENGKAVCKEKSEATTAATTT 613 3313 3314 3315>sp|Q11121|PLB1_TORDE LYSOPHOSPHOLIPASE PRECURSOR (PHOSPHOLIPASE B) 3316 Length = 649 3317 3318 Score = 34.8 bits (78), Expect = 0.27 3319 Identities = 31/144 (21%), Positives = 56/144 (38%), Gaps = 13/144 (9%) 3320 3321Query: 109 YLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQ 168 3322 Y DDE + I F+ TRG +T + C +C + + K SL+ 3323Sbjct: 519 YTDDERLKMIKNGFEAATRGNLTDDSSFMGCVAC-------------AVMRRKQQSLNAT 565 3324 3325Query: 169 NLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNS 228 3326 +C Y D+ GL + ++ + ++ Y++ + T N 3327Sbjct: 566 LPEECSTCFTNYCWNGTIDDTPVSGLDNSDFDPTAASSAYSAYNTESYSSSSATGSKKNG 625 3328 3329Query: 229 ANIGAKISNFTMIPKNETVMAGYI 252 3330 A + A ++FT I T +AG++ 3331Sbjct: 626 AGLPATPTSFTSILTLLTAIAGFL 649 3332 3333 3334>sp|P46992|YJR1_YEAST HYPOTHETICAL 43.0 KD PROTEIN IN CPS1-FPP1 INTERGENIC REGION 3335 Length = 396 3336 3337 Score = 33.6 bits (75), Expect = 0.61 3338 Identities = 40/191 (20%), Positives = 76/191 (38%), Gaps = 43/191 (22%) 3339 3340Query: 77 VNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF------------------INSI 118 3341 VNKF D++++E + ++ + P+ADYL F +N+ 3342Sbjct: 42 VNKFKDITNNESCTCEVGDRVWFSGKNAPLADYLSVHFRGPLKLKQFAFYTSPGFTVNNS 101 3343 3344Query: 119 PTAFDW----------RTRGAVTPVKNQGQCGSCW-------SFSTTGNVEGQHFISQNK 161 3345 ++ DW +T VT + + G+ C S + TG+ ++ 3346Sbjct: 102 RSSSDWNRLAYYESSSKTADNVTFLNHGGEASPCLGNALSYASSNGTGSASEATVLADGT 161 3347 3348Query: 162 LVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETG 221 3349 L+S ++ ++ + C + ++ C +G P Y Y GG T + + E 3350Sbjct: 162 LISSDQEYIIYSNVSCPKSGYDKGCGVYRSG--IPAYYGY----GG--TTKMFLFEFEMP 213 3351 3352Query: 222 TQCNFNSANIG 232 3353 T+ NS++IG 3354Sbjct: 214 TETEKNSSSIG 224 3355 3356 3357>sp|P28493|PR5_ARATH PATHOGENESIS-RELATED PROTEIN 5 PRECURSOR (PR-5) 3358 Length = 239 3359 3360 Score = 32.1 bits (71), Expect = 1.8 3361 Identities = 24/93 (25%), Positives = 36/93 (37%), Gaps = 7/93 (7%) 3362 3363Query: 133 VKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNG 192 3364 ++ G G C G V + + L + + N+V C C + ++ C G N 3365Sbjct: 137 IRPSGGSGDC---KYAGCVSDLNAACPDMLKVMDQNNVVACKSACERFNTDQYCCRGAND 193 3366 3367Query: 193 GLQ---PNAYNYIIKNGGIQTESSYPYTAETGT 222 3368 + P Y+ I KN SY Y ET T 3369Sbjct: 194 KPETCPPTDYSRIFKN-ACPDAYSYAYDDETST 225 3370 3371 3372>sp|P41901|SPR3_YEAST SPORULATION-SPECIFIC SEPTIN 3373 Length = 512 3374 3375 Score = 31.7 bits (70), Expect = 2.4 3376 Identities = 18/63 (28%), Positives = 30/63 (47%), Gaps = 9/63 (14%) 3377 3378Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 3379 + +NLI + K+D L+ +E KN+ +E I D+PV + DE +N+ 3380Sbjct: 237 KRVNLIPVIAKSDL---------LTKEELKNFKTQVREIIRVQDIPVCFFFGDEVLNATQ 287 3381 3382Query: 120 TAF 122 3383 F 3384Sbjct: 288 DIF 290 3385 3386 3387>sp|P54634|POLN_LORDV NON-STRUCTURAL POLYPROTEIN [CONTAINS: RNA-DIRECTED RNA POLYMERASE ; 3388 THIOL PROTEASE 3C ; HELICASE (2C LIKE PROTEIN)] 3389 Length = 1699 3390 3391 Score = 31.3 bits (69), Expect = 3.1 3392 Identities = 13/31 (41%), Positives = 21/31 (66%) 3393 3394Query: 17 SSRGIPPEEQSQFLEFQDKFNKKYSHEEYLE 47 3395 SS+G+ EE ++ +++ N KYS EEYL+ 3396Sbjct: 893 SSKGLSDEEYDEYKRIREERNGKYSIEEYLQ 923 3397 3398 3399>sp|P21173|DNAA_MICLU CHROMOSOMAL REPLICATION INITIATOR PROTEIN DNAA 3400 Length = 515 3401 3402 Score = 31.3 bits (69), Expect = 3.1 3403 Identities = 25/104 (24%), Positives = 46/104 (44%), Gaps = 13/104 (12%) 3404 3405Query: 31 EFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKN 90 3406 EF + F H+E +++++ ++ L + I AD + V +F F 3407Sbjct: 247 EFTNDFINSIRHDEGASFKQVYRN----VDILLIDDIQFLADKEATVEEFFHT----FNT 298 3408 3409Query: 91 YYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVK 134 3410 Y NNK+ + T DLP F + + + F+W G +T ++ 3411Sbjct: 299 LYNNNKQVVITSDLPPKQL--SGFEDRLRSRFEW---GLITDIQ 337 3412 3413 3414>sp|P89263|Y022_GVXN HYPOTHETICAL ORF22 HOMOLOG 3415 Length = 166 3416 3417 Score = 30.5 bits (67), Expect = 5.3 3418 Identities = 27/107 (25%), Positives = 43/107 (39%), Gaps = 9/107 (8%) 3419 3420Query: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203 3421 SF T G H SQ V +S Q+LV Y+ CD+ Y+ ++ 3422Sbjct: 66 SFDTLEGPGGGHCFSQP--VRVSRQDLVT-------YDCASLCDDVRAAYFYVGPYDRLV 116 3423 3424Query: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAG 250 3425 +G E Y T CN ++ + I+++T I ++ AG 3426Sbjct: 117 VDGNELQEGGYCTTNSVPRNCNRETSILLHSINHWTCIAEDPRYYAG 163 3427 3428 3429>sp|P24896|NU5M_CAEEL NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 5 3430 Length = 527 3431 3432 Score = 30.5 bits (67), Expect = 5.3 3433 Identities = 21/52 (40%), Positives = 26/52 (49%), Gaps = 7/52 (13%) 3434 3435Query: 44 EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNN 95 3436 +YL + I+K K +L L IN K T F LSS FKNYYL + 3437Sbjct: 466 DYLAKNSIYKMKNLKFMDLFLNNINSKGYTLF-------LSSGMFKNYYLKS 510 3438 3439 3440>sp|P25648|SRB8_YEAST SUPPRESSOR OF RNA POLYMERASE B SRB8 3441 Length = 1427 3442 3443 Score = 30.1 bits (66), Expect = 7.0 3444 Identities = 22/89 (24%), Positives = 44/89 (48%), Gaps = 10/89 (11%) 3445 3446Query: 21 IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGV--- 77 3447 +PP + S F++ + Y EE ++ E F NLG + ++ I H+ + K+ + 3448Sbjct: 1314 LPPFQVSSFVKETKLHSGDYGEEEDADQEESFSLNLG----IGIVEIAHENEQKWLIYDK 1369 3449 3450Query: 78 --NKFADLSSDEFKNYYLNNKEAIFTDDL 104 3451 +K+ S E ++++N +TDD+ 3452Sbjct: 1370 KDHKYVCTFSME-PYHFISNYNTKYTDDM 1397 3453 3454 3455>sp|Q04723|PEPC_LACLC AMINOPEPTIDASE C 3456 Length = 436 3457 3458 Score = 30.1 bits (66), Expect = 7.0 3459 Identities = 11/20 (55%), Positives = 14/20 (70%) 3460 3461Query: 303 NMPYWIVKNSWGADWGEQGY 322 3462 N W V+NSWG D G++GY 3463Sbjct: 370 NSTKWKVENSWGKDAGQKGY 389 3464 3465 3466>sp|Q13867|BLMH_HUMAN BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 3467 Length = 455 3468 3469 Score = 29.7 bits (65), Expect = 9.1 3470 Identities = 10/17 (58%), Positives = 13/17 (75%) 3471 3472Query: 307 WIVKNSWGADWGEQGYI 323 3473 W V+NSWG D G +GY+ 3474Sbjct: 392 WRVENSWGEDHGHKGYL 408 3475 3476 3477>sp|P87362|BLMH_CHICK BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) (AMINOPEPTIDASE H) 3478 Length = 455 3479 3480 Score = 29.7 bits (65), Expect = 9.1 3481 Identities = 10/19 (52%), Positives = 14/19 (73%) 3482 3483Query: 307 WIVKNSWGADWGEQGYIYL 325 3484 W V+NSWG D G +GY+ + 3485Sbjct: 392 WRVENSWGEDRGNKGYLIM 410 3486 3487 3488>sp|P70645|BLMH_RAT BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 3489 Length = 454 3490 3491 Score = 29.7 bits (65), Expect = 9.1 3492 Identities = 10/17 (58%), Positives = 13/17 (75%) 3493 3494Query: 307 WIVKNSWGADWGEQGYI 323 3495 W V+NSWG D G +GY+ 3496Sbjct: 392 WRVENSWGEDHGHKGYL 408 3497 3498 3499Searching..................................................done 3500 3501 3502Results from round 2 3503 3504 3505 Score E 3506Sequences producing significant alignments: (bits) Value 3507Sequences used in model and found again: 3508 3509sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR 527 e-149 3510sp|P25975|CATL_BOVIN CATHEPSIN L PRECURSOR 440 e-123 3511sp|P07154|CATL_RAT CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN... 431 e-121 3512sp|P06797|CATL_MOUSE CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTE... 431 e-120 3513sp|Q28944|CATL_PIG CATHEPSIN L PRECURSOR 430 e-120 3514sp|P07711|CATL_HUMAN CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTE... 421 e-118 3515sp|O60911|CATM_HUMAN CATHEPSIN L2 PRECURSOR (CATHEPSIN V) 417 e-116 3516sp|P43296|RD19_ARATH CYSTEINE PROTEINASE RD19A PRECURSOR 412 e-115 3517sp|P25776|ORYA_ORYSA ORYZAIN ALPHA CHAIN PRECURSOR 407 e-113 3518sp|P25782|CYS2_HOMAM DIGESTIVE CYSTEINE PROTEINASE 2 PRECURSOR 403 e-112 3519sp|P43295|A494_ARATH PROBABLE CYSTEINE PROTEINASE A494 PRECURSOR 402 e-112 3520sp|P43235|CATK_HUMAN CATHEPSIN K PRECURSOR (CATHEPSIN O) (CATHEP... 401 e-111 3521sp|P25804|CYSP_PEA CYSTEINE PROTEINASE 15A PRECURSOR (TURGOR-RES... 399 e-111 3522sp|P04989|CYS2_DICDI CYSTEINE PROTEINASE 2 PRECURSOR (PRESTALK C... 398 e-111 3523sp|P43236|CATK_RABIT CATHEPSIN K PRECURSOR (OC-2 PROTEIN) 395 e-110 3524sp|P54640|CYS5_DICDI CYSTEINE PROTEINASE 5 PRECURSOR 394 e-109 3525sp|P13277|CYS1_HOMAM DIGESTIVE CYSTEINE PROTEINASE 1 PRECURSOR 393 e-109 3526sp|P25784|CYS3_HOMAM DIGESTIVE CYSTEINE PROTEINASE 3 PRECURSOR 392 e-109 3527sp|P25774|CATS_HUMAN CATHEPSIN S PRECURSOR 392 e-109 3528sp|P15242|TES1_RAT TESTIN 1/2 PRECURSOR (CMB-22/CMB-23) 391 e-109 3529sp|Q10716|CYS1_MAIZE CYSTEINE PROTEINASE 1 PRECURSOR 391 e-108 3530sp|P12412|CYSP_VIGMU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYS... 389 e-108 3531sp|P43297|RD21_ARATH CYSTEINE PROTEINASE RD21A PRECURSOR 387 e-107 3532sp|P55097|CATK_MOUSE CATHEPSIN K PRECURSOR 384 e-106 3533sp|P09668|CATH_HUMAN CATHEPSIN H PRECURSOR 383 e-106 3534sp|O46427|CATH_PIG CATHEPSIN H PRECURSOR 382 e-106 3535sp|P25803|CYSP_PHAVU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYS... 380 e-105 3536sp|P00786|CATH_RAT CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPS... 377 e-104 3537sp|P25251|CYS4_BRANA CYSTEINE PROTEINASE COT44 PRECURSOR 376 e-104 3538sp|P05167|ALEU_HORVU THIOL PROTEASE ALEURAIN PRECURSOR 375 e-104 3539sp|Q10717|CYS2_MAIZE CYSTEINE PROTEINASE 2 PRECURSOR 374 e-103 3540sp|P25777|ORYB_ORYSA ORYZAIN BETA CHAIN PRECURSOR 374 e-103 3541sp|Q40143|CYS3_LYCES CYSTEINE PROTEINASE 3 PRECURSOR 373 e-103 3542sp|P43156|CYSP_HEMSP THIOL PROTEASE SEN102 PRECURSOR 373 e-103 3543sp|P49935|CATH_MOUSE CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHE... 372 e-103 3544sp|P25778|ORYC_ORYSA ORYZAIN GAMMA CHAIN PRECURSOR 370 e-102 3545sp|P00785|ACTN_ACTCH ACTINIDAIN PRECURSOR (ACTINIDIN) 368 e-102 3546sp|P41721|CATV_NPVBM VIRAL CATHEPSIN (V-CATH) 367 e-101 3547sp|Q02765|CATS_RAT CATHEPSIN S PRECURSOR 365 e-100 3548sp|P14658|CYSP_TRYBB CYSTEINE PROTEINASE PRECURSOR 365 e-100 3549sp|P25783|CATV_NPVAC VIRAL CATHEPSIN (V-CATH) 363 e-100 3550sp|P41715|CATV_NPVCF VIRAL CATHEPSIN (V-CATH) 363 e-100 3551sp|P25250|CYS2_HORVU CYSTEINE PROTEINASE EP-B 2 PRECURSOR 361 e-100 3552sp|P25249|CYS1_HORVU CYSTEINE PROTEINASE EP-B 1 PRECURSOR 361 1e-99 3553sp|Q26534|CATL_SCHMA CATHEPSIN L PRECURSOR (SMCL1) 361 2e-99 3554sp|P25779|CYSP_TRYCR CRUZIPAIN PRECURSOR (MAJOR CYSTEINE PROTEIN... 356 4e-98 3555sp|O10364|CATV_NPVOP VIRAL CATHEPSIN (V-CATH) 351 2e-96 3556sp|P36400|LCPB_LEIME CYSTEINE PROTEINASE B PRECURSOR 348 1e-95 3557sp|P25775|LCPA_LEIME CYSTEINE PROTEINASE A PRECURSOR 348 1e-95 3558sp|Q05094|CYS2_LEIPI CYSTEINE PROTEINASE 2 PRECURSOR (AMASTIGOTE... 347 2e-95 3559sp|P35591|CYS1_LEIPI CYSTEINE PROTEINASE 1 PRECURSOR (AMASTIGOTE... 347 3e-95 3560sp|P05994|PAP4_CARPA PAPAYA PROTEINASE IV PRECURSOR (PPIV) (PAPA... 346 3e-95 3561sp|P10056|PAP3_CARPA CARICAIN PRECURSOR (PAPAYA PROTEINASE OMEGA... 341 1e-93 3562sp|P14080|PAP2_CARPA CHYMOPAPAIN PRECURSOR (PAPAYA PROTEINASE II... 339 6e-93 3563sp|P00784|PAPA_CARPA PAPAIN PRECURSOR (PAPAYA PROTEINASE I) (PPI) 334 1e-91 3564sp|P22895|P34_SOYBN P34 PROBABLE THIOL PROTEASE PRECURSOR 331 1e-90 3565sp|O17473|CATL_BRUPA CATHEPSIN L-LIKE PRECURSOR 327 2e-89 3566sp|Q10991|CATL_SHEEP CATHEPSIN L 325 1e-88 3567sp|P54639|CYS4_DICDI CYSTEINE PROTEINASE 4 PRECURSOR 323 4e-88 3568sp|P56203|CATW_MOUSE CATHEPSIN W PRECURSOR (LYMPHOPAIN) 319 4e-87 3569sp|P56202|CATW_HUMAN CATHEPSIN W PRECURSOR (LYMPHOPAIN) 318 9e-87 3570sp|Q01958|CPP2_ENTHI CYSTEINE PROTEINASE 2 PRECURSOR 318 9e-87 3571sp|P36185|ACP2_ENTHI CYSTEINE PROTEINASE ACP2 PRECURSOR 315 8e-86 3572sp|Q06964|CPP3_ENTHI CYSTEINE PROTEINASE 3 PRECURSOR (CYSTEINE P... 312 9e-85 3573sp|P36184|ACP1_ENTHI CYSTEINE PROTEINASE ACP1 PRECURSOR 309 5e-84 3574sp|P25326|CATS_BOVIN CATHEPSIN S 307 2e-83 3575sp|Q01957|CPP1_ENTHI CYSTEINE PROTEINASE 1 PRECURSOR 301 1e-81 3576sp|P25805|CYSP_PLAFA THROPHOZOITE CYSTEINE PROTEINASE PRECURSOR ... 299 5e-81 3577sp|P20721|CYSL_LYCES LOW-TEMPERATURE-INDUCED CYSTEINE PROTEINASE... 298 9e-81 3578sp|P43234|CATO_HUMAN CATHEPSIN O PRECURSOR 296 5e-80 3579sp|P46102|CYSP_PLAVN CYSTEINE PROTEINASE PRECURSOR 291 1e-78 3580sp|P42666|CYSP_PLAVI CYSTEINE PROTEINASE PRECURSOR 278 1e-74 3581sp|P16311|MMAL_DERFA MAJOR MITE FECAL ALLERGEN DER F 1 PRECURSOR... 272 8e-73 3582sp|P80884|ANAN_ANACO ANANAIN 271 1e-72 3583sp||CATL_CHICK_1 [Segment 1 of 2] CATHEPSIN L 264 1e-70 3584sp|P97821|CATC_MOUSE DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (D... 262 9e-70 3585sp|P25781|CYSP_THEAN CYSTEINE PROTEINASE PRECURSOR 259 6e-69 3586sp|P08176|MMAL_DERPT MAJOR MITE FECAL ALLERGEN DER P 1 PRECURSOR... 255 1e-67 3587sp|P14518|BROM_ANACO BROMELAIN, STEM 253 4e-67 3588sp|P53634|CATC_HUMAN DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (D... 251 2e-66 3589sp|P80067|CATC_RAT DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPP... 250 4e-66 3590sp|P22497|CYSP_THEPA CYSTEINE PROTEINASE PRECURSOR 243 5e-64 3591sp|P07858|CATB_HUMAN CATHEPSIN B PRECURSOR (CATHEPSIN B1) (APP S... 238 1e-62 3592sp|P00787|CATB_RAT CATHEPSIN B PRECURSOR (CATHEPSIN B1) (RSG-2) 236 4e-62 3593sp|P10605|CATB_MOUSE CATHEPSIN B PRECURSOR (CATHEPSIN B1) 233 4e-61 3594sp|P43509|CPR5_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 5 PREC... 232 8e-61 3595sp|P07688|CATB_BOVIN CATHEPSIN B PRECURSOR 232 8e-61 3596sp|P43233|CATB_CHICK CATHEPSIN B PRECURSOR (CATHEPSIN B1) 232 1e-60 3597sp|Q26563|CATC_SCHMA CATHEPSIN C PRECURSOR 231 1e-60 3598sp|P25807|CYS1_CAEEL GUT-SPECIFIC CYSTEINE PROTEINASE PRECURSOR 231 1e-60 3599sp|P43508|CPR4_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 4 PREC... 231 2e-60 3600sp|P43510|CPR6_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 6 PREC... 225 8e-59 3601sp|P43157|CYSP_SCHJA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECUR... 222 1e-57 3602sp|P25792|CYSP_SCHMA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECUR... 216 5e-56 3603sp|P25802|CYS1_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PREC... 213 4e-55 3604sp|P25793|CYS2_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 2 PREC... 212 1e-54 3605sp|P19092|CYS1_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PREC... 208 2e-53 3606sp|P43507|CPR3_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 PREC... 204 3e-52 3607sp|P25780|EUM1_EURMA MITE GROUP I ALLERGEN EUR M 1 (EUR M I) 201 1e-51 3608sp|P25773|CATL_FELCA CATHEPSIN L (PROGESTERONE-DEPENDENT PROTEIN... 185 2e-46 3609sp|Q23894|CYS3_DICDI CYSTEINE PROTEINASE 3 (CYSTEINE PROTEINASE II) 158 2e-38 3610sp|P13823|SERA_PLAFG SERINE-REPEAT ANTIGEN PROTEIN PRECURSOR (P1... 157 3e-38 3611sp|Q06544|CYS3_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 139 6e-33 3612sp|P05993|PAP5_CARPA CYSTEINE PROTEINASE (CLONE PLBPC13) 131 2e-30 3613sp|P32957|CC4_CARCN CYSTEINE PROTEINASE IV (CC-IV) 85 3e-16 3614sp|P32956|CC3_CARCN CYSTEINE PROTEINASE III (CC-III) 84 4e-16 3615sp|P32955|CC2_CARCN CYSTEINE PROTEINASE II (CC-II) 81 2e-15 3616sp||CATL_CHICK_2 [Segment 2 of 2] CATHEPSIN L 77 5e-14 3617sp|P32954|CC1_CARCN CYSTEINE PROTEINASE I (CC-I) 76 1e-13 3618 3619Sequences not found previously or not previously below threshold: 3620 3621sp|P12399|CT2A_MOUSE CTLA-2-ALPHA PROTEIN PRECURSOR 98 2e-20 3622sp|P12400|CT2B_MOUSE CTLA-2-BETA PROTEIN PRECURSOR 96 1e-19 3623sp|P05689|CATX_BOVIN CATHEPSIN 60 9e-09 3624sp|P94869|PEPG_LACDL AMINOPEPTIDASE G 46 1e-04 3625sp|P94870|PEPE_LACHE AMINOPEPTIDASE E 43 0.001 3626sp|P94868|PEPW_LACDL AMINOPEPTIDASE W 42 0.002 3627sp|Q10744|PEPC_LACHE AMINOPEPTIDASE C 41 0.003 3628sp|Q04723|PEPC_LACLC AMINOPEPTIDASE C 40 0.009 3629sp|Q48543|PEPC_LACDL AMINOPEPTIDASE C 38 0.025 3630sp|P21381|THPA_THADA THAUMATOPAIN 38 0.025 3631sp|Q56115|PEPC_STRTR AMINOPEPTIDASE C 36 0.13 3632sp|P09983|HLY1_ECOLI HEMOLYSIN, CHROMOSOMAL 36 0.17 3633sp|P33403|CYSP_TRIFO CYSTEINE PROTEINASE 35 0.28 3634sp|P08715|HLYA_ECOLI HEMOLYSIN, PLASMID 35 0.28 3635sp|P87362|BLMH_CHICK BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) (... 34 0.37 3636sp|P54704|PSPB_DICDI PRESPORE PROTEIN B PRECURSOR 34 0.37 3637sp|Q13867|BLMH_HUMAN BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 34 0.49 3638sp|P70645|BLMH_RAT BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 34 0.49 3639sp|P80532|CAT3_FASHE PUTATIVE CATHEPSIN L3 (NEWLY EXCYSTED JUVEN... 34 0.49 3640sp|P16462|LKTA_ACTAC LEUKOTOXIN 34 0.49 3641sp|P13438|TSP_MOUSE TROPHOBLAST-SPECIFIC PROTEIN PRECURSOR 32 1.4 3642sp|Q00951|HLYA_ACTSU HEMOLYSIN (CYTOLYSIN II) (CLY-IIA) (HLY-IIA... 32 1.4 3643sp|P15377|RT2A_ACTPL RTX-II TOXIN DETERMINANT A (APX-IIA) (HEMOL... 32 1.4 3644sp|P52181|TGLC_PAGMA PROTEIN-GLUTAMINE GAMMA-GLUTAMYLTRANSFERASE... 32 1.9 3645sp|P35681|TCTP_ORYSA TRANSLATIONALLY CONTROLLED TUMOR PROTEIN HO... 32 1.9 3646sp|Q01532|BLH1_YEAST CYSTEINE PROTEINASE 1 (Y3) (BLEOMYCIN HYDRO... 32 2.5 3647sp|P16312|MMAL_DERMI MAJOR MITE FECAL ALLERGEN DER M 1 (DER M I) 31 3.2 3648sp|O03992|TCTP_FRAAN TRANSLATIONALLY CONTROLLED TUMOR PROTEIN HO... 31 4.2 3649sp|Q04489|YMJ6_YEAST HYPOTHETICAL 59.5 KD PROTEIN IN VPS9-RAD10 ... 31 4.2 3650sp|Q03164|HRX_HUMAN ZINC FINGER PROTEIN HRX (ALL-1) (TRITHORAX-L... 31 4.2 3651sp||CATB_COTJA_1 [Segment 1 of 2] CATHEPSIN B (CATHEPSIN B1) 30 5.6 3652sp|Q62703|RCN2_RAT RETICULOCALBIN 2 PRECURSOR (CALCIUM-BINDING P... 30 5.6 3653sp|P48651|PSS1_HUMAN PHOSPHATIDYLSERINE SYNTHASE I (SERINE-EXCHA... 30 5.6 3654sp|P33404|CYSP_TRIVA CYSTEINE PROTEINASE 30 5.6 3655sp|Q00576|PSS1_CRILO PHOSPHATIDYLSERINE SYNTHASE I (SERINE-EXCHA... 30 5.6 3656sp|Q9ZRX0|TCTP_PSEMZ TRANSLATIONALLY CONTROLLED TUMOR PROTEIN HO... 30 7.3 3657sp|Q94480|V136_DICDI VEG136 PROTEIN 30 7.3 3658sp|P55131|RT32_ACTPL RTX-III TOXIN DETERMINANT A FROM SEROTYPE 8... 30 7.3 3659sp|P55130|RT31_ACTPL RTX-III TOXIN DETERMINANT A FROM SEROTYPE 2... 30 7.3 3660sp|P40101|YE16_YEAST HYPOTHETICAL 35.9 KD PROTEIN IN ISC10 3'REGION 30 7.3 3661sp|P13388|XMRK_XIPMA MELANOMA RECEPTOR PROTEIN-TYROSINE KINASE P... 30 7.3 3662sp|Q9ZL75|MOAA_HELPJ MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN A 30 9.5 3663sp|P55129|RT12_ACTPL RTX-I TOXIN DETERMINANT A FROM SEROTYPES 5/... 30 9.5 3664sp|P55128|RT11_ACTPL RTX-I TOXIN DETERMINANT A FROM SEROTYPES 1/... 30 9.5 3665sp|P35669|GSHB_SCHPO GLUTATHIONE SYNTHETASE LARGE CHAIN (GLUTATH... 30 9.5 3666sp|P11140|ABRA_ABRPR ABRIN-A PRECURSOR (RRNA N-GLYCOSIDASE) 30 9.5 3667sp|Q00690|LEM2_MOUSE E-SELECTIN PRECURSOR (ENDOTHELIAL LEUKOCYTE... 30 9.5 3668sp|P06620|ICEN_PSESY ICE NUCLEATION PROTEIN 30 9.5 3669 3670>sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR 3671 Length = 343 3672 3673 Score = 527 bits (1343), Expect = e-149 3674 Identities = 343/343 (100%), Positives = 343/343 (100%) 3675 3676Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 3677 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 3678Sbjct: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 3679 3680Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPT 120 3681 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPT 3682Sbjct: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPT 120 3683 3684Query: 121 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 180 3685 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 3686Sbjct: 121 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 180 3687 3688Query: 181 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTM 240 3689 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTM 3690Sbjct: 181 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTM 240 3691 3692Query: 241 IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIF 300 3693 IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIF 3694Sbjct: 241 IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIF 300 3695 3696Query: 301 RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 3697 RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 3698Sbjct: 301 RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 3699 3700 3701>sp|P25975|CATL_BOVIN CATHEPSIN L PRECURSOR 3702 Length = 334 3703 3704 Score = 440 bits (1119), Expect = e-123 3705 Identities = 120/343 (34%), Positives = 176/343 (50%), Gaps = 19/343 (5%) 3706 3707Query: 7 FVLAVFTVFVSSRG--IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNL 64 3708 F L V + V+S + P + + +++ + Y E R +++ N I+ N 3709Sbjct: 5 FFLTVLCLGVASAAPKLDPNLDAHWHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQ 64 3710 3711Query: 65 IAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDW 124 3712 K + +N F D++++EF+ + + +P + DW 3713Sbjct: 65 EYSEGKHAFRMAMNAFGDMTNEEFRQVMNG----FQNQKHKKGKLFHEPLLVDVPKSVDW 120 3714 3715Query: 125 RTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEE 184 3716 +G VTPVKNQGQCGSCW+FS TG +EGQ F KLVSLSEQNLVDC 3717Sbjct: 121 TKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRA-------- 172 3718 3719Query: 185 ACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN 244 3720 ++GCNGGL NA+ YI NGG+ +E SYPY A CN+ A + F IP+ 3721Sbjct: 173 QGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQR 232 3722 3723Query: 245 ETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFR 301 3724 E + + + GP+++A DA +QFY G+ +D C+ LDHG+L+VGY + T 3725Sbjct: 233 EKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSN 292 3726 3727Query: 302 KNMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 343 3728 N +WIVKNSWG +WG GY+ + + N CG++ S + 3729Sbjct: 293 NNK-FWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334 3730 3731 3732>sp|P07154|CATL_RAT CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) (CYCLIC 3733 PROTEIN-2) (CP-2) 3734 Length = 334 3735 3736 Score = 431 bits (1097), Expect = e-121 3737 Identities = 123/345 (35%), Positives = 187/345 (53%), Gaps = 19/345 (5%) 3738 3739Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEEL 62 3740 ++LL VL + T + + +Q+ +++ + Y E R +++ N+ I+ 3741Sbjct: 4 LLLLAVLCLGTALATPKF-DQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLH 62 3742 3743Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAF 122 3744 N N K +N F D++++EF+ + + + IP 3745Sbjct: 63 NGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYRHQKH----KKGRLFQEPLMLQIPKTV 118 3746 3747Query: 123 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEG 182 3748 DWR +G VTPVKNQGQCGSCW+FS +G +EGQ F+ KL+SLSEQNLVDC H 3749Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH------- 171 3750 3751Query: 183 EEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIP 242 3752 + ++GCNGGL A+ YI +NGG+ +E SYPY A+ G C + + A + F IP 3753Sbjct: 172 -DQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDG-SCKYRAEYAVANDTGFVDIP 229 3754 3755Query: 243 KNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGILIVGYSAKNTI 299 3756 + E + + + GP+++A DA QFY G++ P C+ LDHG+L+VGY + T 3757Sbjct: 230 QQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTD 289 3758 3759Query: 300 FRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 343 3760 K+ YW+VKNSWG +WG GYI + + + N CG++ S I+ 3761Sbjct: 290 SNKDK-YWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333 3762 3763 3764>sp|P06797|CATL_MOUSE CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) 3765 Length = 334 3766 3767 Score = 431 bits (1096), Expect = e-120 3768 Identities = 120/347 (34%), Positives = 189/347 (53%), Gaps = 18/347 (5%) 3769 3770Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 3771 M ++LL + +++ +++ +++ + Y E R I++ N+ I+ 3772Sbjct: 1 MNLLLLLAVLCLGTALATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQ 60 3773 3774Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPT 120 3775 N N + +N F D++++EF+ + + + IP 3776Sbjct: 61 LHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKH----KKGRLFQEPLMLKIPK 116 3777 3778Query: 121 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 180 3779 + DWR +G VTPVKNQGQCGSCW+FS +G +EGQ F+ KL+SLSEQNLVDC H 3780Sbjct: 117 SVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHA---- 172 3781 3782Query: 181 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTM 240 3783 ++GCNGGL A+ YI +NGG+ +E SYPY A+ G C + + A + F 3784Sbjct: 173 ----QGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDG-SCKYRAEFAVANDTGFVD 227 3785 3786Query: 241 IPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGILIVGYSAKN 297 3787 IP+ E + + + GP+++A DA QFY G++ P C+ +LDHG+L+VGY + 3788Sbjct: 228 IPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEG 287 3789 3790Query: 298 TIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 343 3791 T KN YW+VKNSWG++WG +GYI + + + N CG++ S ++ 3792Sbjct: 288 TDSNKNK-YWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333 3793 3794 3795>sp|Q28944|CATL_PIG CATHEPSIN L PRECURSOR 3796 Length = 334 3797 3798 Score = 430 bits (1094), Expect = e-120 3799 Identities = 115/343 (33%), Positives = 171/343 (49%), Gaps = 19/343 (5%) 3800 3801Query: 7 FVLAVFTVFVSSRG--IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNL 64 3802 L + ++S + + + +++ + Y E R +++ N+ IE N 3803Sbjct: 5 LFLTALCLGIASAAPKLDQNLDADWYKWKATHGRLYGMNEEGWRRAVWEKNMKMIELHNQ 64 3804 3805Query: 65 IAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDW 124 3806 K +N F D++++EF+ + + +P + DW 3807Sbjct: 65 EYSQGKHGFSMAMNAFGDMTNEEFRQVMNG----FQNQKHKKGKVFHESLVLEVPKSVDW 120 3808 3809Query: 125 RTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEE 184 3810 R +G VT VKNQGQCGSCW+FS TG +EGQ F KLVSLSEQNLVDC 3811Sbjct: 121 REKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS--------RP 172 3812 3813Query: 185 ACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN 244 3814 ++GCNGGL NA+ Y+ NGG+ TE SYPY C + A + F IP+ 3815Sbjct: 173 QGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDIPQR 232 3816 3817Query: 245 ETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFR 301 3818 E + + + GP+++A DA +QFY G+ +D C+ LDHG+L+VGY + T 3819Sbjct: 233 EKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSN 292 3820 3821Query: 302 KNMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 343 3822 + +WIVKNSWG +WG GY+ + + N CG+S S + 3823Sbjct: 293 SSK-FWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334 3824 3825 3826>sp|P07711|CATL_HUMAN CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) 3827 Length = 333 3828 3829 Score = 421 bits (1072), Expect = e-118 3830 Identities = 118/343 (34%), Positives = 180/343 (52%), Gaps = 20/343 (5%) 3831 3832Query: 7 FVLAVFTVFVSSRGIPPE--EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNL 64 3833 +LA F + ++S + + ++Q+ +++ N+ Y E R +++ N+ IE N 3834Sbjct: 5 LILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQ 64 3835 3836Query: 65 IAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDW 124 3837 K +N F D++S+EF+ + P + DW 3838Sbjct: 65 EYREGKHSFTMAMNAFGDMTSEEFRQVMNG----FQNRKPRKGKVFQEPLFYEAPRSVDW 120 3839 3840Query: 125 RTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEE 184 3841 R +G VTPVKNQGQCGSCW+FS TG +EGQ F +L+SLSEQNLVDC 3842Sbjct: 121 REKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCS--------GP 172 3843 3844Query: 185 ACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN 244 3845 +EGCNGGL A+ Y+ NGG+ +E SYPY A C +N A + F IPK 3846Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATE-ESCKYNPKYSVANDTGFVDIPKQ 231 3847 3848Query: 245 ETVMAGYIVSTGPLAIAADA--VEWQFYIGGVF-DIPCNPNSLDHGILIVGYSAKNTIFR 301 3849 E + + + GP+++A DA + FY G++ + C+ +DHG+L+VGY ++T 3850Sbjct: 232 EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESD 291 3851 3852Query: 302 KNMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 343 3853 N YW+VKNSWG +WG GY+ + + +N CG+++ S + 3854Sbjct: 292 NNK-YWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333 3855 3856 3857>sp|O60911|CATM_HUMAN CATHEPSIN L2 PRECURSOR (CATHEPSIN V) 3858 Length = 334 3859 3860 Score = 417 bits (1061), Expect = e-116 3861 Identities = 118/346 (34%), Positives = 179/346 (51%), Gaps = 21/346 (6%) 3862 3863Query: 5 LLFVLAVFTVFVSSRGI--PPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEEL 62 3864 L VLA F + ++S +++ +++ + Y E R +++ N+ IE 3865Sbjct: 3 LSLVLAAFCLGIASAVPKFDQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIELH 62 3866 3867Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAF 122 3868 N K +N F D++++EF+ + F + +P + 3869Sbjct: 63 NGEYSQGKHGFTMAMNAFPDMTNEEFRQMMGCFRNQKFR----KGKVFREPLFLDLPKSV 118 3870 3871Query: 123 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEG 182 3872 DWR +G VTPVKNQ QCGSCW+FS TG +EGQ F KLVSLSEQNLVDC 3873Sbjct: 119 DWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS-------- 170 3874 3875Query: 183 EEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMI- 241 3876 ++GCNGG A+ Y+ +NGG+ +E SYPY A C + N A + FT++ 3877Sbjct: 171 RPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVD-EICKYRPENSVANDTGFTVVA 229 3878 3879Query: 242 PKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVF-DIPCNPNSLDHGILIVGYSAKNT 298 3880 P E + + + GP+++A DA +QFY G++ + C+ +LDHG+L+VGY + 3881Sbjct: 230 PGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFE-G 288 3882 3883Query: 299 IFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 343 3884 N YW+VKNSWG +WG GY+ + + K N CG++ S + 3885Sbjct: 289 ANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334 3886 3887 3888>sp|P43296|RD19_ARATH CYSTEINE PROTEINASE RD19A PRECURSOR 3889 Length = 368 3890 3891 Score = 412 bits (1048), Expect = e-115 3892 Identities = 148/356 (41%), Positives = 199/356 (55%), Gaps = 28/356 (7%) 3893 3894Query: 6 LFVLAVFTVFVSSRGIP------------------PEEQSQFLEFQDKFNKKY-SHEEYL 46 3895 +FVL+ F V VSS + + F F+ KF K Y S+EE+ 3896Sbjct: 10 VFVLSFFIVSVSSSDVNDGDDLVIRQVVGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHD 69 3897 3898Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPV 106 3899 RF +FK+NL + + GV +F+DL+ EF+ +L + 3900Sbjct: 70 YRFSVFKANLRRARRHQKLDP----SATHGVTQFSDLTRSEFRKKHLGVRSGFKLP--KD 123 3901 3902Query: 107 ADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLS 166 3903 A+ ++P FDWR GAVTPVKNQG CGSCWSFS TG +EG +F++ KLVSLS 3904Sbjct: 124 ANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLS 183 3905 3906Query: 167 EQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNF 226 3907 EQ LVDCDHEC + E ++CD GCNGGL +A+ Y +K GG+ E YPYT + G C 3908Sbjct: 184 EQQLVDCDHEC-DPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKTCKL 242 3909 3910Query: 227 NSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDH 286 3911 + + I A +SNF++I +E +A +V GPLA+A +A Q YIGGV L+H 3912Sbjct: 243 DKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYICTRRLNH 302 3913 3914Query: 287 GILIVGYSAKN--TIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVST 340 3915 G+L+VGY A K PYWI+KNSWG WGE G+ + +G+N CGV + VST 3916Sbjct: 303 GVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSMVST 358 3917 3918 3919>sp|P25776|ORYA_ORYSA ORYZAIN ALPHA CHAIN PRECURSOR 3920 Length = 458 3921 3922 Score = 407 bits (1036), Expect = e-113 3923 Identities = 121/350 (34%), Positives = 183/350 (51%), Gaps = 27/350 (7%) 3924 3925Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQ--FLEFQDKFNKKYSHE-EYLERFEIFKSNLGKI 59 3926 ++LL LA + + S G EE+++ + E++ + K Y+ E R+ F+ NL I 3927Sbjct: 12 LLLLLSLAAADMSIVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYI 71 3928 3929Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 3930 +E N A + G+N+FADL+++E+++ YL + + V+D ++P 3931Sbjct: 72 DEHNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRER-KVSDRYLAADNEALP 130 3932 3933Query: 120 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 179 3934 + DWRT+GAV +K+QG CGSCW+FS VE + I L+SLSEQ LVDCD 3935Sbjct: 131 ESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCD----- 185 3936 3937Query: 180 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANI-GAKISNF 238 3938 + +EGCNGGL A+++II NGGI TE YPY + +C+ N N I ++ 3939Sbjct: 186 ----TSYNEGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKD-ERCDVNRKNAKVVTIDSY 240 3940 3941Query: 239 TMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAK 296 3942 + N V P+++A +A +Q Y G+F C +LDHG+ VGY + 3943Sbjct: 241 EDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQLYSSGIFTGKCG-TALDHGVAAVGYGTE 299 3944 3945Query: 297 NTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNTCGVSNFVSTSI 342 3946 N YWIV+NSWG WGE GY+ + R CG++ S + 3947Sbjct: 300 N-----GKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGIAVEPSYPL 344 3948 3949 3950>sp|P25782|CYS2_HOMAM DIGESTIVE CYSTEINE PROTEINASE 2 PRECURSOR 3951 Length = 323 3952 3953 Score = 403 bits (1024), Expect = e-112 3954 Identities = 132/349 (37%), Positives = 187/349 (52%), Gaps = 32/349 (9%) 3955 3956Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKI 59 3957 MKV +LF+ V S + F+ K+ ++Y EE R IF+ N I 3958Sbjct: 1 MKVAVLFLCGVALAAASPS---------WEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYI 51 3959 3960Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 3961 EE N N + +NKF D++ +EF N I PV+ + + 3962Sbjct: 52 EEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMKGN---IPRRSAPVSVFYPKKETGPQA 108 3963 3964Query: 120 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 179 3965 T DWRT+GAVTPVK+QGQCGSCW+FSTTG++EGQHF+ L+SL+EQ LVDC 3966Sbjct: 109 TEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCS----- 163 3967 3968Query: 180 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFT 239 3969 +GCNGG +A++YI N GI TE++YPY A G C F+S ++ A S T 3970Sbjct: 164 ---RPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDG-SCRFDSNSVAATCSGHT 219 3971 3972Query: 240 MI-PKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHGILIVGYSA 295 3973 I +ET + + GP+++ DA +QFY GV+ P C+P+ LDH +L VGY + 3974Sbjct: 220 NIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVGYGS 279 3975 3976Query: 296 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 343 3977 + +W+VKNSW WG+ GYI + R + N CG++ S ++ 3978Sbjct: 280 EG-----GQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323 3979 3980 3981>sp|P43295|A494_ARATH PROBABLE CYSTEINE PROTEINASE A494 PRECURSOR 3982 Length = 313 3983 3984 Score = 402 bits (1021), Expect = e-112 3985 Identities = 141/312 (45%), Positives = 187/312 (59%), Gaps = 10/312 (3%) 3986 3987Query: 32 FQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKN 90 3988 F+ KF K Y EE+ RF +FK+NL + + + + GV +F+DL+ EF+ 3989Sbjct: 3 FKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARH----GVTQFSDLTRSEFRR 58 3990 3991Query: 91 YYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGN 150 3992 +L K A+ ++P FDWR RGAVTPVKNQG CGSCWSFSTTG 3993Sbjct: 59 KHLGVKGGFKLP--KDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGA 116 3994 3995Query: 151 VEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQT 210 3996 +EG HF++ KLVSLSEQ LVDCDHEC + E E +CD GCNGGL +A+ Y +K GG+ 3997Sbjct: 117 LEGAHFLATGKLVSLSEQQLVDCDHEC-DPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMR 175 3998 3999Query: 211 ESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFY 270 4000 E YPYT G C + + I A +SNF+++ NE +A ++ GPLA+A +A Q Y 4001Sbjct: 176 EKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTY 235 4002 4003Query: 271 IGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSWGADWGEQGYIYLRRG 328 4004 IGGV L+HG+L+VGY + K PYWI+KNSWG WGE G+ + +G 4005Sbjct: 236 IGGVSCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKG 295 4006 4007Query: 329 KNTCGVSNFVST 340 4008 +N CGV + VST 4009Sbjct: 296 RNICGVDSLVST 307 4010 4011 4012>sp|P43235|CATK_HUMAN CATHEPSIN K PRECURSOR (CATHEPSIN O) (CATHEPSIN X) (CATHEPSIN O2) 4013 Length = 329 4014 4015 Score = 401 bits (1020), Expect = e-111 4016 Identities = 121/341 (35%), Positives = 179/341 (52%), Gaps = 25/341 (7%) 4017 4018Query: 9 LAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65 4019 L V + V S + PEE + + ++ K+Y+++ + + R I++ NL I NL 4020Sbjct: 4 LKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLE 63 4021 4022Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWR 125 4023 A + +N D++S+E K + Y+ E+ P + D+R 4024Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIP-EWEGRAPDSVDYR 122 4025 4026Query: 126 TRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEA 185 4027 +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E 4028Sbjct: 123 KKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--------- 173 4029 4030Query: 186 CDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIP-KN 244 4031 ++GC GG NA+ Y+ KN GI +E +YPY + C +N AK + IP N 4032Sbjct: 174 -NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE-ESCMYNPTGKAAKCRGYREIPEGN 231 4033 4034Query: 245 ETVMAGYIVSTGPLAIAADAV--EWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFR 301 4035 E + + GP+++A DA +QFY GV +D CN ++L+H +L VGY + 4036Sbjct: 232 EKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-----IQ 286 4037 4038Query: 302 KNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVSTS 341 4039 K +WI+KNSWG +WG +GYI + R KN CG++N S 4040Sbjct: 287 KGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 327 4041 4042 4043>sp|P25804|CYSP_PEA CYSTEINE PROTEINASE 15A PRECURSOR (TURGOR-RESPONSIVE PROTEIN 15A) 4044 Length = 363 4045 4046 Score = 399 bits (1015), Expect = e-111 4047 Identities = 141/322 (43%), Positives = 196/322 (60%), Gaps = 12/322 (3%) 4048 4049Query: 23 PEEQSQFLEFQDKFNKKYSHEEYL-ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFA 81 4050 + F F+ KF+K Y+ +E RF +FKSNL K + N + G+ KF+ 4051Sbjct: 42 LNAEHHFTSFKSKFSKSYATKEEHDYRFGVFKSNLIKAKLHQ----NRDPTAEHGITKFS 97 4052 4053Query: 82 DLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGS 141 4054 DL++ EF+ +L K+ + A ++P FDWR +GAVTPVK+QG CGS 4055Sbjct: 98 DLTASEFRRQFLGLKKRLRLPAH--AQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGS 155 4056 4057Query: 142 CWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNY 201 4058 CW+FSTTG +EG H+++ KLVSLSEQ LVDCDH C + E +CD GCNGGL NA+ Y 4059Sbjct: 156 CWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVC-DPEQAGSCDSGCNGGLMNNAFEY 214 4060 4061Query: 202 IIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIA 261 4062 ++++GG+ E Y YT G C F+ + + A +SNF+++ +E +A +V GPLA+A 4063Sbjct: 215 LLESGGVVQEKDYAYTGRDG-SCKFDKSKVVASVSNFSVVTLDEDQIAANLVKNGPLAVA 273 4064 4065Query: 262 ADAVEWQFYIGGVFD-IPCNPNSLDHGILIVGY--SAKNTIFRKNMPYWIVKNSWGADWG 318 4066 +A Q Y+ GV C + LDHG+L+VG+ A I K PYWI+KNSWG +WG 4067Sbjct: 274 INAAWMQTYMSGVSCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWG 333 4068 4069Query: 319 EQGYIYLRRGKNTCGVSNFVST 340 4070 EQGY + RG+N CGV + VST 4071Sbjct: 334 EQGYYKICRGRNVCGVDSMVST 355 4072 4073 4074>sp|P04989|CYS2_DICDI CYSTEINE PROTEINASE 2 PRECURSOR (PRESTALK CATHEPSIN) 4075 Length = 376 4076 4077 Score = 398 bits (1012), Expect = e-111 4078 Identities = 145/386 (37%), Positives = 211/386 (54%), Gaps = 55/386 (14%) 4079 4080Query: 1 MKVILLFVLAVFTVFVSSRGIP-------PEEQSQFLEFQDKFNKKYSHEEYLERFEIFK 53 4081 M++++ +L +F F + P + ++ F E+ KFN++YS E+ R+ IFK 4082Sbjct: 1 MRLLVFLILLIFVNFSFANVRPNGRRFSESQYRTAFTEWTLKFNRQYSSSEFSNRYSIFK 60 4083 4084Query: 54 SNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKE-AIFTDDLPVADYLDD 112 4085 SN+ ++ N + T G+N FAD++++E++ YL + A + + L+ 4086Sbjct: 61 SNMDYVDNWNSK---GDSQTVLGLNNFADITNEEYRKTYLGTRVNAHSYNGYDGREVLNV 117 4087 4088Query: 113 EFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 172 4089 E + + P + DWRT+ AVTP+K+QGQCGSCWSFSTTG+ EG H + KLVSLSEQNLVD 4090Sbjct: 118 EDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVD 177 4091 4092Query: 173 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 232 4093 C + GC+GGL NA++YIIKN GI TESSYPYTAETG+ C FN ++IG 4094Sbjct: 178 CSGPEENF--------GCDGGLMNNAFDYIIKNKGIDTESSYPYTAETGSTCLFNKSDIG 229 4095 4096Query: 233 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGIL 289 4097 A I + I + GP+++A DA +Q Y G++ P C+P LDHG+L 4098Sbjct: 230 ATIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSPTELDHGVL 289 4099 4100Query: 290 IVGYSA--------------------------------KNTIFRKNMPYWIVKNSWGADW 317 4101 +VGY +++ K YWIVKNSWG W 4102Sbjct: 290 VVGYGVQGKDDEGPVLNRKQTIVIHKNEDNKVESSDDSSDSVRPKANNYWIVKNSWGTSW 349 4103 4104Query: 318 GEQGYIYLRRG-KNTCGVSNFVSTSI 342 4105 G +GYI + + KN CG+++ S + 4106Sbjct: 350 GIKGYILMSKDRKNNCGIASVSSYPL 375 4107 4108 4109>sp|P43236|CATK_RABIT CATHEPSIN K PRECURSOR (OC-2 PROTEIN) 4110 Length = 329 4111 4112 Score = 395 bits (1005), Expect = e-110 4113 Identities = 118/341 (34%), Positives = 178/341 (51%), Gaps = 25/341 (7%) 4114 4115Query: 9 LAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65 4116 L V + V S + PEE +Q+ ++ ++K+Y+ + + + R I++ NL I NL 4117Sbjct: 4 LKVLLLPVVSFALHPEEILDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLE 63 4118 4119Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWR 125 4120 A + +N D++S+E K Y+ ++ P + D+R 4121Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSRSHSNDTLYIP-DWEGRTPDSIDYR 122 4122 4123Query: 126 TRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEA 185 4124 +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E 4125Sbjct: 123 KKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--------- 173 4126 4127Query: 186 CDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIP-KN 244 4128 + GC GG NA+ Y+ +N GI +E +YPY + C +N AK + IP N 4129Sbjct: 174 -NYGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQD-ESCMYNPTGKAAKCRGYREIPEGN 231 4130 4131Query: 245 ETVMAGYIVSTGPLAIAADAV--EWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFR 301 4132 E + + GP+++A DA +QFY GV +D C+ ++++H +L VGY + 4133Sbjct: 232 EKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSDNVNHAVLAVGYG-----IQ 286 4134 4135Query: 302 KNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVSTS 341 4136 K +WI+KNSWG WG +GYI + R KN CG++N S 4137Sbjct: 287 KGNKHWIIKNSWGESWGNKGYILMARNKNNACGIANLASFP 327 4138 4139 4140>sp|P54640|CYS5_DICDI CYSTEINE PROTEINASE 5 PRECURSOR 4141 Length = 344 4142 4143 Score = 394 bits (1001), Expect = e-109 4144 Identities = 138/362 (38%), Positives = 200/362 (55%), Gaps = 37/362 (10%) 4145 4146Query: 1 MKVI-LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKI 59 4147 MKV+ L VL V + + ++ F ++ K Y+ EE+ R+ IF +N+ + 4148Sbjct: 1 MKVLSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSEEFGARYNIFTANMDYV 60 4149 4150Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 4151 ++ N + ++T G+N FAD++++E++N YL K F + + NS 4152Sbjct: 61 QQWN----SKGSETVLGLNNFADITNEEYRNTYLGTK---FDASSLIGTQEEKVHTNSSA 113 4153 4154Query: 120 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 179 4155 + DWR+ GAVTPVKNQGQCG CWSFSTTG+ EG HF S+ +LVSLSEQNL+DC E 4156Sbjct: 114 ASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE--- 170 4157 4158Query: 180 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFT 239 4159 + GC+GGL A+ YII N GI TESSYPY AE G +C + S N GA +S++ 4160Sbjct: 171 -------NSGCDGGLMTYAFEYIINNNGIDTESSYPYKAENG-KCEYKSENSGATLSSYK 222 4161 4162Query: 240 MIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGILIVGYS-- 294 4163 + V+ P+++A DA +Q Y G++ P C+ +LDHG+L VGY 4164Sbjct: 223 TVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSG 282 4165 4166Query: 295 ------------AKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTS 341 4167 + N + YWIVKNSWG WG +GYI + R + N CG+++ S 4168Sbjct: 283 SGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSASFP 342 4169 4170Query: 342 II 343 4171 ++ 4172Sbjct: 343 VV 344 4173 4174 4175>sp|P13277|CYS1_HOMAM DIGESTIVE CYSTEINE PROTEINASE 1 PRECURSOR 4176 Length = 322 4177 4178 Score = 393 bits (998), Expect = e-109 4179 Identities = 130/351 (37%), Positives = 186/351 (52%), Gaps = 37/351 (10%) 4180 4181Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKI 59 4182 MKV+ LF+ + + + EF+ KF +KY EE R +F NL I 4183Sbjct: 1 MKVVALFLFGLALAAANPS---------WEEFKGKFGRKYVDLEEERYRLNVFLDNLQYI 51 4184 4185Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 4186 EE N + +N+F+D+++++F K+ + + ++ P 4187Sbjct: 52 EEFNKKYERGEVTYNLAINQFSDMTNEKFNAVMKGYKKGPRPAAVFTS-------TDAAP 104 4188 4189Query: 120 TA--FDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 177 4190 + DWRT+GAVTPVK+QGQCGSCW+FSTTG +EGQHF+ +LVSLSEQ LVDC 4191Sbjct: 105 ESTEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDC---- 160 4192 4193Query: 178 MEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN 237 4194 G ++GCNGG A Y+ NGG+ TESSYPY A T C FNS IGA + 4195Sbjct: 161 ---AGGSYYNQGCNGGWVERAIMYVRDNGGVDTESSYPYEARDNT-CRFNSNTIGATCTG 216 4196 4197Query: 238 FTMIPK-NETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGILIVGY 293 4198 + I + +E+ + GP+++A DA +Q Y GV+ P C+ + LDH +L VGY 4199Sbjct: 217 YVGIAQGSESALKTATRDIGPISVAIDASHRSFQSYYTGVYYEPSCSSSQLDHAVLAVGY 276 4200 4201Query: 294 SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 343 4202 ++ +W+VKNSW WGE GYI + R + N CG++ + 4203Sbjct: 277 GSEG-----GQDFWLVKNSWATSWGESGYIKMARNRNNNCGIATDACYPTV 322 4204 4205 4206>sp|P25784|CYS3_HOMAM DIGESTIVE CYSTEINE PROTEINASE 3 PRECURSOR 4207 Length = 321 4208 4209 Score = 392 bits (997), Expect = e-109 4210 Identities = 126/348 (36%), Positives = 185/348 (52%), Gaps = 32/348 (9%) 4211 4212Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKI 59 4213 MKV LF+ + S + F+ ++ +KY +E L R +F+ N I 4214Sbjct: 1 MKVAALFLCGLALATASPS---------WDHFKTQYGRKYGDAKEELYRQRVFQQNEQLI 51 4215 4216Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 4217 E+ N N + K +N+F D++++EF K+ + + 4218Sbjct: 52 EDFNKKFENGEVTFKVAMNQFGDMTNEEFNAVMKGYKKG----SRGEPKAVFTAEAGPMA 107 4219 4220Query: 120 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 179 4221 DWRT+ VTPVK+Q QCGSCW+FS TG +EGQHF+ ++LVSLSEQ LVDC 4222Sbjct: 108 ADVDWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCST---- 163 4223 4224Query: 180 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFT 239 4225 + ++GC GG +A++YI NGGI TESSYPY AE C F++ +IGA + 4226Sbjct: 164 ----DYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAED-RSCRFDANSIGAICTGSV 218 4227 4228Query: 240 MIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDI-PCNPNSLDHGILIVGYSAK 296 4229 + E + + GP+++A DA +QFY GV+ C+P LDHG+L VGY + 4230Sbjct: 219 EVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVGYGTE 278 4231 4232Query: 297 NTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 343 4233 +T YW+VKNSWG+ WG+ GYI + R + N CG+++ S + 4234Sbjct: 279 ST-----KDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYPTV 321 4235 4236 4237>sp|P25774|CATS_HUMAN CATHEPSIN S PRECURSOR 4238 Length = 331 4239 4240 Score = 392 bits (996), Expect = e-109 4241 Identities = 112/342 (32%), Positives = 172/342 (49%), Gaps = 21/342 (6%) 4242 4243Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELN 63 4244 L+ VL V + V+ P + ++ + K+Y E R I++ NL + N 4245Sbjct: 4 LVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHN 63 4246 4247Query: 64 LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD 123 4248 L G+N D++S+E + + + + +P + D 4249Sbjct: 64 LEHSMGMHSYDLGMNHLGDMTSEEVMSLTSSLRVPSQWQRNITYKSNPNRI---LPDSVD 120 4250 4251Query: 124 WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGE 183 4252 WR +G VT VK QG CG+CW+FS G +E Q + KLV+LS QNLVDC E 4253Sbjct: 121 WREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTE------- 173 4254 4255Query: 184 EACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIP- 242 4256 + ++GCNGG A+ YII N GI +++SYPY A +C ++S A S +T +P 4257Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMD-QKCQYDSKYRAATCSKYTELPY 232 4258 4259Query: 243 KNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIF 300 4260 E V+ + + GP+++ DA + Y GV+ P +++HG+L+VGY N 4261Sbjct: 233 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLN--- 289 4262 4263Query: 301 RKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTS 341 4264 YW+VKNSWG ++GE+GYI + R K N CG+++F S 4265Sbjct: 290 --GKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329 4266 4267 4268>sp|P15242|TES1_RAT TESTIN 1/2 PRECURSOR (CMB-22/CMB-23) 4269 Length = 333 4270 4271 Score = 391 bits (995), Expect = e-109 4272 Identities = 112/347 (32%), Positives = 175/347 (50%), Gaps = 20/347 (5%) 4273 4274Query: 3 VILLFVLAVFTVFVSSRGI--PPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 4275 +I + LA+ + V S P ++ E++ K K Y+ E + +++ N IE 4276Sbjct: 1 MIAVLFLAILCLEVDSTAPTPDPSLDVEWNEWRTKHGKTYNMNEERLKRAVWEKNFKMIE 60 4277 4278Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPT 120 4279 N + + D +N F DL++ EF + D +P 4280Sbjct: 61 LHNWEYLEGRHDFTMAMNAFGDLTNIEFVKMMTG----FQRQKIKKTHIFQDHQFLYVPK 116 4281 4282Query: 121 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 180 4283 DWR G VTPVKNQG C S W+FS TG++EGQ F +L+ LSEQNL+DC + + 4284Sbjct: 117 RVDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSNVTH 176 4285 4286Query: 181 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTM 240 4287 GC+GG A+ Y+ NGG+ TE SYPY + G +C +++ N A + +F 4288Sbjct: 177 --------GCSGGFMQYAFQYVKDNGGLATEESYPYRGQ-GRECRYHAENSAANVRDFVQ 227 4289 4290Query: 241 IPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGILIVGYSAKN 297 4291 IP +E + + GP+++A DA +QFY G++ P C L+H +L+VGY + 4292Sbjct: 228 IPGSEEALMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQCKRVHLNHAVLVVGYGFE- 286 4293 4294Query: 298 TIFRKNMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 343 4295 +W+VKNSWG +WG +GY+ L + N CG++ + + I+ 4296Sbjct: 287 GEESDGNSFWLVKNSWGEEWGMKGYMKLAKDWSNHCGIATYSTYPIV 333 4297 4298 4299>sp|Q10716|CYS1_MAIZE CYSTEINE PROTEINASE 1 PRECURSOR 4300 Length = 371 4301 4302 Score = 391 bits (993), Expect = e-108 4303 Identities = 141/368 (38%), Positives = 201/368 (54%), Gaps = 34/368 (9%) 4304 4305Query: 1 MKVILLFVLAVFTVFVSSRGIPPEE-------------------QSQFLEFQDKFNKKYS 41 4306 M +L +L++ + + + E+ +S FL F +F K Y 4307Sbjct: 1 MAHRVLLLLSLASAAAVAAAVDAEDPLIRQVVPGGDDNDLELNAESHFLSFVQRFGKSYK 60 4308 4309Query: 42 H-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLN---NKE 97 4310 +E+ R +FK NL + L+ + GV KF+DL+ EF+ YL ++ 4311Sbjct: 61 DADEHAYRLSVFKDNLRRARRHQLLDP----SAEHGVTKFSDLTPAEFRRTYLGLRKSRR 116 4312 4313Query: 98 AIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFI 157 4314 A+ + A + +P FDWR GAV PVKNQG CGSCWSFS +G +EG H++ 4315Sbjct: 117 ALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYL 176 4316 4317Query: 158 SQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYT 217 4318 + KL LSEQ VDCDHEC + ++CD GCNGGL A++Y+ K GG+++E YPYT 4319Sbjct: 177 ATGKLEVLSEQQFVDCDHEC-DSSEPDSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYT 235 4320 4321Query: 218 AETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDI 277 4322 G +C F+ + I A + NF+++ +E ++ ++ GPLAI +A Q YIGGV 4323Sbjct: 236 GSDG-KCKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYMQTYIGGVSCP 294 4324 4325Query: 278 PCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSWGADWGEQGYIYLRRG---KNTC 332 4326 LDHG+L+VGY A I K+ PYWI+KNSWG +WGE GY + RG +N C 4327Sbjct: 295 YICGRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGENGYYKICRGSNVRNKC 354 4328 4329Query: 333 GVSNFVST 340 4330 GV + VST 4331Sbjct: 355 GVDSMVST 362 4332 4333 4334>sp|P12412|CYSP_VIGMU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYSTEINE PROTEINASE) 4335 (SULFHYDRYL-ENDOPEPTIDASE) (SH-EP) 4336 Length = 362 4337 4338 Score = 389 bits (989), Expect = e-108 4339 Identities = 131/360 (36%), Positives = 187/360 (51%), Gaps = 36/360 (10%) 4340 4341Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQ---------FLEFQDKFNKKYSHEEYLERFEI 51 4342 MK +L VL++ V + E+ + ++ S E +RF + 4343Sbjct: 3 MKKLLWVVLSLSLVLGVANSFDFHEKDLESEESLWDLYERWRSHHTVSRSLGEKHKRFNV 62 4344 4345Query: 52 FKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAI---FTDDLPVAD 108 4346 FK+N+ + N + K +NKFAD+++ EF++ Y +K F + 4347Sbjct: 63 FKANVMHVHNTNKMDKP----YKLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSG 118 4348 4349Query: 109 YLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQ 168 4350 E + S+P + DWR +GAVT VK+QGQCGSCW+FST VEG + I NKLVSLSEQ 4351Sbjct: 119 TFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQ 178 4352 4353Query: 169 NLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNS 228 4354 LVDCD E ++GCNGGL +A+ +I + GGI TES+YPYTA+ GT 4355Sbjct: 179 ELVDCDKE---------ENQGCNGGLMESAFEFIKQKGGITTESNYPYTAQEGTCDESKV 229 4356 4357Query: 229 ANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSLDH 286 4358 ++ I +P N+ V+ P+++A DA ++QFY GVF C L+H 4359Sbjct: 230 NDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDC-NTDLNH 288 4360 4361Query: 287 GILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNTCGVSNFVSTSI 342 4362 G+ IVGY YWIV+NSWG +WGEQGYI ++R + CG++ S I 4363Sbjct: 289 GVAIVGYGTTV----DGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEGLCGIAMMASYPI 344 4364 4365 4366>sp|P43297|RD21_ARATH CYSTEINE PROTEINASE RD21A PRECURSOR 4367 Length = 462 4368 4369 Score = 387 bits (984), Expect = e-107 4370 Identities = 120/330 (36%), Positives = 167/330 (50%), Gaps = 29/330 (8%) 4371 4372Query: 22 PPEEQSQFLEFQDKFNKKYSHE---EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN 78 4373 E S + + K K S E RFEIFK NL ++E N + + G+ 4374Sbjct: 43 EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNL----SYRLGLT 98 4375 4376Query: 79 KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQ 138 4377 +FADL++DE+++ YL K + + + + +P + DWR +GAV VK+QG 4378Sbjct: 99 RFADLTNDEYRSKYLGAKME-KKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGG 157 4379 4380Query: 139 CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNA 198 4381 CGSCW+FST G VEG + I L++LSEQ LVDCD + +EGCNGGL A 4382Sbjct: 158 CGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCD---------TSYNEGCNGGLMDYA 208 4383 4384Query: 199 YNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPL 258 4385 + +IIKNGGI T+ YPY GT I ++ +P V+ P+ 4386Sbjct: 209 FEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPI 268 4387 4388Query: 259 AIAADA--VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGAD 316 4389 +IA +A +Q Y G+FD C LDHG++ VGY +N YWIV+NSWG 4390Sbjct: 269 SIAIEAGGRAFQLYDSGIFDGSCG-TQLDHGVVAVGYGTEN-----GKDYWIVRNSWGKS 322 4391 4392Query: 317 WGEQGYIYLRRG----KNTCGVSNFVSTSI 342 4393 WGE GY+ + R CG++ S I 4394Sbjct: 323 WGESGYLRMARNIASSSGKCGIAIEPSYPI 352 4395 4396 4397>sp|P55097|CATK_MOUSE CATHEPSIN K PRECURSOR 4398 Length = 329 4399 4400 Score = 384 bits (976), Expect = e-106 4401 Identities = 115/341 (33%), Positives = 178/341 (51%), Gaps = 25/341 (7%) 4402 4403Query: 9 LAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65 4404 L V + + S + PEE +Q+ ++ K+Y+ + + + R I++ NL +I NL 4405Sbjct: 4 LKVLLLPMVSFALSPEEMLDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLE 63 4406 4407Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWR 125 4408 A + +N D++S+E + Y E+ +P + D+R 4409Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLRIPPSRSYSNDTLYTP-EWEGRVPDSIDYR 122 4410 4411Query: 126 TRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEA 185 4412 +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E 4413Sbjct: 123 KKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTE--------- 173 4414 4415Query: 186 CDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIP-KN 244 4416 + GC GG A+ Y+ +NGGI +E ++PY + C +N+ AK + IP N 4417Sbjct: 174 -NYGCGGGYMTTAFQYVQQNGGIDSEDAFPYVGQD-ESCMYNATAKAAKCRGYREIPVGN 231 4418 4419Query: 245 ETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFR 301 4420 E + + GP++++ DA +QFY GV +D C+ ++++H +L+VGY + 4421Sbjct: 232 EKALKRAVARVGPISVSIDASLASFQFYSRGVYYDENCDRDNVNHAVLVVGYGT-----Q 286 4422 4423Query: 302 KNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVSTS 341 4424 K +WI+KNSWG WG +GY L R KN CG++N S 4425Sbjct: 287 KGSKHWIIKNSWGESWGNKGYALLARNKNNACGITNMASFP 327 4426 4427 4428>sp|P09668|CATH_HUMAN CATHEPSIN H PRECURSOR 4429 Length = 335 4430 4431 Score = 383 bits (974), Expect = e-106 4432 Identities = 119/341 (34%), Positives = 167/341 (48%), Gaps = 28/341 (8%) 4433 4434Query: 8 VLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAI 67 4435 +L V + + E+ F + K K YS EEY R + F SN KI N 4436Sbjct: 14 LLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFASNWRKINAHN---- 69 4437 4438Query: 68 NHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTR 127 4439 N K +N+F+D+S E K+ YL ++ ++YL P + DWR + 4440Sbjct: 70 NGNHTFKMALNQFSDMSFAEIKHKYLWSEPQ--NCSATKSNYLRGT--GPYPPSVDWRKK 125 4441 4442Query: 128 GA-VTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEAC 186 4443 G V+PVKNQG CGSCW+FSTTG +E I+ K++SL+EQ LVDC + Y 4444Sbjct: 126 GNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNY------ 179 4445 4446Query: 187 DEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIP-KNE 245 4447 GC GGL A+ YI+ N GI E +YPY + G C F + + I +E 4448Sbjct: 180 --GCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGY-CKFQPGKAIGFVKDVANITIYDE 236 4449 4450Query: 246 TVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCN---PNSLDHGILIVGYSAKNTIFR 301 4451 M + P++ A + ++ Y G++ P+ ++H +L VGY KN 4452Sbjct: 237 EAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKN---- 292 4453 4454Query: 302 KNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 342 4455 +PYWIVKNSWG WG GY + RGKN CG++ S I 4456Sbjct: 293 -GIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPI 332 4457 4458 4459>sp|O46427|CATH_PIG CATHEPSIN H PRECURSOR 4460 Length = 335 4461 4462 Score = 382 bits (971), Expect = e-106 4463 Identities = 117/336 (34%), Positives = 165/336 (48%), Gaps = 28/336 (8%) 4464 4465Query: 13 TVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKAD 72 4466 S+ + E+ F + + KKYS EEY R ++F SN KI N 4467Sbjct: 19 ACGASNLAVSSFEKLHFKSWMVQHQKKYSLEEYHHRLQVFVSNWRKINAHNA----GNHT 74 4468 4469Query: 73 TKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGA-VT 131 4470 K G+N+F+D+S DE ++ YL ++ +YL P + DWR +G V+ 4471Sbjct: 75 FKLGLNQFSDMSFDEIRHKYLWSEPQ--NCSATKGNYLRGT--GPYPPSMDWRKKGNFVS 130 4472 4473Query: 132 PVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCN 191 4474 PVKNQG CGSCW+FSTTG +E I+ K++SL+EQ LVDC + GC 4475Sbjct: 131 PVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFN--------NHGCQ 182 4476 4477Query: 192 GGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN-ETVMAG 250 4478 GGL A+ YI N GI E +YPY + C F A + + I N E M 4479Sbjct: 183 GGLPSQAFEYIRYNKGIMGEDTYPYKGQDD-HCKFQPDKAIAFVKDVANITMNDEEAMVE 241 4480 4481Query: 251 YIVSTGPLAIAADA-VEWQFYIGGVFDIPCN---PNSLDHGILIVGYSAKNTIFRKNMPY 306 4482 + P++ A + ++ Y G++ P+ ++H +L VGY +N +PY 4483Sbjct: 242 AVALYNPVSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEEN-----GIPY 296 4484 4485Query: 307 WIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 342 4486 WIVKNSWG WG GY + RGKN CG++ S I 4487Sbjct: 297 WIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPI 332 4488 4489 4490>sp|P25803|CYSP_PHAVU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYSTEINE PROTEINASE EP-C1) 4491 Length = 362 4492 4493 Score = 380 bits (966), Expect = e-105 4494 Identities = 128/354 (36%), Positives = 182/354 (51%), Gaps = 32/354 (9%) 4495 4496Query: 3 VILLFVLAVFTVFVSSRGIP--PEEQSQF---LEFQDKFNKKYSHEEYLERFEIFKSNLG 57 4497 V+L F L + E+S + ++ S E +RF +FK+NL 4498Sbjct: 9 VVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANLM 68 4499 4500Query: 58 KIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK---EAIFTDDLPVADYLDDEF 114 4501 + N + K +NKFAD+++ EF++ Y +K +F E 4502Sbjct: 69 HVHNTNKMDKP----YKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEK 124 4503 4504Query: 115 INSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCD 174 4505 + S+P + DWR +GAVT VK+QGQCGSCW+FST VEG + I NKLV+LSEQ LVDCD 4506Sbjct: 125 VVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCD 184 4507 4508Query: 175 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAK 234 4509 E ++GCNGGL +A+ +I + GGI TES+YPY A+ GT ++ 4510Sbjct: 185 KE---------ENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVS 235 4511 4512Query: 235 ISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSLDHGILIVG 292 4513 I +P N+ V+ P+++A DA ++QFY GVF C L+HG+ IVG 4514Sbjct: 236 IDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDC-STDLNHGVAIVG 294 4515 4516Query: 293 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNTCGVSNFVSTSI 342 4517 Y YWIV+NSWG +WGE GYI ++R + CG++ S I 4518Sbjct: 295 YGTTV----DGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPI 344 4519 4520 4521>sp|P00786|CATH_RAT CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPSIN BA) 4522 Length = 333 4523 4524 Score = 377 bits (958), Expect = e-104 4525 Identities = 117/324 (36%), Positives = 167/324 (51%), Gaps = 28/324 (8%) 4526 4527Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 4528 E+ F + + K YS EY R ++F +N KI+ N K G+N+F+D+S 4529Sbjct: 29 EKFHFTSWMKQHQKTYSSREYSHRLQVFANNWRKIQAHNQRN----HTFKMGLNQFSDMS 84 4530 4531Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAV-TPVKNQGQCGSCW 143 4532 E K+ YL ++ ++YL P++ DWR +G V +PVKNQG CGSCW 4533Sbjct: 85 FAEIKHKYLWSEPQ--NCSATKSNYLRGT--GPYPSSMDWRKKGNVVSPVKNQGACGSCW 140 4534 4535Query: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203 4536 +FSTTG +E I+ K+++L+EQ LVDC + GC GGL A+ YI+ 4537Sbjct: 141 TFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFN--------NHGCQGGLPSQAFEYIL 192 4538 4539Query: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN-ETVMAGYIVSTGPLAIAA 262 4540 N GI E SYPY + G QC FN A + N I N E M + P++ A 4541Sbjct: 193 YNKGIMGEDSYPYIGKNG-QCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAF 251 4542 4543Query: 263 DAV-EWQFYIGGVFDIPCN---PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 318 4544 + ++ Y GV+ P+ ++H +L VGY +N + YWIVKNSWG++WG 4545Sbjct: 252 EVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLL-----YWIVKNSWGSNWG 306 4546 4547Query: 319 EQGYIYLRRGKNTCGVSNFVSTSI 342 4548 GY + RGKN CG++ S I 4549Sbjct: 307 NNGYFLIERGKNMCGLAACASYPI 330 4550 4551 4552>sp|P25251|CYS4_BRANA CYSTEINE PROTEINASE COT44 PRECURSOR 4553 Length = 328 4554 4555 Score = 376 bits (955), Expect = e-104 4556 Identities = 115/329 (34%), Positives = 167/329 (49%), Gaps = 32/329 (9%) 4557 4558Query: 29 FLEFQDKFNKKYSH-----EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADL 83 4559 +L + + K S+ + ERF IFK NL I+ N N A K G+ FA+L 4560Sbjct: 4 YLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNE--NNKNATYKLGLTIFANL 61 4561 4562Query: 84 SSDEFKNYYLNNK----EAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQC 139 4563 ++DE+++ YL + I Y ++ +P DWR +GAV +K+QG C 4564Sbjct: 62 TNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTC 121 4565 4566Query: 140 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 199 4567 GSCW+FST VEG + I +LVSLSEQ LVDCD + ++GCNGGL A+ 4568Sbjct: 122 GSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDK---------SYNQGCNGGLMDYAF 172 4569 4570Query: 200 NYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLA 259 4571 +I+KNGG+ TE YPY G + + I + +P + VS P++ 4572Sbjct: 173 QFIMKNGGLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVS 232 4573 4574Query: 260 IAADA--VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 317 4575 +A DA +Q Y G+F C ++DH ++ VGY ++N + YWIV+NSWG W 4576Sbjct: 233 VAIDAGGRAFQHYQSGIFTGKCG-TNMDHAVVAVGYGSENGV-----DYWIVRNSWGTRW 286 4577 4578Query: 318 GEQGYIYLRRG----KNTCGVSNFVSTSI 342 4579 GE GYI + R CG++ S + 4580Sbjct: 287 GEDGYIRMERNVASKSGKCGIAIEASYPV 315 4581 4582 4583>sp|P05167|ALEU_HORVU THIOL PROTEASE ALEURAIN PRECURSOR 4584 Length = 362 4585 4586 Score = 375 bits (954), Expect = e-104 4587 Identities = 119/374 (31%), Positives = 179/374 (47%), Gaps = 54/374 (14%) 4588 4589Query: 3 VILLFVLAVFTVFVSSRGIPPEEQ---------------------------SQFLEFQDK 35 4590 ++ L VLA V V+S + +F F + 4591Sbjct: 8 LLALAVLATAAVAVASSSSFADSNPIRPVTDRAASTLESAVLGALGRTRHALRFARFAVR 67 4592 4593Query: 36 FNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLN 94 4594 + K Y S E RF IF +L ++ N + ++ G+N+F+D+S +EF+ L 4595Sbjct: 68 YGKSYESAAEVRRRFRIFSESLEEVRSTNRKGLPYR----LGINRFSDMSWEEFQATRLG 123 4596 4597Query: 95 NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQ 154 4598 A T +A ++P DWR G V+PVKNQ CGSCW+FSTTG +E 4599Sbjct: 124 ---AAQTCSATLAGNHLMRDAAALPETKDWREDGIVSPVKNQAHCGSCWTFSTTGALEAA 180 4600 4601Query: 155 HFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSY 214 4602 + + K +SLSEQ LVDC + GCNGGL A+ YI NGGI TE SY 4603Sbjct: 181 YTQATGKNISLSEQQLVDCAGGFNNF--------GCNGGLPSQAFEYIKYNGGIDTEESY 232 4604 4605Query: 215 PYTAETGTQCNFNSANIGAKISNFTMIPKN-ETVMAGYIVSTGPLAIAADAVE-WQFYIG 272 4606 PY G C++ + N ++ + I N E + + P+++A ++ ++ Y 4607Sbjct: 233 PYKGVNGV-CHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVIDGFRQYKS 291 4608 4609Query: 273 GVFD-IPCN--PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK 329 4610 GV+ C P+ ++H +L VGY +N + PYW++KNSWGADWG+ GY + GK 4611Sbjct: 292 GVYTSDHCGTTPDDVNHAVLAVGYGVENGV-----PYWLIKNSWGADWGDNGYFKMEMGK 346 4612 4613Query: 330 NTCGVSNFVSTSII 343 4614 N C ++ S ++ 4615Sbjct: 347 NMCAIATCASYPVV 360 4616 4617 4618>sp|Q10717|CYS2_MAIZE CYSTEINE PROTEINASE 2 PRECURSOR 4619 Length = 360 4620 4621 Score = 374 bits (951), Expect = e-103 4622 Identities = 114/322 (35%), Positives = 168/322 (51%), Gaps = 26/322 (8%) 4623 4624Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 4625 +F F ++ K Y S E +RF IF +L + N + + G+N+FAD+S + 4626Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGL----SYRLGINRFADMSWE 113 4627 4628Query: 87 EFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFS 146 4629 EF+ L + ++ ++P DWR G V+PVKNQG CGSCW+FS 4630Sbjct: 114 EFRATRLGAAQNCSAT--LTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSCWTFS 171 4631 4632Query: 147 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 206 4633 TTG +E + + K +SLSEQ LVDC + GCNGGL A+ YI NG 4634Sbjct: 172 TTGALEAAYTQATGKPISLSEQQLVDCGFAFNNF--------GCNGGLPSQAFEYIKYNG 223 4635 4636Query: 207 GIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN-ETVMAGYIVSTGPLAIAADAV 265 4637 G+ TE SYPY G C F + N+G K+ + I E + + P+++A + + 4638Sbjct: 224 GLDTEESYPYQGVNGI-CKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVI 282 4639 4640Query: 266 E-WQFYIGGVFD-IPCN--PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQG 321 4641 ++ Y GV+ C P ++H +L VGY ++ + PYW++KNSWGADWG++G 4642Sbjct: 283 TGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGV-----PYWLIKNSWGADWGDEG 337 4643 4644Query: 322 YIYLRRGKNTCGVSNFVSTSII 343 4645 Y + GKN CGV+ S I+ 4646Sbjct: 338 YFKMEMGKNMCGVATCASYPIV 359 4647 4648 4649>sp|P25777|ORYB_ORYSA ORYZAIN BETA CHAIN PRECURSOR 4650 Length = 471 4651 4652 Score = 374 bits (951), Expect = e-103 4653 Identities = 112/304 (36%), Positives = 159/304 (51%), Gaps = 23/304 (7%) 4654 4655Query: 44 EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDD 103 4656 E+ RF +F NL ++ N A + G+N+FADL+++EF+ +L K A 4657Sbjct: 69 EHERRFLVFWDNLKFVDAHNARADEGGG-FRLGMNRFADLTNEEFRATFLGAKVAER--S 125 4658 4659Query: 104 LPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLV 163 4660 + + + +P + DWR +GAV PVKNQGQCGSCW+FS VE + + +++ 4661Sbjct: 126 RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMI 185 4662 4663Query: 164 SLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQ 223 4664 +LSEQ LV+C + GCNGGL +A+++IIKNGGI TE YPY A G 4665Sbjct: 186 TLSEQELVECST--------NGQNSGCNGGLMADAFDFIIKNGGIDTEDDYPYKAVDGKC 237 4666 4667Query: 224 CNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNP 281 4668 I F +P+N+ V+ P+++A +A E+Q Y GVF C 4669Sbjct: 238 DINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCG- 296 4670 4671Query: 282 NSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKN----TCGVSNF 337 4672 SLDHG++ VGY N YWIV+NSWG WGE GY+ + R N CG++ 4673Sbjct: 297 TSLDHGVVAVGYGTDN-----GKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGIAMM 351 4674 4675Query: 338 VSTS 341 4676 S 4677Sbjct: 352 ASYP 355 4678 4679 4680>sp|Q40143|CYS3_LYCES CYSTEINE PROTEINASE 3 PRECURSOR 4681 Length = 356 4682 4683 Score = 373 bits (948), Expect = e-103 4684 Identities = 123/321 (38%), Positives = 169/321 (52%), Gaps = 28/321 (8%) 4685 4686Query: 29 FLEFQDKFNKKYSHEEYL-ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 4687 F F + K+Y E + +RFEIF NL I N + K G+N+F DL+ DE 4688Sbjct: 57 FARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGL----SYKLGINEFTDLTWDE 112 4689 4690Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFST 147 4691 F+ + L A L + +P DWR G V+PVK QG+CGSCW+FST 4692Sbjct: 113 FRKHKLG---ASQNCSATTKGNLKLTNVV-LPETKDWRKDGIVSPVKAQGKCGSCWTFST 168 4693 4694Query: 148 TGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGG 207 4695 TG +E + + K +SLSEQ LVDC + GCNGGL A+ YI NGG 4696Sbjct: 169 TGALEAAYAQAFGKGISLSEQQLVDCAGAFNNF--------GCNGGLPSQAFEYIKFNGG 220 4697 4698Query: 208 IQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN-ETVMAGYIVSTGPLAIAADAVE 266 4699 + TE +YPYT + G C F+ ANIG K+ + I E + + P+++A + V+ 4700Sbjct: 221 LDTEEAYPYTGKNGI-CKFSQANIGVKVISSVNITLGAEYELKYAVALVRPVSVAFEVVK 279 4701 4702Query: 267 -WQFYIGGVF-DIPCN--PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGY 322 4703 ++ Y GV+ C P ++H +L VGY +N PYW++KNSWGADWGE GY 4704Sbjct: 280 GFKQYKSGVYASTECGDTPMDVNHAVLAVGYGVEN-----GTPYWLIKNSWGADWGEDGY 334 4705 4706Query: 323 IYLRRGKNTCGVSNFVSTSII 343 4707 + GKN CGV+ S I+ 4708Sbjct: 335 FKMEMGKNMCGVATCASYPIV 355 4709 4710 4711>sp|P43156|CYSP_HEMSP THIOL PROTEASE SEN102 PRECURSOR 4712 Length = 360 4713 4714 Score = 373 bits (947), Expect = e-103 4715 Identities = 127/352 (36%), Positives = 181/352 (51%), Gaps = 30/352 (8%) 4716 4717Query: 3 VILLFVLAVFTVFVSSRGIPPEEQ--SQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 4718 V L F+ ++ + + + E+ + + +++ +E RF +FK N+ I 4719Sbjct: 12 VALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVARDLDEKNRRFNVFKENVKFIH 71 4720 4721Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADY---LDDEFINS 117 4722 E N A K +NKF D+++ EF++ Y +K + E + S 4723Sbjct: 72 EFNQK---KDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYENVGS 128 4724 4725Query: 118 IPT-AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 4726 +P + DWR +GAVT VK+QGQCGSCW+FST +VEG + I +LVSLSEQ LVDCD 4727Sbjct: 129 LPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQELVDCD-- 186 4728 4729Query: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKIS 236 4730 + +EGCNGGL A+ +I KN GI TE SYPY + GT + + I 4731Sbjct: 187 -------TSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDGTCASNLLNSPVVSID 238 4732 4733Query: 237 NFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSLDHGILIVGYS 294 4734 +P N V+ P++++ +A +QFY GVF C LDHG+ IVGY 4735Sbjct: 239 GHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCG-TELDHGVAIVGYG 297 4736 4737Query: 295 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNTCGVSNFVSTSI 342 4738 A R YWIVKNSWG +WGE GYI ++RG + CG++ S I 4739Sbjct: 298 AT----RDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI 345 4740 4741 4742>sp|P49935|CATH_MOUSE CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPSIN BA) 4743 Length = 333 4744 4745 Score = 372 bits (944), Expect = e-103 4746 Identities = 114/324 (35%), Positives = 164/324 (50%), Gaps = 28/324 (8%) 4747 4748Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 4749 E+ F + + K YS EY R ++F +N KI+ N K +N+F+D+S 4750Sbjct: 29 EKFHFKSWMKQHQKTYSSVEYNHRLQMFANNWRKIQAHNQRN----HTFKMALNQFSDMS 84 4751 4752Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAV-TPVKNQGQCGSCW 143 4753 E K+ +L ++ ++YL P++ DWR +G V +PVKNQG C SCW 4754Sbjct: 85 FAEIKHKFLWSEPQ--NCSATKSNYLRGT--GPYPSSMDWRKKGNVVSPVKNQGACASCW 140 4755 4756Query: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203 4757 +FSTTG +E I+ K++SL+EQ LVDC + GC GGL A+ YI+ 4758Sbjct: 141 TFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFN--------NHGCKGGLPSQAFEYIL 192 4759 4760Query: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN-ETVMAGYIVSTGPLAIAA 262 4761 N GI E SYPY + + C FN A + N I N E M + P++ A 4762Sbjct: 193 YNKGIMEEDSYPYIGKD-SSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAF 251 4763 4764Query: 263 DAV-EWQFYIGGVFDIPCN---PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 318 4765 + ++ Y GV+ P+ ++H +L VGY +N + YWIVKNSWG+ WG 4766Sbjct: 252 EVTEDFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLL-----YWIVKNSWGSQWG 306 4767 4768Query: 319 EQGYIYLRRGKNTCGVSNFVSTSI 342 4769 E GY + RGKN CG++ S I 4770Sbjct: 307 ENGYFLIERGKNMCGLAACASYPI 330 4771 4772 4773>sp|P25778|ORYC_ORYSA ORYZAIN GAMMA CHAIN PRECURSOR 4774 Length = 362 4775 4776 Score = 370 bits (940), Expect = e-102 4777 Identities = 110/322 (34%), Positives = 164/322 (50%), Gaps = 27/322 (8%) 4778 4779Query: 28 QFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 4780 +F F + K+Y E RF IF +L + N + ++ G+N+FAD+S + 4781Sbjct: 61 RFARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNRRGLPYR----LGINRFADMSWE 116 4782 4783Query: 87 EFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFS 146 4784 EF+ L A +A ++P DWR G V+PVK+QG CGSCW FS 4785Sbjct: 117 EFQASRLG---AAQNCSATLAGNHRMRDAPALPETKDWREDGIVSPVKDQGHCGSCWPFS 173 4786 4787Query: 147 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 206 4788 TTG++E ++ + VSLSEQ L DC + GC+GGL A+ YI NG 4789Sbjct: 174 TTGSLEARYTQATGPPVSLSEQQLADCATRYNNF--------GCSGGLPSQAFEYIKYNG 225 4790 4791Query: 207 GIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIP-KNETVMAGYIVSTGPLAIAADAV 265 4792 G+ TE +YPYT G C++ N G K+ + I E + + P+++A + 4793Sbjct: 226 GLDTEEAYPYTGVNGI-CHYKPENAGVKVLDSVNITLVAEDELKNAVGLVRPVSVAFQVI 284 4794 4795Query: 266 E-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQG 321 4796 ++ Y GV+ +P ++H +L VGY +N + PYW++KNSWGADWG+ G 4797Sbjct: 285 NGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGV-----PYWLIKNSWGADWGDNG 339 4798 4799Query: 322 YIYLRRGKNTCGVSNFVSTSII 343 4800 Y + GKN CG++ S I+ 4801Sbjct: 340 YFTMEMGKNMCGIATCASYPIV 361 4802 4803 4804>sp|P00785|ACTN_ACTCH ACTINIDAIN PRECURSOR (ACTINIDIN) 4805 Length = 380 4806 4807 Score = 368 bits (936), Expect = e-102 4808 Identities = 116/352 (32%), Positives = 180/352 (50%), Gaps = 29/352 (8%) 4809 4810Query: 1 MKVILLFVLAVFTVFVSSRGIPP----EEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSN 55 4811 M ++ L + ++ +++ + E ++ + + K+ K Y+ E+ RFEIFK 4812Sbjct: 10 MSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKET 69 4813 4814Query: 56 LGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFI 115 4815 L I+E N + K G+N+FADL+ +EF++ YL ++ V++ + F 4816Sbjct: 70 LRFIDEHNA---DTNRSYKVGLNQFADLTDEEFRSTYLGFTSG--SNKTKVSNRYEPRFG 124 4817 4818Query: 116 NSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 175 4819 +P+ DWR+ GAV +K+QG+CG CW+FS VEG + I L+SLSEQ L+DC 4820Sbjct: 125 QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC-- 182 4821 4822Query: 176 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKI 235 4823 G GCNGG + + +II NGGI TE +YPYTA+ G I 4824Sbjct: 183 ------GRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTI 236 4825 4826Query: 236 SNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIPCNPNSLDHGILIVGY 293 4827 + +P N V+ P+++A DA ++ Y G+F PC ++DH + IVGY 4828Sbjct: 237 DTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCG-TAIDHAVTIVGY 295 4829 4830Query: 294 SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK---NTCGVSNFVSTSI 342 4831 + + YWIVKNSW WGE+GY+ + R TCG++ S + 4832Sbjct: 296 GTEG-----GIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342 4833 4834 4835>sp|P41721|CATV_NPVBM VIRAL CATHEPSIN (V-CATH) 4836 Length = 323 4837 4838 Score = 367 bits (932), Expect = e-101 4839 Identities = 131/342 (38%), Positives = 180/342 (52%), Gaps = 26/342 (7%) 4840 4841Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELN 63 4842 +LF L V+ V S+ P + + F EF +FNK YS E E L RF+IF+ NL +I 4843Sbjct: 4 ILFYLFVYAVVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI---- 59 4844 4845Query: 64 LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD 123 4846 I N K+ +NKF+DLS DE Y T + LD P FD 4847Sbjct: 60 -INKNQNDSAKYEINKFSDLSKDETIAKYTGLSLPTQTQNFCKVILLDQPPGKG-PLEFD 117 4848 4849Query: 124 WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGE 183 4850 WR VT VKNQG CG+CW+F+T G++E Q I N+L++LSEQ ++DCD 4851Sbjct: 118 WRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNELINLSEQQMIDCD--------- 168 4852 4853Query: 184 EACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN-FTMIP 242 4854 D GCNGGL A+ IIK GG+Q ES YPY A C NS ++ + + I 4855Sbjct: 169 -FVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYII 226 4856 4857Query: 243 KNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRK 302 4858 E + + GP+ +A DA + Y G+ C + L+H +L+VGY +N 4859Sbjct: 227 VYEEKLKDLLPLVGPIPMAIDAADIVNYKQGIIKY-CFDSGLNHAVLLVGYGVEN----- 280 4860 4861Query: 303 NMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN-FVSTSII 343 4862 N+PYW KN+WG DWGE G+ +++ N CG+ N ST++I 4863Sbjct: 281 NIPYWTFKNTWGTDWGEDGFFRVQQNINACGMRNELASTAVI 322 4864 4865 4866>sp|Q02765|CATS_RAT CATHEPSIN S PRECURSOR 4867 Length = 330 4868 4869 Score = 365 bits (926), Expect = e-100 4870 Identities = 104/330 (31%), Positives = 161/330 (48%), Gaps = 22/330 (6%) 4871 4872Query: 18 SRGIPPEEQSQFLEFQD-KFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFG 76 4873 + P + ++ + + E R I++ NL I NL G 4874Sbjct: 15 ATAERPTLDHHWDLWKKTRMRRNTDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVG 74 4875 4876Query: 77 VNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQ 136 4877 +N D++ +E Y + + + ++ +P + DWR +G VT VK Q 4878Sbjct: 75 MNHMGDMTPEEVIGYMGSLRIPRPWNRSGTLKSSSNQT---LPDSVDWREKGCVTNVKYQ 131 4879 4880Query: 137 GQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQP 196 4881 G CGSCW+FS G +EGQ + KLVSLS QNLVDC E E+ ++GC GG 4882Sbjct: 132 GSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTE------EKYGNKGCGGGFMT 185 4883 4884Query: 197 NAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIP-KNETVMAGYIVST 255 4885 A+ YII + I +E+SYPY A +C ++ N A S + +P +E + + + 4886Sbjct: 186 EAFQYII-DTSIDSEASYPYKAMD-EKCLYDPKNRAATCSRYIELPFGDEEALKEAVATK 243 4887 4888Query: 256 GPLAIAADA---VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNS 312 4889 GP+++ D + Y GV+D P +++HG+L+VGY YW+VKNS 4890Sbjct: 244 GPVSVGIDDASHSSFFLYQSGVYDDPSCTENMNHGVLVVGYGT-----LDGKDYWLVKNS 298 4891 4892Query: 313 WGADWGEQGYIYLRR-GKNTCGVSNFVSTS 341 4893 WG +G+QGYI + R KN CG++++ S 4894Sbjct: 299 WGLHFGDQGYIRMARNNKNHCGIASYCSYP 328 4895 4896 4897>sp|P14658|CYSP_TRYBB CYSTEINE PROTEINASE PRECURSOR 4898 Length = 450 4899 4900 Score = 365 bits (926), Expect = e-100 4901 Identities = 136/346 (39%), Positives = 190/346 (54%), Gaps = 26/346 (7%) 4902 4903Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEE 61 4904 V+L + +V + S + + +F F+ K+ K Y +E RF F+ N+ + + 4905Sbjct: 15 VLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK- 73 4906 4907Query: 62 LNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTA 121 4908 I FGV F+D++ +EF+ Y N + ++ P A 4909Sbjct: 74 ---IQAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTVNVT-TGRAPAA 129 4910 4911Query: 122 FDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYE 181 4912 DWR +GAVTPVK QGQCGSCW+FST GN+EGQ ++ N LVSLSEQ LV CD 4913Sbjct: 130 VDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDT------ 183 4914 4915Query: 182 GEEACDEGCNGGLQPNAYNYIIK-NGG-IQTESSYPYTAETGT--QCNFNSANIGAKISN 237 4916 D GCNGGL NA+N+I+ NGG + TE+SYPY + G QC N IGA I++ 4917Sbjct: 184 ----IDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITD 239 4918 4919Query: 238 FTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN 297 4920 +P++E +A Y+ GPLAIA DA + Y GG+ C LDHG+L+VGY+ 4921Sbjct: 240 HVDLPQDEDAIAAYLAENGPLAIAVDAESFMDYNGGIL-TSCTSKQLDHGVLLVGYN--- 295 4922 4923Query: 298 TIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 4924 N PYWI+KNSW WGE GYI + +G N C ++ VS++++ 4925Sbjct: 296 --DNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339 4926 4927 4928>sp|P25783|CATV_NPVAC VIRAL CATHEPSIN (V-CATH) 4929 Length = 323 4930 4931 Score = 363 bits (923), Expect = e-100 4932 Identities = 129/342 (37%), Positives = 178/342 (51%), Gaps = 26/342 (7%) 4933 4934Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELN 63 4935 +LF L V+ V S+ + + F EF +FNK Y E E L RF+IF+ NL +I 4936Sbjct: 4 ILFYLFVYGVVNSAAYDLLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI---- 59 4937 4938Query: 64 LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD 123 4939 I N K+ +NKF+DLS DE Y I T + LD P FD 4940Sbjct: 60 -INKNQNDSAKYEINKFSDLSKDETIAKYTGLSLPIQTQNFCKVIVLDQPPGKG-PLEFD 117 4941 4942Query: 124 WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGE 183 4943 WR VT VKNQG CG+CW+F+T ++E Q I N+L++LSEQ ++DCD 4944Sbjct: 118 WRRLNKVTSVKNQGMCGACWAFATLASLESQFAIKHNQLINLSEQQMIDCD--------- 168 4945 4946Query: 184 EACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN-FTMIP 242 4947 D GCNGGL A+ IIK GG+Q ES YPY A C NS ++ + + I 4948Sbjct: 169 -FVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIT 226 4949 4950Query: 243 KNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRK 302 4951 E + + GP+ +A DA + Y G+ C + L+H +L+VGY +N 4952Sbjct: 227 VYEEKLKDLLRLVGPIPMAIDAADIVNYKQGIIKY-CFNSGLNHAVLLVGYGVEN----- 280 4953 4954Query: 303 NMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN-FVSTSII 343 4955 N+PYW KN+WG DWGE G+ +++ N CG+ N ST++I 4956Sbjct: 281 NIPYWTFKNTWGTDWGEDGFFRVQQNINACGMRNELASTAVI 322 4957 4958 4959>sp|P41715|CATV_NPVCF VIRAL CATHEPSIN (V-CATH) 4960 Length = 324 4961 4962 Score = 363 bits (922), Expect = e-100 4963 Identities = 128/343 (37%), Positives = 187/343 (54%), Gaps = 25/343 (7%) 4964 4965Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKI 59 4966 M I+L++L V ++ + + + F +F KFNK YS E E L RF+IF+ NL +I 4967Sbjct: 1 MNKIVLYLLVYGAVQCAAYDV-LKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEI 59 4968 4969Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 4970 N ++ + ++ +NKFADLS DE + Y + T + LD P 4971Sbjct: 60 INKN----HNDSTAQYEINKFADLSKDETISKYTGLSLPLQTQNFCEVVVLDRPPDKG-P 114 4972 4973Query: 120 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 179 4974 FDWR VT VKNQG CG+CW+F+T G++E Q I N+ ++LSEQ L+DCD 4975Sbjct: 115 LEFDWRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNQFINLSEQQLIDCD----- 169 4976 4977Query: 180 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN-F 238 4978 D GC+GGL A+ ++ GGIQ ES YPY A G C N+A K+ + 4979Sbjct: 170 -----FVDAGCDGGLLHTAFEAVMNMGGIQAESDYPYEANNG-DCRANAAKFVVKVKKCY 223 4980 4981Query: 239 TMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNT 298 4982 I E + + S GP+ +A DA + Y G+ C + L+H +L+VGY+ +N 4983Sbjct: 224 RYITVFEEKLKDLLRSVGPIPVAIDASDIVNYKRGIMKY-CANHGLNHAVLLVGYAVENG 282 4984 4985Query: 299 IFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 341 4986 + P+WI+KN+WGADWGEQGY +++ N CG+ N + +S 4987Sbjct: 283 V-----PFWILKNTWGADWGEQGYFRVQQNINACGIQNELPSS 320 4988 4989 4990>sp|P25250|CYS2_HORVU CYSTEINE PROTEINASE EP-B 2 PRECURSOR 4991 Length = 373 4992 4993 Score = 361 bits (918), Expect = e-100 4994 Identities = 121/358 (33%), Positives = 170/358 (46%), Gaps = 38/358 (10%) 4995 4996Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQ---------FLEFQDKFNKKYSHEEYLERFEIFKSN 55 4997 + VLAV V + S IP E++ + +Q + H E RF FKSN 4998Sbjct: 14 VAAVLAVAAVELCS-AIPMEDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSN 72 4999 5000Query: 56 LGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFI 115 5001 I N + +N+F D+ EF+ ++ + P + 5002Sbjct: 73 AHFIHSHNKR---GDHPYRLHLNRFGDMDQAEFRATFVGDLRRDTPSKPPSVPGFMYAAL 129 5003 5004Query: 116 N--SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDC 173 5005 N +P + DWR +GAVT VK+QG+CGSCW+FST +VEG + I LVSLSEQ L+DC 5006Sbjct: 130 NVSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDC 189 5007 5008Query: 174 DHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQ---CNFNSAN 230 5009 D A ++GC GGL NA+ YI NGG+ TE++YPY A GT ++ 5010Sbjct: 190 D---------TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSP 240 5011 5012Query: 231 IGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIPCNPNSLDHGI 288 5013 + I +P N V+ P+++A +A + FY GVF C LDHG+ 5014Sbjct: 241 VVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECG-TELDHGV 299 5015 5016Query: 289 LIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK----NTCGVSNFVSTSI 342 5017 +VGY + YW VKNSWG WGEQGYI + + CG++ S + 5018Sbjct: 300 AVVGYG----VAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPV 353 5019 5020 5021>sp|P25249|CYS1_HORVU CYSTEINE PROTEINASE EP-B 1 PRECURSOR 5022 Length = 371 5023 5024 Score = 361 bits (917), Expect = 1e-99 5025 Identities = 121/358 (33%), Positives = 170/358 (46%), Gaps = 38/358 (10%) 5026 5027Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQ---------FLEFQDKFNKKYSHEEYLERFEIFKSN 55 5028 + VLAV V + S IP E++ + +Q + H E RF FKSN 5029Sbjct: 14 VAAVLAVAAVELCS-AIPMEDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSN 72 5030 5031Query: 56 LGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFI 115 5032 I N + +N+F D+ EF+ ++ + P + 5033Sbjct: 73 AHFIHSHNKR---GDHPYRLHLNRFGDMDQAEFRATFVGDLRRDTPAKPPSVPGFMYAAL 129 5034 5035Query: 116 N--SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDC 173 5036 N +P + DWR +GAVT VK+QG+CGSCW+FST +VEG + I LVSLSEQ L+DC 5037Sbjct: 130 NVSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDC 189 5038 5039Query: 174 DHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQ---CNFNSAN 230 5040 D A ++GC GGL NA+ YI NGG+ TE++YPY A GT ++ 5041Sbjct: 190 D---------TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSP 240 5042 5043Query: 231 IGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIPCNPNSLDHGI 288 5044 + I +P N V+ P+++A +A + FY GVF C LDHG+ 5045Sbjct: 241 VVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGDCG-TELDHGV 299 5046 5047Query: 289 LIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK----NTCGVSNFVSTSI 342 5048 +VGY + YW VKNSWG WGEQGYI + + CG++ S + 5049Sbjct: 300 AVVGYG----VAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPV 353 5050 5051 5052>sp|Q26534|CATL_SCHMA CATHEPSIN L PRECURSOR (SMCL1) 5053 Length = 319 5054 5055 Score = 361 bits (916), Expect = 2e-99 5056 Identities = 128/326 (39%), Positives = 190/326 (58%), Gaps = 22/326 (6%) 5057 5058Query: 21 IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80 5059 +P ++++F+ K+ K+Y E RF IFKSN+ K + L + + +GV + 5060Sbjct: 12 LPGNVDEKYVQFKLKYRKQYHETEDEIRFNIFKSNILKAQ---LYQVFVRGSAIYGVTPY 68 5061 5062Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCG 140 5063 +DL++DEF +L + + L E +N+IP FDWR +GAVT VKNQG CG 5064Sbjct: 69 SDLTTDEFARTHLTASWVVPSSRSNTPTSLGKE-VNNIPKNFDWREKGAVTEVKNQGMCG 127 5065 5066Query: 141 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYN 200 5067 SCW+FSTTGNVE Q F KL+SLSEQ LVDCD D+GCNGGL NAY 5068Sbjct: 128 SCWAFSTTGNVESQWFRKTGKLLSLSEQQLVDCDG----------LDDGCNGGLPSNAYE 177 5069 5070Query: 201 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAI 260 5071 IIK GG+ E +YPY A+ +C+ + + I++ + ++ET +A ++ +++ 5072Sbjct: 178 SIIKMGGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISV 236 5073 5074Query: 261 AADAVEWQFYIGGV---FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 317 5075 +A+ QFY G+ + I C+ LDH +L+VGY + KN P+WIVKNSWG +W 5076Sbjct: 237 GMNALLLQFYQHGISHPWWIFCSKYLLDHAVLLVGYG----VSEKNEPFWIVKNSWGVEW 292 5077 5078Query: 318 GEQGYIYLRRGKNTCGVSNFVSTSII 343 5079 GE GY + RG +CG++ ++++I 5080Sbjct: 293 GENGYFRMYRGDGSCGINTVATSAMI 318 5081 5082 5083>sp|P25779|CYSP_TRYCR CRUZIPAIN PRECURSOR (MAJOR CYSTEINE PROTEINASE) (CRUZAINE) 5084 Length = 467 5085 5086 Score = 356 bits (904), Expect = 4e-98 5087 Identities = 133/350 (38%), Positives = 184/350 (52%), Gaps = 30/350 (8%) 5088 5089Query: 3 VILLFVLAVFTVFV----SSRGIPPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLG 57 5090 ++L VL V V +S SQF EF+ K + Y S E R +F+ NL 5091Sbjct: 8 LLLAAVLVVMACLVPAATASLHAEETLTSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF 67 5092 5093Query: 58 KIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINS 117 5094 + FGV F+DL+ +EF++ Y +N A F A + 5095Sbjct: 68 LARLHAAANPH----ATFGVTPFSDLTREEFRSRY-HNGAAHFAAAQERARVPVKVEVVG 122 5096 5097Query: 118 IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 177 5098 P A DWR RGAVT VK+QGQCGSCW+FS GNVE Q F++ + L +LSEQ LV CD 5099Sbjct: 123 APAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDK-- 180 5100 5101Query: 178 MEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETG--TQCNFNSANIGA 233 5102 D GC+GGL NA+ +I++ NG + TE SYPY + G C + +GA 5103Sbjct: 181 --------TDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGA 232 5104 5105Query: 234 KISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGY 293 5106 I+ +P++E +A ++ GP+A+A DA W Y GGV C LDHG+L+VGY 5107Sbjct: 233 TITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVM-TSCVSEQLDHGVLLVGY 291 5108 5109Query: 294 SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 5110 + + PYWI+KNSW WGE+GYI + +G N C V S++++ 5111Sbjct: 292 NDSAAV-----PYWIIKNSWTTQWGEEGYIRIAKGSNQCLVKEEASSAVV 336 5112 5113 5114>sp|O10364|CATV_NPVOP VIRAL CATHEPSIN (V-CATH) 5115 Length = 324 5116 5117 Score = 351 bits (890), Expect = 2e-96 5118 Identities = 122/343 (35%), Positives = 181/343 (52%), Gaps = 25/343 (7%) 5119 5120Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKI 59 5121 M I+L +L V ++ + + + F +F KFNK YS E E L RF+IF+ NL +I 5122Sbjct: 1 MNKIMLCLLVCGVVHAATYDL-LKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEI 59 5123 5124Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 5125 N + + ++ +NKF+DLS +E + Y T + LD + P 5126Sbjct: 60 INKNQ----NDSTAQYEINKFSDLSKEEAISKYTGLSLPHQTQNFCEVVILDRPP-DRGP 114 5127 5128Query: 120 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 179 5129 FDWR VT VKNQG CG+CW+F+T G++E Q I N+L++LSEQ +DCD 5130Sbjct: 115 LEFDWRQFNKVTSVKNQGVCGACWAFATLGSLESQFAIKYNRLINLSEQQFIDCDR---- 170 5131 5132Query: 180 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKI-SNF 238 5133 + GC+GGL A+ ++ GG+Q ES YPY G QC N + S 5134Sbjct: 171 ------VNAGCDGGLLHTAFESAMEMGGVQMESDYPYETANG-QCRINPNRFVVGVRSCR 223 5135 5136Query: 239 TMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNT 298 5137 I E + + + GP+ +A DA + Y G+ C + L+H +L+VGY+ +N 5138Sbjct: 224 RYIVMFEEKLKDLLRAVGPIPVAIDASDIVNYRRGIMR-QCANHGLNHAVLLVGYAVEN- 281 5139 5140Query: 299 IFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 341 5141 N+PYWI+KN+WG DWGE GY +++ N CG+ N + +S 5142Sbjct: 282 ----NIPYWILKNTWGTDWGEDGYFRVQQNINACGIRNELVSS 320 5143 5144 5145>sp|P36400|LCPB_LEIME CYSTEINE PROTEINASE B PRECURSOR 5146 Length = 443 5147 5148 Score = 348 bits (883), Expect = 1e-95 5149 Identities = 123/348 (35%), Positives = 184/348 (52%), Gaps = 28/348 (8%) 5150 5151Query: 4 ILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEEL 62 5152 ++ VLA + + + F EF+ + + Y E +R F+ NL + E 5153Sbjct: 13 VVCVVLAAACAPARAIHVGTPAAALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREH 72 5154 5155Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDE--FINSIPT 120 5156 + +FG+ KF DLS EF YLN A + ++++P 5157Sbjct: 73 QARNPH----AQFGITKFFDLSEAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPD 128 5158 5159Query: 121 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 180 5160 A DWR +GAVTPVK+QG CGSCW+FS GN+EGQ +++ ++LVSLSEQ LV CD 5161Sbjct: 129 AVDWREKGAVTPVKDQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDD----- 183 5162 5163Query: 181 EGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETG---TQCNFNSANIGAKI 235 5164 ++GC+GGL A++++++ NG + TE SYPY + G N + +GA+I 5165Sbjct: 184 -----MNDGCDGGLMLQAFDWLLQNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQI 238 5166 5167Query: 236 SNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 295 5168 +I +E MA ++ GP+AIA DA + Y GV C L+HG+L+VGY 5169Sbjct: 239 DGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVL-TACIGKQLNHGVLLVGYDM 297 5170 5171Query: 296 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 5172 + PYW++KNSWG DWGEQGY+ + G N C +S + ++ + 5173Sbjct: 298 TGEV-----PYWVIKNSWGGDWGEQGYVRVVMGVNACLLSEYPVSAHV 340 5174 5175 5176>sp|P25775|LCPA_LEIME CYSTEINE PROTEINASE A PRECURSOR 5177 Length = 354 5178 5179 Score = 348 bits (883), Expect = 1e-95 5180 Identities = 143/355 (40%), Positives = 190/355 (53%), Gaps = 36/355 (10%) 5181 5182Query: 5 LLFVLAVFTVFVSSRGI-------PPEEQ----SQFLEFQDKFNKKYSHE-EYLERFEIF 52 5183 LLF + V +FV G PP + + + F+ + K + + E RF F 5184Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAF 66 5185 5186Query: 53 KSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDD 112 5187 K N+ LN + D KFADL+ EF YLN D D 5188Sbjct: 67 KQNMQTAYFLNTQNPHAHYDVS---GKFADLTPQEFAKLYLNPDYYARHLKNHKEDVHVD 123 5189 5190Query: 113 EFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 172 5191 + S + DWR +GAVTPVKNQG CGSCW+FS GN+EGQ S + LVSLSEQ LV 5192Sbjct: 124 DSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQMLVS 183 5193 5194Query: 173 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETGTQ--CNFNS 228 5195 CD DEGCNGGL A N+I++ NG + TE+SYPYT+ GT+ C+ + 5196Sbjct: 184 CD----------NIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCH-DE 232 5197 5198Query: 229 ANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGI 288 5199 +GAKI+ F +P +E +A ++ GP+A+A DA WQ Y GGV C SL+HG+ 5200Sbjct: 233 GEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVS-LCLAWSLNHGV 291 5201 5202Query: 289 LIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 5203 LIVG++ PYWIVKNSWG+ WGE+GYI L G N C + N+ ++ + 5204Sbjct: 292 LIVGFN-----KNAKPPYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNYPVSATV 341 5205 5206 5207>sp|Q05094|CYS2_LEIPI CYSTEINE PROTEINASE 2 PRECURSOR (AMASTIGOTE CYSTEINE PROTEINASE 5208 A-2) 5209 Length = 444 5210 5211 Score = 347 bits (882), Expect = 2e-95 5212 Identities = 124/349 (35%), Positives = 187/349 (53%), Gaps = 29/349 (8%) 5213 5214Query: 4 ILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEEL 62 5215 ++ VLA + + + F EF+ + + Y E +R F+ NL + E 5216Sbjct: 13 VVCVVLAAACAPARAIHVGTPAAALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREH 72 5217 5218Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDE--FINSIPT 120 5219 + +FG+ KF DLS EF YLN A + ++++P 5220Sbjct: 73 QARNPH----AQFGITKFFDLSEAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPD 128 5221 5222Query: 121 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 180 5223 A DWR +GAVTPVK+QG CGSCW+FS GN+EGQ +++ ++LVSLSEQ LV CD 5224Sbjct: 129 AVDWREKGAVTPVKDQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDD----- 183 5225 5226Query: 181 EGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETG--TQCNFNSAN--IGAK 234 5227 ++GC+GGL A++++++ NG + TE SYPY + G +C+ +S +GA+ 5228Sbjct: 184 -----MNDGCDGGLMLQAFDWLLQNTNGHLHTEDSYPYVSGNGYVPECSNSSEELVVGAQ 238 5229 5230Query: 235 ISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYS 294 5231 I +I +E MA ++ GP+AIA DA + Y GV C L+HG+L+VGY 5232Sbjct: 239 IDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYKSGVL-TACIGKQLNHGVLLVGYD 297 5233 5234Query: 295 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 5235 + PYW++KNSWG DWGEQGY+ + G N C +S + ++ + 5236Sbjct: 298 MTGEV-----PYWVIKNSWGGDWGEQGYVRVVMGVNACLLSEYPVSAHV 341 5237 5238 5239>sp|P35591|CYS1_LEIPI CYSTEINE PROTEINASE 1 PRECURSOR (AMASTIGOTE CYSTEINE PROTEINASE 5240 A-1) 5241 Length = 354 5242 5243 Score = 347 bits (880), Expect = 3e-95 5244 Identities = 143/355 (40%), Positives = 190/355 (53%), Gaps = 36/355 (10%) 5245 5246Query: 5 LLFVLAVFTVFVSSRGI-------PPEEQ----SQFLEFQDKFNKKYSHE-EYLERFEIF 52 5247 LLF + V +FV G PP + + + F+ + K + + E RF F 5248Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAF 66 5249 5250Query: 53 KSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDD 112 5251 K N+ LN + D KFADL+ EF YLN D D 5252Sbjct: 67 KQNMQTAYFLNTQNPHAHYDVS---GKFADLTPQEFAKLYLNPDYYARHLKDHKEDVHVD 123 5253 5254Query: 113 EFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 172 5255 + S + DWR +GAVTPVKNQG CGSCW+FS GN+EGQ S + LVSLSEQ LV 5256Sbjct: 124 DSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQMLVS 183 5257 5258Query: 173 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETGTQ--CNFNS 228 5259 CD DEGCNGGL A N+I++ NG + TE+SYPYT+ GT+ C+ + 5260Sbjct: 184 CD----------NIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCH-DE 232 5261 5262Query: 229 ANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGI 288 5263 +GAKI+ F +P +E +A ++ GP+A+A DA WQ Y GGV C SL+HG+ 5264Sbjct: 233 GEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVS-LCLAWSLNHGV 291 5265 5266Query: 289 LIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 5267 LIVG++ PYWIVKNSWG+ WGE+GYI L G N C + N+ ++ + 5268Sbjct: 292 LIVGFN-----KNAKPPYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNYPVSATV 341 5269 5270 5271>sp|P05994|PAP4_CARPA PAPAYA PROTEINASE IV PRECURSOR (PPIV) (PAPAYA PEPTIDASE B) (GLYCYL 5272 ENDOPEPTIDASE) 5273 Length = 348 5274 5275 Score = 346 bits (879), Expect = 3e-95 5276 Identities = 111/321 (34%), Positives = 163/321 (50%), Gaps = 29/321 (9%) 5277 5278Query: 29 FLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 5279 F + K NK Y + +E L RFEIFK NL I+E N + + G+N+F+DLS+DE 5280Sbjct: 48 FNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMINGYW----LGLNEFSDLSNDE 103 5281 5282Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFST 147 5283 FK Y+ + +T+ ++++++ ++ +P + DWR +GAVTPVK+QG C SCW+FST 5284Sbjct: 104 FKEKYVGSLPEDYTNQPYDEEFVNEDIVD-LPESVDWRAKGAVTPVKHQGYCESCWAFST 162 5285 5286Query: 148 TGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGG 207 5287 VEG + I LV LSEQ LVDCD + GCN G Q + Y+ +N G 5288Sbjct: 163 VATVEGINKIKTGNLVELSEQELVDCDKQ----------SYGCNRGYQSTSLQYVAQN-G 211 5289 5290Query: 208 IQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV-- 265 5291 I + YPY A+ T K + + N ++ P+++ ++ 5292Sbjct: 212 IHLRAKYPYIAKQQTCRANQVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGR 271 5293 5294Query: 266 EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325 5295 ++Q Y GG+F+ C +DH + VGY Y ++KNSWG WGE GYI + 5296Sbjct: 272 DFQNYKGGIFEGSCG-TKVDHAVTAVGYGKSG-----GKGYILIKNSWGPGWGENGYIRI 325 5297 5298Query: 326 RRGKNT----CGVSNFVSTSI 342 5299 RR CGV I 5300Sbjct: 326 RRASGNSPGVCGVYRSSYYPI 346 5301 5302 5303>sp|P10056|PAP3_CARPA CARICAIN PRECURSOR (PAPAYA PROTEINASE OMEGA) (PAPAYA PROTEINASE 5304 III) (PPIII) (PAPAYA PEPTIDASE A) 5305 Length = 348 5306 5307 Score = 341 bits (865), Expect = 1e-93 5308 Identities = 114/317 (35%), Positives = 162/317 (50%), Gaps = 26/317 (8%) 5309 5310Query: 29 FLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 5311 F + NK Y + +E L RFEIFK NL I+E N ++ G+N+FADLS+DE 5312Sbjct: 48 FNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYW----LGLNEFADLSNDE 103 5313 5314Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFST 147 5315 F Y+ + + ++++++ +N +P DWR +GAVTPV++QG CGSCW+FS 5316Sbjct: 104 FNEKYVGSLIDATIEQSYDEEFINEDTVN-LPENVDWRKKGAVTPVRHQGSCGSCWAFSA 162 5317 5318Query: 148 TGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGG 207 5319 VEG + I KLV LSEQ LVDC+ GC GG P A Y+ KN G 5320Sbjct: 163 VATVEGINKIRTGKLVELSEQELVDCERR----------SHGCKGGYPPYALEYVAKN-G 211 5321 5322Query: 208 IQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--V 265 5323 I S YPY A+ GT K S + N ++ P+++ ++ 5324Sbjct: 212 IHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGR 271 5325 5326Query: 266 EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325 5327 +Q Y GG+F+ PC +DH + VGY Y ++KNSWG WGE+GYI + 5328Sbjct: 272 PFQLYKGGIFEGPCG-TKVDHAVTAVGYGKSG-----GKGYILIKNSWGTAWGEKGYIRI 325 5329 5330Query: 326 RRG-KNTCGVSNFVSTS 341 5331 +R N+ GV +S 5332Sbjct: 326 KRAPGNSPGVCGLYKSS 342 5333 5334 5335>sp|P14080|PAP2_CARPA CHYMOPAPAIN PRECURSOR (PAPAYA PROTEINASE II) (PPII) 5336 Length = 352 5337 5338 Score = 339 bits (860), Expect = 6e-93 5339 Identities = 119/321 (37%), Positives = 160/321 (49%), Gaps = 29/321 (9%) 5340 5341Query: 29 FLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 5342 F + K NK Y S +E + RFEIF+ NL I+E N ++ G+N FADLS+DE 5343Sbjct: 48 FDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYW----LGLNGFADLSNDE 103 5344 5345Query: 88 FKNYYLNNKEAIFTDDLP-VADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFS 146 5346 FK Y+ FT + + + + P + DWR +GAVTPVKNQG CGSCW+FS 5347Sbjct: 104 FKKKYVGFVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFS 163 5348 5349Query: 147 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 206 5350 T VEG + I L+ LSEQ LVDCD GC GG Q + Y+ N 5351Sbjct: 164 TIATVEGINKIVTGNLLELSEQELVDCDKH----------SYGCKGGYQTTSLQYVANN- 212 5352 5353Query: 207 GIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA-- 264 5354 G+ T YPY A+ + KI+ + +P N ++ PL++ +A 5355Sbjct: 213 GVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGG 272 5356 5357Query: 265 VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIY 324 5358 +Q Y GVFD PC LDH + VGY + Y I+KNSWG +WGE+GY+ 5359Sbjct: 273 KPFQLYKSGVFDGPCG-TKLDHAVTAVGYGTSD-----GKNYIIIKNSWGPNWGEKGYMR 326 5360 5361Query: 325 LRR----GKNTCGVSNFVSTS 341 5362 L+R + TCGV 5363Sbjct: 327 LKRQSGNSQGTCGVYKSSYYP 347 5364 5365 5366>sp|P00784|PAPA_CARPA PAPAIN PRECURSOR (PAPAYA PROTEINASE I) (PPI) 5367 Length = 345 5368 5369 Score = 334 bits (848), Expect = 1e-91 5370 Identities = 107/321 (33%), Positives = 152/321 (47%), Gaps = 32/321 (9%) 5371 5372Query: 29 FLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87 5373 F + K NK Y + +E + RFEIFK NL I+E N ++ G+N FAD+S+DE 5374Sbjct: 48 FESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKNNSYW----LGLNVFADMSNDE 103 5375 5376Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFST 147 5377 FK Y + +T + + ++ +IP DWR +GAVTPVKNQG CGSCW+FS 5378Sbjct: 104 FKEKYTGSIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSA 163 5379 5380Query: 148 TGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGG 207 5381 +EG I L SEQ L+DCD GCNGG +A + + G 5382Sbjct: 164 VVTIEGIIKIRTGNLNEYSEQELLDCDRR----------SYGCNGGYPWSALQLVAQ-YG 212 5383 5384Query: 208 IQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV-- 265 5385 I ++YPY + AK + Y ++ P+++ +A 5386Sbjct: 213 IHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGK 272 5387 5388Query: 266 EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325 5389 ++Q Y GG+F PC N +DH + VGY Y ++KNSWG WGE GYI + 5390Sbjct: 273 DFQLYRGGIFVGPCG-NKVDHAVAAVGYG---------PNYILIKNSWGTGWGENGYIRI 322 5391 5392Query: 326 RRGKNT----CGVSNFVSTSI 342 5393 +RG CG+ + 5394Sbjct: 323 KRGTGNSYGVCGLYTSSFYPV 343 5395 5396 5397>sp|P22895|P34_SOYBN P34 PROBABLE THIOL PROTEASE PRECURSOR 5398 Length = 379 5399 5400 Score = 331 bits (840), Expect = 1e-90 5401 Identities = 109/334 (32%), Positives = 169/334 (49%), Gaps = 33/334 (9%) 5402 5403Query: 24 EEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFAD 82 5404 + S F ++ + + Y +HEE +R EIFK+N I ++N + G+NKFAD 5405Sbjct: 39 QVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNA-NRKSPHSHRLGLNKFAD 97 5406 5407Query: 83 LSSDEFKNYYLNNKEAIFTDDLPVADYLDDE--FINSIPTAFDWRTRGAVTPVKNQGQCG 140 5408 ++ EF YL + + + E + P ++DWR +G +T VK QG CG 5409Sbjct: 98 ITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHPPASWDWRKKGVITQVKYQGGCG 157 5410 5411Query: 141 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYN 200 5412 W+FS TG +E H I+ LVSLSEQ LVDC E EG G Q ++ 5413Sbjct: 158 RGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE----------SEGSYNGWQYQSFE 207 5414 5415Query: 201 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNF-TMIPKNE------TVMAGYIV 253 5416 +++++GGI T+ YPY A+ G +C N I + T+I +E + 5417Sbjct: 208 WVLEHGGIATDDDYPYRAKEG-RCKANKIQDKVTIDGYETLIMSDESTESETEQAFLSAI 266 5418 5419Query: 254 STGPLAIAADAVEWQFYIGGVFDIP--CNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKN 311 5420 P++++ DA ++ Y GG++D +P ++H +L+VGY + + + YWI KN 5421Sbjct: 267 LEQPISVSIDAKDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGV-----DYWIAKN 321 5422 5423Query: 312 SWGADWGEQGYIYLRRGKNT----CGVSNFVSTS 341 5424 SWG DWGE GYI+++R CG++ F S 5425Sbjct: 322 SWGFDWGEDGYIWIQRNTGNLLGVCGMNYFASYP 355 5426 5427 5428>sp|O17473|CATL_BRUPA CATHEPSIN L-LIKE PRECURSOR 5429 Length = 395 5430 5431 Score = 327 bits (829), Expect = 2e-89 5432 Identities = 99/325 (30%), Positives = 153/325 (46%), Gaps = 23/325 (7%) 5433 5434Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84 5435 ++++ ++ K Y +E R IF+SN E +N +N ADL+ 5436Sbjct: 87 LETEWKDYVTALGKHYDQKENNFRMAIFESNELMTERINKKYEQGLVSYTTALNDLADLT 146 5437 5438Query: 85 SDEF--KNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSC 142 5439 +EF +N + +++ + +P DWRT+GAVTPV+NQG+CGSC 5440Sbjct: 147 DEEFMVRNGLRLPNQTDLRGKRQTSEFYRYDKSERLPDQVDWRTKGAVTPVRNQGECGSC 206 5441 5442Query: 143 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 202 5443 ++F+T +E H +L+ LS QN+VDC + GC+GG P A+ Y 5444Sbjct: 207 YAFATAAALEAYHKQMTGRLLDLSPQNIVDC--------TRNLGNNGCSGGYMPTAFQY- 257 5445 5446Query: 203 IKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMI-PKNETVMAGYIVSTGPLAIA 261 5447 GI ES YPY +C + + + F I P +E + + GP+ + 5448Sbjct: 258 ASRYGIAMESRYPYVGTE-QRCRWQQSIAVVTDNGFNEIQPGDELALKHAVAKRGPVVVG 316 5449 5450Query: 262 A--DAVEWQFYIGGVF-DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 318 5451 ++FY GV+ + C DH +L VGY + YWIVKNSWG DWG 5452Sbjct: 317 ISGSKRSFRFYKDGVYSEGNCG--RPDHAVLAVGYGTHPSY----GDYWIVKNSWGTDWG 370 5453 5454Query: 319 EQGYIYLRRGK-NTCGVSNFVSTSI 342 5455 + GY+Y+ R + N C +++ S I 5456Sbjct: 371 KDGYVYMARNRGNMCHIASAASFPI 395 5457 5458 5459>sp|Q10991|CATL_SHEEP CATHEPSIN L 5460 Length = 217 5461 5462 Score = 325 bits (824), Expect = 1e-88 5463 Identities = 104/230 (45%), Positives = 140/230 (60%), Gaps = 17/230 (7%) 5464 5465Query: 118 IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 177 5466 +P + DW +G VTPVKNQGQCGSCW+FS TG +EGQ F KLVSLSEQNLVD 5467Sbjct: 1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVD----- 55 5468 5469Query: 178 MEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN 237 5470 ++GCNGGL NA+ YI +NGG+ +E SYPY A T T CN+ AK + 5471Sbjct: 56 ---SSRPQGNQGCNGGLMDNAFQYIKENGGLDSEESYPYEA-TDTSCNYKPEYSAAKDTG 111 5472 5473Query: 238 FTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYS 294 5474 F IP+ E + + + GP+++A DA +QFY G+ +D C+ LDHG+L+VGY 5475Sbjct: 112 FVDIPQREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYG 171 5476 5477Query: 295 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 343 5478 + T N +WIVKNSWG +WG +GY+ + + N CG++ S + 5479Sbjct: 172 FEGT----NNKFWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217 5480 5481 5482>sp|P54639|CYS4_DICDI CYSTEINE PROTEINASE 4 PRECURSOR 5483 Length = 442 5484 5485 Score = 323 bits (819), Expect = 4e-88 5486 Identities = 115/308 (37%), Positives = 166/308 (53%), Gaps = 23/308 (7%) 5487 5488Query: 1 MKVI-LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKI 59 5489 M+V+ L +L V + + ++ F + + YS EE+ R++IFKSN+ + 5490Sbjct: 1 MRVLSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQIFKSNMDYV 60 5491 5492Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 5493 + N + +T G+N FAD+++ E++ YL F + + F P 5494Sbjct: 61 HQWN----SKGGETVLGLNVFADITNQEYRTTYLGTP---FDGSALIGTEEEKIFSTPAP 113 5495 5496Query: 120 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN---KLVSLSEQNLVDCDHE 176 5497 DWR +GAVTP+KNQGQCG CWSFSTTG+ EG HFI+ LVSLSEQNL+DC 5498Sbjct: 114 -TVDWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDCSK- 171 5499 5500Query: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKIS 236 5501 + GC GGL + YII N GI TESSYPYTAE G +C F ++NIGA+I 5502Sbjct: 172 -------SYGNNGCEGGLMTLGFEYIINNKGIDTESSYPYTAEDGKECKFKTSNIGAQIV 224 5503 5504Query: 237 NFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGILIVGY 293 5505 ++ + + P+++A DA +Q Y G++ P C P LDHG+L+VGY 5506Sbjct: 225 SYQNVTSGSEASLQSASNNAPVSVAIDASNESFQLYESGIYYEPACTPTQLDHGVLVVGY 284 5507 5508Query: 294 SAKNTIFR 301 5509 + ++ 5510Sbjct: 285 GSGSSSSS 292 5511 5512 5513 Score = 70.2 bits (169), Expect = 6e-12 5514 Identities = 18/46 (39%), Positives = 26/46 (56%), Gaps = 1/46 (2%) 5515 5516Query: 297 NTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTS 341 5517 + + YWIVKNSWG WG GYI++ + + N CG++ S 5518Sbjct: 392 GAVEASSGNYWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMASFP 437 5519 5520 5521>sp|P56203|CATW_MOUSE CATHEPSIN W PRECURSOR (LYMPHOPAIN) 5522 Length = 371 5523 5524 Score = 319 bits (810), Expect = 4e-87 5525 Identities = 108/340 (31%), Positives = 162/340 (46%), Gaps = 32/340 (9%) 5526 5527Query: 22 PPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80 5528 P E + F FQ +FN+ Y + EY R IF NL + + L + +FG F 5529Sbjct: 33 PLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDL---GTAEFGETPF 89 5530 5531Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRT-RGAVTPVKNQGQC 139 5532 +DL+ +EF Y + T ++ + + S+P DWR + ++ VKNQG C 5533Sbjct: 90 SDLTEEEFGQLYGQERSPERTPNMTK-KVESNTWGESVPRTCDWRKAKNIISSVKNQGSC 148 5534 5535Query: 140 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 199 5536 CW+ + N++ I + V +S Q L+DC E C GCNGG +AY 5537Sbjct: 149 KCCWAMAAADNIQALWRIKHQQFVDVSVQELLDC----------ERCGNGCNGGFVWDAY 198 5538 5539Query: 200 NYIIKNGGIQTESSYPYTAETGT-QCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPL 258 5540 ++ N G+ +E YP+ + +C A I +FTM+ NE +A Y+ GP+ 5541Sbjct: 199 LTVLNNSGLASEKDYPFQGDRKPHRCLAKKYKKVAWIQDFTMLSNNEQAIAHYLAVHGPI 258 5542 5543Query: 259 AIAADAVEWQFYIGGVFDIP---CNPNSLDHGILIVGYSAKNTIFRKNM----------- 304 5544 + + Q Y GV C+P +DH +L+VG+ K + 5545Sbjct: 259 TVTINMKLLQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKKKEGMQTGTVLSHSRKRRHS 318 5546 5547Query: 305 -PYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 5548 PYWI+KNSWGA WGE+GY L RG NTCGV+ + T+ + 5549Sbjct: 319 SPYWILKNSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQV 358 5550 5551 5552>sp|P56202|CATW_HUMAN CATHEPSIN W PRECURSOR (LYMPHOPAIN) 5553 Length = 376 5554 5555 Score = 318 bits (807), Expect = 9e-87 5556 Identities = 114/368 (30%), Positives = 177/368 (47%), Gaps = 44/368 (11%) 5557 5558Query: 6 LFVLAVFTVFVSSRGI---------PPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSN 55 5559 L L V + RG P E + F FQ +FN+ Y S EE+ R +IF N 5560Sbjct: 10 LLALLVAGLAQGIRGPLRAQDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHN 69 5561 5562Query: 56 LGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFI 115 5563 L + + L + +FGV F+DL+ +EF Y + A + + +E 5564Sbjct: 70 LAQAQRLQEEDL---GTAEFGVTPFSDLTEEEFGQLYGYRRAAGGVPSMG-REIRSEEPE 125 5565 5566Query: 116 NSIPTAFDWRT-RGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCD 174 5567 S+P + DWR GA++P+K+Q C CW+ + GN+E IS V +S L+DC 5568Sbjct: 126 ESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVHELLDCG 185 5569 5570Query: 175 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGT-QCNFNSANIGA 233 5571 C +GC+GG +A+ ++ N G+ +E YP+ + +C+ A 5572Sbjct: 186 R----------CGDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKKYQKVA 235 5573 5574Query: 234 KISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFD---IPCNPNSLDHGILI 290 5575 I +F M+ NE +A Y+ + GP+ + + Q Y GV C+P +DH +L+ 5576Sbjct: 236 WIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDHSVLL 295 5577 5578Query: 291 VGYSA--------KNTIFRKN-------MPYWIVKNSWGADWGEQGYIYLRRGKNTCGVS 335 5579 VG+ + T+ ++ PYWI+KNSWGA WGE+GY L RG NTCG++ 5580Sbjct: 296 VGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGIT 355 5581 5582Query: 336 NFVSTSII 343 5583 F T+ + 5584Sbjct: 356 KFPLTARV 363 5585 5586 5587>sp|Q01958|CPP2_ENTHI CYSTEINE PROTEINASE 2 PRECURSOR 5588 Length = 315 5589 5590 Score = 318 bits (807), Expect = 9e-87 5591 Identities = 102/330 (30%), Positives = 160/330 (47%), Gaps = 37/330 (11%) 5592 5593Query: 21 IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN-K 79 5594 + F + K NK ++ E L R IF N ++ N I K V+ 5595Sbjct: 8 LAIASAIDFNTWASKNNKHFTAIEKLRRRAIFNMNAKFVDSFNKIG-----SFKLSVDGP 62 5596 5597Query: 80 FADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQC 139 5598 FA ++++E++ + + T++ YL+ + P + DWR G VTP+++Q QC 5599Sbjct: 63 FAAMTNEEYRTLLKSKRT---TEENGQVKYLNIQA----PESVDWRKEGKVTPIRDQAQC 115 5600 5601Query: 140 GSCWSFSTTGNVEGQHFISQN---KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQP 196 5602 GSC++F + +EG+ I + + LSE+++V C + + GCNGGL 5603Sbjct: 116 GSCYTFGSLAALEGRLLIEKGGDANTLDLSEEHMVQC--------TRDNGNNGCNGGLGS 167 5604 5605Query: 197 NAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTG 256 5606 N Y+YII++ G+ ES YPYT T C N + AKI+ +T +P+N +S G 5607Sbjct: 168 NVYDYIIEH-GVAKESDYPYTGSDST-CKTNVKSF-AKITGYTKVPRNNEAELKAALSQG 224 5608 5609Query: 257 PLAIAADAVE--WQFYIGGVF-DIPCNPNS--LDHGILIVGYSAKNTIFRKNMPYWIVKN 311 5610 + ++ DA +Q Y G + D C N L+H + VGY + WIV+N 5611Sbjct: 225 LVDVSIDASSAKFQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVD-----GKECWIVRN 279 5612 5613Query: 312 SWGADWGEQGYIYLRRGKNTCGVSNFVSTS 341 5614 SWG WG++GYI + NTCGV+ 5615Sbjct: 280 SWGTGWGDKGYINMVIEGNTCGVATDPLYP 309 5616 5617 5618>sp|P36185|ACP2_ENTHI CYSTEINE PROTEINASE ACP2 PRECURSOR 5619 Length = 310 5620 5621 Score = 315 bits (799), Expect = 8e-86 5622 Identities = 102/330 (30%), Positives = 158/330 (46%), Gaps = 32/330 (9%) 5623 5624Query: 18 SRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGV 77 5625 + GI F + K NK ++ E L R IF N ++ N I K V 5626Sbjct: 1 AAGIRIASAIDFNTWASKNNKHFTAIEKLRRRAIFNMNAKFVDSFNKIG-----SFKLSV 55 5627 5628Query: 78 N-KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQ 136 5629 + FA ++++E++ + + T++ YL+ + P + DWR G VTP+++Q 5630Sbjct: 56 DGPFAAMTNEEYRTLLKSKRT---TEENGQVKYLNIQA----PESVDWRKEGKVTPLRDQ 108 5631 5632Query: 137 GQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQP 196 5633 QCGSC++F + +EG+ I + + N +D E M+ + + GCNGGL 5634Sbjct: 109 AQCGSCYTFGSLAALEGRLLIEKG-----GDANTLDLSEEHMQCT-RDNGNNGCNGGLGS 162 5635 5636Query: 197 NAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTG 256 5637 N Y+YII++ G+ ES YPYT T C N + KI+ +T +P+N +S G 5638Sbjct: 163 NVYDYIIEH-GVAKESDYPYTGSDST-CKTNVKSFR-KITGYTKVPRNNEAELKAALSQG 219 5639 5640Query: 257 PLAIAADAVE--WQFYIGGVF-DIPCNPNS--LDHGILIVGYSAKNTIFRKNMPYWIVKN 311 5641 L ++ D +Q Y G + D C N L+H + VGY + WIV+N 5642Sbjct: 220 LLDVSIDVSSAKFQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVD-----GKECWIVRN 274 5643 5644Query: 312 SWGADWGEQGYIYLRRGKNTCGVSNFVSTS 341 5645 SWG WG++GYI + NTCGV+ 5646Sbjct: 275 SWGTSWGDKGYINMVIEGNTCGVATDPLYP 304 5647 5648 5649>sp|Q06964|CPP3_ENTHI CYSTEINE PROTEINASE 3 PRECURSOR (CYSTEINE PROTEINASE ACP3) 5650 Length = 308 5651 5652 Score = 312 bits (790), Expect = 9e-85 5653 Identities = 102/322 (31%), Positives = 157/322 (48%), Gaps = 37/322 (11%) 5654 5655Query: 29 FLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN-KFADLSSDE 87 5656 F + NK ++ E L R IF N + E N K K V+ FA ++++E 5657Sbjct: 9 FNTWAANNNKHFTAVEALRRRAIFNMNARFVAEFNK-----KGSFKLSVDGPFAAMTNEE 63 5658 5659Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFST 147 5660 ++ + + ++ YL+ + P + DWR +G VTP+++Q QCGSC++F + 5661Sbjct: 64 YRTLLKSKRTV---EENGKVTYLNIQA----PESVDWRAQGKVTPIRDQAQCGSCYTFGS 116 5662 5663Query: 148 TGNVEGQHFISQN---KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK 204 5664 +EG+ I + + LSE++LV C + + GCNGGL N Y+YII+ 5665Sbjct: 117 LAALEGRLLIEKGGNANTLDLSEEHLVQC--------TRDNGNNGCNGGLGSNVYDYIIQ 168 5666 5667Query: 205 NGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA 264 5668 N G+ ES YPYT T C N AKI+ + +P+N +S G + ++ DA 5669Sbjct: 169 N-GVAKESDYPYTGTDST-CKTN-VKAFAKITGYNKVPRNNEAELKAALSQGLVDVSIDA 225 5670 5671Query: 265 VE--WQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 319 5672 +Q Y G + D C N +L+H + VGY + WIV+NSWG WG+ 5673Sbjct: 226 SSAKFQLYKSGAYSDTKCKNNFFALNHEVCAVGYGVVD-----GKECWIVRNSWGTGWGD 280 5674 5675Query: 320 QGYIYLRRGKNTCGVSNFVSTS 341 5676 +GYI + NTCGV+ 5677Sbjct: 281 KGYINMVIEGNTCGVATDPLYP 302 5678 5679 5680>sp|P36184|ACP1_ENTHI CYSTEINE PROTEINASE ACP1 PRECURSOR 5681 Length = 308 5682 5683 Score = 309 bits (784), Expect = 5e-84 5684 Identities = 109/348 (31%), Positives = 156/348 (44%), Gaps = 53/348 (15%) 5685 5686Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKI 59 5687 M ++LFV V+ F ++ NK +++ EYL RF +F N + 5688Sbjct: 1 MFALILFVSLACANEVA-----------FKQWAATHNKVFANRAEYLYRFAVFLDNKKFV 49 5689 5690Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119 5691 E A+ +N FAD++ +EF +L T ++P + + P 5692Sbjct: 50 E----------ANANTELNVFADMTHEEFIQTHLG-----MTYEVPETTSNVKAAVKAAP 94 5693 5694Query: 120 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 179 5695 + DWR+ + P K+QGQCGSCW+F TT +EG+ KL S SEQ LVDCD 5696Sbjct: 95 ESVDWRS--IMNPAKDQGQCGSCWTFCTTAVLEGRVNKDLGKLYSFSEQQLVDCD----- 147 5697 5698Query: 180 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFT 239 5699 A D GC GG N+ +I +N G+ ES YPY A GT A ++ 5700Sbjct: 148 -----ASDNGCEGGHPSNSLKFIQENNGLGLESDYPYKAVAGTC---KKVKNVATVTGSR 199 5701 5702Query: 240 MIP-KNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVF--DIPCNPNSLDHGILIVGYS 294 5703 + +ET + I GP+A+ DA +Q Y G D C ++H + VGY 5704Sbjct: 200 RVTDGSETGLQTIIAENGPVAVGMDASRPSFQLYKKGTIYSDTKCRSRMMNHCVTAVGYG 259 5705 5706Query: 295 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTS 341 5707 N YWI++NSWG WG+ GY L R N CG+ + 5708Sbjct: 260 -----SNSNGKYWIIRNSWGTSWGDAGYFLLARDSNNMCGIGRDSNYP 302 5709 5710 5711>sp|P25326|CATS_BOVIN CATHEPSIN S 5712 Length = 217 5713 5714 Score = 307 bits (778), Expect = 2e-83 5715 Identities = 91/228 (39%), Positives = 129/228 (55%), Gaps = 17/228 (7%) 5716 5717Query: 118 IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 177 5718 +P + DWR +G VT VK QG CGSCW+FS G +E Q + KLVSLS QNLVDC 5719Sbjct: 1 LPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST-- 58 5720 5721Query: 178 MEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN 237 5722 + ++GCNGG A+ YII N GI +E+SYPY A G +C ++ N A S 5723Sbjct: 59 -----AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDG-KCQYDVKNRAATCSR 112 5724 5725Query: 238 FTMIP-KNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSLDHGILIVGYS 294 5726 + +P +E + + + GP+++ DA + Y GV+ P +++HG+L+VGY 5727Sbjct: 113 YIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYG 172 5728 5729Query: 295 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTS 341 5730 YW+VKNSWG +G+QGYI + R N CG++N+ S 5731Sbjct: 173 N-----LDGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYP 215 5732 5733 5734>sp|Q01957|CPP1_ENTHI CYSTEINE PROTEINASE 1 PRECURSOR 5735 Length = 315 5736 5737 Score = 301 bits (763), Expect = 1e-81 5738 Identities = 103/322 (31%), Positives = 155/322 (47%), Gaps = 37/322 (11%) 5739 5740Query: 29 FLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN-KFADLSSDE 87 5741 F + NK ++ E L R IF N + E N K V+ FA ++++E 5742Sbjct: 16 FNTWVANNNKHFTAVESLRRRAIFNMNARIVAENNRKE-----TFKLSVDGPFAAMTNEE 70 5743 5744Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFST 147 5745 + + + ++ YL+ + P A DWR +G VTP+++QG CGSC++F + 5746Sbjct: 71 YNSLLKLKRSG---EEKGEVRYLNIQA----PKAVDWRKKGKVTPIRDQGNCGSCYTFGS 123 5747 5748Query: 148 TGNVEGQHFISQN---KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK 204 5749 +EG+ I + + + LSE+++V C E + GCNGGL N YNYI++ 5750Sbjct: 124 IAALEGRLLIEKGGDSETLDLSEEHMVQCTRE--------DGNNGCNGGLGSNVYNYIME 175 5751 5752Query: 205 NGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA 264 5753 N GI ES YPYT T C + AKI ++ + +N V +S G + ++ DA 5754Sbjct: 176 N-GIAKESDYPYTGSDST-CRSD-VKAFAKIKSYNRVARNNEVELKAAISQGLVDVSIDA 232 5755 5756Query: 265 VE--WQFYIGGVF-DIPCNPNS--LDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 319 5757 +Q Y G + D C N L+H + VGY + WIV+NSWG WGE 5758Sbjct: 233 SSVQFQLYKSGAYTDTQCKNNYFALNHEVCAVGYGVVD-----GKECWIVRNSWGTGWGE 287 5759 5760Query: 320 QGYIYLRRGKNTCGVSNFVSTS 341 5761 +GYI + NTCGV+ 5762Sbjct: 288 KGYINMVIEGNTCGVATDPLYP 309 5763 5764 5765>sp|P25805|CYSP_PLAFA THROPHOZOITE CYSTEINE PROTEINASE PRECURSOR (TCP) 5766 Length = 569 5767 5768 Score = 299 bits (758), Expect = 5e-81 5769 Identities = 100/363 (27%), Positives = 161/363 (43%), Gaps = 62/363 (17%) 5770 5771Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 5772 S+F +F + NK Y + +E + +FEIFK N I+ N +N A K VN+F+D S 5773Sbjct: 223 SKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNK--LNKNAMYKKKVNQFSDYSE 280 5774 5775Query: 86 DEFKN--------------YYLNNKEAIFTDDLPVADYL------DDEFINSIPTAFDWR 125 5776 +E K Y E D++ ++++ + + + +P D+R 5777Sbjct: 281 EELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYR 340 5778 5779Query: 126 TRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEA 185 5780 +G V K+QG CGSCW+F++ GN+E ++S SEQ +VDC + 5781Sbjct: 341 EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--------- 391 5782 5783Query: 186 CDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNE 245 5784 + GC+GG ++ Y+++N + Y Y A+ C +S+ + E 5785Sbjct: 392 -NFGCDGGHPFYSFLYVLQN-ELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAV--KE 447 5786 5787Query: 246 TVMAGYIVSTGPLAIAADA-VEWQFYIGGVFDIPCNPNSLDHGILIVGYS---------- 294 5788 + + GPL++ ++ Y GV++ C L+H +L+VGY 5789Sbjct: 448 NQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTC-SEELNHSVLLVGYGQVEKTKLNYN 506 5790 5791Query: 295 ----AKNTIFRKNMP------YWIVKNSWGADWGEQGYIYLRRGKN----TCGVSNFVST 340 5792 NT N P YWI+KNSW WGE G++ L R KN CG+ V 5793Sbjct: 507 NKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFY 566 5794 5795Query: 341 SII 343 5796 I+ 5797Sbjct: 567 PIL 569 5798 5799 5800>sp|P20721|CYSL_LYCES LOW-TEMPERATURE-INDUCED CYSTEINE PROTEINASE PRECURSOR 5801 Length = 346 5802 5803 Score = 298 bits (756), Expect = 9e-81 5804 Identities = 88/243 (36%), Positives = 130/243 (53%), Gaps = 21/243 (8%) 5805 5806Query: 106 VADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL 165 5807 +D + +S+P + DWR +G + VK+QG CGSCW+FS +E + I L+SL 5808Sbjct: 6 KSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISL 65 5809 5810Query: 166 SEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCN 225 5811 SEQ LVDCD + +EGC+GGL A+ ++IKNGGI TE YPY G 5812Sbjct: 66 SEQELVDCDR---------SYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQ 116 5813 5814Query: 226 FNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNS 283 5815 + KI ++ +P N V+ P++IA +A ++Q Y G+F C + 5816Sbjct: 117 YRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCG-TA 175 5817 5818Query: 284 LDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNTCGVSNFVS 339 5819 +DHG++I GY +N M YWIV+NSWGA+ E GY+ ++R CG++ S 5820Sbjct: 176 VDHGVVIAGYGTEN-----GMDYWIVRNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPS 230 5821 5822Query: 340 TSI 342 5823 + 5824Sbjct: 231 YPV 233 5825 5826 5827>sp|P43234|CATO_HUMAN CATHEPSIN O PRECURSOR 5828 Length = 321 5829 5830 Score = 296 bits (750), Expect = 5e-80 5831 Identities = 98/297 (32%), Positives = 151/297 (49%), Gaps = 20/297 (6%) 5832 5833Query: 50 EIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADY 109 5834 F+ +L + LN + + + +G+N+F+ L +EFK YL +K + F A+ 5835Sbjct: 42 AAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPR--YSAEV 99 5836 5837Query: 110 LDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQN 169 5838 S+P FDWR + VT V+NQ CG CW+FS G VE + I L LS Q 5839Sbjct: 100 HMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQ 159 5840 5841Query: 170 LVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGG-IQTESSYPYTAETGTQCNFNS 228 5842 ++DC + GCNGG NA N++ K + +S YP+ A+ G F+ 5843Sbjct: 160 VIDCS----------YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSG 209 5844 5845Query: 229 ANIGAKISNF--TMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDH 286 5846 ++ G I + E MA +++ GPL + DAV WQ Y+GG+ C+ +H 5847Sbjct: 210 SHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHCSSGEANH 269 5848 5849Query: 287 GILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 5850 +LI G+ + PYWIV+NSWG+ WG GY +++ G N CG+++ VS+ + 5851Sbjct: 270 AVLITGFDKTG-----STPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 321 5852 5853 5854>sp|P46102|CYSP_PLAVN CYSTEINE PROTEINASE PRECURSOR 5855 Length = 506 5856 5857 Score = 291 bits (738), Expect = 1e-78 5858 Identities = 108/360 (30%), Positives = 167/360 (46%), Gaps = 58/360 (16%) 5859 5860Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85 5861 S+F ++ + NKKY + +E L+RFE FK K ++ N + + VN+++D S 5862Sbjct: 160 SKFFKYMKENNKKYENMDEQLQRFENFKIRYMKTQKHNEMVGKNGLTYVQKVNQYSDFSK 219 5863 5864Query: 86 DEFKNYYLNNKEAIFTDDL----PVADYLDDEFINSI-------PTAFDWRTRGAVTPVK 134 5865 +EF NY+ P+ +L + + S+ P + D+R++ P K 5866Sbjct: 220 EEFDNYFKKLLSVPMDLKSKYIVPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNFLPPK 279 5867 5868Query: 135 NQGQCGSCWSFSTTGNVEGQHFISQNKL-VSLSEQNLVDCDHECMEYEGEEACDEGCNGG 193 5869 +QG CGSCW+F+ GN E + +++++ +S SEQ +VDC E + GC+GG 5870Sbjct: 280 DQGNCGSCWAFAAIGNFEYLYVHTRHEMPISFSEQQMVDCSTE----------NYGCDGG 329 5871 5872Query: 194 LQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIV 253 5873 A+ Y+I N G+ YPY C ++ ++ + NE +M + 5874Sbjct: 330 NPFYAFLYMINN-GVCLGDEYPYKGHEDFFCLNYRCSLLGRVHFIGDVKPNELIM--ALN 386 5875 5876Query: 254 STGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFR----------- 301 5877 GP+ IA A ++ Y GGVFD CNP L+H +L+VGY 5878Sbjct: 387 YVGPVTIAVGASEDFVLYSGGVFDGECNP-ELNHSVLLVGYGQVKKSLAFEDSHSNVDSN 445 5879 5880Query: 302 ---------KNMP------YWIVKNSWGADWGEQGYIYLRRGK----NTCGVSNFVSTSI 342 5881 K YWIV+NSWG +WGE GYI ++R K CGV + V I 5882Sbjct: 446 LIKKYKENIKGDDDDDIIYYWIVRNSWGPNWGEGGYIRIKRNKAGDDGFCGVGSDVFFPI 505 5883 5884 5885>sp|P42666|CYSP_PLAVI CYSTEINE PROTEINASE PRECURSOR 5886 Length = 583 5887 5888 Score = 278 bits (704), Expect = 1e-74 5889 Identities = 100/383 (26%), Positives = 164/383 (42%), Gaps = 68/383 (17%) 5890 5891Query: 11 VFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINH 69 5892 V + + + S+F F +K+ + Y E +E+++ FK N KI++ N 5893Sbjct: 219 VSVAQIEGLFVNLKYASKFFNFMNKYKRSYKDINEQMEKYKNFKMNYLKIKKHNETNQM- 277 5894 5895Query: 70 KADTKFGVNKFADLSSDEFKNYY---------LNNKEAIFTDDLPVADYLDDEFINS--- 117 5896 K VN+F+D S +F++Y+ L K + + + +S 5897Sbjct: 278 ---YKMKVNQFSDYSKKDFESYFRKLVPIPDHLKKKYVVPFSSMNNGKGKNVVTSSSGAN 334 5898 5899Query: 118 ----IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLV-SLSEQNLVD 172 5900 +P D+R +G V K+QG CGSCW+F++ GNVE + NK + +LSEQ +VD 5901Sbjct: 335 LLADVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNVECMYAKEHNKTILTLSEQEVVD 394 5902 5903Query: 173 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 232 5904 C + GC+GG ++ Y I+N GI Y Y A C 5905Sbjct: 395 CSK----------LNFGCDGGHPFYSFIYAIEN-GICMGDDYKYKAMDNLFCLNYRCKNK 443 5906 5907Query: 233 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIV 291 5908 +S+ + +NE + + GP+++ ++ FY GG+F+ C L+H +L+V 5909Sbjct: 444 VTLSSVGGVKENE--LIRALNEVGPVSVNVGVTDDFSFYGGGIFNGTC-TEELNHSVLLV 500 5910 5911Query: 292 GYS---------------AKNTIFRKN------------MPYWIVKNSWGADWGEQGYIY 324 5912 GY + + +K YWI+KNSW WGE G++ 5913Sbjct: 501 GYGQVQSSKIFQEKNAYDDASGVTKKGALSYPSKADDGIQYYWIIKNSWSKFWGENGFMR 560 5914 5915Query: 325 LRRGKN----TCGVSNFVSTSII 343 5916 + R K CG+ V I+ 5917Sbjct: 561 ISRNKEGDNVFCGIGVEVFYPIL 583 5918 5919 5920>sp|P16311|MMAL_DERFA MAJOR MITE FECAL ALLERGEN DER F 1 PRECURSOR (DER F I) 5921 Length = 321 5922 5923 Score = 272 bits (688), Expect = 8e-73 5924 Identities = 110/350 (31%), Positives = 150/350 (42%), Gaps = 46/350 (13%) 5925 5926Query: 7 FVLAVFTVFVSSRGIP-PEEQSQFLEFQDKFNKKYSHEEYLE-RFEIFKSNLGKIEELNL 64 5927 FVLA+ ++ V S P F EF+ FNK Y+ E E + F +L +E 5928Sbjct: 3 FVLAIASLLVLSTVYARPASIKTFEEFKKAFNKNYATVEEEEVARKNFLESLKYVEA--- 59 5929 5930Query: 65 IAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFIN------SI 118 5931 K +N +DLS DEFKN YL + EA + L L+ E ++ 5932Sbjct: 60 --------NKGAINHLSDLSLDEFKNRYLMSAEAF--EQLKTQFDLNAETSACRINSVNV 109 5933 5934Query: 119 PTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECM 178 5935 P+ D R+ VTP++ QG CGSCW+FS E + +N + LSEQ LVDC 5936Sbjct: 110 PSELDLRSLRTVTPIRMQGGCGSCWAFSGVAATESAYLAYRNTSLDLSEQELVDC----- 164 5937 5938Query: 179 EYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNF 238 5939 A GC+G P YI +N G+ E SYPY A NS + G ISN+ 5940Sbjct: 165 ------ASQHGCHGDTIPRGIEYIQQN-GVVEERSYPYVAREQRCRRPNSQHYG--ISNY 215 5941 5942Query: 239 TMIPKNETVMAGYIVSTGPLAIAA-----DAVEWQFYIGG-VFDIPCNPNSLDHGILIVG 292 5943 I + ++ AIA D +Q Y G + H + IVG 5944Sbjct: 216 CQIYPPDVKQIREALTQTHTAIAVIIGIKDLRAFQHYDGRTIIQHDNGYQPNYHAVNIVG 275 5945 5946Query: 293 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 342 5947 Y + YWIV+NSW WG+ GY Y + G N + + I 5948Sbjct: 276 YG-----STQGDDYWIVRNSWDTTWGDSGYGYFQAGNNLMMIEQYPYVVI 320 5949 5950 5951>sp|P80884|ANAN_ANACO ANANAIN 5952 Length = 216 5953 5954 Score = 271 bits (687), Expect = 1e-72 5955 Identities = 93/229 (40%), Positives = 122/229 (52%), Gaps = 22/229 (9%) 5956 5957Query: 118 IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 177 5958 +P + DWR GAVT VKNQG+CGSCW+F++ VE + I + LVSLSEQ ++DC 5959Sbjct: 1 VPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDC---- 56 5960 5961Query: 178 MEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN 237 5962 A GC GG AY++II N G+ + + YPY A GT C N A I+ 5963Sbjct: 57 -------AVSYGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGT-CKTNGVPNSAYITR 108 5964 5965Query: 238 FTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAK 296 5966 +T + +N Y VS P+A A DA +Q Y GVF PC L+H I+I+GY 5967Sbjct: 109 YTYVQRNNERNMMYAVSNQPIAAALDASGNFQHYKRGVFTGPCG-TRLNHAIVIIGYGQD 167 5968 5969Query: 297 NTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK----NTCGVSNFVSTS 341 5970 + +WIV+NSWGA WGE GYI L R CG++ 5971Sbjct: 168 SA----GKKFWIVRNSWGAGWGEGGYIRLARDVSSSFGICGIAMDPLYP 212 5972 5973 5974>sp||CATL_CHICK_1 [Segment 1 of 2] CATHEPSIN L 5975 Length = 176 5976 5977 Score = 264 bits (669), Expect = 1e-70 5978 Identities = 86/183 (46%), Positives = 115/183 (61%), Gaps = 12/183 (6%) 5979 5980Query: 119 PTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECM 178 5981 P + DWR +G VTPVK+QGQCGSCW+FSTTG +EGQHF ++ KLVSLSEQNLVDC 5982Sbjct: 2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCS---- 57 5983 5984Query: 179 EYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNF 238 5985 ++GCNGGL A+ Y+ NGGI +E SYPYTA+ C + + A + F 5986Sbjct: 58 ----RPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGF 113 5987 5988Query: 239 TMIPKN-ETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHGILIVGYS 294 5989 IP+ E + + S GP+++A DA +QFY G++ P C+ LDHG+L+VGY 5990Sbjct: 114 VDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYG 173 5991 5992Query: 295 AKN 297 5993 + 5994Sbjct: 174 FEG 176 5995 5996 5997>sp|P97821|CATC_MOUSE DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C) 5998 (CATHEPSIN J) (DIPEPTIDYL TRANSFERASE) 5999 Length = 462 6000 6001 Score = 262 bits (662), Expect = 9e-70 6002 Identities = 86/317 (27%), Positives = 148/317 (46%), Gaps = 37/317 (11%) 6003 6004Query: 43 EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTD 102 6005 E Y ER + N ++ +N + K+ T ++ +S + +++ 6006Sbjct: 161 ERYSERL--YTHNHNFVKAINTVQ---KSWTATAYKEYEKMSLRDLIRRSGHSQRIPRPK 215 6007 6008Query: 103 DLPVADYLDDEFINSIPTAFDWRTR---GAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 159 6009 P+ D + + +N +P ++DWR V+PV+NQ CGSC+SF++ G +E + I 6010Sbjct: 216 PAPMTDEIQQQILN-LPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILT 274 6011 6012Query: 160 NKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYT 217 6013 N + LS Q +V C +GC+GG ++ G+ ES +PYT 6014Sbjct: 275 NNSQTPILSPQEVVSCSPYA----------QGCDGGFPYLIAGKYAQDFGVVEESCFPYT 324 6015 6016Query: 218 AETGTQCNFNSANIGAKISNFTMIPK-----NETVMAGYIVSTGPLAIAADAVE-WQFYI 271 6017 A+ + C + S++ + NE +M +V GP+A+A + + + Y 6018Sbjct: 325 AKD-SPCKPRENCLRYYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYH 383 6019 6020Query: 272 GGVFDI-----PCNPNSL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325 6021 G++ P NP L +H +L+VGY + YWI+KNSWG++WGE GY + 6022Sbjct: 384 SGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVT---GIEYWIIKNSWGSNWGESGYFRI 440 6023 6024Query: 326 RRGKNTCGVSNFVSTSI 342 6025 RRG + C + + +I 6026Sbjct: 441 RRGTDECAIESIAVAAI 457 6027 6028 6029>sp|P25781|CYSP_THEAN CYSTEINE PROTEINASE PRECURSOR 6030 Length = 441 6031 6032 Score = 259 bits (655), Expect = 6e-69 6033 Identities = 105/343 (30%), Positives = 162/343 (46%), Gaps = 50/343 (14%) 6034 6035Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 6036 +F F +K+ K + S ++ ++RF F+ N ++ +NKF+DLS + 6037Sbjct: 119 EFDAFVEKYKKVHRSFDQRVQRFLTFRKNYHIVKTHKPTEP-----YSLDLNKFSDLSDE 173 6038 6039Query: 87 EFKNYY--------------------LNNKEAIFTDDLPVADYLDDEFINSIP--TAFDW 124 6040 EFK Y +++K I+ L A +++ S+ +W 6041Sbjct: 174 EFKALYPVITPPKTYTSLSKHLEFKKMSHKNPIYISKLKKAKGIEEIKDLSLITGENLNW 233 6042 6043Query: 125 RTRGAVTPVKNQG-QCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGE 183 6044 AV+P K+QG CGSCW+FS+ +VE + + +NK LSEQ LV+CD M 6045Sbjct: 234 ARTDAVSPTKDQGDHCGSCWAFSSIASVESLYRLYKNKSYFLSEQELVNCDKSSM----- 288 6046 6047Query: 184 EACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPK 243 6048 GC GGL A YI + G+ ES PYT + C + N I + +++ 6049Sbjct: 289 -----GCAGGLPITALEYI-HSKGVSFESEVPYTGIV-SPCKPSIKN-KVFIDSISILKG 340 6050 6051Query: 244 NETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKN 303 6052 N+ V ++S + IA E + Y GG+F C L+H +L+VG + 6053Sbjct: 341 NDVVNKSLVISPTVVGIAVT-KELKLYSGGIFTGKCG-GELNHAVLLVGEGVDH---ETG 395 6054 6055Query: 304 MPYWIVKNSWGADWGEQGYIYLRR---GKNTCGVSNFVSTSII 343 6056 M YWI+KNSWG DWGE G++ L+R G + CG+ F I+ 6057Sbjct: 396 MRYWIIKNSWGEDWGENGFLRLQRTKKGLDKCGILTFGLNPIL 438 6058 6059 6060>sp|P08176|MMAL_DERPT MAJOR MITE FECAL ALLERGEN DER P 1 PRECURSOR (DER P I) 6061 Length = 320 6062 6063 Score = 255 bits (644), Expect = 1e-67 6064 Identities = 105/356 (29%), Positives = 153/356 (42%), Gaps = 49/356 (13%) 6065 6066Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 6067 MK++L + V +R P F E++ FNK Y+ E E + N +E 6068Sbjct: 1 MKIVLAIASLLALSAVYAR---PSSIKTFEEYKKAFNKSYATFEDE---EAARKN--FLE 52 6069 6070Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFI----- 115 6071 + + N +N +DLS DEFKN +L + EA + L L+ E 6072Sbjct: 53 SVKYVQSNGG-----AINHLSDLSLDEFKNRFLMSAEAF--EHLKTQFDLNAETNACSIN 105 6073 6074Query: 116 NSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 175 6075 + P D R VTP++ QG CGSCW+FS E + +N+ + L+EQ LVDC 6076Sbjct: 106 GNAPAEIDLRQMRTVTPIRMQGGCGSCWAFSGVAATESAYLAYRNQSLDLAEQELVDC-- 163 6077 6078Query: 176 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKI 235 6079 A GC+G P YI NG +Q ES Y Y A + N+ G I 6080Sbjct: 164 ---------ASQHGCHGDTIPRGIEYIQHNGVVQ-ESYYRYVAREQSCRRPNAQRFG--I 211 6081 6082Query: 236 SNFTMI-PKNETVMAGYIV-STGPLAIAA---DAVEWQFYIGGVF---DIPCNPNSLDHG 287 6083 SN+ I P N + + + +A+ D ++ Y G D PN H 6084Sbjct: 212 SNYCQIYPPNVNKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRDNGYQPNY--HA 269 6085 6086Query: 288 ILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 6087 + IVGYS + + YWIV+NSW +WG+ GY Y + + + I+ 6088Sbjct: 270 VNIVGYSN-----AQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEEYPYVVIL 320 6089 6090 6091>sp|P14518|BROM_ANACO BROMELAIN, STEM 6092 Length = 212 6093 6094 Score = 253 bits (640), Expect = 4e-67 6095 Identities = 80/230 (34%), Positives = 114/230 (48%), Gaps = 27/230 (11%) 6096 6097Query: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 6098 ++P + DWR GAVT VKNQ CG+CW+F+ VE + I + L LSEQ ++DC 6099Sbjct: 1 AVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAK- 59 6100 6101Query: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKIS 236 6102 GC GG + A+ +II N G+ + + YPY A GT C + A I+ 6103Sbjct: 60 ----------GYGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGT-CKTDGVPNSAYIT 108 6104 6105Query: 237 NFTMIPKNETVMAGYIVSTGPLAIAADA-VEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 295 6106 + +P+N Y VS P+ +A DA +Q+Y GVF+ PC SL+H + +GY 6107Sbjct: 109 GYARVPRNNESSMMYAVSKQPITVAVDANANFQYYKSGVFNGPCG-TSLNHAVTAIGYGQ 167 6108 6109Query: 296 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNTCGVSNFVSTS 341 6110 + I+ K WGA WGE GYI + R CG++ 6111Sbjct: 168 DSIIYPK---------KWGAKWGEAGYIRMARDVSSSSGICGIAIDPLYP 208 6112 6113 6114>sp|P53634|CATC_HUMAN DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C) 6115 (CATHEPSIN J) (DIPEPTIDYL TRANSFERASE) 6116 Length = 463 6117 6118 Score = 251 bits (634), Expect = 2e-66 6119 Identities = 86/318 (27%), Positives = 137/318 (43%), Gaps = 36/318 (11%) 6120 6121Query: 41 SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIF 100 6122 S E+Y R +K + ++ +N I + A T + L+ + + I 6123Sbjct: 159 SQEKYSNRL--YKYDHNFVKAINAIQKSWTATTYME---YETLTLGDMIRRSGGHSRKIP 213 6124 6125Query: 101 TDDLPVADYLDDEFINSIPTAFDWRTRGA---VTPVKNQGQCGSCWSFSTTGNVEGQHFI 157 6126 + I +PT++DWR V+PV+NQ CGSC+SF++ G +E + I 6127Sbjct: 214 RPKPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRI 273 6128 6129Query: 158 SQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYP 215 6130 N + LS Q +V C +GC GG ++ G+ E+ +P 6131Sbjct: 274 LTNNSQTPILSPQEVVSCSQYA----------QGCEGGFPYLIAGKYAQDFGLVEEACFP 323 6132 6133Query: 216 YTAETGTQCNFNSANIGAKISNFTMIPK-----NETVMAGYIVSTGPLAIAADAVE-WQF 269 6134 YT + C S + + NE +M +V GP+A+A + + + 6135Sbjct: 324 YTGTD-SPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLH 382 6136 6137Query: 270 YIGGVFDI-----PCNPNSL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 323 6138 Y G++ P NP L +H +L+VGY + M YWIVKNSWG WGE GY 6139Sbjct: 383 YKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSAS---GMDYWIVKNSWGTGWGENGYF 439 6140 6141Query: 324 YLRRGKNTCGVSNFVSTS 341 6142 +RRG + C + + + 6143Sbjct: 440 RIRRGTDECAIESIAVAA 457 6144 6145 6146>sp|P80067|CATC_RAT DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C) 6147 (CATHEPSIN J) (DIPEPTIDYL TRANSFERASE) 6148 Length = 462 6149 6150 Score = 250 bits (631), Expect = 4e-66 6151 Identities = 86/317 (27%), Positives = 145/317 (45%), Gaps = 37/317 (11%) 6152 6153Query: 43 EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTD 102 6154 E+Y ER + + ++ +N + + A T ++ LS + ++ + 6155Sbjct: 161 EKYSERL--YSHHHNFVKAINSVQKSWTATT---YRRYEKLSIRDLIRRSGHSGRILRPK 215 6156 6157Query: 103 DLPVADYLDDEFINSIPTAFDWRTRGA---VTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 159 6158 P+ D + + + S+P ++DWR V+PV+NQ CGSC+SF++ G +E + I 6159Sbjct: 216 PAPITDEIQQQIL-SLPESWDWRNVRGINFVSPVRNQESCGSCYSFASIGMLEARIRILT 274 6160 6161Query: 160 NKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYT 217 6162 N + LS Q +V C +GC+GG ++ G+ E+ +PYT 6163Sbjct: 275 NNSQTPILSPQEVVSCSPYA----------QGCDGGFPYLIAGKYAQDFGVVEENCFPYT 324 6164 6165Query: 218 AETGTQCNFNSANIGAKISNFTMIPK-----NETVMAGYIVSTGPLAIAADAVE-WQFYI 271 6166 A C + S + + NE +M +V GP+A+A + + + Y 6167Sbjct: 325 ATDAP-CKPKENCLRYYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYH 383 6168 6169Query: 272 GGVFDI-----PCNPNSL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325 6170 G++ P NP L +H +L+VGY + YWIVKNSWG+ WGE GY + 6171Sbjct: 384 SGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVT---GLDYWIVKNSWGSQWGESGYFRI 440 6172 6173Query: 326 RRGKNTCGVSNFVSTSI 342 6174 RRG + C + + +I 6175Sbjct: 441 RRGTDECAIESIAMAAI 457 6176 6177 6178>sp|P22497|CYSP_THEPA CYSTEINE PROTEINASE PRECURSOR 6179 Length = 439 6180 6181 Score = 243 bits (613), Expect = 5e-64 6182 Identities = 98/342 (28%), Positives = 155/342 (44%), Gaps = 48/342 (14%) 6183 6184Query: 24 EEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFAD 82 6185 E +F EF K+N++++ + E L R F+SN +++E G+N+F+D 6186Sbjct: 119 EVYREFEEFNSKYNRRHATQQERLNRLVTFRSNYLEVKE-----QKGDEPYVKGINRFSD 173 6187 6188Query: 83 LSSDEFKN---------------YYLNNKEAIFTDDLPVADYLDDEFINSIP----TAFD 123 6189 L+ EF YYL + A T + L+ + + D 6190Sbjct: 174 LTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKALNTDEDVDLAKLTGENLD 233 6191 6192Query: 124 WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGE 183 6193 WR +VT VK+Q CG CW+FST G+VEG + +K LS Q L+DCD 6194Sbjct: 234 WRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDCD--------- 284 6195 6196Query: 184 EACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPK 243 6197 + GC GGL +AY Y+ K G+ + P+ + +C+ A + ++ + 6198Sbjct: 285 -SFSNGCQGGLLESAYEYVRK-YGLVSAKDLPFV-DKARRCSVPKAK-KVSVPSYHVFKG 340 6199 6200Query: 244 NETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRK 302 6201 E + +++ P ++ E Y GVF C SL+H +++VG 6202Sbjct: 341 KE--VMTRSLTSSPCSVYLSVSPELAKYKSGVFTGECG-KSLNHAVVLVGEGYDEV---T 394 6203 6204Query: 303 NMPYWIVKNSWGADWGEQGYIYLRR---GKNTCGVSNFVSTS 341 6205 YW+V+NSWG DWGE GY+ L R G + CGV + ++ 6206Sbjct: 395 KKRYWVVQNSWGTDWGENGYMRLERTNMGTDKCGVLDTSMSA 436 6207 6208 6209>sp|P07858|CATB_HUMAN CATHEPSIN B PRECURSOR (CATHEPSIN B1) (APP SECRETASE) 6210 Length = 339 6211 6212 Score = 238 bits (602), Expect = 1e-62 6213 Identities = 71/299 (23%), Positives = 114/299 (37%), Gaps = 58/299 (19%) 6214 6215Query: 82 DLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQG 137 6216 D+S YL F + +P +FD W + +++QG 6217Sbjct: 51 DMS-------YLKRLCGTFLGGPKPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQG 103 6218 6219Query: 138 QCGSCWSFSTTGNVEGQHFISQNKLVSL--SEQNLVDCDHECMEYEGEEACDEGCNGGLQ 195 6220 CGSCW+F + + I N VS+ S ++L+ C C C +GCNGG 6221Sbjct: 104 SCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTC---C-----GSMCGDGCNGGYP 155 6222 6223Query: 196 PNAYNYIIKNGGIQ----------------------TESSYPYTAETGT-QCN------F 226 6224 A+N+ + G + S P T E T +C+ + 6225Sbjct: 156 AEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGY 215 6226 6227Query: 227 NSANIGAKISNF--TMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNS 283 6228 + K + + +E + I GP+ A ++ Y GV+ 6229Sbjct: 216 SPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMM 275 6230 6231Query: 284 LDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 342 6232 H I I+G+ +N PYW+V NSW DWG+ G+ + RG++ CG+ + V I 6233Sbjct: 276 GGHAIRILGWGVEN-----GTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGI 329 6234 6235 6236>sp|P00787|CATB_RAT CATHEPSIN B PRECURSOR (CATHEPSIN B1) (RSG-2) 6237 Length = 339 6238 6239 Score = 236 bits (597), Expect = 4e-62 6240 Identities = 69/289 (23%), Positives = 117/289 (39%), Gaps = 51/289 (17%) 6241 6242Query: 92 YLNNKEAIFTDDLPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQGQCGSCWSFST 147 6243 YL + + + ++P +FD W + +++QG CGSCW+F 6244Sbjct: 54 YLKKLCGTVLGGPNLPERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGA 113 6245 6246Query: 148 TGNVEGQHFISQNKLVSL--SEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKN 205 6247 + + I N V++ S ++L+ C C C +GCNGG A+N+ + 6248Sbjct: 114 VEAMSDRICIHTNGRVNVEVSAEDLLTC---C-----GIQCGDGCNGGYPSGAWNFWTRK 165 6249 6250Query: 206 GGIQ----------------------TESSYPYTAETGT-QCN------FNSANIGAKIS 236 6251 G + S P T E T +CN ++++ K 6252Sbjct: 166 GLVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHY 225 6253 6254Query: 237 NFTM--IPKNETVMAGYIVSTGPLAIAADA-VEWQFYIGGVFDIPCNPNSLDHGILIVGY 293 6255 +T + +E + I GP+ A ++ Y GV+ H I I+G+ 6256Sbjct: 226 GYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGW 285 6257 6258Query: 294 SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 342 6259 +N + PYW+V NSW DWG+ G+ + RG+N CG+ + + I 6260Sbjct: 286 GIENGV-----PYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGI 329 6261 6262 6263>sp|P10605|CATB_MOUSE CATHEPSIN B PRECURSOR (CATHEPSIN B1) 6264 Length = 339 6265 6266 Score = 233 bits (588), Expect = 4e-61 6267 Identities = 67/289 (23%), Positives = 111/289 (38%), Gaps = 51/289 (17%) 6268 6269Query: 92 YLNNKEAIFTDDLPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQGQCGSCWSFST 147 6270 YL + + +P FD W + +++QG CGSCW+F 6271Sbjct: 54 YLKKLCGTVLGGPKLPGRVAFGEDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGA 113 6272 6273Query: 148 TGNVEGQHFISQNKLVSL--SEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKN 205 6274 + + I N V++ S ++L+ C C C +GCNGG A+++ K 6275Sbjct: 114 VEAISDRTCIHTNGRVNVEVSAEDLLTC---C-----GIQCGDGCNGGYPSGAWSFWTKK 165 6276 6277Query: 206 GGIQ----------------------TESSYPYTAETGT-QCN------FNSANIGAKIS 236 6278 G + S P T E T +CN ++ + K 6279Sbjct: 166 GLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHF 225 6280 6281Query: 237 NFTM--IPKNETVMAGYIVSTGPLAIAADA-VEWQFYIGGVFDIPCNPNSLDHGILIVGY 293 6282 +T + + + I GP+ A ++ Y GV+ H I I+G+ 6283Sbjct: 226 GYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGW 285 6284 6285Query: 294 SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 342 6286 +N + PYW+ NSW DWG+ G+ + RG+N CG+ + + I 6287Sbjct: 286 GVENGV-----PYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAGI 329 6288 6289 6290>sp|P43509|CPR5_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 5 PRECURSOR 6291 Length = 344 6292 6293 Score = 232 bits (586), Expect = 8e-61 6294 Identities = 68/280 (24%), Positives = 114/280 (40%), Gaps = 53/280 (18%) 6295 6296Query: 105 PVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN 160 6297 D + E ++IP FD W ++ +++Q CGSCW+F+ + + I+ N 6298Sbjct: 69 KDEDIVATEVSDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASN 128 6299 6300Query: 161 KLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTES------ 212 6301 V+ LS ++L+ C G +C GC GG A+ + +K+G + S 6302Sbjct: 129 GAVNTLLSSEDLLSC------CTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFG 182 6303 6304Query: 213 SYPYTA-------------------ETGTQC--------NFNSANIGAKISNFTM--IPK 243 6305 PY+ E +C N+ + + K T + K 6306Sbjct: 183 CKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGK 242 6307 6308Query: 244 NETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRK 302 6309 + I++ GP+ +A ++ Y GV+ + H + I+G+ N 6310Sbjct: 243 KVEQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDN----- 297 6311 6312Query: 303 NMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 342 6313 PYW+V NSW WGE+GY + RG N CG+ + I 6314Sbjct: 298 GTPYWLVANSWNVAWGEKGYFRIIRGLNECGIEHSAVAGI 337 6315 6316 6317>sp|P07688|CATB_BOVIN CATHEPSIN B PRECURSOR 6318 Length = 335 6319 6320 Score = 232 bits (586), Expect = 8e-61 6321 Identities = 66/263 (25%), Positives = 108/263 (40%), Gaps = 51/263 (19%) 6322 6323Query: 118 IPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL--SEQNLV 171 6324 +P +FD W + +++QG CGSCW+F + + I N V++ S ++++ 6325Sbjct: 80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139 6326 6327Query: 172 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ---------------------- 209 6328 C GE C +GCNGG A+N+ K G + 6329Sbjct: 140 TC------CGGE--CGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHV 191 6330 6331Query: 210 TESSYPYTAETGT-QCN------FNSANIGAKISN--FTMIPKNETVMAGYIVSTGPLAI 260 6332 S P T E T +CN ++ + K + NE + I GP+ 6333Sbjct: 192 NGSRPPCTGEGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEG 251 6334 6335Query: 261 AADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 319 6336 A ++ Y GV+ H I I+G+ +N PYW+V NSW DWG+ 6337Sbjct: 252 AFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVEN-----GTPYWLVGNSWNTDWGD 306 6338 6339Query: 320 QGYIYLRRGKNTCGVSNFVSTSI 342 6340 G+ + RG++ CG+ + + + 6341Sbjct: 307 NGFFKILRGQDHCGIESEIVAGM 329 6342 6343 6344>sp|P43233|CATB_CHICK CATHEPSIN B PRECURSOR (CATHEPSIN B1) 6345 Length = 340 6346 6347 Score = 232 bits (585), Expect = 1e-60 6348 Identities = 72/323 (22%), Positives = 122/323 (37%), Gaps = 62/323 (19%) 6349 6350Query: 59 IEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSI 118 6351 + +N + +A F D+S Y+ F + +D + 6352Sbjct: 31 VNHINKLNTTGRAGHNF---HNTDMS-------YVKKLCGTFLGGPKAPERVDFAEDMDL 80 6353 6354Query: 119 PTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVD 172 6355 P FD W ++ +++QG CGSCW+F + + + N VS +S ++L+ 6356Sbjct: 81 PDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLS 140 6357 6358Query: 173 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGI----------------------QT 210 6359 C C C GCNGG A+ Y + G + 6360Sbjct: 141 C---C-----GFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEHHVN 192 6361 6362Query: 211 ESSYPYTAETGT--QCN------FNSANIGAKISNFTM--IPKNETVMAGYIVSTGPLAI 260 6363 S P T E G +C+ ++ + K T +P++E + I GP+ 6364Sbjct: 193 GSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEG 252 6365 6366Query: 261 AADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 319 6367 A ++ Y GV+ H I I+G+ +N PYW+ NSW DWG 6368Sbjct: 253 AFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGVEN-----GTPYWLAANSWNTDWGI 307 6369 6370Query: 320 QGYIYLRRGKNTCGVSNFVSTSI 342 6371 G+ + RG++ CG+ + + + 6372Sbjct: 308 TGFFKILRGEDHCGIESEIVAGV 330 6373 6374 6375>sp|Q26563|CATC_SCHMA CATHEPSIN C PRECURSOR 6376 Length = 454 6377 6378 Score = 231 bits (584), Expect = 1e-60 6379 Identities = 80/281 (28%), Positives = 124/281 (43%), Gaps = 38/281 (13%) 6380 6381Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFI---NSIPTAFDWRT-----RGAVTP 132 6382 + + DE +N K + + E I ++P FDW + R VTP 6383Sbjct: 178 SKYTIDELRNRAGGVKSMVTRPSVLNRKTPSKELISLTGNLPLEFDWTSPPDGSRSPVTP 237 6384 6385Query: 133 VKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLVDCDHECMEYEGEEACDEGC 190 6386 ++NQG CGSC++ + +E + + N + LS Q +VDC EGC 6387Sbjct: 238 IRNQGICGSCYASPSAAALEARIRLVSNFSEQPILSPQTVVDCSPY----------SEGC 287 6388 6389Query: 191 NGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPK-----NE 245 6390 NGG ++ G+ + PYT E +C + ++++ I NE 6391Sbjct: 288 NGGFPFLIAGKYGEDFGLPQKIVIPYTGEDTGKCTVSKNCTRYYTTDYSYIGGYYGATNE 347 6392 6393Query: 246 TVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPC--------NPNSL-DHGILIVGYSA 295 6394 +M ++S GP + + ++QFY G++ NP L +H +L+VGY 6395Sbjct: 348 KLMQLELISNGPFPVGFEVYEDFQFYKEGIYHHTTVQTDHYNFNPFELTNHAVLLVGYGV 407 6396 6397Query: 296 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 336 6398 PYW VKNSWG +WGEQGY + RG + CGV + 6399Sbjct: 408 DKLS---GEPYWKVKNSWGVEWGEQGYFRILRGTDECGVES 445 6400 6401 6402>sp|P25807|CYS1_CAEEL GUT-SPECIFIC CYSTEINE PROTEINASE PRECURSOR 6403 Length = 329 6404 6405 Score = 231 bits (584), Expect = 1e-60 6406 Identities = 69/290 (23%), Positives = 114/290 (38%), Gaps = 46/290 (15%) 6407 6408Query: 83 LSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQGQ 138 6409 ++ +E K ++ K A D + + + S+P FD W ++ +++Q 6410Sbjct: 51 ITEEEMKFKLMDGKYAAAHSD-EIRATEQEVVLASVPATFDSRTQWSECKSIKLIRDQAT 109 6411 6412Query: 139 CGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQP 196 6413 CGSCW+F + + I +S +L+ C C +C GC GG 6414Sbjct: 110 CGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSC---C-----GSSCGNGCEGGYPI 161 6415 6416Query: 197 NAYNYIIKNGGIQTESSY------PYTAETGTQ--------------CNFNSANIGAKIS 236 6417 A + + G+ T Y PY T C + AK 6418Sbjct: 162 QALRWW-DSKGVVTGGDYHGAGCKPYPIAPCTSGNCPESKTPSCSMSCQSGYSTAYAKDK 220 6419 6420Query: 237 NFTM----IPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIV 291 6421 +F + +PKN + I + GP+ A ++ Y GV+ H I I+ 6422Sbjct: 221 HFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHAIKII 280 6423 6424Query: 292 GYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 341 6425 G+ ++ PYW+V NSWG +WGE G+ + RG + CG+ + V 6426Sbjct: 281 GWGTES-----GSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAVVAG 325 6427 6428 6429>sp|P43508|CPR4_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 4 PRECURSOR 6430 Length = 335 6431 6432 Score = 231 bits (583), Expect = 2e-60 6433 Identities = 78/304 (25%), Positives = 126/304 (40%), Gaps = 60/304 (19%) 6434 6435Query: 82 DLSSDEFKNYYLNNK-EAIFTDDLPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQ 136 6436 D++ ++ K + + A T D+ V + +E ++IP FD W ++ +++Q 6437Sbjct: 46 DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINE--DTIPATFDARTQWPNCMSINNIRDQ 103 6438 6439Query: 137 GQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGL 194 6440 CGSCW+F+ + I+ N V+ LS ++++ C C C GC GG 6441Sbjct: 104 SDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSC---CSN------CGYGCEGGY 154 6442 6443Query: 195 QPNAYNYIIKNGGIQTESSY-------PYT-------------------AETGTQC---- 224 6444 NA+ Y++K+G T SY PY+ C 6445Sbjct: 155 PINAWKYLVKSG-FCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKC 213 6446 6447Query: 225 ---NFNSANIGAKISNFTM--IPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIP 278 6448 N+N A K T + K + + I++ GP+ A ++ Y GV+ 6449Sbjct: 214 TNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHT 273 6450 6451Query: 279 CNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFV 338 6452 H I I+G+ N PYW+V NSW +WGE GY + RG N CG+ + V 6453Sbjct: 274 TGQELGGHAIRILGWGTDN-----GTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAV 328 6454 6455Query: 339 STSI 342 6456 + 6457Sbjct: 329 VGGV 332 6458 6459 6460>sp|P43510|CPR6_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 6 PRECURSOR 6461 Length = 379 6462 6463 Score = 225 bits (569), Expect = 8e-59 6464 Identities = 83/388 (21%), Positives = 143/388 (36%), Gaps = 78/388 (20%) 6465 6466Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60 6467 MK +L V + + DK+ + E E L + 6468Sbjct: 1 MKTLLFLSCIVVAAYCAC-------NDNLESVLDKYRNREIDSEAAE--------LDGDD 45 6469 6470Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSI-- 118 6471 ++ + N T +F+ + + K + + +L + 6472Sbjct: 46 LIDYVNENQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDI 105 6473 6474Query: 119 PTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK--LVSLSEQNLVD 172 6475 P +FD W ++ +++Q CGSCW+F + + I+ + V+LS +L+ 6476Sbjct: 106 PESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLS 165 6477 6478Query: 173 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQ------CNF 226 6479 C +C GCNGG A+ Y +K+G I T S+Y TA G + C 6480Sbjct: 166 CCK---------SCGFGCNGGDPLAAWRYWVKDG-IVTGSNY--TANNGCKPYPFPPCEH 213 6481 6482Query: 227 NSANIGA----------------KISNFTMIPKNE---------------TVMAGYIVST 255 6483 +S +S++T +E + +++ 6484Sbjct: 214 HSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTH 273 6485 6486Query: 256 GPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWG 314 6487 GPL IA + ++ Y GGV+ H + ++G+ + I PYW V NSW 6488Sbjct: 274 GPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGI-----PYWTVANSWN 328 6489 6490Query: 315 ADWGEQGYIYLRRGKNTCGVSNFVSTSI 342 6491 DWGE G+ + RG + CG+ + V I 6492Sbjct: 329 TDWGEDGFFRILRGVDECGIESGVVGGI 356 6493 6494 6495>sp|P43157|CYSP_SCHJA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECURSOR (ANTIGEN SJ31) 6496 Length = 342 6497 6498 Score = 222 bits (559), Expect = 1e-57 6499 Identities = 69/299 (23%), Positives = 114/299 (38%), Gaps = 53/299 (17%) 6500 6501Query: 84 SSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQGQC 139 6502 S D+ + KE + IP+ FD W +++ +++Q +C 6503Sbjct: 56 SLDDARILMGARKEDAEMKRNRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRC 115 6504 6505Query: 140 GSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPN 197 6506 GSCW+F + + I S LS +L+ C C + C +GC GG 6507Sbjct: 116 GSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISC---CKD------CGDGCQGGFPGV 166 6508 6509Query: 198 AYNYIIKNGGIQTESS--------YPY----------------TAETGTQCN------FN 227 6510 A++Y +K G + S YP+ QC + 6511Sbjct: 167 AWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYK 226 6512 6513Query: 228 SANIGAKISN--FTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSL 284 6514 + K + NE V+ I+ GP+ A D ++ Y G++ 6515Sbjct: 227 TPYEQDKHYGDESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVG 286 6516 6517Query: 285 DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 6518 H I I+G+ + K PYW++ NSW DWGE+G + RG++ C + + V +I 6519Sbjct: 287 GHAIRIIGWGVE-----KRTPYWLIANSWNEDWGEKGLFRMVRGRDECSIESDVVAGLI 340 6520 6521 6522>sp|P25792|CYSP_SCHMA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECURSOR (ANTIGEN SM31) 6523 Length = 340 6524 6525 Score = 216 bits (545), Expect = 5e-56 6526 Identities = 67/303 (22%), Positives = 117/303 (38%), Gaps = 55/303 (18%) 6527 6528Query: 78 NKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD----WRTRGAVTPV 133 6529 N+F L + + + P D+ +++ IP+ FD W ++ + 6530Sbjct: 51 NRFHSLDDARIQMGARREEPDLRRKRRPTVDH--NDWNVEIPSNFDSRKKWPGCKSIATI 108 6531 6532Query: 134 KNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLVDCDHECMEYEGEEACDEGCN 191 6533 ++Q +CGSCWSF + + I + V LS +L+ C C E+C GC 6534Sbjct: 109 RDQSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTC---C------ESCGLGCE 159 6535 6536Query: 192 GGLQPNAYNYIIKNGGIQTESS--------YPY----------------------TAETG 221 6537 GG+ A++Y +K G + S YP+ + 6538Sbjct: 160 GGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQT 219 6539 6540Query: 222 TQCNFNSANIGAKISN--FTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIP 278 6541 Q + + K + +E + I+ GP+ + ++ Y G++ 6542Sbjct: 220 CQRKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHI 279 6543 6544Query: 279 CNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFV 338 6545 H I I+G+ +N PYW++ NSW DWGE GY + RG++ C + + V 6546Sbjct: 280 TGEALGGHAIRIIGWGVEN-----KTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEV 334 6547 6548Query: 339 STS 341 6549 6550Sbjct: 335 IAG 337 6551 6552 6553>sp|P25802|CYS1_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PRECURSOR 6554 Length = 341 6555 6556 Score = 213 bits (537), Expect = 4e-55 6557 Identities = 64/297 (21%), Positives = 110/297 (36%), Gaps = 56/297 (18%) 6558 6559Query: 88 FKNYYLNNKEAIFTD--DLPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQGQCGS 141 6560 FK ++ K + D V D +E + IP ++D W ++ + +Q CGS 6561Sbjct: 59 FKQRLMDLKYIDQNNIPDEEVEDEELEENNDDIPESYDPRIQWANCSSLFHIPDQANCGS 118 6562 6563Query: 142 CWSFSTTGNVEGQHFISQN--KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 199 6564 CW+ S+ + + I+ K V +S Q++V C C C +GC GG +A+ 6565Sbjct: 119 CWAVSSAAAMSDRICIASKGAKQVLISAQDVVSC---CTW------CGDGCEGGWPISAF 169 6566 6567Query: 200 NYIIKNGGIQ------TESSYPYTAETGTQCNFNSANIGAKI------------------ 235 6568 + G + S PY + N G + 6569Sbjct: 170 RFHADEGVVTGGDYNTKGSCRPYEIHPCGH-HGNETYYGECVGMADTPRCKRRCLLGYPK 228 6570 6571Query: 236 --------SNFTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDH 286 6572 + + + I+ GP+ ++ Y G++ + H 6573Sbjct: 229 SYPSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLH 288 6574 6575Query: 287 GILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 6576 + ++G+ + K PYWIV NSW DWGE G+ + RG N CG ++ + 6577Sbjct: 289 AVKVIGWGEE-----KGTPYWIVANSWHDDWGENGFFRMHRGSNDCGFEERMAAGSV 340 6578 6579 6580>sp|P25793|CYS2_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 2 PRECURSOR 6581 Length = 342 6582 6583 Score = 212 bits (534), Expect = 1e-54 6584 Identities = 64/302 (21%), Positives = 119/302 (39%), Gaps = 56/302 (18%) 6585 6586Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQ 136 6587 +D + D F+ ++ K +L V + D E IP ++D W+ +++Q 6588Sbjct: 53 SDPTPD-FEQKIMSIKYKHQKLNLMVKEDPDPEVD--IPPSYDPRDVWKNCTTFY-IRDQ 108 6589 6590Query: 137 GQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGL 194 6591 CGSCW+ ST + + I+ K V++S +++ C C C +GC GG 6592Sbjct: 109 ANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTC---C-----RPQCGDGCEGGW 160 6593 6594Query: 195 QPNAYNYIIKNGGIQTES------SYPYTAET----GTQCNFNSANIGAKI--------- 235 6595 A+ Y I +G + PY G + A 6596Sbjct: 161 PIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYGECRGTAPTPPCKRKCRP 220 6597 6598Query: 236 -------------SNFTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNP 281 6599 + ++ ++ + I+ GP+ + +++ Y G++ 6600Sbjct: 221 GVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVASFAVYEDFRHYKSGIYKHTAGE 280 6601 6602Query: 282 NSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 341 6603 H + ++G+ +N N +W++ NSW DWGE+GY + RG N CG+ ++ 6604Sbjct: 281 LRGYHAVKMIGWGNEN-----NTDFWLIANSWHNDWGEKGYFRIVRGSNDCGIEGTIAAG 335 6605 6606Query: 342 II 343 6607 I+ 6608Sbjct: 336 IV 337 6609 6610 6611>sp|P19092|CYS1_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PRECURSOR 6612 Length = 342 6613 6614 Score = 208 bits (523), Expect = 2e-53 6615 Identities = 62/295 (21%), Positives = 115/295 (38%), Gaps = 55/295 (18%) 6616 6617Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD----WRTRGAVTPVKNQGQCGSCW 143 6618 F+ ++ K +L V + D E IP ++D W+ +++Q CGSCW 6619Sbjct: 59 FEQKIMDIKYKHQKLNLMVKEDPDPEVD--IPPSYDPRDVWKNCTTFY-IRDQANCGSCW 115 6620 6621Query: 144 SFSTTGNVEGQHFISQN--KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNY 201 6622 + ST + + I+ K V++S +++ C C C +GC GG A+ Y 6623Sbjct: 116 AVSTAAAISDRICIASKAEKQVNISATDIMTC---C-----RPQCGDGCEGGWPIEAWKY 167 6624 6625Query: 202 IIKNGGIQTES------SYPYTAET----GTQCNFNSANIGAKI---------------- 235 6626 I +G + PY G + A 6627Sbjct: 168 FIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYR 227 6628 6629Query: 236 ------SNFTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGI 288 6630 + ++ ++ + I+ GP+ + +++ Y G++ H + 6631Sbjct: 228 IDKRYGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAV 287 6632 6633Query: 289 LIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343 6634 ++G+ +N N +W++ NSW DWGE+GY + RG N CG+ ++ I+ 6635Sbjct: 288 KMIGWGNEN-----NTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAGIV 337 6636 6637 6638>sp|P43507|CPR3_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 PRECURSOR 6639 Length = 370 6640 6641 Score = 204 bits (513), Expect = 3e-52 6642 Identities = 61/265 (23%), Positives = 99/265 (37%), Gaps = 49/265 (18%) 6643 6644Query: 112 DEFINSIPTAFD----WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS--L 165 6645 + +P FD W + ++NQ CGSCW+F + + I N + 6646Sbjct: 86 EIVPEPLPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVI 145 6647 6648Query: 166 SEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSY------PYTAE 219 6649 S ++++ C C C GC GG A + +G + T Y PY+ 6650Sbjct: 146 SVEDILSC---C-----GTTCGYGCKGGYSIEALRFWASSGAV-TGGDYGGHGCMPYSFA 196 6651 6652Query: 220 TGTQ---------CNF------------NSANIGAKISNFTMIPKNETVMAGYIVSTGPL 258 6653 T+ C + GA T K+ T + I GP+ 6654Sbjct: 197 PCTKNCPESTTPSCKTTCQSSYKTEEYKKDKHYGASAYKVTT-TKSVTEIQTEIYHYGPV 255 6655 6656Query: 259 AIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 317 6657 + ++ Y GV+ H + I+G+ +N + YW++ NSWG + 6658Sbjct: 256 EASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGVENGV-----DYWLIANSWGTSF 310 6659 6660Query: 318 GEQGYIYLRRGKNTCGVSNFVSTSI 342 6661 GE+G+ +RRG N C + V I 6662Sbjct: 311 GEKGFFKIRRGTNECQIEGNVVAGI 335 6663 6664 6665>sp|P25780|EUM1_EURMA MITE GROUP I ALLERGEN EUR M 1 (EUR M I) 6666 Length = 211 6667 6668 Score = 201 bits (507), Expect = 1e-51 6669 Identities = 69/220 (31%), Positives = 99/220 (44%), Gaps = 25/220 (11%) 6670 6671Query: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176 6672 S+P+ D R+ VTP++ QG CGSCW+FS + E + +N + L+EQ LVDC 6673Sbjct: 10 SLPSELDLRSLRTVTPIRMQGGCGSCWAFSGVASTESAYLAYRNMSLDLAEQELVDC--- 66 6674 6675Query: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKIS 236 6676 A GC+G P YI +NG +Q E YPY A + N+ G K 6677Sbjct: 67 --------ASQNGCHGDTIPRGIEYIQQNGVVQ-EHYYPYVAREQSCHRPNAQRYGLK-- 115 6678 6679Query: 237 NFTMIPKNETVMAGYIVSTGPLAIAA-----DAVEWQFYIGGVFDIPCNPNSLD-HGILI 290 6680 N+ I ++ ++ A+A D ++ Y G N + H + I 6681Sbjct: 116 NYCQISPPDSNKIRQALTQTHTAVAVIIGIKDLNAFRHYDGRTIMQHDNGYQPNYHAVNI 175 6682 6683Query: 291 VGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKN 330 6684 VGY + + YWIV+NSW WG+ GY Y N 6685Sbjct: 176 VGYGN-----TQGVDYWIVRNSWDTTWGDNGYGYFAANIN 210 6686 6687 6688>sp|P25773|CATL_FELCA CATHEPSIN L (PROGESTERONE-DEPENDENT PROTEIN) (PDP) 6689 Length = 139 6690 6691 Score = 185 bits (464), Expect = 2e-46 6692 Identities = 55/141 (39%), Positives = 84/141 (59%), Gaps = 5/141 (3%) 6693 6694Query: 192 GGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGY 251 6695 GGL +A+ Y+ NGG+ +E SYPY A+ G C + N A ++++ IP E + 6696Sbjct: 1 GGLIDDAFQYVKDNGGLDSEESYPYHAQ-GDSCKYRPENSVANVTDYWDIPSKENELMIT 59 6697 6698Query: 252 IVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGILIVGYSAKNTIFRKNMPYWI 308 6699 + + GP++ A DA ++FY G++ P C+ +DHG+L+VGY A T +N YWI 6700Sbjct: 60 LAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDVDHGVLVVGYGADGT-ETENKKYWI 118 6701 6702Query: 309 VKNSWGADWGEQGYIYLRRGK 329 6703 +KNSWG DWG GYI + + + 6704Sbjct: 119 IKNSWGTDWGMDGYIKMAKDR 139 6705 6706 6707>sp|Q23894|CYS3_DICDI CYSTEINE PROTEINASE 3 (CYSTEINE PROTEINASE II) 6708 Length = 151 6709 6710 Score = 158 bits (395), Expect = 2e-38 6711 Identities = 61/158 (38%), Positives = 86/158 (53%), Gaps = 17/158 (10%) 6712 6713Query: 41 SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIF 100 6714 +H+E++ R+E FK N+ + N + + T G+N+ ADLS++E++ YL + I 6715Sbjct: 1 THKEFMPRYEEFKKNMDYVHNWN----SKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIK 56 6716 6717Query: 101 TDDLPVAD---YLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFI 157 6718 + + L+ P DWR + AVTPVK+QGQCGSC STTG+VEG I 6719Sbjct: 57 LNGYHKRNLGLRLNRPHFKQ-PLNVDWREKDAVTPVKDQGQCGSC-IISTTGSVEGVTAI 114 6720 6721Query: 158 SQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQ 195 6722 KLVSLSEQN++ +EGCNGGL 6723Sbjct: 115 KTGKLVSLSEQNILRLSSSF--------GNEGCNGGLM 144 6724 6725 6726>sp|P13823|SERA_PLAFG SERINE-REPEAT ANTIGEN PROTEIN PRECURSOR (P126) (111 KD ANTIGEN) 6727 Length = 989 6728 6729 Score = 157 bits (393), Expect = 3e-38 6730 Identities = 60/255 (23%), Positives = 101/255 (39%), Gaps = 46/255 (18%) 6731 6732Query: 123 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEG 182 6733 D + V++QG C + W F++ ++E + + +S + +C Y+G 6734Sbjct: 569 DENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANC------YKG 622 6735 6736Query: 183 EEACDEGCNGGLQPNAYNYIIKNGG-IQTESSYPYT-AETGTQCN--------------- 225 6737 E + C+ G P + II++ G + ES+YPY + G QC 6738Sbjct: 623 EHK--DRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKI 680 6739 6740Query: 226 -----------------FNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQ 268 6741 + S + F I K E + G +++ I A+ V 6742Sbjct: 681 LHNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAY----IKAENVMGY 736 6743 6744Query: 269 FYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG 328 6745 + G C ++ DH + IVGY + YWIV+NSWG WG++GY + 6746Sbjct: 737 EFSGKKVQNLCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMY 796 6747 6748Query: 329 KNTCGVSNFVSTSII 343 6749 T NF+ + +I 6750Sbjct: 797 GPTHCHFNFIHSVVI 811 6751 6752 6753>sp|Q06544|CYS3_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 6754 Length = 174 6755 6756 Score = 139 bits (348), Expect = 6e-33 6757 Identities = 31/130 (23%), Positives = 53/130 (39%), Gaps = 8/130 (6%) 6758 6759Query: 217 TAETGTQCNFNSANIGAKISN--FTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGG 273 6760 + Q + A K +P N + I+ GP+ ++ Y G 6761Sbjct: 50 KCQKTCQRGYLKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSG 109 6762 6763Query: 274 VFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCG 333 6764 ++ + H + I+G+ + K PYW++ NSW DWGE+G+ + RG N C 6765Sbjct: 110 IYKHTAGRMTGGHAVKIIGWGKE-----KGTPYWLIANSWHDDWGEKGFYRMIRGINNCR 164 6766 6767Query: 334 VSNFVSTSII 343 6768 + V I+ 6769Sbjct: 165 IEEMVFAGIV 174 6770 6771 6772>sp|P05993|PAP5_CARPA CYSTEINE PROTEINASE (CLONE PLBPC13) 6773 Length = 96 6774 6775 Score = 131 bits (327), Expect = 2e-30 6776 Identities = 43/87 (49%), Positives = 55/87 (62%), Gaps = 2/87 (2%) 6777 6778Query: 256 GPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSW 313 6779 GPLA+A +A Q YIGGV L+HG+L+VGY + I K PYW++KNSW 6780Sbjct: 1 GPLAVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGYAPIRLKEKPYWVIKNSW 60 6781 6782Query: 314 GADWGEQGYIYLRRGKNTCGVSNFVST 340 6783 G +WGE GY + RG+N CGV + VST 6784Sbjct: 61 GENWGENGYYKICRGRNICGVDSMVST 87 6785 6786 6787>sp|P12399|CT2A_MOUSE CTLA-2-ALPHA PROTEIN PRECURSOR 6788 Length = 136 6789 6790 Score = 98.2 bits (241), Expect = 2e-20 6791 Identities = 32/131 (24%), Positives = 53/131 (40%), Gaps = 14/131 (10%) 6792 6793Query: 8 VLAVFTVFVSSRGIPPE--EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLI 65 6794 L + + + S PP+ +++ E++ KF K Y+ E R +++ N KIE N 6795Sbjct: 16 FLLILCLGMMSAAPPPDPSLDNEWKEWKTKFAKAYNLNEERHRRLVWEENKKKIEAHNAD 75 6796 6797Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWR 125 6798 K G+N+F+DL+ +EFK N E +P D 6799Sbjct: 76 YEQGKTSFYMGLNQFSDLTPEEFKTNCYGNSL------------NRGEMAPDLPEYEDLG 123 6800 6801Query: 126 TRGAVTPVKNQ 136 6802 +TP + Q 6803Sbjct: 124 KNSYLTPGRAQ 134 6804 6805 6806>sp|P12400|CT2B_MOUSE CTLA-2-BETA PROTEIN PRECURSOR 6807 Length = 141 6808 6809 Score = 95.9 bits (235), Expect = 1e-19 6810 Identities = 29/133 (21%), Positives = 54/133 (39%), Gaps = 13/133 (9%) 6811 6812Query: 4 ILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELN 63 6813 + L +L + + + P +++ E++ F K YS +E R +++ N KIE N 6814Sbjct: 20 VFLLILCLGMMSAAPS-PDPSLDNEWKEWKTTFAKAYSLDEERHRRLMWEENKKKIEAHN 78 6815 6816Query: 64 LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD 123 6817 K G+N+F+DL+ +EF+ + + E +P D 6818Sbjct: 79 ADYERGKTSFYMGLNQFSDLTPEEFR------------TNCCGSSMCRGEMAPDLPEYED 126 6819 6820Query: 124 WRTRGAVTPVKNQ 136 6821 +TP + Q 6822Sbjct: 127 LGKNSYLTPGRAQ 139 6823 6824 6825>sp|P32957|CC4_CARCN CYSTEINE PROTEINASE IV (CC-IV) 6826 Length = 43 6827 6828 Score = 84.6 bits (206), Expect = 3e-16 6829 Identities = 26/42 (61%), Positives = 30/42 (70%) 6830 6831Query: 119 PTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN 160 6832 P + DWR +GAVTPVKNQG CGSCW+FST VEG + I 6833Sbjct: 2 PESIDWRKKGAVTPVKNQGSCGSCWAFSTIVTVEGINKIRTG 43 6834 6835 6836>sp|P32956|CC3_CARCN CYSTEINE PROTEINASE III (CC-III) 6837 Length = 43 6838 6839 Score = 84.2 bits (205), Expect = 4e-16 6840 Identities = 26/42 (61%), Positives = 30/42 (70%) 6841 6842Query: 119 PTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN 160 6843 P + DWR +GAVTPVKNQG CGSCW+FST VEG + I 6844Sbjct: 2 PESIDWRKKGAVTPVKNQGSCGSCWAFSTIATVEGINKIVHG 43 6845 6846 6847>sp|P32955|CC2_CARCN CYSTEINE PROTEINASE II (CC-II) 6848 Length = 43 6849 6850 Score = 81.5 bits (198), Expect = 2e-15 6851 Identities = 24/42 (57%), Positives = 29/42 (68%) 6852 6853Query: 119 PTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN 160 6854 P + DWR +GAVTPVK+Q CGSCW+FST VEG + I 6855Sbjct: 2 PGSVDWRQKGAVTPVKDQNPCGSCWAFSTVATVEGINKIVTG 43 6856 6857 6858>sp||CATL_CHICK_2 [Segment 2 of 2] CATHEPSIN L 6859 Length = 42 6860 6861 Score = 77.2 bits (187), Expect = 5e-14 6862 Identities = 20/42 (47%), Positives = 28/42 (66%), Gaps = 1/42 (2%) 6863 6864Query: 303 NMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 343 6865 YWIVKNSWG WG++GYIY+ + KN CG++ S ++ 6866Sbjct: 1 GKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 42 6867 6868 6869>sp|P32954|CC1_CARCN CYSTEINE PROTEINASE I (CC-I) 6870 Length = 43 6871 6872 Score = 76.0 bits (184), Expect = 1e-13 6873 Identities = 24/40 (60%), Positives = 29/40 (72%) 6874 6875Query: 118 IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFI 157 6876 I + DWR +GAVTPV+NQG CGSCW+FS+ VEG I 6877Sbjct: 1 IVASIDWRQKGAVTPVRNQGSCGSCWTFSSVAAVEGIIKI 40 6878 6879 6880>sp|P05689|CATX_BOVIN CATHEPSIN 6881 Length = 73 6882 6883 Score = 59.7 bits (142), Expect = 9e-09 6884 Identities = 15/41 (36%), Positives = 24/41 (57%), Gaps = 5/41 (12%) 6885 6886Query: 285 DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325 6887 +H + + G+ + M YWIV+NSWG WGE G++ + 6888Sbjct: 10 NHIVSVAGWGVSD-----GMEYWIVRNSWGEPWGEHGWMRI 45 6889 6890 6891>sp|P94869|PEPG_LACDL AMINOPEPTIDASE G 6892 Length = 437 6893 6894 Score = 46.0 bits (107), Expect = 1e-04 6895 Identities = 13/49 (26%), Positives = 21/49 (42%), Gaps = 4/49 (8%) 6896 6897Query: 280 NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG 328 6898 + H + +VG R+ W V+NSWG GE+G+ + 6899Sbjct: 355 GAGEVSHAMTLVGVDEDKGDIRQ----WKVENSWGDKSGEKGFFVMSHN 399 6900 6901 6902>sp|P94870|PEPE_LACHE AMINOPEPTIDASE E 6903 Length = 438 6904 6905 Score = 42.9 bits (99), Expect = 0.001 6906 Identities = 14/48 (29%), Positives = 21/48 (43%), Gaps = 4/48 (8%) 6907 6908Query: 278 PCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325 6909 + H + +VG N R+ W V+NSWG G +GY + 6910Sbjct: 354 KTGVGEVSHAMTLVGVDEDNGEVRQ----WKVENSWGDKSGAKGYYVM 397 6911 6912 6913>sp|P94868|PEPW_LACDL AMINOPEPTIDASE W 6914 Length = 437 6915 6916 Score = 42.1 bits (97), Expect = 0.002 6917 Identities = 14/43 (32%), Positives = 20/43 (45%), Gaps = 4/43 (9%) 6918 6919Query: 286 HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG 328 6920 H + +VG R+ W V+NSWG GE+GY + 6921Sbjct: 361 HDMALVGVDVDGGQVRQ----WKVENSWGDKSGEKGYFTMSAD 399 6922 6923 6924>sp|Q10744|PEPC_LACHE AMINOPEPTIDASE C 6925 Length = 449 6926 6927 Score = 41.4 bits (95), Expect = 0.003 6928 Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 4/46 (8%) 6929 6930Query: 280 NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325 6931 + +DH ++I G + K W ++NSWG G +GY + 6932Sbjct: 358 GESMMDHAMVITGVDIVDGKPTK----WKIENSWGEKPGFKGYFVM 399 6933 6934 6935>sp|Q04723|PEPC_LACLC AMINOPEPTIDASE C 6936 Length = 436 6937 6938 Score = 39.8 bits (91), Expect = 0.009 6939 Identities = 15/68 (22%), Positives = 29/68 (42%), Gaps = 9/68 (13%) 6940 6941Query: 262 ADAVEWQ------FYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGA 315 6942 D+ +++ F + + + H +++ G + N W V+NSWG 6943Sbjct: 326 MDSYDFKSSLDIEFTQSKAGRLDYGESLMTHAMVLAG---VDLDADGNSTKWKVENSWGK 382 6944 6945Query: 316 DWGEQGYI 323 6946 D G++GY 6947Sbjct: 383 DAGQKGYF 390 6948 6949 6950 Score = 31.6 bits (70), Expect = 2.5 6951 Identities = 15/77 (19%), Positives = 28/77 (35%), Gaps = 10/77 (12%) 6952 6953Query: 77 VNKFADLSS---DEFKN--YYLNNKEAIFTDDLPVADYLDDEFINSIPT-AFDWRTRGAV 130 6954 + +D + + F + A+ + L + + ++P + D 6955Sbjct: 1 MTVTSDFTQKLYENFAENTKLRAVENAVTKNGLLSSLEVRGSHAANLPEFSLDLTKD--- 57 6956 6957Query: 131 TPVKNQGQCGSCWSFST 147 6958 PV NQ Q G CW F+ 6959Sbjct: 58 -PVTNQKQSGRCWMFAA 73 6960 6961 6962>sp|Q48543|PEPC_LACDL AMINOPEPTIDASE C 6963 Length = 449 6964 6965 Score = 38.2 bits (87), Expect = 0.025 6966 Identities = 12/46 (26%), Positives = 23/46 (49%), Gaps = 4/46 (8%) 6967 6968Query: 280 NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325 6969 + ++H ++I A + + K W ++NSWG G +GY + 6970Sbjct: 358 GESMMNHAMVIT---AVDLVDDKPTK-WKIENSWGDKSGFKGYFVM 399 6971 6972 6973>sp|P21381|THPA_THADA THAUMATOPAIN 6974 Length = 35 6975 6976 Score = 38.2 bits (87), Expect = 0.025 6977 Identities = 14/27 (51%), Positives = 19/27 (69%), Gaps = 2/27 (7%) 6978 6979Query: 117 SIPTAFDWRTRGAVTPVKNQGQ-CGSC 142 6980 ++P + DW +GAV VKNQ + CGSC 6981Sbjct: 1 NLPNSVDWWKKGAVAAVKNQ-RXCGSC 26 6982 6983 6984>sp|Q56115|PEPC_STRTR AMINOPEPTIDASE C 6985 Length = 445 6986 6987 Score = 35.9 bits (81), Expect = 0.13 6988 Identities = 11/46 (23%), Positives = 21/46 (44%), Gaps = 5/46 (10%) 6989 6990Query: 279 CNPNSLDHGILIVGYSAKNTIFRKNMPY-WIVKNSWGADWGEQGYI 323 6991 + + + H +++ G P W ++NSWG G++GY 6992Sbjct: 356 YSESLMTHAMVLTGVDLD----ADGKPIKWKIENSWGDKVGQKGYF 397 6993 6994 6995>sp|P09983|HLY1_ECOLI HEMOLYSIN, CHROMOSOMAL 6996 Length = 1023 6997 6998 Score = 35.5 bits (80), Expect = 0.17 6999 Identities = 21/119 (17%), Positives = 35/119 (28%), Gaps = 20/119 (16%) 7000 7001Query: 31 EFQDKFNKKYSHEEYLERFEIF-KSNLGKIEELNLIAI-------NHKADTKF-----GV 77 7002 E++ K K Y Y R F + N + + N + GV 7003Sbjct: 430 EWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHWDTLIGELAGV 489 7004 7005Query: 78 NKFAD--LSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVK 134 7006 + D LS + +YY K D + + + D + T +K 7007Sbjct: 490 TRNGDKTLSGKSYIDYYEEGKR-----LEKKPDEFQKQVFDPLKGNIDLSDSKSSTLLK 543 7008 7009 7010>sp|P33403|CYSP_TRIFO CYSTEINE PROTEINASE 7011 Length = 23 7012 7013 Score = 34.7 bits (78), Expect = 0.28 7014 Identities = 8/19 (42%), Positives = 12/19 (63%) 7015 7016Query: 120 TAFDWRTRGAVTPVKNQGQ 138 7017 + DWR +G V +K+Q Q 7018Sbjct: 2 DSLDWREKGVVNSIKDQAQ 20 7019 7020 7021>sp|P08715|HLYA_ECOLI HEMOLYSIN, PLASMID 7022 Length = 1024 7023 7024 Score = 34.7 bits (78), Expect = 0.28 7025 Identities = 9/38 (23%), Positives = 14/38 (36%), Gaps = 1/38 (2%) 7026 7027Query: 31 EFQDKFNKKYSHEEYLERFEIF-KSNLGKIEELNLIAI 67 7028 E++ K K Y Y R F + N + + N 7029Sbjct: 431 EWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYS 468 7030 7031 7032>sp|P87362|BLMH_CHICK BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) (AMINOPEPTIDASE H) 7033 Length = 455 7034 7035 Score = 34.3 bits (77), Expect = 0.37 7036 Identities = 10/19 (52%), Positives = 14/19 (73%) 7037 7038Query: 307 WIVKNSWGADWGEQGYIYL 325 7039 W V+NSWG D G +GY+ + 7040Sbjct: 392 WRVENSWGEDRGNKGYLIM 410 7041 7042 7043>sp|P54704|PSPB_DICDI PRESPORE PROTEIN B PRECURSOR 7044 Length = 379 7045 7046 Score = 34.3 bits (77), Expect = 0.37 7047 Identities = 24/109 (22%), Positives = 41/109 (37%), Gaps = 15/109 (13%) 7048 7049Query: 210 TESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQF 269 7050 T YP T G +N + K + ++PK E + Y+ GP D W++ 7051Sbjct: 78 TGKIYP-TVNMGDCHRYNVDLVFKKDKSGNVMPKKELRESAYVP-HGP----IDPATWKY 131 7052 7053Query: 270 YI--GGVFDI-PCNPNSL------DHGILIVGYSAKNTIFRKNMPYWIV 309 7054 Y G + C+P ++ L +GY A + W++ 7055Sbjct: 132 YTFVQGKWTGFGCDPQNVVFSGAEGGMPLQLGYGANGKNGDNGISVWLI 180 7056 7057 7058>sp|Q13867|BLMH_HUMAN BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 7059 Length = 455 7060 7061 Score = 34.0 bits (76), Expect = 0.49 7062 Identities = 10/19 (52%), Positives = 14/19 (73%) 7063 7064Query: 307 WIVKNSWGADWGEQGYIYL 325 7065 W V+NSWG D G +GY+ + 7066Sbjct: 392 WRVENSWGEDHGHKGYLCM 410 7067 7068 7069>sp|P70645|BLMH_RAT BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 7070 Length = 454 7071 7072 Score = 34.0 bits (76), Expect = 0.49 7073 Identities = 10/19 (52%), Positives = 14/19 (73%) 7074 7075Query: 307 WIVKNSWGADWGEQGYIYL 325 7076 W V+NSWG D G +GY+ + 7077Sbjct: 392 WRVENSWGEDHGHKGYLCM 410 7078 7079 7080>sp|P80532|CAT3_FASHE PUTATIVE CATHEPSIN L3 (NEWLY EXCYSTED JUVENILE PROTEIN 8) 7081 Length = 19 7082 7083 Score = 34.0 bits (76), Expect = 0.49 7084 Identities = 9/18 (50%), Positives = 12/18 (66%) 7085 7086Query: 118 IPTAFDWRTRGAVTPVKN 135 7087 +P + DWR G VT VK+ 7088Sbjct: 2 VPASIDWREYGYVTEVKD 19 7089 7090 7091>sp|P16462|LKTA_ACTAC LEUKOTOXIN 7092 Length = 1050 7093 7094 Score = 34.0 bits (76), Expect = 0.49 7095 Identities = 11/68 (16%), Positives = 25/68 (36%), Gaps = 2/68 (2%) 7096 7097Query: 12 FTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKA 71 7098 + S I + + +++K+ K YS Y R F ++ N + +K 7099Sbjct: 410 ASKQAVSEHIANQLADKIKAWENKYGKNYSENGYDARHSAFLE--DSLKLFNELREKYKT 467 7100 7101Query: 72 DTKFGVNK 79 7102 + + + 7103Sbjct: 468 ENILSITQ 475 7104 7105 7106>sp|P13438|TSP_MOUSE TROPHOBLAST-SPECIFIC PROTEIN PRECURSOR 7107 Length = 124 7108 7109 Score = 32.4 bits (72), Expect = 1.4 7110 Identities = 19/129 (14%), Positives = 41/129 (31%), Gaps = 12/129 (9%) 7111 7112Query: 8 VLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAI 67 7113 L + + V+S I PE Q + K +E L + ++ + + + 7114Sbjct: 6 FLVILCLGVASAVIVPEAQLDAELQEQK------DKEVLIK-AVWSKFMKTNKLHSSEND 58 7115 7116Query: 68 NHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTR 127 7117 + ++ L+ +E +F ++ + + P D+ 7118Sbjct: 59 QETEGSNIEMSASGQLTDEELMKIMTTVLHPMFEEEENKP-----QPVVDDPEFEDYTES 113 7119 7120Query: 128 GAVTPVKNQ 136 7121 G V NQ 7122Sbjct: 114 GDGFFVPNQ 122 7123 7124 7125>sp|Q00951|HLYA_ACTSU HEMOLYSIN (CYTOLYSIN II) (CLY-IIA) (HLY-IIA) (CYTC) (APPA) 7126 Length = 956 7127 7128 Score = 32.4 bits (72), Expect = 1.4 7129 Identities = 10/36 (27%), Positives = 16/36 (43%), Gaps = 1/36 (2%) 7130 7131Query: 31 EFQDKFNKKYSHEEYLER-FEIFKSNLGKIEELNLI 65 7132 E++ K NK Y + Y R + N+ + LN 7133Sbjct: 427 EWEKKHNKNYFEQGYDSRHLADLQDNMKFLINLNKE 462 7134 7135 7136>sp|P15377|RT2A_ACTPL RTX-II TOXIN DETERMINANT A (APX-IIA) (HEMOLYSIN IIA) (HLY-IIA) 7137 (CYTOLYSIN IIA) (CLY-IIA) 7138 Length = 956 7139 7140 Score = 32.4 bits (72), Expect = 1.4 7141 Identities = 10/36 (27%), Positives = 16/36 (43%), Gaps = 1/36 (2%) 7142 7143Query: 31 EFQDKFNKKYSHEEYLER-FEIFKSNLGKIEELNLI 65 7144 E++ K NK Y + Y R + N+ + LN 7145Sbjct: 427 EWEKKHNKNYFEQGYDSRHLADLQDNMKFLINLNKE 462 7146 7147 7148>sp|P52181|TGLC_PAGMA PROTEIN-GLUTAMINE GAMMA-GLUTAMYLTRANSFERASE (TISSUE 7149 TRANSGLUTAMINASE) (TGASE C) (TGC) 7150 Length = 695 7151 7152 Score = 32.0 bits (71), Expect = 1.9 7153 Identities = 12/48 (25%), Positives = 17/48 (35%), Gaps = 5/48 (10%) 7154 7155Query: 102 DDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTG 149 7156 ++ + S+P W G V PVK G CW F+ 7157Sbjct: 237 EEPYTDGVAPYRWTGSVPILQQWSKAG-VRPVKY----GQCWVFAAVA 279 7158 7159 7160>sp|P35681|TCTP_ORYSA TRANSLATIONALLY CONTROLLED TUMOR PROTEIN HOMOLOG (TCTP) 7161 Length = 168 7162 7163 Score = 32.0 bits (71), Expect = 1.9 7164 Identities = 12/34 (35%), Positives = 19/34 (55%) 7165 7166Query: 22 PPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSN 55 7167 PP ++ QF+ F ++ K S + E+ E FK N 7168Sbjct: 80 PPFDKKQFVTFMKRYIKNLSAKLDAEKQEEFKKN 113 7169 7170 7171>sp|Q01532|BLH1_YEAST CYSTEINE PROTEINASE 1 (Y3) (BLEOMYCIN HYDROLASE) (BLM HYDROLASE) 7172 Length = 454 7173 7174 Score = 31.6 bits (70), Expect = 2.5 7175 Identities = 16/40 (40%), Positives = 19/40 (47%), Gaps = 9/40 (22%) 7176 7177Query: 131 TPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNL 170 7178 TPV NQ G CW F+ T + +L LSE NL 7179Sbjct: 62 TPVTNQKSSGRCWLFAATNQL---------RLNVLSELNL 92 7180 7181 7182>sp|P16312|MMAL_DERMI MAJOR MITE FECAL ALLERGEN DER M 1 (DER M I) 7183 Length = 30 7184 7185 Score = 31.2 bits (69), Expect = 3.2 7186 Identities = 8/24 (33%), Positives = 14/24 (58%) 7187 7188Query: 114 FINSIPTAFDWRTRGAVTPVKNQG 137 7189 ++P+ D R+ VTP++ QG 7190Sbjct: 7 NSGNVPSELDLRSLRTVTPIRMQG 30 7191 7192 7193>sp|O03992|TCTP_FRAAN TRANSLATIONALLY CONTROLLED TUMOR PROTEIN HOMOLOG (TCTP) 7194 Length = 170 7195 7196 Score = 30.8 bits (68), Expect = 4.2 7197 Identities = 16/62 (25%), Positives = 30/62 (47%), Gaps = 13/62 (20%) 7198 7199Query: 22 PPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFA 81 7200 PP ++ QF+ + ++ K + + E+ E FK N+ + TKF ++K + 7201Sbjct: 82 PPFDKKQFVTWVKRYIKLLTPKLEGEQQETFKKNI-------------EGATKFLLSKLS 128 7202 7203Query: 82 DL 83 7204 DL 7205Sbjct: 129 DL 130 7206 7207 7208>sp|Q04489|YMJ6_YEAST HYPOTHETICAL 59.5 KD PROTEIN IN VPS9-RAD10 INTERGENIC REGION 7209 Length = 525 7210 7211 Score = 30.8 bits (68), Expect = 4.2 7212 Identities = 26/122 (21%), Positives = 43/122 (34%), Gaps = 20/122 (16%) 7213 7214Query: 174 DHECMEYEGEEACDEGCNGGLQPNAYNYIIKN-----GGIQ----TESSYPYTAET--GT 222 7215 D +++ GE E G +++N G I E Y YT + 7216Sbjct: 87 DRYFLQFNGELYNKEISQGDNDSLYIASMLQNLKEGMGVIDVIKSLEGEYAYTIYDVNSS 146 7217 7218Query: 223 QCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 282 7219 + F IG + ++++ P NE +A ++ A +Q IGGV 7220Sbjct: 147 KLYFGRDPIGRRSLSYSVTPDNELYVA---------SVTGSAGSFQDCIGGVIYEYDTRT 197 7221 7222Query: 283 SL 284 7223 L 7224Sbjct: 198 KL 199 7225 7226 7227>sp|Q03164|HRX_HUMAN ZINC FINGER PROTEIN HRX (ALL-1) (TRITHORAX-LIKE PROTEIN) 7228 Length = 3969 7229 7230 Score = 30.8 bits (68), Expect = 4.2 7231 Identities = 17/81 (20%), Positives = 38/81 (45%), Gaps = 7/81 (8%) 7232 7233Query: 126 TRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDC-DHECMEYEGEE 184 7234 +V+ VK QGQ S + ++E + + S++NL+D + E ++ + + 7235Sbjct: 2782 NCHSVSRVKTQGQ-DSLEA--QLSSLESSRRVHTSTP---SDKNLLDTYNTELLKSDSDN 2835 7236 7237Query: 185 ACDEGCNGGLQPNAYNYIIKN 205 7238 + C L + ++++KN 7239Sbjct: 2836 NNSDDCGNILPSDIMDFVLKN 2856 7240 7241 7242>sp||CATB_COTJA_1 [Segment 1 of 2] CATHEPSIN B (CATHEPSIN B1) 7243 Length = 25 7244 7245 Score = 30.4 bits (67), Expect = 5.6 7246 Identities = 6/25 (24%), Positives = 12/25 (48%), Gaps = 4/25 (16%) 7247 7248Query: 118 IPTAFD----WRTRGAVTPVKNQGQ 138 7249 +P FD W ++ +++QG 7250Sbjct: 1 LPDTFDSRKQWPNCPTISEIRDQGS 25 7251 7252 7253>sp|Q62703|RCN2_RAT RETICULOCALBIN 2 PRECURSOR (CALCIUM-BINDING PROTEIN ERC-55) 7254 (TAIPOXIN-ASSOCIATED CALCIUM-BINDING PROTEIN-49) 7255 (TCBP-49) 7256 Length = 318 7257 7258 Score = 30.4 bits (67), Expect = 5.6 7259 Identities = 17/109 (15%), Positives = 40/109 (36%), Gaps = 4/109 (3%) 7260 7261Query: 26 QSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIA-INHKADTKFGVNKFADLS 84 7262 +++ ++ K Y+ +E ++F + N + + F N D + 7263Sbjct: 84 ENELSQWIQMSFKHYAMQEAKQQFVEYDKNSDGTVTWDEYNVQMYDRVIDFDENTALDDT 143 7264 7265Query: 85 SDE-FKNYYLNNKEAIFTDDLPVADYLDDEFINSI--PTAFDWRTRGAV 130 7266 +E F+ +L +K+ + L+ E + P D+ T + 7267Sbjct: 144 EEESFRQLHLKDKKRFEKANQDSGPGLNLEEFIAFEHPEEVDYMTEFVI 192 7268 7269 7270>sp|P48651|PSS1_HUMAN PHOSPHATIDYLSERINE SYNTHASE I (SERINE-EXCHANGE ENZYME I) (KIAA0024) 7271 Length = 473 7272 7273 Score = 30.4 bits (67), Expect = 5.6 7274 Identities = 6/20 (30%), Positives = 8/20 (40%) 7275 7276Query: 142 CWSFSTTGNVEGQHFISQNK 161 7277 CW F G +E I + 7278Sbjct: 354 CWVFGVIGFLEAIVCIKFGQ 373 7279 7280 7281>sp|P33404|CYSP_TRIVA CYSTEINE PROTEINASE 7282 Length = 22 7283 7284 Score = 30.4 bits (67), Expect = 5.6 7285 Identities = 10/17 (58%), Positives = 13/17 (75%), Gaps = 1/17 (5%) 7286 7287Query: 123 DWRTRGAVTPV-KNQGQ 138 7288 DWR +GAV + K+QGQ 7289Sbjct: 6 DWRKKGAVNVIXKDQGQ 22 7290 7291 7292>sp|Q00576|PSS1_CRILO PHOSPHATIDYLSERINE SYNTHASE I (SERINE-EXCHANGE ENZYME I) 7293 Length = 471 7294 7295 Score = 30.4 bits (67), Expect = 5.6 7296 Identities = 6/20 (30%), Positives = 8/20 (40%) 7297 7298Query: 142 CWSFSTTGNVEGQHFISQNK 161 7299 CW F G +E I + 7300Sbjct: 354 CWVFGVIGFLEAIVCIKFGQ 373 7301 7302 7303>sp|Q9ZRX0|TCTP_PSEMZ TRANSLATIONALLY CONTROLLED TUMOR PROTEIN HOMOLOG (TCTP) 7304 Length = 167 7305 7306 Score = 30.1 bits (66), Expect = 7.3 7307 Identities = 17/62 (27%), Positives = 27/62 (43%), Gaps = 13/62 (20%) 7308 7309Query: 22 PPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFA 81 7310 PP ++ QFL F ++ K + + ER FK N+ + K V+K + 7311Sbjct: 79 PPFDKKQFLGFIKRYIKNLATKLSEERQAEFKKNV-------------EGAAKMLVSKLS 125 7312 7313Query: 82 DL 83 7314 DL 7315Sbjct: 126 DL 127 7316 7317 7318>sp|Q94480|V136_DICDI VEG136 PROTEIN 7319 Length = 357 7320 7321 Score = 30.1 bits (66), Expect = 7.3 7322 Identities = 21/95 (22%), Positives = 34/95 (35%), Gaps = 3/95 (3%) 7323 7324Query: 70 KADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGA 129 7325 +G++ F LS DE Y ++ + Y I S ++W 7326Sbjct: 79 NKKYDYGLDLFL-LSIDEGITGYRDDSLETVKRNQEQ--YPIPSQILSYKELYNWTMDDI 135 7327 7328Query: 130 VTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS 164 7329 V + +G C SC F G + NK+V+ 7330Sbjct: 136 VKEIGLKGNCTSCGVFRRQALDRGAVMLKANKIVT 170 7331 7332 7333>sp|P55131|RT32_ACTPL RTX-III TOXIN DETERMINANT A FROM SEROTYPE 8 (APX-IIIA) (CYTOLYSIN 7334 IIIA) (CLY-IIIA) 7335 Length = 1052 7336 7337 Score = 30.1 bits (66), Expect = 7.3 7338 Identities = 9/53 (16%), Positives = 21/53 (38%), Gaps = 1/53 (1%) 7339 7340Query: 20 GIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIF-KSNLGKIEELNLIAINHKA 71 7341 + + ++ E++ K+ K Y Y R + F + + + N +A 7342Sbjct: 427 HVASKIGNKIDEWEKKYGKNYFENGYDARHKAFLEDSFSLLSSFNKQYETERA 479 7343 7344 7345>sp|P55130|RT31_ACTPL RTX-III TOXIN DETERMINANT A FROM SEROTYPE 2 (APX-IIIA) (CYTOLYSIN 7346 IIIA) (CLY-IIIA) 7347 Length = 1049 7348 7349 Score = 30.1 bits (66), Expect = 7.3 7350 Identities = 9/53 (16%), Positives = 21/53 (38%), Gaps = 1/53 (1%) 7351 7352Query: 20 GIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIF-KSNLGKIEELNLIAINHKA 71 7353 + + ++ E++ K+ K Y Y R + F + + + N +A 7354Sbjct: 427 HVASKIGNKIDEWEKKYGKNYFENGYDARHKAFLEDSFSLLSSFNKQYETERA 479 7355 7356 7357>sp|P40101|YE16_YEAST HYPOTHETICAL 35.9 KD PROTEIN IN ISC10 3'REGION 7358 Length = 306 7359 7360 Score = 30.1 bits (66), Expect = 7.3 7361 Identities = 13/48 (27%), Positives = 16/48 (33%), Gaps = 1/48 (2%) 7362 7363Query: 199 YNY-IIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNE 245 7364 Y Y I+KNG ES Y E F K+ + E 7365Sbjct: 103 YQYEILKNGDFPEESDYEVKGECDGFTLFKVLFCTVKVKKTSYYRNKE 150 7366 7367 7368>sp|P13388|XMRK_XIPMA MELANOMA RECEPTOR PROTEIN-TYROSINE KINASE PRECURSOR 7369 Length = 1166 7370 7371 Score = 30.1 bits (66), Expect = 7.3 7372 Identities = 16/54 (29%), Positives = 23/54 (41%), Gaps = 11/54 (20%) 7373 7374Query: 140 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGG 193 7375 GSCW+ H KL+ +EQ C+ C + + C+E C GG 7376Sbjct: 202 GSCWAPGPG------HCQKFTKLL-CAEQ----CNRRCRGPKPIDCCNEHCAGG 244 7377 7378 7379>sp|Q9ZL75|MOAA_HELPJ MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN A 7380 Length = 321 7381 7382 Score = 29.7 bits (65), Expect = 9.5 7383 Identities = 23/162 (14%), Positives = 51/162 (31%), Gaps = 9/162 (5%) 7384 7385Query: 41 SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIF 100 7386 + +E LE E K+ +I + + H G+ + L K + ++ 7387Sbjct: 165 NDDEILELLEYAKNRSIQIRYIEFMENTHAKSLVKGLKEKEILDLIAQKYKIMGMEKPKQ 224 7388 7389Query: 101 TDDLPVADYLDDEFINSIPTAFDW-RTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 159 7390 +F P + D+ ++ + + C + E 7391Sbjct: 225 GSSKIYTLENGYQFGIIAPHSDDFCQSCNRIRLASDGKICPCLYYQDAIDAKEAIINKDT 284 7392 7393Query: 160 NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNY 201 7394 + L +Q++++ + M + NGG A+ Y 7395Sbjct: 285 KMMKRLLKQSIINKPEKNMWNDK--------NGGTPTRAFYY 318 7396 7397 7398>sp|P55129|RT12_ACTPL RTX-I TOXIN DETERMINANT A FROM SEROTYPES 5/10 (APX-IA) (HEMOLYSIN 7399 IA) (HLY-IA) (CYTOLYSIN IA) (CLY-IA) 7400 Length = 1023 7401 7402 Score = 29.7 bits (65), Expect = 9.5 7403 Identities = 8/42 (19%), Positives = 15/42 (35%), Gaps = 1/42 (2%) 7404 7405Query: 27 SQFLEFQDKFNKKYSHEEYLERFEIF-KSNLGKIEELNLIAI 67 7406 ++ E++ K K Y Y R F + + + N 7407Sbjct: 423 NKIDEWEKKHGKNYFENGYDARHSAFLEDTFELLSQYNKEYS 464 7408 7409 7410>sp|P55128|RT11_ACTPL RTX-I TOXIN DETERMINANT A FROM SEROTYPES 1/9 (APX-IA) (HEMOLYSIN 7411 IA) (HLY-IA) (CYTOLYSIN IA) (CLY-IA) 7412 Length = 1023 7413 7414 Score = 29.7 bits (65), Expect = 9.5 7415 Identities = 8/42 (19%), Positives = 15/42 (35%), Gaps = 1/42 (2%) 7416 7417Query: 27 SQFLEFQDKFNKKYSHEEYLERFEIF-KSNLGKIEELNLIAI 67 7418 ++ E++ K K Y Y R F + + + N 7419Sbjct: 423 NKIDEWEKKHGKNYFENGYDARHSAFLEDTFELLSQYNKEYS 464 7420 7421 7422>sp|P35669|GSHB_SCHPO GLUTATHIONE SYNTHETASE LARGE CHAIN (GLUTATHIONE SYNTHASE LARGE 7423 CHAIN) (GSH SYNTHETASE LARGE CHAIN) (GSH-S) 7424 (PHYTOCHELATIN SYNTHETASE) 7425 Length = 498 7426 7427 Score = 29.7 bits (65), Expect = 9.5 7428 Identities = 16/67 (23%), Positives = 26/67 (37%), Gaps = 6/67 (8%) 7429 7430Query: 33 QDKFNKKYSHEEYLERFE------IFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86 7431 Q +NK Y+ F I K + + NL + +A N+F LS 7432Sbjct: 66 QKAYNKLYAKIANDYEFLRLHLQSITKYDEFMNKLWNLYQKHREAVAHLKENQFQPLSLG 125 7433 7434Query: 87 EFKNYYL 93 7435 F++ Y+ 7436Sbjct: 126 VFRSDYM 132 7437 7438 7439>sp|P11140|ABRA_ABRPR ABRIN-A PRECURSOR (RRNA N-GLYCOSIDASE) 7440 Length = 528 7441 7442 Score = 29.7 bits (65), Expect = 9.5 7443 Identities = 12/59 (20%), Positives = 22/59 (36%), Gaps = 2/59 (3%) 7444 7445Query: 152 EGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQT 210 7446 E Q + + + S QN +C +G GC+ G + + +G I + 7447Sbjct: 436 EQQWALYTDGSIR-SVQNTNNCLTSKDHKQGSTILLMGCSNGWASQRWVF-KNDGSIYS 492 7448 7449 7450>sp|Q00690|LEM2_MOUSE E-SELECTIN PRECURSOR (ENDOTHELIAL LEUKOCYTE ADHESION MOLECULE 1) 7451 (ELAM-1) (LEUKOCYTE-ENDOTHELIAL CELL ADHESION MOLECULE 7452 2) (LECAM2) (CD62E) 7453 Length = 612 7454 7455 Score = 29.7 bits (65), Expect = 9.5 7456 Identities = 7/30 (23%), Positives = 13/30 (43%) 7457 7458Query: 171 VDCDHECMEYEGEEACDEGCNGGLQPNAYN 200 7459 ++C H + +C GC G P++ 7460Sbjct: 191 LNCSHPFGPFSYNSSCSFGCKRGYLPSSME 220 7461 7462 7463>sp|P06620|ICEN_PSESY ICE NUCLEATION PROTEIN 7464 Length = 1200 7465 7466 Score = 29.7 bits (65), Expect = 9.5 7467 Identities = 9/33 (27%), Positives = 13/33 (39%), Gaps = 3/33 (9%) 7468 7469Query: 277 IPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIV 309 7470 N + +H LI GY + T W+V 7471Sbjct: 194 YGSNETAGNHSDLIAGYGSTGTA---GSDSWLV 223 7472 7473 7474 Database: /home/peter/blast/data/swissprot.pr 7475 Posted date: Oct 10, 2000 10:43 AM 7476 Number of letters in database: 31,984,247 7477 Number of sequences in database: 88,780 7478 7479Lambda K H 7480 0.318 0.140 0.482 7481 7482Lambda K H 7483 0.270 0.0491 0.230 7484 7485 7486Matrix: BLOSUM62 7487Gap Penalties: Existence: 11, Extension: 1 7488Number of Hits to DB: 48087047 7489Number of Sequences: 88780 7490Number of extensions: 2165712 7491Number of successful extensions: 7103 7492Number of sequences better than 10.0: 284 7493Number of HSP's better than 10.0 without gapping: 253 7494Number of HSP's successfully gapped in prelim test: 31 7495Number of HSP's that attempted gapping in prelim test: 5661 7496Number of HSP's gapped (non-prelim): 339 7497length of query: 343 7498length of database: 31,984,247 7499effective HSP length: 49 7500effective length of query: 294 7501effective length of database: 27,634,027 7502effective search space: 8124403938 7503effective search space used: 8124403938 7504T: 11 7505A: 40 7506X1: 16 ( 7.3 bits) 7507X2: 38 (14.8 bits) 7508X3: 64 (24.9 bits) 7509S1: 41 (21.6 bits) 7510S2: 65 (29.7 bits) 7511