|
Name |
Accession |
Description |
Interval |
E-value |
| SEA |
pfam01390 |
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ... |
2344-2426 |
6.70e-09 |
|
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.
Pssm-ID: 460188 Cd Length: 100 Bit Score: 55.32 E-value: 6.70e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2344 VTVKVTNRNFTKDLNNISSSVYQNFTQLFKSQMDKAYMGKDF-PQYRGVIIRRL--FNGSVVVehDVVME-----TNYTS 2415
Cdd:pfam01390 5 GSFKITNLQYTPDLGNPSSQEFKSLSRRIESLLNELFRNSSLrKQYIKSHVLRLrpDGGSVVV--DVVLVfrfpsTEPAL 82
|
90
....*....|.
gi 1953082137 2416 DFQKLFENLIE 2426
Cdd:pfam01390 83 DREKLIEEILR 93
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
820-1245 |
3.46e-08 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 59.41 E-value: 3.46e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 820 VPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATES 899
Cdd:PHA03307 43 LVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPP 122
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 900 TFS---TIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTETTVTSETSYTVGDGSSAS 976
Cdd:PHA03307 123 PASpppSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAA 202
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 977 PSGPGQLSTTVSVSAqtttglvDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPftthaDGGHTTTSLAagsT 1056
Cdd:PHA03307 203 SPRPPRRSSPISASA-------SSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECP-----LPRPAPITLP---T 267
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1057 IYTATAPSELSTPSFSTTASHSTDSE-----IPTSTSSPSELSTHTVVTGQAGSTPTGettiiptvpaSSEPTASTHVSH 1131
Cdd:PHA03307 268 RIWEASGWNGPSSRPGPASSSSSPRErspspSPSSPGSGPAPSSPRASSSSSSSRESS----------SSSTSSSSESSR 337
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1132 TTDAGRSTVPSRPGDLSTSPAVSGPTATgvPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSptESLATSPGSGPS 1211
Cdd:PHA03307 338 GAAVSPGPSPSRSPSPSRPPPPADPSSP--RKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRR--DATGRFPAGRPR 413
|
410 420 430
....*....|....*....|....*....|....
gi 1953082137 1212 ASPSATESTFSTIVSESSEYtvasYTTGSPSPSS 1245
Cdd:PHA03307 414 PSPLDAGAASGAFYARYPLL----TPSGEPWPGS 443
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
2045-2300 |
3.44e-07 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 56.17 E-value: 3.44e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2045 ALPSVFTT---VSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHS---------SE 2112
Cdd:NF033849 226 SLPMMYAAnlgQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESestgqsssvGT 305
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2113 PTGIPHSTTSGEdAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSVIPTSTSS--- 2189
Cdd:NF033849 306 SESQSHGTTEGT-STTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSsrs 384
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2190 --------------PSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPT--ASTHVSHTTDAGHSTvpsrpgDLSTSPAV 2253
Cdd:NF033849 385 sssgvsggfsggiaGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSssTGTSSGHSDSSSHST------SSGQADSV 458
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 1953082137 2254 SGPTATGVPQESTDHSTMSHSSAVTHSFSSTFTEVDKSHIPTSSSRQ 2300
Cdd:NF033849 459 SQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQG 505
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
175-360 |
1.64e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 53.60 E-value: 1.64e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 175 ALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTT 254
Cdd:COG3469 23 LGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTA 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 255 SGEDAVSASTPTTAGPFTTHADG--GHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVV 332
Cdd:COG3469 103 SGANTGTSTVTTTSTGAGSVTSTtsSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSAT 182
|
170 180
....*....|....*....|....*...
gi 1953082137 333 TGQAGSTPTGETTIIPTVPASSEPTAST 360
Cdd:COG3469 183 TTATATTASGATTPSATTTATTTGPPTP 210
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1819-2024 |
1.38e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.52 E-value: 1.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1819 TTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVP 1898
Cdd:COG3469 6 TAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAA 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1899 ASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSP 1978
Cdd:COG3469 86 AAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1953082137 1979 TESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPS 2024
Cdd:COG3469 166 TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
272-477 |
1.38e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.52 E-value: 1.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 272 TTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVP 351
Cdd:COG3469 6 TAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAA 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 352 ASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSP 431
Cdd:COG3469 86 AAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1953082137 432 TESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPS 477
Cdd:COG3469 166 TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1506-1948 |
1.55e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.71 E-value: 1.55e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1506 QPHLLLQGPRSTPPLRPQELSTPsfsttASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTAs 1585
Cdd:PHA03247 2651 RPRDDPAPGRVSRPRRARRLGRA-----AQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPG- 2724
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1586 thvshtTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTIS--DSPQPPDSSATTftkgdASPMSTSSPTESLAT 1663
Cdd:PHA03247 2725 ------PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAgpPAPAPPAAPAAG-----PPRRLTRPAVASLSE 2793
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1664 SPGSGPSAsPSATESTFSTIVSESSEYTVASYTTGSPSPSEPGRLFHHHPDHWTQHYRPFGVALRLHNCVcfdRKRPLLQ 1743
Cdd:PHA03247 2794 SRESLPSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDV---RRRPPSR 2869
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1744 KPPIrwVTEAPRPLQVRASLSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGISHSTTSGEDAVSASTP--TTAGPFTTH 1821
Cdd:PHA03247 2870 SPAA--KPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPprPQPPLAPTT 2947
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1822 ADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTvVTGQAGSTPTGETTIIPTV---- 1897
Cdd:PHA03247 2948 DPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR-VSSWASSLALHEETDPPPVslkq 3026
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1898 ---PASSEPTASTHVSHTTDAGRSTV------PSGPGDLSTSPAVSGPTATGVPQESTDH 1948
Cdd:PHA03247 3027 tlwPPDDTEDSDADSLFDSDSERSDLealdplPPEPHDPFAHEPDPATPEAGARESPSSQ 3086
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
948-1217 |
2.51e-05 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 50.00 E-value: 2.51e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 948 SVFTTVSALTETTVTSETSYTVGDGSSASPSGpGQlSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAV 1027
Cdd:NF033849 266 SVGTSQSQSHTTGHGSTRGWSHTQSTSESEST-GQ-SSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSH 343
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1028 SASTPTTAGPFTTHADGGHTTTSLAAGSTiyTATAPSELSTPSFST--------------TASHSTDSEIPTSTSSPSEL 1093
Cdd:NF033849 344 SDGTSQSTSISHSESSSESTGTSVGHSTS--SSVSSSESSSRSSSSgvsggfsggiagggVTSEGLGASQGGSEGWGSGD 421
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1094 STHTVVTGQAGSTPTGETTIIPTVPASSEPT-ASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTIS 1172
Cdd:NF033849 422 SVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSgQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSES 501
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1953082137 1173 DSPQPPDSSATTFTKGDASPMSTSSpTESLATSPGSGPSASPSAT 1217
Cdd:NF033849 502 VSQGDGRSTGRSESQGTSLGTSGGR-TSGAGGSMGLGPSISLGKS 545
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2042-2243 |
2.57e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.75 E-value: 2.57e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2042 ALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTT 2121
Cdd:COG3469 23 LGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTA 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2122 SGEDAVSASTPTTAGPftthaDGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSVIPTSTSSPSelsthTVVTG 2201
Cdd:COG3469 103 SGANTGTSTVTTTSTG-----AGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTS-----TTTTT 172
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 1953082137 2202 QAGSTPTGETTIIPTVPASSEPTASTHVSHTT-DAGHSTVPSR 2243
Cdd:COG3469 173 TSASTTPSATTTATATTASGATTPSATTTATTtGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
942-1127 |
3.02e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.37 E-value: 3.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 942 ALSALPSVFTTVSALTETTVTSETSYTVGDGSSASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTT 1021
Cdd:COG3469 23 LGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTA 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1022 SGEDAVSASTPTTAGPFTTHADG--GHTTTSLAAGSTIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELSTHTVV 1099
Cdd:COG3469 103 SGANTGTSTVTTTSTGAGSVTSTtsSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSAT 182
|
170 180
....*....|....*....|....*...
gi 1953082137 1100 TGQAGSTPTGETTIIPTVPASSEPTAST 1127
Cdd:COG3469 183 TTATATTASGATTPSATTTATTTGPPTP 210
|
|
| SEA |
smart00200 |
Domain found in sea urchin sperm protein, enterokinase, agrin; Proposed function of regulating ... |
2344-2416 |
9.77e-05 |
|
Domain found in sea urchin sperm protein, enterokinase, agrin; Proposed function of regulating or binding carbohydrate sidechains.
Pssm-ID: 214554 Cd Length: 121 Bit Score: 44.32 E-value: 9.77e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1953082137 2344 VTVKVTNRNFTKDLNNISSSVYQNFTQLFKSQMDKAYMGKDF-PQYRGVIIRRLFNGSVVVEHDVVMETNYTSD 2416
Cdd:smart00200 12 LSVEGENLQYSPSLEDPSSEEYQELVRDVEKLLEQIYGKTDLkPDFVGTEVIEFRNGSVVVDLGLLFNEGVTNG 85
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
1761-2206 |
1.03e-04 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 47.85 E-value: 1.03e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1761 ASLSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTA 1840
Cdd:COG4625 59 TGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAG 138
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1841 TAPSELSTPSFSTTASRSTDSAIPTSTSSpselSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTV 1920
Cdd:COG4625 139 GGGGGGGGGGAGGGGGGGAGGAGGGGGGG----GGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGA 214
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1921 PSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATEST 2000
Cdd:COG4625 215 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGG 294
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2001 FSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGS 2080
Cdd:COG4625 295 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGG 374
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2081 GQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTA 2160
Cdd:COG4625 375 GGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGG 454
|
410 420 430 440
....*....|....*....|....*....|....*....|....*....
gi 1953082137 2161 TAPSELSTPSFSTTASRSTDSVIPTSTSSPSELSTHTV---VTGQAGST 2206
Cdd:COG4625 455 GAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGggnYTQSAGST 503
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1761-2019 |
2.37e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.92 E-value: 2.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1761 ASLSTTVSVSAQTTTGLVDGSTVYPGTPHSsepTGISHSTTSGedavsastpttagpfTTHADGGHTTTSLAAGSTIYTA 1840
Cdd:NF033849 245 ESVGHSTSQGQSHSVGTSESHSVGTSQSQS---HTTGHGSTRG---------------WSHTQSTSESESTGQSSSVGTS 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1841 TAPSELSTPSFSTTASRS-TDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHvSHTTDAGRST 1919
Cdd:NF033849 307 ESQSHGTTEGTSTTDSSShSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV-SSSESSSRSS 385
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1920 VPSGPGDLSTSPAVSGPTATGVPQ---ESTDHGTISDSPPPPDSSATTFTKGDASPMSTS---SPTESLATSPGSGPSAS 1993
Cdd:NF033849 386 SSGVSGGFSGGIAGGGVTSEGLGAsqgGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSsshSTSSGQADSVSQGTSWS 465
|
250 260
....*....|....*....|....*.
gi 1953082137 1994 PSATESTFSTIVSESSEYTVASYTTG 2019
Cdd:NF033849 466 EGTGTSQGQSVGTSESWSTSQSETDS 491
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
859-1160 |
2.97e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.54 E-value: 2.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 859 DSSATTFTKGDASPMSTS-SPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTg 937
Cdd:NF033849 237 QSAGTGYGESVGHSTSQGqSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTE- 315
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 938 hsttalsalpSVFTTVSALTETTVTSETSYTVGDGSSASPSGpgqlSTTVSVSAQTTTGLVDGSSVYPGTPHS-SEPTGI 1016
Cdd:NF033849 316 ----------GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGT----SQSTSISHSESSSESTGTSVGHSTSSSvSSSESS 381
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1017 SHSTTSGEDA-VSASTPttAGPFTTHADGGHTTTSLAAG-STIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELS 1094
Cdd:NF033849 382 SRSSSSGVSGgFSGGIA--GGGVTSEGLGASQGGSEGWGsGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVS 459
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1095 ---THTVVTGQAGSTPTGETTIIPTVPASSEPTASTH-VSHTTDAGRSTVPSRPGDLSTSPAVSGPTATG 1160
Cdd:NF033849 460 qgtSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTgTSESVSQGDGRSTGRSESQGTSLGTSGGRTSG 529
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1194-1384 |
5.22e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.51 E-value: 5.22e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1194 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQedSSTTTLTTGHSTTALSALPSVFTTV 1273
Cdd:COG3469 24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTT--STTATATAAAAAATSTSATLVATST 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1274 SALTETTVTSETSYTVGDGSSVSPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPT 1353
Cdd:COG3469 102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
|
170 180 190
....*....|....*....|....*....|.
gi 1953082137 1354 TAGPFTTHADGGHTTTSLAAGSTIYTATAPL 1384
Cdd:COG3469 182 TTTATATTASGATTPSATTTATTTGPPTPGL 212
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1148-1376 |
6.79e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 45.38 E-value: 6.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1148 STSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTS-----SPTESLATSPGSGPSASPSATESTFS 1222
Cdd:NF033849 240 GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTqstseSESTGQSSSVGTSESQSHGTTEGTST 319
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1223 TI---VSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTETTVTSETSYTVGDGSSVSPSG 1299
Cdd:NF033849 320 TDsssHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAG 399
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1300 PGQLSTTVSVSAQTTTG-----------LVDGSSVYPGTPHSSEpTGISHSTTSGEdAVSASTPTTAGPFTTHADGGHTT 1368
Cdd:NF033849 400 GGVTSEGLGASQGGSEGwgsgdsvqsvsQSYGSSSSTGTSSGHS-DSSSHSTSSGQ-ADSVSQGTSWSEGTGTSQGQSVG 477
|
....*...
gi 1953082137 1369 TSLAAGST 1376
Cdd:NF033849 478 TSESWSTS 485
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1963-2209 |
7.09e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 45.38 E-value: 7.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1963 TTFTKGDASPMSTS-SPTESLATSPGSGPSASPSATESTFstiVSESSEYTVASYTTGSPSPS--SQEDSSTTTLTTGHS 2039
Cdd:NF033849 285 WSHTQSTSESESTGqSSSVGTSESQSHGTTEGTSTTDSSS---HSQSSSYNVSSGTGVSSSHSdgTSQSTSISHSESSSE 361
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2040 TTALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHS 2119
Cdd:NF033849 362 STGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSG 441
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2120 TTSG-EDAVSASTPTTAGPFTTHADGGHTTTSLAAGstiytaTAPSELSTPSFSTTASRST---DSVIPTSTSSPSELST 2195
Cdd:NF033849 442 HSDSsSHSTSSGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTgtsESVSQGDGRSTGRSES 515
|
250
....*....|....
gi 1953082137 2196 HTVVTGQAGSTPTG 2209
Cdd:NF033849 516 QGTSLGTSGGRTSG 529
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1547-1889 |
1.28e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 44.23 E-value: 1.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1547 SSPSELSTHTVVTGQAGSTPTGETTIIptvpaSSEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTD 1626
Cdd:NF033849 241 TGYGESVGHSTSQGQSHSVGTSESHSV-----GTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTE 315
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1627 HGTISDSPQPPDSSATTFTKGDASPMSTSSpteslATSPGSGPSASPSATESTfSTIVSESSEYTVASYTTGSPSPSEpg 1706
Cdd:NF033849 316 GTSTTDSSSHSQSSSYNVSSGTGVSSSHSD-----GTSQSTSISHSESSSEST-GTSVGHSTSSSVSSSESSSRSSSS-- 387
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1707 rlfhHHPDHWTQHYRPFGVAlrlhncvcfdrkrpllqkppirwvteaprplqvraSLSTTVSVSAQTTTGLVDGSTVYPG 1786
Cdd:NF033849 388 ----GVSGGFSGGIAGGGVT-----------------------------------SEGLGASQGGSEGWGSGDSVQSVSQ 428
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1787 TPHSSEPTGISHSTTSG-EDAVSASTPTTAGPFTTHADGGHTTTSLAAGstiytaTAPSELSTPSFSTTASRSTDSAIPT 1865
Cdd:NF033849 429 SYGSSSSTGTSSGHSDSsSHSTSSGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTGTSESV 502
|
330 340
....*....|....*....|....*..
gi 1953082137 1866 S---TSSPSELSTPTVVTGQAGSTPTG 1889
Cdd:NF033849 503 SqgdGRSTGRSESQGTSLGTSGGRTSG 529
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
763-1139 |
1.38e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 44.14 E-value: 1.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 763 IPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPT----ASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTA 838
Cdd:pfam05109 404 IITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTtglpSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTS 483
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 839 TGVPqestdhgtISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSG-PSASPSATESTFSTiVSESSEYTVASYT 917
Cdd:pfam05109 484 GASP--------VTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAvTTPTPNATSPTLGK-TSPTSAVTTPTPN 554
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 918 TGSPSPSSQEDSSTTTLTTGHSTTALSAL--PSVFTTVSALTETTVTSETSYTVGDGSSASP--SGPGQLSTTVSVSAQ- 992
Cdd:pfam05109 555 ATSPTPAVTTPTPNATIPTLGKTSPTSAVttPTPNATSPTVGETSPQANTTNHTLGGTSSTPvvTSPPKNATSAVTTGQh 634
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 993 -TTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSF 1071
Cdd:pfam05109 635 nITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGN 714
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1953082137 1072 STTASHSTDSEIPTSTSSPSELSTHtvvtgqagsTPTGETTIIPTVPAS-SEPTASTHVSHTTDAGRST 1139
Cdd:pfam05109 715 SSTSTKPGEVNVTKGTPPKNATSPQ---------APSGQKTAVPTVTSTgGKANSTTGGKHTTGHGART 774
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1868-2251 |
2.08e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 43.62 E-value: 2.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1868 SSPSELSTPTVVTGqAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVP-SGPGDLSTSPAVSGPTAT------- 1939
Cdd:PHA03307 45 SDSAELAAVTVVAG-AAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPaSPAREGSPTPPGPSSPDPppptppp 123
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1940 GVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSG-PSASPSATESTFSTIVSESSEYTVASYTT 2018
Cdd:PHA03307 124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAAlPLSSPEETARAPSSPPAEPPPSTPPAAAS 203
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2019 GSPSP-----SSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSET------SYTVGDGSSASPSGSGQPSTTV 2087
Cdd:PHA03307 204 PRPPRrsspiSASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECplprpaPITLPTRIWEASGWNGPSSRPG 283
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2088 SVSAQTTTGLVDGSTVyPGTPHSSE-PTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGStiyTATAPSEL 2166
Cdd:PHA03307 284 PASSSSSPRERSPSPS-PSSPGSGPaPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSP---SPSRPPPP 359
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2167 STPSFSTTASRSTDSVIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVShttdaGHSTVPSRPGD 2246
Cdd:PHA03307 360 ADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAAS-----GAFYARYPLLT 434
|
....*
gi 1953082137 2247 LSTSP 2251
Cdd:PHA03307 435 PSGEP 439
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
217-472 |
2.68e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 43.46 E-value: 2.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 217 STTVSVSAQTTTGLVDGSTVYPGTPHSsepTGIPHSTTSGedavsastpttagpfTTHADGGHTTTSLAAGSTIYTATAP 296
Cdd:NF033849 248 GHSTSQGQSHSVGTSESHSVGTSQSQS---HTTGHGSTRG---------------WSHTQSTSESESTGQSSSVGTSESQ 309
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 297 SELSTPSFSTTASRS-TDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHvSHTTDAGRSTVPS 375
Cdd:NF033849 310 SHGTTEGTSTTDSSShSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV-SSSESSSRSSSSG 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 376 GPGDLSTSPAVSGPTATGVPQ---ESTDHGTISDSPPPPDSSATTFTKGDASPMSTS---SPTESLATSPGSGPSASPSA 449
Cdd:NF033849 389 VSGGFSGGIAGGGVTSEGLGAsqgGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSsshSTSSGQADSVSQGTSWSEGT 468
|
250 260
....*....|....*....|...
gi 1953082137 450 TESTFSTIVSESSEYTVASYTTG 472
Cdd:NF033849 469 GTSQGQSVGTSESWSTSQSETDS 491
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
238-478 |
2.86e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 2.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 238 PGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATA--PSELSTPSFSTTASRSTDSA 315
Cdd:PHA03247 2592 PPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPerPRDDPAPGRVSRPRRARRLG 2671
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 316 IPTSTSSPSELSTPTVVTGQAGST----------PTGETTIIPTVPASSEPTAsthvshtTDAGRSTVPSGPGDLSTSPA 385
Cdd:PHA03247 2672 RAAQASSPPQRPRRRAARPTVGSLtsladpppppPTPEPAPHALVSATPLPPG-------PAAARQASPALPAAPAPPAV 2744
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 386 vsgPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSAsPSATESTFSTIVSESSEYT 465
Cdd:PHA03247 2745 ---PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP-WDPADPPAAVLAPAAALPP 2820
|
250
....*....|...
gi 1953082137 466 VASYTTGSPSPSS 478
Cdd:PHA03247 2821 AASPAGPLPPPTS 2833
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
17-395 |
6.88e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.08 E-value: 6.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 17 ATATQGETTIIPTVPASSEPTAS--THVSHTTDAGHSTVRSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPP--- 91
Cdd:PHA03307 57 AGAAACDRFEPPTGPPPGPGTEApaNESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDlse 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 92 ---DSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSA-TESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTL 167
Cdd:PHA03307 137 mlrPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSsPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS 216
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 168 TTGHSTTAL--SALPSVFTTVSALtgttvTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGstvyPGTPHSSE 245
Cdd:PHA03307 217 ASSPAPAPGrsAADDAGASSSDSS-----SSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPS----SRPGPASS 287
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 246 PTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSE 325
Cdd:PHA03307 288 SSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRK 367
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1953082137 326 LSTPTvVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRS-TVPSGPGDLSTSPAVSGP--TATGVP 395
Cdd:PHA03307 368 RPRPS-RAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPrPSPLDAGAASGAFYARYPllTPSGEP 439
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1522-1707 |
7.22e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 41.66 E-value: 7.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1522 PQELSTPSFSTTASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVPS 1601
Cdd:COG3469 29 AASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTG 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1602 RPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFS 1681
Cdd:COG3469 109 TSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATAT 188
|
170 180
....*....|....*....|....*..
gi 1953082137 1682 TIVSESS-EYTVASYTTGSPSPSEPGR 1707
Cdd:COG3469 189 TASGATTpSATTTATTTGPPTPGLPKH 215
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SEA |
pfam01390 |
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ... |
2344-2426 |
6.70e-09 |
|
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.
Pssm-ID: 460188 Cd Length: 100 Bit Score: 55.32 E-value: 6.70e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2344 VTVKVTNRNFTKDLNNISSSVYQNFTQLFKSQMDKAYMGKDF-PQYRGVIIRRL--FNGSVVVehDVVME-----TNYTS 2415
Cdd:pfam01390 5 GSFKITNLQYTPDLGNPSSQEFKSLSRRIESLLNELFRNSSLrKQYIKSHVLRLrpDGGSVVV--DVVLVfrfpsTEPAL 82
|
90
....*....|.
gi 1953082137 2416 DFQKLFENLIE 2426
Cdd:pfam01390 83 DREKLIEEILR 93
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
820-1245 |
3.46e-08 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 59.41 E-value: 3.46e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 820 VPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATES 899
Cdd:PHA03307 43 LVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPP 122
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 900 TFS---TIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTETTVTSETSYTVGDGSSAS 976
Cdd:PHA03307 123 PASpppSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAA 202
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 977 PSGPGQLSTTVSVSAqtttglvDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPftthaDGGHTTTSLAagsT 1056
Cdd:PHA03307 203 SPRPPRRSSPISASA-------SSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECP-----LPRPAPITLP---T 267
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1057 IYTATAPSELSTPSFSTTASHSTDSE-----IPTSTSSPSELSTHTVVTGQAGSTPTGettiiptvpaSSEPTASTHVSH 1131
Cdd:PHA03307 268 RIWEASGWNGPSSRPGPASSSSSPRErspspSPSSPGSGPAPSSPRASSSSSSSRESS----------SSSTSSSSESSR 337
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1132 TTDAGRSTVPSRPGDLSTSPAVSGPTATgvPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSptESLATSPGSGPS 1211
Cdd:PHA03307 338 GAAVSPGPSPSRSPSPSRPPPPADPSSP--RKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRR--DATGRFPAGRPR 413
|
410 420 430
....*....|....*....|....*....|....
gi 1953082137 1212 ASPSATESTFSTIVSESSEYtvasYTTGSPSPSS 1245
Cdd:PHA03307 414 PSPLDAGAASGAFYARYPLL----TPSGEPWPGS 443
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
2045-2300 |
3.44e-07 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 56.17 E-value: 3.44e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2045 ALPSVFTT---VSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHS---------SE 2112
Cdd:NF033849 226 SLPMMYAAnlgQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESestgqsssvGT 305
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2113 PTGIPHSTTSGEdAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSVIPTSTSS--- 2189
Cdd:NF033849 306 SESQSHGTTEGT-STTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSsrs 384
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2190 --------------PSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPT--ASTHVSHTTDAGHSTvpsrpgDLSTSPAV 2253
Cdd:NF033849 385 sssgvsggfsggiaGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSssTGTSSGHSDSSSHST------SSGQADSV 458
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 1953082137 2254 SGPTATGVPQESTDHSTMSHSSAVTHSFSSTFTEVDKSHIPTSSSRQ 2300
Cdd:NF033849 459 SQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQG 505
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
795-1268 |
1.41e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 1.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 795 PTVPASSEPTAsthvshttdAGRSTVPSRPGDLSTSPAV-SGPTATGVPQESTD-----------HGTISDSPQPPDSSA 862
Cdd:PHA03247 2553 PPLPPAAPPAA---------PDRSVPPPRPAPRPSEPAVtSRARRPDAPPQSARprapvddrgdpRGPAPPSPLPPDTHA 2623
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 863 TTFTKGDASPMSTSSPT-ESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSsqedsstttlttghstt 941
Cdd:PHA03247 2624 PDPPPPSPSPAANEPDPhPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR----------------- 2686
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 942 alsALPSVFTTVSALTETTvtsetsytvgdgssASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHST- 1020
Cdd:PHA03247 2687 ---AARPTVGSLTSLADPP--------------PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPa 2749
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1021 TSGEDAVSASTPTTAGPFTTHADGG------HTTTSLAAGSTIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELS 1094
Cdd:PHA03247 2750 TPGGPARPARPPTTAGPPAPAPPAApaagppRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLP 2829
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1095 THTVVTGQAGSTPTG--------ETTIIPTVPASSEPTASThvshttDAGRSTVPSRPGDLSTS-PAVSGPTAT-GVPQE 1164
Cdd:PHA03247 2830 PPTSAQPTAPPPPPGppppslplGGSVAPGGDVRRRPPSRS------PAAKPAAPARPPVRRLArPAVSRSTESfALPPD 2903
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1165 STDHGTISDSPQPPDSSATTFTKGDASPM--STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPS 1242
Cdd:PHA03247 2904 QPERPPQPQAPPPPQPQPQPPPPPQPQPPppPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA 2983
|
490 500
....*....|....*....|....*.
gi 1953082137 1243 PSSQEDSSTTTLTTGHSTTALSALPS 1268
Cdd:PHA03247 2984 PSREAPASSTPPLTGHSLSRVSSWAS 3009
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
175-360 |
1.64e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 53.60 E-value: 1.64e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 175 ALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTT 254
Cdd:COG3469 23 LGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTA 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 255 SGEDAVSASTPTTAGPFTTHADG--GHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVV 332
Cdd:COG3469 103 SGANTGTSTVTTTSTGAGSVTSTtsSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSAT 182
|
170 180
....*....|....*....|....*...
gi 1953082137 333 TGQAGSTPTGETTIIPTVPASSEPTAST 360
Cdd:COG3469 183 TTATATTASGATTPSATTTATTTGPPTP 210
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1819-2024 |
1.38e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.52 E-value: 1.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1819 TTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVP 1898
Cdd:COG3469 6 TAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAA 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1899 ASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSP 1978
Cdd:COG3469 86 AAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1953082137 1979 TESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPS 2024
Cdd:COG3469 166 TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
272-477 |
1.38e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.52 E-value: 1.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 272 TTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVP 351
Cdd:COG3469 6 TAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAA 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 352 ASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSP 431
Cdd:COG3469 86 AAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1953082137 432 TESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPS 477
Cdd:COG3469 166 TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1506-1948 |
1.55e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.71 E-value: 1.55e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1506 QPHLLLQGPRSTPPLRPQELSTPsfsttASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTAs 1585
Cdd:PHA03247 2651 RPRDDPAPGRVSRPRRARRLGRA-----AQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPG- 2724
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1586 thvshtTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTIS--DSPQPPDSSATTftkgdASPMSTSSPTESLAT 1663
Cdd:PHA03247 2725 ------PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAgpPAPAPPAAPAAG-----PPRRLTRPAVASLSE 2793
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1664 SPGSGPSAsPSATESTFSTIVSESSEYTVASYTTGSPSPSEPGRLFHHHPDHWTQHYRPFGVALRLHNCVcfdRKRPLLQ 1743
Cdd:PHA03247 2794 SRESLPSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDV---RRRPPSR 2869
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1744 KPPIrwVTEAPRPLQVRASLSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGISHSTTSGEDAVSASTP--TTAGPFTTH 1821
Cdd:PHA03247 2870 SPAA--KPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPprPQPPLAPTT 2947
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1822 ADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTvVTGQAGSTPTGETTIIPTV---- 1897
Cdd:PHA03247 2948 DPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR-VSSWASSLALHEETDPPPVslkq 3026
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1898 ---PASSEPTASTHVSHTTDAGRSTV------PSGPGDLSTSPAVSGPTATGVPQESTDH 1948
Cdd:PHA03247 3027 tlwPPDDTEDSDADSLFDSDSERSDLealdplPPEPHDPFAHEPDPATPEAGARESPSSQ 3086
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1748-1942 |
1.97e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.14 E-value: 1.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1748 RWVTEAPRPLQVRASLSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHT 1827
Cdd:COG3469 6 TAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAA 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1828 TTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTAST 1907
Cdd:COG3469 86 AAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
|
170 180 190
....*....|....*....|....*....|....*
gi 1953082137 1908 HVSHTTDAGRSTVPSGPGDlSTSPAVSGPTATGVP 1942
Cdd:COG3469 166 TSTTTTTTSASTTPSATTT-ATATTASGATTPSAT 199
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
948-1217 |
2.51e-05 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 50.00 E-value: 2.51e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 948 SVFTTVSALTETTVTSETSYTVGDGSSASPSGpGQlSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAV 1027
Cdd:NF033849 266 SVGTSQSQSHTTGHGSTRGWSHTQSTSESEST-GQ-SSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSH 343
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1028 SASTPTTAGPFTTHADGGHTTTSLAAGSTiyTATAPSELSTPSFST--------------TASHSTDSEIPTSTSSPSEL 1093
Cdd:NF033849 344 SDGTSQSTSISHSESSSESTGTSVGHSTS--SSVSSSESSSRSSSSgvsggfsggiagggVTSEGLGASQGGSEGWGSGD 421
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1094 STHTVVTGQAGSTPTGETTIIPTVPASSEPT-ASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTIS 1172
Cdd:NF033849 422 SVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSgQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSES 501
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1953082137 1173 DSPQPPDSSATTFTKGDASPMSTSSpTESLATSPGSGPSASPSAT 1217
Cdd:NF033849 502 VSQGDGRSTGRSESQGTSLGTSGGR-TSGAGGSMGLGPSISLGKS 545
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2042-2243 |
2.57e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.75 E-value: 2.57e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2042 ALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTT 2121
Cdd:COG3469 23 LGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTA 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2122 SGEDAVSASTPTTAGPftthaDGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSVIPTSTSSPSelsthTVVTG 2201
Cdd:COG3469 103 SGANTGTSTVTTTSTG-----AGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTS-----TTTTT 172
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 1953082137 2202 QAGSTPTGETTIIPTVPASSEPTASTHVSHTT-DAGHSTVPSR 2243
Cdd:COG3469 173 TSASTTPSATTTATATTASGATTPSATTTATTtGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
942-1127 |
3.02e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.37 E-value: 3.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 942 ALSALPSVFTTVSALTETTVTSETSYTVGDGSSASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTT 1021
Cdd:COG3469 23 LGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTA 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1022 SGEDAVSASTPTTAGPFTTHADG--GHTTTSLAAGSTIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELSTHTVV 1099
Cdd:COG3469 103 SGANTGTSTVTTTSTGAGSVTSTtsSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSAT 182
|
170 180
....*....|....*....|....*...
gi 1953082137 1100 TGQAGSTPTGETTIIPTVPASSEPTAST 1127
Cdd:COG3469 183 TTATATTASGATTPSATTTATTTGPPTP 210
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1566-2048 |
3.32e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.94 E-value: 3.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1566 PTGETTIIPTVPAS--SEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQestdhgTISDSPQPPDSSATT 1643
Cdd:PHA03247 2562 AAPDRSVPPPRPAPrpSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPD------THAPDPPPPSPSPAA 2635
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1644 FTKGDASPmstssptesLATSPGSGPSASPSATEstfstiVSESSEYTVASYTTGSPSPsepgrlfhhhPDHWTQHYRPF 1723
Cdd:PHA03247 2636 NEPDPHPP---------PTVPPPERPRDDPAPGR------VSRPRRARRLGRAAQASSP----------PQRPRRRAARP 2690
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1724 GVAlRLHNCVCFDRKRPLLQKPPIRWVTEAPRPLQVRASLSTTVSVSAQTTTGLVDGSTVYPGTPHS-SEPTGISHSTTS 1802
Cdd:PHA03247 2691 TVG-SLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARpARPPTTAGPPAP 2769
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1803 GEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRST-----DSAIPTSTSSPSELSTPT 1877
Cdd:PHA03247 2770 APPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplpppTSAQPTAPPPPPGPPPPS 2849
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1878 VVTGqAGSTPTGETT-IIPTVPASSEPTASTHVSHTTDAgRSTVPSGPGDLSTSPAVSGPTATGVPQEstdhgtisdsPP 1956
Cdd:PHA03247 2850 LPLG-GSVAPGGDVRrRPPSRSPAAKPAAPARPPVRRLA-RPAVSRSTESFALPPDQPERPPQPQAPP----------PP 2917
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1957 PPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTT 2036
Cdd:PHA03247 2918 QPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLT 2997
|
490
....*....|..
gi 1953082137 2037 GHSTTALSALPS 2048
Cdd:PHA03247 2998 GHSLSRVSSWAS 3009
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
187-395 |
3.46e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.37 E-value: 3.46e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 187 SALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVypgtphSSEPTGIPHSTTSGEDAVSASTPT 266
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVV------VAASGSAGSGTGTTAASSTAATSS 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 267 TAGPFTTHADGGHTTTSlaaGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTI 346
Cdd:COG3469 75 TTSTTATATAAAAAATS---TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTT 151
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 1953082137 347 IPTVPASSEPTASTHVSHTTDAGRSTVPSGPGDlSTSPAVSGPTATGVP 395
Cdd:COG3469 152 TVSGTETATGGTTTTSTTTTTTSASTTPSATTT-ATATTASGATTPSAT 199
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1514-1998 |
6.39e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.78 E-value: 6.39e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1514 PRSTPPLRPQELSTPSFSTTASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTASthVSHTTD 1593
Cdd:PHA03247 2623 APDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGS--LTSLAD 2700
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1594 AGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPqPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASP 1673
Cdd:PHA03247 2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP-PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1674 SATestfstiVSESSEYTVASYTTGSPSPSEPGrlfhhhpdhwtqhyrPFGVALRLHNCVCFDRKRPLLQKPPIRWVTEA 1753
Cdd:PHA03247 2780 PRR-------LTRPAVASLSESRESLPSPWDPA---------------DPPAAVLAPAAALPPAASPAGPLPPPTSAQPT 2837
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1754 PRPLQvraslsttvsvSAQTTTGLVDGSTVYPGTPHS---------SEPTGISHSTTSGEDAVSASTPTTAGPFTTHADG 1824
Cdd:PHA03247 2838 APPPP-----------PGPPPPSLPLGGSVAPGGDVRrrppsrspaAKPAAPARPPVRRLARPAVSRSTESFALPPDQPE 2906
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1825 GHTTTSLAAGSTIYTATAPSELSTPSFSTTAsRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPT 1904
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP-RPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1905 ASTHVSHTTDAGRSTVpsgpgdlstsPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLAT 1984
Cdd:PHA03247 2986 REAPASSTPPLTGHSL----------SRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEAL 3055
|
490
....*....|....
gi 1953082137 1985 SPGSGPSASPSATE 1998
Cdd:PHA03247 3056 DPLPPEPHDPFAHE 3069
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2005-2212 |
7.46e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 48.21 E-value: 7.46e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2005 VSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPS 2084
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2085 TTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEdAVSASTPTTAGPFT---THADGGHTTTSLAAGSTIYTAT 2161
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGS-VTSTTSSTAGSTTTsgaSATSSAGSTTTTTTVSGTETAT 160
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 1953082137 2162 APSELSTPSFSTTASRSTDSVIPTSTSSPSELSTHTVVTGQAGSTPTGETT 2212
Cdd:COG3469 161 GGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
968-1268 |
8.15e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.24 E-value: 8.15e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 968 TVGDGSSASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADgght 1047
Cdd:PHA03307 63 DRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEML---- 138
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1048 TTSLAAGSTIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSElsthtvvtgQAGSTPTGETTIIPTVPASSEPTAST 1127
Cdd:PHA03307 139 RPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETA---------RAPSSPPAEPPPSTPPAAASPRPPRR 209
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1128 HVSHTTDAGRSTvPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPG 1207
Cdd:PHA03307 210 SSPISASASSPA-PAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSS 288
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1953082137 1208 SG-----PSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPS 1268
Cdd:PHA03307 289 SSprersPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPS 354
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1026-1244 |
9.00e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.83 E-value: 9.00e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1026 AVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELSTHTVVTGQAGS 1105
Cdd:COG3469 3 SVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATA 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1106 TPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDspqpPDSSATTF 1185
Cdd:COG3469 83 TAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTT----TTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1953082137 1186 TKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVsesseyTVASYTTGSPSPS 1244
Cdd:COG3469 159 ATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSA------TTTATTTGPPTPG 211
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
107-319 |
9.00e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.83 E-value: 9.00e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 107 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTV 186
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 187 SALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSepTGIPHSTTSGEDAVSASTPT 266
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAS--ATSSAGSTTTTTTVSGTETA 159
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1953082137 267 TAGPFTTHADGGHTTTS-LAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTS 319
Cdd:COG3469 160 TGGTTTTSTTTTTTSAStTPSATTTATATTASGATTPSATTTATTTGPPTPGLP 213
|
|
| SEA |
smart00200 |
Domain found in sea urchin sperm protein, enterokinase, agrin; Proposed function of regulating ... |
2344-2416 |
9.77e-05 |
|
Domain found in sea urchin sperm protein, enterokinase, agrin; Proposed function of regulating or binding carbohydrate sidechains.
Pssm-ID: 214554 Cd Length: 121 Bit Score: 44.32 E-value: 9.77e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1953082137 2344 VTVKVTNRNFTKDLNNISSSVYQNFTQLFKSQMDKAYMGKDF-PQYRGVIIRRLFNGSVVVEHDVVMETNYTSD 2416
Cdd:smart00200 12 LSVEGENLQYSPSLEDPSSEEYQELVRDVEKLLEQIYGKTDLkPDFVGTEVIEFRNGSVVVDLGLLFNEGVTNG 85
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1974-2199 |
9.89e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.83 E-value: 9.89e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1974 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTV 2053
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2054 SALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSepTGIPHSTTSGEDAVSASTPT 2133
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAS--ATSSAGSTTTTTTVSGTETA 159
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1953082137 2134 TAGPFTTHADGGHTTTS-LAAGSTIYTATAPSELSTPSFSTTASrstdsvipTSTSSPSELSTHTVV 2199
Cdd:COG3469 160 TGGTTTTSTTTTTTSAStTPSATTTATATTASGATTPSATTTAT--------TTGPPTPGLPKHVLV 218
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
1761-2206 |
1.03e-04 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 47.85 E-value: 1.03e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1761 ASLSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTA 1840
Cdd:COG4625 59 TGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAG 138
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1841 TAPSELSTPSFSTTASRSTDSAIPTSTSSpselSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTV 1920
Cdd:COG4625 139 GGGGGGGGGGAGGGGGGGAGGAGGGGGGG----GGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGA 214
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1921 PSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATEST 2000
Cdd:COG4625 215 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGG 294
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2001 FSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGS 2080
Cdd:COG4625 295 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGG 374
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2081 GQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTA 2160
Cdd:COG4625 375 GGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGG 454
|
410 420 430 440
....*....|....*....|....*....|....*....|....*....
gi 1953082137 2161 TAPSELSTPSFSTTASRSTDSVIPTSTSSPSELSTHTV---VTGQAGST 2206
Cdd:COG4625 455 GAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGggnYTQSAGST 503
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
874-1099 |
1.13e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.44 E-value: 1.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 874 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTV 953
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 954 SALTETTVTSETSYTVGDGSSASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPT 1033
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETAT 160
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1953082137 1034 TAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTAShstdseipTSTSSPSELSTHTVV 1099
Cdd:COG3469 161 GGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTAT--------TTGPPTPGLPKHVLV 218
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1914-2123 |
1.44e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.05 E-value: 1.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1914 DAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTdhgTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSAS 1993
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVT---LTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTS 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1994 PSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSETSYTVGDGS 2073
Cdd:COG3469 78 TTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTE 157
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2074 SASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSG 2123
Cdd:COG3469 158 TATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGP 207
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1926-2137 |
1.46e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.05 E-value: 1.46e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1926 DLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIV 2005
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2006 SesseYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALP--SVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQP 2083
Cdd:COG3469 81 T----ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGagSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1953082137 2084 STTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGP 2137
Cdd:COG3469 157 ETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTP 210
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
59-270 |
1.46e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.05 E-value: 1.46e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 59 DLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIV 138
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 139 SesseYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALP--SVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQP 216
Cdd:COG3469 81 T----ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGagSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1953082137 217 STTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGP 270
Cdd:COG3469 157 ETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTP 210
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
107-298 |
2.32e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 46.67 E-value: 2.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 107 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTV 186
Cdd:COG3469 24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTAS 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 187 SALTGTTVTSETSYTVGDGSSASPSGSGQPsTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPT 266
Cdd:COG3469 104 GANTGTSTVTTTSTGAGSVTSTTSSTAGST-TTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSAT 182
|
170 180 190
....*....|....*....|....*....|..
gi 1953082137 267 TAGPFTTHADGGHTTTSLAAGSTIYTATAPSE 298
Cdd:COG3469 183 TTATATTASGATTPSATTTATTTGPPTPGLPK 214
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1761-2019 |
2.37e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.92 E-value: 2.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1761 ASLSTTVSVSAQTTTGLVDGSTVYPGTPHSsepTGISHSTTSGedavsastpttagpfTTHADGGHTTTSLAAGSTIYTA 1840
Cdd:NF033849 245 ESVGHSTSQGQSHSVGTSESHSVGTSQSQS---HTTGHGSTRG---------------WSHTQSTSESESTGQSSSVGTS 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1841 TAPSELSTPSFSTTASRS-TDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHvSHTTDAGRST 1919
Cdd:NF033849 307 ESQSHGTTEGTSTTDSSShSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV-SSSESSSRSS 385
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1920 VPSGPGDLSTSPAVSGPTATGVPQ---ESTDHGTISDSPPPPDSSATTFTKGDASPMSTS---SPTESLATSPGSGPSAS 1993
Cdd:NF033849 386 SSGVSGGFSGGIAGGGVTSEGLGAsqgGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSsshSTSSGQADSVSQGTSWS 465
|
250 260
....*....|....*....|....*.
gi 1953082137 1994 PSATESTFSTIVSESSEYTVASYTTG 2019
Cdd:NF033849 466 EGTGTSQGQSVGTSESWSTSQSETDS 491
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
859-1160 |
2.97e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.54 E-value: 2.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 859 DSSATTFTKGDASPMSTS-SPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTg 937
Cdd:NF033849 237 QSAGTGYGESVGHSTSQGqSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTE- 315
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 938 hsttalsalpSVFTTVSALTETTVTSETSYTVGDGSSASPSGpgqlSTTVSVSAQTTTGLVDGSSVYPGTPHS-SEPTGI 1016
Cdd:NF033849 316 ----------GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGT----SQSTSISHSESSSESTGTSVGHSTSSSvSSSESS 381
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1017 SHSTTSGEDA-VSASTPttAGPFTTHADGGHTTTSLAAG-STIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELS 1094
Cdd:NF033849 382 SRSSSSGVSGgFSGGIA--GGGVTSEGLGASQGGSEGWGsGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVS 459
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1095 ---THTVVTGQAGSTPTGETTIIPTVPASSEPTASTH-VSHTTDAGRSTVPSRPGDLSTSPAVSGPTATG 1160
Cdd:NF033849 460 qgtSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTgTSESVSQGDGRSTGRSESQGTSLGTSGGRTSG 529
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
968-1166 |
3.98e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.90 E-value: 3.98e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 968 TVGDGSSASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHT 1047
Cdd:COG3469 24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTAS 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1048 TTSLAAGSTIYTATAPSELSTPSFSTTAshsTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTAST 1127
Cdd:COG3469 104 GANTGTSTVTTTSTGAGSVTSTTSSTAG---STTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPS 180
|
170 180 190
....*....|....*....|....*....|....*....
gi 1953082137 1128 HVSHTTDAGRSTVpsrpgdlsTSPAVSGPTATGVPQEST 1166
Cdd:COG3469 181 ATTTATATTASGA--------TTPSATTTATTTGPPTPG 211
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1194-1384 |
5.22e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.51 E-value: 5.22e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1194 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQedSSTTTLTTGHSTTALSALPSVFTTV 1273
Cdd:COG3469 24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTT--STTATATAAAAAATSTSATLVATST 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1274 SALTETTVTSETSYTVGDGSSVSPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPT 1353
Cdd:COG3469 102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
|
170 180 190
....*....|....*....|....*....|.
gi 1953082137 1354 TAGPFTTHADGGHTTTSLAAGSTIYTATAPL 1384
Cdd:COG3469 182 TTTATATTASGATTPSATTTATTTGPPTPGL 212
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1512-1923 |
5.32e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 45.55 E-value: 5.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1512 QGPRSTPPLRPQELSTPSFSTTASHSTDSEIPTSTSSPSelsthtvvtgqAGSTPTGETTIIPTVPASSEPTASTHVSHT 1591
Cdd:PHA03307 69 TGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPT-----------PPGPSSPDPPPPTPPPASPPPSPAPDLSEM 137
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1592 TDAGRSTVPSRPGDLSTSPAVSGPTATGVP---------QESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLA 1662
Cdd:PHA03307 138 LRPVGSPGPPPAASPPAAGASPAAVASDAAssrqaalplSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASA 217
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1663 TSPGSGPSASP--SATESTFSTIVSESSEYTVASYTTGSPSPSEPGRLfhhhpdhwtqhyrpfgvalrlhncvcfdrkrP 1740
Cdd:PHA03307 218 SSPAPAPGRSAadDAGASSSDSSSSESSGCGWGPENECPLPRPAPITL-------------------------------P 266
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1741 LLQKPPIRWVTEAPRPLqvraslsttvSVSAQTTTGLVDGSTVyPGTPHSSE-PTGISHSTTSGEDAVSASTPTTAGPFT 1819
Cdd:PHA03307 267 TRIWEASGWNGPSSRPG----------PASSSSSPRERSPSPS-PSSPGSGPaPSSPRASSSSSSSRESSSSSTSSSSES 335
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1820 THADGGHTTTSLAAGStiyTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIP-TVP 1898
Cdd:PHA03307 336 SRGAAVSPGPSPSRSP---SPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPaGRP 412
|
410 420
....*....|....*....|....*
gi 1953082137 1899 ASSEPTASTHVSHTTDAGRSTVPSG 1923
Cdd:PHA03307 413 RPSPLDAGAASGAFYARYPLLTPSG 437
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1547-1942 |
5.60e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 45.55 E-value: 5.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1547 SSPSELSTHTVVTGQAGSTPTGettiIPTVPASSEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQEstd 1626
Cdd:PHA03307 45 SDSAELAAVTVVAGAAACDRFE----PPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPP--- 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1627 hGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATEST-----FSTIVSESSEytvASYTTGSPS 1701
Cdd:PHA03307 118 -PPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSrqaalPLSSPEETAR---APSSPPAEP 193
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1702 PSEPGRLFHHHPDH----WTQHYRPFGVALRLHNCVcFDRKRPLLQKPPIRWV-------TEAPRPlqvRASLSTTVSVS 1770
Cdd:PHA03307 194 PPSTPPAAASPRPPrrssPISASASSPAPAPGRSAA-DDAGASSSDSSSSESSgcgwgpeNECPLP---RPAPITLPTRI 269
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1771 AQTTTGLVDGstvyPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPS 1850
Cdd:PHA03307 270 WEASGWNGPS----SRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGP 345
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1851 FSTTASRSTDSAIPTSTSSPSELSTPTvVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRS-TVPSGPGDLST 1929
Cdd:PHA03307 346 SPSRSPSPSRPPPPADPSSPRKRPRPS-RAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPrPSPLDAGAASG 424
|
410
....*....|....*
gi 1953082137 1930 SPAVSGP--TATGVP 1942
Cdd:PHA03307 425 AFYARYPllTPSGEP 439
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1148-1376 |
6.79e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 45.38 E-value: 6.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1148 STSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTS-----SPTESLATSPGSGPSASPSATESTFS 1222
Cdd:NF033849 240 GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTqstseSESTGQSSSVGTSESQSHGTTEGTST 319
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1223 TI---VSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTETTVTSETSYTVGDGSSVSPSG 1299
Cdd:NF033849 320 TDsssHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAG 399
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1300 PGQLSTTVSVSAQTTTG-----------LVDGSSVYPGTPHSSEpTGISHSTTSGEdAVSASTPTTAGPFTTHADGGHTT 1368
Cdd:NF033849 400 GGVTSEGLGASQGGSEGwgsgdsvqsvsQSYGSSSSTGTSSGHS-DSSSHSTSSGQ-ADSVSQGTSWSEGTGTSQGQSVG 477
|
....*...
gi 1953082137 1369 TSLAAGST 1376
Cdd:NF033849 478 TSESWSTS 485
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1146-1357 |
6.92e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.13 E-value: 6.92e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1146 DLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIV 1225
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1226 SesseYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALP--SVFTTVSALTETTVTSETSYTVGDGSSVSPSGPGQL 1303
Cdd:COG3469 81 T----ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGagSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1953082137 1304 STTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGP 1357
Cdd:COG3469 157 ETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTP 210
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1963-2209 |
7.09e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 45.38 E-value: 7.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1963 TTFTKGDASPMSTS-SPTESLATSPGSGPSASPSATESTFstiVSESSEYTVASYTTGSPSPS--SQEDSSTTTLTTGHS 2039
Cdd:NF033849 285 WSHTQSTSESESTGqSSSVGTSESQSHGTTEGTSTTDSSS---HSQSSSYNVSSGTGVSSSHSdgTSQSTSISHSESSSE 361
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2040 TTALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHS 2119
Cdd:NF033849 362 STGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSG 441
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2120 TTSG-EDAVSASTPTTAGPFTTHADGGHTTTSLAAGstiytaTAPSELSTPSFSTTASRST---DSVIPTSTSSPSELST 2195
Cdd:NF033849 442 HSDSsSHSTSSGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTgtsESVSQGDGRSTGRSES 515
|
250
....*....|....
gi 1953082137 2196 HTVVTGQAGSTPTG 2209
Cdd:NF033849 516 QGTSLGTSGGRTSG 529
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1827-2233 |
7.21e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 45.29 E-value: 7.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1827 TTTSLAAGSTIYTATAPSELSTPSFSTTASrstdSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIptvpasSEPTAS 1906
Cdd:pfam05109 410 TNATTTTHKVIFSKAPESTTTSPTLNTTGF----AAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADV------TSPTPA 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1907 THVSHTTDAGRSTVPSGPGDLSTSPAVSGPTaTGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSP 1986
Cdd:pfam05109 480 GTTSGASPVTPSPSPRDNGTESKAPDMTSPT-SAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSP 558
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1987 gsGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEdSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSETS 2066
Cdd:pfam05109 559 --TPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGE-TSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHN 635
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2067 YTvgdgSSASPSGSGQPSttvSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHST-----TSGEDAVSASTPTTAGPFTTH 2141
Cdd:pfam05109 636 IT----SSSTSSMSLRPS---SISETLSPSTSDNSTSHMPLLTSAHPTGGENITqvtpaSTSTHHVSTSSPAPRPGTTSQ 708
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2142 ADG-GHTTTSLAAGSTIYTATAPSELSTpsfSTTASRSTDSVIPTSTSSpselsthtvvTGQAGSTPTGETTIIPTVPAS 2220
Cdd:pfam05109 709 ASGpGNSSTSTKPGEVNVTKGTPPKNAT---SPQAPSGQKTAVPTVTST----------GGKANSTTGGKHTTGHGARTS 775
|
410
....*....|...
gi 1953082137 2221 SEPTASTHVSHTT 2233
Cdd:pfam05109 776 TEPTTDYGGDSTT 788
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
826-1037 |
8.07e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.74 E-value: 8.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 826 DLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIV 905
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 906 SesseYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALP--SVFTTVSALTETTVTSETSYTVGDGSSASPSGPGQL 983
Cdd:COG3469 81 T----ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGagSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1953082137 984 STTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGP 1037
Cdd:COG3469 157 ETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTP 210
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1758-1907 |
8.21e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.74 E-value: 8.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1758 QVRASLSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTI 1837
Cdd:COG3469 64 TAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSS 143
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1838 YTATAPSELSTPSFSTTASRSTDsaiPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTAST 1907
Cdd:COG3469 144 AGSTTTTTTVSGTETATGGTTTT---STTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTP 210
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
902-1110 |
1.23e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 1.23e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 902 STIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTETTVTSETSYTVGDGSSASPSGPG 981
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 982 QLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTAT 1061
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETAT 160
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1953082137 1062 APSELS-----TPSFSTTASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGE 1110
Cdd:COG3469 161 GGTTTTsttttTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPK 214
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1547-1889 |
1.28e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 44.23 E-value: 1.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1547 SSPSELSTHTVVTGQAGSTPTGETTIIptvpaSSEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTD 1626
Cdd:NF033849 241 TGYGESVGHSTSQGQSHSVGTSESHSV-----GTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTE 315
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1627 HGTISDSPQPPDSSATTFTKGDASPMSTSSpteslATSPGSGPSASPSATESTfSTIVSESSEYTVASYTTGSPSPSEpg 1706
Cdd:NF033849 316 GTSTTDSSSHSQSSSYNVSSGTGVSSSHSD-----GTSQSTSISHSESSSEST-GTSVGHSTSSSVSSSESSSRSSSS-- 387
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1707 rlfhHHPDHWTQHYRPFGVAlrlhncvcfdrkrpllqkppirwvteaprplqvraSLSTTVSVSAQTTTGLVDGSTVYPG 1786
Cdd:NF033849 388 ----GVSGGFSGGIAGGGVT-----------------------------------SEGLGASQGGSEGWGSGDSVQSVSQ 428
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1787 TPHSSEPTGISHSTTSG-EDAVSASTPTTAGPFTTHADGGHTTTSLAAGstiytaTAPSELSTPSFSTTASRSTDSAIPT 1865
Cdd:NF033849 429 SYGSSSSTGTSSGHSDSsSHSTSSGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTGTSESV 502
|
330 340
....*....|....*....|....*..
gi 1953082137 1866 S---TSSPSELSTPTVVTGQAGSTPTG 1889
Cdd:NF033849 503 SqgdGRSTGRSESQGTSLGTSGGRTSG 529
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
763-1139 |
1.38e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 44.14 E-value: 1.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 763 IPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPT----ASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTA 838
Cdd:pfam05109 404 IITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTtglpSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTS 483
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 839 TGVPqestdhgtISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSG-PSASPSATESTFSTiVSESSEYTVASYT 917
Cdd:pfam05109 484 GASP--------VTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAvTTPTPNATSPTLGK-TSPTSAVTTPTPN 554
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 918 TGSPSPSSQEDSSTTTLTTGHSTTALSAL--PSVFTTVSALTETTVTSETSYTVGDGSSASP--SGPGQLSTTVSVSAQ- 992
Cdd:pfam05109 555 ATSPTPAVTTPTPNATIPTLGKTSPTSAVttPTPNATSPTVGETSPQANTTNHTLGGTSSTPvvTSPPKNATSAVTTGQh 634
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 993 -TTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSF 1071
Cdd:pfam05109 635 nITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGN 714
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1953082137 1072 STTASHSTDSEIPTSTSSPSELSTHtvvtgqagsTPTGETTIIPTVPAS-SEPTASTHVSHTTDAGRST 1139
Cdd:pfam05109 715 SSTSTKPGEVNVTKGTPPKNATSPQ---------APSGQKTAVPTVTSTgGKANSTTGGKHTTGHGART 774
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
17-471 |
1.54e-03 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 44.00 E-value: 1.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 17 ATATQGETTIIPTVPASSEPTASTHVSHTTDAGHSTVRSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSAT 96
Cdd:COG4625 51 GGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAG 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 97 TFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTAL 176
Cdd:COG4625 131 GGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGG 210
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 177 SALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTphSSEPTGIPHSTTSG 256
Cdd:COG4625 211 GGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGG--SGGGGGGGGGGGSG 288
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 257 EDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQA 336
Cdd:COG4625 289 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGG 368
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 337 GSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSAT 416
Cdd:COG4625 369 GGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGG 448
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 1953082137 417 TFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTT 471
Cdd:COG4625 449 GGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGGNYTQSAGST 503
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1868-2251 |
2.08e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 43.62 E-value: 2.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1868 SSPSELSTPTVVTGqAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVP-SGPGDLSTSPAVSGPTAT------- 1939
Cdd:PHA03307 45 SDSAELAAVTVVAG-AAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPaSPAREGSPTPPGPSSPDPppptppp 123
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1940 GVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSG-PSASPSATESTFSTIVSESSEYTVASYTT 2018
Cdd:PHA03307 124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAAlPLSSPEETARAPSSPPAEPPPSTPPAAAS 203
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2019 GSPSP-----SSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSET------SYTVGDGSSASPSGSGQPSTTV 2087
Cdd:PHA03307 204 PRPPRrsspiSASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECplprpaPITLPTRIWEASGWNGPSSRPG 283
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2088 SVSAQTTTGLVDGSTVyPGTPHSSE-PTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGStiyTATAPSEL 2166
Cdd:PHA03307 284 PASSSSSPRERSPSPS-PSSPGSGPaPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSP---SPSRPPPP 359
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2167 STPSFSTTASRSTDSVIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVShttdaGHSTVPSRPGD 2246
Cdd:PHA03307 360 ADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAAS-----GAFYARYPLLT 434
|
....*
gi 1953082137 2247 LSTSP 2251
Cdd:PHA03307 435 PSGEP 439
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
217-472 |
2.68e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 43.46 E-value: 2.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 217 STTVSVSAQTTTGLVDGSTVYPGTPHSsepTGIPHSTTSGedavsastpttagpfTTHADGGHTTTSLAAGSTIYTATAP 296
Cdd:NF033849 248 GHSTSQGQSHSVGTSESHSVGTSQSQS---HTTGHGSTRG---------------WSHTQSTSESESTGQSSSVGTSESQ 309
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 297 SELSTPSFSTTASRS-TDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHvSHTTDAGRSTVPS 375
Cdd:NF033849 310 SHGTTEGTSTTDSSShSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV-SSSESSSRSSSSG 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 376 GPGDLSTSPAVSGPTATGVPQ---ESTDHGTISDSPPPPDSSATTFTKGDASPMSTS---SPTESLATSPGSGPSASPSA 449
Cdd:NF033849 389 VSGGFSGGIAGGGVTSEGLGAsqgGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSsshSTSSGQADSVSQGTSWSEGT 468
|
250 260
....*....|....*....|...
gi 1953082137 450 TESTFSTIVSESSEYTVASYTTG 472
Cdd:NF033849 469 GTSQGQSVGTSESWSTSQSETDS 491
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
238-478 |
2.86e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 2.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 238 PGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATA--PSELSTPSFSTTASRSTDSA 315
Cdd:PHA03247 2592 PPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPerPRDDPAPGRVSRPRRARRLG 2671
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 316 IPTSTSSPSELSTPTVVTGQAGST----------PTGETTIIPTVPASSEPTAsthvshtTDAGRSTVPSGPGDLSTSPA 385
Cdd:PHA03247 2672 RAAQASSPPQRPRRRAARPTVGSLtsladpppppPTPEPAPHALVSATPLPPG-------PAAARQASPALPAAPAPPAV 2744
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 386 vsgPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSAsPSATESTFSTIVSESSEYT 465
Cdd:PHA03247 2745 ---PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP-WDPADPPAAVLAPAAALPP 2820
|
250
....*....|...
gi 1953082137 466 VASYTTGSPSPSS 478
Cdd:PHA03247 2821 AASPAGPLPPPTS 2833
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
754-927 |
3.04e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 43.24 E-value: 3.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 754 TASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGEttiiPTVPASSEPTASTHVSHTTDAGRSTVPSrPGDLSTSPAV 833
Cdd:PHA03307 235 SSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWE----ASGWNGPSSRPGPASSSSSPRERSPSPS-PSSPGSGPAP 309
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 834 SGPTATG--VPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSP---GSGPSASPSATESTFSTIVSES 908
Cdd:PHA03307 310 SSPRASSssSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSprkRPRPSRAPSSPAASAGRPTRRR 389
|
170
....*....|....*....
gi 1953082137 909 SEYTVASYTTGSPSPSSQE 927
Cdd:PHA03307 390 ARAAVAGRARRRDATGRFP 408
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1848-2063 |
3.47e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.82 E-value: 3.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1848 TPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTiiPTVPASSEPTASTHVSHTTDAGRSTVPSGPGDL 1927
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVST--TGSVVVAASGSAGSGTGTTAASSTAATSSTTST 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1928 STSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSE 2007
Cdd:COG3469 79 TATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1953082137 2008 SSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTS 2063
Cdd:COG3469 159 ATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPK 214
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
777-1238 |
3.93e-03 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 42.84 E-value: 3.93e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 777 TVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQ 856
Cdd:COG4625 44 GGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGG 123
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 857 PPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTT 936
Cdd:COG4625 124 GGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGG 203
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 937 GHSTTALSALPSVFTTVSALTETTVTSETSYTVGDGSSASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTphSSEPTGI 1016
Cdd:COG4625 204 GGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGG--SGGGGGG 281
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1017 SHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELSTH 1096
Cdd:COG4625 282 GGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGG 361
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1097 TVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQ 1176
Cdd:COG4625 362 GTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAG 441
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1953082137 1177 PPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTT 1238
Cdd:COG4625 442 GGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGGNYTQSAGST 503
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
752-948 |
6.60e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.08 E-value: 6.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 752 STTASHSTDSEIPTSTSSPSELSthtVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTD-----AGRSTVPSRPGD 826
Cdd:PHA03307 151 SPPAAGASPAAVASDAASSRQAA---LPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSspisaSASSPAPAPGRS 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 827 LSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSG-----PSASPSATESTF 901
Cdd:PHA03307 228 AADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSprersPSPSPSSPGSGP 307
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 1953082137 902 STIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPS 948
Cdd:PHA03307 308 APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPS 354
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
17-395 |
6.88e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.08 E-value: 6.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 17 ATATQGETTIIPTVPASSEPTAS--THVSHTTDAGHSTVRSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPP--- 91
Cdd:PHA03307 57 AGAAACDRFEPPTGPPPGPGTEApaNESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDlse 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 92 ---DSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSA-TESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTL 167
Cdd:PHA03307 137 mlrPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSsPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS 216
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 168 TTGHSTTAL--SALPSVFTTVSALtgttvTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGstvyPGTPHSSE 245
Cdd:PHA03307 217 ASSPAPAPGrsAADDAGASSSDSS-----SSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPS----SRPGPASS 287
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 246 PTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSE 325
Cdd:PHA03307 288 SSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRK 367
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1953082137 326 LSTPTvVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRS-TVPSGPGDLSTSPAVSGP--TATGVP 395
Cdd:PHA03307 368 RPRPS-RAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPrPSPLDAGAASGAFYARYPllTPSGEP 439
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1522-1707 |
7.22e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 41.66 E-value: 7.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1522 PQELSTPSFSTTASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVPS 1601
Cdd:COG3469 29 AASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTG 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1602 RPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFS 1681
Cdd:COG3469 109 TSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATAT 188
|
170 180
....*....|....*....|....*..
gi 1953082137 1682 TIVSESS-EYTVASYTTGSPSPSEPGR 1707
Cdd:COG3469 189 TASGATTpSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2011-2207 |
7.54e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 41.66 E-value: 7.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2011 YTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSE---TSYTVGDGSSASPSGSGQPSTTV 2087
Cdd:COG3469 15 ASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASStaaTSSTTSTTATATAAAAAATSTSA 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2088 SVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELS 2167
Cdd:COG3469 95 TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTS 174
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 1953082137 2168 TPSFSTTASRSTDSViPTSTSSPSELSTHTVVTGQAGSTP 2207
Cdd:COG3469 175 ASTTPSATTTATATT-ASGATTPSATTTATTTGPPTPGLP 213
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1837-2239 |
8.72e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 41.44 E-value: 8.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1837 IYTATAPSELSTPSFSTTASRSTDSA---IPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTT 1913
Cdd:pfam05109 304 VFSDEIPASQDMPTNTTDITYVGDNAtysVPMVTSEDANSPNVTVTAFWAWPNNTETDFKCKWTLTSGTPSGCENISGAF 383
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1914 DAGRSTVPSGPGdLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTF-TKGDASPMSTSSPTESLATSPGSGPSA 1992
Cdd:pfam05109 384 ASNRTFDITVSG-LGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLnTTGFAAPNTTTGLPSSTHVPTNLTAPA 462
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1993 SPSATESTfSTIVSESSEYTVASYTTGSPSPSSQEDSSTttlttghsttalSALPSVFTTVSALTGTT--VTSETSYTVG 2070
Cdd:pfam05109 463 STGPTVST-ADVTSPTPAGTTSGASPVTPSPSPRDNGTE------------SKAPDMTSPTSAVTTPTpnATSPTPAVTT 529
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2071 DGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTgipHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTS 2150
Cdd:pfam05109 530 PTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPT---LGKTSPTSAVTTPTPNATSPTVGETSPQANTTN 606
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2151 LAAGStiyTATAPSELSTPSFSTTASRSTDSVIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVpASSEPTASTHVS 2230
Cdd:pfam05109 607 HTLGG---TSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLL-TSAHPTGGENIT 682
|
....*....
gi 1953082137 2231 HTTDAGHST 2239
Cdd:pfam05109 683 QVTPASTST 691
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
107-596 |
9.00e-03 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 41.69 E-value: 9.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 107 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTV 186
Cdd:COG4625 2 GGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGG 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 187 SALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPT 266
Cdd:COG4625 82 GGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAG 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 267 TAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTI 346
Cdd:COG4625 162 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 347 IPTVPASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPM 426
Cdd:COG4625 242 GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 427 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTV 506
Cdd:COG4625 322 GGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGG 401
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 507 SALTGNDRYFRNLLYELSTPSFSTTASHSTDSEIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTESTDHGTISDSPQ 586
Cdd:COG4625 402 GGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGN 481
|
490
....*....|
gi 1953082137 587 PPDSSATTFT 596
Cdd:COG4625 482 NTYTGTTTVN 491
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
254-463 |
9.13e-03 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 41.57 E-value: 9.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 254 TSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGST---IYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELS--- 327
Cdd:COG5665 245 TPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTtsnTPTSTAKAQPQPPTKKQPAKEPPSDTASGNPSAPSVLInsd 324
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 328 TPTVVTGQAGSTPTGETTIIPTVPASsepTASTHVSHTTDAGRSTVPSGPGDLSTSpaVSGPTATGVPQESTDHGTISDS 407
Cdd:COG5665 325 SPTSEDPATASVPTTEETTAFTTPSS---VPSTPAEKDTPATDLATPVSPTPPETS--VDKKVSPDSATSSTKSEKEGGT 399
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1953082137 408 PPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSE 463
Cdd:COG5665 400 ASSPMPPNIAIGAKDDVDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAG 455
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
1801-2010 |
9.13e-03 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 41.57 E-value: 9.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1801 TSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGST---IYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELS--- 1874
Cdd:COG5665 245 TPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTtsnTPTSTAKAQPQPPTKKQPAKEPPSDTASGNPSAPSVLInsd 324
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1875 TPTVVTGQAGSTPTGETTIIPTVPASsepTASTHVSHTTDAGRSTVPSGPGDLSTSpaVSGPTATGVPQESTDHGTISDS 1954
Cdd:COG5665 325 SPTSEDPATASVPTTEETTAFTTPSS---VPSTPAEKDTPATDLATPVSPTPPETS--VDKKVSPDSATSSTKSEKEGGT 399
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1953082137 1955 PPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSE 2010
Cdd:COG5665 400 ASSPMPPNIAIGAKDDVDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAG 455
|
|
|