NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1953082137|ref|XP_038397638|]
View 

mucin-12-like isoform X4 [Canis lupus familiaris]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SEA pfam01390
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ...
2344-2426 6.70e-09

SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.


:

Pssm-ID: 460188  Cd Length: 100  Bit Score: 55.32  E-value: 6.70e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2344 VTVKVTNRNFTKDLNNISSSVYQNFTQLFKSQMDKAYMGKDF-PQYRGVIIRRL--FNGSVVVehDVVME-----TNYTS 2415
Cdd:pfam01390    5 GSFKITNLQYTPDLGNPSSQEFKSLSRRIESLLNELFRNSSLrKQYIKSHVLRLrpDGGSVVV--DVVLVfrfpsTEPAL 82
                           90
                   ....*....|.
gi 1953082137 2416 DFQKLFENLIE 2426
Cdd:pfam01390   83 DREKLIEEILR 93
PHA03307 super family cl33723
transcriptional regulator ICP4; Provisional
820-1245 3.46e-08

transcriptional regulator ICP4; Provisional


The actual alignment was detected with superfamily member PHA03307:

Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 59.41  E-value: 3.46e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  820 VPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATES 899
Cdd:PHA03307    43 LVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPP 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  900 TFS---TIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTETTVTSETSYTVGDGSSAS 976
Cdd:PHA03307   123 PASpppSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAA 202
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  977 PSGPGQLSTTVSVSAqtttglvDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPftthaDGGHTTTSLAagsT 1056
Cdd:PHA03307   203 SPRPPRRSSPISASA-------SSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECP-----LPRPAPITLP---T 267
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1057 IYTATAPSELSTPSFSTTASHSTDSE-----IPTSTSSPSELSTHTVVTGQAGSTPTGettiiptvpaSSEPTASTHVSH 1131
Cdd:PHA03307   268 RIWEASGWNGPSSRPGPASSSSSPRErspspSPSSPGSGPAPSSPRASSSSSSSRESS----------SSSTSSSSESSR 337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1132 TTDAGRSTVPSRPGDLSTSPAVSGPTATgvPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSptESLATSPGSGPS 1211
Cdd:PHA03307   338 GAAVSPGPSPSRSPSPSRPPPPADPSSP--RKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRR--DATGRFPAGRPR 413
                          410       420       430
                   ....*....|....*....|....*....|....
gi 1953082137 1212 ASPSATESTFSTIVSESSEYtvasYTTGSPSPSS 1245
Cdd:PHA03307   414 PSPLDAGAASGAFYARYPLL----TPSGEPWPGS 443
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2045-2300 3.44e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 3.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2045 ALPSVFTT---VSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHS---------SE 2112
Cdd:NF033849   226 SLPMMYAAnlgQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESestgqsssvGT 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2113 PTGIPHSTTSGEdAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSVIPTSTSS--- 2189
Cdd:NF033849   306 SESQSHGTTEGT-STTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSsrs 384
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2190 --------------PSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPT--ASTHVSHTTDAGHSTvpsrpgDLSTSPAV 2253
Cdd:NF033849   385 sssgvsggfsggiaGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSssTGTSSGHSDSSSHST------SSGQADSV 458
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1953082137 2254 SGPTATGVPQESTDHSTMSHSSAVTHSFSSTFTEVDKSHIPTSSSRQ 2300
Cdd:NF033849   459 SQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQG 505
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
175-360 1.64e-06

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 53.60  E-value: 1.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  175 ALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTT 254
Cdd:COG3469     23 LGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTA 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  255 SGEDAVSASTPTTAGPFTTHADG--GHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVV 332
Cdd:COG3469    103 SGANTGTSTVTTTSTGAGSVTSTtsSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSAT 182
                          170       180
                   ....*....|....*....|....*...
gi 1953082137  333 TGQAGSTPTGETTIIPTVPASSEPTAST 360
Cdd:COG3469    183 TTATATTASGATTPSATTTATTTGPPTP 210
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
1819-2024 1.38e-05

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.52  E-value: 1.38e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1819 TTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVP 1898
Cdd:COG3469      6 TAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAA 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1899 ASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSP 1978
Cdd:COG3469     86 AAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1953082137 1979 TESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPS 2024
Cdd:COG3469    166 TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
272-477 1.38e-05

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.52  E-value: 1.38e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  272 TTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVP 351
Cdd:COG3469      6 TAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAA 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  352 ASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSP 431
Cdd:COG3469     86 AAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1953082137  432 TESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPS 477
Cdd:COG3469    166 TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1506-1948 1.55e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.55e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1506 QPHLLLQGPRSTPPLRPQELSTPsfsttASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTAs 1585
Cdd:PHA03247  2651 RPRDDPAPGRVSRPRRARRLGRA-----AQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPG- 2724
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1586 thvshtTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTIS--DSPQPPDSSATTftkgdASPMSTSSPTESLAT 1663
Cdd:PHA03247  2725 ------PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAgpPAPAPPAAPAAG-----PPRRLTRPAVASLSE 2793
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1664 SPGSGPSAsPSATESTFSTIVSESSEYTVASYTTGSPSPSEPGRLFHHHPDHWTQHYRPFGVALRLHNCVcfdRKRPLLQ 1743
Cdd:PHA03247  2794 SRESLPSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDV---RRRPPSR 2869
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1744 KPPIrwVTEAPRPLQVRASLSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGISHSTTSGEDAVSASTP--TTAGPFTTH 1821
Cdd:PHA03247  2870 SPAA--KPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPprPQPPLAPTT 2947
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1822 ADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTvVTGQAGSTPTGETTIIPTV---- 1897
Cdd:PHA03247  2948 DPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR-VSSWASSLALHEETDPPPVslkq 3026
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1898 ---PASSEPTASTHVSHTTDAGRSTV------PSGPGDLSTSPAVSGPTATGVPQESTDH 1948
Cdd:PHA03247  3027 tlwPPDDTEDSDADSLFDSDSERSDLealdplPPEPHDPFAHEPDPATPEAGARESPSSQ 3086
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
1194-1384 5.22e-04

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.51  E-value: 5.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1194 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQedSSTTTLTTGHSTTALSALPSVFTTV 1273
Cdd:COG3469     24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTT--STTATATAAAAAATSTSATLVATST 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1274 SALTETTVTSETSYTVGDGSSVSPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPT 1353
Cdd:COG3469    102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1953082137 1354 TAGPFTTHADGGHTTTSLAAGSTIYTATAPL 1384
Cdd:COG3469    182 TTTATATTASGATTPSATTTATTTGPPTPGL 212
 
Name Accession Description Interval E-value
SEA pfam01390
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ...
2344-2426 6.70e-09

SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.


Pssm-ID: 460188  Cd Length: 100  Bit Score: 55.32  E-value: 6.70e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2344 VTVKVTNRNFTKDLNNISSSVYQNFTQLFKSQMDKAYMGKDF-PQYRGVIIRRL--FNGSVVVehDVVME-----TNYTS 2415
Cdd:pfam01390    5 GSFKITNLQYTPDLGNPSSQEFKSLSRRIESLLNELFRNSSLrKQYIKSHVLRLrpDGGSVVV--DVVLVfrfpsTEPAL 82
                           90
                   ....*....|.
gi 1953082137 2416 DFQKLFENLIE 2426
Cdd:pfam01390   83 DREKLIEEILR 93
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
820-1245 3.46e-08

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 59.41  E-value: 3.46e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  820 VPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATES 899
Cdd:PHA03307    43 LVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPP 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  900 TFS---TIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTETTVTSETSYTVGDGSSAS 976
Cdd:PHA03307   123 PASpppSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAA 202
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  977 PSGPGQLSTTVSVSAqtttglvDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPftthaDGGHTTTSLAagsT 1056
Cdd:PHA03307   203 SPRPPRRSSPISASA-------SSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECP-----LPRPAPITLP---T 267
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1057 IYTATAPSELSTPSFSTTASHSTDSE-----IPTSTSSPSELSTHTVVTGQAGSTPTGettiiptvpaSSEPTASTHVSH 1131
Cdd:PHA03307   268 RIWEASGWNGPSSRPGPASSSSSPRErspspSPSSPGSGPAPSSPRASSSSSSSRESS----------SSSTSSSSESSR 337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1132 TTDAGRSTVPSRPGDLSTSPAVSGPTATgvPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSptESLATSPGSGPS 1211
Cdd:PHA03307   338 GAAVSPGPSPSRSPSPSRPPPPADPSSP--RKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRR--DATGRFPAGRPR 413
                          410       420       430
                   ....*....|....*....|....*....|....
gi 1953082137 1212 ASPSATESTFSTIVSESSEYtvasYTTGSPSPSS 1245
Cdd:PHA03307   414 PSPLDAGAASGAFYARYPLL----TPSGEPWPGS 443
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2045-2300 3.44e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 3.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2045 ALPSVFTT---VSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHS---------SE 2112
Cdd:NF033849   226 SLPMMYAAnlgQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESestgqsssvGT 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2113 PTGIPHSTTSGEdAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSVIPTSTSS--- 2189
Cdd:NF033849   306 SESQSHGTTEGT-STTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSsrs 384
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2190 --------------PSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPT--ASTHVSHTTDAGHSTvpsrpgDLSTSPAV 2253
Cdd:NF033849   385 sssgvsggfsggiaGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSssTGTSSGHSDSSSHST------SSGQADSV 458
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1953082137 2254 SGPTATGVPQESTDHSTMSHSSAVTHSFSSTFTEVDKSHIPTSSSRQ 2300
Cdd:NF033849   459 SQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQG 505
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
175-360 1.64e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 53.60  E-value: 1.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  175 ALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTT 254
Cdd:COG3469     23 LGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTA 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  255 SGEDAVSASTPTTAGPFTTHADG--GHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVV 332
Cdd:COG3469    103 SGANTGTSTVTTTSTGAGSVTSTtsSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSAT 182
                          170       180
                   ....*....|....*....|....*...
gi 1953082137  333 TGQAGSTPTGETTIIPTVPASSEPTAST 360
Cdd:COG3469    183 TTATATTASGATTPSATTTATTTGPPTP 210
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1819-2024 1.38e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.52  E-value: 1.38e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1819 TTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVP 1898
Cdd:COG3469      6 TAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAA 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1899 ASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSP 1978
Cdd:COG3469     86 AAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1953082137 1979 TESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPS 2024
Cdd:COG3469    166 TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
272-477 1.38e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.52  E-value: 1.38e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  272 TTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVP 351
Cdd:COG3469      6 TAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAA 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  352 ASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSP 431
Cdd:COG3469     86 AAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1953082137  432 TESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPS 477
Cdd:COG3469    166 TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
PHA03247 PHA03247
large tegument protein UL36; Provisional
1506-1948 1.55e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.55e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1506 QPHLLLQGPRSTPPLRPQELSTPsfsttASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTAs 1585
Cdd:PHA03247  2651 RPRDDPAPGRVSRPRRARRLGRA-----AQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPG- 2724
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1586 thvshtTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTIS--DSPQPPDSSATTftkgdASPMSTSSPTESLAT 1663
Cdd:PHA03247  2725 ------PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAgpPAPAPPAAPAAG-----PPRRLTRPAVASLSE 2793
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1664 SPGSGPSAsPSATESTFSTIVSESSEYTVASYTTGSPSPSEPGRLFHHHPDHWTQHYRPFGVALRLHNCVcfdRKRPLLQ 1743
Cdd:PHA03247  2794 SRESLPSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDV---RRRPPSR 2869
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1744 KPPIrwVTEAPRPLQVRASLSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGISHSTTSGEDAVSASTP--TTAGPFTTH 1821
Cdd:PHA03247  2870 SPAA--KPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPprPQPPLAPTT 2947
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1822 ADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTvVTGQAGSTPTGETTIIPTV---- 1897
Cdd:PHA03247  2948 DPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR-VSSWASSLALHEETDPPPVslkq 3026
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1898 ---PASSEPTASTHVSHTTDAGRSTV------PSGPGDLSTSPAVSGPTATGVPQESTDH 1948
Cdd:PHA03247  3027 tlwPPDDTEDSDADSLFDSDSERSDLealdplPPEPHDPFAHEPDPATPEAGARESPSSQ 3086
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
948-1217 2.51e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 50.00  E-value: 2.51e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  948 SVFTTVSALTETTVTSETSYTVGDGSSASPSGpGQlSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAV 1027
Cdd:NF033849   266 SVGTSQSQSHTTGHGSTRGWSHTQSTSESEST-GQ-SSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSH 343
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1028 SASTPTTAGPFTTHADGGHTTTSLAAGSTiyTATAPSELSTPSFST--------------TASHSTDSEIPTSTSSPSEL 1093
Cdd:NF033849   344 SDGTSQSTSISHSESSSESTGTSVGHSTS--SSVSSSESSSRSSSSgvsggfsggiagggVTSEGLGASQGGSEGWGSGD 421
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1094 STHTVVTGQAGSTPTGETTIIPTVPASSEPT-ASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTIS 1172
Cdd:NF033849   422 SVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSgQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSES 501
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1953082137 1173 DSPQPPDSSATTFTKGDASPMSTSSpTESLATSPGSGPSASPSAT 1217
Cdd:NF033849   502 VSQGDGRSTGRSESQGTSLGTSGGR-TSGAGGSMGLGPSISLGKS 545
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2042-2243 2.57e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 49.75  E-value: 2.57e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2042 ALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTT 2121
Cdd:COG3469     23 LGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTA 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2122 SGEDAVSASTPTTAGPftthaDGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSVIPTSTSSPSelsthTVVTG 2201
Cdd:COG3469    103 SGANTGTSTVTTTSTG-----AGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTS-----TTTTT 172
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1953082137 2202 QAGSTPTGETTIIPTVPASSEPTASTHVSHTT-DAGHSTVPSR 2243
Cdd:COG3469    173 TSASTTPSATTTATATTASGATTPSATTTATTtGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
942-1127 3.02e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 49.37  E-value: 3.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  942 ALSALPSVFTTVSALTETTVTSETSYTVGDGSSASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTT 1021
Cdd:COG3469     23 LGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTA 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1022 SGEDAVSASTPTTAGPFTTHADG--GHTTTSLAAGSTIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELSTHTVV 1099
Cdd:COG3469    103 SGANTGTSTVTTTSTGAGSVTSTtsSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSAT 182
                          170       180
                   ....*....|....*....|....*...
gi 1953082137 1100 TGQAGSTPTGETTIIPTVPASSEPTAST 1127
Cdd:COG3469    183 TTATATTASGATTPSATTTATTTGPPTP 210
SEA smart00200
Domain found in sea urchin sperm protein, enterokinase, agrin; Proposed function of regulating ...
2344-2416 9.77e-05

Domain found in sea urchin sperm protein, enterokinase, agrin; Proposed function of regulating or binding carbohydrate sidechains.


Pssm-ID: 214554  Cd Length: 121  Bit Score: 44.32  E-value: 9.77e-05
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1953082137  2344 VTVKVTNRNFTKDLNNISSSVYQNFTQLFKSQMDKAYMGKDF-PQYRGVIIRRLFNGSVVVEHDVVMETNYTSD 2416
Cdd:smart00200   12 LSVEGENLQYSPSLEDPSSEEYQELVRDVEKLLEQIYGKTDLkPDFVGTEVIEFRNGSVVVDLGLLFNEGVTNG 85
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
1761-2206 1.03e-04

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 47.85  E-value: 1.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1761 ASLSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTA 1840
Cdd:COG4625     59 TGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAG 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1841 TAPSELSTPSFSTTASRSTDSAIPTSTSSpselSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTV 1920
Cdd:COG4625    139 GGGGGGGGGGAGGGGGGGAGGAGGGGGGG----GGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGA 214
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1921 PSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATEST 2000
Cdd:COG4625    215 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGG 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2001 FSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGS 2080
Cdd:COG4625    295 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGG 374
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2081 GQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTA 2160
Cdd:COG4625    375 GGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGG 454
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*....
gi 1953082137 2161 TAPSELSTPSFSTTASRSTDSVIPTSTSSPSELSTHTV---VTGQAGST 2206
Cdd:COG4625    455 GAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGggnYTQSAGST 503
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1761-2019 2.37e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 46.92  E-value: 2.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1761 ASLSTTVSVSAQTTTGLVDGSTVYPGTPHSsepTGISHSTTSGedavsastpttagpfTTHADGGHTTTSLAAGSTIYTA 1840
Cdd:NF033849   245 ESVGHSTSQGQSHSVGTSESHSVGTSQSQS---HTTGHGSTRG---------------WSHTQSTSESESTGQSSSVGTS 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1841 TAPSELSTPSFSTTASRS-TDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHvSHTTDAGRST 1919
Cdd:NF033849   307 ESQSHGTTEGTSTTDSSShSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV-SSSESSSRSS 385
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1920 VPSGPGDLSTSPAVSGPTATGVPQ---ESTDHGTISDSPPPPDSSATTFTKGDASPMSTS---SPTESLATSPGSGPSAS 1993
Cdd:NF033849   386 SSGVSGGFSGGIAGGGVTSEGLGAsqgGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSsshSTSSGQADSVSQGTSWS 465
                          250       260
                   ....*....|....*....|....*.
gi 1953082137 1994 PSATESTFSTIVSESSEYTVASYTTG 2019
Cdd:NF033849   466 EGTGTSQGQSVGTSESWSTSQSETDS 491
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
859-1160 2.97e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 46.54  E-value: 2.97e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  859 DSSATTFTKGDASPMSTS-SPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTg 937
Cdd:NF033849   237 QSAGTGYGESVGHSTSQGqSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTE- 315
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  938 hsttalsalpSVFTTVSALTETTVTSETSYTVGDGSSASPSGpgqlSTTVSVSAQTTTGLVDGSSVYPGTPHS-SEPTGI 1016
Cdd:NF033849   316 ----------GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGT----SQSTSISHSESSSESTGTSVGHSTSSSvSSSESS 381
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1017 SHSTTSGEDA-VSASTPttAGPFTTHADGGHTTTSLAAG-STIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELS 1094
Cdd:NF033849   382 SRSSSSGVSGgFSGGIA--GGGVTSEGLGASQGGSEGWGsGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVS 459
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1095 ---THTVVTGQAGSTPTGETTIIPTVPASSEPTASTH-VSHTTDAGRSTVPSRPGDLSTSPAVSGPTATG 1160
Cdd:NF033849   460 qgtSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTgTSESVSQGDGRSTGRSESQGTSLGTSGGRTSG 529
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1194-1384 5.22e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.51  E-value: 5.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1194 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQedSSTTTLTTGHSTTALSALPSVFTTV 1273
Cdd:COG3469     24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTT--STTATATAAAAAATSTSATLVATST 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1274 SALTETTVTSETSYTVGDGSSVSPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPT 1353
Cdd:COG3469    102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1953082137 1354 TAGPFTTHADGGHTTTSLAAGSTIYTATAPL 1384
Cdd:COG3469    182 TTTATATTASGATTPSATTTATTTGPPTPGL 212
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1148-1376 6.79e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 45.38  E-value: 6.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1148 STSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTS-----SPTESLATSPGSGPSASPSATESTFS 1222
Cdd:NF033849   240 GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTqstseSESTGQSSSVGTSESQSHGTTEGTST 319
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1223 TI---VSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTETTVTSETSYTVGDGSSVSPSG 1299
Cdd:NF033849   320 TDsssHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAG 399
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1300 PGQLSTTVSVSAQTTTG-----------LVDGSSVYPGTPHSSEpTGISHSTTSGEdAVSASTPTTAGPFTTHADGGHTT 1368
Cdd:NF033849   400 GGVTSEGLGASQGGSEGwgsgdsvqsvsQSYGSSSSTGTSSGHS-DSSSHSTSSGQ-ADSVSQGTSWSEGTGTSQGQSVG 477

                   ....*...
gi 1953082137 1369 TSLAAGST 1376
Cdd:NF033849   478 TSESWSTS 485
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1963-2209 7.09e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 45.38  E-value: 7.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1963 TTFTKGDASPMSTS-SPTESLATSPGSGPSASPSATESTFstiVSESSEYTVASYTTGSPSPS--SQEDSSTTTLTTGHS 2039
Cdd:NF033849   285 WSHTQSTSESESTGqSSSVGTSESQSHGTTEGTSTTDSSS---HSQSSSYNVSSGTGVSSSHSdgTSQSTSISHSESSSE 361
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2040 TTALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHS 2119
Cdd:NF033849   362 STGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSG 441
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2120 TTSG-EDAVSASTPTTAGPFTTHADGGHTTTSLAAGstiytaTAPSELSTPSFSTTASRST---DSVIPTSTSSPSELST 2195
Cdd:NF033849   442 HSDSsSHSTSSGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTgtsESVSQGDGRSTGRSES 515
                          250
                   ....*....|....
gi 1953082137 2196 HTVVTGQAGSTPTG 2209
Cdd:NF033849   516 QGTSLGTSGGRTSG 529
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1547-1889 1.28e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 44.23  E-value: 1.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1547 SSPSELSTHTVVTGQAGSTPTGETTIIptvpaSSEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTD 1626
Cdd:NF033849   241 TGYGESVGHSTSQGQSHSVGTSESHSV-----GTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTE 315
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1627 HGTISDSPQPPDSSATTFTKGDASPMSTSSpteslATSPGSGPSASPSATESTfSTIVSESSEYTVASYTTGSPSPSEpg 1706
Cdd:NF033849   316 GTSTTDSSSHSQSSSYNVSSGTGVSSSHSD-----GTSQSTSISHSESSSEST-GTSVGHSTSSSVSSSESSSRSSSS-- 387
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1707 rlfhHHPDHWTQHYRPFGVAlrlhncvcfdrkrpllqkppirwvteaprplqvraSLSTTVSVSAQTTTGLVDGSTVYPG 1786
Cdd:NF033849   388 ----GVSGGFSGGIAGGGVT-----------------------------------SEGLGASQGGSEGWGSGDSVQSVSQ 428
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1787 TPHSSEPTGISHSTTSG-EDAVSASTPTTAGPFTTHADGGHTTTSLAAGstiytaTAPSELSTPSFSTTASRSTDSAIPT 1865
Cdd:NF033849   429 SYGSSSSTGTSSGHSDSsSHSTSSGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTGTSESV 502
                          330       340
                   ....*....|....*....|....*..
gi 1953082137 1866 S---TSSPSELSTPTVVTGQAGSTPTG 1889
Cdd:NF033849   503 SqgdGRSTGRSESQGTSLGTSGGRTSG 529
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
763-1139 1.38e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.14  E-value: 1.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  763 IPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPT----ASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTA 838
Cdd:pfam05109  404 IITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTtglpSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTS 483
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  839 TGVPqestdhgtISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSG-PSASPSATESTFSTiVSESSEYTVASYT 917
Cdd:pfam05109  484 GASP--------VTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAvTTPTPNATSPTLGK-TSPTSAVTTPTPN 554
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  918 TGSPSPSSQEDSSTTTLTTGHSTTALSAL--PSVFTTVSALTETTVTSETSYTVGDGSSASP--SGPGQLSTTVSVSAQ- 992
Cdd:pfam05109  555 ATSPTPAVTTPTPNATIPTLGKTSPTSAVttPTPNATSPTVGETSPQANTTNHTLGGTSSTPvvTSPPKNATSAVTTGQh 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  993 -TTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSF 1071
Cdd:pfam05109  635 nITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGN 714
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1953082137 1072 STTASHSTDSEIPTSTSSPSELSTHtvvtgqagsTPTGETTIIPTVPAS-SEPTASTHVSHTTDAGRST 1139
Cdd:pfam05109  715 SSTSTKPGEVNVTKGTPPKNATSPQ---------APSGQKTAVPTVTSTgGKANSTTGGKHTTGHGART 774
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1868-2251 2.08e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.62  E-value: 2.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1868 SSPSELSTPTVVTGqAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVP-SGPGDLSTSPAVSGPTAT------- 1939
Cdd:PHA03307    45 SDSAELAAVTVVAG-AAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPaSPAREGSPTPPGPSSPDPppptppp 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1940 GVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSG-PSASPSATESTFSTIVSESSEYTVASYTT 2018
Cdd:PHA03307   124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAAlPLSSPEETARAPSSPPAEPPPSTPPAAAS 203
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2019 GSPSP-----SSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSET------SYTVGDGSSASPSGSGQPSTTV 2087
Cdd:PHA03307   204 PRPPRrsspiSASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECplprpaPITLPTRIWEASGWNGPSSRPG 283
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2088 SVSAQTTTGLVDGSTVyPGTPHSSE-PTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGStiyTATAPSEL 2166
Cdd:PHA03307   284 PASSSSSPRERSPSPS-PSSPGSGPaPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSP---SPSRPPPP 359
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2167 STPSFSTTASRSTDSVIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVShttdaGHSTVPSRPGD 2246
Cdd:PHA03307   360 ADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAAS-----GAFYARYPLLT 434

                   ....*
gi 1953082137 2247 LSTSP 2251
Cdd:PHA03307   435 PSGEP 439
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
217-472 2.68e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.46  E-value: 2.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  217 STTVSVSAQTTTGLVDGSTVYPGTPHSsepTGIPHSTTSGedavsastpttagpfTTHADGGHTTTSLAAGSTIYTATAP 296
Cdd:NF033849   248 GHSTSQGQSHSVGTSESHSVGTSQSQS---HTTGHGSTRG---------------WSHTQSTSESESTGQSSSVGTSESQ 309
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  297 SELSTPSFSTTASRS-TDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHvSHTTDAGRSTVPS 375
Cdd:NF033849   310 SHGTTEGTSTTDSSShSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV-SSSESSSRSSSSG 388
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  376 GPGDLSTSPAVSGPTATGVPQ---ESTDHGTISDSPPPPDSSATTFTKGDASPMSTS---SPTESLATSPGSGPSASPSA 449
Cdd:NF033849   389 VSGGFSGGIAGGGVTSEGLGAsqgGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSsshSTSSGQADSVSQGTSWSEGT 468
                          250       260
                   ....*....|....*....|...
gi 1953082137  450 TESTFSTIVSESSEYTVASYTTG 472
Cdd:NF033849   469 GTSQGQSVGTSESWSTSQSETDS 491
PHA03247 PHA03247
large tegument protein UL36; Provisional
238-478 2.86e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 2.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  238 PGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATA--PSELSTPSFSTTASRSTDSA 315
Cdd:PHA03247  2592 PPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPerPRDDPAPGRVSRPRRARRLG 2671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  316 IPTSTSSPSELSTPTVVTGQAGST----------PTGETTIIPTVPASSEPTAsthvshtTDAGRSTVPSGPGDLSTSPA 385
Cdd:PHA03247  2672 RAAQASSPPQRPRRRAARPTVGSLtsladpppppPTPEPAPHALVSATPLPPG-------PAAARQASPALPAAPAPPAV 2744
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  386 vsgPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSAsPSATESTFSTIVSESSEYT 465
Cdd:PHA03247  2745 ---PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP-WDPADPPAAVLAPAAALPP 2820
                          250
                   ....*....|...
gi 1953082137  466 VASYTTGSPSPSS 478
Cdd:PHA03247  2821 AASPAGPLPPPTS 2833
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
17-395 6.88e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 6.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137   17 ATATQGETTIIPTVPASSEPTAS--THVSHTTDAGHSTVRSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPP--- 91
Cdd:PHA03307    57 AGAAACDRFEPPTGPPPGPGTEApaNESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDlse 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137   92 ---DSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSA-TESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTL 167
Cdd:PHA03307   137 mlrPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSsPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  168 TTGHSTTAL--SALPSVFTTVSALtgttvTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGstvyPGTPHSSE 245
Cdd:PHA03307   217 ASSPAPAPGrsAADDAGASSSDSS-----SSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPS----SRPGPASS 287
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  246 PTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSE 325
Cdd:PHA03307   288 SSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRK 367
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1953082137  326 LSTPTvVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRS-TVPSGPGDLSTSPAVSGP--TATGVP 395
Cdd:PHA03307   368 RPRPS-RAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPrPSPLDAGAASGAFYARYPllTPSGEP 439
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1522-1707 7.22e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 41.66  E-value: 7.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1522 PQELSTPSFSTTASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVPS 1601
Cdd:COG3469     29 AASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTG 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1602 RPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFS 1681
Cdd:COG3469    109 TSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATAT 188
                          170       180
                   ....*....|....*....|....*..
gi 1953082137 1682 TIVSESS-EYTVASYTTGSPSPSEPGR 1707
Cdd:COG3469    189 TASGATTpSATTTATTTGPPTPGLPKH 215
 
Name Accession Description Interval E-value
SEA pfam01390
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ...
2344-2426 6.70e-09

SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.


Pssm-ID: 460188  Cd Length: 100  Bit Score: 55.32  E-value: 6.70e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2344 VTVKVTNRNFTKDLNNISSSVYQNFTQLFKSQMDKAYMGKDF-PQYRGVIIRRL--FNGSVVVehDVVME-----TNYTS 2415
Cdd:pfam01390    5 GSFKITNLQYTPDLGNPSSQEFKSLSRRIESLLNELFRNSSLrKQYIKSHVLRLrpDGGSVVV--DVVLVfrfpsTEPAL 82
                           90
                   ....*....|.
gi 1953082137 2416 DFQKLFENLIE 2426
Cdd:pfam01390   83 DREKLIEEILR 93
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
820-1245 3.46e-08

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 59.41  E-value: 3.46e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  820 VPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATES 899
Cdd:PHA03307    43 LVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPP 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  900 TFS---TIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTETTVTSETSYTVGDGSSAS 976
Cdd:PHA03307   123 PASpppSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAA 202
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  977 PSGPGQLSTTVSVSAqtttglvDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPftthaDGGHTTTSLAagsT 1056
Cdd:PHA03307   203 SPRPPRRSSPISASA-------SSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECP-----LPRPAPITLP---T 267
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1057 IYTATAPSELSTPSFSTTASHSTDSE-----IPTSTSSPSELSTHTVVTGQAGSTPTGettiiptvpaSSEPTASTHVSH 1131
Cdd:PHA03307   268 RIWEASGWNGPSSRPGPASSSSSPRErspspSPSSPGSGPAPSSPRASSSSSSSRESS----------SSSTSSSSESSR 337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1132 TTDAGRSTVPSRPGDLSTSPAVSGPTATgvPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSptESLATSPGSGPS 1211
Cdd:PHA03307   338 GAAVSPGPSPSRSPSPSRPPPPADPSSP--RKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRR--DATGRFPAGRPR 413
                          410       420       430
                   ....*....|....*....|....*....|....
gi 1953082137 1212 ASPSATESTFSTIVSESSEYtvasYTTGSPSPSS 1245
Cdd:PHA03307   414 PSPLDAGAASGAFYARYPLL----TPSGEPWPGS 443
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2045-2300 3.44e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 3.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2045 ALPSVFTT---VSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHS---------SE 2112
Cdd:NF033849   226 SLPMMYAAnlgQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESestgqsssvGT 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2113 PTGIPHSTTSGEdAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSVIPTSTSS--- 2189
Cdd:NF033849   306 SESQSHGTTEGT-STTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSsrs 384
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2190 --------------PSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPT--ASTHVSHTTDAGHSTvpsrpgDLSTSPAV 2253
Cdd:NF033849   385 sssgvsggfsggiaGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSssTGTSSGHSDSSSHST------SSGQADSV 458
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1953082137 2254 SGPTATGVPQESTDHSTMSHSSAVTHSFSSTFTEVDKSHIPTSSSRQ 2300
Cdd:NF033849   459 SQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQG 505
PHA03247 PHA03247
large tegument protein UL36; Provisional
795-1268 1.41e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 1.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  795 PTVPASSEPTAsthvshttdAGRSTVPSRPGDLSTSPAV-SGPTATGVPQESTD-----------HGTISDSPQPPDSSA 862
Cdd:PHA03247  2553 PPLPPAAPPAA---------PDRSVPPPRPAPRPSEPAVtSRARRPDAPPQSARprapvddrgdpRGPAPPSPLPPDTHA 2623
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  863 TTFTKGDASPMSTSSPT-ESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSsqedsstttlttghstt 941
Cdd:PHA03247  2624 PDPPPPSPSPAANEPDPhPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR----------------- 2686
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  942 alsALPSVFTTVSALTETTvtsetsytvgdgssASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHST- 1020
Cdd:PHA03247  2687 ---AARPTVGSLTSLADPP--------------PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPa 2749
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1021 TSGEDAVSASTPTTAGPFTTHADGG------HTTTSLAAGSTIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELS 1094
Cdd:PHA03247  2750 TPGGPARPARPPTTAGPPAPAPPAApaagppRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLP 2829
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1095 THTVVTGQAGSTPTG--------ETTIIPTVPASSEPTASThvshttDAGRSTVPSRPGDLSTS-PAVSGPTAT-GVPQE 1164
Cdd:PHA03247  2830 PPTSAQPTAPPPPPGppppslplGGSVAPGGDVRRRPPSRS------PAAKPAAPARPPVRRLArPAVSRSTESfALPPD 2903
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1165 STDHGTISDSPQPPDSSATTFTKGDASPM--STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPS 1242
Cdd:PHA03247  2904 QPERPPQPQAPPPPQPQPQPPPPPQPQPPppPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA 2983
                          490       500
                   ....*....|....*....|....*.
gi 1953082137 1243 PSSQEDSSTTTLTTGHSTTALSALPS 1268
Cdd:PHA03247  2984 PSREAPASSTPPLTGHSLSRVSSWAS 3009
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
175-360 1.64e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 53.60  E-value: 1.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  175 ALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTT 254
Cdd:COG3469     23 LGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTA 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  255 SGEDAVSASTPTTAGPFTTHADG--GHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVV 332
Cdd:COG3469    103 SGANTGTSTVTTTSTGAGSVTSTtsSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSAT 182
                          170       180
                   ....*....|....*....|....*...
gi 1953082137  333 TGQAGSTPTGETTIIPTVPASSEPTAST 360
Cdd:COG3469    183 TTATATTASGATTPSATTTATTTGPPTP 210
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1819-2024 1.38e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.52  E-value: 1.38e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1819 TTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVP 1898
Cdd:COG3469      6 TAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAA 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1899 ASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSP 1978
Cdd:COG3469     86 AAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1953082137 1979 TESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPS 2024
Cdd:COG3469    166 TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
272-477 1.38e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.52  E-value: 1.38e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  272 TTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVP 351
Cdd:COG3469      6 TAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAA 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  352 ASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSP 431
Cdd:COG3469     86 AAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1953082137  432 TESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPS 477
Cdd:COG3469    166 TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
PHA03247 PHA03247
large tegument protein UL36; Provisional
1506-1948 1.55e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.55e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1506 QPHLLLQGPRSTPPLRPQELSTPsfsttASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTAs 1585
Cdd:PHA03247  2651 RPRDDPAPGRVSRPRRARRLGRA-----AQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPG- 2724
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1586 thvshtTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTIS--DSPQPPDSSATTftkgdASPMSTSSPTESLAT 1663
Cdd:PHA03247  2725 ------PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAgpPAPAPPAAPAAG-----PPRRLTRPAVASLSE 2793
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1664 SPGSGPSAsPSATESTFSTIVSESSEYTVASYTTGSPSPSEPGRLFHHHPDHWTQHYRPFGVALRLHNCVcfdRKRPLLQ 1743
Cdd:PHA03247  2794 SRESLPSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDV---RRRPPSR 2869
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1744 KPPIrwVTEAPRPLQVRASLSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGISHSTTSGEDAVSASTP--TTAGPFTTH 1821
Cdd:PHA03247  2870 SPAA--KPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPprPQPPLAPTT 2947
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1822 ADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTvVTGQAGSTPTGETTIIPTV---- 1897
Cdd:PHA03247  2948 DPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR-VSSWASSLALHEETDPPPVslkq 3026
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1898 ---PASSEPTASTHVSHTTDAGRSTV------PSGPGDLSTSPAVSGPTATGVPQESTDH 1948
Cdd:PHA03247  3027 tlwPPDDTEDSDADSLFDSDSERSDLealdplPPEPHDPFAHEPDPATPEAGARESPSSQ 3086
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1748-1942 1.97e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.14  E-value: 1.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1748 RWVTEAPRPLQVRASLSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHT 1827
Cdd:COG3469      6 TAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAA 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1828 TTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTAST 1907
Cdd:COG3469     86 AAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1953082137 1908 HVSHTTDAGRSTVPSGPGDlSTSPAVSGPTATGVP 1942
Cdd:COG3469    166 TSTTTTTTSASTTPSATTT-ATATTASGATTPSAT 199
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
948-1217 2.51e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 50.00  E-value: 2.51e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  948 SVFTTVSALTETTVTSETSYTVGDGSSASPSGpGQlSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAV 1027
Cdd:NF033849   266 SVGTSQSQSHTTGHGSTRGWSHTQSTSESEST-GQ-SSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSH 343
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1028 SASTPTTAGPFTTHADGGHTTTSLAAGSTiyTATAPSELSTPSFST--------------TASHSTDSEIPTSTSSPSEL 1093
Cdd:NF033849   344 SDGTSQSTSISHSESSSESTGTSVGHSTS--SSVSSSESSSRSSSSgvsggfsggiagggVTSEGLGASQGGSEGWGSGD 421
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1094 STHTVVTGQAGSTPTGETTIIPTVPASSEPT-ASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTIS 1172
Cdd:NF033849   422 SVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSgQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSES 501
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1953082137 1173 DSPQPPDSSATTFTKGDASPMSTSSpTESLATSPGSGPSASPSAT 1217
Cdd:NF033849   502 VSQGDGRSTGRSESQGTSLGTSGGR-TSGAGGSMGLGPSISLGKS 545
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2042-2243 2.57e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 49.75  E-value: 2.57e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2042 ALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTT 2121
Cdd:COG3469     23 LGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTA 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2122 SGEDAVSASTPTTAGPftthaDGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSVIPTSTSSPSelsthTVVTG 2201
Cdd:COG3469    103 SGANTGTSTVTTTSTG-----AGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTS-----TTTTT 172
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1953082137 2202 QAGSTPTGETTIIPTVPASSEPTASTHVSHTT-DAGHSTVPSR 2243
Cdd:COG3469    173 TSASTTPSATTTATATTASGATTPSATTTATTtGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
942-1127 3.02e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 49.37  E-value: 3.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  942 ALSALPSVFTTVSALTETTVTSETSYTVGDGSSASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTT 1021
Cdd:COG3469     23 LGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTA 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1022 SGEDAVSASTPTTAGPFTTHADG--GHTTTSLAAGSTIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELSTHTVV 1099
Cdd:COG3469    103 SGANTGTSTVTTTSTGAGSVTSTtsSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSAT 182
                          170       180
                   ....*....|....*....|....*...
gi 1953082137 1100 TGQAGSTPTGETTIIPTVPASSEPTAST 1127
Cdd:COG3469    183 TTATATTASGATTPSATTTATTTGPPTP 210
PHA03247 PHA03247
large tegument protein UL36; Provisional
1566-2048 3.32e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.94  E-value: 3.32e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1566 PTGETTIIPTVPAS--SEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQestdhgTISDSPQPPDSSATT 1643
Cdd:PHA03247  2562 AAPDRSVPPPRPAPrpSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPD------THAPDPPPPSPSPAA 2635
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1644 FTKGDASPmstssptesLATSPGSGPSASPSATEstfstiVSESSEYTVASYTTGSPSPsepgrlfhhhPDHWTQHYRPF 1723
Cdd:PHA03247  2636 NEPDPHPP---------PTVPPPERPRDDPAPGR------VSRPRRARRLGRAAQASSP----------PQRPRRRAARP 2690
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1724 GVAlRLHNCVCFDRKRPLLQKPPIRWVTEAPRPLQVRASLSTTVSVSAQTTTGLVDGSTVYPGTPHS-SEPTGISHSTTS 1802
Cdd:PHA03247  2691 TVG-SLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARpARPPTTAGPPAP 2769
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1803 GEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRST-----DSAIPTSTSSPSELSTPT 1877
Cdd:PHA03247  2770 APPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplpppTSAQPTAPPPPPGPPPPS 2849
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1878 VVTGqAGSTPTGETT-IIPTVPASSEPTASTHVSHTTDAgRSTVPSGPGDLSTSPAVSGPTATGVPQEstdhgtisdsPP 1956
Cdd:PHA03247  2850 LPLG-GSVAPGGDVRrRPPSRSPAAKPAAPARPPVRRLA-RPAVSRSTESFALPPDQPERPPQPQAPP----------PP 2917
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1957 PPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTT 2036
Cdd:PHA03247  2918 QPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLT 2997
                          490
                   ....*....|..
gi 1953082137 2037 GHSTTALSALPS 2048
Cdd:PHA03247  2998 GHSLSRVSSWAS 3009
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
187-395 3.46e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 49.37  E-value: 3.46e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  187 SALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVypgtphSSEPTGIPHSTTSGEDAVSASTPT 266
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVV------VAASGSAGSGTGTTAASSTAATSS 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  267 TAGPFTTHADGGHTTTSlaaGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTI 346
Cdd:COG3469     75 TTSTTATATAAAAAATS---TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTT 151
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 1953082137  347 IPTVPASSEPTASTHVSHTTDAGRSTVPSGPGDlSTSPAVSGPTATGVP 395
Cdd:COG3469    152 TVSGTETATGGTTTTSTTTTTTSASTTPSATTT-ATATTASGATTPSAT 199
PHA03247 PHA03247
large tegument protein UL36; Provisional
1514-1998 6.39e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 6.39e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1514 PRSTPPLRPQELSTPSFSTTASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTASthVSHTTD 1593
Cdd:PHA03247  2623 APDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGS--LTSLAD 2700
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1594 AGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPqPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASP 1673
Cdd:PHA03247  2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP-PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1674 SATestfstiVSESSEYTVASYTTGSPSPSEPGrlfhhhpdhwtqhyrPFGVALRLHNCVCFDRKRPLLQKPPIRWVTEA 1753
Cdd:PHA03247  2780 PRR-------LTRPAVASLSESRESLPSPWDPA---------------DPPAAVLAPAAALPPAASPAGPLPPPTSAQPT 2837
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1754 PRPLQvraslsttvsvSAQTTTGLVDGSTVYPGTPHS---------SEPTGISHSTTSGEDAVSASTPTTAGPFTTHADG 1824
Cdd:PHA03247  2838 APPPP-----------PGPPPPSLPLGGSVAPGGDVRrrppsrspaAKPAAPARPPVRRLARPAVSRSTESFALPPDQPE 2906
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1825 GHTTTSLAAGSTIYTATAPSELSTPSFSTTAsRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPT 1904
Cdd:PHA03247  2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP-RPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1905 ASTHVSHTTDAGRSTVpsgpgdlstsPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLAT 1984
Cdd:PHA03247  2986 REAPASSTPPLTGHSL----------SRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEAL 3055
                          490
                   ....*....|....
gi 1953082137 1985 SPGSGPSASPSATE 1998
Cdd:PHA03247  3056 DPLPPEPHDPFAHE 3069
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2005-2212 7.46e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 48.21  E-value: 7.46e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2005 VSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPS 2084
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2085 TTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEdAVSASTPTTAGPFT---THADGGHTTTSLAAGSTIYTAT 2161
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGS-VTSTTSSTAGSTTTsgaSATSSAGSTTTTTTVSGTETAT 160
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1953082137 2162 APSELSTPSFSTTASRSTDSVIPTSTSSPSELSTHTVVTGQAGSTPTGETT 2212
Cdd:COG3469    161 GGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
968-1268 8.15e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 48.24  E-value: 8.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  968 TVGDGSSASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADgght 1047
Cdd:PHA03307    63 DRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEML---- 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1048 TTSLAAGSTIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSElsthtvvtgQAGSTPTGETTIIPTVPASSEPTAST 1127
Cdd:PHA03307   139 RPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETA---------RAPSSPPAEPPPSTPPAAASPRPPRR 209
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1128 HVSHTTDAGRSTvPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPG 1207
Cdd:PHA03307   210 SSPISASASSPA-PAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSS 288
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1953082137 1208 SG-----PSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPS 1268
Cdd:PHA03307   289 SSprersPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPS 354
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1026-1244 9.00e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 47.83  E-value: 9.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1026 AVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELSTHTVVTGQAGS 1105
Cdd:COG3469      3 SVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATA 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1106 TPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDspqpPDSSATTF 1185
Cdd:COG3469     83 TAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTT----TTVSGTET 158
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1953082137 1186 TKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVsesseyTVASYTTGSPSPS 1244
Cdd:COG3469    159 ATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSA------TTTATTTGPPTPG 211
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
107-319 9.00e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 47.83  E-value: 9.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  107 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTV 186
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  187 SALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSepTGIPHSTTSGEDAVSASTPT 266
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAS--ATSSAGSTTTTTTVSGTETA 159
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1953082137  267 TAGPFTTHADGGHTTTS-LAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTS 319
Cdd:COG3469    160 TGGTTTTSTTTTTTSAStTPSATTTATATTASGATTPSATTTATTTGPPTPGLP 213
SEA smart00200
Domain found in sea urchin sperm protein, enterokinase, agrin; Proposed function of regulating ...
2344-2416 9.77e-05

Domain found in sea urchin sperm protein, enterokinase, agrin; Proposed function of regulating or binding carbohydrate sidechains.


Pssm-ID: 214554  Cd Length: 121  Bit Score: 44.32  E-value: 9.77e-05
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1953082137  2344 VTVKVTNRNFTKDLNNISSSVYQNFTQLFKSQMDKAYMGKDF-PQYRGVIIRRLFNGSVVVEHDVVMETNYTSD 2416
Cdd:smart00200   12 LSVEGENLQYSPSLEDPSSEEYQELVRDVEKLLEQIYGKTDLkPDFVGTEVIEFRNGSVVVDLGLLFNEGVTNG 85
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1974-2199 9.89e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 47.83  E-value: 9.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1974 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTV 2053
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2054 SALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSepTGIPHSTTSGEDAVSASTPT 2133
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAS--ATSSAGSTTTTTTVSGTETA 159
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1953082137 2134 TAGPFTTHADGGHTTTS-LAAGSTIYTATAPSELSTPSFSTTASrstdsvipTSTSSPSELSTHTVV 2199
Cdd:COG3469    160 TGGTTTTSTTTTTTSAStTPSATTTATATTASGATTPSATTTAT--------TTGPPTPGLPKHVLV 218
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
1761-2206 1.03e-04

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 47.85  E-value: 1.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1761 ASLSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTA 1840
Cdd:COG4625     59 TGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAG 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1841 TAPSELSTPSFSTTASRSTDSAIPTSTSSpselSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTV 1920
Cdd:COG4625    139 GGGGGGGGGGAGGGGGGGAGGAGGGGGGG----GGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGA 214
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1921 PSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATEST 2000
Cdd:COG4625    215 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGG 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2001 FSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGS 2080
Cdd:COG4625    295 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGG 374
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2081 GQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTA 2160
Cdd:COG4625    375 GGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGG 454
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*....
gi 1953082137 2161 TAPSELSTPSFSTTASRSTDSVIPTSTSSPSELSTHTV---VTGQAGST 2206
Cdd:COG4625    455 GAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGggnYTQSAGST 503
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
874-1099 1.13e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 47.44  E-value: 1.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  874 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTV 953
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  954 SALTETTVTSETSYTVGDGSSASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPT 1033
Cdd:COG3469     81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETAT 160
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1953082137 1034 TAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTAShstdseipTSTSSPSELSTHTVV 1099
Cdd:COG3469    161 GGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTAT--------TTGPPTPGLPKHVLV 218
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1914-2123 1.44e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 47.05  E-value: 1.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1914 DAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTdhgTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSAS 1993
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVT---LTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTS 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1994 PSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSETSYTVGDGS 2073
Cdd:COG3469     78 TTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTE 157
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2074 SASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSG 2123
Cdd:COG3469    158 TATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGP 207
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1926-2137 1.46e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 47.05  E-value: 1.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1926 DLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIV 2005
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2006 SesseYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALP--SVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQP 2083
Cdd:COG3469     81 T----ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGagSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1953082137 2084 STTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGP 2137
Cdd:COG3469    157 ETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTP 210
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
59-270 1.46e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 47.05  E-value: 1.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137   59 DLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIV 138
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  139 SesseYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALP--SVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQP 216
Cdd:COG3469     81 T----ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGagSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1953082137  217 STTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGP 270
Cdd:COG3469    157 ETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTP 210
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
107-298 2.32e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 46.67  E-value: 2.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  107 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTV 186
Cdd:COG3469     24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTAS 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  187 SALTGTTVTSETSYTVGDGSSASPSGSGQPsTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPT 266
Cdd:COG3469    104 GANTGTSTVTTTSTGAGSVTSTTSSTAGST-TTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSAT 182
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1953082137  267 TAGPFTTHADGGHTTTSLAAGSTIYTATAPSE 298
Cdd:COG3469    183 TTATATTASGATTPSATTTATTTGPPTPGLPK 214
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1761-2019 2.37e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 46.92  E-value: 2.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1761 ASLSTTVSVSAQTTTGLVDGSTVYPGTPHSsepTGISHSTTSGedavsastpttagpfTTHADGGHTTTSLAAGSTIYTA 1840
Cdd:NF033849   245 ESVGHSTSQGQSHSVGTSESHSVGTSQSQS---HTTGHGSTRG---------------WSHTQSTSESESTGQSSSVGTS 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1841 TAPSELSTPSFSTTASRS-TDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHvSHTTDAGRST 1919
Cdd:NF033849   307 ESQSHGTTEGTSTTDSSShSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV-SSSESSSRSS 385
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1920 VPSGPGDLSTSPAVSGPTATGVPQ---ESTDHGTISDSPPPPDSSATTFTKGDASPMSTS---SPTESLATSPGSGPSAS 1993
Cdd:NF033849   386 SSGVSGGFSGGIAGGGVTSEGLGAsqgGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSsshSTSSGQADSVSQGTSWS 465
                          250       260
                   ....*....|....*....|....*.
gi 1953082137 1994 PSATESTFSTIVSESSEYTVASYTTG 2019
Cdd:NF033849   466 EGTGTSQGQSVGTSESWSTSQSETDS 491
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
859-1160 2.97e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 46.54  E-value: 2.97e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  859 DSSATTFTKGDASPMSTS-SPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTg 937
Cdd:NF033849   237 QSAGTGYGESVGHSTSQGqSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTE- 315
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  938 hsttalsalpSVFTTVSALTETTVTSETSYTVGDGSSASPSGpgqlSTTVSVSAQTTTGLVDGSSVYPGTPHS-SEPTGI 1016
Cdd:NF033849   316 ----------GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGT----SQSTSISHSESSSESTGTSVGHSTSSSvSSSESS 381
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1017 SHSTTSGEDA-VSASTPttAGPFTTHADGGHTTTSLAAG-STIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELS 1094
Cdd:NF033849   382 SRSSSSGVSGgFSGGIA--GGGVTSEGLGASQGGSEGWGsGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVS 459
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1095 ---THTVVTGQAGSTPTGETTIIPTVPASSEPTASTH-VSHTTDAGRSTVPSRPGDLSTSPAVSGPTATG 1160
Cdd:NF033849   460 qgtSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTgTSESVSQGDGRSTGRSESQGTSLGTSGGRTSG 529
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
968-1166 3.98e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.90  E-value: 3.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  968 TVGDGSSASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHT 1047
Cdd:COG3469     24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTAS 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1048 TTSLAAGSTIYTATAPSELSTPSFSTTAshsTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTAST 1127
Cdd:COG3469    104 GANTGTSTVTTTSTGAGSVTSTTSSTAG---STTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPS 180
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1953082137 1128 HVSHTTDAGRSTVpsrpgdlsTSPAVSGPTATGVPQEST 1166
Cdd:COG3469    181 ATTTATATTASGA--------TTPSATTTATTTGPPTPG 211
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1194-1384 5.22e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.51  E-value: 5.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1194 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQedSSTTTLTTGHSTTALSALPSVFTTV 1273
Cdd:COG3469     24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTT--STTATATAAAAAATSTSATLVATST 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1274 SALTETTVTSETSYTVGDGSSVSPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPT 1353
Cdd:COG3469    102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1953082137 1354 TAGPFTTHADGGHTTTSLAAGSTIYTATAPL 1384
Cdd:COG3469    182 TTTATATTASGATTPSATTTATTTGPPTPGL 212
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1512-1923 5.32e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 5.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1512 QGPRSTPPLRPQELSTPSFSTTASHSTDSEIPTSTSSPSelsthtvvtgqAGSTPTGETTIIPTVPASSEPTASTHVSHT 1591
Cdd:PHA03307    69 TGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPT-----------PPGPSSPDPPPPTPPPASPPPSPAPDLSEM 137
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1592 TDAGRSTVPSRPGDLSTSPAVSGPTATGVP---------QESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLA 1662
Cdd:PHA03307   138 LRPVGSPGPPPAASPPAAGASPAAVASDAAssrqaalplSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASA 217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1663 TSPGSGPSASP--SATESTFSTIVSESSEYTVASYTTGSPSPSEPGRLfhhhpdhwtqhyrpfgvalrlhncvcfdrkrP 1740
Cdd:PHA03307   218 SSPAPAPGRSAadDAGASSSDSSSSESSGCGWGPENECPLPRPAPITL-------------------------------P 266
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1741 LLQKPPIRWVTEAPRPLqvraslsttvSVSAQTTTGLVDGSTVyPGTPHSSE-PTGISHSTTSGEDAVSASTPTTAGPFT 1819
Cdd:PHA03307   267 TRIWEASGWNGPSSRPG----------PASSSSSPRERSPSPS-PSSPGSGPaPSSPRASSSSSSSRESSSSSTSSSSES 335
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1820 THADGGHTTTSLAAGStiyTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIP-TVP 1898
Cdd:PHA03307   336 SRGAAVSPGPSPSRSP---SPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPaGRP 412
                          410       420
                   ....*....|....*....|....*
gi 1953082137 1899 ASSEPTASTHVSHTTDAGRSTVPSG 1923
Cdd:PHA03307   413 RPSPLDAGAASGAFYARYPLLTPSG 437
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1547-1942 5.60e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 5.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1547 SSPSELSTHTVVTGQAGSTPTGettiIPTVPASSEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQEstd 1626
Cdd:PHA03307    45 SDSAELAAVTVVAGAAACDRFE----PPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPP--- 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1627 hGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATEST-----FSTIVSESSEytvASYTTGSPS 1701
Cdd:PHA03307   118 -PPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSrqaalPLSSPEETAR---APSSPPAEP 193
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1702 PSEPGRLFHHHPDH----WTQHYRPFGVALRLHNCVcFDRKRPLLQKPPIRWV-------TEAPRPlqvRASLSTTVSVS 1770
Cdd:PHA03307   194 PPSTPPAAASPRPPrrssPISASASSPAPAPGRSAA-DDAGASSSDSSSSESSgcgwgpeNECPLP---RPAPITLPTRI 269
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1771 AQTTTGLVDGstvyPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPS 1850
Cdd:PHA03307   270 WEASGWNGPS----SRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGP 345
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1851 FSTTASRSTDSAIPTSTSSPSELSTPTvVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRS-TVPSGPGDLST 1929
Cdd:PHA03307   346 SPSRSPSPSRPPPPADPSSPRKRPRPS-RAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPrPSPLDAGAASG 424
                          410
                   ....*....|....*
gi 1953082137 1930 SPAVSGP--TATGVP 1942
Cdd:PHA03307   425 AFYARYPllTPSGEP 439
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1148-1376 6.79e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 45.38  E-value: 6.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1148 STSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTS-----SPTESLATSPGSGPSASPSATESTFS 1222
Cdd:NF033849   240 GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTqstseSESTGQSSSVGTSESQSHGTTEGTST 319
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1223 TI---VSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTETTVTSETSYTVGDGSSVSPSG 1299
Cdd:NF033849   320 TDsssHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAG 399
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1300 PGQLSTTVSVSAQTTTG-----------LVDGSSVYPGTPHSSEpTGISHSTTSGEdAVSASTPTTAGPFTTHADGGHTT 1368
Cdd:NF033849   400 GGVTSEGLGASQGGSEGwgsgdsvqsvsQSYGSSSSTGTSSGHS-DSSSHSTSSGQ-ADSVSQGTSWSEGTGTSQGQSVG 477

                   ....*...
gi 1953082137 1369 TSLAAGST 1376
Cdd:NF033849   478 TSESWSTS 485
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1146-1357 6.92e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.13  E-value: 6.92e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1146 DLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIV 1225
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1226 SesseYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALP--SVFTTVSALTETTVTSETSYTVGDGSSVSPSGPGQL 1303
Cdd:COG3469     81 T----ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGagSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1953082137 1304 STTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGP 1357
Cdd:COG3469    157 ETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTP 210
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1963-2209 7.09e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 45.38  E-value: 7.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1963 TTFTKGDASPMSTS-SPTESLATSPGSGPSASPSATESTFstiVSESSEYTVASYTTGSPSPS--SQEDSSTTTLTTGHS 2039
Cdd:NF033849   285 WSHTQSTSESESTGqSSSVGTSESQSHGTTEGTSTTDSSS---HSQSSSYNVSSGTGVSSSHSdgTSQSTSISHSESSSE 361
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2040 TTALSALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHS 2119
Cdd:NF033849   362 STGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSG 441
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2120 TTSG-EDAVSASTPTTAGPFTTHADGGHTTTSLAAGstiytaTAPSELSTPSFSTTASRST---DSVIPTSTSSPSELST 2195
Cdd:NF033849   442 HSDSsSHSTSSGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTgtsESVSQGDGRSTGRSES 515
                          250
                   ....*....|....
gi 1953082137 2196 HTVVTGQAGSTPTG 2209
Cdd:NF033849   516 QGTSLGTSGGRTSG 529
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1827-2233 7.21e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 45.29  E-value: 7.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1827 TTTSLAAGSTIYTATAPSELSTPSFSTTASrstdSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIptvpasSEPTAS 1906
Cdd:pfam05109  410 TNATTTTHKVIFSKAPESTTTSPTLNTTGF----AAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADV------TSPTPA 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1907 THVSHTTDAGRSTVPSGPGDLSTSPAVSGPTaTGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSP 1986
Cdd:pfam05109  480 GTTSGASPVTPSPSPRDNGTESKAPDMTSPT-SAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSP 558
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1987 gsGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEdSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSETS 2066
Cdd:pfam05109  559 --TPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGE-TSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHN 635
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2067 YTvgdgSSASPSGSGQPSttvSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHST-----TSGEDAVSASTPTTAGPFTTH 2141
Cdd:pfam05109  636 IT----SSSTSSMSLRPS---SISETLSPSTSDNSTSHMPLLTSAHPTGGENITqvtpaSTSTHHVSTSSPAPRPGTTSQ 708
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2142 ADG-GHTTTSLAAGSTIYTATAPSELSTpsfSTTASRSTDSVIPTSTSSpselsthtvvTGQAGSTPTGETTIIPTVPAS 2220
Cdd:pfam05109  709 ASGpGNSSTSTKPGEVNVTKGTPPKNAT---SPQAPSGQKTAVPTVTST----------GGKANSTTGGKHTTGHGARTS 775
                          410
                   ....*....|...
gi 1953082137 2221 SEPTASTHVSHTT 2233
Cdd:pfam05109  776 TEPTTDYGGDSTT 788
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
826-1037 8.07e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.74  E-value: 8.07e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  826 DLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIV 905
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  906 SesseYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALP--SVFTTVSALTETTVTSETSYTVGDGSSASPSGPGQL 983
Cdd:COG3469     81 T----ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGagSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1953082137  984 STTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGP 1037
Cdd:COG3469    157 ETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTP 210
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1758-1907 8.21e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.74  E-value: 8.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1758 QVRASLSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTI 1837
Cdd:COG3469     64 TAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSS 143
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1838 YTATAPSELSTPSFSTTASRSTDsaiPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTAST 1907
Cdd:COG3469    144 AGSTTTTTTVSGTETATGGTTTT---STTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTP 210
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
902-1110 1.23e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 1.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  902 STIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTETTVTSETSYTVGDGSSASPSGPG 981
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  982 QLSTTVSVSAQTTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTAT 1061
Cdd:COG3469     81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETAT 160
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1953082137 1062 APSELS-----TPSFSTTASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGE 1110
Cdd:COG3469    161 GGTTTTsttttTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPK 214
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1547-1889 1.28e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 44.23  E-value: 1.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1547 SSPSELSTHTVVTGQAGSTPTGETTIIptvpaSSEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTD 1626
Cdd:NF033849   241 TGYGESVGHSTSQGQSHSVGTSESHSV-----GTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTE 315
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1627 HGTISDSPQPPDSSATTFTKGDASPMSTSSpteslATSPGSGPSASPSATESTfSTIVSESSEYTVASYTTGSPSPSEpg 1706
Cdd:NF033849   316 GTSTTDSSSHSQSSSYNVSSGTGVSSSHSD-----GTSQSTSISHSESSSEST-GTSVGHSTSSSVSSSESSSRSSSS-- 387
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1707 rlfhHHPDHWTQHYRPFGVAlrlhncvcfdrkrpllqkppirwvteaprplqvraSLSTTVSVSAQTTTGLVDGSTVYPG 1786
Cdd:NF033849   388 ----GVSGGFSGGIAGGGVT-----------------------------------SEGLGASQGGSEGWGSGDSVQSVSQ 428
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1787 TPHSSEPTGISHSTTSG-EDAVSASTPTTAGPFTTHADGGHTTTSLAAGstiytaTAPSELSTPSFSTTASRSTDSAIPT 1865
Cdd:NF033849   429 SYGSSSSTGTSSGHSDSsSHSTSSGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTGTSESV 502
                          330       340
                   ....*....|....*....|....*..
gi 1953082137 1866 S---TSSPSELSTPTVVTGQAGSTPTG 1889
Cdd:NF033849   503 SqgdGRSTGRSESQGTSLGTSGGRTSG 529
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
763-1139 1.38e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.14  E-value: 1.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  763 IPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPT----ASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTA 838
Cdd:pfam05109  404 IITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTtglpSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTS 483
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  839 TGVPqestdhgtISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSG-PSASPSATESTFSTiVSESSEYTVASYT 917
Cdd:pfam05109  484 GASP--------VTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAvTTPTPNATSPTLGK-TSPTSAVTTPTPN 554
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  918 TGSPSPSSQEDSSTTTLTTGHSTTALSAL--PSVFTTVSALTETTVTSETSYTVGDGSSASP--SGPGQLSTTVSVSAQ- 992
Cdd:pfam05109  555 ATSPTPAVTTPTPNATIPTLGKTSPTSAVttPTPNATSPTVGETSPQANTTNHTLGGTSSTPvvTSPPKNATSAVTTGQh 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  993 -TTTGLVDGSSVYPGTPHSSEPTGISHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSF 1071
Cdd:pfam05109  635 nITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGN 714
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1953082137 1072 STTASHSTDSEIPTSTSSPSELSTHtvvtgqagsTPTGETTIIPTVPAS-SEPTASTHVSHTTDAGRST 1139
Cdd:pfam05109  715 SSTSTKPGEVNVTKGTPPKNATSPQ---------APSGQKTAVPTVTSTgGKANSTTGGKHTTGHGART 774
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
17-471 1.54e-03

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 44.00  E-value: 1.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137   17 ATATQGETTIIPTVPASSEPTASTHVSHTTDAGHSTVRSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSAT 96
Cdd:COG4625     51 GGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAG 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137   97 TFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTAL 176
Cdd:COG4625    131 GGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGG 210
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  177 SALPSVFTTVSALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTphSSEPTGIPHSTTSG 256
Cdd:COG4625    211 GGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGG--SGGGGGGGGGGGSG 288
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  257 EDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQA 336
Cdd:COG4625    289 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGG 368
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  337 GSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSAT 416
Cdd:COG4625    369 GGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGG 448
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1953082137  417 TFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTT 471
Cdd:COG4625    449 GGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGGNYTQSAGST 503
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1868-2251 2.08e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.62  E-value: 2.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1868 SSPSELSTPTVVTGqAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVP-SGPGDLSTSPAVSGPTAT------- 1939
Cdd:PHA03307    45 SDSAELAAVTVVAG-AAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPaSPAREGSPTPPGPSSPDPppptppp 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1940 GVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSG-PSASPSATESTFSTIVSESSEYTVASYTT 2018
Cdd:PHA03307   124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAAlPLSSPEETARAPSSPPAEPPPSTPPAAAS 203
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2019 GSPSP-----SSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSET------SYTVGDGSSASPSGSGQPSTTV 2087
Cdd:PHA03307   204 PRPPRrsspiSASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECplprpaPITLPTRIWEASGWNGPSSRPG 283
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2088 SVSAQTTTGLVDGSTVyPGTPHSSE-PTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGStiyTATAPSEL 2166
Cdd:PHA03307   284 PASSSSSPRERSPSPS-PSSPGSGPaPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSP---SPSRPPPP 359
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2167 STPSFSTTASRSTDSVIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVShttdaGHSTVPSRPGD 2246
Cdd:PHA03307   360 ADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAAS-----GAFYARYPLLT 434

                   ....*
gi 1953082137 2247 LSTSP 2251
Cdd:PHA03307   435 PSGEP 439
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
217-472 2.68e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.46  E-value: 2.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  217 STTVSVSAQTTTGLVDGSTVYPGTPHSsepTGIPHSTTSGedavsastpttagpfTTHADGGHTTTSLAAGSTIYTATAP 296
Cdd:NF033849   248 GHSTSQGQSHSVGTSESHSVGTSQSQS---HTTGHGSTRG---------------WSHTQSTSESESTGQSSSVGTSESQ 309
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  297 SELSTPSFSTTASRS-TDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHvSHTTDAGRSTVPS 375
Cdd:NF033849   310 SHGTTEGTSTTDSSShSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV-SSSESSSRSSSSG 388
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  376 GPGDLSTSPAVSGPTATGVPQ---ESTDHGTISDSPPPPDSSATTFTKGDASPMSTS---SPTESLATSPGSGPSASPSA 449
Cdd:NF033849   389 VSGGFSGGIAGGGVTSEGLGAsqgGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSsshSTSSGQADSVSQGTSWSEGT 468
                          250       260
                   ....*....|....*....|...
gi 1953082137  450 TESTFSTIVSESSEYTVASYTTG 472
Cdd:NF033849   469 GTSQGQSVGTSESWSTSQSETDS 491
PHA03247 PHA03247
large tegument protein UL36; Provisional
238-478 2.86e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 2.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  238 PGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATA--PSELSTPSFSTTASRSTDSA 315
Cdd:PHA03247  2592 PPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPerPRDDPAPGRVSRPRRARRLG 2671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  316 IPTSTSSPSELSTPTVVTGQAGST----------PTGETTIIPTVPASSEPTAsthvshtTDAGRSTVPSGPGDLSTSPA 385
Cdd:PHA03247  2672 RAAQASSPPQRPRRRAARPTVGSLtsladpppppPTPEPAPHALVSATPLPPG-------PAAARQASPALPAAPAPPAV 2744
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  386 vsgPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSAsPSATESTFSTIVSESSEYT 465
Cdd:PHA03247  2745 ---PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP-WDPADPPAAVLAPAAALPP 2820
                          250
                   ....*....|...
gi 1953082137  466 VASYTTGSPSPSS 478
Cdd:PHA03247  2821 AASPAGPLPPPTS 2833
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
754-927 3.04e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.24  E-value: 3.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  754 TASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGEttiiPTVPASSEPTASTHVSHTTDAGRSTVPSrPGDLSTSPAV 833
Cdd:PHA03307   235 SSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWE----ASGWNGPSSRPGPASSSSSPRERSPSPS-PSSPGSGPAP 309
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  834 SGPTATG--VPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSP---GSGPSASPSATESTFSTIVSES 908
Cdd:PHA03307   310 SSPRASSssSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSprkRPRPSRAPSSPAASAGRPTRRR 389
                          170
                   ....*....|....*....
gi 1953082137  909 SEYTVASYTTGSPSPSSQE 927
Cdd:PHA03307   390 ARAAVAGRARRRDATGRFP 408
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1848-2063 3.47e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.82  E-value: 3.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1848 TPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTiiPTVPASSEPTASTHVSHTTDAGRSTVPSGPGDL 1927
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVST--TGSVVVAASGSAGSGTGTTAASSTAATSSTTST 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1928 STSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSE 2007
Cdd:COG3469     79 TATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1953082137 2008 SSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTS 2063
Cdd:COG3469    159 ATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPK 214
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
777-1238 3.93e-03

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 42.84  E-value: 3.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  777 TVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQ 856
Cdd:COG4625     44 GGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGG 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  857 PPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTT 936
Cdd:COG4625    124 GGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGG 203
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  937 GHSTTALSALPSVFTTVSALTETTVTSETSYTVGDGSSASPSGPGQLSTTVSVSAQTTTGLVDGSSVYPGTphSSEPTGI 1016
Cdd:COG4625    204 GGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGG--SGGGGGG 281
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1017 SHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASHSTDSEIPTSTSSPSELSTH 1096
Cdd:COG4625    282 GGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGG 361
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1097 TVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVPSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQ 1176
Cdd:COG4625    362 GTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAG 441
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1953082137 1177 PPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTT 1238
Cdd:COG4625    442 GGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGGNYTQSAGST 503
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
752-948 6.60e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 6.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  752 STTASHSTDSEIPTSTSSPSELSthtVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTD-----AGRSTVPSRPGD 826
Cdd:PHA03307   151 SPPAAGASPAAVASDAASSRQAA---LPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSspisaSASSPAPAPGRS 227
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  827 LSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSG-----PSASPSATESTF 901
Cdd:PHA03307   228 AADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSprersPSPSPSSPGSGP 307
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1953082137  902 STIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPS 948
Cdd:PHA03307   308 APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPS 354
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
17-395 6.88e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 6.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137   17 ATATQGETTIIPTVPASSEPTAS--THVSHTTDAGHSTVRSRPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPP--- 91
Cdd:PHA03307    57 AGAAACDRFEPPTGPPPGPGTEApaNESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDlse 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137   92 ---DSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSA-TESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTL 167
Cdd:PHA03307   137 mlrPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSsPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  168 TTGHSTTAL--SALPSVFTTVSALtgttvTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGstvyPGTPHSSE 245
Cdd:PHA03307   217 ASSPAPAPGrsAADDAGASSSDSS-----SSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPS----SRPGPASS 287
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  246 PTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSE 325
Cdd:PHA03307   288 SSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRK 367
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1953082137  326 LSTPTvVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRS-TVPSGPGDLSTSPAVSGP--TATGVP 395
Cdd:PHA03307   368 RPRPS-RAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPrPSPLDAGAASGAFYARYPllTPSGEP 439
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1522-1707 7.22e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 41.66  E-value: 7.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1522 PQELSTPSFSTTASHSTDSEIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTTDAGRSTVPS 1601
Cdd:COG3469     29 AASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTG 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1602 RPGDLSTSPAVSGPTATGVPQESTDHGTISDSPQPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFS 1681
Cdd:COG3469    109 TSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATAT 188
                          170       180
                   ....*....|....*....|....*..
gi 1953082137 1682 TIVSESS-EYTVASYTTGSPSPSEPGR 1707
Cdd:COG3469    189 TASGATTpSATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2011-2207 7.54e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 41.66  E-value: 7.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2011 YTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTVSALTGTTVTSE---TSYTVGDGSSASPSGSGQPSTTV 2087
Cdd:COG3469     15 ASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASStaaTSSTTSTTATATAAAAAATSTSA 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2088 SVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGSTIYTATAPSELS 2167
Cdd:COG3469     95 TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTS 174
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 1953082137 2168 TPSFSTTASRSTDSViPTSTSSPSELSTHTVVTGQAGSTP 2207
Cdd:COG3469    175 ASTTPSATTTATATT-ASGATTPSATTTATTTGPPTPGLP 213
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1837-2239 8.72e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.44  E-value: 8.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1837 IYTATAPSELSTPSFSTTASRSTDSA---IPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTVPASSEPTASTHVSHTT 1913
Cdd:pfam05109  304 VFSDEIPASQDMPTNTTDITYVGDNAtysVPMVTSEDANSPNVTVTAFWAWPNNTETDFKCKWTLTSGTPSGCENISGAF 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1914 DAGRSTVPSGPGdLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTF-TKGDASPMSTSSPTESLATSPGSGPSA 1992
Cdd:pfam05109  384 ASNRTFDITVSG-LGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLnTTGFAAPNTTTGLPSSTHVPTNLTAPA 462
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1993 SPSATESTfSTIVSESSEYTVASYTTGSPSPSSQEDSSTttlttghsttalSALPSVFTTVSALTGTT--VTSETSYTVG 2070
Cdd:pfam05109  463 STGPTVST-ADVTSPTPAGTTSGASPVTPSPSPRDNGTE------------SKAPDMTSPTSAVTTPTpnATSPTPAVTT 529
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2071 DGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTgipHSTTSGEDAVSASTPTTAGPFTTHADGGHTTTS 2150
Cdd:pfam05109  530 PTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPT---LGKTSPTSAVTTPTPNATSPTVGETSPQANTTN 606
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 2151 LAAGStiyTATAPSELSTPSFSTTASRSTDSVIPTSTSSPSELSTHTVVTGQAGSTPTGETTIIPTVpASSEPTASTHVS 2230
Cdd:pfam05109  607 HTLGG---TSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLL-TSAHPTGGENIT 682

                   ....*....
gi 1953082137 2231 HTTDAGHST 2239
Cdd:pfam05109  683 QVTPASTST 691
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
107-596 9.00e-03

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 41.69  E-value: 9.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  107 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTV 186
Cdd:COG4625      2 GGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGG 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  187 SALTGTTVTSETSYTVGDGSSASPSGSGQPSTTVSVSAQTTTGLVDGSTVYPGTPHSSEPTGIPHSTTSGEDAVSASTPT 266
Cdd:COG4625     82 GGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAG 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  267 TAGPFTTHADGGHTTTSLAAGSTIYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELSTPTVVTGQAGSTPTGETTI 346
Cdd:COG4625    162 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  347 IPTVPASSEPTASTHVSHTTDAGRSTVPSGPGDLSTSPAVSGPTATGVPQESTDHGTISDSPPPPDSSATTFTKGDASPM 426
Cdd:COG4625    242 GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 321
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  427 STSSPTESLATSPGSGPSASPSATESTFSTIVSESSEYTVASYTTGSPSPSSQEDSSTTTLTTGHSTTALSALPSVFTTV 506
Cdd:COG4625    322 GGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGG 401
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  507 SALTGNDRYFRNLLYELSTPSFSTTASHSTDSEIPTSTSSPSELSTPTVVTGQAGSTPTGETTIIPTESTDHGTISDSPQ 586
Cdd:COG4625    402 GGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGN 481
                          490
                   ....*....|
gi 1953082137  587 PPDSSATTFT 596
Cdd:COG4625    482 NTYTGTTTVN 491
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
254-463 9.13e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 41.57  E-value: 9.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  254 TSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGST---IYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELS--- 327
Cdd:COG5665    245 TPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTtsnTPTSTAKAQPQPPTKKQPAKEPPSDTASGNPSAPSVLInsd 324
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137  328 TPTVVTGQAGSTPTGETTIIPTVPASsepTASTHVSHTTDAGRSTVPSGPGDLSTSpaVSGPTATGVPQESTDHGTISDS 407
Cdd:COG5665    325 SPTSEDPATASVPTTEETTAFTTPSS---VPSTPAEKDTPATDLATPVSPTPPETS--VDKKVSPDSATSSTKSEKEGGT 399
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1953082137  408 PPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSE 463
Cdd:COG5665    400 ASSPMPPNIAIGAKDDVDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAG 455
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
1801-2010 9.13e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 41.57  E-value: 9.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1801 TSGEDAVSASTPTTAGPFTTHADGGHTTTSLAAGST---IYTATAPSELSTPSFSTTASRSTDSAIPTSTSSPSELS--- 1874
Cdd:COG5665    245 TPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTtsnTPTSTAKAQPQPPTKKQPAKEPPSDTASGNPSAPSVLInsd 324
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1953082137 1875 TPTVVTGQAGSTPTGETTIIPTVPASsepTASTHVSHTTDAGRSTVPSGPGDLSTSpaVSGPTATGVPQESTDHGTISDS 1954
Cdd:COG5665    325 SPTSEDPATASVPTTEETTAFTTPSS---VPSTPAEKDTPATDLATPVSPTPPETS--VDKKVSPDSATSSTKSEKEGGT 399
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1953082137 1955 PPPPDSSATTFTKGDASPMSTSSPTESLATSPGSGPSASPSATESTFSTIVSESSE 2010
Cdd:COG5665    400 ASSPMPPNIAIGAKDDVDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAG 455
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH