NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958666954|ref|XP_038944762|]
View 

target of Nesh-SH3 isoform X2 [Rattus norvegicus]

Protein Classification

fibronectin type III domain-containing protein( domain architecture ID 10440918)

fibronectin type III (FN3) domain-containing protein similar to human Target of Nesh-SH3 (Tarsh) and Drosophila melanogaster cytokine receptor (protein domeless)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
358-897 1.65e-14

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.65e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  358 SEKTPETAQSVL-IPESELLLSSLAPKGSP--------EFPEAKTAFPSEKPGGS-------LASSEEPWVVPGAKTSEd 421
Cdd:PHA03247  2449 ADGDPFFARTILgAPFSLSLLLGELFPGAPvyrrpaeaRFPFAAGAAPDPGGGGPpdpdappAPSRLAPAILPDEPVGE- 2527
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  422 srVVQPRTATYDVVSSSATSDETEVEPHTATSDPIldsvPPKTSRTAEQPRATLAPSEASFDPRTveifTSPEVRPTTAA 501
Cdd:PHA03247  2528 --PVHPRMLTWIRGLEELASDDAGDPPPPLPPAAP----PAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSAR 2597
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  502 PQQTTSIPSTPKR--QSTPKPPRVKPAPEPETRPSAQSTKAP------------PHKTKKPGR-RRPKTTRSPEVPKS-- 564
Cdd:PHA03247  2598 PRAPVDDRGDPRGpaPPSPLPPDTHAPDPPPPSPSPAANEPDphppptvppperPRDDPAPGRvSRPRRARRLGRAAQas 2677
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  565 -------KPALEPATVP------PEILVPTIVPKPPQRPKATRRPEAPQIQPAHEPVTFGSEAPALAIVTTTDIAPVISR 631
Cdd:PHA03247  2678 sppqrprRRAARPTVGSltsladPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP 2757
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  632 TKASVTTLAPKSSRPRTRQRPKYKATPSPKIPQTKPDLGPITAEPSLASTTKKVRRPRPkpkTTPHPEVPQTILVPATSL 711
Cdd:PHA03247  2758 ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAA---ALPPAASPAGPLPPPTSA 2834
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  712 EPVIRTETPG---TTLVPKLSQQPDFPHPKPKTTRSPAAPPTElvsttvfEPVIPLKEDPVTTIVPFTDLEPATDLETPV 788
Cdd:PHA03247  2835 QPTAPPPPPGpppPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA-------PARPPVRRLARPAVSRSTESFALPPDQPER 2907
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  789 AFRTEAPRTTLASKKSQRTRRPRPRPPKATLSPQAPKTKTVPAVVLEPVTLRPEVQVTTLAPKKTQIKHRPRPKPKPIPS 868
Cdd:PHA03247  2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
                          570       580
                   ....*....|....*....|....*....
gi 1958666954  869 pevAESKPVPTKEREPVTlRTESWVTTKA 897
Cdd:PHA03247  2988 ---APASSTPPLTGHSLS-RVSSWASSLA 3012
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1359-1450 3.57e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.57e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1359 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1436
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1958666954 1437 LGEGPASNTVAFST 1450
Cdd:cd00063     80 GGESPPSESVTVTT 93
PHA03247 super family cl33720
large tegument protein UL36; Provisional
707-1321 7.63e-08

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 7.63e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  707 PATSLEPVIRTETPGttlvPKLSQQPDFPHPKPKTTRSPAAPPTElvstTVFEPVIP--------LKE-------DPvTT 771
Cdd:PHA03247  2483 PAEARFPFAAGAAPD----PGGGGPPDPDAPPAPSRLAPAILPDE----PVGEPVHPrmltwirgLEElasddagDP-PP 2553
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  772 IVPFTDLEPATDLETPVAfrTEAPRTT---LASKKSQRTRRPRPRPPKATLSPQAPKTKTVPAVVLEPVTLRPEVQVTTL 848
Cdd:PHA03247  2554 PLPPAAPPAAPDRSVPPP--RPAPRPSepaVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP 2631
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  849 APKKTQikhRPRPKPKPIPSPEVAESKPVPTKEREPvtLRTESWVTTKAPKTPKRTHRVRPKPKTTTPEAPLTKPvaaTD 928
Cdd:PHA03247  2632 SPAANE---PDPHPPPTVPPPERPRDDPAPGRVSRP--RRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP---PP 2703
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  929 LESSALSTEVPTTVVLTTALVPATLRTKSPKTTLAPSVQRTRRPRPRPKTTARTdvsesksvsddlelvafstESPQKTI 1008
Cdd:PHA03247  2704 PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP-------------------ARPPTTA 2764
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1009 APRQTTSP--PPKLKPPHSRRPAkeqVPKGSLHTTSKPKMP-PSPEVVDITSVPKDEQLSHKPDPEVSQSETVLPPVTFR 1085
Cdd:PHA03247  2765 GPPAPAPPaaPAAGPPRRLTRPA---VASLSESRESLPSPWdPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1086 VEPPKTTIVPLETRDIPLIPVISPRPSEEELQTTMEQTDQSTQELFTTKIPRTTE---LAKTTQAPHRLHTTPVRPRIPE 1162
Cdd:PHA03247  2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTEsfaLPPDQPERPPQPQAPPPPQPQP 2921
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1163 RPHGRPALNKTTTRPDRTKSRgmshkngVGPGTKQTPKPSSTGRNTSVDSHATrKPGLIPGTRHRHTSPRP-VPPQRKPL 1241
Cdd:PHA03247  2922 QPPPPPQPQPPPPPPPRPQPP-------LAPTTDPAGAGEPSGAVPQPWLGAL-VPGRVAVPRFRVPQPAPsREAPASST 2993
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1242 PPNNVTGKPGSAGIISS----SRATSPP--LKATLKPTGTATerpgaEKKQPTAPASEEEFGNTTDFSSSPTKETDPLGK 1315
Cdd:PHA03247  2994 PPLTGHSLSRVSSWASSlalhEETDPPPvsLKQTLWPPDDTE-----DSDADSLFDSDSERSDLEALDPLPPEPHDPFAH 3068

                   ....*.
gi 1958666954 1316 PRFIGP 1321
Cdd:PHA03247  3069 EPDPAT 3074
fn3 pfam00041
Fibronectin type III domain;
116-195 1.49e-04

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 42.02  E-value: 1.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCSSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1958666954  193 GVK 195
Cdd:pfam00041   72 RVQ 74
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
358-897 1.65e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.65e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  358 SEKTPETAQSVL-IPESELLLSSLAPKGSP--------EFPEAKTAFPSEKPGGS-------LASSEEPWVVPGAKTSEd 421
Cdd:PHA03247  2449 ADGDPFFARTILgAPFSLSLLLGELFPGAPvyrrpaeaRFPFAAGAAPDPGGGGPpdpdappAPSRLAPAILPDEPVGE- 2527
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  422 srVVQPRTATYDVVSSSATSDETEVEPHTATSDPIldsvPPKTSRTAEQPRATLAPSEASFDPRTveifTSPEVRPTTAA 501
Cdd:PHA03247  2528 --PVHPRMLTWIRGLEELASDDAGDPPPPLPPAAP----PAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSAR 2597
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  502 PQQTTSIPSTPKR--QSTPKPPRVKPAPEPETRPSAQSTKAP------------PHKTKKPGR-RRPKTTRSPEVPKS-- 564
Cdd:PHA03247  2598 PRAPVDDRGDPRGpaPPSPLPPDTHAPDPPPPSPSPAANEPDphppptvppperPRDDPAPGRvSRPRRARRLGRAAQas 2677
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  565 -------KPALEPATVP------PEILVPTIVPKPPQRPKATRRPEAPQIQPAHEPVTFGSEAPALAIVTTTDIAPVISR 631
Cdd:PHA03247  2678 sppqrprRRAARPTVGSltsladPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP 2757
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  632 TKASVTTLAPKSSRPRTRQRPKYKATPSPKIPQTKPDLGPITAEPSLASTTKKVRRPRPkpkTTPHPEVPQTILVPATSL 711
Cdd:PHA03247  2758 ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAA---ALPPAASPAGPLPPPTSA 2834
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  712 EPVIRTETPG---TTLVPKLSQQPDFPHPKPKTTRSPAAPPTElvsttvfEPVIPLKEDPVTTIVPFTDLEPATDLETPV 788
Cdd:PHA03247  2835 QPTAPPPPPGpppPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA-------PARPPVRRLARPAVSRSTESFALPPDQPER 2907
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  789 AFRTEAPRTTLASKKSQRTRRPRPRPPKATLSPQAPKTKTVPAVVLEPVTLRPEVQVTTLAPKKTQIKHRPRPKPKPIPS 868
Cdd:PHA03247  2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
                          570       580
                   ....*....|....*....|....*....
gi 1958666954  869 pevAESKPVPTKEREPVTlRTESWVTTKA 897
Cdd:PHA03247  2988 ---APASSTPPLTGHSLS-RVSSWASSLA 3012
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1359-1450 3.57e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.57e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1359 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1436
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1958666954 1437 LGEGPASNTVAFST 1450
Cdd:cd00063     80 GGESPPSESVTVTT 93
PHA03247 PHA03247
large tegument protein UL36; Provisional
707-1321 7.63e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 7.63e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  707 PATSLEPVIRTETPGttlvPKLSQQPDFPHPKPKTTRSPAAPPTElvstTVFEPVIP--------LKE-------DPvTT 771
Cdd:PHA03247  2483 PAEARFPFAAGAAPD----PGGGGPPDPDAPPAPSRLAPAILPDE----PVGEPVHPrmltwirgLEElasddagDP-PP 2553
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  772 IVPFTDLEPATDLETPVAfrTEAPRTT---LASKKSQRTRRPRPRPPKATLSPQAPKTKTVPAVVLEPVTLRPEVQVTTL 848
Cdd:PHA03247  2554 PLPPAAPPAAPDRSVPPP--RPAPRPSepaVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP 2631
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  849 APKKTQikhRPRPKPKPIPSPEVAESKPVPTKEREPvtLRTESWVTTKAPKTPKRTHRVRPKPKTTTPEAPLTKPvaaTD 928
Cdd:PHA03247  2632 SPAANE---PDPHPPPTVPPPERPRDDPAPGRVSRP--RRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP---PP 2703
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  929 LESSALSTEVPTTVVLTTALVPATLRTKSPKTTLAPSVQRTRRPRPRPKTTARTdvsesksvsddlelvafstESPQKTI 1008
Cdd:PHA03247  2704 PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP-------------------ARPPTTA 2764
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1009 APRQTTSP--PPKLKPPHSRRPAkeqVPKGSLHTTSKPKMP-PSPEVVDITSVPKDEQLSHKPDPEVSQSETVLPPVTFR 1085
Cdd:PHA03247  2765 GPPAPAPPaaPAAGPPRRLTRPA---VASLSESRESLPSPWdPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1086 VEPPKTTIVPLETRDIPLIPVISPRPSEEELQTTMEQTDQSTQELFTTKIPRTTE---LAKTTQAPHRLHTTPVRPRIPE 1162
Cdd:PHA03247  2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTEsfaLPPDQPERPPQPQAPPPPQPQP 2921
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1163 RPHGRPALNKTTTRPDRTKSRgmshkngVGPGTKQTPKPSSTGRNTSVDSHATrKPGLIPGTRHRHTSPRP-VPPQRKPL 1241
Cdd:PHA03247  2922 QPPPPPQPQPPPPPPPRPQPP-------LAPTTDPAGAGEPSGAVPQPWLGAL-VPGRVAVPRFRVPQPAPsREAPASST 2993
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1242 PPNNVTGKPGSAGIISS----SRATSPP--LKATLKPTGTATerpgaEKKQPTAPASEEEFGNTTDFSSSPTKETDPLGK 1315
Cdd:PHA03247  2994 PPLTGHSLSRVSSWASSlalhEETDPPPvsLKQTLWPPDDTE-----DSDADSLFDSDSERSDLEALDPLPPEPHDPFAH 3068

                   ....*.
gi 1958666954 1316 PRFIGP 1321
Cdd:PHA03247  3069 EPDPAT 3074
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1360-1440 1.06e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 50.69  E-value: 1.06e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  1360 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1437
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 1958666954  1438 GEG 1440
Cdd:smart00060   81 GEG 83
fn3 pfam00041
Fibronectin type III domain;
1360-1443 1.40e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 47.79  E-value: 1.40e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1360 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1436
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 1958666954 1437 LGEGPAS 1443
Cdd:pfam00041   79 GGEGPPS 85
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
493-702 8.05e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.07  E-value: 8.05e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  493 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQSTKAPPHKTKKPGRRRPKTTRSPEVPKSKPALEPAT 572
Cdd:NF033839   292 PSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPEVKPQPEK 371
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  573 VPPEILVPTIVPKPPQRPKatrrPEAPQIQPAHEPVTFGSEapalaivtttdIAPVISRTKASVTTLAPKSSRPRTRQRP 652
Cdd:NF033839   372 PKPEVKPQPETPKPEVKPQ----PEKPKPEVKPQPEKPKPE-----------VKPQPEKPKPEVKPQPEKPKPEVKPQPE 436
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1958666954  653 KYKATPSPKIPQTKPDLGPitaEPSLASTTKKVRRPRPKPKTTPHPEVPQ 702
Cdd:NF033839   437 KPKPEVKPQPEKPKPEVKP---QPETPKPEVKPQPEKPKPEVKPQPEKPK 483
fn3 pfam00041
Fibronectin type III domain;
116-195 1.49e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 42.02  E-value: 1.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCSSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1958666954  193 GVK 195
Cdd:pfam00041   72 RVQ 74
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
462-684 1.54e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 46.30  E-value: 1.54e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  462 PKTSRTAEQPRATLAPSEASFDPrtvEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPpRVKPAPE-PETRPSAQSTKA 540
Cdd:NF033839   308 KEVKPEPETPKPEVKPQLEKPKP---EVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKP-EVKPQPEkPKPEVKPQPETP 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  541 PPHKTKKPGRRRPKTTRSPEVPKSKPALEPATVPPEILVPTIVPKPPQRPKatrrPEAPQIQPAHEPVTFGSEAPALAIV 620
Cdd:NF033839   384 KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQ----PEKPKPEVKPQPEKPKPEVKPQPET 459
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958666954  621 TTTDIAPVISRTKASVttlapkSSRPRTRQRPKYKATPSPKIPQTKPDLGPITAEPSLASTTKK 684
Cdd:NF033839   460 PKPEVKPQPEKPKPEV------KPQPEKPKPDNSKPQADDKKPSTPNNLSKDKQPSNQASTNEK 517
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1201-1455 2.30e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 45.76  E-value: 2.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1201 PSSTGRNTSVDSHATRKPGLIPGTRHRHTSPRPVPPQRKPLPPNNVTGKPGSAGIISSSRATSPPLKATLKPTGTATERP 1280
Cdd:COG3401     73 AGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATTATAVAGGAATAGTYALGAGLYGVDG 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1281 GAEKKQPTAPASEEEFGNTTDFSSSPTKETDPLGKPRFIGPHVRYIPKPDN----KPCSITDSVRRFPTEEATEGNATSP 1356
Cdd:COG3401    153 ANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTtyyyRVAATDTGGESAPSNEVSVTTPTTP 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1357 PqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1436
Cdd:COG3401    233 P-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTNGTTYYYRVTAVDA 307
                          250       260
                   ....*....|....*....|
gi 1958666954 1437 LG-EGPASNTVAFSTESADP 1455
Cdd:COG3401    308 AGnESAPSNVVSVTTDLTPP 327
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
496-750 2.40e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.91  E-value: 2.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  496 RPTTAAPQQTTSIPSTPKRQSTPKPPRVKP--APEPET----RPSAQSTKAPPHKTKKPGRRRPKTTRSPevpkskPALE 569
Cdd:pfam03154  324 RIHTPPSQSQLQSQQPPREQPLPPAPLSMPhiKPPPTTpipqLPNPQSHKHPPHLSGPSPFQMNSNLPPP------PALK 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  570 PATVPPEILVPTIVPKPPQRPKATRRPEAPQIQPahepvtfgseapalAIVTTTDIAPVISRTKASVTTLAPKSSRPRTR 649
Cdd:pfam03154  398 PLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQP--------------PVLTQSQSLPPPAASHPPTSGLHQVPSQSPFP 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  650 QRPKYKATPSPKIPqtkpdlgPITAEPSLASTTKKVRRPRPKPKTTPHPeVPQTilvPATSLEPVIRTETPgttlvPKLS 729
Cdd:pfam03154  464 QHPFVPGGPPPITP-------PSGPPTSTSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA-----LDEA 527
                          250       260
                   ....*....|....*....|.
gi 1958666954  730 QQPDFPHPKPkttRSPAAPPT 750
Cdd:pfam03154  528 EEPESPPPPP---RSPSPEPT 545
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1344-1498 4.59e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.99  E-value: 4.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1344 PTEEATEGNATSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqitNQTFSTVENL 1421
Cdd:COG3401    314 PSNVVSVTTDLTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGL 387
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958666954 1422 KPDTSYEFQVKPKNPLG-EGPASNTVAFSTESADPRVSEPISAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1498
Cdd:COG3401    388 TPGTTYYYKVTAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
116-195 1.44e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.40  E-value: 1.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  116 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcssdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYEFG 193
Cdd:cd00063      3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73

                   ..
gi 1958666954  194 VK 195
Cdd:cd00063     74 VR 75
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
117-195 2.55e-03

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 38.36  E-value: 2.55e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954   117 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCSSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTVYEF 192
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 1958666954   193 GVK 195
Cdd:smart00060   73 RVR 75
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
358-897 1.65e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.65e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  358 SEKTPETAQSVL-IPESELLLSSLAPKGSP--------EFPEAKTAFPSEKPGGS-------LASSEEPWVVPGAKTSEd 421
Cdd:PHA03247  2449 ADGDPFFARTILgAPFSLSLLLGELFPGAPvyrrpaeaRFPFAAGAAPDPGGGGPpdpdappAPSRLAPAILPDEPVGE- 2527
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  422 srVVQPRTATYDVVSSSATSDETEVEPHTATSDPIldsvPPKTSRTAEQPRATLAPSEASFDPRTveifTSPEVRPTTAA 501
Cdd:PHA03247  2528 --PVHPRMLTWIRGLEELASDDAGDPPPPLPPAAP----PAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSAR 2597
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  502 PQQTTSIPSTPKR--QSTPKPPRVKPAPEPETRPSAQSTKAP------------PHKTKKPGR-RRPKTTRSPEVPKS-- 564
Cdd:PHA03247  2598 PRAPVDDRGDPRGpaPPSPLPPDTHAPDPPPPSPSPAANEPDphppptvppperPRDDPAPGRvSRPRRARRLGRAAQas 2677
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  565 -------KPALEPATVP------PEILVPTIVPKPPQRPKATRRPEAPQIQPAHEPVTFGSEAPALAIVTTTDIAPVISR 631
Cdd:PHA03247  2678 sppqrprRRAARPTVGSltsladPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP 2757
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  632 TKASVTTLAPKSSRPRTRQRPKYKATPSPKIPQTKPDLGPITAEPSLASTTKKVRRPRPkpkTTPHPEVPQTILVPATSL 711
Cdd:PHA03247  2758 ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAA---ALPPAASPAGPLPPPTSA 2834
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  712 EPVIRTETPG---TTLVPKLSQQPDFPHPKPKTTRSPAAPPTElvsttvfEPVIPLKEDPVTTIVPFTDLEPATDLETPV 788
Cdd:PHA03247  2835 QPTAPPPPPGpppPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA-------PARPPVRRLARPAVSRSTESFALPPDQPER 2907
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  789 AFRTEAPRTTLASKKSQRTRRPRPRPPKATLSPQAPKTKTVPAVVLEPVTLRPEVQVTTLAPKKTQIKHRPRPKPKPIPS 868
Cdd:PHA03247  2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
                          570       580
                   ....*....|....*....|....*....
gi 1958666954  869 pevAESKPVPTKEREPVTlRTESWVTTKA 897
Cdd:PHA03247  2988 ---APASSTPPLTGHSLS-RVSSWASSLA 3012
PHA03247 PHA03247
large tegument protein UL36; Provisional
548-1102 9.76e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 70.35  E-value: 9.76e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  548 PGRRRPKTTRSPEVPKSKPalEPATVPPEilvPTIVPKPPQRPKATRRPEAPQIQPAHEPV-TFGSEAPALAIVTTTDIA 626
Cdd:PHA03247  2478 PVYRRPAEARFPFAAGAAP--DPGGGGPP---DPDAPPAPSRLAPAILPDEPVGEPVHPRMlTWIRGLEELASDDAGDPP 2552
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  627 PVI------SRTKASVTT--LAPKSSRPRTRQRPKYKATP----SPKIPQTKPDLGPITAEPSLASTTKKVRRPrPKPKT 694
Cdd:PHA03247  2553 PPLppaappAAPDRSVPPprPAPRPSEPAVTSRARRPDAPpqsaRPRAPVDDRGDPRGPAPPSPLPPDTHAPDP-PPPSP 2631
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  695 TPHP-EVPQTILVPATSLEPVIRTETPGTTLVPKLSQ---QPDFPHPKPKTTRSPAAPPTELVSTTVFEPVIPLK--EDP 768
Cdd:PHA03247  2632 SPAAnEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARrlgRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPtpEPA 2711
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  769 VTTIVPFTDLEPAtdletPVAFRTEAPRTTLAskksqrtrRPRPRPPKATLSPQAPKTKTVPAVVLEPVTLRPEVQVTTL 848
Cdd:PHA03247  2712 PHALVSATPLPPG-----PAAARQASPALPAA--------PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAG 2778
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  849 APKKTqikhrprPKPKPIPSPEVAESKPVPtkeREPvtlrteswvttkAPKTPKRTHRVRPKPKTTTPEAPLTKPVAATD 928
Cdd:PHA03247  2779 PPRRL-------TRPAVASLSESRESLPSP---WDP------------ADPPAAVLAPAAALPPAASPAGPLPPPTSAQP 2836
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  929 LESSALSTEVPTTVVLTTALVPATLRTKSPkTTLAPSVQRTRRPRPRPKTTARTDVSESKsvsddlELVAFSTESPQKti 1008
Cdd:PHA03247  2837 TAPPPPPGPPPPSLPLGGSVAPGGDVRRRP-PSRSPAAKPAAPARPPVRRLARPAVSRST------ESFALPPDQPER-- 2907
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1009 aPRQTTSPPPKLKPPHSRRPAKEQVPkgsLHTTSKPKMPPSPE-----VVDITSVPKDEQLSHKPDPEVSQSETVLPPVT 1083
Cdd:PHA03247  2908 -PPQPQAPPPPQPQPQPPPPPQPQPP---PPPPPRPQPPLAPTtdpagAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA 2983
                          570
                   ....*....|....*....
gi 1958666954 1084 FRVEPPKTTIVPLETRDIP 1102
Cdd:PHA03247  2984 PSREAPASSTPPLTGHSLS 3002
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1359-1450 3.57e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.57e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1359 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1436
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1958666954 1437 LGEGPASNTVAFST 1450
Cdd:cd00063     80 GGESPPSESVTVTT 93
PHA03247 PHA03247
large tegument protein UL36; Provisional
313-628 4.09e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.41  E-value: 4.09e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  313 PAESKTPEVEKVAGQPVTVTPETVSRSTKPTLASALDTAETALVLSEKTPETAQSVLIPESELLLSSLAPKGSPEFPEAK 392
Cdd:PHA03247  2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  393 TAFPSekPGGSLASSEEPWVVPGAkTSEDSRVVQPRTATYDVVSSSATSdetevEPHTATSDPILDSVPPKTSRTAEQPR 472
Cdd:PHA03247  2782 RLTRP--AVASLSESRESLPSPWD-PADPPAAVLAPAAALPPAASPAGP-----LPPPTSAQPTAPPPPPGPPPPSLPLG 2853
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  473 ATLAP-------SEASFDPRTVEIFTSPEVR--PTTAAPQQTTSI---PSTPKRQSTPK-PPRVKPAPEPETRPSAQSTK 539
Cdd:PHA03247  2854 GSVAPggdvrrrPPSRSPAAKPAAPARPPVRrlARPAVSRSTESFalpPDQPERPPQPQaPPPPQPQPQPPPPPQPQPPP 2933
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  540 APPHKTKKPGRRRPKTTRSPEVPKSKPALE-PATVPPEILVPTIVPKPPQRPKATrrPEAPQIQPAHEPVT-FGSEAPAL 617
Cdd:PHA03247  2934 PPPPRPQPPLAPTTDPAGAGEPSGAVPQPWlGALVPGRVAVPRFRVPQPAPSREA--PASSTPPLTGHSLSrVSSWASSL 3011
                          330
                   ....*....|.
gi 1958666954  618 AIVTTTDIAPV 628
Cdd:PHA03247  3012 ALHEETDPPPV 3022
PHA03247 PHA03247
large tegument protein UL36; Provisional
309-575 4.93e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.41  E-value: 4.93e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  309 TLALPAESKTPEVEKVAGQPVTVTPetvSRSTKPTLASALDTAETALvlSEKTPETAQSVLIPESELLLSSLAPKGSPEF 388
Cdd:PHA03247  2756 RPARPPTTAGPPAPAPPAAPAAGPP---RRLTRPAVASLSESRESLP--SPWDPADPPAAVLAPAAALPPAASPAGPLPP 2830
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  389 PEAKTAFPSEKPGGSLASSEEP--WVVPGAKtsedsrvvqprtatydvVSSSATSDETEVEPHTATSDPILDSVPPKTSR 466
Cdd:PHA03247  2831 PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVSR 2893
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  467 TAE---QPRATLAPseasfdPRTVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAqstKAPPH 543
Cdd:PHA03247  2894 STEsfaLPPDQPER------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAV---PQPWL 2964
                          250       260       270
                   ....*....|....*....|....*....|..
gi 1958666954  544 KTKKPGRRRPKTTRSPEVPKSKPALEPATVPP 575
Cdd:PHA03247  2965 GALVPGRVAVPRFRVPQPAPSREAPASSTPPL 2996
PHA03247 PHA03247
large tegument protein UL36; Provisional
707-1321 7.63e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 7.63e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  707 PATSLEPVIRTETPGttlvPKLSQQPDFPHPKPKTTRSPAAPPTElvstTVFEPVIP--------LKE-------DPvTT 771
Cdd:PHA03247  2483 PAEARFPFAAGAAPD----PGGGGPPDPDAPPAPSRLAPAILPDE----PVGEPVHPrmltwirgLEElasddagDP-PP 2553
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  772 IVPFTDLEPATDLETPVAfrTEAPRTT---LASKKSQRTRRPRPRPPKATLSPQAPKTKTVPAVVLEPVTLRPEVQVTTL 848
Cdd:PHA03247  2554 PLPPAAPPAAPDRSVPPP--RPAPRPSepaVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP 2631
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  849 APKKTQikhRPRPKPKPIPSPEVAESKPVPTKEREPvtLRTESWVTTKAPKTPKRTHRVRPKPKTTTPEAPLTKPvaaTD 928
Cdd:PHA03247  2632 SPAANE---PDPHPPPTVPPPERPRDDPAPGRVSRP--RRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP---PP 2703
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  929 LESSALSTEVPTTVVLTTALVPATLRTKSPKTTLAPSVQRTRRPRPRPKTTARTdvsesksvsddlelvafstESPQKTI 1008
Cdd:PHA03247  2704 PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP-------------------ARPPTTA 2764
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1009 APRQTTSP--PPKLKPPHSRRPAkeqVPKGSLHTTSKPKMP-PSPEVVDITSVPKDEQLSHKPDPEVSQSETVLPPVTFR 1085
Cdd:PHA03247  2765 GPPAPAPPaaPAAGPPRRLTRPA---VASLSESRESLPSPWdPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1086 VEPPKTTIVPLETRDIPLIPVISPRPSEEELQTTMEQTDQSTQELFTTKIPRTTE---LAKTTQAPHRLHTTPVRPRIPE 1162
Cdd:PHA03247  2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTEsfaLPPDQPERPPQPQAPPPPQPQP 2921
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1163 RPHGRPALNKTTTRPDRTKSRgmshkngVGPGTKQTPKPSSTGRNTSVDSHATrKPGLIPGTRHRHTSPRP-VPPQRKPL 1241
Cdd:PHA03247  2922 QPPPPPQPQPPPPPPPRPQPP-------LAPTTDPAGAGEPSGAVPQPWLGAL-VPGRVAVPRFRVPQPAPsREAPASST 2993
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1242 PPNNVTGKPGSAGIISS----SRATSPP--LKATLKPTGTATerpgaEKKQPTAPASEEEFGNTTDFSSSPTKETDPLGK 1315
Cdd:PHA03247  2994 PPLTGHSLSRVSSWASSlalhEETDPPPvsLKQTLWPPDDTE-----DSDADSLFDSDSERSDLEALDPLPPEPHDPFAH 3068

                   ....*.
gi 1958666954 1316 PRFIGP 1321
Cdd:PHA03247  3069 EPDPAT 3074
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
447-769 9.78e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 57.17  E-value: 9.78e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  447 EPHTATSDPILDSVPPKTSRTAEQPRATLAPSEASFDprtveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPA 526
Cdd:PRK07003   359 EPAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASA--------VPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAP 430
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  527 PEPETRPSAQSTKAPPHKTKKPGrrrpkttrSPEVPKSKPALEPATVPPEILVPTIVPKPPQRPKATRRPEAPQIQPAHE 606
Cdd:PRK07003   431 APPATADRGDDAADGDAPVPAKA--------NARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAA 502
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  607 PVTFGSEAPALAIVTTTDIAPVISRTKASVTTLAPKSSRPRTR------------------QRPKYKATPSPKIPQTKPD 668
Cdd:PRK07003   503 TPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARaggaaaaldvlrnagmrvSSDRGARAAAAAKPAAAPA 582
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  669 LGPITAEPSLASTTKKVRRPRPKPKTTPHP------------------EVPQTILVPATSLEPVIrteTPGTTLVPKLSQ 730
Cdd:PRK07003   583 AAPKPAAPRVAVQVPTPRARAATGDAPPNGaaraeqaaesrgapppweDIPPDDYVPLSADEGFG---GPDDGFVPVFDS 659
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 1958666954  731 QPDFPHPKPKTTRSPAAPptelVSTTVFEPVIPLkeDPV 769
Cdd:PRK07003   660 GPDDVRVAPKPADAPAPP----VDTRPLPPAIPL--DAI 692
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1360-1440 1.06e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 50.69  E-value: 1.06e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  1360 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1437
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 1958666954  1438 GEG 1440
Cdd:smart00060   81 GEG 83
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
518-1058 3.57e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 55.08  E-value: 3.57e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  518 PKPPRVKPAPEPETRPSAQSTKAPPHKtkkpgrrRPKTTRSPEVPKsKPALEPATVPPEILVPTIVPKPPQRPKATRRPE 597
Cdd:PTZ00449   521 PKAPGDKEGEEGEHEDSKESDEPKEGG-------KPGETKEGEVGK-KPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPE 592
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  598 APQiqpahepvtfgseapalaivtttdiapvisrtkasvttlapKSSRPRTRQRPKYKatPSPKIPQtkpdLGPITAEPS 677
Cdd:PTZ00449   593 EPK-----------------------------------------KPKRPRSAQRPTRP--KSPKLPE----LLDIPKSPK 625
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  678 LASTTKKVRRPrPKPKTTPHPEVPQTILVPATSLEPVirtetpgttlVPKLSQQPDFPHPKPKTTRSPAAPPTELVSTTV 757
Cdd:PTZ00449   626 RPESPKSPKRP-PPPQRPSSPERPEGPKIIKSPKPPK----------SPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVV 694
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  758 fepvipLKEDPVTTIVPFTDLEPATDLETPVAFRTEAPRTtlaskksqrtrrprprpPKATLSPqaPKTKTVPAVVLEPV 837
Cdd:PTZ00449   695 ------LDESFESILKETLPETPGTPFTTPRPLPPKLPRD-----------------EEFPFEP--IGDPDAEQPDDIEF 749
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  838 TLRPEvqvttlaPKKTQIKHRPRPKPKPIPSPEVAESKPVPTKEREPvtlrteswvtTKAPKTPKRTHRVRPKPKTTTPE 917
Cdd:PTZ00449   750 FTPPE-------EERTFFHETPADTPLPDILAEEFKEEDIHAETGEP----------DEAMKRPDSPSEHEDKPPGDHPS 812
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  918 APLTKpvaaTDLESSALSTEVPTTVVLTTALVPATLRTKSPKTTLAPSVQRTRRPRPRPKTTARTDVSESKSVSDDLElv 997
Cdd:PTZ00449   813 LPKKR----HRLDGLALSTTDLESDAGRIAKDASGKIVKLKRSKSFDDLTTVEEAEEMGAEARKIVVDDDGTEADDED-- 886
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958666954  998 AFSTESPQKTIAPRQTtsppPKLKPPHSRRPAKeqvpkgslhtTSKPKMPPSPEVVDITSV 1058
Cdd:PTZ00449   887 THPPEEKHKSEVRRRR----PPKKPSKPKKPSK----------PKKPKKPDSAFIPSIIAI 933
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
384-702 3.89e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 55.08  E-value: 3.89e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  384 GSPEFPEAkTAFPSEKPGGSLASSEEPWVVPGAKTSEDSRvvQPRTATYDVVsssaTSDETEVEPHTATSDPILDSVP-- 461
Cdd:PTZ00449   509 EPPEGPEA-SGLPPKAPGDKEGEEGEHEDSKESDEPKEGG--KPGETKEGEV----GKKPGPAKEHKPSKIPTLSKKPef 581
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  462 ---PKTSRTAEQPRATLAPSEASfdprtveiftSPEVRPTTAAPQqTTSIPSTPKRQSTPKPPRVKPAPE-------PET 531
Cdd:PTZ00449   582 pkdPKHPKDPEEPKKPKRPRSAQ----------RPTRPKSPKLPE-LLDIPKSPKRPESPKSPKRPPPPQrpssperPEG 650
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  532 RPSAQSTKaPPHKTKKP----------------GRRRPKTTRSPEVPKSKPALEPATVPPEILVPTIVPK--PPQRPkat 593
Cdd:PTZ00449   651 PKIIKSPK-PPKSPKPPfdpkfkekfyddyldaAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRplPPKLP--- 726
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  594 RRPEAPqiqpaHEPVTfgseapalaivtttdiAPVISRTKASVTTLAPKSSRPRTRQRPKYKATPSPKIPQTK-PDLGPI 672
Cdd:PTZ00449   727 RDEEFP-----FEPIG----------------DPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKeEDIHAE 785
                          330       340       350
                   ....*....|....*....|....*....|..
gi 1958666954  673 TAEPSLAstTKKVRRPRPKPKTTP--HPEVPQ 702
Cdd:PTZ00449   786 TGEPDEA--MKRPDSPSEHEDKPPgdHPSLPK 815
PRK10263 PRK10263
DNA translocase FtsK; Provisional
340-666 5.91e-07

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 54.71  E-value: 5.91e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  340 TKPTLASALDTAETALVLSEKTPETaQSVLIPESELLLSSLAPKGSPeFPEAKTAFPSEKPGGSLASSEEPWVVPGAKTS 419
Cdd:PRK10263   317 TEPVAVAAAATTATQSWAAPVEPVT-QTPPVASVDVPPAQPTVAWQP-VPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYN 394
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  420 EDSRVVQPRTATYDVVSSSATSDETEVEPHTATSDPILDSVPPKTSRTAEQPRATlAPSEASFDPRTVEIFTSPEVRPTT 499
Cdd:PRK10263   395 EPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQA-EEQQSTFAPQSTYQTEQTYQQPAA 473
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  500 AAPQQTtsipstpKRQSTPKPPRVKPAPE-PETRPSAQSTKAPPHKTKKPGRRRPKTTRSPEvPKSKPALEPATVPPEIL 578
Cdd:PRK10263   474 QEPLYQ-------QPQPVEQQPVVEPEPVvEETKPARPPLYYFEEVEEKRAREREQLAAWYQ-PIPEPVKEPEPIKSSLK 545
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  579 VPTIVPKPPQRPKATRRPEAPQIQPA--HEPVTFGSEAPALAIVTTTDIAPvisRTKASVTTLAPKSSRPRTRQRPKYKA 656
Cdd:PRK10263   546 APSVAAVPPVEAAAAVSPLASGVKKAtlATGAAATVAAPVFSLANSGGPRP---QVKEGIGPQLPRPKRIRVPTRRELAS 622
                          330
                   ....*....|....
gi 1958666954  657 ----TPSPKIPQTK 666
Cdd:PRK10263   623 ygikLPSQRAAEEK 636
fn3 pfam00041
Fibronectin type III domain;
1360-1443 1.40e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 47.79  E-value: 1.40e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1360 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1436
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 1958666954 1437 LGEGPAS 1443
Cdd:pfam00041   79 GGEGPPS 85
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
484-670 1.96e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 52.57  E-value: 1.96e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  484 PRTVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQS--TKAPPHKTKKPGRRRPKTTRSPEV 561
Cdd:PRK12323   374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSpaPEALAAARQASARGPGGAPAPAPA 453
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  562 PKSKP--ALEPATVPPEILVPTIVPKPPQRPKATRRPEAPQIQPAHE--PVTFGSEAP-----ALAIVTTTDIA-PVISR 631
Cdd:PRK12323   454 PAAAPaaAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEelPPEFASPAPaqpdaAPAGWVAESIPdPATAD 533
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1958666954  632 TKASVTTLAPKSSRPRTrqrPKYKATPSPKIPQTKPDLG 670
Cdd:PRK12323   534 PDDAFETLAPAPAAAPA---PRAAAATEPVVAPRPPRAS 569
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
284-658 7.89e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 50.88  E-value: 7.89e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  284 STVKLP---SSIMLEISDALKAQLAKNETLALPAESKTPEVEKVAGQPVTVTPETVSRS-TKPTLASALDTAETALVLSE 359
Cdd:PRK14949   372 AEISLPegqTPSALAAAVQAPHANEPQFVNAAPAEKKTALTEQTTAQQQVQAANAEAVAeADASAEPADTVEQALDDESE 451
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  360 KTPEtaqsvLIPESELLLSSLAPKGSpEFPEAKTAFPSEKPGGSLASSEEPWVVP-GAKTSEDSRVVQPRTATyDVVSSS 438
Cdd:PRK14949   452 LLAA-----LNAEQAVILSQAQSQGF-EASSSLDADNSAVPEQIDSTAEQSVVNPsVTDTQVDDTSASNNSAA-DNTVDD 524
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  439 ATSDETEVEPHTATSDPILDSVPPKTSrtAEQPRATLAPSEASFDPRTVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTP 518
Cdd:PRK14949   525 NYSAEDTLESNGLDEGDYAQDSAPLDA--YQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEAQPSSQSLSPISAVTTA 602
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  519 KPPRV------------------------------KPAPE--PETRPSAQSTKAPPHKTKKPgrRRPKTTRSPEVPKSKP 566
Cdd:PRK14949   603 AASLAdddildavlaardsllsdldalspkegdgkKSSADrkPKTPPSRAPPASLSKPASSP--DASQTSASFDLDPDFE 680
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  567 ALEPATVPPEILVPTIVPKPPQRPKATRRP---EAPQIQPAHEPVTFGSEAPALAIVTTTDIAPvisrTKASVTTLAPKS 643
Cdd:PRK14949   681 LATHQSVPEAALASGSAPAPPPVPDPYDRPpweEAPEVASANDGPNNAAEGNLSESVEDASNSE----LQAVEQQATHQP 756
                          410
                   ....*....|....*
gi 1958666954  644 SRPRTRQRPKYKATP 658
Cdd:PRK14949   757 QVQAEAQSPASTTAL 771
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
448-648 1.16e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 50.26  E-value: 1.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  448 PHTATSDPILDSVPPKTSRTAEQPRATLAPSEASFDPRTVEIFTSPEVRPTTAAPQ----QTTSIPSTPKRQSTPKP-PR 522
Cdd:PRK12323   374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPApealAAARQASARGPGGAPAPaPA 453
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  523 VKPAPEPETRPSAQSTKAPPHKTKKPGRRRPKTTRSPEVPKSKPALE--PATVPPEILVPTIVPKPPQRPKATRRPEAPQ 600
Cdd:PRK12323   454 PAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEelPPEFASPAPAQPDAAPAGWVAESIPDPATAD 533
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 1958666954  601 IQPAHEPVTfgsEAPALAIVTTTDIAPvisrtKASVTTLAPKSSRPRT 648
Cdd:PRK12323   534 PDDAFETLA---PAPAAAPAPRAAAAT-----EPVVAPRPPRASASGL 573
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
438-749 1.42e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.17  E-value: 1.42e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  438 SATSDETEVEPHT------ATSDPILDSVPPKTSRTAEQPR--ATLAPSEASFDPRTVEIFTSPEVRPTTAAPQQTTSIP 509
Cdd:PHA03307    25 PATPGDAADDLLSgsqgqlVSDSAELAAVTVVAGAAACDRFepPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREG 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  510 STPKRQSTPKPPRVKPAPEPETRPSAQSTKAPPHKTKKPGRRRPKtTRSPEVPKSKPALEPATVPP----------EILV 579
Cdd:PHA03307   105 SPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPA-ASPPAAGASPAAVASDAASSrqaalplsspEETA 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  580 PTIVPKPPQRPKATRRPEA-PQIQPAHEPVTFGSEAPALAIVTTTDIAPVISRTKASVTTLAPKSSRPRT---RQRPKYK 655
Cdd:PHA03307   184 RAPSSPPAEPPPSTPPAAAsPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENecpLPRPAPI 263
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  656 ATPSPKIPQTKPDLGPITAEPSLASTTKKVRRPRPKP-------KTTPHPEVPQTILVPATSLE-PVIRTETPGTTLVPk 727
Cdd:PHA03307   264 TLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPsspgsgpAPSSPRASSSSSSSRESSSSsTSSSSESSRGAAVS- 342
                          330       340
                   ....*....|....*....|..
gi 1958666954  728 lSQQPDFPHPKPKTTRSPAAPP 749
Cdd:PHA03307   343 -PGPSPSRSPSPSRPPPPADPS 363
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
500-731 1.48e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 49.87  E-value: 1.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  500 AAPQQTTSIPSTPKRQSTPKPPRVKPAP--EPETRPSAQSTKAPPHKTKKPGRRRPKTTRSPEVPKSKPALEPATVPPEI 577
Cdd:PRK12323   372 AGPATAAAAPVAQPAPAAAAPAAAAPAPaaPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPA 451
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  578 LVPTIVPKPPQRPKATrrpeAPQIQPAHEPVTFGSEAPALAIVTTTDIAPVISRTKASVTTLAPKSSRPRTRQRPKYkAT 657
Cdd:PRK12323   452 PAPAAAPAAAARPAAA----GPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAE-SI 526
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958666954  658 PSPKIPQTKPDLGPITAEPSLASTTKKVRRPRPKPKTTPhPEVPQTILVPATSLE-PVIRTETPGTTLVPKLSQQ 731
Cdd:PRK12323   527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRP-PRASASGLPDMFDGDwPALAARLPVRGLAQQLARQ 600
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
462-599 2.39e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 48.94  E-value: 2.39e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  462 PKTSRTAEQPRATLAPSEASFdprtveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQSTKAP 541
Cdd:PRK14951   366 PAAAAEAAAPAEKKTPARPEA--------AAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAA 437
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1958666954  542 PHKTKKPGRRRPKttrSPEVPKSKPALEPATVPPEILVPTIVPKPPQRPKATRRPEAP 599
Cdd:PRK14951   438 PAAAPAAVALAPA---PPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTE 492
PHA03247 PHA03247
large tegument protein UL36; Provisional
1003-1381 6.04e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 6.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1003 SPQKTIAPRQTTSPPPklkPPHSRRPAKEQVPKGSlhttskPKMPPSPEVVDITSVPKDEQLSHKPDPEVSQSETVLPPV 1082
Cdd:PHA03247  2611 PAPPSPLPPDTHAPDP---PPPSPSPAANEPDPHP------PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ 2681
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1083 TFRVEPPKTTIVPLETRDIPLIPVISPRP------SEEELQTTMEQTDQSTQELFTTKIPRTTELAKTTQAPHRLHTTPV 1156
Cdd:PHA03247  2682 RPRRRAARPTVGSLTSLADPPPPPPTPEPaphalvSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1157 RPRIPER---PHGRPALNKTTTRPDRTKSRGMSHKNGVGPGTKQTPKPSSTGRNTSVDSHATRKPGLIPGTRHRHTSPRP 1233
Cdd:PHA03247  2762 TTAGPPApapPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1234 VP-----------------PQRKPLPPNNVTGKPGSAGIISSSRATSPPLKATLK--------------PTGTATERPGA 1282
Cdd:PHA03247  2842 PPgppppslplggsvapggDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTEsfalppdqperppqPQAPPPPQPQP 2921
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1283 EKKQPTAPASEEEFGNTTDFSSSPTKETDPLGKPRFIGPHvryiPKPDNKPCSITDSVRRFPTEEATEGNATSPPQNPPT 1362
Cdd:PHA03247  2922 QPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ----PWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLT 2997
                          410
                   ....*....|....*....
gi 1958666954 1363 NLTVVTVEGCPSFVILDWE 1381
Cdd:PHA03247  2998 GHSLSRVSSWASSLALHEE 3016
PHA03378 PHA03378
EBNA-3B; Provisional
396-670 7.39e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 47.75  E-value: 7.39e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  396 PSEKPGGSLASSEEP-------WVVPGAKTSEDSRVVQPRTATYDVVSSSATSDETEVEPHTAT-SDPILDSVPPKTSRT 467
Cdd:PHA03378   602 PSQTPEPPTTQSHIPetsaprqWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTwTQIGHIPYQPSPTGA 681
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  468 AEQPRATLAPSEASFDPRTveiftspevrPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQSTKAPPHKTKK 547
Cdd:PHA03378   682 NTMLPIQWAPGTMQPPPRA----------PTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAP 751
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  548 PGRRRPKTTRSPEVPKSKPALEPATVPPeilvPTIVPKPPQRPKATRRPEaPQIQPAHEPVTFGSEAPALAIVTTTDIAp 627
Cdd:PHA03378   752 GRARPPAAAPGRARPPAAAPGAPTPQPP----PQAPPAPQQRPRGAPTPQ-PPPQAGPTSMQLMPRAAPGQQGPTKQIL- 825
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 1958666954  628 visrtkASVTTLAPKSSRPRTRqRPKYKATPSPKIPQTKPDLG 670
Cdd:PHA03378   826 ------RQLLTGGVKRGRPSLK-KPAALERQAAAGPTPSPGSG 861
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
493-702 8.05e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.07  E-value: 8.05e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  493 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQSTKAPPHKTKKPGRRRPKTTRSPEVPKSKPALEPAT 572
Cdd:NF033839   292 PSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPEVKPQPEK 371
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  573 VPPEILVPTIVPKPPQRPKatrrPEAPQIQPAHEPVTFGSEapalaivtttdIAPVISRTKASVTTLAPKSSRPRTRQRP 652
Cdd:NF033839   372 PKPEVKPQPETPKPEVKPQ----PEKPKPEVKPQPEKPKPE-----------VKPQPEKPKPEVKPQPEKPKPEVKPQPE 436
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1958666954  653 KYKATPSPKIPQTKPDLGPitaEPSLASTTKKVRRPRPKPKTTPHPEVPQ 702
Cdd:NF033839   437 KPKPEVKPQPEKPKPEVKP---QPETPKPEVKPQPEKPKPEVKPQPEKPK 483
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
496-657 1.07e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 47.02  E-value: 1.07e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  496 RPTTAAPQQTTSIPSTPKRqstpkPPRVKPAPEPETRPSAQSTKAPPhktkkPGRRRPKTTRSPEVPKSKPALEPATVPP 575
Cdd:PRK14951   365 KPAAAAEAAAPAEKKTPAR-----PEAAAPAAAPVAQAAAAPAPAAA-----PAAAASAPAAPPAAAPPAPVAAPAAAAP 434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  576 eilvptiVPKPPQRPKATRRPEAPQIQPAHEPVtfgseapalAIVTTTDIAPVISRTKASVTTLAPKSSRPRTRQRPKYK 655
Cdd:PRK14951   435 -------AAAPAAAPAAVALAPAPPAQAAPETV---------AIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEGDVWH 498

                   ..
gi 1958666954  656 AT 657
Cdd:PRK14951   499 AT 500
fn3 pfam00041
Fibronectin type III domain;
116-195 1.49e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 42.02  E-value: 1.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCSSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1958666954  193 GVK 195
Cdd:pfam00041   72 RVQ 74
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
462-684 1.54e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 46.30  E-value: 1.54e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  462 PKTSRTAEQPRATLAPSEASFDPrtvEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPpRVKPAPE-PETRPSAQSTKA 540
Cdd:NF033839   308 KEVKPEPETPKPEVKPQLEKPKP---EVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKP-EVKPQPEkPKPEVKPQPETP 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  541 PPHKTKKPGRRRPKTTRSPEVPKSKPALEPATVPPEILVPTIVPKPPQRPKatrrPEAPQIQPAHEPVTFGSEAPALAIV 620
Cdd:NF033839   384 KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQ----PEKPKPEVKPQPEKPKPEVKPQPET 459
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958666954  621 TTTDIAPVISRTKASVttlapkSSRPRTRQRPKYKATPSPKIPQTKPDLGPITAEPSLASTTKK 684
Cdd:NF033839   460 PKPEVKPQPEKPKPEV------KPQPEKPKPDNSKPQADDKKPSTPNNLSKDKQPSNQASTNEK 517
PRK10263 PRK10263
DNA translocase FtsK; Provisional
454-745 2.12e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.23  E-value: 2.12e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  454 DPILD--SVPPKTSRTAEQPRAT---LAPSEASFDPRTVEIFTSPEVRPTTAapQQTTSIPSTPKRQSTPKPPRVKPAP- 527
Cdd:PRK10263   308 DPLLNgaPITEPVAVAAAATTATqswAAPVEPVTQTPPVASVDVPPAQPTVA--WQPVPGPQTGEPVIAPAPEGYPQQSq 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  528 ---------EPETRPsAQSTKAPPHKTKKPGRRRPKTTRSPEVPKSKPALEPATVPPEILVPTIV--PKPPQRPKATRRP 596
Cdd:PRK10263   386 yaqpavqynEPLQQP-VQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAeeQQSTFAPQSTYQT 464
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  597 EAPQIQPAHEPVTFgSEAPALAIVTTTDIAPVISRTKASVTTL----APKSSRPRTRQR--------PKYKATPSPKIPQ 664
Cdd:PRK10263   465 EQTYQQPAAQEPLY-QQPQPVEQQPVVEPEPVVEETKPARPPLyyfeEVEEKRAREREQlaawyqpiPEPVKEPEPIKSS 543
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  665 TKPD----LGPITAEPSLASTTKKVRrprpkpKTTPHPEVPQTILVPATSLepvirteTPGTTLVPKLSQQPDFPHPKPK 740
Cdd:PRK10263   544 LKAPsvaaVPPVEAAAAVSPLASGVK------KATLATGAAATVAAPVFSL-------ANSGGPRPQVKEGIGPQLPRPK 610

                   ....*
gi 1958666954  741 TTRSP 745
Cdd:PRK10263   611 RIRVP 615
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1201-1455 2.30e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 45.76  E-value: 2.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1201 PSSTGRNTSVDSHATRKPGLIPGTRHRHTSPRPVPPQRKPLPPNNVTGKPGSAGIISSSRATSPPLKATLKPTGTATERP 1280
Cdd:COG3401     73 AGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATTATAVAGGAATAGTYALGAGLYGVDG 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1281 GAEKKQPTAPASEEEFGNTTDFSSSPTKETDPLGKPRFIGPHVRYIPKPDN----KPCSITDSVRRFPTEEATEGNATSP 1356
Cdd:COG3401    153 ANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTtyyyRVAATDTGGESAPSNEVSVTTPTTP 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1357 PqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1436
Cdd:COG3401    233 P-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTNGTTYYYRVTAVDA 307
                          250       260
                   ....*....|....*....|
gi 1958666954 1437 LG-EGPASNTVAFSTESADP 1455
Cdd:COG3401    308 AGnESAPSNVVSVTTDLTPP 327
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
496-750 2.40e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.91  E-value: 2.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  496 RPTTAAPQQTTSIPSTPKRQSTPKPPRVKP--APEPET----RPSAQSTKAPPHKTKKPGRRRPKTTRSPevpkskPALE 569
Cdd:pfam03154  324 RIHTPPSQSQLQSQQPPREQPLPPAPLSMPhiKPPPTTpipqLPNPQSHKHPPHLSGPSPFQMNSNLPPP------PALK 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  570 PATVPPEILVPTIVPKPPQRPKATRRPEAPQIQPahepvtfgseapalAIVTTTDIAPVISRTKASVTTLAPKSSRPRTR 649
Cdd:pfam03154  398 PLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQP--------------PVLTQSQSLPPPAASHPPTSGLHQVPSQSPFP 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  650 QRPKYKATPSPKIPqtkpdlgPITAEPSLASTTKKVRRPRPKPKTTPHPeVPQTilvPATSLEPVIRTETPgttlvPKLS 729
Cdd:pfam03154  464 QHPFVPGGPPPITP-------PSGPPTSTSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA-----LDEA 527
                          250       260
                   ....*....|....*....|.
gi 1958666954  730 QQPDFPHPKPkttRSPAAPPT 750
Cdd:pfam03154  528 EEPESPPPPP---RSPSPEPT 545
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1344-1498 4.59e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.99  E-value: 4.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1344 PTEEATEGNATSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqitNQTFSTVENL 1421
Cdd:COG3401    314 PSNVVSVTTDLTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGL 387
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958666954 1422 KPDTSYEFQVKPKNPLG-EGPASNTVAFSTESADPRVSEPISAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1498
Cdd:COG3401    388 TPGTTYYYKVTAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
427-618 5.63e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.48  E-value: 5.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  427 PRTATYDVVSSSATSDETEVEPHTATSDPILDSVPPKTSRTAEQPRAtlAPSEASFDPRTVEIFTSPEVRPTTAAPQQTT 506
Cdd:PRK12323   375 ATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAA--APARRSPAPEALAAARQASARGPGGAPAPAP 452
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  507 SIPSTPKRQSTP--KPPRVKP--APEPETRPSAQSTKAPPHKTKKPGRRRPKTTRSPEVPKSKPAL---------EPATV 573
Cdd:PRK12323   453 APAAAPAAAARPaaAGPRPVAaaAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPagwvaesipDPATA 532
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1958666954  574 PPEILVPTIVPKPPQ----RPKATRRPEAPQIQPAHE----PVTFGSEAPALA 618
Cdd:PRK12323   533 DPDDAFETLAPAPAAapapRAAAATEPVVAPRPPRASasglPDMFDGDWPALA 585
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
532-726 7.51e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.10  E-value: 7.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  532 RPSAQSTKA-PPHKTKKPGRRRPKTTRSPEVPKSKPALEPATVPPEILVPTivPKPPQRPKATRRPEAPQIQPAHEPVTF 610
Cdd:PRK12323   364 RPGQSGGGAgPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAA--AARAVAAAPARRSPAPEALAAARQASA 441
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  611 GSEAPALAIVTTTDIAPV--ISRTKASVTTLAPKSSRPRTRQRPKYKATPSPKIPQTKPDLGPITAEPSLASTTK----K 684
Cdd:PRK12323   442 RGPGGAPAPAPAPAAAPAaaARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAapagW 521
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1958666954  685 VRRPRPKPKTTPhPEVPQTILVPATSLEPVIRTETPGTTLVP 726
Cdd:PRK12323   522 VAESIPDPATAD-PDDAFETLAPAPAAAPAPRAAAATEPVVA 562
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
311-671 7.53e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.39  E-value: 7.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  311 ALPAESKTPEVEKVAGQPVTVTPETVSRSTKPTL-----ASALDTAETALVLSEKTPETAQSVlipeselllsslapkgs 385
Cdd:PHA03307   124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGAspaavASDAASSRQAALPLSSPEETARAP----------------- 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  386 pefPEAKTAFPSEKPGGSLASSEEPWVVPGAKTSEDSRVVQPRTATYDVVSSSATSDETEvephtatsdpildSVPPKTS 465
Cdd:PHA03307   187 ---SSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSE-------------SSGCGWG 250
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  466 RTAEQPRATLAPSEASFDPRTVEIFTSPEVRPTTAApqqttsiPSTPKRQSTPKPPRVKP-APEPETRPSAQSTKAPPHK 544
Cdd:PHA03307   251 PENECPLPRPAPITLPTRIWEASGWNGPSSRPGPAS-------SSSSPRERSPSPSPSSPgSGPAPSSPRASSSSSSSRE 323
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  545 TKKPGRRRPKTTRSPEVPKSKPALEPATVPPEILVPTIVPKPPQRPKATRRPEAPQIQPAHEPVTFGSEAPAlaivtttd 624
Cdd:PHA03307   324 SSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVA-------- 395
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*..
gi 1958666954  625 iAPVISRTKASVTTlapkssRPRTRQRPKYKATPSPKIPQTKPDLGP 671
Cdd:PHA03307   396 -GRARRRDATGRFP------AGRPRPSPLDAGAASGAFYARYPLLTP 435
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
444-676 8.80e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 43.76  E-value: 8.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  444 TEVEPHTATSDPIlDSVPPKTSRTAEQPratLAPSEASFD--PRTVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKP- 520
Cdd:PLN03209   344 TKPVTPEAPSPPI-EEEPPQPKAVVPRP---LSPYTAYEDlkPPTSPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSa 419
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  521 ---PRVKPAPEPETRPSAQSTKAPPHKTKKPGRRRPkttrSPEVPKSKPALEPATVP--PEILVPTIVPKPPQRPKATRR 595
Cdd:PLN03209   420 snvPEVEPAQVEAKKTRPLSPYARYEDLKPPTSPSP----TAPTGVSPSVSSTSSVPavPDTAPATAATDAAAPPPANMR 495
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  596 PEAPQIQPA--HEPVTFGSEAPALAIVTTTDIAPVISRTKASVTTLAPKSSRPRTRQRPkykATPSPKIPQTKPDLGPIT 673
Cdd:PLN03209   496 PLSPYAVYDdlKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRP---LSPYTMYEDLKPPTSPTP 572

                   ...
gi 1958666954  674 AEP 676
Cdd:PLN03209   573 SPV 575
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
471-607 1.15e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.82  E-value: 1.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  471 PRATLAPSEAsfdprtVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQSTKAPPHKTKKPGR 550
Cdd:PRK07764   371 ERGLLARLER------LERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSP 444
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1958666954  551 RRPKTTRSPEVPKSKPALEPATVPPEILVPTIVPKPPQRPKATRRPEAPQIQPAHEP 607
Cdd:PRK07764   445 AGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPA 501
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
481-576 1.27e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 43.73  E-value: 1.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  481 SFDPRTVEIFTSPEVRPTTAAPQQTTsIPSTPKRQSTPKPPRVKPAPEPETRPSAQSTKAPPhktkKPGRRRPKTTRSPE 560
Cdd:PRK12270    24 SVDPSWREFFADYGPGSTAAPTAAAA-AAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPP----KPAAAAAAAAAPAA 98
                           90
                   ....*....|....*.
gi 1958666954  561 VPKSKPALEPATVPPE 576
Cdd:PRK12270    99 PPAAAAAAAPAAAAVE 114
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
482-634 1.38e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 43.32  E-value: 1.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  482 FDPRTVEifTSPEVRPTTAAPQQTTSIPSTPkRQSTPKPPRVKPAPEPETRPSAQSTKAPPHKTKKPGRRRPKTTRSPEV 561
Cdd:PRK07994   359 FHPAAPL--PEPEVPPQSAAPAASAQATAAP-TAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGA 435
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958666954  562 PKSKPALEPATVPPEILVPTIVPKPPQRPKATRRPEAPQIQPAHEPV-TFGSEAPALAIVTTTDIAPVISRTKA 634
Cdd:PRK07994   436 TKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKaTNPVEVKKEPVATPKALKKALEHEKT 509
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
116-195 1.44e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.40  E-value: 1.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  116 KPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcssdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYEFG 193
Cdd:cd00063      3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYEFR 73

                   ..
gi 1958666954  194 VK 195
Cdd:cd00063     74 VR 75
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
448-599 1.84e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 42.93  E-value: 1.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  448 PHTATSDPildSVPPKTSRTAEQPRATLAPSEASFDPRTVEIFTSPEVRPTTAAPQQTTSIPST--------PKRQSTPK 519
Cdd:PRK07994   361 PAAPLPEP---EVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQllaarqqlQRAQGATK 437
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  520 PPRVKPAPEPETRPSAQSTKAPPHKtkkpgrrRPKTTRSPEVPKSKPALEPATVPPEILVPTIVPKPPQRPKATRRPEAP 599
Cdd:PRK07994   438 AKKSEPAAASRARPVNSALERLASV-------RPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTP 510
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
394-714 2.01e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.91  E-value: 2.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  394 AFPSEKPGGSLASSEEPWVVPGA--------KTSEDSRVVQPRTATYDVVSSSATSDETEVEPHTATSDPILDSVPPKTS 465
Cdd:PRK07003   357 AFEPAVTGGGAPGGGVPARVAGAvpapgaraAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATA 436
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  466 RTAEQPRATLAPSEASFDPRTVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQSTKAPPHKT 545
Cdd:PRK07003   437 DRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAA 516
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  546 KKPGRRRPKTTRSPEVPKSKPAlepATVPP----------EILVPTIVPKPPQRpkaTRRPEAPQIQPAHEPVTFGSEAP 615
Cdd:PRK07003   517 SREDAPAAAAPPAPEARPPTPA---AAAPAaraggaaaalDVLRNAGMRVSSDR---GARAAAAAKPAAAPAAAPKPAAP 590
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  616 ALAIVTTTDIAPViSRTKASVTTLAPKSSRPRTRQRPKykatPSPKIPQTkpDLGPITAE-----------PSLASTTKK 684
Cdd:PRK07003   591 RVAVQVPTPRARA-ATGDAPPNGAARAEQAAESRGAPP----PWEDIPPD--DYVPLSADegfggpddgfvPVFDSGPDD 663
                          330       340       350
                   ....*....|....*....|....*....|
gi 1958666954  685 VRRPrPKPKTTPHPEVPQTILVPATSLEPV 714
Cdd:PRK07003   664 VRVA-PKPADAPAPPVDTRPLPPAIPLDAI 692
PHA03379 PHA03379
EBNA-3A; Provisional
496-748 2.38e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 42.74  E-value: 2.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  496 RPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQSTKAPPHKTKK----PGRRRPKT--------TRSPEVPK 563
Cdd:PHA03379   410 EPTYGTPRPPVEKPRPEVPQSLETATSHGSAQVPEPPPVHDLEPGPLHDQHSmapcPVAQLPPGplqdlepgDQLPGVVQ 489
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  564 S-KPALEPATVPPEILVPTIVPKPPQRPKATRRPEAPQIQPAhEPVTfgseAPALAIVTTTDIAPvisrTKASVTTLAPK 642
Cdd:PHA03379   490 DgRPACAPVPAPAGPIVRPWEASLSQVPGVAFAPVMPQPMPV-EPVP----VPTVALERPVCPAP----PLIAMQGPGET 560
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  643 SSRPRTRQRpkYKATPSPKIPQTKPDLGPITAEPSLASTTKKVR------RPRPKPKTTP-----HPEVPQTILVPATSL 711
Cdd:PHA03379   561 SGIVRVRER--WRPAPWTPNPPRSPSQMSVRDRLARLRAEAQPYqasvevQPPQLTQVSPqqpmeYPLEPEQQMFPGSPF 638
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1958666954  712 EPVIRTETPGTtlVPKLSQQP-DFPHPKPKTTRSPAAP 748
Cdd:PHA03379   639 SQVADVMRAGG--VPAMQPQYfDLPLQQPISQGAPLAP 674
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
117-195 2.55e-03

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 38.36  E-value: 2.55e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954   117 PLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCSSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTVYEF 192
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTEYEF 72

                    ...
gi 1958666954   193 GVK 195
Cdd:smart00060   73 RVR 75
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
311-649 2.71e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 42.25  E-value: 2.71e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  311 ALPAESKTPEVEKVAGQPVTV--TPETVSRSTKPTLASALDTAETALVLSEKTPETAQSVLIPESELLLSSLAPKGS--P 386
Cdd:pfam17823  129 SLPAAIAALPSEAFSAPRAAAcrANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATltP 208
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  387 EFPEAKTAFPSEKPGGSLASSEEPWVVPGAKTSEDSRVVQPRTATYDVVSSSATSDETEVEPHTatSDPILDSVPP-KTS 465
Cdd:pfam17823  209 ARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINM--GDPHARRLSPaKHM 286
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  466 RTAEQPRATLAPSEASFDPRTVEIFTSPEVRPT----TAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQSTKAP 541
Cdd:pfam17823  287 PSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTagepTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVL 366
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  542 PhktkkpgrrrpkTTRSPEVPKSKPALEPATVPPeilvptivpkppqrpkaTRRPEAPQIQPAHEPVtfGSEAPAlaivt 621
Cdd:pfam17823  367 H------------TSMIPEVEATSPTTQPSPLLP-----------------TQGAAGPGILLAPEQV--ATEATA----- 410
                          330       340
                   ....*....|....*....|....*...
gi 1958666954  622 TTDIAPVISRTKASVTTLAPKSSRPRTR 649
Cdd:pfam17823  411 GTASAGPTPRSSGDPKTLAMASCQLSTQ 438
NESP55 pfam06390
Neuroendocrine-specific golgi protein P55 (NESP55); This family consists of several mammalian ...
436-568 2.97e-03

Neuroendocrine-specific golgi protein P55 (NESP55); This family consists of several mammalian neuroendocrine-specific golgi protein P55 (NESP55) sequences. NESP55 is a novel member of the chromogranin family and is a soluble, acidic, heat-stable secretory protein that is expressed exclusively in endocrine and nervous tissues, although less widely than chromogranins.


Pssm-ID: 115071 [Multi-domain]  Cd Length: 261  Bit Score: 41.39  E-value: 2.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  436 SSSATSDETEVEPHTATSDPILDSVPPKTSrtAEQPRATLAPSEASFDPRTVEIFTSPEVRPTTAAP---QQTTSIPSTP 512
Cdd:pfam06390  122 PESDIESETEFETEPETEPDTAPTTEPETE--PEDEPGPVVPKGATFHQSLTERLHALKLQSADASPrraPPSTQEPESA 199
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  513 KRQSTPKPPRVKPAP-EPETRPSA-QSTKAPPH--KTKKPGRRRPKttrSPEVPKSKPAL 568
Cdd:pfam06390  200 REGEEPERGPLDKDPrDPEEEEEEkEEEKQQPHrcKPKKPARRRDP---SPESPPKKGAI 256
PHA03377 PHA03377
EBNA-3C; Provisional
519-710 3.04e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 42.35  E-value: 3.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  519 KPPRVKPAPEPETRPSAQSTKAPPHKTKKPGR-----RRPKTTRSPEVPKSKPALEPATVPpeilvPTIVPKPPQRPKAT 593
Cdd:PHA03377   414 RKPRTLPWPTPKTHPVKRTLVKTSGRSDEAEQaqstpERPGPSDQPSVPVEPAHLTPVEHT-----TVILHQPPQSPPTV 488
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  594 rrpeapQIQPAHEPVTFGSEApalAIVTTTDIAPVISRTKASVTTLAPKSSRPRTRQ--------RPKYKATPSPKIPQ- 664
Cdd:PHA03377   489 ------AIKPAPPPSRRRRGA---CVVYDDDIIEVIDVETTEEEESVTQPAKPHRKVqdgfqrsgRRQKRATPPKVSPSd 559
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1958666954  665 ------TKPDLGPITAEPSLASTTKKVRRPRPKPKTTPHPEVPQTILVPATS 710
Cdd:PHA03377   560 rgppkaSPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASG 611
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
414-768 3.05e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 3.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  414 PGAKTSEDSRVVQPRTATYDVVSSSATSDETEVEPHTATSDPILDSVPPKTSRTAEQPRATLAPSEASFDPRTVEIFTSP 493
Cdd:pfam03154  188 PPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLP 267
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  494 EVRPTTAAPQQTTSIPSTPKRQSTPKPPRvkpaPEPETRPSAQSTKAPPHKTKKPG--RRRPKTTRS-PEVPKSKPALEP 570
Cdd:pfam03154  268 QPSLHGQMPPMPHSLQTGPSHMQHPVPPQ----PFPLTPQSSQSQVPPGPSPAAPGqsQQRIHTPPSqSQLQSQQPPREQ 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  571 ATVPPEILVPTIVPKP----PQRPKATRRPEAPQIQpAHEPVTFGSEAPalaivtttdiAPVISRTKASVTTLAPKSSRP 646
Cdd:pfam03154  344 PLPPAPLSMPHIKPPPttpiPQLPNPQSHKHPPHLS-GPSPFQMNSNLP----------PPPALKPLSSLSTHHPPSAHP 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  647 RTRQ-RPKYKATPSPkiPQTKPDLGPITAEPSLASTTKKVRRPRPKPKTTPHPEVPQTILVPATSLEPvirtetpgttLV 725
Cdd:pfam03154  413 PPLQlMPQSQQLPPP--PAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPP----------SG 480
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1958666954  726 PKLSQQPDFPHPKPKTTRSPAAPPTELVSTTVFEPVIPLKEDP 768
Cdd:pfam03154  481 PPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEA 523
PHA03247 PHA03247
large tegument protein UL36; Provisional
492-604 3.39e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 3.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  492 SPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQSTKAPPHKTKKPGRRRPKTTRSPEVPKSKPALEPA 571
Cdd:PHA03247   379 SLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPA 458
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1958666954  572 TVPPEILVPTIVPKPPQRPKATRRPEAPQIQPA 604
Cdd:PHA03247   459 TEPAPDDPDDATRKALDALRERRPPEPPGADLA 491
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
308-782 3.66e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 41.96  E-value: 3.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  308 ETLALPAESKTPEVEKVAGQpvtvTPETVSRSTKPTLASaldtaeTALVLSEKTPETAQSVLIPESELLLSSLAPKGS-- 385
Cdd:COG5665    102 EPLGRLVASTGLNASGVSAN----SAATIAPGANATLTS------SAGADSLQASSEMALWGPRRVALVVRDGASNPVav 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  386 -PEFPEAKTAFPSEKPGGSLASseepwvVPGAKTSEDSrvvQPRTATYDVVSSSATSDETEVEP-------HTATSDPIL 457
Cdd:COG5665    172 vVTTMIAVPSAPAAPPNAVDYS------VLVPIAAQDP---AASVSTPQAFNASATSGRSQHIVqaakrvgVEWWGDPSL 242
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  458 DSVPPKTSRTAEQPRATLAPSEASFDPRTVEIFTSPEVR--------------PTTAAPQQTTSIPSTPKRQSTPKPPRV 523
Cdd:COG5665    243 LATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntptstakaqpqpPTKKQPAKEPPSDTASGNPSAPSVLIN 322
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  524 KPAPEPETRPSAQSTKAPPHKTKKPGRRRPKTTRSPEVPKSKPALEPATVPPEIlvpTIVPKPPQRPKATRRPEAPQIQP 603
Cdd:COG5665    323 SDSPTSEDPATASVPTTEETTAFTTPSSVPSTPAEKDTPATDLATPVSPTPPET---SVDKKVSPDSATSSTKSEKEGGT 399
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  604 AHEPVTFGSEAPALAIVTTTDIAPVISRTKASVTTLAPKSSRPRTRQRPkyKATPSPKIPqtkpdlgpitaePSLASTTK 683
Cdd:COG5665    400 ASSPMPPNIAIGAKDDVDATDPSQEAKEYTKNAPMTPEADSAPESSVRT--EASPSAGSD------------LEPENTTL 465
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  684 KVRRPRPKPKTTPHPEVPQTILVPATSLEPVIRTETPGTTLVPKLSQQ--PDFPHPKPKTTRSPAAPPTELVSttvfepv 761
Cdd:COG5665    466 RDPAPNAIPPPEDPSTIGRLSSGDKLANETGPPVIRRDSTPSSTADQSivGVLAFGLDQRTQAEISVEAASRS------- 538
                          490       500
                   ....*....|....*....|.
gi 1958666954  762 IPLKEDPVTTIVPFTDLEPAT 782
Cdd:COG5665    539 NPLLNSQVKSFPLGKRSEGAK 559
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
604-847 3.79e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.14  E-value: 3.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  604 AHEP-VTFGSEAPALAIVTTTDIAPVISRTKASVTTLAPKSSRPRTRQRPKYKATPSPKIPQTKPDLGPITAEPSLASTT 682
Cdd:PRK07003   357 AFEPaVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATA 436
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  683 KKVRRPRPKPKTTPHPEVPQTilvpATSLEPVIRTETPGTTLVPKLSQQPDFPHPKPKTTRSPAAPPTELVSTTVFEPVI 762
Cdd:PRK07003   437 DRGDDAADGDAPVPAKANARA----SADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARA 512
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  763 PLKEDPVTTIVPFTDLEPATDLETPVAfRTEAPRTTLASKKSQ--RTRRPRPRPPKATLSPQAPKTKTVPAVVLEPVTLR 840
Cdd:PRK07003   513 PAAASREDAPAAAAPPAPEARPPTPAA-AAPAARAGGAAAALDvlRNAGMRVSSDRGARAAAAAKPAAAPAAAPKPAAPR 591

                   ....*..
gi 1958666954  841 PEVQVTT 847
Cdd:PRK07003   592 VAVQVPT 598
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
462-618 4.09e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 41.77  E-value: 4.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  462 PKTSRTA-EQPRATLAPSEASfdPRTVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQSTKA 540
Cdd:PRK07994   361 PAAPLPEpEVPPQSAAPAASA--QATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKA 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  541 PPHKTKKPGRRRPKTT---RSPEVPKSKPALEPATVPPEILVPTIVPKPPQRPKATRRPEAPQIQPAHEPvtfgseAPAL 617
Cdd:PRK07994   439 KKSEPAAASRARPVNSaleRLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEK------TPEL 512

                   .
gi 1958666954  618 A 618
Cdd:PRK07994   513 A 513
PHA03247 PHA03247
large tegument protein UL36; Provisional
1023-1362 4.28e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 4.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1023 PHSRRPAKEQVPkgslhttSKPKMPPSPEVvditsvpkdeqlSHKPDPEVSQSETVLPPVTFRVEPPKTTIVP------- 1095
Cdd:PHA03247  2478 PVYRRPAEARFP-------FAAGAAPDPGG------------GGPPDPDAPPAPSRLAPAILPDEPVGEPVHPrmltwir 2538
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1096 ----LETRDI----PLIPVISPRPSeeelqttmeqTDQSTQElfTTKIPRTTELAKTTQAphrlhttpVRPRIPERPhGR 1167
Cdd:PHA03247  2539 gleeLASDDAgdppPPLPPAAPPAA----------PDRSVPP--PRPAPRPSEPAVTSRA--------RRPDAPPQS-AR 2597
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1168 PalnkTTTRPDRTKSRGMSHKNGVGPGTKQTPKPSSTGRntsvdSHATRKPGlipgtrhRHTSPRPVPPQRKPLPPNNVT 1247
Cdd:PHA03247  2598 P----RAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPS-----PAANEPDP-------HPPPTVPPPERPRDDPAPGRV 2661
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1248 GKPGSAGIISSSRATSPPLKATLKPTGTATERPGAEKKQPTAPASEEEFGNTTDFSSSPTKETDPLGKPRFIGPHVRYIP 1327
Cdd:PHA03247  2662 SRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP 2741
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1958666954 1328 KPDNKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1362
Cdd:PHA03247  2742 PAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
897-1111 4.66e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.79  E-value: 4.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  897 APKTPKRTHRVRPKPKTTTPEAPLTKPVAAtdlesSALSTEVPTTVVLTTALVPATLRTKSPKTTLAPSVQRTRRPRPRP 976
Cdd:PRK12323   373 GPATAAAAPVAQPAPAAAAPAAAAPAPAAP-----PAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  977 KTTARTDVSESKSvsddlelvAFSTESPQKTIAPRQTTSPPPKLKPPHSRRPAKEQVPKGSlhttSKPKMPPSPEVVDIT 1056
Cdd:PRK12323   448 PAPAPAPAAAPAA--------AARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWE----ELPPEFASPAPAQPD 515
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1958666954 1057 SVPKDEQLSHKPDPEVSQsetvlPPVTFRVEPPKTTIVPLETRDIPLIPVISPRP 1111
Cdd:PRK12323   516 AAPAGWVAESIPDPATAD-----PDDAFETLAPAPAAAPAPRAAAATEPVVAPRP 565
PHA03377 PHA03377
EBNA-3C; Provisional
416-647 5.26e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 41.58  E-value: 5.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  416 AKTSEDSRVvQPRTATYDVVSSSATSDETEVEPHTATSDPILDSVPPKTS--RTAEQPRATLAPSEASFDPRTVEI---- 489
Cdd:PHA03377   698 AQPSEESHL-SSMSPTQPISHEEQPRYEDPDDPLDLSLHPDQAPPPSHQApySGHEEPQAQQAPYPGYWEPRPPQApylg 776
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  490 FTSPEVRPTTAAPQQTTSIPSTPKRQS-----------------------TPKPPRVKPAPEPETRPSAQSTKAPPHKTK 546
Cdd:PHA03377   777 YQEPQAQGVQVSSYPGYAGPWGLRAQHpryrhswaywsqypghghpqgpwAPRPPHLPPQWDGSAGHGQDQVSQFPHLQS 856
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  547 KPGRRRPKTTRSPEVPKSKPaLEPATVPPEILVPtivPKPPQRPKATRRPEAPqiQPAHEPVTFGSEAPALAIVTTTDIA 626
Cdd:PHA03377   857 ETGPPRLQLSQVPQLPYSQT-LVSSSAPSWSSPQ---PRAPIRPIPTRFPPPP--MPLQDSMAVGCDSSGTACPSMPFAS 930
                          250       260
                   ....*....|....*....|.
gi 1958666954  627 PVISRTKASVTTLAPKSSRPR 647
Cdd:PHA03377   931 DYSQGAFTPLDINAQTPKRPR 951
dnaA PRK14086
chromosomal replication initiator protein DnaA;
451-654 5.46e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 41.35  E-value: 5.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  451 ATSDPILDSVPPKTsrtaEQPRATLAPSEASFDPRTVEIFTSPE---VRPTTA-APQQTTSIPSTPKRQSTPKP---PRv 523
Cdd:PRK14086    86 ITVDPSAGEPAPPP----PHARRTSEPELPRPGRRPYEGYGGPRaddRPPGLPrQDQLPTARPAYPAYQQRPEPgawPR- 160
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  524 KPAPEPETRPSAQSTKAPPHKTKKPGRRRPKTTRSPEvPKSKPALEPATVPPEilvptiVPKPPQRPKATRRPEAPQIQP 603
Cdd:PRK14086   161 AADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPY-DAGRPEYDQRRRDYD------HPRPDWDRPRRDRTDRPEPPP 233
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1958666954  604 -AHEPVTFGSEAPALAIVTTTDIAPVISRTKASVTTLAPKSSRPRTRQRPKY 654
Cdd:PRK14086   234 gAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGEPTARLNPKY 285
PRK11633 PRK11633
cell division protein DedD; Provisional
458-541 6.19e-03

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 39.99  E-value: 6.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  458 DSVPPKTSRTAEQP--------RATLAPSEASFDPRTVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEP 529
Cdd:PRK11633    54 DMMPAATQALPTQPpegaaeavRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKP 133
                           90
                   ....*....|..
gi 1958666954  530 ETRPSAQSTKAP 541
Cdd:PRK11633   134 EPKPVVEEKAAP 145
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1002-1231 6.61e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 41.21  E-value: 6.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1002 ESPQKTIAPRQTTSPPPKLKPPHSRRPAKEQVPKgslhttsKPKMPPSPEvvditsVPKDEQLSHK-----PDPEVSQSE 1076
Cdd:PTZ00449   622 KSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK-------SPKPPKSPK------PPFDPKFKEKfyddyLDAAAKSKE 688
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954 1077 TVlppVTFRVEPPKTTIVPLETRDIPLIPVISPRPSEEELQTTMEQTDQSTQELFTTKIPRTTELAKTTQAPHRLHTTPV 1156
Cdd:PTZ00449   689 TK---TTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPA 765
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958666954 1157 RPRIPErphgrpALNKTTTRPDRTKSRGMSHKNGVGPGTKQTPKPSSTGRNTSVDSHATRKPGLIPGTRHRHTSP 1231
Cdd:PTZ00449   766 DTPLPD------ILAEEFKEEDIHAETGEPDEAMKRPDSPSEHEDKPPGDHPSLPKKRHRLDGLALSTTDLESDA 834
GGN pfam15685
Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the ...
500-695 7.43e-03

Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the maturation of sperm and is expressed virtually only in the testis. It is found to be associated with the intracellular membrane, binds with GGNBP1 and may be involved in vesicular trafficking.


Pssm-ID: 434857 [Multi-domain]  Cd Length: 668  Bit Score: 40.91  E-value: 7.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  500 AAPQQTTSIPSTPkrQSTPKPPRVKPAPEPETRPSAQSTKAPP------HKTKKPGRRRPKTTRS--PEVPKSKPALEPA 571
Cdd:pfam15685  377 GAPRRRAAALSGP--WGSPPPPPGKAHPIPGPRRPAPALLAPPmfifpaPTNGEPVRPGPPAPQAllPRPPPPTPPATPP 454
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  572 TVPPEIlvpTIVPKPPQRPKATRRPEAPQIQPAH-EPVTFGSEAPAL--AIVTTTDIAPVISRTKASVTTLAPKSSRPrt 648
Cdd:pfam15685  455 PVPPPI---PQLPALQPMPLAAARPPTPRPCPGHgESALAPAPTAPLppALAADQAPAPALAAAPAPSPAPAPATADP-- 529
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1958666954  649 rqrpkYKATPSPKIPQTKPDLGPITAEPSLASTTKKVRRPRPKPKTT 695
Cdd:pfam15685  530 -----LPPAPAPIKARTRKNKGPRAARGATREDGAPGDGPREKTAAT 571
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
509-702 9.07e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.74  E-value: 9.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  509 PSTPKRQSTPKPPRVKPAPEPETRPSAQSTKAPPHKTKKPGRRRPKTTRSPEVPKSKPALEPATVPPEILVPTIVPKPPQ 588
Cdd:PRK07764   591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP 670
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958666954  589 RPKATRRPEAPQIQPAHEPVTFGSEAPALAIVTTTDIAPVISRTKASVTTLAPKSSRPRTRQRPKYKATPSPkipqTKPD 668
Cdd:PRK07764   671 AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLP----PEPD 746
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1958666954  669 LGPITAEPSLASTTKKVRRPRPKPKTTPHPEVPQ 702
Cdd:PRK07764   747 DPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH