NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|334183349|ref|NP_001117499|]
View 

Nuclear pore complex protein [Arabidopsis thaliana]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF5585 super family cl39316
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1102-1472 5.81e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


The actual alignment was detected with superfamily member pfam17823:

Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 54.20  E-value: 5.81e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1102 SPISASKPSFNWSGNKSSNTTSYAEESAPSQIKDTRTVSQPGGSSFLPKRPVASTVLEQTEkkagefkfSEAKANAFVET 1181
Cdd:pfam17823   68 APVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQ--------SLPAAIAALPS 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1182 AAGSVQRLSTTSSGSDFESSKGFGAQFSTmSSGAPASSFSSKSLFGfNSSSSIPGDKFTFPAVTAPLSGTPLDSTSTLFT 1261
Cdd:pfam17823  140 EAFSAPRAAACRANASAAPRAAIAAASAP-HAASPAPRTAASSTTA-ASSTTAASSAPTTAASSAPATLTPARGISTAAT 217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1262 ASSAPVSSSSQDPVPasipiSSAPVPQTFS-VTSTSTVSATGFNVPFGKPLTSVKVDLNQAAPStpspspgptagftfnl 1340
Cdd:pfam17823  218 ATGHPAAGTALAAVG-----NSSPAAGTVTaAVGTVTPAALATLAAAAGTVASAAGTINMGDPH---------------- 276
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1341 pALSPSSPEMVSSSTGQSSLFPPSAPTSQVSSDQASatsslTDSSRLFSSTSLSSTPPITPPDAFQSPQVSTPSSAVpIT 1420
Cdd:pfam17823  277 -ARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVS-----TDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAV-VT 349
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 334183349  1421 EPVSEPKKPEAQSSSILSTQSTVDSVANATKTQNEPLPVKSEISNPGTTVTP 1472
Cdd:pfam17823  350 TTKAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAP 401
PPE super family cl35037
PPE-repeat protein [Function unknown];
1637-1731 3.39e-05

PPE-repeat protein [Function unknown];


The actual alignment was detected with superfamily member COG5651:

Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 48.35  E-value: 3.39e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1637 FGQPSQIGG--GQQALGSVLGSFGQSRQIGAGLPGATFGSPTGFGGSNPGSG---LPNAPASGGFAAAGSSATGGFAAMA 1711
Cdd:COG5651   167 FTQPPPTITnpGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGpigLNSGPGNTGFAGTGAAAGAAAAAAA 246
                          90       100
                  ....*....|....*....|
gi 334183349 1712 SAGRGFAGASSTPTGGFAAL 1731
Cdd:COG5651   247 AAAAAGAGASAALASLAATL 266
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
1580-1673 3.77e-03

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


:

Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 38.37  E-value: 3.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1580 GPFGNATTTTSNPFNMTvPSGELFKPASFNFQNPQPSQPAGFGSFSVTPSQTPAQSGFGQPSQiGGGQQALGSVLGSFGQ 1659
Cdd:pfam13634    1 GLFGAATSTSGGLFGNT-STTAASGGGLFGAASTATATTSGGGLFGNSSSNAPSGGLFGATNT-TTQTATGGGLFGNNAA 78
                           90
                   ....*....|....
gi 334183349  1660 SRQIGAGlpGATFG 1673
Cdd:pfam13634   79 TTTSTTG--GGLFG 90
 
Name Accession Description Interval E-value
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1102-1472 5.81e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 54.20  E-value: 5.81e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1102 SPISASKPSFNWSGNKSSNTTSYAEESAPSQIKDTRTVSQPGGSSFLPKRPVASTVLEQTEkkagefkfSEAKANAFVET 1181
Cdd:pfam17823   68 APVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQ--------SLPAAIAALPS 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1182 AAGSVQRLSTTSSGSDFESSKGFGAQFSTmSSGAPASSFSSKSLFGfNSSSSIPGDKFTFPAVTAPLSGTPLDSTSTLFT 1261
Cdd:pfam17823  140 EAFSAPRAAACRANASAAPRAAIAAASAP-HAASPAPRTAASSTTA-ASSTTAASSAPTTAASSAPATLTPARGISTAAT 217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1262 ASSAPVSSSSQDPVPasipiSSAPVPQTFS-VTSTSTVSATGFNVPFGKPLTSVKVDLNQAAPStpspspgptagftfnl 1340
Cdd:pfam17823  218 ATGHPAAGTALAAVG-----NSSPAAGTVTaAVGTVTPAALATLAAAAGTVASAAGTINMGDPH---------------- 276
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1341 pALSPSSPEMVSSSTGQSSLFPPSAPTSQVSSDQASatsslTDSSRLFSSTSLSSTPPITPPDAFQSPQVSTPSSAVpIT 1420
Cdd:pfam17823  277 -ARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVS-----TDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAV-VT 349
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 334183349  1421 EPVSEPKKPEAQSSSILSTQSTVDSVANATKTQNEPLPVKSEISNPGTTVTP 1472
Cdd:pfam17823  350 TTKAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAP 401
PPE COG5651
PPE-repeat protein [Function unknown];
1637-1731 3.39e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 48.35  E-value: 3.39e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1637 FGQPSQIGG--GQQALGSVLGSFGQSRQIGAGLPGATFGSPTGFGGSNPGSG---LPNAPASGGFAAAGSSATGGFAAMA 1711
Cdd:COG5651   167 FTQPPPTITnpGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGpigLNSGPGNTGFAGTGAAAGAAAAAAA 246
                          90       100
                  ....*....|....*....|
gi 334183349 1712 SAGRGFAGASSTPTGGFAAL 1731
Cdd:COG5651   247 AAAAAGAGASAALASLAATL 266
PHA03247 PHA03247
large tegument protein UL36; Provisional
1341-1725 3.60e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 3.60e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1341 PALSPSSPEMVSSSTGQSSLfPPSAPTSQVSSDQASATSSLTDSSRLFSSTSLSSTPPITP-PDAFQSPQVSTPSSAVPI 1419
Cdd:PHA03247 2624 PDPPPPSPSPAANEPDPHPP-PTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPrRRAARPTVGSLTSLADPP 2702
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1420 TEPVSEPKKPEAQSSSILSTQSTVDSVANATKTQNEPLPVKSeisnPGTTVTPVSSSGFLSGFSSGTQSSLASMAAPsfs 1499
Cdd:PHA03247 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAV----PAGPATPGGPARPARPPTTAGPPAPAPPAAP--- 2775
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1500 wpgSSQPQQLSSTPAPFPASSPTSASPFGEkkdivDTQEDEMDEEAPEASQTTELSMGSFGGFGLGSTPNPGAPKTNPF- 1578
Cdd:PHA03247 2776 ---AAGPPRRLTRPAVASLSESRESLPSPW-----DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPp 2847
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1579 -----------GGPFGNATTTTSNPFNMTVPSgelfKPASFNFQNPQPSQPAgfGSFSVTPSQTPAQSGFGQPSQIGGGQ 1647
Cdd:PHA03247 2848 pslplggsvapGGDVRRRPPSRSPAAKPAAPA----RPPVRRLARPAVSRST--ESFALPPDQPERPPQPQAPPPPQPQP 2921
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 334183349 1648 QALGSVLGSFGQSRQigaGLPGATFGSPTGFGGSNPGSGLPNAPASGGFaAAGSSATGGFAAMASAGRGFAGASSTPT 1725
Cdd:PHA03247 2922 QPPPPPQPQPPPPPP---PRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL-VPGRVAVPRFRVPQPAPSREAPASSTPP 2995
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
1580-1673 3.77e-03

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 38.37  E-value: 3.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1580 GPFGNATTTTSNPFNMTvPSGELFKPASFNFQNPQPSQPAGFGSFSVTPSQTPAQSGFGQPSQiGGGQQALGSVLGSFGQ 1659
Cdd:pfam13634    1 GLFGAATSTSGGLFGNT-STTAASGGGLFGAASTATATTSGGGLFGNSSSNAPSGGLFGATNT-TTQTATGGGLFGNNAA 78
                           90
                   ....*....|....
gi 334183349  1660 SRQIGAGlpGATFG 1673
Cdd:pfam13634   79 TTTSTTG--GGLFG 90
 
Name Accession Description Interval E-value
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1102-1472 5.81e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 54.20  E-value: 5.81e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1102 SPISASKPSFNWSGNKSSNTTSYAEESAPSQIKDTRTVSQPGGSSFLPKRPVASTVLEQTEkkagefkfSEAKANAFVET 1181
Cdd:pfam17823   68 APVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQ--------SLPAAIAALPS 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1182 AAGSVQRLSTTSSGSDFESSKGFGAQFSTmSSGAPASSFSSKSLFGfNSSSSIPGDKFTFPAVTAPLSGTPLDSTSTLFT 1261
Cdd:pfam17823  140 EAFSAPRAAACRANASAAPRAAIAAASAP-HAASPAPRTAASSTTA-ASSTTAASSAPTTAASSAPATLTPARGISTAAT 217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1262 ASSAPVSSSSQDPVPasipiSSAPVPQTFS-VTSTSTVSATGFNVPFGKPLTSVKVDLNQAAPStpspspgptagftfnl 1340
Cdd:pfam17823  218 ATGHPAAGTALAAVG-----NSSPAAGTVTaAVGTVTPAALATLAAAAGTVASAAGTINMGDPH---------------- 276
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1341 pALSPSSPEMVSSSTGQSSLFPPSAPTSQVSSDQASatsslTDSSRLFSSTSLSSTPPITPPDAFQSPQVSTPSSAVpIT 1420
Cdd:pfam17823  277 -ARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVS-----TDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAV-VT 349
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 334183349  1421 EPVSEPKKPEAQSSSILSTQSTVDSVANATKTQNEPLPVKSEISNPGTTVTP 1472
Cdd:pfam17823  350 TTKAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAP 401
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1210-1651 1.21e-06

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 53.77  E-value: 1.21e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1210 TMSSGAPASSFSSKSLFGFNSSSSIPGDKFTFPAVTAPLSGTPLDSTSTLFtassapvssssqdpvpaSIPISSAPVpqt 1289
Cdd:pfam05109  367 TLTSGTPSGCENISGAFASNRTFDITVSGLGTAPKTLIITRTATNATTTTH-----------------KVIFSKAPE--- 426
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1290 fSVTSTSTVSATGFNVPfgkpltSVKVDLNQAAPSTPSPSPGPTAGFTFNLPALSpsSPEMVSSSTGQSSLFPPSAPTSQ 1369
Cdd:pfam05109  427 -STTTSPTLNTTGFAAP------NTTTGLPSSTHVPTNLTAPASTGPTVSTADVT--SPTPAGTTSGASPVTPSPSPRDN 497
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1370 VSSDQASATSSLTDSSRLFSSTSLSSTPPITPPDA-FQSPQV--STPSSAVPITEPVSEPKKPEAQS----SSILSTQST 1442
Cdd:pfam05109  498 GTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPnATSPTLgkTSPTSAVTTPTPNATSPTPAVTTptpnATIPTLGKT 577
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1443 VDSVANATKTQNEPLPVKSEISNP---------GTTVTPVSSSGFLSGFSSGT--QSSLASMAAPSFSWpgssQPQQLSS 1511
Cdd:pfam05109  578 SPTSAVTTPTPNATSPTVGETSPQanttnhtlgGTSSTPVVTSPPKNATSAVTtgQHNITSSSTSSMSL----RPSSISE 653
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1512 TPAPFPASSPTSASPFGEKKDivDTQEDEMDEEAPEASQTTELSmgsfggfglGSTPNPGAPKTNPFGGPfGNATTTTsN 1591
Cdd:pfam05109  654 TLSPSTSDNSTSHMPLLTSAH--PTGGENITQVTPASTSTHHVS---------TSSPAPRPGTTSQASGP-GNSSTST-K 720
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1592 PFNMTVPSGELFKPASfnfqnpQPSQPAgfGSFSVTPSQTpaqSGFGQPSQIGGGQQALG 1651
Cdd:pfam05109  721 PGEVNVTKGTPPKNAT------SPQAPS--GQKTAVPTVT---STGGKANSTTGGKHTTG 769
PPE COG5651
PPE-repeat protein [Function unknown];
1637-1731 3.39e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 48.35  E-value: 3.39e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1637 FGQPSQIGG--GQQALGSVLGSFGQSRQIGAGLPGATFGSPTGFGGSNPGSG---LPNAPASGGFAAAGSSATGGFAAMA 1711
Cdd:COG5651   167 FTQPPPTITnpGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGpigLNSGPGNTGFAGTGAAAGAAAAAAA 246
                          90       100
                  ....*....|....*....|
gi 334183349 1712 SAGRGFAGASSTPTGGFAAL 1731
Cdd:COG5651   247 AAAAAGAGASAALASLAATL 266
PHA03247 PHA03247
large tegument protein UL36; Provisional
1341-1725 3.60e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 3.60e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1341 PALSPSSPEMVSSSTGQSSLfPPSAPTSQVSSDQASATSSLTDSSRLFSSTSLSSTPPITP-PDAFQSPQVSTPSSAVPI 1419
Cdd:PHA03247 2624 PDPPPPSPSPAANEPDPHPP-PTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPrRRAARPTVGSLTSLADPP 2702
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1420 TEPVSEPKKPEAQSSSILSTQSTVDSVANATKTQNEPLPVKSeisnPGTTVTPVSSSGFLSGFSSGTQSSLASMAAPsfs 1499
Cdd:PHA03247 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAV----PAGPATPGGPARPARPPTTAGPPAPAPPAAP--- 2775
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1500 wpgSSQPQQLSSTPAPFPASSPTSASPFGEkkdivDTQEDEMDEEAPEASQTTELSMGSFGGFGLGSTPNPGAPKTNPF- 1578
Cdd:PHA03247 2776 ---AAGPPRRLTRPAVASLSESRESLPSPW-----DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPp 2847
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1579 -----------GGPFGNATTTTSNPFNMTVPSgelfKPASFNFQNPQPSQPAgfGSFSVTPSQTPAQSGFGQPSQIGGGQ 1647
Cdd:PHA03247 2848 pslplggsvapGGDVRRRPPSRSPAAKPAAPA----RPPVRRLARPAVSRST--ESFALPPDQPERPPQPQAPPPPQPQP 2921
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 334183349 1648 QALGSVLGSFGQSRQigaGLPGATFGSPTGFGGSNPGSGLPNAPASGGFaAAGSSATGGFAAMASAGRGFAGASSTPT 1725
Cdd:PHA03247 2922 QPPPPPQPQPPPPPP---PRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL-VPGRVAVPRFRVPQPAPSREAPASSTPP 2995
PHA03247 PHA03247
large tegument protein UL36; Provisional
1273-1631 4.60e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 4.60e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1273 DPVPASIPISSAPVPQTFSVTSTSTVSATGFNVPfgkpltsvkvdlnqAAPSTPSPSPGPTAGFTFNLPAL-----SPSS 1347
Cdd:PHA03247 2700 DPPPPPPTPEPAPHALVSATPLPPGPAAARQASP--------------ALPAAPAPPAVPAGPATPGGPARparppTTAG 2765
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1348 PEMVSSSTGQSSLFPPSAPTSQVSSDQASATSSLTDSSRLFSSTSLSSTPPITPPDAFQSPQVSTPSSAVPITEPVSEPK 1427
Cdd:PHA03247 2766 PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP 2845
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1428 KPEaqsssilsTQSTVDSVANATKTQNEPlPVKSEISNPGTTVTPvsssgflsgfssgtqsSLASMAAPSFSWPGSSQPQ 1507
Cdd:PHA03247 2846 PPP--------SLPLGGSVAPGGDVRRRP-PSRSPAAKPAAPARP----------------PVRRLARPAVSRSTESFAL 2900
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1508 qlsstPAPFPASSPTSASPfgekkdivdtQEDEMDEEAPEASQTTelsmgsFGGFGLGSTPNPGAPKTNPfgGPFGNATT 1587
Cdd:PHA03247 2901 -----PPDQPERPPQPQAP----------PPPQPQPQPPPPPQPQ------PPPPPPPRPQPPLAPTTDP--AGAGEPSG 2957
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....
gi 334183349 1588 TTSNPFNMTVPSGELFKPaSFNFQNPQPSQPAgfgSFSVTPSQT 1631
Cdd:PHA03247 2958 AVPQPWLGALVPGRVAVP-RFRVPQPAPSREA---PASSTPPLT 2997
PPE COG5651
PPE-repeat protein [Function unknown];
1570-1737 3.17e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 41.80  E-value: 3.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1570 PGAPKTNPFGGPFGNATTTTSNPFNMTVPSGELFKPASFNFQNPQPSQPAGFGSFSVTPSQTPAQSGFGQPSQIGGGqqA 1649
Cdd:COG5651   212 GLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGL--A 289
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1650 LGSVLGSFGQSRQIGAGLPGATFGSPTGFGGSNPGSGLPNAPASGGFAAAGSSATGGFAAMASAGRGFAGASSTPTGGFA 1729
Cdd:COG5651   290 GSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGG 369

                  ....*...
gi 334183349 1730 AlasgsGG 1737
Cdd:COG5651   370 S-----AG 372
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
1580-1673 3.77e-03

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 38.37  E-value: 3.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1580 GPFGNATTTTSNPFNMTvPSGELFKPASFNFQNPQPSQPAGFGSFSVTPSQTPAQSGFGQPSQiGGGQQALGSVLGSFGQ 1659
Cdd:pfam13634    1 GLFGAATSTSGGLFGNT-STTAASGGGLFGAASTATATTSGGGLFGNSSSNAPSGGLFGATNT-TTQTATGGGLFGNNAA 78
                           90
                   ....*....|....
gi 334183349  1660 SRQIGAGlpGATFG 1673
Cdd:pfam13634   79 TTTSTTG--GGLFG 90
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1237-1521 3.97e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.87  E-value: 3.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1237 DKFTFPAVTAPLSGTPLDSTSTLFTASSAPVSSSSQDPvpasipisSAPVPQTFSVTSTSTVSATGFNVPFGKPLTSVKV 1316
Cdd:pfam17823  108 DGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAP--------RAAACRANASAAPRAAIAAASAPHAASPAPRTAA 179
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1317 DLNQAAPSTPSPSPGPTAGFTFNLPALSPSSPEMVSS-STGQSSLFPPSAPTSQVSSDQASATSSLTDSSRLFSSTSLSS 1395
Cdd:pfam17823  180 SSTTAASSTTAASSAPTTAASSAPATLTPARGISTAAtATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAA 259
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1396 TPPIT----------PPDAFQSPQVSTPSSAVPITePVSePKKPEAQSSSIlstQSTVDSVANATKtqNEPLPVKSEISN 1465
Cdd:pfam17823  260 AGTVAsaagtinmgdPHARRLSPAKHMPSDTMARN-PAA-PMGAQAQGPII---QVSTDQPVHNTA--GEPTPSPSNTTL 332
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 334183349  1466 PGTTVTPVSSSGFLSGFSSGTQSSLASmAAPSFSWPGSSQPQQLSSTPAPFPASSP 1521
Cdd:pfam17823  333 EPNTPKSVASTNLAVVTTTKAQAKEPS-ASPVPVLHTSMIPEVEATSPTTQPSPLL 387
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1039-1506 7.05e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.31  E-value: 7.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1039 SERLRSANNTQDRSLLHvkDHASPVVSSNKGIMESFQQDTSEAQSTPFKTRPPMPqsnSPFTISPISASKPSFNWSgnkS 1118
Cdd:PHA03307   21 FPRPPATPGDAADDLLS--GSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPP---GPGTEAPANESRSTPTWS---L 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1119 SNTTSYAEESAPSQIKDTRTVSQPGGSSFLPKRPVASTVLEQTEKKAGEFKFSEAKANAFVETAAGSvqrlstTSSGSDF 1198
Cdd:PHA03307   93 STLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASP------AAVASDA 166
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1199 ESSkgfgaqfstmSSGAPASSFSSKSlfgfnsSSSIPGDKFTFPAVTAPLSGTPLDSTstlfTASSAPVSSSSQDPVPAS 1278
Cdd:PHA03307  167 ASS----------RQAALPLSSPEET------ARAPSSPPAEPPPSTPPAAASPRPPR----RSSPISASASSPAPAPGR 226
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1279 IPISSAPVPQTFSVTSTSTVSATGFNVPFGKPLTSVKVDLNQAAPSTPSPSPGPTAGFTFNLPALSPSSPEMVSSSTGQS 1358
Cdd:PHA03307  227 SAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSG 306
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349 1359 SLFPPSAPTSQVSSDQASATSSLTDSSRLFSSTSLSSTPPITPPDAFQSPQVSTPSSAVPITEPVSEPKKPEAQSSSILS 1438
Cdd:PHA03307  307 PAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPT 386
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 334183349 1439 TQSTVDSVANATKTQNEPLPVkseisnPGTTVTPVSSSGFLSGFSSGTQSSLASmaaPSFS-WPGSSQP 1506
Cdd:PHA03307  387 RRRARAAVAGRARRRDATGRF------PAGRPRPSPLDAGAASGAFYARYPLLT---PSGEpWPGSPPP 446
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1189-1367 8.97e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 40.81  E-value: 8.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1189 LSTTSSGSDFESSKGFGAQFSTmsSGAPASSFSSKSLFGFNSSSSIPGDKFTFPAVTAPLSGTPL-DSTSTLFTASSapv 1267
Cdd:pfam15967    1 MSGFSFGGGPGSTATAGGGFSF--GAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGLFgQKPATGFTFGT--- 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183349  1268 ssssqdpvPASIPISSAPVPQTFSVTSTSTVSATGFNVPFGKPL-----------TSVKVDLNQAAPSTPSPSPGPTAGF 1336
Cdd:pfam15967   76 --------PASSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAasatpfslpasSTSGGGLSLGSVLTSTAAQQGATGF 147
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 334183349  1337 TFNL---PALS-PSSPEMVSSSTGQS---SLFPPSAPT 1367
Cdd:pfam15967  148 TLNLggtPATTtAVSTGLSLGSTLTSlggSLFQNTNST 185
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH