NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958750089|ref|XP_038958034|]
View 

protein transport protein Sec24B isoform X1 [Rattus norvegicus]

Protein Classification

SEC24 family transport protein( domain architecture ID 1001573)

SEC24 family transport protein is a component of the coat protein complex II (COPII) which promotes the formation of transport vesicles from the endoplasmic reticulum (ER)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG5028 super family cl34873
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ...
448-1285 2.43e-167

Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];


The actual alignment was detected with superfamily member COG5028:

Pssm-ID: 227361 [Multi-domain]  Cd Length: 861  Bit Score: 520.50  E-value: 2.43e-167
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  448 GSYPDMHS-------SSASSPVPDRAPEPNST----LVPTPTAAQPAKV--AKPFGYgYPTLQPAYQNAAapPTTAHPSG 514
Cdd:COG5028      7 GVYPQAQSqvhtgaaSSKKSARPHRAYANFSAgqmgMPPYTTPPLQQQSrrQIDQAA-TAMHNTGANNPA--PSVMSPAF 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  515 PAYSGYPQQYPGvhqlssglgGLSLQSSPQPES-LRPVNL--TQEKNI----LPATPIWAPVPNLSAELSKLNCSPDSFR 587
Cdd:COG5028     84 QSQQKFSSPYGG---------SMADGTAPKPTNpLVPVDLfeDQPPPIsdlfLPPPPIVPPLTTNFVGSEQSNCSPKYVR 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  588 CTLTSIPQTQALLNKAKLPLGLLLHPFRDLT----QLPVITSNTIVRCRSCRTYINPFVSFIDQ-RRWKCNLCYRVNDVP 662
Cdd:COG5028    155 STMYAIPETNDLLKKSKIPFGLVIRPFLELYpeedPVPLVEDGSIVRCRRCRSYINPFVQFIEQgRKWRCNICRSKNDVP 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  663 EEFlYNPLTRS--YGEPHKRPEVQNSTVEFIASSDYMLRPPQPAVYLFVLDVSHNAVEAGYLTVLCQSLLENLDKLPG-D 739
Cdd:COG5028    235 EGF-DNPSGPNdpRSDRYSRPELKSGVVDFLAPKEYSLRQPPPPVYVFLIDVSFEAIKNGLVKAAIRAILENLDQIPNfD 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  740 SRTRIGFMTFDSTIHFYNLQEGLSQpQMLIVSDIDDVFLPTP-DSLLVNLYESKELIKDLLNALPNMFINTRETHSALGP 818
Cdd:COG5028    314 PRTKIAIICFDSSLHFFKLSPDLDE-QMLIVSDLDEPFLPFPsGLFVLPLKSCKQIIETLLDRVPRIFQDNKSPKNALGP 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  819 ALQAAFKLMSPTGGRVSVFQTQLPSLGAGLLQSREDPNQRsstkvvhHLGPATDFYKKLALDCSGQQTAVDLFLLSSQYS 898
Cdd:COG5028    393 ALKAAKSLIGGTGGKIIVFLSTLPNMGIGKLQLREDKESS-------LLSCKDSFYKEFAIECSKVGISVDLFLTSEDYI 465
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  899 DLASLACMSKYSAGCIFYYPSFHSThNPSQAEKLQKDLKRYLTRKIGFEAVMRIRCTKGLSMHTFHGNFFVRSTDLLSLA 978
Cdd:COG5028    466 DVATLSHLCRYTGGQTYFYPNFSAT-RPNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYGNFFNRSSDLCAFS 544
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  979 NINPDAGFAVQLSIEESLAdTSLVCFQTALLYTSSKGERRIRVHTLCLPVVSSLADVYAGVDVQAAICLLANMAVDRSVS 1058
Cdd:COG5028    545 TMPRDTSLLVEFSIDEKLM-TSDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASADQLAIACILAKKASTKALN 623
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089 1059 SSLSDARDALVNAVVDPLSAYSSAVSSVPRSTLTA-PSSLKLLPLYVLALLKQKAFRTGtSTRLDDRVYAMCQMKSQPLV 1137
Cdd:COG5028    624 SSLKEARVLINKSMVDILKAYKKELVKSNTSTQLPlPANLKLLPLLMLALLKSSAFRSG-STPSDIRISALNRLTSLPLK 702
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089 1138 HLMKMIHPNLYRIDRLTDEGAIHVNDRVVPQPPLqKLSAEKLTREGAFLMDCGSVFYIWIGKGCDSNFIENVLGYPDFGS 1217
Cdd:COG5028    703 QLMRNIYPTLYALHDMPIEAGLPDEGLLVLPSPI-NATSSLLESGGLYLIDTGQKIFLWFGKDAVPSLLQDLFGVDSLSD 781
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958750089 1218 IPQKMTHLPELDTLPSERTRSFVTWLRDSRP-LSPVLHVVKD--ESPARTDFFQHLVEDRTEAALSYYEFL 1285
Cdd:COG5028    782 IPSGKFTLPPTGNEFNERVRNIIGELRSVNDdSTLPLVLVRGggDPSLRLWFFSTLVEDKTLNIPSYLDYL 852
PHA03247 super family cl33720
large tegument protein UL36; Provisional
2-483 6.54e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 6.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089    2 SAPAGSPHPAAGARMPPKLGGAVSGLAPPQQNGPAQSQmqvpsgyglPHQNYMAPSGHYSQGPGKMTSLPLDNQCENYYS 81
Cdd:PHA03247  2611 PAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPP---------PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ 2681
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089   82 RPY-TAPTQNVGTPSSANQPGAQlmyGRGPSAPHMGASMPGPFQGAPASASHSYPSASQPYSSlgnrYSSPATYSATASV 160
Cdd:PHA03247  2682 RPRrRAARPTVGSLTSLADPPPP---PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP----PAVPAGPATPGGP 2754
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  161 ASQGYPSTCSHyPISTVSNVVYPNVSYPSLPASEPYGQMFTSQSAPPPARPLKESYSGPSTAVAYPSRPPPPPSQHQQQQ 240
Cdd:PHA03247  2755 ARPARPPTTAG-PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  241 QQQQQQQQQSHSGYSSLPWSGpGLPPAQDSLIRNqmgslaiPNSHPAINVADSLSCPITENVQPPKPSSVVATVLPGPSS 320
Cdd:PHA03247  2834 AQPTAPPPPPGPPPPSLPLGG-SVAPGGDVRRRP-------PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  321 TRMPPAPSHPVGPVPSAPPPPEQMQTKVR---------FPVPSHAGCGKWRPVNRDPGLWGTYIGRM---RERPPekfan 388
Cdd:PHA03247  2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPppprpqpplAPTTDPAGAGEPSGAVPQPWLGALVPGRVavpRFRVP----- 2980
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  389 kgmQYGDYVNNQASSTPTPLSSASDDEEEEDEDEEAGVDSSSTTSSASPLPNSYDALEGGS-----YPDMHSSSASSPVP 463
Cdd:PHA03247  2981 ---QPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDadslfDSDSERSDLEALDP 3057
                          490       500
                   ....*....|....*....|
gi 1958750089  464 DrAPEPNSTLVPTPTAAQPA 483
Cdd:PHA03247  3058 L-PPEPHDPFAHEPDPATPE 3076
 
Name Accession Description Interval E-value
COG5028 COG5028
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ...
448-1285 2.43e-167

Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];


Pssm-ID: 227361 [Multi-domain]  Cd Length: 861  Bit Score: 520.50  E-value: 2.43e-167
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  448 GSYPDMHS-------SSASSPVPDRAPEPNST----LVPTPTAAQPAKV--AKPFGYgYPTLQPAYQNAAapPTTAHPSG 514
Cdd:COG5028      7 GVYPQAQSqvhtgaaSSKKSARPHRAYANFSAgqmgMPPYTTPPLQQQSrrQIDQAA-TAMHNTGANNPA--PSVMSPAF 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  515 PAYSGYPQQYPGvhqlssglgGLSLQSSPQPES-LRPVNL--TQEKNI----LPATPIWAPVPNLSAELSKLNCSPDSFR 587
Cdd:COG5028     84 QSQQKFSSPYGG---------SMADGTAPKPTNpLVPVDLfeDQPPPIsdlfLPPPPIVPPLTTNFVGSEQSNCSPKYVR 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  588 CTLTSIPQTQALLNKAKLPLGLLLHPFRDLT----QLPVITSNTIVRCRSCRTYINPFVSFIDQ-RRWKCNLCYRVNDVP 662
Cdd:COG5028    155 STMYAIPETNDLLKKSKIPFGLVIRPFLELYpeedPVPLVEDGSIVRCRRCRSYINPFVQFIEQgRKWRCNICRSKNDVP 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  663 EEFlYNPLTRS--YGEPHKRPEVQNSTVEFIASSDYMLRPPQPAVYLFVLDVSHNAVEAGYLTVLCQSLLENLDKLPG-D 739
Cdd:COG5028    235 EGF-DNPSGPNdpRSDRYSRPELKSGVVDFLAPKEYSLRQPPPPVYVFLIDVSFEAIKNGLVKAAIRAILENLDQIPNfD 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  740 SRTRIGFMTFDSTIHFYNLQEGLSQpQMLIVSDIDDVFLPTP-DSLLVNLYESKELIKDLLNALPNMFINTRETHSALGP 818
Cdd:COG5028    314 PRTKIAIICFDSSLHFFKLSPDLDE-QMLIVSDLDEPFLPFPsGLFVLPLKSCKQIIETLLDRVPRIFQDNKSPKNALGP 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  819 ALQAAFKLMSPTGGRVSVFQTQLPSLGAGLLQSREDPNQRsstkvvhHLGPATDFYKKLALDCSGQQTAVDLFLLSSQYS 898
Cdd:COG5028    393 ALKAAKSLIGGTGGKIIVFLSTLPNMGIGKLQLREDKESS-------LLSCKDSFYKEFAIECSKVGISVDLFLTSEDYI 465
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  899 DLASLACMSKYSAGCIFYYPSFHSThNPSQAEKLQKDLKRYLTRKIGFEAVMRIRCTKGLSMHTFHGNFFVRSTDLLSLA 978
Cdd:COG5028    466 DVATLSHLCRYTGGQTYFYPNFSAT-RPNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYGNFFNRSSDLCAFS 544
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  979 NINPDAGFAVQLSIEESLAdTSLVCFQTALLYTSSKGERRIRVHTLCLPVVSSLADVYAGVDVQAAICLLANMAVDRSVS 1058
Cdd:COG5028    545 TMPRDTSLLVEFSIDEKLM-TSDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASADQLAIACILAKKASTKALN 623
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089 1059 SSLSDARDALVNAVVDPLSAYSSAVSSVPRSTLTA-PSSLKLLPLYVLALLKQKAFRTGtSTRLDDRVYAMCQMKSQPLV 1137
Cdd:COG5028    624 SSLKEARVLINKSMVDILKAYKKELVKSNTSTQLPlPANLKLLPLLMLALLKSSAFRSG-STPSDIRISALNRLTSLPLK 702
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089 1138 HLMKMIHPNLYRIDRLTDEGAIHVNDRVVPQPPLqKLSAEKLTREGAFLMDCGSVFYIWIGKGCDSNFIENVLGYPDFGS 1217
Cdd:COG5028    703 QLMRNIYPTLYALHDMPIEAGLPDEGLLVLPSPI-NATSSLLESGGLYLIDTGQKIFLWFGKDAVPSLLQDLFGVDSLSD 781
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958750089 1218 IPQKMTHLPELDTLPSERTRSFVTWLRDSRP-LSPVLHVVKD--ESPARTDFFQHLVEDRTEAALSYYEFL 1285
Cdd:COG5028    782 IPSGKFTLPPTGNEFNERVRNIIGELRSVNDdSTLPLVLVRGggDPSLRLWFFSTLVEDKTLNIPSYLDYL 852
Sec24-like cd01479
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the ...
701-944 1.36e-130

Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.


Pssm-ID: 238756 [Multi-domain]  Cd Length: 244  Bit Score: 400.50  E-value: 1.36e-130
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  701 PQPAVYLFVLDVSHNAVEAGYLTVLCQSLLENLDKLPGD-SRTRIGFMTFDSTIHFYNLQEGLSQPQMLIVSDIDDVFLP 779
Cdd:cd01479      1 PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPGDdPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLDDPFLP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  780 TPDSLLVNLYESKELIKDLLNALPNMFINTRETHSALGPALQAAFKLMSPTGGRVSVFQTQLPSLGAGLLQSREDPNQRS 859
Cdd:cd01479     81 LPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLKETGGKIIVFQSSLPTLGAGKLKSREDPKLLS 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  860 STKVVHHLGPATDFYKKLALDCSGQQTAVDLFLLSSQYSDLASLACMSKYSAGCIFYYPSFHsTHNPSQAEKLQKDLKRY 939
Cdd:cd01479    161 TDKEKQLLQPQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYPSFN-FSAPNDVEKLVNELARY 239

                   ....*
gi 1958750089  940 LTRKI 944
Cdd:cd01479    240 LTRKI 244
Sec23_trunk pfam04811
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
701-940 6.40e-107

Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.


Pssm-ID: 398467 [Multi-domain]  Cd Length: 241  Bit Score: 337.30  E-value: 6.40e-107
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  701 PQPAVYLFVLDVSHNAVEAGYLTVLCQSLLENLDKLPGDSRTRIGFMTFDSTIHFYNLQEGLSQPQMLIVSDIDDVFLPT 780
Cdd:pfam04811    1 PQPPVFLFVIDVSYNAIKSGLLAALKESLLQSLDLLPGDPRARVGFITFDSTVHFFNLGSSLRQPQMLVVSDLQDMFLPL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  781 PDSLLVNLYESKELIKDLLNALPNMFINTRETHSALGPALQAAFKLM--SPTGGRVSVFQTQLPSLGA-GLLQSREDPNQ 857
Cdd:pfam04811   81 PDRFLVPLSECRFVLEDLLEQLPPMFPVTKRPERCLGPALQAAFLLLkaAFTGGKIMVFQGGLPTVGPgGKLKSRLDESH 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  858 RSSTKVVHHLGPATD-FYKKLALDCSGQQTAVDLFLLSSQYSDLASLACMSKYSAGCIFYYPSFHSTHnpsQAEKLQKDL 936
Cdd:pfam04811  161 HGTDKEKAKLVKKADkFYKSLAKECVKQGHSVDLFAFSLDYVDVATLGQLSRLTGGQVYLYPSFQADV---DGSKFKQDL 237

                   ....
gi 1958750089  937 KRYL 940
Cdd:pfam04811  238 QRYF 241
PTZ00395 PTZ00395
Sec24-related protein; Provisional
668-1294 9.97e-34

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 142.14  E-value: 9.97e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  668 NPLTRSYGEPHKRPEVQNStvefiassdYMLRPPQ-----PAVYLFVLDVSHNAVEAGYLTVLCQSL---LENLdKLPgd 739
Cdd:PTZ00395   921 NLICEKNGEPDSAKIRRNS---------FLAKYPQvknmlPPYFVFVVECSYNAIYNNITYTILEGIryaVQNV-KCP-- 988
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  740 sRTRIGFMTFDSTIHFYNLQEGLSQP-------------QMLIVSDIDDVFLPTP-DSLLVNLYESKELIKDLLNALPNM 805
Cdd:PTZ00395   989 -QTKIAIITFNSSIYFYHCKGGKGVSgeegdggggsgnhQVIVMSDVDDPFLPLPlEDLFFGCVEEIDKINTLIDTIKSV 1067
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  806 FINTRETHSALGPALQAAFKLMSPTGG--RVSVFQTQLPSLGAGLLQSREDPNQRSSTKVVHHLgpatdFYKKLALDCSG 883
Cdd:PTZ00395  1068 STTMQSYGSCGNSALKIAMDMLKERNGlgSICMFYTTTPNCGIGAIKELKKDLQENFLEVKQKI-----FYDSLLLDLYA 1142
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  884 QQTAVDLFLLSSQYSDLA--SLACMSKYSAGCIFYYPSFhsthnpsqaeKLQKDLKR-YL-------TRKIGFEAVMRIR 953
Cdd:PTZ00395  1143 FNISVDIFIISSNNVRVCvpSLQYVAQNTGGKILFVENF----------LWQKDYKEiYMnimdtltSEDIAYCCELKLR 1212
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  954 CTKGLSMHTF---HGNF-FVRSTDLLSLANINPDAGFAVQLSIEESLADTSLVCFQTALLYTSSKGERRIRVHTLCLPVV 1029
Cdd:PTZ00395  1213 YSHHMSVKKLfccNNNFnSIISVDTIKIPKIRHDQTFAFLLNYSDISESKKQIYFQCACIYTNLWGDRFVRLHTTHMNLT 1292
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089 1030 SSLADVYAGVDVQAaiclLANMAVDRSVSSSLSDarDALVNAVVDPLSAYSSAVSSVPRST-----LTAPSSLKLLPLYV 1104
Cdd:PTZ00395  1293 SSLSTVFRYTDAEA----LMNILIKQLCTNILHN--DNYSKIIIDNLAAILFSYRINCASSahsgqLILPDTLKLLPLFT 1366
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089 1105 LALLKQKAfrTGTSTRLDDRVYAMCQMKSQPLVHLMKMIHPNLY--RIDRLTDE-GAIHVNDRVVpQPPLQKLSAEKLTR 1181
Cdd:PTZ00395  1367 SSLLKHNV--TKKEILHDLKVYSLIKLLSMPIISSLLYVYPVMYviHIKGKTNEiDSMDVDDDLF-IPKTIPSSAEKIYS 1443
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089 1182 EGAFLMDCGSVFYIWIGKGCDSNFIENVLgypdfGSIP-QKMTHLPELDTLPS----ERTRSFVTWLRDSRPLSPVLhVV 1256
Cdd:PTZ00395  1444 NGIYLLDACTHFYLYFGFHSDANFAKEIV-----GDIPtEKNAHELNLTDTPNaqkvQRIIKNLSRIHHFNKYVPLV-MV 1517
                          650       660       670
                   ....*....|....*....|....*....|....*...
gi 1958750089 1257 KDESPARTDFFQHLVEDRTEAALSYYEFLIHVQQQVCK 1294
Cdd:PTZ00395  1518 APKSNEEEHLISLCVEDKADKEYSYVNFLCFIHKLVHK 1555
PHA03247 PHA03247
large tegument protein UL36; Provisional
2-483 6.54e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 6.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089    2 SAPAGSPHPAAGARMPPKLGGAVSGLAPPQQNGPAQSQmqvpsgyglPHQNYMAPSGHYSQGPGKMTSLPLDNQCENYYS 81
Cdd:PHA03247  2611 PAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPP---------PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ 2681
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089   82 RPY-TAPTQNVGTPSSANQPGAQlmyGRGPSAPHMGASMPGPFQGAPASASHSYPSASQPYSSlgnrYSSPATYSATASV 160
Cdd:PHA03247  2682 RPRrRAARPTVGSLTSLADPPPP---PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP----PAVPAGPATPGGP 2754
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  161 ASQGYPSTCSHyPISTVSNVVYPNVSYPSLPASEPYGQMFTSQSAPPPARPLKESYSGPSTAVAYPSRPPPPPSQHQQQQ 240
Cdd:PHA03247  2755 ARPARPPTTAG-PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  241 QQQQQQQQQSHSGYSSLPWSGpGLPPAQDSLIRNqmgslaiPNSHPAINVADSLSCPITENVQPPKPSSVVATVLPGPSS 320
Cdd:PHA03247  2834 AQPTAPPPPPGPPPPSLPLGG-SVAPGGDVRRRP-------PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  321 TRMPPAPSHPVGPVPSAPPPPEQMQTKVR---------FPVPSHAGCGKWRPVNRDPGLWGTYIGRM---RERPPekfan 388
Cdd:PHA03247  2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPppprpqpplAPTTDPAGAGEPSGAVPQPWLGALVPGRVavpRFRVP----- 2980
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  389 kgmQYGDYVNNQASSTPTPLSSASDDEEEEDEDEEAGVDSSSTTSSASPLPNSYDALEGGS-----YPDMHSSSASSPVP 463
Cdd:PHA03247  2981 ---QPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDadslfDSDSERSDLEALDP 3057
                          490       500
                   ....*....|....*....|
gi 1958750089  464 DrAPEPNSTLVPTPTAAQPA 483
Cdd:PHA03247  3058 L-PPEPHDPFAHEPDPATPE 3076
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
110-495 8.75e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.52  E-value: 8.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  110 PSAPHMGASMPGPFQGAPASASHSYPSASQPYSSLGNRYSSPATYSATASVASQGYPStcSHYPISTVSNVVYP-NVSYP 188
Cdd:pfam03154  187 PPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPS--PHPPLQPMTQPPPPsQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  189 SLPASEPYGQMftsqsaPPPARPLKesySGPSTaVAYPSRPPPPPSQHQQQQQQQQQQQQQSHSGYSSlpwSGPGLPPAQ 268
Cdd:pfam03154  265 PLPQPSLHGQM------PPMPHSLQ---TGPSH-MQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQ---QRIHTPPSQ 331
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  269 DSLIRNQ-------------MGSLAIPNSHPAINVADSLSCPITENVQPPKPSSVVATVLPGPS-------STRMPPA-- 326
Cdd:pfam03154  332 SQLQSQQppreqplppaplsMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAlkplsslSTHHPPSah 411
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  327 PSHPVGPVPSAPPPPEQMQTKVRFPVPSHAGCGKWRPVNrdpglwgtyiGRMRERPPEK-FANKGMQYGDYVNNQASSTP 405
Cdd:pfam03154  412 PPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPT----------SGLHQVPSQSpFPQHPFVPGGPPPITPPSGP 481
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  406 TPLSSASDDEEEEDEDEEAGVDSSSTTSSASPLPNSYDALEGgsyPDMHSSSASSPVPDRAPEPNSTLVPTPT-AAQPAK 484
Cdd:pfam03154  482 PTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEA---LDEAEEPESPPPPPRSPSPEPTVVNTPShASQSAR 558
                          410
                   ....*....|.
gi 1958750089  485 VAKPFGYGYPT 495
Cdd:pfam03154  559 FYKHLDRGYNS 569
GEL smart00262
Gelsolin homology domain; Gelsolin/severin/villin homology domain. Calcium-binding and ...
1160-1201 9.43e-03

Gelsolin homology domain; Gelsolin/severin/villin homology domain. Calcium-binding and actin-binding. Both intra- and extracellular domains.


Pssm-ID: 214590 [Multi-domain]  Cd Length: 90  Bit Score: 36.89  E-value: 9.43e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 1958750089  1160 HVNDRVVPQPPLQKLSAEKLTREGAFLMDCGSVFYIWIGKGC 1201
Cdd:smart00262    4 RVKGKRNVRVPEVPFSQGSLNSGDCYILDTGSEIYVWVGKKS 45
 
Name Accession Description Interval E-value
COG5028 COG5028
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ...
448-1285 2.43e-167

Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];


Pssm-ID: 227361 [Multi-domain]  Cd Length: 861  Bit Score: 520.50  E-value: 2.43e-167
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  448 GSYPDMHS-------SSASSPVPDRAPEPNST----LVPTPTAAQPAKV--AKPFGYgYPTLQPAYQNAAapPTTAHPSG 514
Cdd:COG5028      7 GVYPQAQSqvhtgaaSSKKSARPHRAYANFSAgqmgMPPYTTPPLQQQSrrQIDQAA-TAMHNTGANNPA--PSVMSPAF 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  515 PAYSGYPQQYPGvhqlssglgGLSLQSSPQPES-LRPVNL--TQEKNI----LPATPIWAPVPNLSAELSKLNCSPDSFR 587
Cdd:COG5028     84 QSQQKFSSPYGG---------SMADGTAPKPTNpLVPVDLfeDQPPPIsdlfLPPPPIVPPLTTNFVGSEQSNCSPKYVR 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  588 CTLTSIPQTQALLNKAKLPLGLLLHPFRDLT----QLPVITSNTIVRCRSCRTYINPFVSFIDQ-RRWKCNLCYRVNDVP 662
Cdd:COG5028    155 STMYAIPETNDLLKKSKIPFGLVIRPFLELYpeedPVPLVEDGSIVRCRRCRSYINPFVQFIEQgRKWRCNICRSKNDVP 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  663 EEFlYNPLTRS--YGEPHKRPEVQNSTVEFIASSDYMLRPPQPAVYLFVLDVSHNAVEAGYLTVLCQSLLENLDKLPG-D 739
Cdd:COG5028    235 EGF-DNPSGPNdpRSDRYSRPELKSGVVDFLAPKEYSLRQPPPPVYVFLIDVSFEAIKNGLVKAAIRAILENLDQIPNfD 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  740 SRTRIGFMTFDSTIHFYNLQEGLSQpQMLIVSDIDDVFLPTP-DSLLVNLYESKELIKDLLNALPNMFINTRETHSALGP 818
Cdd:COG5028    314 PRTKIAIICFDSSLHFFKLSPDLDE-QMLIVSDLDEPFLPFPsGLFVLPLKSCKQIIETLLDRVPRIFQDNKSPKNALGP 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  819 ALQAAFKLMSPTGGRVSVFQTQLPSLGAGLLQSREDPNQRsstkvvhHLGPATDFYKKLALDCSGQQTAVDLFLLSSQYS 898
Cdd:COG5028    393 ALKAAKSLIGGTGGKIIVFLSTLPNMGIGKLQLREDKESS-------LLSCKDSFYKEFAIECSKVGISVDLFLTSEDYI 465
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  899 DLASLACMSKYSAGCIFYYPSFHSThNPSQAEKLQKDLKRYLTRKIGFEAVMRIRCTKGLSMHTFHGNFFVRSTDLLSLA 978
Cdd:COG5028    466 DVATLSHLCRYTGGQTYFYPNFSAT-RPNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYGNFFNRSSDLCAFS 544
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  979 NINPDAGFAVQLSIEESLAdTSLVCFQTALLYTSSKGERRIRVHTLCLPVVSSLADVYAGVDVQAAICLLANMAVDRSVS 1058
Cdd:COG5028    545 TMPRDTSLLVEFSIDEKLM-TSDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASADQLAIACILAKKASTKALN 623
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089 1059 SSLSDARDALVNAVVDPLSAYSSAVSSVPRSTLTA-PSSLKLLPLYVLALLKQKAFRTGtSTRLDDRVYAMCQMKSQPLV 1137
Cdd:COG5028    624 SSLKEARVLINKSMVDILKAYKKELVKSNTSTQLPlPANLKLLPLLMLALLKSSAFRSG-STPSDIRISALNRLTSLPLK 702
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089 1138 HLMKMIHPNLYRIDRLTDEGAIHVNDRVVPQPPLqKLSAEKLTREGAFLMDCGSVFYIWIGKGCDSNFIENVLGYPDFGS 1217
Cdd:COG5028    703 QLMRNIYPTLYALHDMPIEAGLPDEGLLVLPSPI-NATSSLLESGGLYLIDTGQKIFLWFGKDAVPSLLQDLFGVDSLSD 781
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958750089 1218 IPQKMTHLPELDTLPSERTRSFVTWLRDSRP-LSPVLHVVKD--ESPARTDFFQHLVEDRTEAALSYYEFL 1285
Cdd:COG5028    782 IPSGKFTLPPTGNEFNERVRNIIGELRSVNDdSTLPLVLVRGggDPSLRLWFFSTLVEDKTLNIPSYLDYL 852
Sec24-like cd01479
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the ...
701-944 1.36e-130

Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.


Pssm-ID: 238756 [Multi-domain]  Cd Length: 244  Bit Score: 400.50  E-value: 1.36e-130
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  701 PQPAVYLFVLDVSHNAVEAGYLTVLCQSLLENLDKLPGD-SRTRIGFMTFDSTIHFYNLQEGLSQPQMLIVSDIDDVFLP 779
Cdd:cd01479      1 PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPGDdPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLDDPFLP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  780 TPDSLLVNLYESKELIKDLLNALPNMFINTRETHSALGPALQAAFKLMSPTGGRVSVFQTQLPSLGAGLLQSREDPNQRS 859
Cdd:cd01479     81 LPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLKETGGKIIVFQSSLPTLGAGKLKSREDPKLLS 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  860 STKVVHHLGPATDFYKKLALDCSGQQTAVDLFLLSSQYSDLASLACMSKYSAGCIFYYPSFHsTHNPSQAEKLQKDLKRY 939
Cdd:cd01479    161 TDKEKQLLQPQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYPSFN-FSAPNDVEKLVNELARY 239

                   ....*
gi 1958750089  940 LTRKI 944
Cdd:cd01479    240 LTRKI 244
Sec23_trunk pfam04811
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
701-940 6.40e-107

Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.


Pssm-ID: 398467 [Multi-domain]  Cd Length: 241  Bit Score: 337.30  E-value: 6.40e-107
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  701 PQPAVYLFVLDVSHNAVEAGYLTVLCQSLLENLDKLPGDSRTRIGFMTFDSTIHFYNLQEGLSQPQMLIVSDIDDVFLPT 780
Cdd:pfam04811    1 PQPPVFLFVIDVSYNAIKSGLLAALKESLLQSLDLLPGDPRARVGFITFDSTVHFFNLGSSLRQPQMLVVSDLQDMFLPL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  781 PDSLLVNLYESKELIKDLLNALPNMFINTRETHSALGPALQAAFKLM--SPTGGRVSVFQTQLPSLGA-GLLQSREDPNQ 857
Cdd:pfam04811   81 PDRFLVPLSECRFVLEDLLEQLPPMFPVTKRPERCLGPALQAAFLLLkaAFTGGKIMVFQGGLPTVGPgGKLKSRLDESH 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  858 RSSTKVVHHLGPATD-FYKKLALDCSGQQTAVDLFLLSSQYSDLASLACMSKYSAGCIFYYPSFHSTHnpsQAEKLQKDL 936
Cdd:pfam04811  161 HGTDKEKAKLVKKADkFYKSLAKECVKQGHSVDLFAFSLDYVDVATLGQLSRLTGGQVYLYPSFQADV---DGSKFKQDL 237

                   ....
gi 1958750089  937 KRYL 940
Cdd:pfam04811  238 QRYF 241
trunk_domain cd01468
trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi ...
701-938 9.54e-104

trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins.


Pssm-ID: 238745 [Multi-domain]  Cd Length: 239  Bit Score: 328.44  E-value: 9.54e-104
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  701 PQPAVYLFVLDVSHNAVEAGYLTVLCQSLLENLDKLPGDSRTRIGFMTFDSTIHFYNLQEGLSQPQMLIVSDIDDVFLPT 780
Cdd:cd01468      1 PQPPVFVFVIDVSYEAIKEGLLQALKESLLASLDLLPGDPRARVGLITYDSTVHFYNLSSDLAQPKMYVVSDLKDVFLPL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  781 PDSLLVNLYESKELIKDLLNALPNMF--INTRETHSALGPALQAAFKLMSPT--GGRVSVFQTQLPSLGAGLLQSREDPN 856
Cdd:cd01468     81 PDRFLVPLSECKKVIHDLLEQLPPMFwpVPTHRPERCLGPALQAAFLLLKGTfaGGRIIVFQGGLPTVGPGKLKSREDKE 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  857 QRSSTKVVHHLGPATDFYKKLALDCSGQQTAVDLFLLSSQYSDLASLACMSKYSAGCIFYYPSFHSthnPSQAEKLQKDL 936
Cdd:cd01468    161 PIRSHDEAQLLKPATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLYDSFQA---PNDGSKFKQDL 237

                   ..
gi 1958750089  937 KR 938
Cdd:cd01468    238 QR 239
PTZ00395 PTZ00395
Sec24-related protein; Provisional
668-1294 9.97e-34

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 142.14  E-value: 9.97e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  668 NPLTRSYGEPHKRPEVQNStvefiassdYMLRPPQ-----PAVYLFVLDVSHNAVEAGYLTVLCQSL---LENLdKLPgd 739
Cdd:PTZ00395   921 NLICEKNGEPDSAKIRRNS---------FLAKYPQvknmlPPYFVFVVECSYNAIYNNITYTILEGIryaVQNV-KCP-- 988
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  740 sRTRIGFMTFDSTIHFYNLQEGLSQP-------------QMLIVSDIDDVFLPTP-DSLLVNLYESKELIKDLLNALPNM 805
Cdd:PTZ00395   989 -QTKIAIITFNSSIYFYHCKGGKGVSgeegdggggsgnhQVIVMSDVDDPFLPLPlEDLFFGCVEEIDKINTLIDTIKSV 1067
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  806 FINTRETHSALGPALQAAFKLMSPTGG--RVSVFQTQLPSLGAGLLQSREDPNQRSSTKVVHHLgpatdFYKKLALDCSG 883
Cdd:PTZ00395  1068 STTMQSYGSCGNSALKIAMDMLKERNGlgSICMFYTTTPNCGIGAIKELKKDLQENFLEVKQKI-----FYDSLLLDLYA 1142
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  884 QQTAVDLFLLSSQYSDLA--SLACMSKYSAGCIFYYPSFhsthnpsqaeKLQKDLKR-YL-------TRKIGFEAVMRIR 953
Cdd:PTZ00395  1143 FNISVDIFIISSNNVRVCvpSLQYVAQNTGGKILFVENF----------LWQKDYKEiYMnimdtltSEDIAYCCELKLR 1212
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  954 CTKGLSMHTF---HGNF-FVRSTDLLSLANINPDAGFAVQLSIEESLADTSLVCFQTALLYTSSKGERRIRVHTLCLPVV 1029
Cdd:PTZ00395  1213 YSHHMSVKKLfccNNNFnSIISVDTIKIPKIRHDQTFAFLLNYSDISESKKQIYFQCACIYTNLWGDRFVRLHTTHMNLT 1292
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089 1030 SSLADVYAGVDVQAaiclLANMAVDRSVSSSLSDarDALVNAVVDPLSAYSSAVSSVPRST-----LTAPSSLKLLPLYV 1104
Cdd:PTZ00395  1293 SSLSTVFRYTDAEA----LMNILIKQLCTNILHN--DNYSKIIIDNLAAILFSYRINCASSahsgqLILPDTLKLLPLFT 1366
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089 1105 LALLKQKAfrTGTSTRLDDRVYAMCQMKSQPLVHLMKMIHPNLY--RIDRLTDE-GAIHVNDRVVpQPPLQKLSAEKLTR 1181
Cdd:PTZ00395  1367 SSLLKHNV--TKKEILHDLKVYSLIKLLSMPIISSLLYVYPVMYviHIKGKTNEiDSMDVDDDLF-IPKTIPSSAEKIYS 1443
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089 1182 EGAFLMDCGSVFYIWIGKGCDSNFIENVLgypdfGSIP-QKMTHLPELDTLPS----ERTRSFVTWLRDSRPLSPVLhVV 1256
Cdd:PTZ00395  1444 NGIYLLDACTHFYLYFGFHSDANFAKEIV-----GDIPtEKNAHELNLTDTPNaqkvQRIIKNLSRIHHFNKYVPLV-MV 1517
                          650       660       670
                   ....*....|....*....|....*....|....*...
gi 1958750089 1257 KDESPARTDFFQHLVEDRTEAALSYYEFLIHVQQQVCK 1294
Cdd:PTZ00395  1518 APKSNEEEHLISLCVEDKADKEYSYVNFLCFIHKLVHK 1555
Sec23_BS pfam08033
Sec23/Sec24 beta-sandwich domain;
945-1029 3.00e-33

Sec23/Sec24 beta-sandwich domain;


Pssm-ID: 429794 [Multi-domain]  Cd Length: 86  Bit Score: 123.42  E-value: 3.00e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  945 GFEAVMRIRCTKGLSMHTFHGNFFVRS-TDLLSLANINPDAGFAVQLSIEESLADTSLVCFQTALLYTSSKGERRIRVHT 1023
Cdd:pfam08033    1 GFNAVLRVRTSKGLKVSGFIGNFVSRSsGDTWKLPSLDPDTSYAFEFDIDEPLPNGSNAYIQFALLYTHSSGERRIRVTT 80

                   ....*.
gi 1958750089 1024 LCLPVV 1029
Cdd:pfam08033   81 VALPVT 86
SEC23 COG5047
Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];
584-1063 2.11e-22

Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];


Pssm-ID: 227380 [Multi-domain]  Cd Length: 755  Bit Score: 104.19  E-value: 2.11e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  584 DSFRCTLTSIPQTQALLNKAKLPLGLLLHPFRDLTQLPVITSNTIVRCRSCRTYINPFVSfIDQRR--WKCNLCYRVNDV 661
Cdd:COG5047     10 DGIRLTWNVFPATRGDATRTVIPIACLYTPLHEDDALTVNYYEPVKCTAPCKAVLNPYCH-IDERNqsWICPFCNQRNTL 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  662 PEEflYNPLTRSYGEPhkRPEVQNSTVEFIASsdymlRPPQ-PAVYLFVLDVshnAVEAGYLTVLCQSLLENLDKLPGDS 740
Cdd:COG5047     89 PPQ--YRDISNANLPL--ELLPQSSTIEYTLS-----KPVIlPPVFFFVVDA---CCDEEELTALKDSLIVSLSLLPPEA 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  741 RtrIGFMTFDSTIHF----------------------YNLQE--GLSQPQMLIV--SDIDDVFLPTPDSLLVNLYESKEL 794
Cdd:COG5047    157 L--VGLITYGTSIQVhelnaenhrrsyvfsgnkeytkENLQEllALSKPTKSGGfeSKISGIGQFASSRFLLPTQQCEFK 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  795 IKDLLNAL-PNMF--INTRETHSALGPALQAAFKLMSPT----GGRVSVFQTQLPSLGAGLLQSRE--DP---NQRSSTK 862
Cdd:COG5047    235 LLNILEQLqPDPWpvPAGKRPLRCTGSALNIASSLLEQCfpnaGCHIVLFAGGPCTVGPGTVVSTElkEPmrsHHDIESD 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  863 VVHHLGPATDFYKKLALDCSGQQTAVDLFLLSSQYSDLASLACMSKYSAGCIFYYPSFHSTHNPSQAEK-LQKDLKRYLt 941
Cdd:COG5047    315 SAQHSKKATKFYKGLAERVANQGHALDIFAGCLDQIGIMEMEPLTTSTGGALVLSDSFTTSIFKQSFQRiFNRDSEGYL- 393
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  942 rKIGFEAVMRIRCTKGLSMHTFHGN---------------FFVRSTDLLSLANINPDAGFAVQLSIEESLADTS------ 1000
Cdd:COG5047    394 -KMGFNANMEVKTSKNLKIKGLIGHavsvkkkannisdseIGIGATNSWKMASLSPKSNYALYFEIALGAASGSaqrpae 472
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958750089 1001 -LVCFQTalLYTSSKGERRIRVHTLCLPVVSSLADV-YAGVDVQAAICLLANMAVDRSVSSSLSD 1063
Cdd:COG5047    473 aYIQFIT--TYQHSSGTYRIRVTTVARMFTDGGLPKiNRSFDQEAAAVFMARIAAFKAETEDIID 535
Sec23_helical pfam04815
Sec23/Sec24 helical domain; COPII-coated vesicles carry proteins from the endoplasmic ...
1040-1141 1.82e-18

Sec23/Sec24 helical domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is composed of five alpha helices.


Pssm-ID: 461441 [Multi-domain]  Cd Length: 103  Bit Score: 81.78  E-value: 1.82e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089 1040 DVQAAICLLANMAVDRSVSSSLSDARDALVNAVVDPLSA-YSSAVSSVPRSTLTAPSSLKLLPLYVLALLKQKAFRTGTS 1118
Cdd:pfam04815    1 DQEAIAVLLAKKAVEKALSSSLSDAREALDNKLVDILAAyRKYCASSSSPGQLILPESLKLLPLYMLALLKSPALRGGNS 80
                           90       100
                   ....*....|....*....|...
gi 1958750089 1119 TRLDDRVYAMCQMKSQPLVHLMK 1141
Cdd:pfam04815   81 SPSDERAYARHLLLSLPVEELLL 103
zf-Sec23_Sec24 pfam04810
Sec23/Sec24 zinc finger; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
628-664 1.12e-15

Sec23/Sec24 zinc finger; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is found to be zinc binding domain.


Pssm-ID: 461437 [Multi-domain]  Cd Length: 38  Bit Score: 71.71  E-value: 1.12e-15
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1958750089  628 IVRCRSCRTYINPFVSFIDQ-RRWKCNLCYRVNDVPEE 664
Cdd:pfam04810    1 PVRCRRCRAYLNPFCQFDFGgKKWTCNFCGTRNPVPPE 38
PLN00162 PLN00162
transport protein sec23; Provisional
584-1068 4.57e-14

transport protein sec23; Provisional


Pssm-ID: 215083 [Multi-domain]  Cd Length: 761  Bit Score: 77.29  E-value: 4.57e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  584 DSFRCTLTSIPQTQALLNKAKLPLGLLLHPFRDLTQLPVITSNTIvRCRSCRTYINPF--VSFiDQRRWKCNLCYRVNDV 661
Cdd:PLN00162    10 DGVRMSWNVWPSSKIEASKCVIPLAALYTPLKPLPELPVLPYDPL-RCRTCRAVLNPYcrVDF-QAKIWICPFCFQRNHF 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  662 PeeflynpltRSY---GEPHKRPEV--QNSTVEFiASSDYMLRPPQPAVYLFVLDVSHNAVEAGYLTvlcQSLLENLDKL 736
Cdd:PLN00162    88 P---------PHYssiSETNLPAELfpQYTTVEY-TLPPGSGGAPSPPVFVFVVDTCMIEEELGALK---SALLQAIALL 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  737 PGDSrtRIGFMTFDSTIHFYNL------------------------QEGLS----QPQMLIVSDIDDVFLPTP-DSLLVN 787
Cdd:PLN00162   155 PENA--LVGLITFGTHVHVHELgfsecsksyvfrgnkevskdqileQLGLGgkkrRPAGGGIAGARDGLSSSGvNRFLLP 232
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  788 LYESKELIKDLLNALPNMFINTRETHSAL---GPALQAAFKLM---SP-TGGRVSVFqTQLPS-LGAGLLQSREDPNQRS 859
Cdd:PLN00162   233 ASECEFTLNSALEELQKDPWPVPPGHRPArctGAALSVAAGLLgacVPgTGARIMAF-VGGPCtEGPGAIVSKDLSEPIR 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  860 STK-----VVHHLGPATDFYKKLALDCSGQQTAVDLFLLSSQYSDLASLACMSKYSAGCIFYYPSFHSthnpsqaEKLQK 934
Cdd:PLN00162   312 SHKdldkdAAPYYKKAVKFYEGLAKQLVAQGHVLDVFACSLDQVGVAEMKVAVERTGGLVVLAESFGH-------SVFKD 384
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  935 DLKRYLTR------KIGFEAVMRIRCTKGL----------SMH----------TFHGNffvrsTDLLSLANINPDAGFAV 988
Cdd:PLN00162   385 SLRRVFERdgegslGLSFNGTFEVNCSKDVkvqgaigpcaSLEkkgpsvsdteIGEGG-----TTAWKLCGLDKKTSLAV 459
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  989 --QLSIEESLADTSL---VCFQTALLYTSSKGERRIRVHTLCLPVV--SSLADVYAGVDVQAAICLLANMAVDRSVSssl 1061
Cdd:PLN00162   460 ffEVANSGQSNPQPPgqqFFLQFLTRYQHSNGQTRLRVTTVTRRWVegSSSEELVAGFDQEAAAVVMARLASHKMET--- 536

                   ....*..
gi 1958750089 1062 SDARDAL 1068
Cdd:PLN00162   537 EEEFDAT 543
Gelsolin pfam00626
Gelsolin repeat;
1165-1240 1.26e-09

Gelsolin repeat;


Pssm-ID: 395501 [Multi-domain]  Cd Length: 76  Bit Score: 55.78  E-value: 1.26e-09
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958750089 1165 VVPQPPLQKLSAEKLTREGAFLMDCGSVFYIWIGKGcdSNFIENVLGYPDFGSIP-QKMTHLPELDTLP-SERTRSFV 1240
Cdd:pfam00626    1 KFVLPPPVPLSQESLNSGDCYLLDNGFTIFLWVGKG--SSLLEKLFAALLAAQLDdDERFPLPEVIRVPqGKEPARFL 76
PHA03247 PHA03247
large tegument protein UL36; Provisional
2-483 6.54e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 6.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089    2 SAPAGSPHPAAGARMPPKLGGAVSGLAPPQQNGPAQSQmqvpsgyglPHQNYMAPSGHYSQGPGKMTSLPLDNQCENYYS 81
Cdd:PHA03247  2611 PAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPP---------PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ 2681
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089   82 RPY-TAPTQNVGTPSSANQPGAQlmyGRGPSAPHMGASMPGPFQGAPASASHSYPSASQPYSSlgnrYSSPATYSATASV 160
Cdd:PHA03247  2682 RPRrRAARPTVGSLTSLADPPPP---PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP----PAVPAGPATPGGP 2754
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  161 ASQGYPSTCSHyPISTVSNVVYPNVSYPSLPASEPYGQMFTSQSAPPPARPLKESYSGPSTAVAYPSRPPPPPSQHQQQQ 240
Cdd:PHA03247  2755 ARPARPPTTAG-PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  241 QQQQQQQQQSHSGYSSLPWSGpGLPPAQDSLIRNqmgslaiPNSHPAINVADSLSCPITENVQPPKPSSVVATVLPGPSS 320
Cdd:PHA03247  2834 AQPTAPPPPPGPPPPSLPLGG-SVAPGGDVRRRP-------PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  321 TRMPPAPSHPVGPVPSAPPPPEQMQTKVR---------FPVPSHAGCGKWRPVNRDPGLWGTYIGRM---RERPPekfan 388
Cdd:PHA03247  2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPppprpqpplAPTTDPAGAGEPSGAVPQPWLGALVPGRVavpRFRVP----- 2980
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  389 kgmQYGDYVNNQASSTPTPLSSASDDEEEEDEDEEAGVDSSSTTSSASPLPNSYDALEGGS-----YPDMHSSSASSPVP 463
Cdd:PHA03247  2981 ---QPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDadslfDSDSERSDLEALDP 3057
                          490       500
                   ....*....|....*....|
gi 1958750089  464 DrAPEPNSTLVPTPTAAQPA 483
Cdd:PHA03247  3058 L-PPEPHDPFAHEPDPATPE 3076
PHA03247 PHA03247
large tegument protein UL36; Provisional
121-570 3.36e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 3.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  121 GPFQGAPASASHSYPSASQPYSSLGNRYSSPAtysATASVASQGYPstcshyPISTVSNV-VYPNVSYPSLPASEPYGQM 199
Cdd:PHA03247  2550 DPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPA---VTSRARRPDAP------PQSARPRApVDDRGDPRGPAPPSPLPPD 2620
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  200 FTSQSAPPPA-RPLKESYSGPSTAVAYPSRPPPPPSQHQQQQQQQQQQQQQSHSGYSSLP--WSGPGLPPAQDSLirnqm 276
Cdd:PHA03247  2621 THAPDPPPPSpSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqrPRRRAARPTVGSL----- 2695
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  277 GSLAIPNSHPainvadslscPITENVQPPKPSSVVATVLPGPSSTRMPPAPSHpvgpvpsappppeqmqtkvrfPVPSHA 356
Cdd:PHA03247  2696 TSLADPPPPP----------PTPEPAPHALVSATPLPPGPAAARQASPALPAA---------------------PAPPAV 2744
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  357 GCGKWRPVNRDPglwgtyigrmRERPPekfankgmqygdyvnnqasSTPTPLSSASDDEEEededeeAGVDSSSTTSSAS 436
Cdd:PHA03247  2745 PAGPATPGGPAR----------PARPP-------------------TTAGPPAPAPPAAPA------AGPPRRLTRPAVA 2789
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  437 PLPNSYDALEGGSYPDMHSSSASSPVPDRAPEPN-STLVPTPTAAQPAKVAKPFGYGYPTLQPAYQNA------------ 503
Cdd:PHA03247  2790 SLSESRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrppsr 2869
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958750089  504 AAPPTTAHPSGPAYSGYPQQYPGVHQLSSGLGGLSLQSSPQPESLRPVnLTQEKNILPATPIWAPVP 570
Cdd:PHA03247  2870 SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPP-QPQPQPPPPPQPQPPPPP 2935
PHA03247 PHA03247
large tegument protein UL36; Provisional
448-608 5.47e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 5.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  448 GSYPDMHSSSASSPVPDRAPEPNSTLVPTPTAAQPAKVAKPfgygYPTLQPAyqNAAAPPTTAHPSGPAYSGYPQQYPGv 527
Cdd:PHA03247  2693 GSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASP----ALPAAPA--PPAVPAGPATPGGPARPARPPTTAG- 2765
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  528 hqLSSGLGGLSLQSSPQPESLRPVNLTQEKNILPATPIWAPVPNLSAELSKLNCSPDSFRCTLTSIPQTQALLNKAKLPL 607
Cdd:PHA03247  2766 --PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPP 2843

                   .
gi 1958750089  608 G 608
Cdd:PHA03247  2844 G 2844
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
3-211 7.00e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.74  E-value: 7.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089    3 APAGSPHPAAGARMPPKLGGAVSGLAPPQQNGPAQSQMQVPSGyglPHQNYMAPSGHYSQGPGKMTSLPLDNQCENYYSR 82
Cdd:PRK07764   589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAG---AAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDG 665
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089   83 PYTAPTQNVGTPSSANQPGAQLMYGRGPSAPHMGASMPGPFQGAPASASHSYPSASQpysslgnrySSPATYSATASVAS 162
Cdd:PRK07764   666 GDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPP---------QAAQGASAPSPAAD 736
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 1958750089  163 QGYPstcshyPISTVSNVVYPNVSYPSLPASEPYGQMFTSQSAPPPARP 211
Cdd:PRK07764   737 DPVP------LPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPP 779
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
110-495 8.75e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.52  E-value: 8.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  110 PSAPHMGASMPGPFQGAPASASHSYPSASQPYSSLGNRYSSPATYSATASVASQGYPStcSHYPISTVSNVVYP-NVSYP 188
Cdd:pfam03154  187 PPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPS--PHPPLQPMTQPPPPsQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  189 SLPASEPYGQMftsqsaPPPARPLKesySGPSTaVAYPSRPPPPPSQHQQQQQQQQQQQQQSHSGYSSlpwSGPGLPPAQ 268
Cdd:pfam03154  265 PLPQPSLHGQM------PPMPHSLQ---TGPSH-MQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQ---QRIHTPPSQ 331
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  269 DSLIRNQ-------------MGSLAIPNSHPAINVADSLSCPITENVQPPKPSSVVATVLPGPS-------STRMPPA-- 326
Cdd:pfam03154  332 SQLQSQQppreqplppaplsMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAlkplsslSTHHPPSah 411
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  327 PSHPVGPVPSAPPPPEQMQTKVRFPVPSHAGCGKWRPVNrdpglwgtyiGRMRERPPEK-FANKGMQYGDYVNNQASSTP 405
Cdd:pfam03154  412 PPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPT----------SGLHQVPSQSpFPQHPFVPGGPPPITPPSGP 481
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958750089  406 TPLSSASDDEEEEDEDEEAGVDSSSTTSSASPLPNSYDALEGgsyPDMHSSSASSPVPDRAPEPNSTLVPTPT-AAQPAK 484
Cdd:pfam03154  482 PTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEA---LDEAEEPESPPPPPRSPSPEPTVVNTPShASQSAR 558
                          410
                   ....*....|.
gi 1958750089  485 VAKPFGYGYPT 495
Cdd:pfam03154  559 FYKHLDRGYNS 569
GEL smart00262
Gelsolin homology domain; Gelsolin/severin/villin homology domain. Calcium-binding and ...
1160-1201 9.43e-03

Gelsolin homology domain; Gelsolin/severin/villin homology domain. Calcium-binding and actin-binding. Both intra- and extracellular domains.


Pssm-ID: 214590 [Multi-domain]  Cd Length: 90  Bit Score: 36.89  E-value: 9.43e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 1958750089  1160 HVNDRVVPQPPLQKLSAEKLTREGAFLMDCGSVFYIWIGKGC 1201
Cdd:smart00262    4 RVKGKRNVRVPEVPFSQGSLNSGDCYILDTGSEIYVWVGKKS 45
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH