NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|767964335|ref|XP_011538683|]
View 

protein transport protein Sec24C isoform X1 [Homo sapiens]

Protein Classification

SEC24 family transport protein( domain architecture ID 1001573)

SEC24 family transport protein is a component of the coat protein complex II (COPII) which promotes the formation of transport vesicles from the endoplasmic reticulum (ER)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG5028 super family cl34873
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ...
323-1113 1.32e-168

Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];


The actual alignment was detected with superfamily member COG5028:

Pssm-ID: 227361 [Multi-domain]  Cd Length: 861  Bit Score: 518.96  E-value: 1.32e-168
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  323 PDAIPSPQLSELPPQQKTRHRIDPDAIPS-PIQVIEDdrnnrgTEPFVTGVRG----QVPPLvTTNFLVKDQGNASPRYI 397
Cdd:COG5028    81 PAFQSQQKFSSPYGGSMADGTAPKPTNPLvPVDLFED------QPPPISDLFLppppIVPPL-TTNFVGSEQSNCSPKYV 153
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  398 RCTSYNIPCTSDMAKQAQVPLAAVIKPLARLPPEEASPYVVDHGEsgPLRCNRCKAYMCPFMQFIEGGRRFQCCFCSCIN 477
Cdd:COG5028   154 RSTMYAIPETNDLLKKSKIPFGLVIRPFLELYPEEDPVPLVEDGS--IVRCRRCRSYINPFVQFIEQGRKWRCNICRSKN 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  478 DVPPQYFQHLDHTGKRVDAYDRPELSLGSYEFLATVDYckNNKFPSPPAFIFMIDVSYNAIRTGLV----RLLCEELKSL 553
Cdd:COG5028   232 DVPEGFDNPSGPNDPRSDRYSRPELKSGVVDFLAPKEY--SLRQPPPPVYVFLIDVSFEAIKNGLVkaaiRAILENLDQI 309
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  554 LDFLPReggaeesaIRVGFVTYNKVLHFYNVKSSLaQPQMMVVSDVADMFVPLLDG-FLVNVNESRAVITSLLDQIPEMF 632
Cdd:COG5028   310 PNFDPR--------TKIAIICFDSSLHFFKLSPDL-DEQMLIVSDLDEPFLPFPSGlFVLPLKSCKQIIETLLDRVPRIF 380
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  633 ADTRETETVFVPviqagmeALKAA-----ECAGKLFLFHTSLPIAeAPGKLKNRDDrklintdKEKTLFQPQTGAYQTLA 707
Cdd:COG5028   381 QDNKSPKNALGP-------ALKAAksligGTGGKIIVFLSTLPNM-GIGKLQLRED-------KESSLLSCKDSFYKEFA 445
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  708 KECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYASFQVE--NDQERFLSDLRRDVQKVVGFDAVMRVRTSTGIR 785
Cdd:COG5028   446 IECSKVGISVDLFLTSEDYIDVATLSHLCRYTGGQTYFYPNFSATrpNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLR 525
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  786 AVDFFGAFYMSNTTDVELAGLDGDKTVTVEFKHDDRLNEeSGALLQCALLYTSCAGQRRLRIHNLALNCCTQLADLYRNC 865
Cdd:COG5028   526 VSSFYGNFFNRSSDLCAFSTMPRDTSLLVEFSIDEKLMT-SDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASA 604
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  866 ETDTLINYMAKFAYRGVLNSPVKAVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKLLPVYLNCVLKSDVLQPGAe 945
Cdd:COG5028   605 DQLAIACILAKKASTKALNSSLKEARVLINKSMVDILKAYKKELVKSNTSTQLPLPANLKLLPLLMLALLKSSAFRSGS- 683
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  946 VTTDDRAYVRQLVTSMDVTETNVFFYPRLLPLTKSPVES-------TTEPPAVRASEERLSNGDIYLLENGLNLFLWVGA 1018
Cdd:COG5028   684 TPSDIRISALNRLTSLPLKQLMRNIYPTLYALHDMPIEAglpdeglLVLPSPINATSSLLESGGLYLIDTGQKIFLWFGK 763
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335 1019 SVQQGVVQSLFSVSSFSQITSGLSVLPVLDNPLSKKVRGLIDSLRaQRSRYMKLTVVKQED----KMEMLFKHFLVEDKS 1094
Cdd:COG5028   764 DAVPSLLQDLFGVDSLSDIPSGKFTLPPTGNEFNERVRNIIGELR-SVNDDSTLPLVLVRGggdpSLRLWFFSTLVEDKT 842
                         810
                  ....*....|....*....
gi 767964335 1095 LsGGASYVDFLCHMHKEIR 1113
Cdd:COG5028   843 L-NIPSYLDYLQILHEKIK 860
PHA03247 super family cl33720
large tegument protein UL36; Provisional
4-378 4.20e-15

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 4.20e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    4 NQSVPPVPPfgQPQPIYPGY----HQSSYGGQSGS-TAPAIPYGAYNGPVPGYQQTPPQGMSRAPPSSGAPPASTAQAPC 78
Cdd:PHA03247 2565 DRSVPPPRP--APRPSEPAVtsraRRPDAPPQSARpRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHP 2642
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   79 GQAAYGQFGQGDV-----------------QNGPSSTVQMQRLPGSQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLS 141
Cdd:PHA03247 2643 PPTVPPPERPRDDpapgrvsrprrarrlgrAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLP 2722
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  142 -GMQISGAVAPAPPSSGLGFGPPTSLASASG-SFPNSGLYGSYPQGQAPPLSQAQGHPGIQTP---------QRSAPSQA 210
Cdd:PHA03247 2723 pGPAAARQASPALPAAPAPPAVPAGPATPGGpARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpavaslsesRESLPSPW 2802
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  211 SSFTPPASGGPRLPSMTGPLLPGQSFGGPSVSQPNHVSSPPQALPPGTQMTGPLGPLPPMHSPQQPGYQPQQNGSfgPAR 290
Cdd:PHA03247 2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA--PAR 2880
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  291 GPQSNYGGPYPAAPTFGSQPGPPQPLPPKRLDPDAIPSPQLSELPPQQKTRH-------------RIDPDAIPSPIQVIE 357
Cdd:PHA03247 2881 PPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPppppprpqpplapTTDPAGAGEPSGAVP 2960
                         410       420
                  ....*....|....*....|.
gi 767964335  358 DDRNNRGTEPFVTGVRGQVPP 378
Cdd:PHA03247 2961 QPWLGALVPGRVAVPRFRVPQ 2981
 
Name Accession Description Interval E-value
COG5028 COG5028
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ...
323-1113 1.32e-168

Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];


Pssm-ID: 227361 [Multi-domain]  Cd Length: 861  Bit Score: 518.96  E-value: 1.32e-168
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  323 PDAIPSPQLSELPPQQKTRHRIDPDAIPS-PIQVIEDdrnnrgTEPFVTGVRG----QVPPLvTTNFLVKDQGNASPRYI 397
Cdd:COG5028    81 PAFQSQQKFSSPYGGSMADGTAPKPTNPLvPVDLFED------QPPPISDLFLppppIVPPL-TTNFVGSEQSNCSPKYV 153
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  398 RCTSYNIPCTSDMAKQAQVPLAAVIKPLARLPPEEASPYVVDHGEsgPLRCNRCKAYMCPFMQFIEGGRRFQCCFCSCIN 477
Cdd:COG5028   154 RSTMYAIPETNDLLKKSKIPFGLVIRPFLELYPEEDPVPLVEDGS--IVRCRRCRSYINPFVQFIEQGRKWRCNICRSKN 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  478 DVPPQYFQHLDHTGKRVDAYDRPELSLGSYEFLATVDYckNNKFPSPPAFIFMIDVSYNAIRTGLV----RLLCEELKSL 553
Cdd:COG5028   232 DVPEGFDNPSGPNDPRSDRYSRPELKSGVVDFLAPKEY--SLRQPPPPVYVFLIDVSFEAIKNGLVkaaiRAILENLDQI 309
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  554 LDFLPReggaeesaIRVGFVTYNKVLHFYNVKSSLaQPQMMVVSDVADMFVPLLDG-FLVNVNESRAVITSLLDQIPEMF 632
Cdd:COG5028   310 PNFDPR--------TKIAIICFDSSLHFFKLSPDL-DEQMLIVSDLDEPFLPFPSGlFVLPLKSCKQIIETLLDRVPRIF 380
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  633 ADTRETETVFVPviqagmeALKAA-----ECAGKLFLFHTSLPIAeAPGKLKNRDDrklintdKEKTLFQPQTGAYQTLA 707
Cdd:COG5028   381 QDNKSPKNALGP-------ALKAAksligGTGGKIIVFLSTLPNM-GIGKLQLRED-------KESSLLSCKDSFYKEFA 445
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  708 KECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYASFQVE--NDQERFLSDLRRDVQKVVGFDAVMRVRTSTGIR 785
Cdd:COG5028   446 IECSKVGISVDLFLTSEDYIDVATLSHLCRYTGGQTYFYPNFSATrpNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLR 525
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  786 AVDFFGAFYMSNTTDVELAGLDGDKTVTVEFKHDDRLNEeSGALLQCALLYTSCAGQRRLRIHNLALNCCTQLADLYRNC 865
Cdd:COG5028   526 VSSFYGNFFNRSSDLCAFSTMPRDTSLLVEFSIDEKLMT-SDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASA 604
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  866 ETDTLINYMAKFAYRGVLNSPVKAVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKLLPVYLNCVLKSDVLQPGAe 945
Cdd:COG5028   605 DQLAIACILAKKASTKALNSSLKEARVLINKSMVDILKAYKKELVKSNTSTQLPLPANLKLLPLLMLALLKSSAFRSGS- 683
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  946 VTTDDRAYVRQLVTSMDVTETNVFFYPRLLPLTKSPVES-------TTEPPAVRASEERLSNGDIYLLENGLNLFLWVGA 1018
Cdd:COG5028   684 TPSDIRISALNRLTSLPLKQLMRNIYPTLYALHDMPIEAglpdeglLVLPSPINATSSLLESGGLYLIDTGQKIFLWFGK 763
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335 1019 SVQQGVVQSLFSVSSFSQITSGLSVLPVLDNPLSKKVRGLIDSLRaQRSRYMKLTVVKQED----KMEMLFKHFLVEDKS 1094
Cdd:COG5028   764 DAVPSLLQDLFGVDSLSDIPSGKFTLPPTGNEFNERVRNIIGELR-SVNDDSTLPLVLVRGggdpSLRLWFFSTLVEDKT 842
                         810
                  ....*....|....*....
gi 767964335 1095 LsGGASYVDFLCHMHKEIR 1113
Cdd:COG5028   843 L-NIPSYLDYLQILHEKIK 860
Sec24-like cd01479
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the ...
522-781 7.53e-124

Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.


Pssm-ID: 238756 [Multi-domain]  Cd Length: 244  Bit Score: 379.31  E-value: 7.53e-124
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  522 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDFLPREggaeESAIRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 601
Cdd:cd01479     1 PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPGD----DPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLDD 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  602 MFVPLLDGFLVNVNESRAVITSLLDQIPEMFADTRETETVFVPVIQAGMEALKaaECAGKLFLFHTSLPIAEApGKLKNR 681
Cdd:cd01479    77 PFLPLPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLK--ETGGKIIVFQSSLPTLGA-GKLKSR 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  682 DDRKLINTDKEKTLFQPQTGAYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYASFQvendqerflSD 761
Cdd:cd01479   154 EDPKLLSTDKEKQLLQPQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYPSFN---------FS 224
                         250       260
                  ....*....|....*....|
gi 767964335  762 LRRDVQKVVGFDAVMRVRTS 781
Cdd:cd01479   225 APNDVEKLVNELARYLTRKI 244
Sec23_trunk pfam04811
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
522-766 4.48e-116

Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.


Pssm-ID: 398467 [Multi-domain]  Cd Length: 241  Bit Score: 358.49  E-value: 4.48e-116
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   522 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDFLPREggaeeSAIRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 601
Cdd:pfam04811    1 PQPPVFLFVIDVSYNAIKSGLLAALKESLLQSLDLLPGD-----PRARVGFITFDSTVHFFNLGSSLRQPQMLVVSDLQD 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   602 MFVPLLDGFLVNVNESRAVITSLLDQIPEMFADTRETETVFVPVIQAGMEALKAAECAGKLFLFHTSLPIAEAPGKLKNR 681
Cdd:pfam04811   76 MFLPLPDRFLVPLSECRFVLEDLLEQLPPMFPVTKRPERCLGPALQAAFLLLKAAFTGGKIMVFQGGLPTVGPGGKLKSR 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   682 DDRKLINTDKEKTLFQPQT-GAYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYASFQVENDQERFLS 760
Cdd:pfam04811  156 LDESHHGTDKEKAKLVKKAdKFYKSLAKECVKQGHSVDLFAFSLDYVDVATLGQLSRLTGGQVYLYPSFQADVDGSKFKQ 235

                   ....*.
gi 767964335   761 DLRRDV 766
Cdd:pfam04811  236 DLQRYF 241
PTZ00395 PTZ00395
Sec24-related protein; Provisional
19-1114 4.02e-48

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 187.97  E-value: 4.02e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   19 IYPGYHqssyGGQSGSTAPAIPYGAYNGPVPGYQQTPPQGMSRAPPSSGapPASTAQAPCgqAAYGQFGQgdvQNGPSST 98
Cdd:PTZ00395  338 IYGGFH----DGSPNAASAGAPFNGLGNQADGGHINQVHPDARGAWAGG--PHSNASYNC--AAYSNAAQ---SNAAQSN 406
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   99 VQMQRLPGSQPfGSPLAPVGNQPPVLQPYGPPPTSAqvaTQLSGmqisgavapaPPSSGlgfgPPTSlasasgSFPNSGL 178
Cdd:PTZ00395  407 AGFSNAGYSNP-GNSNPGYNNAPNSNTPYNNPPNSN---TPYSN----------PPNSN----PPYS------NLPYSNT 462
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  179 -YGSYPQGQAPPLSQAQGHPGIQTP-QRSAPSQASSFTPPASGGPRLPSMTGPllpGQSFGGPSVSQPnhvssppqalpp 256
Cdd:PTZ00395  463 pYSNAPLSNAPPSSAKDHHSAYHAAyQHRAANQPAANLPTANQPAANNFHGAA---GNSVGNPFASRP------------ 527
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  257 gtqmtgplgplppmhspqqpgyqpqqngsFG--PARGPQSNYGGPYPAAPTFGSQPGPPQPLPPKRLDPDAIPSPQlSEL 334
Cdd:PTZ00395  528 -----------------------------FGsaPYGGNAATTADPNGIAKREDHPEGGTNRQKYEQSDEESVESSS-SEN 577
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  335 PPQ----------------QKTRHRIDPDAIPSPIQVIEDDRNNRGTEPFVTgVRGQVPPLVTTNFLVKDQGNASPRYIR 398
Cdd:PTZ00395  578 SSEnenevtdkgeeiysllKKTINRIDMNKIPRPIINTQEKKKKKNLKVFET-CKYISPPSYYQPYISIDTGKADPRFLK 656
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  399 CTSYNIPCTSDMAKQAQVPLAAVIKPLARLPPEEASPYV-----VDHGESGP--LRCNRCKAYM-CPFMQFIEGGrrFQC 470
Cdd:PTZ00395  657 STLYQIPLFSETLKLSQIPFGIIVNPFACLNEGEGIDKIdmkdiINDKEENIeiLRCPKCLGYLhATILEDISSS--VQC 734
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  471 CFCSC---IND--------------------------------------VPPQYFQHLD-------HTGKRV-------- 494
Cdd:PTZ00395  735 VFCDTdflINEnvlfdifqynekighkesdhnehgnslspllkgsvdiiIPPIYYHNVNkfkltytYLNKNInqtafmit 814
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  495 --------------------------------DAYDRPELSLGSY----------------------------------- 507
Cdd:PTZ00395  815 nkimsftkhisnslvandskggnkatsasafgDSGDANFLAGGGYtnyggaggyntydnqsgynnhdvvnnrggsgagnh 894
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  508 ---------EFLATVD------------YCKNN---------------KFPS-----PPAFIFMIDVSYNAIRTGLVRLL 546
Cdd:PTZ00395  895 lygkdhdvqNFDNVMDnanftihdmknlICEKNgepdsakirrnsflaKYPQvknmlPPYFVFVVECSYNAIYNNITYTI 974
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  547 CEELKSLLDFL--PReggaeesaIRVGFVTYNKVLHFYNVKSSLAQP-------------QMMVVSDVADMFVPL-LDGF 610
Cdd:PTZ00395  975 LEGIRYAVQNVkcPQ--------TKIAIITFNSSIYFYHCKGGKGVSgeegdggggsgnhQVIVMSDVDDPFLPLpLEDL 1046
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  611 LVNVNESRAVITSLLDQIPEMFADTRETETVFVPVIQAGMEALKAAECAGKLFLFHTSLPIAeAPGKLKnrddrKLINTD 690
Cdd:PTZ00395 1047 FFGCVEEIDKINTLIDTIKSVSTTMQSYGSCGNSALKIAMDMLKERNGLGSICMFYTTTPNC-GIGAIK-----ELKKDL 1120
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  691 KEKTLFQPQTGAYQTLAKECVAQGCCVDLFLFP--NQYVDVATLSVVPQLTGGSVYKYASFQVEND-QERFLSDLRRDVQ 767
Cdd:PTZ00395 1121 QENFLEVKQKIFYDSLLLDLYAFNISVDIFIISsnNVRVCVPSLQYVAQNTGGKILFVENFLWQKDyKEIYMNIMDTLTS 1200
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  768 KVVGFDAVMRVRTSTGIRAVDFFGAFYMSNTT----DVELAGLDGDKTVTVEFKHDDRLNEESGALLQCALLYTSCAGQR 843
Cdd:PTZ00395 1201 EDIAYCCELKLRYSHHMSVKKLFCCNNNFNSIisvdTIKIPKIRHDQTFAFLLNYSDISESKKQIYFQCACIYTNLWGDR 1280
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  844 RLRIHNLALNCCTQLADLYRNCETDTLINYMAKFAYRGVLNSpvKAVRDTLITQCAQILACYRKNCASPSSAGQLILPEC 923
Cdd:PTZ00395 1281 FVRLHTTHMNLTSSLSTVFRYTDAEALMNILIKQLCTNILHN--DNYSKIIIDNLAAILFSYRINCASSAHSGQLILPDT 1358
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  924 MKLLPVYLNCVLKSDVLQpgAEVTTDDRAYVRQLVTSMDVTETNVFFYPRLLPL----TKSPVESTTE------PPAVRA 993
Cdd:PTZ00395 1359 LKLLPLFTSSLLKHNVTK--KEILHDLKVYSLIKLLSMPIISSLLYVYPVMYVIhikgKTNEIDSMDVdddlfiPKTIPS 1436
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  994 SEERLSNGDIYLLENGLNLFLWVG----ASVQQGVVQSLFSVSSFSQitsglsvLPVLDNPLSKKVRGLIDSLRA--QRS 1067
Cdd:PTZ00395 1437 SAEKIYSNGIYLLDACTHFYLYFGfhsdANFAKEIVGDIPTEKNAHE-------LNLTDTPNAQKVQRIIKNLSRihHFN 1509
                        1290      1300      1310      1320
                  ....*....|....*....|....*....|....*....|....*..
gi 767964335 1068 RYMKLTVVKQEDKMEMLFKHFLVEDKSlSGGASYVDFLCHMHKEIRQ 1114
Cdd:PTZ00395 1510 KYVPLVMVAPKSNEEEHLISLCVEDKA-DKEYSYVNFLCFIHKLVHK 1555
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-378 4.20e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 4.20e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    4 NQSVPPVPPfgQPQPIYPGY----HQSSYGGQSGS-TAPAIPYGAYNGPVPGYQQTPPQGMSRAPPSSGAPPASTAQAPC 78
Cdd:PHA03247 2565 DRSVPPPRP--APRPSEPAVtsraRRPDAPPQSARpRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHP 2642
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   79 GQAAYGQFGQGDV-----------------QNGPSSTVQMQRLPGSQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLS 141
Cdd:PHA03247 2643 PPTVPPPERPRDDpapgrvsrprrarrlgrAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLP 2722
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  142 -GMQISGAVAPAPPSSGLGFGPPTSLASASG-SFPNSGLYGSYPQGQAPPLSQAQGHPGIQTP---------QRSAPSQA 210
Cdd:PHA03247 2723 pGPAAARQASPALPAAPAPPAVPAGPATPGGpARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpavaslsesRESLPSPW 2802
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  211 SSFTPPASGGPRLPSMTGPLLPGQSFGGPSVSQPNHVSSPPQALPPGTQMTGPLGPLPPMHSPQQPGYQPQQNGSfgPAR 290
Cdd:PHA03247 2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA--PAR 2880
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  291 GPQSNYGGPYPAAPTFGSQPGPPQPLPPKRLDPDAIPSPQLSELPPQQKTRH-------------RIDPDAIPSPIQVIE 357
Cdd:PHA03247 2881 PPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPppppprpqpplapTTDPAGAGEPSGAVP 2960
                         410       420
                  ....*....|....*....|.
gi 767964335  358 DDRNNRGTEPFVTGVRGQVPP 378
Cdd:PHA03247 2961 QPWLGALVPGRVAVPRFRVPQ 2981
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
6-271 3.12e-11

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 67.87  E-value: 3.12e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335     6 SVPPVPPFGQPQPIYPGYHQSSYGGQSGSTAPAIPYGAYNGPVPgyqQTPPQGMSRAPPSSGAPPASTAQA--------- 76
Cdd:pfam03154  202 SAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSP---HPPLQPMTQPPPPSQVSPQPLPQPslhgqmppm 278
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    77 ----------------PCGQAAYGQFGQGDVQNGPSSTV-----QMQRLPGSQPFGSPLAPVGNQP----PVLQPY-GPP 130
Cdd:pfam03154  279 phslqtgpshmqhpvpPQPFPLTPQSSQSQVPPGPSPAApgqsqQRIHTPPSQSQLQSQQPPREQPlppaPLSMPHiKPP 358
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   131 PTS--AQVATQLSGMQISGAVAPAPPSSGLGFGPPTS---LASASGSFPNSG------LYGSYPQGQAPP-----LSQAQ 194
Cdd:pfam03154  359 PTTpiPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAlkpLSSLSTHHPPSAhppplqLMPQSQQLPPPPaqppvLTQSQ 438
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   195 GHP--GIQTPQRSAPSQASSfTPPASGGPRLPSMTGPLLPGQsfgGPSVSQPNHVSS--PPQALPPGTQMTGPLG---PL 267
Cdd:pfam03154  439 SLPppAASHPPTSGLHQVPS-QSPFPQHPFVPGGPPPITPPS---GPPTSTSSAMPGiqPPSSASVSSSGPVPAAvscPL 514

                   ....
gi 767964335   268 PPMH 271
Cdd:pfam03154  515 PPVQ 518
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
30-269 4.43e-08

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 56.96  E-value: 4.43e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   30 GQSGSTAPAIPYGAyngpvpgyqQTPPQgmsraPPSSGAPPAST-AQAPCGQAAYGQFGQGDVQNGPSSTVQMQRLPGSQ 108
Cdd:COG5164    22 GSQGSTKPAQNQGS---------TRPAG-----NTGGTRPAQNQgSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQ 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  109 pfGSPLAPvGNQPPVLQPYGPPPTSAQVATQLSGMQISGAvAPAPPSSGlgfgpPTSLASASGSFPNSGLYGSYPQGQAP 188
Cdd:COG5164    88 --GGTRPA-GNTGGTTPAGDGGATGPPDDGGATGPPDDGG-STTPPSGG-----STTPPGDGGSTPPGPGSTGPGGSTTP 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  189 PLSQAQGHPGIQTPQRSAPSQASSFTPPASGGPRLPSMTGPLLPGQSFGGPSVSQPNHVSSPPqalPPGTQMTGPLGPLP 268
Cdd:COG5164   159 PGDGGSTTPPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPP---DDRGGKTGPKDQRP 235

                  .
gi 767964335  269 P 269
Cdd:COG5164   236 K 236
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
5-242 4.81e-04

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 44.15  E-value: 4.81e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    5 QSVPPVPPFGQPQPIYPGYHQSSYGG-------------QSGST-APAIPY--------GAYNGPVPGYQQTPPQGMSRA 62
Cdd:cd22540   159 QVLQQPQQAHKPVPIKPAPLQTSNTNsaslqvpgnviklQSGGNvALTLPVnnlvgtqdGATQLQLAAAPSKPSKKIRKK 238
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   63 PPSSGAPPASTAQAPcgQAAYGQFGQGD-VQNGPSSTVQMQrlpgsqpfgsplaPVGNQPPVLQPYG--PPPTSAQVAT- 138
Cdd:cd22540   239 SAQAAQPAVTVAEQV--ETVLIETTADNiIQAGNNLLIVQS-------------PGTGQPAVLQQVQvlQPKQEQQVVQi 303
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  139 ---QLSGMQISGAVAPAPPSSglgfgPPTSLASASGSFPNSGLYGSYPQGQAPPLSQAQGHPGIQTPQRSAPSQASSFTP 215
Cdd:cd22540   304 pqqALRVVQAASATLPTVPQK-----PLQNIQIQNSEPTPTQVYIKTPSGEVQTVLLQEAPAATATPSSSTSTVQQQVTA 378
                         250       260       270
                  ....*....|....*....|....*....|
gi 767964335  216 P-ASGGPRLPSMT--GPLLPGQSFGGPSVS 242
Cdd:cd22540   379 NnGTGTSKPNYNVrkERTLPKIAPAGGIIS 408
hnRNP-R-Q TIGR01648
heterogeneous nuclear ribonucleoprotein R, Q family; Sequences in this subfamily include the ...
12-193 7.34e-04

heterogeneous nuclear ribonucleoprotein R, Q family; Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q, and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.


Pssm-ID: 273732 [Multi-domain]  Cd Length: 578  Bit Score: 43.45  E-value: 7.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    12 PFGQPQPIYPGYHQSSYGGQSGSTAPAIPYGAYngpVPGYQQTPPQGMSRAPPSSGAPPASTAQapcgqaaYGQFGQGdv 91
Cdd:TIGR01648  383 GRGYPPYGYEAYYGDYYGYHDYRGKYEDKYYGY---DPGMELTPMNPVRGKPGGRGGRPAIPPP-------RGRKNGA-- 450
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    92 qnGPSSTVQMQRLPGSQPFGSPLApvGNQPPVLQPYGPPPTSAQVatqlsgmqiSGAVAPAPPSSGLGFGPPTSlaSASG 171
Cdd:TIGR01648  451 --PPPAIGQDGRQLFLYKITIPAG--YSQRPAPHPLGPPRGSAFV---------RGARGGPAQYQQRGRGSRTS--RGNG 515
                          170       180
                   ....*....|....*....|..
gi 767964335   172 SFPNSGLYGSYPQGQAPPLSQA 193
Cdd:TIGR01648  516 RGGTAGGKRKAFDGYAQPDATA 537
 
Name Accession Description Interval E-value
COG5028 COG5028
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ...
323-1113 1.32e-168

Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];


Pssm-ID: 227361 [Multi-domain]  Cd Length: 861  Bit Score: 518.96  E-value: 1.32e-168
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  323 PDAIPSPQLSELPPQQKTRHRIDPDAIPS-PIQVIEDdrnnrgTEPFVTGVRG----QVPPLvTTNFLVKDQGNASPRYI 397
Cdd:COG5028    81 PAFQSQQKFSSPYGGSMADGTAPKPTNPLvPVDLFED------QPPPISDLFLppppIVPPL-TTNFVGSEQSNCSPKYV 153
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  398 RCTSYNIPCTSDMAKQAQVPLAAVIKPLARLPPEEASPYVVDHGEsgPLRCNRCKAYMCPFMQFIEGGRRFQCCFCSCIN 477
Cdd:COG5028   154 RSTMYAIPETNDLLKKSKIPFGLVIRPFLELYPEEDPVPLVEDGS--IVRCRRCRSYINPFVQFIEQGRKWRCNICRSKN 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  478 DVPPQYFQHLDHTGKRVDAYDRPELSLGSYEFLATVDYckNNKFPSPPAFIFMIDVSYNAIRTGLV----RLLCEELKSL 553
Cdd:COG5028   232 DVPEGFDNPSGPNDPRSDRYSRPELKSGVVDFLAPKEY--SLRQPPPPVYVFLIDVSFEAIKNGLVkaaiRAILENLDQI 309
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  554 LDFLPReggaeesaIRVGFVTYNKVLHFYNVKSSLaQPQMMVVSDVADMFVPLLDG-FLVNVNESRAVITSLLDQIPEMF 632
Cdd:COG5028   310 PNFDPR--------TKIAIICFDSSLHFFKLSPDL-DEQMLIVSDLDEPFLPFPSGlFVLPLKSCKQIIETLLDRVPRIF 380
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  633 ADTRETETVFVPviqagmeALKAA-----ECAGKLFLFHTSLPIAeAPGKLKNRDDrklintdKEKTLFQPQTGAYQTLA 707
Cdd:COG5028   381 QDNKSPKNALGP-------ALKAAksligGTGGKIIVFLSTLPNM-GIGKLQLRED-------KESSLLSCKDSFYKEFA 445
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  708 KECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYASFQVE--NDQERFLSDLRRDVQKVVGFDAVMRVRTSTGIR 785
Cdd:COG5028   446 IECSKVGISVDLFLTSEDYIDVATLSHLCRYTGGQTYFYPNFSATrpNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLR 525
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  786 AVDFFGAFYMSNTTDVELAGLDGDKTVTVEFKHDDRLNEeSGALLQCALLYTSCAGQRRLRIHNLALNCCTQLADLYRNC 865
Cdd:COG5028   526 VSSFYGNFFNRSSDLCAFSTMPRDTSLLVEFSIDEKLMT-SDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASA 604
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  866 ETDTLINYMAKFAYRGVLNSPVKAVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKLLPVYLNCVLKSDVLQPGAe 945
Cdd:COG5028   605 DQLAIACILAKKASTKALNSSLKEARVLINKSMVDILKAYKKELVKSNTSTQLPLPANLKLLPLLMLALLKSSAFRSGS- 683
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  946 VTTDDRAYVRQLVTSMDVTETNVFFYPRLLPLTKSPVES-------TTEPPAVRASEERLSNGDIYLLENGLNLFLWVGA 1018
Cdd:COG5028   684 TPSDIRISALNRLTSLPLKQLMRNIYPTLYALHDMPIEAglpdeglLVLPSPINATSSLLESGGLYLIDTGQKIFLWFGK 763
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335 1019 SVQQGVVQSLFSVSSFSQITSGLSVLPVLDNPLSKKVRGLIDSLRaQRSRYMKLTVVKQED----KMEMLFKHFLVEDKS 1094
Cdd:COG5028   764 DAVPSLLQDLFGVDSLSDIPSGKFTLPPTGNEFNERVRNIIGELR-SVNDDSTLPLVLVRGggdpSLRLWFFSTLVEDKT 842
                         810
                  ....*....|....*....
gi 767964335 1095 LsGGASYVDFLCHMHKEIR 1113
Cdd:COG5028   843 L-NIPSYLDYLQILHEKIK 860
Sec24-like cd01479
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the ...
522-781 7.53e-124

Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.


Pssm-ID: 238756 [Multi-domain]  Cd Length: 244  Bit Score: 379.31  E-value: 7.53e-124
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  522 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDFLPREggaeESAIRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 601
Cdd:cd01479     1 PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPGD----DPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLDD 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  602 MFVPLLDGFLVNVNESRAVITSLLDQIPEMFADTRETETVFVPVIQAGMEALKaaECAGKLFLFHTSLPIAEApGKLKNR 681
Cdd:cd01479    77 PFLPLPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLK--ETGGKIIVFQSSLPTLGA-GKLKSR 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  682 DDRKLINTDKEKTLFQPQTGAYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYASFQvendqerflSD 761
Cdd:cd01479   154 EDPKLLSTDKEKQLLQPQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYPSFN---------FS 224
                         250       260
                  ....*....|....*....|
gi 767964335  762 LRRDVQKVVGFDAVMRVRTS 781
Cdd:cd01479   225 APNDVEKLVNELARYLTRKI 244
Sec23_trunk pfam04811
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
522-766 4.48e-116

Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.


Pssm-ID: 398467 [Multi-domain]  Cd Length: 241  Bit Score: 358.49  E-value: 4.48e-116
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   522 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDFLPREggaeeSAIRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 601
Cdd:pfam04811    1 PQPPVFLFVIDVSYNAIKSGLLAALKESLLQSLDLLPGD-----PRARVGFITFDSTVHFFNLGSSLRQPQMLVVSDLQD 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   602 MFVPLLDGFLVNVNESRAVITSLLDQIPEMFADTRETETVFVPVIQAGMEALKAAECAGKLFLFHTSLPIAEAPGKLKNR 681
Cdd:pfam04811   76 MFLPLPDRFLVPLSECRFVLEDLLEQLPPMFPVTKRPERCLGPALQAAFLLLKAAFTGGKIMVFQGGLPTVGPGGKLKSR 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   682 DDRKLINTDKEKTLFQPQT-GAYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYASFQVENDQERFLS 760
Cdd:pfam04811  156 LDESHHGTDKEKAKLVKKAdKFYKSLAKECVKQGHSVDLFAFSLDYVDVATLGQLSRLTGGQVYLYPSFQADVDGSKFKQ 235

                   ....*.
gi 767964335   761 DLRRDV 766
Cdd:pfam04811  236 DLQRYF 241
trunk_domain cd01468
trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi ...
522-764 4.53e-104

trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins.


Pssm-ID: 238745 [Multi-domain]  Cd Length: 239  Bit Score: 326.51  E-value: 4.53e-104
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  522 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDFLPREGGAeesaiRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 601
Cdd:cd01468     1 PQPPVFVFVIDVSYEAIKEGLLQALKESLLASLDLLPGDPRA-----RVGLITYDSTVHFYNLSSDLAQPKMYVVSDLKD 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  602 MFVPLLDGFLVNVNESRAVITSLLDQIPEMFAD--TRETETVFVPVIQAGMEALKAAECAGKLFLFHTSLPIAEaPGKLK 679
Cdd:cd01468    76 VFLPLPDRFLVPLSECKKVIHDLLEQLPPMFWPvpTHRPERCLGPALQAAFLLLKGTFAGGRIIVFQGGLPTVG-PGKLK 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  680 NRDDRKLINTDKEKTLFQPQTGAYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYASFQVENDQERFL 759
Cdd:cd01468   155 SREDKEPIRSHDEAQLLKPATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLYDSFQAPNDGSKFK 234

                  ....*
gi 767964335  760 SDLRR 764
Cdd:cd01468   235 QDLQR 239
PTZ00395 PTZ00395
Sec24-related protein; Provisional
19-1114 4.02e-48

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 187.97  E-value: 4.02e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   19 IYPGYHqssyGGQSGSTAPAIPYGAYNGPVPGYQQTPPQGMSRAPPSSGapPASTAQAPCgqAAYGQFGQgdvQNGPSST 98
Cdd:PTZ00395  338 IYGGFH----DGSPNAASAGAPFNGLGNQADGGHINQVHPDARGAWAGG--PHSNASYNC--AAYSNAAQ---SNAAQSN 406
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   99 VQMQRLPGSQPfGSPLAPVGNQPPVLQPYGPPPTSAqvaTQLSGmqisgavapaPPSSGlgfgPPTSlasasgSFPNSGL 178
Cdd:PTZ00395  407 AGFSNAGYSNP-GNSNPGYNNAPNSNTPYNNPPNSN---TPYSN----------PPNSN----PPYS------NLPYSNT 462
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  179 -YGSYPQGQAPPLSQAQGHPGIQTP-QRSAPSQASSFTPPASGGPRLPSMTGPllpGQSFGGPSVSQPnhvssppqalpp 256
Cdd:PTZ00395  463 pYSNAPLSNAPPSSAKDHHSAYHAAyQHRAANQPAANLPTANQPAANNFHGAA---GNSVGNPFASRP------------ 527
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  257 gtqmtgplgplppmhspqqpgyqpqqngsFG--PARGPQSNYGGPYPAAPTFGSQPGPPQPLPPKRLDPDAIPSPQlSEL 334
Cdd:PTZ00395  528 -----------------------------FGsaPYGGNAATTADPNGIAKREDHPEGGTNRQKYEQSDEESVESSS-SEN 577
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  335 PPQ----------------QKTRHRIDPDAIPSPIQVIEDDRNNRGTEPFVTgVRGQVPPLVTTNFLVKDQGNASPRYIR 398
Cdd:PTZ00395  578 SSEnenevtdkgeeiysllKKTINRIDMNKIPRPIINTQEKKKKKNLKVFET-CKYISPPSYYQPYISIDTGKADPRFLK 656
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  399 CTSYNIPCTSDMAKQAQVPLAAVIKPLARLPPEEASPYV-----VDHGESGP--LRCNRCKAYM-CPFMQFIEGGrrFQC 470
Cdd:PTZ00395  657 STLYQIPLFSETLKLSQIPFGIIVNPFACLNEGEGIDKIdmkdiINDKEENIeiLRCPKCLGYLhATILEDISSS--VQC 734
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  471 CFCSC---IND--------------------------------------VPPQYFQHLD-------HTGKRV-------- 494
Cdd:PTZ00395  735 VFCDTdflINEnvlfdifqynekighkesdhnehgnslspllkgsvdiiIPPIYYHNVNkfkltytYLNKNInqtafmit 814
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  495 --------------------------------DAYDRPELSLGSY----------------------------------- 507
Cdd:PTZ00395  815 nkimsftkhisnslvandskggnkatsasafgDSGDANFLAGGGYtnyggaggyntydnqsgynnhdvvnnrggsgagnh 894
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  508 ---------EFLATVD------------YCKNN---------------KFPS-----PPAFIFMIDVSYNAIRTGLVRLL 546
Cdd:PTZ00395  895 lygkdhdvqNFDNVMDnanftihdmknlICEKNgepdsakirrnsflaKYPQvknmlPPYFVFVVECSYNAIYNNITYTI 974
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  547 CEELKSLLDFL--PReggaeesaIRVGFVTYNKVLHFYNVKSSLAQP-------------QMMVVSDVADMFVPL-LDGF 610
Cdd:PTZ00395  975 LEGIRYAVQNVkcPQ--------TKIAIITFNSSIYFYHCKGGKGVSgeegdggggsgnhQVIVMSDVDDPFLPLpLEDL 1046
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  611 LVNVNESRAVITSLLDQIPEMFADTRETETVFVPVIQAGMEALKAAECAGKLFLFHTSLPIAeAPGKLKnrddrKLINTD 690
Cdd:PTZ00395 1047 FFGCVEEIDKINTLIDTIKSVSTTMQSYGSCGNSALKIAMDMLKERNGLGSICMFYTTTPNC-GIGAIK-----ELKKDL 1120
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  691 KEKTLFQPQTGAYQTLAKECVAQGCCVDLFLFP--NQYVDVATLSVVPQLTGGSVYKYASFQVEND-QERFLSDLRRDVQ 767
Cdd:PTZ00395 1121 QENFLEVKQKIFYDSLLLDLYAFNISVDIFIISsnNVRVCVPSLQYVAQNTGGKILFVENFLWQKDyKEIYMNIMDTLTS 1200
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  768 KVVGFDAVMRVRTSTGIRAVDFFGAFYMSNTT----DVELAGLDGDKTVTVEFKHDDRLNEESGALLQCALLYTSCAGQR 843
Cdd:PTZ00395 1201 EDIAYCCELKLRYSHHMSVKKLFCCNNNFNSIisvdTIKIPKIRHDQTFAFLLNYSDISESKKQIYFQCACIYTNLWGDR 1280
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  844 RLRIHNLALNCCTQLADLYRNCETDTLINYMAKFAYRGVLNSpvKAVRDTLITQCAQILACYRKNCASPSSAGQLILPEC 923
Cdd:PTZ00395 1281 FVRLHTTHMNLTSSLSTVFRYTDAEALMNILIKQLCTNILHN--DNYSKIIIDNLAAILFSYRINCASSAHSGQLILPDT 1358
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  924 MKLLPVYLNCVLKSDVLQpgAEVTTDDRAYVRQLVTSMDVTETNVFFYPRLLPL----TKSPVESTTE------PPAVRA 993
Cdd:PTZ00395 1359 LKLLPLFTSSLLKHNVTK--KEILHDLKVYSLIKLLSMPIISSLLYVYPVMYVIhikgKTNEIDSMDVdddlfiPKTIPS 1436
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  994 SEERLSNGDIYLLENGLNLFLWVG----ASVQQGVVQSLFSVSSFSQitsglsvLPVLDNPLSKKVRGLIDSLRA--QRS 1067
Cdd:PTZ00395 1437 SAEKIYSNGIYLLDACTHFYLYFGfhsdANFAKEIVGDIPTEKNAHE-------LNLTDTPNAQKVQRIIKNLSRihHFN 1509
                        1290      1300      1310      1320
                  ....*....|....*....|....*....|....*....|....*..
gi 767964335 1068 RYMKLTVVKQEDKMEMLFKHFLVEDKSlSGGASYVDFLCHMHKEIRQ 1114
Cdd:PTZ00395 1510 KYVPLVMVAPKSNEEEHLISLCVEDKA-DKEYSYVNFLCFIHKLVHK 1555
Sec23_helical pfam04815
Sec23/Sec24 helical domain; COPII-coated vesicles carry proteins from the endoplasmic ...
868-966 1.58e-34

Sec23/Sec24 helical domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is composed of five alpha helices.


Pssm-ID: 461441 [Multi-domain]  Cd Length: 103  Bit Score: 127.62  E-value: 1.58e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   868 DTLINYMAKFAYRGVLNSPVKAVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKLLPVYLNCVLKSDVLQPGAEVT 947
Cdd:pfam04815    3 EAIAVLLAKKAVEKALSSSLSDAREALDNKLVDILAAYRKYCASSSSPGQLILPESLKLLPLYMLALLKSPALRGGNSSP 82
                           90
                   ....*....|....*....
gi 767964335   948 TDDRAYVRQLVTSMDVTET 966
Cdd:pfam04815   83 SDERAYARHLLLSLPVEEL 101
Sec23_BS pfam08033
Sec23/Sec24 beta-sandwich domain;
771-854 3.70e-28

Sec23/Sec24 beta-sandwich domain;


Pssm-ID: 429794 [Multi-domain]  Cd Length: 86  Bit Score: 108.78  E-value: 3.70e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   771 GFDAVMRVRTSTGIRAVDFFGAFYMSNTTD-VELAGLDGDKTVTVEFKHDDRLNEESGALLQCALLYTSCAGQRRLRIHN 849
Cdd:pfam08033    1 GFNAVLRVRTSKGLKVSGFIGNFVSRSSGDtWKLPSLDPDTSYAFEFDIDEPLPNGSNAYIQFALLYTHSSGERRIRVTT 80

                   ....*
gi 767964335   850 LALNC 854
Cdd:pfam08033   81 VALPV 85
PLN00162 PLN00162
transport protein sec23; Provisional
401-847 1.07e-18

transport protein sec23; Provisional


Pssm-ID: 215083 [Multi-domain]  Cd Length: 761  Bit Score: 91.93  E-value: 1.07e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  401 SYNI-PCTSDMAKQAQVPLAAVIKPLARLPPEEASPYvvdhgesGPLRCNRCKAYMCPFMQFIEGGRRFQCCFCSCINDV 479
Cdd:PLN00162   15 SWNVwPSSKIEASKCVIPLAALYTPLKPLPELPVLPY-------DPLRCRTCRAVLNPYCRVDFQAKIWICPFCFQRNHF 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  480 PPQYF----QHLDhtgkrvdaydrPELslgsYEFLATVDY---CKNNKFPSPPAFIFMIDVSynAIRTGLvRLLCEELKS 552
Cdd:PLN00162   88 PPHYSsiseTNLP-----------AEL----FPQYTTVEYtlpPGSGGAPSPPVFVFVVDTC--MIEEEL-GALKSALLQ 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  553 LLDFLPreggaeESAiRVGFVTY----------------------------NKVLHFYNVKSSLAQPQMMVVSDVADMFV 604
Cdd:PLN00162  150 AIALLP------ENA-LVGLITFgthvhvhelgfsecsksyvfrgnkevskDQILEQLGLGGKKRRPAGGGIAGARDGLS 222
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  605 PL-LDGFLVNVNESRAVITSLLDQI-PEMF---ADTRETETVFVPV-IQAGMEALKAAECAGKLFLFhTSLPIAEAPGKL 678
Cdd:PLN00162  223 SSgVNRFLLPASECEFTLNSALEELqKDPWpvpPGHRPARCTGAALsVAAGLLGACVPGTGARIMAF-VGGPCTEGPGAI 301
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  679 KNRDDRKLINTDKE-----KTLFQPQTGAYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYASFqven 753
Cdd:PLN00162  302 VSKDLSEPIRSHKDldkdaAPYYKKAVKFYEGLAKQLVAQGHVLDVFACSLDQVGVAEMKVAVERTGGLVVLAESF---- 377
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  754 DQERFLSDLRRDVQKV------VGFDAVMRVRTSTGIRAVDFFG---------------AFYMSNTTDVELAGLDGDKTV 812
Cdd:PLN00162  378 GHSVFKDSLRRVFERDgegslgLSFNGTFEVNCSKDVKVQGAIGpcaslekkgpsvsdtEIGEGGTTAWKLCGLDKKTSL 457
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|
gi 767964335  813 TVEF----KHDDRLNEESGAL-LQCALLYTSCAGQRRLRI 847
Cdd:PLN00162  458 AVFFevanSGQSNPQPPGQQFfLQFLTRYQHSNGQTRLRV 497
zf-Sec23_Sec24 pfam04810
Sec23/Sec24 zinc finger; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
445-482 3.51e-17

Sec23/Sec24 zinc finger; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is found to be zinc binding domain.


Pssm-ID: 461437 [Multi-domain]  Cd Length: 38  Bit Score: 75.95  E-value: 3.51e-17
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 767964335   445 PLRCNRCKAYMCPFMQFIEGGRRFQCCFCSCINDVPPQ 482
Cdd:pfam04810    1 PVRCRRCRAYLNPFCQFDFGGKKWTCNFCGTRNPVPPE 38
SEC23 COG5047
Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];
397-974 1.15e-16

Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];


Pssm-ID: 227380 [Multi-domain]  Cd Length: 755  Bit Score: 85.32  E-value: 1.15e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  397 IRCTSYNIPCTSDMAKQAQVPLAAVIKPLARLPPEEASPYvvdhgesGPLRCNR-CKAYMCPFMQFIEGGRRFQCCFCSC 475
Cdd:COG5047    12 IRLTWNVFPATRGDATRTVIPIACLYTPLHEDDALTVNYY-------EPVKCTApCKAVLNPYCHIDERNQSWICPFCNQ 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  476 INDVPPQYfqhldhtgkrvDAYDRPELSLGSYEFLATVDYCKNNKFPSPPAFIFMIDVSYNAIRtglVRLLCEELKSLLD 555
Cdd:COG5047    85 RNTLPPQY-----------RDISNANLPLELLPQSSTIEYTLSKPVILPPVFFFVVDACCDEEE---LTALKDSLIVSLS 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  556 FLPREggaeesAIrVGFVTYNKVLHFYNVkSSLAQPQMMVVSDVADMFVPLLD--------------------------- 608
Cdd:COG5047   151 LLPPE------AL-VGLITYGTSIQVHEL-NAENHRRSYVFSGNKEYTKENLQellalskptksggfeskisgigqfass 222
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  609 GFLVNVNESRAVITSLLDQI-------PEMFADTRETET-VFVPVIQAGMEALKaaeCAGKLFLFhTSLPIAEAPGKLKN 680
Cdd:COG5047   223 RFLLPTQQCEFKLLNILEQLqpdpwpvPAGKRPLRCTGSaLNIASSLLEQCFPN---AGCHIVLF-AGGPCTVGPGTVVS 298
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  681 RDDRK------LINTDKEKtLFQPQTGAYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYASFQVEND 754
Cdd:COG5047   299 TELKEpmrshhDIESDSAQ-HSKKATKFYKGLAERVANQGHALDIFAGCLDQIGIMEMEPLTTSTGGALVLSDSFTTSIF 377
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  755 QERFLSDLRRDVQK--VVGFDAVMRVRTSTGIRAVDFFG---------------AFYMSNTTDVELAGLDGDKTVTVEFK 817
Cdd:COG5047   378 KQSFQRIFNRDSEGylKMGFNANMEVKTSKNLKIKGLIGhavsvkkkannisdsEIGIGATNSWKMASLSPKSNYALYFE 457
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  818 HDDRLNEESG-----ALLQCALLYTSCAGQRRLRIHNLALNCCTQLADL-YRNCETDTLINYMAKFAyrgVLNSPVKAVR 891
Cdd:COG5047   458 IALGAASGSAqrpaeAYIQFITTYQHSSGTYRIRVTTVARMFTDGGLPKiNRSFDQEAAAVFMARIA---AFKAETEDII 534
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  892 D-------TLITQCaQILACYRKNcaSPSSAGqliLPECMKLLPVYLNCVLKSDVLQPGAEvTTDDRAYVRQLVTSMDVT 964
Cdd:COG5047   535 DvfrwidrNLIRLC-QKFADYRKD--DPSSFR---LDPNFTLYPQFMYHLRRSPFLSVFNN-SPDETAFYRHMLNNADVN 607
                         650
                  ....*....|
gi 767964335  965 ETNVFFYPRL 974
Cdd:COG5047   608 DSLIMIQPTL 617
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-378 4.20e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 4.20e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    4 NQSVPPVPPfgQPQPIYPGY----HQSSYGGQSGS-TAPAIPYGAYNGPVPGYQQTPPQGMSRAPPSSGAPPASTAQAPC 78
Cdd:PHA03247 2565 DRSVPPPRP--APRPSEPAVtsraRRPDAPPQSARpRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHP 2642
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   79 GQAAYGQFGQGDV-----------------QNGPSSTVQMQRLPGSQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLS 141
Cdd:PHA03247 2643 PPTVPPPERPRDDpapgrvsrprrarrlgrAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLP 2722
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  142 -GMQISGAVAPAPPSSGLGFGPPTSLASASG-SFPNSGLYGSYPQGQAPPLSQAQGHPGIQTP---------QRSAPSQA 210
Cdd:PHA03247 2723 pGPAAARQASPALPAAPAPPAVPAGPATPGGpARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpavaslsesRESLPSPW 2802
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  211 SSFTPPASGGPRLPSMTGPLLPGQSFGGPSVSQPNHVSSPPQALPPGTQMTGPLGPLPPMHSPQQPGYQPQQNGSfgPAR 290
Cdd:PHA03247 2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA--PAR 2880
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  291 GPQSNYGGPYPAAPTFGSQPGPPQPLPPKRLDPDAIPSPQLSELPPQQKTRH-------------RIDPDAIPSPIQVIE 357
Cdd:PHA03247 2881 PPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPppppprpqpplapTTDPAGAGEPSGAVP 2960
                         410       420
                  ....*....|....*....|.
gi 767964335  358 DDRNNRGTEPFVTGVRGQVPP 378
Cdd:PHA03247 2961 QPWLGALVPGRVAVPRFRVPQ 2981
Gelsolin pfam00626
Gelsolin repeat;
984-1059 2.80e-12

Gelsolin repeat;


Pssm-ID: 395501 [Multi-domain]  Cd Length: 76  Bit Score: 63.10  E-value: 2.80e-12
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767964335   984 STTEPPAVRASEERLSNGDIYLLENGLNLFLWVGASVQQgvVQSLFSVSSFSQI-TSGLSVLPVLDN-PLSKKVRGLI 1059
Cdd:pfam00626    1 KFVLPPPVPLSQESLNSGDCYLLDNGFTIFLWVGKGSSL--LEKLFAALLAAQLdDDERFPLPEVIRvPQGKEPARFL 76
PHA03247 PHA03247
large tegument protein UL36; Provisional
8-304 8.42e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 69.97  E-value: 8.42e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    8 PPVPPFGQPQPIYPGYHQSSYGGQSGSTAPA-----IPYGAYNGPV----------PGYQQTPPQGMSRAPPSSGAPPAS 72
Cdd:PHA03247 2704 PPPTPEPAPHALVSATPLPPGPAAARQASPAlpaapAPPAVPAGPAtpggparparPPTTAGPPAPAPPAAPAAGPPRRL 2783
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   73 TAQAPCGQAAYGQFGQGDVQNGPSSTVQMQRLPGSQPFGSPLAPVgnqppvlqpygPPPTSAQvatqlsgmqisgavaPA 152
Cdd:PHA03247 2784 TRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPL-----------PPPTSAQ---------------PT 2837
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  153 PPSSGLGFgPPTSLASASGSFPNSGLYGSYPQGQAPPLSQAQGHPGIQTPQRSAPSQAS-SFTPPASGGPRLPSmtgPLL 231
Cdd:PHA03247 2838 APPPPPGP-PPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTeSFALPPDQPERPPQ---PQA 2913
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767964335  232 PGQSFGGPSVSQPNHVSSPPQAlPPGTQMTGPLGPLPPMHSPQQPGYQPQQNGSFGPARGPQSNYGGPYPAAP 304
Cdd:PHA03247 2914 PPPPQPQPQPPPPPQPQPPPPP-PPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
6-271 3.12e-11

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 67.87  E-value: 3.12e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335     6 SVPPVPPFGQPQPIYPGYHQSSYGGQSGSTAPAIPYGAYNGPVPgyqQTPPQGMSRAPPSSGAPPASTAQA--------- 76
Cdd:pfam03154  202 SAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSP---HPPLQPMTQPPPPSQVSPQPLPQPslhgqmppm 278
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    77 ----------------PCGQAAYGQFGQGDVQNGPSSTV-----QMQRLPGSQPFGSPLAPVGNQP----PVLQPY-GPP 130
Cdd:pfam03154  279 phslqtgpshmqhpvpPQPFPLTPQSSQSQVPPGPSPAApgqsqQRIHTPPSQSQLQSQQPPREQPlppaPLSMPHiKPP 358
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   131 PTS--AQVATQLSGMQISGAVAPAPPSSGLGFGPPTS---LASASGSFPNSG------LYGSYPQGQAPP-----LSQAQ 194
Cdd:pfam03154  359 PTTpiPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAlkpLSSLSTHHPPSAhppplqLMPQSQQLPPPPaqppvLTQSQ 438
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   195 GHP--GIQTPQRSAPSQASSfTPPASGGPRLPSMTGPLLPGQsfgGPSVSQPNHVSS--PPQALPPGTQMTGPLG---PL 267
Cdd:pfam03154  439 SLPppAASHPPTSGLHQVPS-QSPFPQHPFVPGGPPPITPPS---GPPTSTSSAMPGiqPPSSASVSSSGPVPAAvscPL 514

                   ....
gi 767964335   268 PPMH 271
Cdd:pfam03154  515 PPVQ 518
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
8-352 1.31e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 65.94  E-value: 1.31e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335     8 PPVPPFGQPQPIYPGYHQSSyggqsgstAPAIPYGAYNGPVPGYQQTPPQGMSRAPPSSGA--PPASTAQAPcgqaaygq 85
Cdd:pfam03154  255 PPPPSQVSPQPLPQPSLHGQ--------MPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSqvPPGPSPAAP-------- 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    86 fgqgdvqnGPSStvQMQRLPGSQPFGSPLAPVGNQP----PVLQPY-GPPPTS--AQVATQLSGMQISGAVAPAPPSSGL 158
Cdd:pfam03154  319 --------GQSQ--QRIHTPPSQSQLQSQQPPREQPlppaPLSMPHiKPPPTTpiPQLPNPQSHKHPPHLSGPSPFQMNS 388
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   159 GFGPPTSLAsasgsfPNSGLYGSYPQGQAPPLSQ--AQGHPgIQTPQRSAP--SQASSFTPPASGGPrlPSMTGPLLPGQ 234
Cdd:pfam03154  389 NLPPPPALK------PLSSLSTHHPPSAHPPPLQlmPQSQQ-LPPPPAQPPvlTQSQSLPPPAASHP--PTSGLHQVPSQ 459
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   235 SfggPSVSQPNHVSSPPQALPPGTqmtgplgplPPMHSPQQPGyqpqqngSFGPARGPQSNYGGPYPAAPTfgsqpgppQ 314
Cdd:pfam03154  460 S---PFPQHPFVPGGPPPITPPSG---------PPTSTSSAMP-------GIQPPSSASVSSSGPVPAAVS--------C 512
                          330       340       350
                   ....*....|....*....|....*....|....*...
gi 767964335   315 PLPPKRLDPDAIPSPQLSELPPQQKTRHRIDPDAIPSP 352
Cdd:pfam03154  513 PLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTP 550
PHA03247 PHA03247
large tegument protein UL36; Provisional
6-254 5.86e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.72  E-value: 5.86e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    6 SVPPVPPFGQPQPIYPGYHQSSYGGQSGS----TAPAIPYGAYNGPVPGYQQT--------PPQGMSRAPPSSGAPPAST 73
Cdd:PHA03247 2769 PAPPAAPAAGPPRRLTRPAVASLSESRESlpspWDPADPPAAVLAPAAALPPAaspagplpPPTSAQPTAPPPPPGPPPP 2848
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   74 AQAPCGQAAYGqfgqGDVQNGPSStvqmqRLPGSQPFGSPLAPVGN--------------QPPVLQPYGPPPTSAQVATQ 139
Cdd:PHA03247 2849 SLPLGGSVAPG----GDVRRRPPS-----RSPAAKPAAPARPPVRRlarpavsrstesfaLPPDQPERPPQPQAPPPPQP 2919
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  140 LSGMQISGAVAPAPPSSGLgfgPPTSLASASGSFPNSGlygsyPQGQAPPLSQAQGHPG-IQTPQRSAPSQASSFTPPAs 218
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPR---PQPPLAPTTDPAGAGE-----PSGAVPQPWLGALVPGrVAVPRFRVPQPAPSREAPA- 2990
                         250       260       270
                  ....*....|....*....|....*....|....*.
gi 767964335  219 ggPRLPSMTGPLLPGQSFGGPSVSQPNHVSSPPQAL 254
Cdd:PHA03247 2991 --SSTPPLTGHSLSRVSSWASSLALHEETDPPPVSL 3024
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
29-304 4.33e-08

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 57.87  E-value: 4.33e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   29 GGQSGSTAPAIPYGAYNGPVPGYQQTPPQGMSRA--PPSSGAPPASTAQAPCGQAAYGQFGQ-GDVQNGPSSTVQmqrlP 105
Cdd:PHA03307  110 GPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSpgPPPAASPPAAGASPAAVASDAASSRQaALPLSSPEETAR----A 185
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  106 GSQPfgsPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQISGAVAPAPPSSGLGFGPPTSLASASGSFPNSGLYGSYPQG 185
Cdd:PHA03307  186 PSSP---PAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAP 262
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  186 QAPPLSQAQGHPGIQTPQRsAPSQASSFTPPASGGPRLPSM--TGPLLPGQSFGGPSVSQPNHVSSPPQALPPGTQMTGP 263
Cdd:PHA03307  263 ITLPTRIWEASGWNGPSSR-PGPASSSSSPRERSPSPSPSSpgSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAV 341
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|.
gi 767964335  264 lGPLPPMHSPQQPGYQPQQNGSFGPARGPQSNYGGPYPAAP 304
Cdd:PHA03307  342 -SPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAAS 381
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
30-269 4.43e-08

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 56.96  E-value: 4.43e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   30 GQSGSTAPAIPYGAyngpvpgyqQTPPQgmsraPPSSGAPPAST-AQAPCGQAAYGQFGQGDVQNGPSSTVQMQRLPGSQ 108
Cdd:COG5164    22 GSQGSTKPAQNQGS---------TRPAG-----NTGGTRPAQNQgSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQ 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  109 pfGSPLAPvGNQPPVLQPYGPPPTSAQVATQLSGMQISGAvAPAPPSSGlgfgpPTSLASASGSFPNSGLYGSYPQGQAP 188
Cdd:COG5164    88 --GGTRPA-GNTGGTTPAGDGGATGPPDDGGATGPPDDGG-STTPPSGG-----STTPPGDGGSTPPGPGSTGPGGSTTP 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  189 PLSQAQGHPGIQTPQRSAPSQASSFTPPASGGPRLPSMTGPLLPGQSFGGPSVSQPNHVSSPPqalPPGTQMTGPLGPLP 268
Cdd:COG5164   159 PGDGGSTTPPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPP---DDRGGKTGPKDQRP 235

                  .
gi 767964335  269 P 269
Cdd:COG5164   236 K 236
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
30-368 1.04e-07

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 56.17  E-value: 1.04e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    30 GQSGSTAPAIPYGAYNGPVPGYQQTPPQGMSRAPPSSGAP--PASTAQAPCGQAAYGQFGQGDVQNGPsstvqMQRLPGS 107
Cdd:pfam09606   91 GQGTRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPqmPMGGAGFPSQMSRVGRMQPGGQAGGM-----MQPSSGQ 165
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   108 QPFGSPLAPVGNQPPVlQPYGPPPTSAQVATQLSGM--QISGAVAPAPPSSGLGFGpptSLASASGSFPNSGLYGSYPQ- 184
Cdd:pfam09606  166 PGSGTPNQMGPNGGPG-QGQAGGMNGGQQGPMGGQMppQMGVPGMPGPADAGAQMG---QQAQANGGMNPQQMGGAPNQv 241
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   185 --GQAPP---LSQAQGHPGIQTPQRSAPS--QASSFTPPASGGPRLPSMTGPLLPGQSFGGPSVSQ-------------- 243
Cdd:pfam09606  242 amQQQQPqqqGQQSQLGMGINQMQQMPQGvgGGAGQGGPGQPMGPPGQQPGAMPNVMSIGDQNNYQqqqtrqqqqqqggn 321
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   244 --PNHVSSPPQALPPGTQMT---------------------GPLGPLPPMHSPQQPGYQPQQNGSFGPARGPQSN--YGG 298
Cdd:pfam09606  322 hpAAHQQQMNQSVGQGGQVValgglnhletwnpgnfgglgaNPMQRGQPGMMSSPSPVPGQQVRQVTPNQFMRQSpqPSV 401
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767964335   299 PYPAAPTFGSQPGPPqplppkrldPDAIPSPQLSELPPQQKTRHRIDPDAIP--SPIQVIEDDRNNRGTEPF 368
Cdd:pfam09606  402 PSPQGPGSQPPQSHP---------GGMIPSPALIPSPSPQMSQQPAQQRTIGqdSPGGSLNTPGQSAVNSPL 464
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
71-299 1.72e-07

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 55.40  E-value: 1.72e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    71 ASTAQAPCGQAAYGQFGQGdvQNG---PSSTVQMQRLPGSQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLS--GMQI 145
Cdd:pfam09606   57 AAQQQQPQGGQGNGGMGGG--QQGmpdPINALQNLAGQGTRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGrpQMPM 134
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   146 SGAVAPAPPSSGLGFGPPtslASASGSFPNSGLYGSYPQGQAPPLSQAQGHPGIQTPQRSAPSQASSFTPPASGGPRLPS 225
Cdd:pfam09606  135 GGAGFPSQMSRVGRMQPG---GQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPG 211
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   226 MT-------------GPLLPGQSFGGPsvsqPNHVSSPPQALPPGTQMTGPLGPLPPMHspqqpgyQPQQNGSFGPARGP 292
Cdd:pfam09606  212 PAdagaqmgqqaqanGGMNPQQMGGAP----NQVAMQQQQPQQQGQQSQLGMGINQMQQ-------MPQGVGGGAGQGGP 280

                   ....*..
gi 767964335   293 QSNYGGP 299
Cdd:pfam09606  281 GQPMGPP 287
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
120-355 2.32e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 55.16  E-value: 2.32e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   120 QPPVLQPYGPPPTSAQVATQLSGMQISGAVAPAPPSSGLGFGPPTSLAsasgsfPNSglygsyPQGQAPPLSQAQGHPGI 199
Cdd:pfam03154  170 QPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQP------PNQ------TQSTAAPHTLIQQTPTL 237
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   200 QTPQRSAP----SQASSFTPPASGGPR---LPSMTGPLLPGqsfGGPSVSQPNHVSSP--PQALPPGTQMTGPLGPLPPM 270
Cdd:pfam03154  238 HPQRLPSPhpplQPMTQPPPPSQVSPQplpQPSLHGQMPPM---PHSLQTGPSHMQHPvpPQPFPLTPQSSQSQVPPGPS 314
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   271 HSPQQPGYQPQQN-GSFGPARGPQSNYGGPYPAAPtfgsqpgppqpLPPKRLDPDaiPSPQLSELPPQQKTRHRIDPDAi 349
Cdd:pfam03154  315 PAAPGQSQQRIHTpPSQSQLQSQQPPREQPLPPAP-----------LSMPHIKPP--PTTPIPQLPNPQSHKHPPHLSG- 380

                   ....*.
gi 767964335   350 PSPIQV 355
Cdd:pfam03154  381 PSPFQM 386
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
5-269 3.02e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 54.77  E-value: 3.02e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335     5 QSVPPVPPFGQPQPIYPGYHQSSYGGQSGSTAPAIPYGAYNGPVPGYQQTPPQGMS----RAPPSSGAPPASTAQA---P 77
Cdd:pfam03154  296 QPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSmphiKPPPTTPIPQLPNPQShkhP 375
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    78 CGQAAYGQFGQGDVQNGPSSTVQMQRLPGSQP------------FGSPLAPVGNQPPVL-QPYGPPPTSAQVATQlSGMQ 144
Cdd:pfam03154  376 PHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPpsahppplqlmpQSQQLPPPPAQPPVLtQSQSLPPPAASHPPT-SGLH 454
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   145 ISGAVAPAPPSSGLGFGPPTSLasasgsfPNSGlygsypqgqaPPLSQAQGHPGIQTPqrsapsqasSFTPPASGGPrLP 224
Cdd:pfam03154  455 QVPSQSPFPQHPFVPGGPPPIT-------PPSG----------PPTSTSSAMPGIQPP---------SSASVSSSGP-VP 507
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 767964335   225 SMTGPLLPGQSFGGPSVSQPNHVSSPPqalPPgtqmtgPLGPLPP 269
Cdd:pfam03154  508 AAVSCPLPPVQIKEEALDEAEEPESPP---PP------PRSPSPE 543
PPE COG5651
PPE-repeat protein [Function unknown];
5-241 3.18e-07

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 54.13  E-value: 3.18e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    5 QSVPPVPPfgqPQPIYpgyhqsSYGGQSGSTAPAIPYGAYNGPVPGYQQTPPQGMSRAPPSSGAPPAStaqAPCGQAAYG 84
Cdd:COG5651   163 ALTPFTQP---PPTIT------NPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIG---LNSGPGNTG 230
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   85 QFGQGDVQNGPSSTVQMQRLPGSQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQISGAVAPAPPSSGLGFGPPT 164
Cdd:COG5651   231 FAGTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGL 310
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767964335  165 SLASASGSFPNSGLYGSYPQGQAPPLSQAQGHPGIQTPqrSAPSQASSFTPPASGGPRLPSMTGPLLPGQSFGGPSV 241
Cdd:COG5651   311 GAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAA--AAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
29-233 6.07e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 53.73  E-value: 6.07e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   29 GGQSGSTAPAIPYGAyngpvPGYQQTPPQGMSR-APPSSGAPPASTAQAPCGQAAYGQFGQGDVQNGPS-STVQMQRLPG 106
Cdd:PRK12323  366 GQSGGGAGPATAAAA-----PVAQPAPAAAAPAaAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPApEALAAARQAS 440
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  107 SQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQISGAVAPAPPSSGLGFGPPtSLASASGSFPNSGLYGSYPqgq 186
Cdd:PRK12323  441 ARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPP-PWEELPPEFASPAPAQPDA--- 516
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 767964335  187 APPLSQAQGHPGIQTPQRSAPSQASSFTPPASGGPRLPSMTGPLLPG 233
Cdd:PRK12323  517 APAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
PHA03377 PHA03377
EBNA-3C; Provisional
8-197 3.48e-06

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 51.21  E-value: 3.48e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    8 PPVPPFGQPQPIYPGYHQSSYGGQSGSTAPAIPYGAYNGPVPGYQQTPPQGMSRAP--PSSGAPPASTAQAPC-GQAAYG 84
Cdd:PHA03377  770 PQAPYLGYQEPQAQGVQVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPwaPRPPHLPPQWDGSAGhGQDQVS 849
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   85 QFGQGDVQNGPSS--TVQMQRLPGSQPFGSPLAPVGNQPPVLQPYGPPPTSAqvatqlsgmqisgavapAPPSSGLGFGP 162
Cdd:PHA03377  850 QFPHLQSETGPPRlqLSQVPQLPYSQTLVSSSAPSWSSPQPRAPIRPIPTRF-----------------PPPPMPLQDSM 912
                         170       180       190
                  ....*....|....*....|....*....|....*
gi 767964335  163 PTSLASASGSFPNSGLYGSYPQGQAPPLSQAQGHP 197
Cdd:PHA03377  913 AVGCDSSGTACPSMPFASDYSQGAFTPLDINAQTP 947
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
40-256 4.28e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 51.14  E-value: 4.28e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   40 PYGAYNGPVPGYQQTPPQGMSRAPPSSGAPPASTAQAPCGQAAygqfgqgdvqngPSSTVQMQRLPGSQPFGSPLAPVGN 119
Cdd:PRK07764  592 PGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAA------------APAEASAAPAPGVAAPEHHPKHVAV 659
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  120 QPPVLQPYGPPPTSAQVAtqlsgmQISGAVAPAPPSSGLGFGPPTSLASASGSfpnsglygsyPQGQAPPLSQAQGHPGI 199
Cdd:PRK07764  660 PDASDGGDGWPAKAGGAA------PAAPPPAPAPAAPAAPAGAAPAQPAPAPA----------ATPPAGQADDPAAQPPQ 723
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 767964335  200 QTPQRSAPSQASSFTPPASGGPRLPSMTGPlLPGQSFGGPSVSQPNHVSSPPQALPP 256
Cdd:PRK07764  724 AAQGASAPSPAADDPVPLPPEPDDPPDPAG-APAQPPPPPAPAPAAAPAAAPPPSPP 779
MISS pfam15822
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic ...
102-304 5.17e-06

MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic MAPK-interacting and spindle-stabilising protein-like proteins. MISS is rich in prolines and has four potential MAPK-phosphorylation sites, a MAPK-docking site, a PEST sequence (PEST motif) and a bipartite nuclear localization signal. The endogenous protein accumulates during mouse meiotic maturation and is found as discrete dots on the MII spindle. MISS is the first example of a physiological MAPK-substrate that is stabilized in MII that specifically regulates MII spindle integrity during the CSF arrest.


Pssm-ID: 318115 [Multi-domain]  Cd Length: 238  Bit Score: 49.21  E-value: 5.17e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   102 QRLPGSQPFGSPLAPVG-------NQPPVLQPYGPPPTSAQVATQLSGMQiSGAVAPAPPsSGLGFGPPtslasaSGSFP 174
Cdd:pfam15822   28 QGWPGSNPWNNPSAPPAvpsglppSTAPSTVPFGPAPTGMYPSIPLTGPS-PGPPAPFPP-SGPSCPPP------GGPYP 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   175 NSGLYGSYPQGQAPPlsqaqghPGIQTPQrsAPSQASSFTPPASGGPRLP--SM-TGPLLPGQSFGGPSVSQPNHVSSPP 251
Cdd:pfam15822  100 APTVPGPGPIGPYPT-------PNMPFPE--LPRPYGAPTDPAAAAPSGPwgSMsSGPWAPGMGGQYPAPNMPYPSPGPY 170
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 767964335   252 QALPP----GTQMTGPLGPLPPmhspqqpgyqpQQNGSFGPARGPQSNYG--GPYPAAP 304
Cdd:pfam15822  171 PAVPPpqspGAAPPVPWGTVPP-----------GPWGPPAPYPDPTGSYPmpGLYPTPN 218
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
8-139 5.58e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.75  E-value: 5.58e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    8 PPVPPFGQPQPIYPGYHQSSYGGQSGSTAPAiPYGAYNGPVPGYQQTPPQGmsrAPPSSGAPPASTAQAPCGQAAYGQFG 87
Cdd:PRK07764  652 HHPKHVAVPDASDGGDGWPAKAGGAAPAAPP-PAPAPAAPAAPAGAAPAQP---APAPAATPPAGQADDPAAQPPQAAQG 727
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 767964335   88 QGDVQNGPSSTVQMQRLPGSQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQ 139
Cdd:PRK07764  728 ASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPP 779
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
5-304 1.70e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.40  E-value: 1.70e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    5 QSVPPVPPFGQPQPIYPGYHQSSYGGQSGSTAPAIPYGAYNGPVPGYQQTPPQGM---------SRAPPSSGAPPASTAQ 75
Cdd:PHA03307  119 PTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAAlplsspeetARAPSSPPAEPPPSTP 198
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   76 ------------------APCGQAAYGQFGQGDVQNGPSSTVQMQR------------LPGSQPFGSPLAPVGNQPPVLQ 125
Cdd:PHA03307  199 paaasprpprrsspisasASSPAPAPGRSAADDAGASSSDSSSSESsgcgwgpenecpLPRPAPITLPTRIWEASGWNGP 278
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  126 PYGPPPTSAQVATQLSgmqiSGAVAPAPPSSGLGFGPPTSLASASGSfPNSGLYGSYPQGQAPplSQAQGHPGiQTPQRS 205
Cdd:PHA03307  279 SSRPGPASSSSSPRER----SPSPSPSSPGSGPAPSSPRASSSSSSS-RESSSSSTSSSSESS--RGAAVSPG-PSPSRS 350
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  206 APSQASSftPPASGGPrlPSMTGPLLPGQSFGGPSVSQPNHVSSPPQALPPGTQMTGPlGPLPPMHSPQQPGYQPQQNGS 285
Cdd:PHA03307  351 PSPSRPP--PPADPSS--PRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDAT-GRFPAGRPRPSPLDAGAASGA 425
                         330       340
                  ....*....|....*....|
gi 767964335  286 FgPARGPQ-SNYGGPYPAAP 304
Cdd:PHA03307  426 F-YARYPLlTPSGEPWPGSP 444
PHA03247 PHA03247
large tegument protein UL36; Provisional
40-304 1.92e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 1.92e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   40 PYGAYNGPVPGYQQTPPqgmSRAPPSSGAP-PASTAQAPCGQAAYGQFGQ---------GDVQNGPSSTvqmqrLPGSQP 109
Cdd:PHA03247 2489 PFAAGAAPDPGGGGPPD---PDAPPAPSRLaPAILPDEPVGEPVHPRMLTwirgleelaSDDAGDPPPP-----LPPAAP 2560
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  110 FGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQISGAVAPAPPSSGLGFGPPTSL----------ASASGSFPNSGLY 179
Cdd:PHA03247 2561 PAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLppdthapdppPPSPSPAANEPDP 2640
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  180 GSYPQGQAPPLSQAQGHPGIQTPQRSAPSQASSFTPPAS-GGPRLPSMTGPLLPGQSFGGPSVSQPNHVSSPPQA----- 253
Cdd:PHA03247 2641 HPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPpQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALvsatp 2720
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 767964335  254 LPPGTQMTGPLGPLPPMH-SPQQPGYQPQQNGSFGPARGPQSNYGGPYPAAP 304
Cdd:PHA03247 2721 LPPGPAAARQASPALPAApAPPAVPAGPATPGGPARPARPPTTAGPPAPAPP 2772
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
29-266 2.43e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 48.69  E-value: 2.43e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   29 GGQSGSTAPAIPY-GAYNGPVPGYQQTPPQGMSRAPPSSGAPPASTAQAPCGQAAYGQFGQGDVQNGPSSTVQMQRLPGS 107
Cdd:PRK07003  370 GGVPARVAGAVPApGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPV 449
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  108 QPFGSPLAPVGNQPPVLQPyGPPPTSAQVATQLSGMQISGAVAPAPPSSGLgfGPPTSLASASGSFPNSGLYGSYPQGQA 187
Cdd:PRK07003  450 PAKANARASADSRCDERDA-QPPADSGSASAPASDAPPDAAFEPAPRAAAP--SAATPAAVPDARAPAAASREDAPAAAA 526
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  188 PPLSQA-QGHPGIQTPQRSAPSQASSFTPPASGGPRLPSMTGPLLPGQSfgGPSVSQPnhVSSPPQALPPGTQMTGPLGP 266
Cdd:PRK07003  527 PPAPEArPPTPAAAAPAARAGGAAAALDVLRNAGMRVSSDRGARAAAAA--KPAAAPA--AAPKPAAPRVAVQVPTPRAR 602
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
107-330 2.71e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.44  E-value: 2.71e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  107 SQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQISGAVAPAPPSSGLGFGPPTSLASASGSFPNSGLYGSYPQGQ 186
Cdd:PRK07764  601 PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAA 680
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  187 APPLSQAQGHPGIQTPQRSAPSQASSFTPPASGGPRlpSMTGPLLPGQSFGGPSVSQPNHVSSPPQALPPGTQMTGPLGP 266
Cdd:PRK07764  681 PPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADD--PAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQP 758
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767964335  267 LPPMHSPQqpgyqpqqngsfGPARGPQSNYGGPYPAAPTFgsqpgppqplppkrldPDAIPSPQ 330
Cdd:PRK07764  759 PPPPAPAP------------AAAPAAAPPPSPPSEEEEMA----------------EDDAPSMD 794
PHA03378 PHA03378
EBNA-3B; Provisional
7-352 3.09e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 48.14  E-value: 3.09e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    7 VPPVPPFGQPQPIYPGYHQSSYGGQSGSTAPAIPYGAYNGPVPGYQQ---------TPPQGMSRAPPSSGAP-----PAS 72
Cdd:PHA03378  525 LPPSPPQPRAGRRAPCVYTEDLDIESDEPASTEPVHDQLLPAPGLGPlqiqpltspTTSQLASSAPSYAQTPwpvphPSQ 604
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   73 TAQAPCGQAAYGQFGQGDVQNGPSSTVQMQRLPGSQ-PFGSPLAPVGNQPPVLQ--------------PYGPPPTSAQVA 137
Cdd:PHA03378  605 TPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPiTFNVLVFPTPHQPPQVEitpykptwtqighiPYQPSPTGANTM 684
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  138 TQLSG----MQI-SGAVAPAPPSSGlgfgPPTSL----ASASGSFPNSGLYGSY--PQGQAPPLSQAQGHPGIQTPQRSA 206
Cdd:PHA03378  685 LPIQWapgtMQPpPRAPTPMRPPAA----PPGRAqrpaAATGRARPPAAAPGRArpPAAAPGRARPPAAAPGRARPPAAA 760
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  207 PSQASS-FTPPASGGPRLPSMTGPLLPGQSFGGPSVSQPnhvsspPQALPPGTQMTGPLGPL---PPMHSPQQPGYQPQQ 282
Cdd:PHA03378  761 PGRARPpAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPP------PQAGPTSMQLMPRAAPGqqgPTKQILRQLLTGGVK 834
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  283 NG--------------SFGPARGPQSNYGGPYPAAPTFGSQPGPPQPLPPKRLDPDAIPSPQLSELPPQQKTRHR----I 344
Cdd:PHA03378  835 RGrpslkkpaalerqaAAGPTPSPGSGTSDKIVQAPVFYPPVLQPIQVMRQLGSVRAAAASTVTQAPTEYTGERRgvgpM 914

                  ....*...
gi 767964335  345 DPDAIPSP 352
Cdd:PHA03378  915 HPTDIPPS 922
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
29-224 3.81e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.06  E-value: 3.81e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   29 GGQSGSTAPAIPYGAYNGPVPGYQQTPPQGMSRAPPSSGAPPA--STAQAPCGQAAYGQFGQGDVQNGPSSTVQMQRLPG 106
Cdd:PRK07764  595 AGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAeaSAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAG 674
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  107 S-QPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQISGAvAPAPPSSGLGFGPPTSLASASGSFPNsglYGSYPQG 185
Cdd:PRK07764  675 GaAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDP-AAQPPQAAQGASAPSPAADDPVPLPP---EPDDPPD 750
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 767964335  186 QAPPLSQAQGHPGiqTPQRSAPSQASSFTPPASGGPRLP 224
Cdd:PRK07764  751 PAGAPAQPPPPPA--PAPAAAPAAAPPPSPPSEEEEMAE 787
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
5-222 6.23e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.29  E-value: 6.23e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    5 QSVPPVPPFGQPQPiypgyhqssyggqSGSTAPAIPygayNGPVPGYQQTPPQGMSRAPPSSGAPPASTAQAPCGQAAYG 84
Cdd:PRK07764  602 APASSGPPEEAARP-------------AAPAAPAAP----AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASD 664
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   85 QFGQGDVQNGPSSTVQMqrlPGSQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQISGAVAPAPPSSGLGFGPPT 164
Cdd:PRK07764  665 GGDGWPAKAGGAAPAAP---PPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPL 741
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 767964335  165 SLASASGSFPNSGLYGSYPQGQAPPLSQAQGHPGIQTPQRSAPSQASSFTPPASGGPR 222
Cdd:PRK07764  742 PPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRR 799
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
32-263 1.76e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 45.34  E-value: 1.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    32 SGSTAPAIPYGAYNGPVPGYQQT-----PPQGMSRAPPSSGAPPASTAQAPCG---------QAAYGQFGQGDVQNGPSS 97
Cdd:pfam17823  180 SSTTAASSTTAASSAPTTAASSApatltPARGISTAATATGHPAAGTALAAVGnsspaagtvTAAVGTVTPAALATLAAA 259
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    98 TVQMQRLPGSQPFGSP----LAPVGNQPPVLQPYGPPPTS--------AQVATQLSGMQISGAVAPAPPSSGLGFGPPTS 165
Cdd:pfam17823  260 AGTVASAAGTINMGDPharrLSPAKHMPSDTMARNPAAPMgaqaqgpiIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKS 339
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   166 LASASGSFPNSglygSYPQGQAPPLSQAQGHPGIQTPQRSAPSQASSftppasggprlPSmtgPLLPGQSFGGPSVSQ-P 244
Cdd:pfam17823  340 VASTNLAVVTT----TKAQAKEPSASPVPVLHTSMIPEVEATSPTTQ-----------PS---PLLPTQGAAGPGILLaP 401
                          250
                   ....*....|....*....
gi 767964335   245 NHVSSPPQalpPGTQMTGP 263
Cdd:pfam17823  402 EQVATEAT---AGTASAGP 417
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
7-268 2.79e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 44.92  E-value: 2.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    7 VPPVPP-FGQPQPIYPGYHQSSYGGQSGSTAPAIPYGAYNgpvpgyQQTPPQGMSRAPPSSGAPPASTAQApcgqaaygq 85
Cdd:PLN03209  339 PKPVPTkPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYE------DLKPPTSPIPTPPSSSPASSKSVDA--------- 403
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   86 FGQGDVQNGPSSTVQMQRLPGSQPFGsplAPVGNQPPvLQPYG-----PPPTSAqvatqlsgmqisgavAPAPPSsglGF 160
Cdd:PLN03209  404 VAKPAEPDVVPSPGSASNVPEVEPAQ---VEAKKTRP-LSPYAryedlKPPTSP---------------SPTAPT---GV 461
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  161 GPPTSLASASGSFPNSGLYGSYPQGQAPPlsQAQGHPGIQTPQRSAPSQASSFTPPASGGPRLPSMTGPLLPGQSFGGPS 240
Cdd:PLN03209  462 SPSVSSTSSVPAVPDTAPATAATDAAAPP--PANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPT 539
                         250       260       270
                  ....*....|....*....|....*....|...
gi 767964335  241 --VSQPNHVSSPPQALPPGT---QMTGPLGPLP 268
Cdd:PLN03209  540 alADEQHHAQPKPRPLSPYTmyeDLKPPTSPTP 572
PHA03379 PHA03379
EBNA-3A; Provisional
9-350 4.12e-04

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 44.66  E-value: 4.12e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    9 PVPPFGQPQPIYP-GYHQSSYGGQSGSTAPAIPYGAYNGPV-------PG-YQQTPPQGMSR--------APPSSGAPPA 71
Cdd:PHA03379  416 PRPPVEKPRPEVPqSLETATSHGSAQVPEPPPVHDLEPGPLhdqhsmaPCpVAQLPPGPLQDlepgdqlpGVVQDGRPAC 495
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   72 STAQAPCG------QAAYGQFGQgdVQNGPSSTVQMQRLPGSQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQI 145
Cdd:PHA03379  496 APVPAPAGpivrpwEASLSQVPG--VAFAPVMPQPMPVEPVPVPTVALERPVCPAPPLIAMQGPGETSGIVRVRERWRPA 573
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  146 SGAVAPAPPSSGLGF--GP-----------------PTSLASASGSFPNSG-------LYGSYPQGQAPPLSQAQGHPGI 199
Cdd:PHA03379  574 PWTPNPPRSPSQMSVrdRLarlraeaqpyqasvevqPPQLTQVSPQQPMEYplepeqqMFPGSPFSQVADVMRAGGVPAM 653
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  200 QTPQRSAPSQassfTPPASGGPRLP--SMTGPLLP-----GQSFGGPSVSQPNHVSSPPQALPPgTQMTGPLGPLPPMHS 272
Cdd:PHA03379  654 QPQYFDLPLQ----QPISQGAPLAPlrASMGPVPPvpatqPQYFDIPLTEPINQGASAAHFLPQ-QPMEGPLVPERWMFQ 728
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  273 PQQPGYqpqqngSFGPARGPQSNYGGPY--------PAAPTFGSQPGPPQPLPPKRLDPDAIPSPQLSELPPQ----QKT 340
Cdd:PHA03379  729 GATLSQ------SVRPGVAQSQYFDLPLtqpinhgaPAAHFLHQPPMEGPWVPEQWMFQGAPPSQGTDVVQHQldalGYV 802
                         410
                  ....*....|
gi 767964335  341 RHRIDPDAIP 350
Cdd:PHA03379  803 LHVLNHPGVP 812
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
5-242 4.81e-04

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 44.15  E-value: 4.81e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    5 QSVPPVPPFGQPQPIYPGYHQSSYGG-------------QSGST-APAIPY--------GAYNGPVPGYQQTPPQGMSRA 62
Cdd:cd22540   159 QVLQQPQQAHKPVPIKPAPLQTSNTNsaslqvpgnviklQSGGNvALTLPVnnlvgtqdGATQLQLAAAPSKPSKKIRKK 238
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   63 PPSSGAPPASTAQAPcgQAAYGQFGQGD-VQNGPSSTVQMQrlpgsqpfgsplaPVGNQPPVLQPYG--PPPTSAQVAT- 138
Cdd:cd22540   239 SAQAAQPAVTVAEQV--ETVLIETTADNiIQAGNNLLIVQS-------------PGTGQPAVLQQVQvlQPKQEQQVVQi 303
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  139 ---QLSGMQISGAVAPAPPSSglgfgPPTSLASASGSFPNSGLYGSYPQGQAPPLSQAQGHPGIQTPQRSAPSQASSFTP 215
Cdd:cd22540   304 pqqALRVVQAASATLPTVPQK-----PLQNIQIQNSEPTPTQVYIKTPSGEVQTVLLQEAPAATATPSSSTSTVQQQVTA 378
                         250       260       270
                  ....*....|....*....|....*....|
gi 767964335  216 P-ASGGPRLPSMT--GPLLPGQSFGGPSVS 242
Cdd:cd22540   379 NnGTGTSKPNYNVrkERTLPKIAPAGGIIS 408
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
47-218 5.12e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 44.07  E-value: 5.12e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   47 PVPGYQQTPPQGMSRAPPssGAPPASTAQAPCGQAAYGQFGQGDVQNGPSSTVqmqrlpgSQPFGSPLAPVGNQPPVLQP 126
Cdd:PRK07003  360 PAVTGGGAPGGGVPARVA--GAVPAPGARAAAAVGASAVPAVTAVTGAAGAAL-------APKAAAAAAATRAEAPPAAP 430
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  127 ygPPPTSAQVATQLSGMQISGAVAPAPPSSglgfgPPTSLASASGSFPNSGLYGSYPQGQAPPLSQAQGHPGIQTPQRSA 206
Cdd:PRK07003  431 --APPATADRGDDAADGDAPVPAKANARAS-----ADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAAT 503
                         170
                  ....*....|..
gi 767964335  207 PSQASSFTPPAS 218
Cdd:PRK07003  504 PAAVPDARAPAA 515
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
27-262 5.67e-04

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 43.83  E-value: 5.67e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   27 SYGGQSGSTApaipYGAYNGPvpGYQQTPPQGMSRAPPSSGAPPASTAQApcgqAAYGQFGQGDVQNGPSSTvqmqrlPG 106
Cdd:cd21118   120 SWQGSGGHGA----YGSQGGP--GVQGHGIPGGTGGPWASGGNYGTNSLG----GSVGQGGNGGPLNYGTNS------QG 183
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  107 SQPFGSPLAPVGNQppvlQPYG---PPPTSAQVATQLSGMQISGAVAPAPPSSGLGFGPPTSLASASGS-------FPNS 176
Cdd:cd21118   184 AVAQPGYGTVRGNN----QNSGctnPPPSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNggnngssSSNS 259
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  177 GLYGSYPQGQAPPLSQAQGHPGIQTPQRSAPSqASSFTPPASGGPRLPSMTGPllpgqsfgGPSVSQPNHVSSPPQALPP 256
Cdd:cd21118   260 GNSGGSNGGSSGNSGSGSGGSSSGGSNGWGGS-SSSGGSGGSGGGNKPECNNP--------GNDVRMAGGGGSQGSKESS 330

                  ....*.
gi 767964335  257 GTQMTG 262
Cdd:cd21118   331 GSHGSN 336
PRK10263 PRK10263
DNA translocase FtsK; Provisional
5-250 5.90e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 44.31  E-value: 5.90e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    5 QSVPPVPPFGQPQPIY------PGYHQSSYGGQSGSTAPAIPYGAYNGPVPGyQQTPPQGMSRAPPSSGAPPASTAQAPC 78
Cdd:PRK10263  378 EGYPQQSQYAQPAVQYneplqqPVQPQQPYYAPAAEQPAQQPYYAPAPEQPA-QQPYYAPAPEQPVAGNAWQAEEQQSTF 456
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   79 GQAAYGQFGQGDVQNGPSSTVQMQRLPGSQPFGSPLAPVGNQppvLQPYGPP--------PTSAQVATQLSGMQisgAVA 150
Cdd:PRK10263  457 APQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEE---TKPARPPlyyfeeveEKRAREREQLAAWY---QPI 530
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  151 PAPPSSGLGFGPPTSLASASGSFPNSGLYGSYPqgQAPPLSQAQGHPGIqtpqrSAPSQASSFTPPASGGPRLPSMTGPl 230
Cdd:PRK10263  531 PEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSP--LASGVKKATLATGA-----AATVAAPVFSLANSGGPRPQVKEGI- 602
                         250       260
                  ....*....|....*....|
gi 767964335  231 lpgqsfgGPSVSQPNHVSSP 250
Cdd:PRK10263  603 -------GPQLPRPKRIRVP 615
PHA03378 PHA03378
EBNA-3B; Provisional
59-355 6.77e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.90  E-value: 6.77e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   59 MSRAPPSSGAPPASTAQAPCgqAAYGQFGQGDVQNGPSSTVQMQRLPGSQPfgsplAPVGNQPPVLQPYGPPPTSAQVAT 138
Cdd:PHA03378  521 MATLLPPSPPQPRAGRRAPC--VYTEDLDIESDEPASTEPVHDQLLPAPGL-----GPLQIQPLTSPTTSQLASSAPSYA 593
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  139 QLSGMQISGAVAPAPPSSGLGfgPPTSLASASGSFPNSGLYGSYPQGQAPPLSQAQGHPGIQTPQRSAPSQASSFTPPAS 218
Cdd:PHA03378  594 QTPWPVPHPSQTPEPPTTQSH--IPETSAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGH 671
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  219 GgPRLPSMTGP--LLPGQSfgGPSVSQPNHVS---SPPQALPPGTQMTGPLGPLPPMHSPQQPGYQPQQNGSFGPARGPQ 293
Cdd:PHA03378  672 I-PYQPSPTGAntMLPIQW--APGTMQPPPRAptpMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPA 748
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 767964335  294 SN-------YGGPYPAAPTFGSQPGPpQPLPPKRLDPDAIPSPQLSELPPQQktrhridPDAIPSPIQV 355
Cdd:PHA03378  749 AApgrarppAAAPGRARPPAAAPGAP-TPQPPPQAPPAPQQRPRGAPTPQPP-------PQAGPTSMQL 809
hnRNP-R-Q TIGR01648
heterogeneous nuclear ribonucleoprotein R, Q family; Sequences in this subfamily include the ...
12-193 7.34e-04

heterogeneous nuclear ribonucleoprotein R, Q family; Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q, and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.


Pssm-ID: 273732 [Multi-domain]  Cd Length: 578  Bit Score: 43.45  E-value: 7.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    12 PFGQPQPIYPGYHQSSYGGQSGSTAPAIPYGAYngpVPGYQQTPPQGMSRAPPSSGAPPASTAQapcgqaaYGQFGQGdv 91
Cdd:TIGR01648  383 GRGYPPYGYEAYYGDYYGYHDYRGKYEDKYYGY---DPGMELTPMNPVRGKPGGRGGRPAIPPP-------RGRKNGA-- 450
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    92 qnGPSSTVQMQRLPGSQPFGSPLApvGNQPPVLQPYGPPPTSAQVatqlsgmqiSGAVAPAPPSSGLGFGPPTSlaSASG 171
Cdd:TIGR01648  451 --PPPAIGQDGRQLFLYKITIPAG--YSQRPAPHPLGPPRGSAFV---------RGARGGPAQYQQRGRGSRTS--RGNG 515
                          170       180
                   ....*....|....*....|..
gi 767964335   172 SFPNSGLYGSYPQGQAPPLSQA 193
Cdd:TIGR01648  516 RGGTAGGKRKAFDGYAQPDATA 537
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
14-298 9.20e-04

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 43.40  E-value: 9.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    14 GQPQP-IYPGYHQSSYGGQSGSTaPAIPYGAYNGPVPGYQQTPPQGMSRAPPSSGAPPASTAQA--PCGQAAYGQFGQGD 90
Cdd:pfam03157  276 GQGQQgYYPTSLQQPGQGQSGYY-PTSQQQAGQLQQEQQLGQEQQDQQPGQGRQGQQPGQGQQGqqPAQGQQPGQGQPGY 354
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    91 VQNGPSSTVQMQrlPGSQPfGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQISGAVAP-----APPSSGLG---FGP 162
Cdd:pfam03157  355 YPTSPQQPGQGQ--PGYYP-TSQQQPQQGQQPEQGQQGQQQGQGQQGQQPGQGQQPGQGQPgyyptSPQQSGQGqpgYYP 431
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   163 PTSLASASGSFPNSGLY---GSYPQGQAPPLSQAQGHPGiQTPQRSAPSQASSFTPPASggprlPSMTGPLLPGQSFGGP 239
Cdd:pfam03157  432 TSPQQSGQGQQPGQGQQpgqEQPGQGQQPGQGQQGQQPG-QPEQGQQPGQGQPGYYPTS-----PQQSGQGQQLGQWQQQ 505
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 767964335   240 SVSQPNHVSSPPQALPPGTQMTGPLGPLPPmhspqqpGYQPQQNGSFGPARGPQSNYGG 298
Cdd:pfam03157  506 GQGQPGYYPTSPLQPGQGQPGYYPTSPQQP-------GQGQQLGQLQQPTQGQQGQQSG 557
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
4-352 1.00e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.44  E-value: 1.00e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    4 NQSVPPVPPFGQPQPIYPGYHQSSYGGQSGSTAPAIPygayngPVPGYQQTPPQGMSRAPPSSGAPPASTAQAPCGQAAY 83
Cdd:PRK07764  429 PQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQ------PAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAA 502
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   84 GQFGQGDVQ------------NGPSSTVQMQRLPGSQPfgsplapVGNQPPVLQPYGPPPTSAQ--------------VA 137
Cdd:PRK07764  503 PAGADDAATlrerwpeilaavPKRSRKTWAILLPEATV-------LGVRGDTLVLGFSTGGLARrfaspgnaevlvtaLA 575
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  138 TQLSG-MQISGAVAPAPPSSGLGFGPPTSLASASGSFPNSGLYGSYPQGQAPPLSQAQGHPGIQTPQRSAPSQASSFTPP 216
Cdd:PRK07764  576 EELGGdWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPK 655
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  217 ASGGPRLPSMTGP---LLPGQSFGGPSVSQPNHVSSPPQALPPGTQMTGPLGPLPPMHSPQQPGYQPQQNGSFGPaRGPQ 293
Cdd:PRK07764  656 HVAVPDASDGGDGwpaKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASA-PSPA 734
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 767964335  294 SNYGGPYPAAPTFGSQPGPPQPLPPKRLDPDAIPSPQLSELPPQQKTRHRIDPDAIPSP 352
Cdd:PRK07764  735 ADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSM 793
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
8-269 1.20e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.24  E-value: 1.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    8 PPVPPFGQPQPIYPGYHQSSYGGQSGStaPAIPYGAYNGPVPGyqqtppqGMSRAPPSSGAPPASTAQAP-CGQAAYGQF 86
Cdd:PHA03307  185 APSSPPAEPPPSTPPAAASPRPPRRSS--PISASASSPAPAPG-------RSAADDAGASSSDSSSSESSgCGWGPENEC 255
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   87 GQGDVQNGPSSTVQMQRLPG---------SQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQISGAVAPA---PP 154
Cdd:PHA03307  256 PLPRPAPITLPTRIWEASGWngpssrpgpASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSsssES 335
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  155 SSGLGFGPPTSLASASGSFPNSGlygsyPQGQAPPLSQAQGHPGIQTPQRSAPSQASSFTPPASGGPRLPSMTGPLLPGQ 234
Cdd:PHA03307  336 SRGAAVSPGPSPSRSPSPSRPPP-----PADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAG 410
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 767964335  235 SFGGPSVSQPNHVSSPPQALPPGTQMTGPL---GPLPP 269
Cdd:PHA03307  411 RPRPSPLDAGAASGAFYARYPLLTPSGEPWpgsPPPPP 448
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
28-336 1.27e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.05  E-value: 1.27e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   28 YGGQSGSTAPAIPYGAYNGPVPgyqqTPPQGMSRAPPSSGAPPASTAQAPCGQAAygqfgqgdvqnGPSSTVQMQRLPGS 107
Cdd:PRK07764  387 VAGGAGAPAAAAPSAAAAAPAA----APAPAAAAPAAAAAPAPAAAPQPAPAPAP-----------APAPPSPAGNAPAG 451
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  108 QPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQISGAVAPAPPSSGLGFGPPT------------------SLASA 169
Cdd:PRK07764  452 GAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDaatlrerwpeilaavpkrSRKTW 531
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  170 SGSFPNSGLYGsyPQGQA-------PPLSQA---QGHPGI-------QTPQRSAPSQASSFTPPASGGPRLPSMTGPLLP 232
Cdd:PRK07764  532 AILLPEATVLG--VRGDTlvlgfstGGLARRfasPGNAEVlvtalaeELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPP 609
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  233 GQSFGGPSVSQPnhvSSPPQALPPGTQMTGPLGPLPPMHSPQQPGYQPQQNGSFGPARGPQSNYGGPYPAAPTFGSQPGP 312
Cdd:PRK07764  610 EEAARPAAPAAP---AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPA 686
                         330       340
                  ....*....|....*....|....
gi 767964335  313 PQPLPPKRLDPDAIPSPQLSELPP 336
Cdd:PRK07764  687 PAAPAAPAGAAPAQPAPAPAATPP 710
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
87-418 1.30e-03

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 42.71  E-value: 1.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   87 GQGDVQNGPSSTVQMQRLPGSQPFGSPLAPVGNQPPvlqpygppptsAQVATQLSGMQISGAVAPAPPSSGlgfGPPTSL 166
Cdd:COG5164     3 LYGPGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRP-----------AGNTGGTRPAQNQGSTTPAGNTGG---TRPAGN 68
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  167 ASASGSFPNSGlygsypqGQAPPlsqaqGHPGIQTPqrsaPSQASSFTPPASGGPrlpsmTGPLLPGQSFGGP----SVS 242
Cdd:COG5164    69 QGATGPAQNQG-------GTTPA-----QNQGGTRP----AGNTGGTTPAGDGGA-----TGPPDDGGATGPPddggSTT 127
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  243 QPNHVSSPPQ----ALPPGTQMTGPLGPLPPmHSPQQPGYQPQQNGSFGPAR-------------GPQSNYGGPYPAAPT 305
Cdd:COG5164   128 PPSGGSTTPPgdggSTPPGPGSTGPGGSTTP-PGDGGSTTPPGPGGSTTPPDdggsttppnkgetGTDIPTGGTPRQGPD 206
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  306 FGSQPGPPQPLPPKRLDPDAI--PSPQLSELPPQQKTRhRIDPDAIPSPIQVIeddrNNRGTEPFVTGVR-GQVPPLVTT 382
Cdd:COG5164   207 GPVKKDDKNGKGNPPDDRGGKtgPKDQRPKTNPIERRG-PERPEAAALPAELT----ALEAENRAANPEPaTKTIPETTT 281
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 767964335  383 nflVKDQGNASPRYIRCTSYNIPCTSDMAKQAQVPL 418
Cdd:COG5164   282 ---VKDLATVLGKKGSDLVTNLMKKGKGTNINAALD 314
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
146-255 1.47e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 42.74  E-value: 1.47e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  146 SGAVAPAPPSSGLGFGPPTSLASASGSFPNSGLYGSYPQGQAPPLSQAqghPGIQTPQRSAPSqassfTPPASGGPRLPS 225
Cdd:PRK14959  382 SGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAA---PSPRVPWDDAPP-----APPRSGIPPRPA 453
                          90       100       110
                  ....*....|....*....|....*....|
gi 767964335  226 mtgPLLPGQSfggPSVSQPNHVSSPPQALP 255
Cdd:PRK14959  454 ---PRMPEAS---PVPGAPDSVASASDAPP 477
SP6_N cd22544
N-terminal domain of transcription factor Specificity Protein (SP) 6; Specificity Proteins ...
105-266 2.78e-03

N-terminal domain of transcription factor Specificity Protein (SP) 6; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP6, also known as epiprofin, shows specific expression pattern in hair follicles and the apical ectodermal ridge (AER) of the developing limbs. SP6 null mice are nude and show defects in skin, teeth, limbs (syndactyly and oligodactyly), and lung alveoli. SP6 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. This model represents the N-terminal domain of SP6.


Pssm-ID: 411693 [Multi-domain]  Cd Length: 245  Bit Score: 40.67  E-value: 2.78e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  105 PGSQPFGSPlAPVGNQPpvLQPYGPPPTSAQVATQLSGMQISGAVAPAPP----SSGLGFGPPTSLASASGSFPNSGLyG 180
Cdd:cd22544    13 HSETPRASP-PTLDLQP--LQPYQIHSSPEAGDYPSPLQPTELQSLPLGPgvdfSARESYEPHSSRRTCLDLESDLPL-G 88
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  181 SYPQGQAPPLSQAQ--------GHPGIQTPQRSAPS-----QASSFT--PPASGGPRLPSMTGPLLPGQsfgGPSVSQPN 245
Cdd:cd22544    89 PFPKLLHPPPDMAHpyeswfrpPHPGGSGEEGGVPSwwdlhAGSSWMdlQHGQGGLQSPGPPGGLQPPL---GGYGSEHQ 165
                         170       180
                  ....*....|....*....|.
gi 767964335  246 HVSSPPQALPPGTQMTGPLGP 266
Cdd:cd22544   166 LCGPPHHLLPPAQHLMGQEGP 186
PPE COG5651
PPE-repeat protein [Function unknown];
129-308 2.78e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 41.42  E-value: 2.78e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  129 PPPTSAQVATQLSGMQISGAVAPAPPSSGLGFGPPTSLASASGSFPNSGLYGSYPQGQAPPLSQAQGHPGIQTPQRSAPS 208
Cdd:COG5651   166 PFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAA 245
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  209 QASSF-TPPASGGPRLPSMTGPLLPGQSFGGPSVSQPNHVSSPPQALPPGTQMTGPLGPLPPMHSPQQPGYQPQQNGSFG 287
Cdd:COG5651   246 AAAAAaGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAG 325
                         170       180
                  ....*....|....*....|.
gi 767964335  288 PARGPQSNYGGPYPAAPTFGS 308
Cdd:COG5651   326 AALGAGAAAAAAGAAAGAGAA 346
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
3-220 3.76e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.40  E-value: 3.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    3 VNQSVPPV--PPFGQPQPIYPGYHQSSYGGQSGSTAPAIPYGAYNGPVPGYQQTPPQGMSRAPPSSGAPPASTAQAPCGQ 80
Cdd:PRK12323  382 VAQPAPAAaaPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAA 461
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   81 ---AAYGQFGQGDVQNGPSSTVQMQRLPGSQPFGSPlaPVGNQPPVLQPYGPPPTsaqvatqlsgmqisgavAPAPPSSG 157
Cdd:PRK12323  462 arpAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPP--PWEELPPEFASPAPAQP-----------------DAAPAGWV 522
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767964335  158 LGFGPPTSLASASGSFPNSGlygsyPQGQAPPLSQAQGHPGIQTPQRsAPSQASSFTPPASGG 220
Cdd:PRK12323  523 AESIPDPATADPDDAFETLA-----PAPAAAPAPRAAAATEPVVAPR-PPRASASGLPDMFDG 579
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
67-269 3.96e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.40  E-value: 3.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   67 GAPPASTAQAPCGQAAYGQFGQGDVQNGPSStvqmqrlPGSQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQIS 146
Cdd:PRK12323  371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAA-------PPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARG 443
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  147 GAVAPAPPSSglgfgPPTSLASAsgsfpnsglygsypqgQAPPLSQAQGHPGIQTPQRSAPSQASSFTPPASGGP---RL 223
Cdd:PRK12323  444 PGGAPAPAPA-----PAAAPAAA----------------ARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPpweEL 502
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*.
gi 767964335  224 PSMTGPLLPGQSFGGPSVSQPNHVSSPPQALPPGTQMTGPLGPLPP 269
Cdd:PRK12323  503 PPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAA 548
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
29-360 4.25e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.12  E-value: 4.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   29 GGQSGSTAPAIPYGAYNGPVPGYQQTPPQGM--SRAPPSSGAPPASTAQAPCGQAAYGQFGQGDVQNGPSSTVQMQRLPG 106
Cdd:PRK07764  390 GAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAaaPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPA 469
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  107 SQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQ----------------------ISGAVAPAP----------- 153
Cdd:PRK07764  470 PAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADdaatlrerwpeilaavpkrsrkTWAILLPEAtvlgvrgdtlv 549
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  154 ---PSSGLG-----------------------------FGPPTSLASASGSfPNSGLYGSYPQGQAPplsQAQGHPGiQT 201
Cdd:PRK07764  550 lgfSTGGLArrfaspgnaevlvtalaeelggdwqveavVGPAPGAAGGEGP-PAPASSGPPEEAARP---AAPAAPA-AP 624
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  202 PQRSAPSQASSFTPPASGGPRLPSMTGPLLPGQSFGGPSvSQPNHVSSPPQALPPGTQMTGPLGPLPPMHSPQQPGYQPQ 281
Cdd:PRK07764  625 AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDAS-DGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP 703
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  282 QNGSFGPARGPQSNYGGPYPAAPTFGSQPGPPQPLPPKRLDPDA--IPSPQLSELPPQQKTRHRIDPDAIPSPIQVIEDD 359
Cdd:PRK07764  704 APAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDppDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783

                  .
gi 767964335  360 R 360
Cdd:PRK07764  784 E 784
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
52-433 4.32e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 41.21  E-value: 4.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    52 QQTPPQGMSRAPPSSGAPPASTAQAPcgqAAYGQFGQGDVQNGPSSTVQMQRLPGSQPFGSPLA-PVGNQPPVLQPYGPp 130
Cdd:pfam03546  129 QVRPASTVGKGPSGKGANPAPPGKAG---SAAPLVQVGKKEEDSESSSEESDSEGEAPPAATQAkPSGKILQVRPASGP- 204
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   131 ptsaqvatqlsgmqiSGAVAPAPPSSGlgfGPPTSLASASGSFPNSGLY--GSYPQGQAPP-LSQAQGHPGIQTPQRSA- 206
Cdd:pfam03546  205 ---------------AKGAAPAPPQKA---GPVATQVKAERSKEDSESSeeSSDSEEEAPAaATPAQAKPALKTPQTKAs 266
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   207 PSQASSFTP-PASGGPRLPSMTGPLLPGqsfggpSVSQPNHVSSPpqALPPGTQMtgplgplPPMHSPQQPGYQPQQNGS 285
Cdd:pfam03546  267 PRKGTPITPtSAKVPPVRVGTPAPWKAG------TVTSPACASSP--AVARGAQR-------PEEDSSSSEESESEEETA 331
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   286 FGPARGPQSNYGGPYPAAPTfgsqpgppqplppkrLDPDAIPSPQLSELPPQQKTRhridPDAIPSPIQVIEDDRNNR-- 363
Cdd:pfam03546  332 PAAAVGQAKSVGKGLQGKAA---------------SAPTKGPSGQGTAPVPPGKTG----PAVAQVKAEAQEDSESSEee 392
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767964335   364 -GTEPfVTGVRGQVPPLVTTNflvkdQGNASPRYIRCTSYNIPCTS-----DMAKQAQVPLAAVIKPLARLPPEEA 433
Cdd:pfam03546  393 sDSEE-AAATPAQVKASGKTP-----QAKANPAPTKASSAKGAASApgkvvAAAAQAKQGSPAKVKPPARTPQNSA 462
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
8-307 4.32e-03

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 41.09  E-value: 4.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335     8 PPVPPFGQPQPIYPGYHQSSYGGQSGSTAPAIPYGAYNGPVPGYQQT---PPQGMSRAPPSSGAP---PASTAQAPCGQ- 80
Cdd:pfam03157  419 PQQSGQGQPGYYPTSPQQSGQGQQPGQGQQPGQEQPGQGQQPGQGQQgqqPGQPEQGQQPGQGQPgyyPTSPQQSGQGQq 498
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335    81 -AAYGQFGQGDVQNGPSSTVQM-QRLPGSQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQISGAVAPAPPSSGL 158
Cdd:pfam03157  499 lGQWQQQGQGQPGYYPTSPLQPgQGQPGYYPTSPQQPGQGQQLGQLQQPTQGQQGQQSGQGQQGQQPGQGQQGQQPGQGQ 578
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   159 GFGPPtslasASGSFPNSGLYGSYP-------QGQAPPLSQ--AQGHPGIQTPQRSAPSQASSFTPPAS----GGPRLPS 225
Cdd:pfam03157  579 QGQQP-----GQGQQPGQGQPGYYPtspqqsgQGQQPGQWQqpGQGQPGYYPTSSLQLGQGQQGYYPTSpqqpGQGQQPG 653
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   226 MTGPLLPGQSFGGP-------SVSQPNHVSSPPQALPPGTQMTG--PLGPLPPmhspqqpGYQPQQNGSFGPARGPQsny 296
Cdd:pfam03157  654 QWQQSGQGQQGYYPtspqqsgQAQQPGQGQQPGQWLQPGQGQQGyyPTSPQQP-------GQGQQLGQGQQSGQGQQ--- 723
                          330
                   ....*....|.
gi 767964335   297 gGPYPAAPTFG 307
Cdd:pfam03157  724 -GYYPTSPGQG 733
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
103-239 4.82e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.12  E-value: 4.82e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  103 RLPGSQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQISGAVAPAPPSSGLgfGPPTSLASASGSFPNSGLYGSY 182
Cdd:PRK07764  380 RLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAP--APAPAPPSPAGNAPAGGAPSPP 457
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 767964335  183 PQGQAPPLSQAQGHPGIQTPQRSAPSQASSFTPPASggPRLPSMTGPLLPGQSFGGP 239
Cdd:PRK07764  458 PAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAA--PAAPAAPAAPAGADDAATL 512
PHA03379 PHA03379
EBNA-3A; Provisional
118-352 7.17e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 40.43  E-value: 7.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  118 GNQPPVLQPYGPPPTSAQVATQLSGMQisgaVAPAPPSSGLGFGPptsLASASGSFPnsGLYGSYPQGQAPPLSQAQGHP 197
Cdd:PHA03379  415 TPRPPVEKPRPEVPQSLETATSHGSAQ----VPEPPPVHDLEPGP---LHDQHSMAP--CPVAQLPPGPLQDLEPGDQLP 485
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  198 GIQTPQRSAPSQAssftpPASGGPRL-PSMTGPL-LPGQSFGgPSVSQPNHVSSPPQalpPGTQMTGPLGPLPP---MHS 272
Cdd:PHA03379  486 GVVQDGRPACAPV-----PAPAGPIVrPWEASLSqVPGVAFA-PVMPQPMPVEPVPV---PTVALERPVCPAPPliaMQG 556
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  273 PQQPGYQPQQNGSFGPA----RGPQSNYGGPYPAAPTFGSQPGPPqplppkRLDPDAIPSPQLSELPPQQKTRHRIDPDA 348
Cdd:PHA03379  557 PGETSGIVRVRERWRPApwtpNPPRSPSQMSVRDRLARLRAEAQP------YQASVEVQPPQLTQVSPQQPMEYPLEPEQ 630

                  ....
gi 767964335  349 IPSP 352
Cdd:PHA03379  631 QMFP 634
PHA02682 PHA02682
ORF080 virion core protein; Provisional
132-268 7.29e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 39.84  E-value: 7.29e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335  132 TSAQVATQLSGMQISGAVAPAPPSSGLGFGPPTSLASAsGSFPNSGLYGSY-----PQGQAP-PLSQAQGHPGIQTPQRS 205
Cdd:PHA02682   21 TSSSLFTKCPQATIPAPAAPCPPDADVDPLDKYSVKEA-GRYYQSRLKANSacmqrPSGQSPlAPSPACAAPAPACPACA 99
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767964335  206 APSQASSFTPPASgGPRLPSMTGPLLPgqsfggPSVSQPNHvSSPPQALPPGTQMTGPLGPLP 268
Cdd:PHA02682  100 PAAPAPAVTCPAP-APACPPATAPTCP------PPAVCPAP-ARPAPACPPSTRQCPPAPPLP 154
DUF4645 pfam15488
Domain of unknown function (DUF4645); This family of proteins is found in eukaryotes. Proteins ...
116-305 8.04e-03

Domain of unknown function (DUF4645); This family of proteins is found in eukaryotes. Proteins in this family are typically between 200 and 298 amino acids in length.


Pssm-ID: 406050 [Multi-domain]  Cd Length: 294  Bit Score: 39.46  E-value: 8.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   116 PVGNQPPVLQPYGPPPTSAQ--VATQ-------LSG--------MQISGAVAPAPPSSGLGfGPPTSLASASGSFPNSGL 178
Cdd:pfam15488   82 PVDSSRALRHPYGPPPAVAEesLATAevnssegLAGwrqkgqdsINVSQEFSGSPPALMVG-GTRVSNGGTERGGNNAKL 160
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767964335   179 YGSYPQGQA---PPLSQAQGHPGIQTPQRSAPSQASSFTPPASGGPRLPSMTGPLlpgqsfGGPSvsqpNHVSSPPQALP 255
Cdd:pfam15488  161 YSALPRGQGffpPRGPQVRGPPHIPTLRSGIMMEVPPGNTRMAGKERLAHVSFPL------GGPR----HPMDNWPRPIP 230
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 767964335   256 PGTQMTGpLGPLPPMHspqqpgyqpqqngSFGPARGPQSNyggPYPAAPT 305
Cdd:pfam15488  231 LSSSTPG-LPSCSTAH-------------CFIPPRPPSFN---PFLAMPI 263
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH