NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720408468|ref|XP_030109331|]
View 

eukaryotic translation initiation factor 4 gamma 3 isoform X4 [Mus musculus]

Protein Classification

eukaryotic translation initiation factor 4 gamma 3( domain architecture ID 10501431)

eukaryotic translation initiation factor 4 gamma 3 (EIF4G3) is component of the protein complex eIF4F, which is involved in the recognition of the mRNA cap, ATP-dependent unwinding of 5'-terminal secondary structure and recruitment of mRNA to the ribosome

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
916-1144 7.56e-64

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 216.08  E-value: 7.56e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  916 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 995
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  996 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 1075
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720408468 1076 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1144
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1584-1716 4.70e-49

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 170.93  E-value: 4.70e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468 1584 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1663
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720408468 1664 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1716
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1382-1494 2.39e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 130.86  E-value: 2.39e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468 1382 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1461
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1720408468 1462 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1494
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
PHA03247 super family cl33720
large tegument protein UL36; Provisional
57-475 5.80e-11

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.04  E-value: 5.80e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   57 RALQTPAPQQIPRGPVqqpledrlFPPTVSavysTVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTP 136
Cdd:PHA03247  2672 RAAQASSPPQRPRRRA--------ARPTVG----SLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP 2739
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  137 YPSGQNAGPATlvyPQAPQTMNSQPQARSPFAAGPRPAhhqffqrPQIQPPRAAIPNSSPSIRPGVQTPTAVYQANQHim 216
Cdd:PHA03247  2740 APPAVPAGPAT---PGGPARPARPPTTAGPPAPAPPAA-------PAAGPPRRLTRPAVASLSESRESLPSPWDPADP-- 2807
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  217 mvnhlpmPYPVTqGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYGTPFYPSQPVYQSAPiiVPTQ 296
Cdd:PHA03247  2808 -------PAAVL-APAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAA--KPAA 2877
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  297 QQPPPAKREKKTIRIRDPNqggkdiTEEIMSGGGSRNPTPPIGRPASTPTPPQLPSQvPEHSPVVYGTVESAhlAASTPV 376
Cdd:PHA03247  2878 PARPPVRRLARPAVSRSTE------SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ-PQPPPPPPPRPQPP--LAPTTD 2948
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  377 TAASDQKQEEKPKPDpvfqspstvLRLVLSGEKKEQAGQMPETAAGEPTPEPPrTSSPTSLPPLARSSLPSPMSAALSSQ 456
Cdd:PHA03247  2949 PAGAGEPSGAVPQPW---------LGALVPGRVAVPRFRVPQPAPSREAPASS-TPPLTGHSLSRVSSWASSLALHEETD 3018
                          410
                   ....*....|....*....
gi 1720408468  457 PLFTAEDKCELPSSKEEDA 475
Cdd:PHA03247  3019 PPPVSLKQTLWPPDDTEDS 3037
W2 super family cl17013
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1695-1741 8.85e-05

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


The actual alignment was detected with superfamily member cd11560:

Pssm-ID: 473053 [Multi-domain]  Cd Length: 194  Bit Score: 45.28  E-value: 8.85e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1720408468 1695 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1741
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
916-1144 7.56e-64

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 216.08  E-value: 7.56e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  916 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 995
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  996 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 1075
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720408468 1076 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1144
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
917-1141 3.99e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 179.48  E-value: 3.99e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   917 RKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 996
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   997 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 1076
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720408468  1077 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1141
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1584-1716 4.70e-49

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 170.93  E-value: 4.70e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468 1584 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1663
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720408468 1664 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1716
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1382-1494 2.39e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 130.86  E-value: 2.39e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468 1382 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1461
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1720408468 1462 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1494
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1382-1494 1.07e-32

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 123.12  E-value: 1.07e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  1382 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1461
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1720408468  1462 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1494
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1654-1738 1.23e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 110.46  E-value: 1.23e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  1654 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqaGKGVALKSVTAFFT 1733
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1720408468  1734 WLREA 1738
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1667-1743 2.87e-24

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 97.60  E-value: 2.87e-24
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720408468 1667 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQaGKGVALKSVTAFFTWLREAEEESE 1743
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
PHA03247 PHA03247
large tegument protein UL36; Provisional
57-475 5.80e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.04  E-value: 5.80e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   57 RALQTPAPQQIPRGPVqqpledrlFPPTVSavysTVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTP 136
Cdd:PHA03247  2672 RAAQASSPPQRPRRRA--------ARPTVG----SLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP 2739
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  137 YPSGQNAGPATlvyPQAPQTMNSQPQARSPFAAGPRPAhhqffqrPQIQPPRAAIPNSSPSIRPGVQTPTAVYQANQHim 216
Cdd:PHA03247  2740 APPAVPAGPAT---PGGPARPARPPTTAGPPAPAPPAA-------PAAGPPRRLTRPAVASLSESRESLPSPWDPADP-- 2807
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  217 mvnhlpmPYPVTqGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYGTPFYPSQPVYQSAPiiVPTQ 296
Cdd:PHA03247  2808 -------PAAVL-APAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAA--KPAA 2877
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  297 QQPPPAKREKKTIRIRDPNqggkdiTEEIMSGGGSRNPTPPIGRPASTPTPPQLPSQvPEHSPVVYGTVESAhlAASTPV 376
Cdd:PHA03247  2878 PARPPVRRLARPAVSRSTE------SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ-PQPPPPPPPRPQPP--LAPTTD 2948
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  377 TAASDQKQEEKPKPDpvfqspstvLRLVLSGEKKEQAGQMPETAAGEPTPEPPrTSSPTSLPPLARSSLPSPMSAALSSQ 456
Cdd:PHA03247  2949 PAGAGEPSGAVPQPW---------LGALVPGRVAVPRFRVPQPAPSREAPASS-TPPLTGHSLSRVSSWASSLALHEETD 3018
                          410
                   ....*....|....*....
gi 1720408468  457 PLFTAEDKCELPSSKEEDA 475
Cdd:PHA03247  3019 PPPVSLKQTLWPPDDTEDS 3037
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
99-517 1.90e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 56.31  E-value: 1.90e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   99 PGPPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTPYPSGQNAGPATLVypqapqtmNSQPQARSPFAAGPRPAHHQF 178
Cdd:pfam03154  181 ASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLI--------QQTPTLHPQRLPSPHPPLQPM 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  179 fqrPQIQPPRAAIPNSSPSirPGVQTPTAvyqanqhimmvnhlPMPYPVTQGHqyciPQYRHSGPPYVGPPQQYPVQPPG 258
Cdd:pfam03154  253 ---TQPPPPSQVSPQPLPQ--PSLHGQMP--------------PMPHSLQTGP----SHMQHPVPPQPFPLTPQSSQSQV 309
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  259 PGPFYPGPGPGDFANAYgTPfyPSQPVYQSAPiivPTQQQP-PPAKRekktirirdpnqggkditeeimsgggsrnPTPP 337
Cdd:pfam03154  310 PPGPSPAAPGQSQQRIH-TP--PSQSQLQSQQ---PPREQPlPPAPL-----------------------------SMPH 354
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  338 IGRPASTPTPPQLPSQVPEHSPvvygtvesaHLAASTPVTAASDQkqeekpKPDPVFQSPSTVLRLVLSGEKKEQAGQMP 417
Cdd:pfam03154  355 IKPPPTTPIPQLPNPQSHKHPP---------HLSGPSPFQMNSNL------PPPPALKPLSSLSTHHPPSAHPPPLQLMP 419
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  418 ETAAGEPTP-EPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKceLPSSKEEDAPPVPSPTScTAASGPSLtdns 496
Cdd:pfam03154  420 QSQQLPPPPaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPF--VPGGPPPITPPSGPPTS-TSSAMPGI---- 492
                          410       420
                   ....*....|....*....|.
gi 1720408468  497 dicKKPCSVAPHDSQLISSTI 517
Cdd:pfam03154  493 ---QPPSSASVSSSGPVPAAV 510
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1695-1741 8.85e-05

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 45.28  E-value: 8.85e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1720408468 1695 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1741
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
916-1144 7.56e-64

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 216.08  E-value: 7.56e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  916 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 995
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  996 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 1075
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720408468 1076 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1144
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
917-1141 3.99e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 179.48  E-value: 3.99e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   917 RKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 996
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   997 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 1076
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720408468  1077 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1141
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1584-1716 4.70e-49

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 170.93  E-value: 4.70e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468 1584 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1663
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720408468 1664 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1716
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1382-1494 2.39e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 130.86  E-value: 2.39e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468 1382 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1461
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1720408468 1462 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1494
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1382-1494 1.07e-32

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 123.12  E-value: 1.07e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  1382 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1461
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1720408468  1462 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1494
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1654-1738 1.23e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 110.46  E-value: 1.23e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  1654 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqaGKGVALKSVTAFFT 1733
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1720408468  1734 WLREA 1738
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1667-1743 2.87e-24

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 97.60  E-value: 2.87e-24
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720408468 1667 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQaGKGVALKSVTAFFTWLREAEEESE 1743
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1584-1710 4.48e-19

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 85.22  E-value: 4.48e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468 1584 EELSQRLEKLIMEEKADDERIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADCSTF---RVDTAVIKQRVPILLKYL 1660
Cdd:cd11473      4 KKLRDSLLKELEEDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADSISLtqkEQLVLVLKKYGPVLRELL 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1720408468 1661 DSDTEKELQALYALQA--SIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWE 1710
Cdd:cd11473     84 KLIKKDQLYLLLKIEKlcLQLKLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1624-1743 2.36e-15

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 75.37  E-value: 2.36e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468 1624 RALMTAVCK-AAIIADCSTFRVDTA---VIKQRVPILLKYLDSDTEkELQALYALQASIVKLDQPANLLRMFFDCLYDEE 1699
Cdd:cd11558     47 RAVVKALLElILEVSSTSTAELLEAlkkLLSKWGPLLENYVKSQDD-QVELLLALEEFCLESEEGGPLFAKLLHALYDLD 125
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1720408468 1700 VISEDAFYKWESSKDPAEQAGKGVALKSVTAFFTWLREAEEESE 1743
Cdd:cd11558    126 ILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWLEEAEEESD 169
PHA03247 PHA03247
large tegument protein UL36; Provisional
57-475 5.80e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.04  E-value: 5.80e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   57 RALQTPAPQQIPRGPVqqpledrlFPPTVSavysTVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTP 136
Cdd:PHA03247  2672 RAAQASSPPQRPRRRA--------ARPTVG----SLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP 2739
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  137 YPSGQNAGPATlvyPQAPQTMNSQPQARSPFAAGPRPAhhqffqrPQIQPPRAAIPNSSPSIRPGVQTPTAVYQANQHim 216
Cdd:PHA03247  2740 APPAVPAGPAT---PGGPARPARPPTTAGPPAPAPPAA-------PAAGPPRRLTRPAVASLSESRESLPSPWDPADP-- 2807
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  217 mvnhlpmPYPVTqGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYGTPFYPSQPVYQSAPiiVPTQ 296
Cdd:PHA03247  2808 -------PAAVL-APAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAA--KPAA 2877
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  297 QQPPPAKREKKTIRIRDPNqggkdiTEEIMSGGGSRNPTPPIGRPASTPTPPQLPSQvPEHSPVVYGTVESAhlAASTPV 376
Cdd:PHA03247  2878 PARPPVRRLARPAVSRSTE------SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ-PQPPPPPPPRPQPP--LAPTTD 2948
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  377 TAASDQKQEEKPKPDpvfqspstvLRLVLSGEKKEQAGQMPETAAGEPTPEPPrTSSPTSLPPLARSSLPSPMSAALSSQ 456
Cdd:PHA03247  2949 PAGAGEPSGAVPQPW---------LGALVPGRVAVPRFRVPQPAPSREAPASS-TPPLTGHSLSRVSSWASSLALHEETD 3018
                          410
                   ....*....|....*....
gi 1720408468  457 PLFTAEDKCELPSSKEEDA 475
Cdd:PHA03247  3019 PPPVSLKQTLWPPDDTEDS 3037
PHA03247 PHA03247
large tegument protein UL36; Provisional
62-497 1.76e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.42  E-value: 1.76e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   62 PAPQQIPRGPVQQPledrlFPPTVSAVYSTVTQVARQPGPPTPAPySAHEISKGLPSLAATPPghasSPGLSQTPYPsGQ 141
Cdd:PHA03247  2592 PPQSARPRAPVDDR-----GDPRGPAPPSPLPPDTHAPDPPPPSP-SPAANEPDPHPPPTVPP----PERPRDDPAP-GR 2660
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  142 NAGPATLVYPQAPQTMNSQPQARSPFAAGPRPAHHQFFQRPqiqPPRAAIPNSSP-SIRPGVQTPTAVYQANQHIMMVNH 220
Cdd:PHA03247  2661 VSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP---PPPPPTPEPAPhALVSATPLPPGPAAARQASPALPA 2737
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  221 LPMPYPVTQGHQYCIPQYRHSGPPyvgppqqypvqppgpgpfypgpgpgdfanaygTPFYPSQPVYQSAPIIVPTQQQPP 300
Cdd:PHA03247  2738 APAPPAVPAGPATPGGPARPARPP--------------------------------TTAGPPAPAPPAAPAAGPPRRLTR 2785
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  301 PAKREKKTIR-----IRDPNQGGKDITEEIMSGGGSRNPTPPIGRP-ASTPTPPQLPSQVPEHSPVVYGTV--------- 365
Cdd:PHA03247  2786 PAVASLSESReslpsPWDPADPPAAVLAPAAALPPAASPAGPLPPPtSAQPTAPPPPPGPPPPSLPLGGSVapggdvrrr 2865
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  366 -----ESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAAGEPTPEPPRTSSPTSLPPL 440
Cdd:PHA03247  2866 ppsrsPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP 2945
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1720408468  441 ARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVPS-PTSctAASGPSLTDNSD 497
Cdd:PHA03247  2946 TTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSrEAP--ASSTPPLTGHSL 3001
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1596-1743 5.78e-09

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 56.86  E-value: 5.78e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468 1596 EEKADD--ERIFDWVEANLDESQMS-------SPTFLRALMTAVCkaAIIADCsTFRVDTA-VIKQRVPILLKYLDSDte 1665
Cdd:cd11561      1 EEEEDErvDELGEFLKKNKDESGLSelkeilkEAERLDVVKDKAV--LVLAEV-LFDENIVkEIKKRKALLLKLVTDE-- 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468 1666 kelQALYALQASIVKL--DQPANLLRMF---FDCLYDEEVISEDAFYKWeSSKDPAEQAGKGVA---LKSVTAFFTWLRE 1737
Cdd:cd11561     76 ---KAQKALLGGIERFcgKHSPELLKKVpliLKALYDNDILEEEVILKW-YEKVSKKYVSKEKSkkvRKAAEPFVEWLEE 151

                   ....*.
gi 1720408468 1738 AEEESE 1743
Cdd:cd11561    152 AEEEEE 157
PHA03378 PHA03378
EBNA-3B; Provisional
60-487 1.62e-08

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 59.70  E-value: 1.62e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   60 QTPAPQQIPRGPVQQPLEDRLFPPTVsavystvtQVARQPGPPTPAPYSAHEISKGLPSLAATPPGhaSSPGLSQTPYPS 139
Cdd:PHA03378   446 HSQAPTVVLHRPPTQPLEGPTGPLSV--------QAPLEPWQPLPHPQVTPVILHQPPAQGVQAHG--SMLDLLEKDDED 515
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  140 GQNAGPATLVYPQAPQTMNSQpqaRSPFA------------AGPRPAHHQFFQRP-----QIQPPRAAIPNSSPSIRPGv 202
Cdd:PHA03378   516 MEQRVMATLLPPSPPQPRAGR---RAPCVytedldiesdepASTEPVHDQLLPAPglgplQIQPLTSPTTSQLASSAPS- 591
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  203 qtptavyqanqhimmvnHLPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYG-TPFYP 281
Cdd:PHA03378   592 -----------------YAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLVFpTPHQP 654
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  282 SQ---PVYQSAPIIVP-TQQQPPPA------KREKKTIRIRDPNQGGKDITEEIMSGGGSRNPTPPIGR---PASTPTPP 348
Cdd:PHA03378   655 PQveiTPYKPTWTQIGhIPYQPSPTgantmlPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRarpPAAAPGRA 734
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  349 QLPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTvlrlvlsgekkeqAGQMPEtaaGEPTPEP 428
Cdd:PHA03378   735 RPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPA-------------PQQRPR---GAPTPQP 798
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720408468  429 PRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSK-----EEDAP--PVPSPTSCTAA 487
Cdd:PHA03378   799 PPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKkpaalERQAAagPTPSPGSGTSD 864
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
99-517 1.90e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 56.31  E-value: 1.90e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   99 PGPPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTPYPSGQNAGPATLVypqapqtmNSQPQARSPFAAGPRPAHHQF 178
Cdd:pfam03154  181 ASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLI--------QQTPTLHPQRLPSPHPPLQPM 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  179 fqrPQIQPPRAAIPNSSPSirPGVQTPTAvyqanqhimmvnhlPMPYPVTQGHqyciPQYRHSGPPYVGPPQQYPVQPPG 258
Cdd:pfam03154  253 ---TQPPPPSQVSPQPLPQ--PSLHGQMP--------------PMPHSLQTGP----SHMQHPVPPQPFPLTPQSSQSQV 309
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  259 PGPFYPGPGPGDFANAYgTPfyPSQPVYQSAPiivPTQQQP-PPAKRekktirirdpnqggkditeeimsgggsrnPTPP 337
Cdd:pfam03154  310 PPGPSPAAPGQSQQRIH-TP--PSQSQLQSQQ---PPREQPlPPAPL-----------------------------SMPH 354
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  338 IGRPASTPTPPQLPSQVPEHSPvvygtvesaHLAASTPVTAASDQkqeekpKPDPVFQSPSTVLRLVLSGEKKEQAGQMP 417
Cdd:pfam03154  355 IKPPPTTPIPQLPNPQSHKHPP---------HLSGPSPFQMNSNL------PPPPALKPLSSLSTHHPPSAHPPPLQLMP 419
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  418 ETAAGEPTP-EPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKceLPSSKEEDAPPVPSPTScTAASGPSLtdns 496
Cdd:pfam03154  420 QSQQLPPPPaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPF--VPGGPPPITPPSGPPTS-TSSAMPGI---- 492
                          410       420
                   ....*....|....*....|.
gi 1720408468  497 dicKKPCSVAPHDSQLISSTI 517
Cdd:pfam03154  493 ---QPPSSASVSSSGPVPAAV 510
PHA03247 PHA03247
large tegument protein UL36; Provisional
58-444 7.34e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 7.34e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   58 ALQTPAPQQIPRGPVQQPLEDRLFP--PTVSAVYSTVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPGHAS-SPGLSQ 134
Cdd:PHA03247  2718 ATPLPPGPAAARQASPALPAAPAPPavPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASlSESRES 2797
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  135 TPYPSGQNAGPATLVYPQAPQTMNSQPQARSPFAAGPRPAHHQF---FQRPQIQPPRAAIPNSSPSIRPGVQTPTAVYQA 211
Cdd:PHA03247  2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPppgPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA 2877
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  212 NQHImMVNHLPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGdfanAYGTPFYPSQPVYQSAPI 291
Cdd:PHA03247  2878 PARP-PVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP----PPPRPQPPLAPTTDPAGA 2952
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  292 IVPTQQQPPPAKREKKTIRIRDPNqggkditeeimsgggSRNPTPPIGRPASTPTPPQlpsqvPEHSPVVYGTVESAHLA 371
Cdd:PHA03247  2953 GEPSGAVPQPWLGALVPGRVAVPR---------------FRVPQPAPSREAPASSTPP-----LTGHSLSRVSSWASSLA 3012
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720408468  372 ASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAgqmpetaagePTPEPPRTSSPTSLPPLARSS 444
Cdd:PHA03247  3013 LHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEA----------LDPLPPEPHDPFAHEPDPATP 3075
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
120-244 4.02e-06

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 51.96  E-value: 4.02e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  120 AATPPGHASSPGLSQTPYPSGQNAGPATLVYPQAPQTMNSQPQArspfaaGPRPAHH-QFFQRPQIQPPRAAIPNSSPSI 198
Cdd:pfam09770  212 QQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQ------HPGQGHPvTILQRPQSPQPDPAQPSIQPQA 285
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1720408468  199 RPGVQTPTAVYQANQHIM------MVNHLPMPYPVTQGHQYCIPQYRHSGPP 244
Cdd:pfam09770  286 QQFHQQPPPVPVQPTQILqnpnrlSAARVGYPQNPQPGVQPAPAHQAHRQQG 337
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
5-434 1.21e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 50.54  E-value: 1.21e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468    5 QKPALKSGSAAAAGTGPGTGAAAAAAVPPPHPAAAAAAAAVAAAAApphpniralQTPAPQQIPRGP---VQQPLEdrLF 81
Cdd:pfam03154  170 QPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATS---------QPPNQTQSTAAPhtlIQQTPT--LH 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   82 PPTVSAVYSTVTQvARQPGPPTPAPYSAHEiskglpslaaTPPGHASSPglsqtPYPSGQNAGPATLVYPQAPQTMNSQP 161
Cdd:pfam03154  239 PQRLPSPHPPLQP-MTQPPPPSQVSPQPLP----------QPSLHGQMP-----PMPHSLQTGPSHMQHPVPPQPFPLTP 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  162 Q-ARSPFAAGPRP-AHHQFFQRPQIQPPRAAIPNSSPsirPGVQTPTAVYQANQHImmvnhlpMPYPVTQGHQYCIPQyR 239
Cdd:pfam03154  303 QsSQSQVPPGPSPaAPGQSQQRIHTPPSQSQLQSQQP---PREQPLPPAPLSMPHI-------KPPPTTPIPQLPNPQ-S 371
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  240 HSGPPYVGPPQQypvqppgpgpfypgpgpgdFANAYGTPFYPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGK 319
Cdd:pfam03154  372 HKHPPHLSGPSP-------------------FQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPP 432
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  320 DITE-EIMSGGGSRNPTPpigrPASTPTPPQLPsqVPEHsPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQS-- 396
Cdd:pfam03154  433 VLTQsQSLPPPAASHPPT----SGLHQVPSQSP--FPQH-PFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSgp 505
                          410       420       430
                   ....*....|....*....|....*....|....*....
gi 1720408468  397 -PSTVLRLVLSGEKKEQAgqMPETAAGEPTPEPPRTSSP 434
Cdd:pfam03154  506 vPAAVSCPLPPVQIKEEA--LDEAEEPESPPPPPRSPSP 542
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
328-491 1.23e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 50.23  E-value: 1.23e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  328 GGGSRNPTPPIGRPASTPTPPQLP--SQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 405
Cdd:PRK07003   368 PGGGVPARVAGAVPAPGARAAAAVgaSAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  406 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTS-----LPPLARSSlPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVP- 479
Cdd:PRK07003   448 PVPAKANARASADSRCDERDAQPPADSGSASapasdAPPDAAFE-PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAa 526
                          170
                   ....*....|..
gi 1720408468  480 SPTSCTAASGPS 491
Cdd:PRK07003   527 PPAPEARPPTPA 538
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1695-1741 8.85e-05

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 45.28  E-value: 8.85e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1720408468 1695 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1741
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
339-494 4.81e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 4.81e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  339 GRPASTPTPPQLPSQVPEHSPVVYGTVESAHLAASTPV---TAASDQKQEEKPKPDPVFQSPS------TVLRLVLSGEK 409
Cdd:PHA03307    19 EFFPRPPATPGDAADDLLSGSQGQLVSDSAELAAVTVVagaAACDRFEPPTGPPPGPGTEAPAnesrstPTWSLSTLAPA 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  410 KEQAGQMPETAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKC-ELPSSKEEDAPPVPSPTSCTAAS 488
Cdd:PHA03307    99 SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAaGASPAAVASDAASSRQAALPLSS 178

                   ....*.
gi 1720408468  489 GPSLTD 494
Cdd:PHA03307   179 PEETAR 184
PHA03378 PHA03378
EBNA-3B; Provisional
57-207 6.83e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 44.67  E-value: 6.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   57 RALQTPAPQQIPRGPVQQP--LEDRLFPPTVSAVYSTVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPGHASSPglsq 134
Cdd:PHA03378   699 RAPTPMRPPAAPPGRAQRPaaATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAP---- 774
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720408468  135 TPYPSGQnAGPATLVYPQAPQTMNSQPQA-RSPFAAGPRPAHHQffQRPQIQPPRAAIPNSSPSIRPGVQTPTA 207
Cdd:PHA03378   775 TPQPPPQ-APPAPQQRPRGAPTPQPPPQAgPTSMQLMPRAAPGQ--QGPTKQILRQLLTGGVKRGRPSLKKPAA 845
PRK10263 PRK10263
DNA translocase FtsK; Provisional
187-481 7.24e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 44.69  E-value: 7.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  187 PRAAIPNSSPSIR----PGVQTPTAVYQanqhimmvnhlPMPYPVTQGHQYCIPQYRHSGP---PYVGPPQQYPVQPPGP 259
Cdd:PRK10263   347 ASVDVPPAQPTVAwqpvPGPQTGEPVIA-----------PAPEGYPQQSQYAQPAVQYNEPlqqPVQPQQPYYAPAAEQP 415
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  260 gpfypgpgpgdfanaygtpfyPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEimsggGSRNPTPPIG 339
Cdd:PRK10263   416 ---------------------AQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQ-----STYQTEQTYQ 469
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  340 RPASTPTPPQLPSQVPEHSPVVYGTVESAHLAASTP------VTAASDQKQEE-----KPKPDPVfqspstvlrlvlsge 408
Cdd:PRK10263   470 QPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPlyyfeeVEEKRAREREQlaawyQPIPEPV--------------- 534
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720408468  409 kKEQAGQMPETAAGEPTPEPPRTSSPTSLPPLA--RSSLPSPMSAALSSQPLFT-AEDKCELPSSKEEDAPPVPSP 481
Cdd:PRK10263   535 -KEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASgvKKATLATGAAATVAAPVFSlANSGGPRPQVKEGIGPQLPRP 609
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
101-484 7.54e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 44.18  E-value: 7.54e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  101 PPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTPypsgqnAGPATlVYPQAPQTMNSQPQARSPfaaGPRPAHhqffq 180
Cdd:pfam17823  115 LAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAAC------RANAS-AAPRAAIAAASAPHAASP---APRTAA----- 179
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  181 rpqiQPPRAAIPNSSPSIRPGVQTPTAVYQAnqhimmvnhLPMPYPVTQGHQycipqyrhSGPPYVGPPQQYPVQPPGPG 260
Cdd:pfam17823  180 ----SSTTAASSTTAASSAPTTAASSAPATL---------TPARGISTAATA--------TGHPAAGTALAAVGNSSPAA 238
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  261 PFYPGPGPGDFANAYGTPFYPSQPVYQSAPII---VPTQQQPPPAKrekktirirdpnqggkditeEIMSGGGSRNPTPP 337
Cdd:pfam17823  239 GTVTAAVGTVTPAALATLAAAAGTVASAAGTInmgDPHARRLSPAK--------------------HMPSDTMARNPAAP 298
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  338 IGRPASTP------------TPPQlPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVfqsPSTvlrlvl 405
Cdd:pfam17823  299 MGAQAQGPiiqvstdqpvhnTAGE-PTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPV---LHT------ 368
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720408468  406 sgekkeqaGQMPETAAGEPTPEPprtsSPtsLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVPSPTSC 484
Cdd:pfam17823  369 --------SMIPEVEATSPTTQP----SP--LLPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGDPKTLAMASC 433
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
328-497 7.82e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.48  E-value: 7.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  328 GGGSRNPTPPIGRPASTPTPPQLPSQV--PEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRlvl 405
Cdd:PRK12323   368 SGGGAGPATAAAAPVAQPAPAAAAPAAaaPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP--- 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  406 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVPSPTSCT 485
Cdd:PRK12323   445 GGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAE 524
                          170
                   ....*....|..
gi 1720408468  486 AASGPSLTDNSD 497
Cdd:PRK12323   525 SIPDPATADPDD 536
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
327-491 2.00e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.05  E-value: 2.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  327 SGGGSRNPTPPIGRPASTPTPPQLPSQVPEHSPVVyGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLS 406
Cdd:PRK07764   601 PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAA-PAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPA 679
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  407 GEKKEQAGQMPETAAGEPTPEP----------PRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAP 476
Cdd:PRK07764   680 APPPAPAPAAPAAPAGAAPAQPapapaatppaGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPP 759
                          170
                   ....*....|....*
gi 1720408468  477 PVPSPTSCTAASGPS 491
Cdd:PRK07764   760 PPPAPAPAAAPAAAP 774
PRK11901 PRK11901
hypothetical protein; Reviewed
281-407 2.64e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 41.98  E-value: 2.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  281 PSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDP---------NQGGKDITEEIMSGGGSRNPTPP--------IGRPAS 343
Cdd:PRK11901   113 TAPPQDISAPPISPTPTQAAPPQTPNGQQRIELPgnisdalsqQQGQVNAASQNAQGNTSTLPTAPatvapskgAKVPAT 192
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720408468  344 TPTPPQLPSQVPEHSPVVygtveSAHLAASTPVTAASDQKQEEKPKPDPVFQS-PSTVLRLVLSG 407
Cdd:PRK11901   193 AETHPTPPQKPATKKPAV-----NHHKTATVAVPPATSGKPKSGAASARALSSaPASHYTLQLSS 252
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
62-174 4.25e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.01  E-value: 4.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468   62 PAPQQIPRGPVQQPLEDRLFPPTVSAVYSTVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPGHASSPGLSQTPYPSGQ 141
Cdd:PRK14951   375 PAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQ 454
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1720408468  142 NAgPATLVYPQAPQTMNSQPQARSPFAAGPRPA 174
Cdd:PRK14951   455 AA-PETVAIPVRVAPEPAVASAAPAPAAAPAAA 486
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
340-507 4.97e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 41.31  E-value: 4.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  340 RPASTPTPPQLPSQVPEHSPVvYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPET 419
Cdd:pfam13254  165 KPKAQPSQPAQPAWMKELNKI-RQSRASVDLGRPNSFKEVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEADT 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  420 AAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSK-----EEDAPPVPSPTSCTAASGPSLTD 494
Cdd:pfam13254  244 LSTDKEQSPAPTSASEPPPKTKELPKDSEEPAAPSKSAEASTEKKEPDTESSpetssEKSAPSLLSPVSKASIDKPLSSP 323
                          170
                   ....*....|....
gi 1720408468  495 NSD-ICKKPCSVAP 507
Cdd:pfam13254  324 DRDpLSPKPKPQSP 337
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
281-429 8.09e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 40.99  E-value: 8.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  281 PSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEIMSGGGSRNPTPPigrPASTPTPPQLPSQVPEHSPV 360
Cdd:PRK07003   467 DAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPA---AAAPPAPEARPPTPAAAAPA 543
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  361 VYGTVESAHL-----------------AASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPEtAAGE 423
Cdd:PRK07003   544 ARAGGAAAALdvlrnagmrvssdrgarAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQ-AAES 622

                   ....*.
gi 1720408468  424 PTPEPP 429
Cdd:PRK07003   623 RGAPPP 628
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
277-462 8.81e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 40.42  E-value: 8.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  277 TPFYPSQpvyqsapiiVPTQQQPPPAKREKKTIRIRDPNQGgkditeeimSGGGSRNPTPPIGRPASTPTPPQL-PSQVP 355
Cdd:pfam05539  186 HPTYPSQ---------VTPQSQPATQGHQTATANQRLSSTE---------PVGTQGTTTSSNPEPQTEPPPSQRgPSGSP 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408468  356 EHSPvvygtvesahlaaSTP----VTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKeqagqmPETAAGEPTPEPPRT 431
Cdd:pfam05539  248 QHPP-------------STTsqdqSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTT------KRQETGRPTPRPTAT 308
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1720408468  432 SSPTSLPPlarSSLPSPMSAALSSQPLFTAE 462
Cdd:pfam05539  309 TQSGSSPP---HSSPPGVQANPTTQNLVDCK 336
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH