NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568931090|ref|XP_006538858|]
View 

eukaryotic translation initiation factor 4 gamma 3 isoform X27 [Mus musculus]

Protein Classification

eukaryotic translation initiation factor 4 gamma 3( domain architecture ID 10501431)

eukaryotic translation initiation factor 4 gamma 3 (EIF4G3) is component of the protein complex eIF4F, which is involved in the recognition of the mRNA cap, ATP-dependent unwinding of 5'-terminal secondary structure and recruitment of mRNA to the ribosome

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
756-984 1.67e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.92  E-value: 1.67e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   756 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 835
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   836 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 915
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568931090   916 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 984
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1424-1556 1.67e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 169.00  E-value: 1.67e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090 1424 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1503
Cdd:cd11559     4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 568931090 1504 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1556
Cdd:cd11559    82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1222-1334 8.51e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 128.93  E-value: 8.51e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  1222 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1301
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 568931090  1302 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1334
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
PRK07003 super family cl35530
DNA polymerase III subunit gamma/tau;
168-331 4.57e-06

DNA polymerase III subunit gamma/tau;


The actual alignment was detected with superfamily member PRK07003:

Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.39  E-value: 4.57e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  168 GGGSRNPTPPIGRPASTPTPPQLP--SQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 245
Cdd:PRK07003  368 PGGGVPARVAGAVPAPGARAAAAVgaSAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  246 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTS-----LPPLARSSlPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVP- 319
Cdd:PRK07003  448 PVPAKANARASADSRCDERDAQPPADSGSASapasdAPPDAAFE-PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAa 526
                         170
                  ....*....|..
gi 568931090  320 SPTSCTAASGPS 331
Cdd:PRK07003  527 PPAPEARPPTPA 538
W2 super family cl17013
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1535-1581 1.13e-04

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


The actual alignment was detected with superfamily member cd11560:

Pssm-ID: 473053 [Multi-domain]  Cd Length: 194  Bit Score: 44.90  E-value: 1.13e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 568931090 1535 LYDEEVISEDAFYKWEssKDPAEQAGKGVALKSVTAFFTWLREAEEE 1581
Cdd:cd11560   150 LYKADVLSEDAILKWY--KKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PHA03247 super family cl33720
large tegument protein UL36; Provisional
6-337 1.45e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 1.45e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090    6 QARSPGGFRPIQFFQRPQIQPPRAAI--------PNSSPSIRP-----GVQTPTAVYQANQHIMMVNHLPMPYPVTQGHQ 72
Cdd:PHA03247 2670 LGRAAQASSPPQRPRRRAARPTVGSLtsladpppPPPTPEPAPhalvsATPLPPGPAAARQASPALPAAPAPPAVPAGPA 2749
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   73 YCIPQYRHSGPPyvgppqqypvqppgpgpfypgpgpgdfanaygTPFYPSQPVYQSAPIIVPTQQQPPPAKREKKTIR-- 150
Cdd:PHA03247 2750 TPGGPARPARPP--------------------------------TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRes 2797
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  151 ---IRDPNQGGKDITEEIMSGGGSRNPTPPIGRP-ASTPTPPQLPSQVPEHSPVVYGTVESA------HLAASTPVTAAS 220
Cdd:PHA03247 2798 lpsPWDPADPPAAVLAPAAALPPAASPAGPLPPPtSAQPTAPPPPPGPPPPSLPLGGSVAPGgdvrrrPPSRSPAAKPAA 2877
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  221 DQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAAGEPTPEPP-RTSSPTSLPPLARSSLPSPMSAALSSQPLF 299
Cdd:PHA03247 2878 PARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPpQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*...
gi 568931090  300 TAED---------KCELPSSK-EEDAPPVPSPtsctAASGPSLTDNSD 337
Cdd:PHA03247 2958 AVPQpwlgalvpgRVAVPRFRvPQPAPSREAP----ASSTPPLTGHSL 3001
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
756-984 1.67e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.92  E-value: 1.67e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   756 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 835
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   836 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 915
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568931090   916 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 984
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
757-981 7.72e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 178.32  E-value: 7.72e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090    757 RKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 836
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090    837 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 916
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568931090    917 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 981
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1424-1556 1.67e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 169.00  E-value: 1.67e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090 1424 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1503
Cdd:cd11559     4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 568931090 1504 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1556
Cdd:cd11559    82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1222-1334 8.51e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 128.93  E-value: 8.51e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  1222 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1301
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 568931090  1302 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1334
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1222-1334 2.99e-32

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 121.97  E-value: 2.99e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   1222 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1301
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 568931090   1302 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1334
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1494-1578 2.41e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 109.69  E-value: 2.41e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   1494 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqaGKGVALKSVTAFFT 1573
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 568931090   1574 WLREA 1578
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1507-1583 9.60e-24

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 96.06  E-value: 9.60e-24
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568931090  1507 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQaGKGVALKSVTAFFTWLREAEEESE 1583
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
168-331 4.57e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.39  E-value: 4.57e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  168 GGGSRNPTPPIGRPASTPTPPQLP--SQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 245
Cdd:PRK07003  368 PGGGVPARVAGAVPAPGARAAAAVgaSAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  246 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTS-----LPPLARSSlPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVP- 319
Cdd:PRK07003  448 PVPAKANARASADSRCDERDAQPPADSGSASapasdAPPDAAFE-PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAa 526
                         170
                  ....*....|..
gi 568931090  320 SPTSCTAASGPS 331
Cdd:PRK07003  527 PPAPEARPPTPA 538
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1535-1581 1.13e-04

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 44.90  E-value: 1.13e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 568931090 1535 LYDEEVISEDAFYKWEssKDPAEQAGKGVALKSVTAFFTWLREAEEE 1581
Cdd:cd11560   150 LYKADVLSEDAILKWY--KKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PHA03247 PHA03247
large tegument protein UL36; Provisional
6-337 1.45e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 1.45e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090    6 QARSPGGFRPIQFFQRPQIQPPRAAI--------PNSSPSIRP-----GVQTPTAVYQANQHIMMVNHLPMPYPVTQGHQ 72
Cdd:PHA03247 2670 LGRAAQASSPPQRPRRRAARPTVGSLtsladpppPPPTPEPAPhalvsATPLPPGPAAARQASPALPAAPAPPAVPAGPA 2749
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   73 YCIPQYRHSGPPyvgppqqypvqppgpgpfypgpgpgdfanaygTPFYPSQPVYQSAPIIVPTQQQPPPAKREKKTIR-- 150
Cdd:PHA03247 2750 TPGGPARPARPP--------------------------------TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRes 2797
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  151 ---IRDPNQGGKDITEEIMSGGGSRNPTPPIGRP-ASTPTPPQLPSQVPEHSPVVYGTVESA------HLAASTPVTAAS 220
Cdd:PHA03247 2798 lpsPWDPADPPAAVLAPAAALPPAASPAGPLPPPtSAQPTAPPPPPGPPPPSLPLGGSVAPGgdvrrrPPSRSPAAKPAA 2877
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  221 DQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAAGEPTPEPP-RTSSPTSLPPLARSSLPSPMSAALSSQPLF 299
Cdd:PHA03247 2878 PARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPpQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*...
gi 568931090  300 TAED---------KCELPSSK-EEDAPPVPSPtsctAASGPSLTDNSD 337
Cdd:PHA03247 2958 AVPQpwlgalvpgRVAVPRFRvPQPAPSREAP----ASSTPPLTGHSL 3001
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
3-357 1.83e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 46.30  E-value: 1.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090     3 SQPQARSPGGFRPIQFFQR-PQIQPPRaaIPNSSPSIRPGVQTPTAVYQANQhimmvnhlPMPYPVTQGHQYCIPQYRHS 81
Cdd:pfam03154  215 SQPPNQTQSTAAPHTLIQQtPTLHPQR--LPSPHPPLQPMTQPPPPSQVSPQ--------PLPQPSLHGQMPPMPHSLQT 284
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090    82 GPPYVGPPQQYPVqppgpgpfypGPGPGDFANAYGTPFYPSQPVYQSA--PIIVPTQQQPPPAKREKKTIRIRDPnqggk 159
Cdd:pfam03154  285 GPSHMQHPVPPQP----------FPLTPQSSQSQVPPGPSPAAPGQSQqrIHTPPSQSQLQSQQPPREQPLPPAP----- 349
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   160 diteeimsgggsrNPTPPIGRPASTPTPPQLPSQVPEHSPvvygtvesaHLAASTPVTAASDQkqeekpKPDPVFQSPST 239
Cdd:pfam03154  350 -------------LSMPHIKPPPTTPIPQLPNPQSHKHPP---------HLSGPSPFQMNSNL------PPPPALKPLSS 401
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   240 VLRLVLSGEKKEQAGQMPETAAGEPTP-EPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKceLPSSKEEDAPPV 318
Cdd:pfam03154  402 LSTHHPPSAHPPPLQLMPQSQQLPPPPaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPF--VPGGPPPITPPS 479
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 568931090   319 PSPTScTAASGPSLtdnsdicKKPCSVAPHDSQLISSTI 357
Cdd:pfam03154  480 GPPTS-TSSAMPGI-------QPPSSASVSSSGPVPAAV 510
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
180-347 2.58e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 42.08  E-value: 2.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   180 RPASTPTPPQLPSQVPEHSPVvYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPET 259
Cdd:pfam13254  165 KPKAQPSQPAQPAWMKELNKI-RQSRASVDLGRPNSFKEVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEADT 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   260 AAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSK-----EEDAPPVPSPTSCTAASGPSLTD 334
Cdd:pfam13254  244 LSTDKEQSPAPTSASEPPPKTKELPKDSEEPAAPSKSAEASTEKKEPDTESSpetssEKSAPSLLSPVSKASIDKPLSSP 323
                          170
                   ....*....|....
gi 568931090   335 NSD-ICKKPCSVAP 347
Cdd:pfam13254  324 DRDpLSPKPKPQSP 337
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
756-984 1.67e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.92  E-value: 1.67e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   756 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 835
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   836 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 915
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568931090   916 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 984
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
757-981 7.72e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 178.32  E-value: 7.72e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090    757 RKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 836
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090    837 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 916
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568931090    917 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 981
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1424-1556 1.67e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 169.00  E-value: 1.67e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090 1424 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1503
Cdd:cd11559     4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 568931090 1504 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1556
Cdd:cd11559    82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1222-1334 8.51e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 128.93  E-value: 8.51e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  1222 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1301
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 568931090  1302 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1334
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1222-1334 2.99e-32

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 121.97  E-value: 2.99e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   1222 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1301
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 568931090   1302 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1334
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1494-1578 2.41e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 109.69  E-value: 2.41e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   1494 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqaGKGVALKSVTAFFT 1573
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 568931090   1574 WLREA 1578
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1507-1583 9.60e-24

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 96.06  E-value: 9.60e-24
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568931090  1507 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQaGKGVALKSVTAFFTWLREAEEESE 1583
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1424-1550 1.56e-18

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 83.29  E-value: 1.56e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090 1424 EELSQRLEKLIMEEKADDERIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADCSTF---RVDTAVIKQRVPILLKYL 1500
Cdd:cd11473     4 KKLRDSLLKELEEDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADSISLtqkEQLVLVLKKYGPVLRELL 83
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 568931090 1501 DSDTEKELQALYALQA--SIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWE 1550
Cdd:cd11473    84 KLIKKDQLYLLLKIEKlcLQLKLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1464-1583 6.15e-15

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 74.22  E-value: 6.15e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090 1464 RALMTAVCK-AAIIADCSTFRVDTA---VIKQRVPILLKYLDSDTEkELQALYALQASIVKLDQPANLLRMFFDCLYDEE 1539
Cdd:cd11558    47 RAVVKALLElILEVSSTSTAELLEAlkkLLSKWGPLLENYVKSQDD-QVELLLALEEFCLESEEGGPLFAKLLHALYDLD 125
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 568931090 1540 VISEDAFYKWESSKDPAEQAGKGVALKSVTAFFTWLREAEEESE 1583
Cdd:cd11558   126 ILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWLEEAEEESD 169
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1437-1583 1.57e-08

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 55.31  E-value: 1.57e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090 1437 EKADDERI---FDWVEANLDESQMS-------SPTFLRALMTAVCkaAIIADCSTFRVDTAVIKQRVPILLKYLDSDtek 1506
Cdd:cd11561     1 EEEEDERVdelGEFLKKNKDESGLSelkeilkEAERLDVVKDKAV--LVLAEVLFDENIVKEIKKRKALLLKLVTDE--- 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090 1507 elQALYALQASIVKL--DQPANLLRMF---FDCLYDEEVISEDAFYKW--ESSKDPAEQAGKGVALKSVTAFFTWLREAE 1579
Cdd:cd11561    76 --KAQKALLGGIERFcgKHSPELLKKVpliLKALYDNDILEEEVILKWyeKVSKKYVSKEKSKKVRKAAEPFVEWLEEAE 153

                  ....
gi 568931090 1580 EESE 1583
Cdd:cd11561   154 EEEE 157
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
168-331 4.57e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.39  E-value: 4.57e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  168 GGGSRNPTPPIGRPASTPTPPQLP--SQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 245
Cdd:PRK07003  368 PGGGVPARVAGAVPAPGARAAAAVgaSAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  246 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTS-----LPPLARSSlPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVP- 319
Cdd:PRK07003  448 PVPAKANARASADSRCDERDAQPPADSGSASapasdAPPDAAFE-PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAa 526
                         170
                  ....*....|..
gi 568931090  320 SPTSCTAASGPS 331
Cdd:PRK07003  527 PPAPEARPPTPA 538
PHA03378 PHA03378
EBNA-3B; Provisional
42-327 4.53e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 48.14  E-value: 4.53e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   42 VQTPTAVYQANQHIMMVNHLPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYG-TPFY 120
Cdd:PHA03378  574 IQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLVFpTPHQ 653
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  121 PSQ---PVYQSAPIIVP-TQQQPPPA------KREKKTIRIRDPNQGGKDITEEIMSGGGSRNPTPPIGR---PASTPTP 187
Cdd:PHA03378  654 PPQveiTPYKPTWTQIGhIPYQPSPTgantmlPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRarpPAAAPGR 733
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  188 PQLPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTvlrlvlsgekkeqAGQMPEtaaGEPTPE 267
Cdd:PHA03378  734 ARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPA-------------PQQRPR---GAPTPQ 797
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568931090  268 PPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSK-----EEDAP--PVPSPTSCTAA 327
Cdd:PHA03378  798 PPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKkpaalERQAAagPTPSPGSGTSD 864
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1535-1581 1.13e-04

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 44.90  E-value: 1.13e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 568931090 1535 LYDEEVISEDAFYKWEssKDPAEQAGKGVALKSVTAFFTWLREAEEE 1581
Cdd:cd11560   150 LYKADVLSEDAILKWY--KKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PHA03247 PHA03247
large tegument protein UL36; Provisional
6-337 1.45e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 1.45e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090    6 QARSPGGFRPIQFFQRPQIQPPRAAI--------PNSSPSIRP-----GVQTPTAVYQANQHIMMVNHLPMPYPVTQGHQ 72
Cdd:PHA03247 2670 LGRAAQASSPPQRPRRRAARPTVGSLtsladpppPPPTPEPAPhalvsATPLPPGPAAARQASPALPAAPAPPAVPAGPA 2749
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   73 YCIPQYRHSGPPyvgppqqypvqppgpgpfypgpgpgdfanaygTPFYPSQPVYQSAPIIVPTQQQPPPAKREKKTIR-- 150
Cdd:PHA03247 2750 TPGGPARPARPP--------------------------------TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRes 2797
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  151 ---IRDPNQGGKDITEEIMSGGGSRNPTPPIGRP-ASTPTPPQLPSQVPEHSPVVYGTVESA------HLAASTPVTAAS 220
Cdd:PHA03247 2798 lpsPWDPADPPAAVLAPAAALPPAASPAGPLPPPtSAQPTAPPPPPGPPPPSLPLGGSVAPGgdvrrrPPSRSPAAKPAA 2877
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  221 DQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAAGEPTPEPP-RTSSPTSLPPLARSSLPSPMSAALSSQPLF 299
Cdd:PHA03247 2878 PARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPpQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*...
gi 568931090  300 TAED---------KCELPSSK-EEDAPPVPSPtsctAASGPSLTDNSD 337
Cdd:PHA03247 2958 AVPQpwlgalvpgRVAVPRFRvPQPAPSREAP----ASSTPPLTGHSL 3001
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
179-334 1.62e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 46.70  E-value: 1.62e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  179 GRPASTPTPPQLPSQVPEHSPVVYGTVESAHLAASTPV---TAASDQKQEEKPKPDPVFQSPS------TVLRLVLSGEK 249
Cdd:PHA03307   19 EFFPRPPATPGDAADDLLSGSQGQLVSDSAELAAVTVVagaAACDRFEPPTGPPPGPGTEAPAnesrstPTWSLSTLAPA 98
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  250 KEQAGQMPETAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKC-ELPSSKEEDAPPVPSPTSCTAAS 328
Cdd:PHA03307   99 SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAaGASPAAVASDAASSRQAALPLSS 178

                  ....*.
gi 568931090  329 GPSLTD 334
Cdd:PHA03307  179 PEETAR 184
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
3-357 1.83e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 46.30  E-value: 1.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090     3 SQPQARSPGGFRPIQFFQR-PQIQPPRaaIPNSSPSIRPGVQTPTAVYQANQhimmvnhlPMPYPVTQGHQYCIPQYRHS 81
Cdd:pfam03154  215 SQPPNQTQSTAAPHTLIQQtPTLHPQR--LPSPHPPLQPMTQPPPPSQVSPQ--------PLPQPSLHGQMPPMPHSLQT 284
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090    82 GPPYVGPPQQYPVqppgpgpfypGPGPGDFANAYGTPFYPSQPVYQSA--PIIVPTQQQPPPAKREKKTIRIRDPnqggk 159
Cdd:pfam03154  285 GPSHMQHPVPPQP----------FPLTPQSSQSQVPPGPSPAAPGQSQqrIHTPPSQSQLQSQQPPREQPLPPAP----- 349
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   160 diteeimsgggsrNPTPPIGRPASTPTPPQLPSQVPEHSPvvygtvesaHLAASTPVTAASDQkqeekpKPDPVFQSPST 239
Cdd:pfam03154  350 -------------LSMPHIKPPPTTPIPQLPNPQSHKHPP---------HLSGPSPFQMNSNL------PPPPALKPLSS 401
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   240 VLRLVLSGEKKEQAGQMPETAAGEPTP-EPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKceLPSSKEEDAPPV 318
Cdd:pfam03154  402 LSTHHPPSAHPPPLQLMPQSQQLPPPPaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPF--VPGGPPPITPPS 479
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 568931090   319 PSPTScTAASGPSLtdnsdicKKPCSVAPHDSQLISSTI 357
Cdd:pfam03154  480 GPPTS-TSSAMPGI-------QPPSSASVSSSGPVPAAV 510
PRK10263 PRK10263
DNA translocase FtsK; Provisional
27-321 1.96e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.23  E-value: 1.96e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   27 PRAAIPNSSPSIR----PGVQTPTAVYQanqhimmvnhlPMPYPVTQGHQYCIPQYRHSGP---PYVGPPQQYPVQPPGP 99
Cdd:PRK10263  347 ASVDVPPAQPTVAwqpvPGPQTGEPVIA-----------PAPEGYPQQSQYAQPAVQYNEPlqqPVQPQQPYYAPAAEQP 415
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  100 gpfypgpgpgdfanaygtpfyPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEimsggGSRNPTPPIG 179
Cdd:PRK10263  416 ---------------------AQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQ-----STYQTEQTYQ 469
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  180 RPASTPTPPQLPSQVPEHSPVVYGTVESAHLAASTP------VTAASDQKQEE-----KPKPDPVfqspstvlrlvlsge 248
Cdd:PRK10263  470 QPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPlyyfeeVEEKRAREREQlaawyQPIPEPV--------------- 534
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568931090  249 kKEQAGQMPETAAGEPTPEPPRTSSPTSLPPLA--RSSLPSPMSAALSSQPLFT-AEDKCELPSSKEEDAPPVPSP 321
Cdd:PRK10263  535 -KEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASgvKKATLATGAAATVAAPVFSlANSGGPRPQVKEGIGPQLPRP 609
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
168-337 4.08e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.25  E-value: 4.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  168 GGGSRNPTPPIGRPASTPTPPQLPSQV--PEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRlvl 245
Cdd:PRK12323  368 SGGGAGPATAAAAPVAQPAPAAAAPAAaaPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP--- 444
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  246 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVPSPTSCT 325
Cdd:PRK12323  445 GGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAE 524
                         170
                  ....*....|..
gi 568931090  326 AASGPSLTDNSD 337
Cdd:PRK12323  525 SIPDPATADPDD 536
PHA03378 PHA03378
EBNA-3B; Provisional
1-268 4.23e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.06  E-value: 4.23e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090    1 MNSQPQARSPGGFRPIQFFQRPQIQPPRAAIPNSSPSIRPGVQTPTAVYQAnqhimmvnhlPMPYPVTQGHQYCIPQyrh 80
Cdd:PHA03378  672 IPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATG----------RARPPAAAPGRARPPA--- 738
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   81 SGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYGTPFYPSQ--PVYQSAPIIVPTQQQPPPAKREKKTIRIRD-PNQG 157
Cdd:PHA03378  739 AAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQapPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAaPGQQ 818
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  158 G--KDITEEIMSGGGSRnptppiGRPAS--------------TPTPPQLPSQVPEHSPVVY----------GTVESAHLA 211
Cdd:PHA03378  819 GptKQILRQLLTGGVKR------GRPSLkkpaalerqaaagpTPSPGSGTSDKIVQAPVFYppvlqpiqvmRQLGSVRAA 892
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568931090  212 ASTPVTAASDQKQEEKPKPDPVfqSPSTVLRlvlSGEKKEQAGQMPETAAGEPTPEP 268
Cdd:PHA03378  893 AASTVTQAPTEYTGERRGVGPM--HPTDIPP---SKRAKTDAYVESQPPHGGQSHSF 944
PRK11901 PRK11901
hypothetical protein; Reviewed
121-247 9.10e-04

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 43.13  E-value: 9.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  121 PSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDP---------NQGGKDITEEIMSGGGSRNPTPP--------IGRPAS 183
Cdd:PRK11901  113 TAPPQDISAPPISPTPTQAAPPQTPNGQQRIELPgnisdalsqQQGQVNAASQNAQGNTSTLPTAPatvapskgAKVPAT 192
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568931090  184 TPTPPQLPSQVPEHSPVVygtveSAHLAASTPVTAASDQKQEEKPKPDPVFQS-PSTVLRLVLSG 247
Cdd:PRK11901  193 AETHPTPPQKPATKKPAV-----NHHKTATVAVPPATSGKPKSGAASARALSSaPASHYTLQLSS 252
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
167-331 1.01e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.82  E-value: 1.01e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  167 SGGGSRNPTPPIGRPASTPTPPQLPSQVPEHSPVVyGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLS 246
Cdd:PRK07764  601 PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAA-PAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPA 679
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  247 GEKKEQAGQMPETAAGEPTPEP----------PRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAP 316
Cdd:PRK07764  680 APPPAPAPAAPAAPAGAAPAQPapapaatppaGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPP 759
                         170
                  ....*....|....*
gi 568931090  317 PVPSPTSCTAASGPS 331
Cdd:PRK07764  760 PPPAPAPAAAPAAAP 774
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
180-347 2.58e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 42.08  E-value: 2.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   180 RPASTPTPPQLPSQVPEHSPVvYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPET 259
Cdd:pfam13254  165 KPKAQPSQPAQPAWMKELNKI-RQSRASVDLGRPNSFKEVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEADT 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   260 AAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSK-----EEDAPPVPSPTSCTAASGPSLTD 334
Cdd:pfam13254  244 LSTDKEQSPAPTSASEPPPKTKELPKDSEEPAAPSKSAEASTEKKEPDTESSpetssEKSAPSLLSPVSKASIDKPLSSP 323
                          170
                   ....*....|....
gi 568931090   335 NSD-ICKKPCSVAP 347
Cdd:pfam13254  324 DRDpLSPKPKPQSP 337
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
117-302 3.21e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 41.96  E-value: 3.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   117 TPFYPSQpvyqsapiiVPTQQQPPPAKREKKTIRIRDPNQGgkditeeimSGGGSRNPTPPIGRPASTPTPPQL-PSQVP 195
Cdd:pfam05539  186 HPTYPSQ---------VTPQSQPATQGHQTATANQRLSSTE---------PVGTQGTTTSSNPEPQTEPPPSQRgPSGSP 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   196 EHSPvvygtvesahlaaSTP----VTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKeqagqmPETAAGEPTPEPPRT 271
Cdd:pfam05539  248 QHPP-------------STTsqdqSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTT------KRQETGRPTPRPTAT 308
                          170       180       190
                   ....*....|....*....|....*....|.
gi 568931090   272 SSPTSLPPlarSSLPSPMSAALSSQPLFTAE 302
Cdd:pfam05539  309 TQSGSSPP---HSSPPGVQANPTTQNLVDCK 336
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
121-269 3.37e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.14  E-value: 3.37e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  121 PSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEIMSGGGSRNPTPPigrPASTPTPPQLPSQVPEHSPV 200
Cdd:PRK07003  467 DAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPA---AAAPPAPEARPPTPAAAAPA 543
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  201 VYGTVESAHL-----------------AASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPEtAAGE 263
Cdd:PRK07003  544 ARAGGAAAALdvlrnagmrvssdrgarAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQ-AAES 622

                  ....*.
gi 568931090  264 PTPEPP 269
Cdd:PRK07003  623 RGAPPP 628
PHA03247 PHA03247
large tegument protein UL36; Provisional
24-347 5.20e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 5.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090   24 IQPPRAAIPNSSPSIRPGVQTPTAVYQANQhimmvnhlpmpyPVTQGHQYCIPqyRHSGPPYVGPPQQYPVQPPGPGPFY 103
Cdd:PHA03247 2568 VPPPRPAPRPSEPAVTSRARRPDAPPQSAR------------PRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSP 2633
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  104 PGPGPGDFANAYGTPfyPSQPVYQSAPiivPTQQQPPPAKREKKTIRIRDPNQGgkditeeimsgggsrnPTPPIGRPAS 183
Cdd:PHA03247 2634 AANEPDPHPPPTVPP--PERPRDDPAP---GRVSRPRRARRLGRAAQASSPPQR----------------PRRRAARPTV 2692
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  184 TPT------PPQLPSQVPEHSPVVYGT----VESAHLAASTPVTAAsdqkqeekPKPDPVFQSPSTVLRLVLSGEKKEQA 253
Cdd:PHA03247 2693 GSLtsladpPPPPPTPEPAPHALVSATplppGPAAARQASPALPAA--------PAPPAVPAGPATPGGPARPARPPTTA 2764
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  254 GQMPETAAGEPTPEPPRTSSPTSLPPL--ARSSLPSPMSAALSSQPLfTAEDKCELPSSKEedAPPVPSPTSCTAASGPS 331
Cdd:PHA03247 2765 GPPAPAPPAAPAAGPPRRLTRPAVASLseSRESLPSPWDPADPPAAV-LAPAAALPPAASP--AGPLPPPTSAQPTAPPP 2841
                         330
                  ....*....|....*..
gi 568931090  332 LTDNSDICKKPC-SVAP 347
Cdd:PHA03247 2842 PPGPPPPSLPLGgSVAP 2858
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
121-323 5.85e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 41.07  E-value: 5.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  121 PSQPVYQSAPIiVPTQQQPPPAKREKKtiRIRDPNQGGKDITEEImsgggSRNPTPPIGRPAS--------------TPT 186
Cdd:PLN03209  343 PTKPVTPEAPS-PPIEEEPPQPKAVVP--RPLSPYTAYEDLKPPT-----SPIPTPPSSSPASsksvdavakpaepdVVP 414
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931090  187 PPQLPSQVPEHSPvvyGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAAGEPTP 266
Cdd:PLN03209  415 SPGSASNVPEVEP---AQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPP 491
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568931090  267 EPPRTSS----------PTSLPPLARSSLPSPMSAALSSQPLFTAEDKCelPSSKEEDAPPVPSPTS 323
Cdd:PLN03209  492 ANMRPLSpyavyddlkpPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTA--LADEQHHAQPKPRPLS 556
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH