NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907155774|ref|XP_036019961|]
View 

eukaryotic translation initiation factor 4 gamma 3 isoform X2 [Mus musculus]

Protein Classification

eukaryotic translation initiation factor 4 gamma 3( domain architecture ID 10501431)

eukaryotic translation initiation factor 4 gamma 3 (EIF4G3) is component of the protein complex eIF4F, which is involved in the recognition of the mRNA cap, ATP-dependent unwinding of 5'-terminal secondary structure and recruitment of mRNA to the ribosome

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
937-1165 7.66e-64

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 216.08  E-value: 7.66e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  937 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 1016
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774 1017 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 1096
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907155774 1097 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1165
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1605-1737 4.63e-49

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 170.93  E-value: 4.63e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774 1605 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1684
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907155774 1685 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1737
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1403-1515 2.80e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 130.47  E-value: 2.80e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774 1403 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1482
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907155774 1483 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1515
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
PHA03247 super family cl33720
large tegument protein UL36; Provisional
57-518 1.37e-10

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.89  E-value: 1.37e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774   57 RALQTPAPQQIPRGPVQQPLEDRLFPPTVSAvySTVTQVA-----RQPGPPTPAPYSAHeiskglpslAATPPGHASSPG 131
Cdd:PHA03247  2566 RSVPPPRPAPRPSEPAVTSRARRPDAPPQSA--RPRAPVDdrgdpRGPAPPSPLPPDTH---------APDPPPPSPSPA 2634
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  132 LSQNAGPATLVYPQAPQtmnsqPQARSPPGRtVPIHCTDTRKRRKVLEQSPV--YRSLAGRGWIKYYIFFQRPqiqPPRA 209
Cdd:PHA03247  2635 ANEPDPHPPPTVPPPER-----PRDDPAPGR-VSRPRRARRLGRAAQASSPPqrPRRRAARPTVGSLTSLADP---PPPP 2705
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  210 AIPNSSP-SIRPGVQTPTAVYQANQHIMMVNHLPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVQPpgpgpfypgpgp 288
Cdd:PHA03247  2706 PTPEPAPhALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPA------------ 2773
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  289 gdfANAYGTPfyPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQggkdiTEEIMSGGGSRNPTPPIGRPASTPTPP 368
Cdd:PHA03247  2774 ---APAAGPP--RRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAA-----ALPPAASPAGPLPPPTSAQPTAPPPPP 2843
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  369 QQLPSQVPEHSPVVYG---------TVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPE 439
Cdd:PHA03247  2844 GPPPPSLPLGGSVAPGgdvrrrppsRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  440 TAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVPS-PTSctAASGPSLTDNSD 518
Cdd:PHA03247  2924 PPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSrEAP--ASSTPPLTGHSL 3001
W2 super family cl17013
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1716-1762 8.96e-05

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


The actual alignment was detected with superfamily member cd11560:

Pssm-ID: 473053 [Multi-domain]  Cd Length: 194  Bit Score: 45.28  E-value: 8.96e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1907155774 1716 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1762
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
937-1165 7.66e-64

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 216.08  E-value: 7.66e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  937 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 1016
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774 1017 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 1096
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907155774 1097 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1165
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
938-1162 4.04e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 179.48  E-value: 4.04e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774   938 RKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 1017
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  1018 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 1097
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907155774  1098 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1162
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1605-1737 4.63e-49

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 170.93  E-value: 4.63e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774 1605 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1684
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907155774 1685 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1737
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1403-1515 2.80e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 130.47  E-value: 2.80e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774 1403 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1482
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907155774 1483 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1515
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1403-1515 1.16e-32

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 123.12  E-value: 1.16e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  1403 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1482
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1907155774  1483 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1515
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1675-1759 1.25e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 110.46  E-value: 1.25e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  1675 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqaGKGVALKSVTAFFT 1754
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1907155774  1755 WLREA 1759
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1688-1764 2.99e-24

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 97.60  E-value: 2.99e-24
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907155774 1688 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQaGKGVALKSVTAFFTWLREAEEESE 1764
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
PHA03247 PHA03247
large tegument protein UL36; Provisional
57-518 1.37e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.89  E-value: 1.37e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774   57 RALQTPAPQQIPRGPVQQPLEDRLFPPTVSAvySTVTQVA-----RQPGPPTPAPYSAHeiskglpslAATPPGHASSPG 131
Cdd:PHA03247  2566 RSVPPPRPAPRPSEPAVTSRARRPDAPPQSA--RPRAPVDdrgdpRGPAPPSPLPPDTH---------APDPPPPSPSPA 2634
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  132 LSQNAGPATLVYPQAPQtmnsqPQARSPPGRtVPIHCTDTRKRRKVLEQSPV--YRSLAGRGWIKYYIFFQRPqiqPPRA 209
Cdd:PHA03247  2635 ANEPDPHPPPTVPPPER-----PRDDPAPGR-VSRPRRARRLGRAAQASSPPqrPRRRAARPTVGSLTSLADP---PPPP 2705
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  210 AIPNSSP-SIRPGVQTPTAVYQANQHIMMVNHLPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVQPpgpgpfypgpgp 288
Cdd:PHA03247  2706 PTPEPAPhALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPA------------ 2773
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  289 gdfANAYGTPfyPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQggkdiTEEIMSGGGSRNPTPPIGRPASTPTPP 368
Cdd:PHA03247  2774 ---APAAGPP--RRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAA-----ALPPAASPAGPLPPPTSAQPTAPPPPP 2843
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  369 QQLPSQVPEHSPVVYG---------TVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPE 439
Cdd:PHA03247  2844 GPPPPSLPLGGSVAPGgdvrrrppsRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  440 TAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVPS-PTSctAASGPSLTDNSD 518
Cdd:PHA03247  2924 PPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSrEAP--ASSTPPLTGHSL 3001
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
92-538 2.81e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 49.00  E-value: 2.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774   92 VTQVARQPGPPTPAPY---------SAHEISKGLPSLAATPPGHASSPGLSQnAGPATLVYPQAPQTMNSQPQARSPPGR 162
Cdd:pfam03154  137 IDQDNRSTSPSIPSPQdnesdsdssAQQQILQTQPPVLQAQSGAASPPSPPP-PGTTQAATAGPTPSAPSVPPQGSPATS 215
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  163 TVPIHCTDTRKRRKVLEQSPVYRSlagrgwikyyiffQR-PQIQPPRAAIPNSSPsirPGVQTPTAVYQANQHIMMVnhl 241
Cdd:pfam03154  216 QPPNQTQSTAAPHTLIQQTPTLHP-------------QRlPSPHPPLQPMTQPPP---PSQVSPQPLPQPSLHGQMP--- 276
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  242 PMPYPVTQGHqyciPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYgTPfyPSQPVYQSAPiivPTQQQP-P 320
Cdd:pfam03154  277 PMPHSLQTGP----SHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIH-TP--PSQSQLQSQQ---PPREQPlP 346
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  321 PAKRekktirirdpnqggkditeeimsgggsrnPTPPIGRPASTPTPPQQLPsQVPEHSPvvygtvesaHLAASTPVTAA 400
Cdd:pfam03154  347 PAPL-----------------------------SMPHIKPPPTTPIPQLPNP-QSHKHPP---------HLSGPSPFQMN 387
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  401 SDQKqeekpkPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAAGEPTP-EPPRTSSPTSLPPLARSSLPSPMSAALSSQPL 479
Cdd:pfam03154  388 SNLP------PPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSP 461
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907155774  480 FTAEDKceLPSSKEEDAPPVPSPTScTAASGPSLtdnsdicKKPCSVAPHDSQLISSTI 538
Cdd:pfam03154  462 FPQHPF--VPGGPPPITPPSGPPTS-TSSAMPGI-------QPPSSASVSSSGPVPAAV 510
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1716-1762 8.96e-05

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 45.28  E-value: 8.96e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1907155774 1716 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1762
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
937-1165 7.66e-64

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 216.08  E-value: 7.66e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  937 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 1016
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774 1017 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 1096
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907155774 1097 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1165
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
938-1162 4.04e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 179.48  E-value: 4.04e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774   938 RKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 1017
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  1018 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 1097
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907155774  1098 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1162
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1605-1737 4.63e-49

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 170.93  E-value: 4.63e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774 1605 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1684
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907155774 1685 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1737
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1403-1515 2.80e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 130.47  E-value: 2.80e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774 1403 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1482
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907155774 1483 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1515
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1403-1515 1.16e-32

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 123.12  E-value: 1.16e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  1403 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1482
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1907155774  1483 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1515
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1675-1759 1.25e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 110.46  E-value: 1.25e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  1675 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqaGKGVALKSVTAFFT 1754
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1907155774  1755 WLREA 1759
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1688-1764 2.99e-24

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 97.60  E-value: 2.99e-24
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907155774 1688 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQaGKGVALKSVTAFFTWLREAEEESE 1764
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1605-1731 4.53e-19

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 85.22  E-value: 4.53e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774 1605 EELSQRLEKLIMEEKADDERIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADCSTF---RVDTAVIKQRVPILLKYL 1681
Cdd:cd11473      4 KKLRDSLLKELEEDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADSISLtqkEQLVLVLKKYGPVLRELL 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907155774 1682 DSDTEKELQALYALQA--SIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWE 1731
Cdd:cd11473     84 KLIKKDQLYLLLKIEKlcLQLKLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1645-1764 2.39e-15

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 75.37  E-value: 2.39e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774 1645 RALMTAVCK-AAIIADCSTFRVDTA---VIKQRVPILLKYLDSDTEkELQALYALQASIVKLDQPANLLRMFFDCLYDEE 1720
Cdd:cd11558     47 RAVVKALLElILEVSSTSTAELLEAlkkLLSKWGPLLENYVKSQDD-QVELLLALEEFCLESEEGGPLFAKLLHALYDLD 125
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1907155774 1721 VISEDAFYKWESSKDPAEQAGKGVALKSVTAFFTWLREAEEESE 1764
Cdd:cd11558    126 ILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWLEEAEEESD 169
PHA03247 PHA03247
large tegument protein UL36; Provisional
57-518 1.37e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.89  E-value: 1.37e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774   57 RALQTPAPQQIPRGPVQQPLEDRLFPPTVSAvySTVTQVA-----RQPGPPTPAPYSAHeiskglpslAATPPGHASSPG 131
Cdd:PHA03247  2566 RSVPPPRPAPRPSEPAVTSRARRPDAPPQSA--RPRAPVDdrgdpRGPAPPSPLPPDTH---------APDPPPPSPSPA 2634
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  132 LSQNAGPATLVYPQAPQtmnsqPQARSPPGRtVPIHCTDTRKRRKVLEQSPV--YRSLAGRGWIKYYIFFQRPqiqPPRA 209
Cdd:PHA03247  2635 ANEPDPHPPPTVPPPER-----PRDDPAPGR-VSRPRRARRLGRAAQASSPPqrPRRRAARPTVGSLTSLADP---PPPP 2705
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  210 AIPNSSP-SIRPGVQTPTAVYQANQHIMMVNHLPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVQPpgpgpfypgpgp 288
Cdd:PHA03247  2706 PTPEPAPhALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPA------------ 2773
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  289 gdfANAYGTPfyPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQggkdiTEEIMSGGGSRNPTPPIGRPASTPTPP 368
Cdd:PHA03247  2774 ---APAAGPP--RRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAA-----ALPPAASPAGPLPPPTSAQPTAPPPPP 2843
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  369 QQLPSQVPEHSPVVYG---------TVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPE 439
Cdd:PHA03247  2844 GPPPPSLPLGGSVAPGgdvrrrppsRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  440 TAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVPS-PTSctAASGPSLTDNSD 518
Cdd:PHA03247  2924 PPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSrEAP--ASSTPPLTGHSL 3001
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1617-1764 5.64e-09

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 56.86  E-value: 5.64e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774 1617 EEKADD--ERIFDWVEANLDESQMS-------SPTFLRALMTAVCkaAIIADCsTFRVDTA-VIKQRVPILLKYLDSDte 1686
Cdd:cd11561      1 EEEEDErvDELGEFLKKNKDESGLSelkeilkEAERLDVVKDKAV--LVLAEV-LFDENIVkEIKKRKALLLKLVTDE-- 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774 1687 kelQALYALQASIVKL--DQPANLLRMF---FDCLYDEEVISEDAFYKWeSSKDPAEQAGKGVA---LKSVTAFFTWLRE 1758
Cdd:cd11561     76 ---KAQKALLGGIERFcgKHSPELLKKVpliLKALYDNDILEEEVILKW-YEKVSKKYVSKEKSkkvRKAAEPFVEWLEE 151

                   ....*.
gi 1907155774 1759 AEEESE 1764
Cdd:cd11561    152 AEEEEE 157
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
348-512 1.37e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 50.23  E-value: 1.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  348 GGGSRNPTPPIGRPASTPTPPQQLPSQV-PEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 426
Cdd:PRK07003   368 PGGGVPARVAGAVPAPGARAAAAVGASAvPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  427 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTS-----LPPLARSSlPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVP- 500
Cdd:PRK07003   448 PVPAKANARASADSRCDERDAQPPADSGSASapasdAPPDAAFE-PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAa 526
                          170
                   ....*....|..
gi 1907155774  501 SPTSCTAASGPS 512
Cdd:PRK07003   527 PPAPEARPPTPA 538
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
92-538 2.81e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 49.00  E-value: 2.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774   92 VTQVARQPGPPTPAPY---------SAHEISKGLPSLAATPPGHASSPGLSQnAGPATLVYPQAPQTMNSQPQARSPPGR 162
Cdd:pfam03154  137 IDQDNRSTSPSIPSPQdnesdsdssAQQQILQTQPPVLQAQSGAASPPSPPP-PGTTQAATAGPTPSAPSVPPQGSPATS 215
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  163 TVPIHCTDTRKRRKVLEQSPVYRSlagrgwikyyiffQR-PQIQPPRAAIPNSSPsirPGVQTPTAVYQANQHIMMVnhl 241
Cdd:pfam03154  216 QPPNQTQSTAAPHTLIQQTPTLHP-------------QRlPSPHPPLQPMTQPPP---PSQVSPQPLPQPSLHGQMP--- 276
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  242 PMPYPVTQGHqyciPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYgTPfyPSQPVYQSAPiivPTQQQP-P 320
Cdd:pfam03154  277 PMPHSLQTGP----SHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIH-TP--PSQSQLQSQQ---PPREQPlP 346
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  321 PAKRekktirirdpnqggkditeeimsgggsrnPTPPIGRPASTPTPPQQLPsQVPEHSPvvygtvesaHLAASTPVTAA 400
Cdd:pfam03154  347 PAPL-----------------------------SMPHIKPPPTTPIPQLPNP-QSHKHPP---------HLSGPSPFQMN 387
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  401 SDQKqeekpkPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAAGEPTP-EPPRTSSPTSLPPLARSSLPSPMSAALSSQPL 479
Cdd:pfam03154  388 SNLP------PPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSP 461
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907155774  480 FTAEDKceLPSSKEEDAPPVPSPTScTAASGPSLtdnsdicKKPCSVAPHDSQLISSTI 538
Cdd:pfam03154  462 FPQHPF--VPGGPPPITPPSGPPTS-TSSAMPGI-------QPPSSASVSSSGPVPAAV 510
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1716-1762 8.96e-05

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 45.28  E-value: 8.96e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1907155774 1716 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1762
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
348-518 1.95e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.41  E-value: 1.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  348 GGGSRNPTPPIGRPASTPTPPQQLPSQV-PEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRlvl 426
Cdd:PRK12323   368 SGGGAGPATAAAAPVAQPAPAAAAPAAAaPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP--- 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  427 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVPSPTSCT 506
Cdd:PRK12323   445 GGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAE 524
                          170
                   ....*....|..
gi 1907155774  507 AASGPSLTDNSD 518
Cdd:PRK12323   525 SIPDPATADPDD 536
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
297-483 3.23e-04

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 45.04  E-value: 3.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  297 TPFYPSQpvyqsapiiVPTQQQPPPAKREKKTIRIRDPNQGgkditeeimSGGGSRNPTPPIGRPASTPTPPQQLPSQVP 376
Cdd:pfam05539  186 HPTYPSQ---------VTPQSQPATQGHQTATANQRLSSTE---------PVGTQGTTTSSNPEPQTEPPPSQRGPSGSP 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  377 EHSPvvygtvesahlaaSTP----VTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKeqagqmPETAAGEPTPEPPRT 452
Cdd:pfam05539  248 QHPP-------------STTsqdqSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTT------KRQETGRPTPRPTAT 308
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1907155774  453 SSPTSLPPlarSSLPSPMSAALSSQPLFTAE 483
Cdd:pfam05539  309 TQSGSSPP---HSSPPGVQANPTTQNLVDCK 336
PHA03378 PHA03378
EBNA-3B; Provisional
60-508 3.23e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.83  E-value: 3.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774   60 QTPAPQQIPRGPVQQPLEDRLFPPTVsavystvtQVARQPGPPTPAPYSAHEISKGLPSLAATPPGH-----ASSPGLSQ 134
Cdd:PHA03378   446 HSQAPTVVLHRPPTQPLEGPTGPLSV--------QAPLEPWQPLPHPQVTPVILHQPPAQGVQAHGSmldllEKDDEDME 517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  135 NAGPATLVYPQAPQTMNsqpqarsppGRTVPIhctdtrkrrkvleqspVYRSLAGrgwikyyIFFQRPQIQPPRAAIPNS 214
Cdd:PHA03378   518 QRVMATLLPPSPPQPRA---------GRRAPC----------------VYTEDLD-------IESDEPASTEPVHDQLLP 565
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  215 SPSIRP-GVQTPTAVYQANQHIMMVNHLPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFaN 293
Cdd:PHA03378   566 APGLGPlQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITF-N 644
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  294 AYGTPfYPSQPVYQSAPIIVPTQQQPPPAKREkktirirdPNQGGKDITEEIMSGGGSRNP---TPPIGRPASTPTPPQQ 370
Cdd:PHA03378   645 VLVFP-TPHQPPQVEITPYKPTWTQIGHIPYQ--------PSPTGANTMLPIQWAPGTMQPpprAPTPMRPPAAPPGRAQ 715
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  371 LPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAA----GEPT 446
Cdd:PHA03378   716 RPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQqrprGAPT 795
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907155774  447 PEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSK-----EEDAP--PVPSPTSCTAA 508
Cdd:PHA03378   796 PQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKkpaalERQAAagPTPSPGSGTSD 864
PRK10263 PRK10263
DNA translocase FtsK; Provisional
207-502 7.59e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 44.69  E-value: 7.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  207 PRAAIPNSSPSIR----PGVQTPTAVYQanqhimmvnhlPMPYPVTQGHQYCIPQYRHSGP---PYVGPPQQYPVQPPGP 279
Cdd:PRK10263   347 ASVDVPPAQPTVAwqpvPGPQTGEPVIA-----------PAPEGYPQQSQYAQPAVQYNEPlqqPVQPQQPYYAPAAEQP 415
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  280 gpfypgpgpgdfanaygtpfyPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEimsggGSRNPTPPIG 359
Cdd:PRK10263   416 ---------------------AQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQ-----STYQTEQTYQ 469
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  360 RPAstPTPPQQLPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEK------------PKPDPVfqspstvlrlvls 427
Cdd:PRK10263   470 QPA--AQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRarereqlaawyqPIPEPV------------- 534
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907155774  428 gekKEQAGQMPETAAGEPTPEPPRTSSPTSLPPLA--RSSLPSPMSAALSSQPLFT-AEDKCELPSSKEEDAPPVPSP 502
Cdd:PRK10263   535 ---KEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASgvKKATLATGAAATVAAPVFSlANSGGPRPQVKEGIGPQLPRP 609
PHA03247 PHA03247
large tegument protein UL36; Provisional
58-465 1.07e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774   58 ALQTPAPQQIPRGPVQQPLEDRLFP--PTVSAVYSTVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPGHAS------- 128
Cdd:PHA03247  2718 ATPLPPGPAAARQASPALPAAPAPPavPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASlsesres 2797
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  129 SPGLSQNAGPATLVYPQAP-QTMNSQPQARSPPGrtvpihcTDTRKRRKVLEQSPVYRSLAGRGWIkyyiffqrpqiqpp 207
Cdd:PHA03247  2798 LPSPWDPADPPAAVLAPAAaLPPAASPAGPLPPP-------TSAQPTAPPPPPGPPPPSLPLGGSV-------------- 2856
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  208 raaIPNSSPSIRPGVQTPTAVYQANQHImMVNHLPMPYPVTQGHQYCIPQYRHSGPPyvgppqqypvqppgpgpfypgpg 287
Cdd:PHA03247  2857 ---APGGDVRRRPPSRSPAAKPAAPARP-PVRRLARPAVSRSTESFALPPDQPERPP----------------------- 2909
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  288 pgdfanaygTPFYPSQPVYQSAPIIVPTQQQPPPAKRekktiRIRDPNQGGKDITEEIMSGGGSRNPTPPIGRPASTPTP 367
Cdd:PHA03247  2910 ---------QPQAPPPPQPQPQPPPPPQPQPPPPPPP-----RPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVP 2975
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  368 PQQLPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVfqspsTVLRLVLSGEKKEQA-------GQMPET 440
Cdd:PHA03247  2976 RFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPV-----SLKQTLWPPDDTEDSdadslfdSDSERS 3050
                          410       420
                   ....*....|....*....|....*
gi 1907155774  441 AAGEPTPEPPRTSSPTSLPPLARSS 465
Cdd:PHA03247  3051 DLEALDPLPPEPHDPFAHEPDPATP 3075
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
360-515 1.79e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.24  E-value: 1.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  360 RPASTPTPPQQLPSQvpEHSPVVygTVESAHLAASTPV---TAASDQKQEEKPKPDPVFQSPS------TVLRLVLSGEK 430
Cdd:PHA03307    23 RPPATPGDAADDLLS--GSQGQL--VSDSAELAAVTVVagaAACDRFEPPTGPPPGPGTEAPAnesrstPTWSLSTLAPA 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  431 KEQAGQMPETAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKC-ELPSSKEEDAPPVPSPTSCTAAS 509
Cdd:PHA03307    99 SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAaGASPAAVASDAASSRQAALPLSS 178

                   ....*.
gi 1907155774  510 GPSLTD 515
Cdd:PHA03307   179 PEETAR 184
PHA03378 PHA03378
EBNA-3B; Provisional
65-382 2.02e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.13  E-value: 2.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774   65 QQIPRGPVQQPLEDRlfPPTVSAVY--STVTQVARQPGPPTPAPYSAHEISKGLPSLAATPPghasspglsqnAGPATLV 142
Cdd:PHA03378   639 QPITFNVLVFPTPHQ--PPQVEITPykPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPP-----------RAPTPMR 705
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  143 YPQAPQTMNSQPQArsPPGRTVPIHCTDTRKRRKVLEQSPVYRSLAGRGWIkyyiffQRPQIQPPRAAIPNSSpsirPGV 222
Cdd:PHA03378   706 PPAAPPGRAQRPAA--ATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRA------RPPAAAPGRARPPAAA----PGA 773
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  223 QTPTAVYQANqhimmvnhlpmPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYGTPF--Y 300
Cdd:PHA03378   774 PTPQPPPQAP-----------PAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSlkK 842
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  301 PS----------QPVYQS--------APIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEI--MSGGGSRNPT--PPI 358
Cdd:PHA03378   843 PAalerqaaagpTPSPGSgtsdkivqAPVFYPPVLQPIQVMRQLGSVRAAAASTVTQAPTEYTgeRRGVGPMHPTdiPPS 922
                          330       340
                   ....*....|....*....|....
gi 1907155774  359 GRPASTPTPPQQLPSQVPEHSPVV 382
Cdd:PHA03378   923 KRAKTDAYVESQPPHGGQSHSFSV 946
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
347-512 2.05e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.05  E-value: 2.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  347 SGGGSRNPTPPIGRPASTPTPPQQLPSQVPehSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 426
Cdd:PRK07764   601 PAPASSGPPEEAARPAAPAAPAAPAAPAPA--GAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAP 678
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  427 SGEKKEQAGQMPETAAGEPTPEP----------PRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDA 496
Cdd:PRK07764   679 AAPPPAPAPAAPAAPAGAAPAQPapapaatppaGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQP 758
                          170
                   ....*....|....*.
gi 1907155774  497 PPVPSPTSCTAASGPS 512
Cdd:PRK07764   759 PPPPAPAPAAAPAAAP 774
PRK11901 PRK11901
hypothetical protein; Reviewed
301-428 2.13e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 42.36  E-value: 2.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  301 PSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDP---------NQGGKDITEEIMSGGGSRNPTPPIGRPASTPTPPQQL 371
Cdd:PRK11901   113 TAPPQDISAPPISPTPTQAAPPQTPNGQQRIELPgnisdalsqQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAKVPAT 192
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  372 PSQVPEHSPVVYGT--VESAHLAASTPVTAASDQKQEEKPKPDPVFQS-PSTVLRLVLSG 428
Cdd:PRK11901   193 AETHPTPPQKPATKkpAVNHHKTATVAVPPATSGKPKSGAASARALSSaPASHYTLQLSS 252
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
82-264 3.79e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 42.33  E-value: 3.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774   82 PPTVSAVYSTVTQVAR--------------QPGPPTPAPYSAHeiskglpslAATPPGHASSPGLSQNAGPATLVYPQAP 147
Cdd:pfam09770  176 APQPAAQPASLPAPSRkmmsleeveaamraQAKKPAQQPAPAP---------AQPPAAPPAQQAQQQQQFPPQIQQQQQP 246
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  148 QTMNSQPQARSPPGRTVPIhctdtrkrrkvleqspvyrslagrgwikyyifFQRPQIQPPRAAIPNSSPSIRPGVQTPTA 227
Cdd:pfam09770  247 QQQPQQPQQHPGQGHPVTI--------------------------------LQRPQSPQPDPAQPSIQPQAQQFHQQPPP 294
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1907155774  228 VYQANQHIM------MVNHLPMPYPVTQGHQYCIPQYRHSGPP 264
Cdd:pfam09770  295 VPVQPTQILqnpnrlSAARVGYPQNPQPGVQPAPAHQAHRQQG 337
PRK10263 PRK10263
DNA translocase FtsK; Provisional
372-513 4.93e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.99  E-value: 4.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  372 PSQVPEHSPVVYGTVESAHLAASTPVTAASdqkQEEKPKPDPVFQSPStvlrlVLSGEKKEQAGQMP-ETAAGEPTPEPP 450
Cdd:PRK10263   301 QPEYDEYDPLLNGAPITEPVAVAAAATTAT---QSWAAPVEPVTQTPP-----VASVDVPPAQPTVAwQPVPGPQTGEPV 372
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907155774  451 RTSSPTSLPPLARSSLPSPMSAALSSQPlFTAEDKCELPSSKEEDAPPVPSPTSCTAASGPSL 513
Cdd:PRK10263   373 IAPAPEGYPQQSQYAQPAVQYNEPLQQP-VQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYY 434
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
5-418 5.30e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 5.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774    5 QKPALKSGSAAAAGTGPGTGAAAAAAVPPPHPAAAAAAAAVAAAAApphpniralQTPAPQQIPRGP---VQQPLEdrLF 81
Cdd:pfam03154  170 QPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATS---------QPPNQTQSTAAPhtlIQQTPT--LH 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774   82 PPTVSAVYSTVTQvARQPGPPTPAPYSAHEiskglpslaaTPPGHASSPGL--SQNAGPATLVYPQAPQ-----TMNSQP 154
Cdd:pfam03154  239 PQRLPSPHPPLQP-MTQPPPPSQVSPQPLP----------QPSLHGQMPPMphSLQTGPSHMQHPVPPQpfpltPQSSQS 307
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  155 QARSPPGRTVPIHCTDTrkrrkvlEQSPVYRSlagrgwikyyiffQRPQIQPPR----AAIPNSSPSIRPGVQTPTAVYQ 230
Cdd:pfam03154  308 QVPPGPSPAAPGQSQQR-------IHTPPSQS-------------QLQSQQPPReqplPPAPLSMPHIKPPPTTPIPQLP 367
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  231 ANQHIMMVNHLPMPYPVTQghqycipqyrHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYGTPFYPSQPvyqsaP 310
Cdd:pfam03154  368 NPQSHKHPPHLSGPSPFQM----------NSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQP-----P 432
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  311 IIVPTQQQPPPAKREKKTIRIRD-PNQggKDITEEIMSGGGSRNPTPPIGRPASTPT--PPQQLPSQVPEHSPVVYGTVE 387
Cdd:pfam03154  433 VLTQSQSLPPPAASHPPTSGLHQvPSQ--SPFPQHPFVPGGPPPITPPSGPPTSTSSamPGIQPPSSASVSSSGPVPAAV 510
                          410       420       430
                   ....*....|....*....|....*....|..
gi 1907155774  388 SAHLAASTPVTAASDQKQE-EKPKPDPVFQSP 418
Cdd:pfam03154  511 SCPLPPVQIKEEALDEAEEpESPPPPPRSPSP 542
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
314-497 7.42e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 40.92  E-value: 7.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  314 PTQQQPPPAKREKKTIRirdPNQGGKDiteeiMSGGGSRNPTPPIGRPASTPTPPQQLPSQVPEHSPVVYGTVESAHLAA 393
Cdd:pfam13254  170 PSQPAQPAWMKELNKIR---QSRASVD-----LGRPNSFKEVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEA 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907155774  394 STPVTaasdqKQEEKPKPDPVFQSPSTVLRLvlsgEKKEQAGQMPETAAGEPT--PEPPRTSSPTSLPPLARSSLPSPMS 471
Cdd:pfam13254  242 DTLST-----DKEQSPAPTSASEPPPKTKEL----PKDSEEPAAPSKSAEASTekKEPDTESSPETSSEKSAPSLLSPVS 312
                          170       180
                   ....*....|....*....|....*.
gi 1907155774  472 AALSSQPLFTAEDKCELPSSKEEDAP 497
Cdd:pfam13254  313 KASIDKPLSSPDRDPLSPKPKPQSPP 338
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH