NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958759467|ref|XP_038960031|]
View 

nuclear receptor coactivator 6 isoform X2 [Rattus norvegicus]

Protein Classification

SANT/Myb-like DNA-binding domain-containing protein; Nup50 family Ran-binding domain-containing protein( domain architecture ID 13845479)

SANT (SWI3, ADA2, N-CoR and TFIIIB)/Myb-like DNA-binding domain-containing protein binds DNA and may function as a transcription factor; also contains a Med15 domain, a critical transducer of gene activation signals that control early metazoan development.| Nup50 family Ran-binding domain (RanBD)-containing protein similar to RanBD domain region of Homo sapiens nuclear pore complex protein Nup50, a component of the nuclear pore complex that has a direct role in nuclear protein import

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-190 2.61e-60

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


:

Pssm-ID: 463988  Cd Length: 143  Bit Score: 203.43  E-value: 2.61e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467   48 IFVAFKGNIDDkdFKWKLDTILQSVPGLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958759467  128 QIEGEGAINLALG---QNRSQDVRMnGPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQGGNA 190
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
Med15 super family cl26621
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
377-799 2.20e-11

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


The actual alignment was detected with superfamily member pfam09606:

Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 69.27  E-value: 2.20e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  377 QQASQAHTNFPQMSNPGQFTAPQMKSLQGGPS---RVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRP 453
Cdd:pfam09606   60 QQQPQGGQGNGGMGGGQQGMPDPINALQNLAGqgtRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGF 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  454 PQNNPLSQGFQQPVSSPGRNP--MVQQGNVPPNFMVMQQQPPSQGPPSLHPGLGGMPKRLPPGFSAGQANPNFMQGQvps 531
Cdd:pfam09606  140 PSQMSRVGRMQPGGQAGGMMQpsSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGPADAG--- 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  532 ttaatpgnSGALQLQANQSVQHAGGQGAGPPQNQMQvshGPPNMMQPSLMGIHGNINNQQAGSSGVPQVTLGSMQGQPQQ 611
Cdd:pfam09606  217 --------AQMGQQAQANGGMNPQQMGGAPNQVAMQ---QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMG 285
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  612 GPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPh 691
Cdd:pfam09606  286 PPGQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANP- 364
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  692 nqmMGPQGQVLLQQNPMIEQIMTNQMQGNKAQFNSQNQSNVMP------GPAQIMRGPTPNmQGNMVQFTGQMSGQMLPQ 765
Cdd:pfam09606  365 ---MQRGQPGMMSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPqgpgsqPPQSHPGGMIPS-PALIPSPSPQMSQQPAQQ 440
                          410       420       430
                   ....*....|....*....|....*....|....
gi 1958759467  766 QGPVSNSPSQVMGIQGQVLRPPGPSPHMAQQHTD 799
Cdd:pfam09606  441 RTIGQDSPGGSLNTPGQSAVNSPLNPQEEQLYRE 474
PHA03247 super family cl33720
large tegument protein UL36; Provisional
151-482 7.91e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 7.91e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  151 GPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQggnaSSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQS 230
Cdd:PHA03247  2630 SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPR----RARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPP 2705
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  231 HPSGSLPPAHHPMQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGIRPQFTAPTQVPV 310
Cdd:PHA03247  2706 PTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR 2785
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  311 PPGWNQLPSGALQP-------PPAQGSLGPMTTNQGWKKAPLPSPMQAQLQARPSLAT--VQTPSHPPPPYPFGSQQASQ 381
Cdd:PHA03247  2786 PAVASLSESRESLPspwdpadPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPgpPPPSLPLGGSVAPGGDVRRR 2865
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  382 AHTNFPqMSNPGQFTAPQMKSL-QGGPSRVPTPLQQPHLTNKSPAS--SPSSFQQGSPASSPTVNQTQQQMGPRP----- 453
Cdd:PHA03247  2866 PPSRSP-AAKPAAPARPPVRRLaRPAVSRSTESFALPPDQPERPPQpqAPPPPQPQPQPPPPPQPQPPPPPPPRPqppla 2944
                          330       340
                   ....*....|....*....|....*....
gi 1958759467  454 PQNNPLSQGFQQPVSSPGRNPMVQQGNVP 482
Cdd:PHA03247  2945 PTTDPAGAGEPSGAVPQPWLGALVPGRVA 2973
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1052-1272 7.36e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 7.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1052 PPRGPLNPDSQRVPMQQSgnVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNSSSP-LGEMPSLPEA 1130
Cdd:PHA03247  2744 VPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPA 2821
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1131 ---GGSEVPSVSGGPSNMPSHLVVSQNQLMMTG--------PKPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1199
Cdd:PHA03247  2822 aspAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvapggdvRRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958759467 1200 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNP 1272
Cdd:PHA03247  2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP 2960
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1670-1937 1.58e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1670 PSTIPATPLTTNSGLMPPSVAVVGPlhiPQNIKFSSAPVTPNVPSSSPAPniQTGRPLVLSSRATPVPLPSPPCTS---S 1746
Cdd:PHA03247  2735 LPAAPAPPAVPAGPATPGGPARPAR---PPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPAdppA 2809
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1747 PVVAPNPSVQQVKELNPDEASPQTNTSAdQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKG------KVDKIGQ 1820
Cdd:PHA03247  2810 AVLAPAAALPPAASPAGPLPPPTSAQPT-APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPaaparpPVRRLAR 2888
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1821 ILLTKACKKVtgSLEKGEEQYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTPSAPTllkmTSSPMGPSSTSTGPIL 1900
Cdd:PHA03247  2889 PAVSRSTESF--ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT----TDPAGAGEPSGAVPQP 2962
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1958759467 1901 PGGALPTSVRSVVTTLVPSELIS---TAPTTKGNHGGITS 1937
Cdd:PHA03247  2963 WLGALVPGRVAVPRFRVPQPAPSreaPASSTPPLTGHSLS 3002
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1494-1813 2.00e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 43.37  E-value: 2.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1494 SGAPNVTIKPPGLTDLEVTPPAVSGEDLKK-------ASVIPTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNI 1566
Cdd:pfam05109  446 TGLPSSTHVPTNLTAPASTGPTVSTADVTSptpagttSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTP 525
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1567 APSIP-PVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNTSAALPTHLQSALMSTVVT-MPNVGNKVMvseGQSAAQS 1644
Cdd:pfam05109  526 AVTTPtPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTpTPNATSPTV---GETSPQA 602
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1645 NARPQFITpvfiNSSSIIQVMKGSQPSTIPATPLTTNSGLMPPSVAVVGPLHIPQNIKFSSAPVT----PNVPSSSP--A 1718
Cdd:pfam05109  603 NTTNHTLG----GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNStshmPLLTSAHPtgG 678
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1719 PNIQTGRPLVLSSRATPVPLPSP-PCTSSPVVAPNPSVQQVK--ELNPDEASPQTNTSADQStlPSSQPTTVVSpllANS 1795
Cdd:pfam05109  679 ENITQVTPASTSTHHVSTSSPAPrPGTTSQASGPGNSSTSTKpgEVNVTKGTPPKNATSPQA--PSGQKTAVPT---VTS 753
                          330
                   ....*....|....*...
gi 1958759467 1796 PGSSANRRSPVSSSKGKG 1813
Cdd:pfam05109  754 TGGKANSTTGGKHTTGHG 771
 
Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-190 2.61e-60

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


Pssm-ID: 463988  Cd Length: 143  Bit Score: 203.43  E-value: 2.61e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467   48 IFVAFKGNIDDkdFKWKLDTILQSVPGLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958759467  128 QIEGEGAINLALG---QNRSQDVRMnGPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQGGNA 190
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
377-799 2.20e-11

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 69.27  E-value: 2.20e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  377 QQASQAHTNFPQMSNPGQFTAPQMKSLQGGPS---RVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRP 453
Cdd:pfam09606   60 QQQPQGGQGNGGMGGGQQGMPDPINALQNLAGqgtRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGF 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  454 PQNNPLSQGFQQPVSSPGRNP--MVQQGNVPPNFMVMQQQPPSQGPPSLHPGLGGMPKRLPPGFSAGQANPNFMQGQvps 531
Cdd:pfam09606  140 PSQMSRVGRMQPGGQAGGMMQpsSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGPADAG--- 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  532 ttaatpgnSGALQLQANQSVQHAGGQGAGPPQNQMQvshGPPNMMQPSLMGIHGNINNQQAGSSGVPQVTLGSMQGQPQQ 611
Cdd:pfam09606  217 --------AQMGQQAQANGGMNPQQMGGAPNQVAMQ---QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMG 285
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  612 GPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPh 691
Cdd:pfam09606  286 PPGQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANP- 364
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  692 nqmMGPQGQVLLQQNPMIEQIMTNQMQGNKAQFNSQNQSNVMP------GPAQIMRGPTPNmQGNMVQFTGQMSGQMLPQ 765
Cdd:pfam09606  365 ---MQRGQPGMMSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPqgpgsqPPQSHPGGMIPS-PALIPSPSPQMSQQPAQQ 440
                          410       420       430
                   ....*....|....*....|....*....|....
gi 1958759467  766 QGPVSNSPSQVMGIQGQVLRPPGPSPHMAQQHTD 799
Cdd:pfam09606  441 RTIGQDSPGGSLNTPGQSAVNSPLNPQEEQLYRE 474
PHA03247 PHA03247
large tegument protein UL36; Provisional
166-539 3.19e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 3.19e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  166 PMASGPGLIRMTSPATVMMPQGGNASSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQSHPSGSLPPAHHP--- 242
Cdd:PHA03247  2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvg 2693
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  243 -----MQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGI----RPQFTAPTQVPVPPG 313
Cdd:PHA03247  2694 sltslADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParpaRPPTTAGPPAPAPPA 2773
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  314 WNQLPSGALQPPPAQGSLGPMTTNQGWKKAPLPSPMqAQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP- 392
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA-AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPl 2852
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  393 GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRPPqnnPLSQGFQQPVSSPGR 472
Cdd:PHA03247  2853 GGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP---PQPQAPPPPQPQPQP 2923
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958759467  473 NPMVQQGNVPPnfmvmqqqPPSQGPPSLHPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTAATPGN 539
Cdd:PHA03247  2924 PPPPQPQPPPP--------PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP 2982
PHA03247 PHA03247
large tegument protein UL36; Provisional
151-482 7.91e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 7.91e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  151 GPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQggnaSSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQS 230
Cdd:PHA03247  2630 SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPR----RARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPP 2705
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  231 HPSGSLPPAHHPMQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGIRPQFTAPTQVPV 310
Cdd:PHA03247  2706 PTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR 2785
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  311 PPGWNQLPSGALQP-------PPAQGSLGPMTTNQGWKKAPLPSPMQAQLQARPSLAT--VQTPSHPPPPYPFGSQQASQ 381
Cdd:PHA03247  2786 PAVASLSESRESLPspwdpadPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPgpPPPSLPLGGSVAPGGDVRRR 2865
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  382 AHTNFPqMSNPGQFTAPQMKSL-QGGPSRVPTPLQQPHLTNKSPAS--SPSSFQQGSPASSPTVNQTQQQMGPRP----- 453
Cdd:PHA03247  2866 PPSRSP-AAKPAAPARPPVRRLaRPAVSRSTESFALPPDQPERPPQpqAPPPPQPQPQPPPPPQPQPPPPPPPRPqppla 2944
                          330       340
                   ....*....|....*....|....*....
gi 1958759467  454 PQNNPLSQGFQQPVSSPGRNPMVQQGNVP 482
Cdd:PHA03247  2945 PTTDPAGAGEPSGAVPQPWLGALVPGRVA 2973
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
615-786 1.61e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 49.81  E-value: 1.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  615 SQLMGMHQQIVPsqgqMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMtPPKQMLPQQGPQMMAPhnqm 694
Cdd:TIGR01628  369 AHLQDQFMQLQP----RMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP-TPMGPGGPLRPNGLAP---- 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  695 MGPQGQVLLQQNPMIEQimtNQMQGNKAQFNSQNQSNVMPGPAQimrGPTPNMQGNMVQFTGQMSgqmlpqqgpvSNSPS 774
Cdd:TIGR01628  440 MNAVRAPSRNAQNAAQK---PPMQPVMYPPNYQSLPLSQDLPQP---QSTASQGGQNKKLAQVLA----------SATPQ 503
                          170
                   ....*....|..
gi 1958759467  775 QVMGIQGQVLRP 786
Cdd:TIGR01628  504 MQKQVLGERLFP 515
PHA03247 PHA03247
large tegument protein UL36; Provisional
1052-1272 7.36e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 7.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1052 PPRGPLNPDSQRVPMQQSgnVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNSSSP-LGEMPSLPEA 1130
Cdd:PHA03247  2744 VPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPA 2821
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1131 ---GGSEVPSVSGGPSNMPSHLVVSQNQLMMTG--------PKPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1199
Cdd:PHA03247  2822 aspAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvapggdvRRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958759467 1200 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNP 1272
Cdd:PHA03247  2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP 2960
PHA03247 PHA03247
large tegument protein UL36; Provisional
1670-1937 1.58e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1670 PSTIPATPLTTNSGLMPPSVAVVGPlhiPQNIKFSSAPVTPNVPSSSPAPniQTGRPLVLSSRATPVPLPSPPCTS---S 1746
Cdd:PHA03247  2735 LPAAPAPPAVPAGPATPGGPARPAR---PPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPAdppA 2809
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1747 PVVAPNPSVQQVKELNPDEASPQTNTSAdQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKG------KVDKIGQ 1820
Cdd:PHA03247  2810 AVLAPAAALPPAASPAGPLPPPTSAQPT-APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPaaparpPVRRLAR 2888
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1821 ILLTKACKKVtgSLEKGEEQYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTPSAPTllkmTSSPMGPSSTSTGPIL 1900
Cdd:PHA03247  2889 PAVSRSTESF--ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT----TDPAGAGEPSGAVPQP 2962
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1958759467 1901 PGGALPTSVRSVVTTLVPSELIS---TAPTTKGNHGGITS 1937
Cdd:PHA03247  2963 WLGALVPGRVAVPRFRVPQPAPSreaPASSTPPLTGHSLS 3002
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1494-1813 2.00e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 43.37  E-value: 2.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1494 SGAPNVTIKPPGLTDLEVTPPAVSGEDLKK-------ASVIPTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNI 1566
Cdd:pfam05109  446 TGLPSSTHVPTNLTAPASTGPTVSTADVTSptpagttSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTP 525
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1567 APSIP-PVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNTSAALPTHLQSALMSTVVT-MPNVGNKVMvseGQSAAQS 1644
Cdd:pfam05109  526 AVTTPtPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTpTPNATSPTV---GETSPQA 602
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1645 NARPQFITpvfiNSSSIIQVMKGSQPSTIPATPLTTNSGLMPPSVAVVGPLHIPQNIKFSSAPVT----PNVPSSSP--A 1718
Cdd:pfam05109  603 NTTNHTLG----GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNStshmPLLTSAHPtgG 678
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1719 PNIQTGRPLVLSSRATPVPLPSP-PCTSSPVVAPNPSVQQVK--ELNPDEASPQTNTSADQStlPSSQPTTVVSpllANS 1795
Cdd:pfam05109  679 ENITQVTPASTSTHHVSTSSPAPrPGTTSQASGPGNSSTSTKpgEVNVTKGTPPKNATSPQA--PSGQKTAVPT---VTS 753
                          330
                   ....*....|....*...
gi 1958759467 1796 PGSSANRRSPVSSSKGKG 1813
Cdd:pfam05109  754 TGGKANSTTGGKHTTGHG 771
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1103-1409 2.69e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 2.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1103 SNSRKMVYQESPQNSSSPlgemPSLPEAGGSEVPSVSGGPSN--MPSHLVV--SQNQLMMTGPKPGPSPLSATQGATPQQ 1178
Cdd:pfam05109  414 TTTHKVIFSKAPESTTTS----PTLNTTGFAAPNTTTGLPSSthVPTNLTApaSTGPTVSTADVTSPTPAGTTSGASPVT 489
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1179 PPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSEISLSPErlnasiaglfpPQINIP 1254
Cdd:pfam05109  490 PSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSAVTTPT-----------PNATSP 558
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1255 LPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTVSNPPNFAAPQAHKLDSVVvsSGKQSNPGTT---KRASPSNSRRSSPG 1331
Cdd:pfam05109  559 TPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTL--GGTSSTPVVTsppKNATSAVTTGQHNI 636
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958759467 1332 SSRKTTPSPGRQNSKAPklTLASQTSTTLLQNMELprnvLVGPTPLANPPLSGSFPNNNGLNSQNPTVPAPAVGTVVE 1409
Cdd:pfam05109  637 TSSSTSSMSLRPSSISE--TLSPSTSDNSTSHMPL----LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQ 708
PHA03247 PHA03247
large tegument protein UL36; Provisional
199-579 2.75e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 2.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  199 PNPELQPRTPRPASQSDAMDPllsGLHIQQ--------------QSHPSGSLPPAHHPMQPVPVNRQMNPANFPQLQQQQ 264
Cdd:PHA03247  2569 PPPRPAPRPSEPAVTSRARRP---DAPPQSarprapvddrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  265 QQQQQQQQQQQQQQQQQLQTRplqqhqqQQPQGIRPQFTAPTQVPVPPGWNQL--PSGALQPPPAQGSlGPMTTNQGWKK 342
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRR-------ARRLGRAAQASSPPQRPRRRAARPTvgSLTSLADPPPPPP-TPEPAPHALVS 2717
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  343 A-PLPSPMQAQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTnfPQMSNPGQFTAPQMKSlqGGPSRVPTPLQQPHLTN 421
Cdd:PHA03247  2718 AtPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP--PTTAGPPAPAPPAAPA--AGPPRRLTRPAVASLSE 2793
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  422 KSPA----SSPSSFQQGSPASSPTVNQTQQQMGPRPPQNNPLsQGFQQPVSSPGRNPMVQQGNVPPNFMVM----QQQPP 493
Cdd:PHA03247  2794 SRESlpspWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ-PTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppSRSPA 2872
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  494 SQGPPSLHPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTAATPGNSGAlQLQANQSVQHAgGQGAGPPQNQMQVSHGPP 573
Cdd:PHA03247  2873 AKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQP-QPPPPPQPQPP-PPPPPRPQPPLAPTTDPA 2950

                   ....*.
gi 1958759467  574 NMMQPS 579
Cdd:PHA03247  2951 GAGEPS 2956
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1528-1898 5.07e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.83  E-value: 5.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1528 PTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNIAPSIPPVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNT 1607
Cdd:pfam05109  432 PTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTS 511
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1608 SAALPTHLQSALMSTVVT-MPNVGNKVMVSEGQSAAQSNARPQFITPVFINSSSIIQVMKGSQPSTIPATPLTTNS-GLM 1685
Cdd:pfam05109  512 AVTTPTPNATSPTPAVTTpTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNAT 591
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1686 PPSVAVVGPLHIPQNIKFSSAPVTPNV--PSSSPAPNIQTGRPLVLSSRATPVPLpsPPCTSSPVVAPNPSVQQVKELN- 1762
Cdd:pfam05109  592 SPTVGETSPQANTTNHTLGGTSSTPVVtsPPKNATSAVTTGQHNITSSSTSSMSL--RPSSISETLSPSTSDNSTSHMPl 669
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1763 PDEASPQTNTSADQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKgkvdKIGQILLTKACKKVTGSLEKGEE--- 1839
Cdd:pfam05109  670 LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTST----KPGEVNVTKGTPPKNATSPQAPSgqk 745
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958759467 1840 -------QYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTP-----SAPTLLKMTSSPMGPSSTSTGP 1898
Cdd:pfam05109  746 tavptvtSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGDSTTPrtrynATTYLPPSTSSKLRPRWTFTSP 816
 
Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-190 2.61e-60

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


Pssm-ID: 463988  Cd Length: 143  Bit Score: 203.43  E-value: 2.61e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467   48 IFVAFKGNIDDkdFKWKLDTILQSVPGLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958759467  128 QIEGEGAINLALG---QNRSQDVRMnGPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQGGNA 190
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
377-799 2.20e-11

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 69.27  E-value: 2.20e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  377 QQASQAHTNFPQMSNPGQFTAPQMKSLQGGPS---RVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRP 453
Cdd:pfam09606   60 QQQPQGGQGNGGMGGGQQGMPDPINALQNLAGqgtRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGF 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  454 PQNNPLSQGFQQPVSSPGRNP--MVQQGNVPPNFMVMQQQPPSQGPPSLHPGLGGMPKRLPPGFSAGQANPNFMQGQvps 531
Cdd:pfam09606  140 PSQMSRVGRMQPGGQAGGMMQpsSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGPADAG--- 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  532 ttaatpgnSGALQLQANQSVQHAGGQGAGPPQNQMQvshGPPNMMQPSLMGIHGNINNQQAGSSGVPQVTLGSMQGQPQQ 611
Cdd:pfam09606  217 --------AQMGQQAQANGGMNPQQMGGAPNQVAMQ---QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMG 285
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  612 GPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPh 691
Cdd:pfam09606  286 PPGQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANP- 364
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  692 nqmMGPQGQVLLQQNPMIEQIMTNQMQGNKAQFNSQNQSNVMP------GPAQIMRGPTPNmQGNMVQFTGQMSGQMLPQ 765
Cdd:pfam09606  365 ---MQRGQPGMMSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPqgpgsqPPQSHPGGMIPS-PALIPSPSPQMSQQPAQQ 440
                          410       420       430
                   ....*....|....*....|....*....|....
gi 1958759467  766 QGPVSNSPSQVMGIQGQVLRPPGPSPHMAQQHTD 799
Cdd:pfam09606  441 RTIGQDSPGGSLNTPGQSAVNSPLNPQEEQLYRE 474
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
268-744 3.59e-08

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 58.87  E-value: 3.59e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  268 QQQQQQQQQQQQQQLQTRPLQQHQQQQPQGIRPQFTAptqvPVPPGwNQLPSGALQPPPAQGSLGPMTTNQGWKK---AP 344
Cdd:pfam09606   64 QGGQGNGGMGGGQQGMPDPINALQNLAGQGTRPQMMG----PMGPG-PGGPMGQQMGGPGTASNLLASLGRPQMPmggAG 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  345 LPSPMQAQLQARPSLATvqtpsHPPPPYPFGSQQASQAHTNFPQMSnPGQFTAPQMKSLQGGPSRVPTPLQ--------- 415
Cdd:pfam09606  139 FPSQMSRVGRMQPGGQA-----GGMMQPSSGQPGSGTPNQMGPNGG-PGQGQAGGMNGGQQGPMGGQMPPQmgvpgmpgp 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  416 ---QPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMgpRPPQNNPLSQGFQQPVSSPGRNPMVQQGNVPPNFMvmqqqp 492
Cdd:pfam09606  213 adaGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQ--QQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPM------ 284
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  493 psqgppslhPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTAATPGNSGALQLQANQSVQHAGGQGAGPPQNQMQVshgp 572
Cdd:pfam09606  285 ---------GPPGQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLET---- 351
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  573 pnmmqpslmGIHGNINNQQAGSSGVPQVTLGSM------QGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPMIL 646
Cdd:pfam09606  352 ---------WNPGNFGGLGANPMQRGQPGMMSSpspvpgQQVRQVTPNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPS 422
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  647 SRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKAQFNS 726
Cdd:pfam09606  423 PALIPSPSPQMSQQPAQQRTIGQDSPGGSLNTPGQSAVNSPLNPQEEQLYREKYRQLTKYIEPLKRMIAKMENDPGDIDK 502
                          490
                   ....*....|....*...
gi 1958759467  727 QNQsnvMPGPAQIMRGPT 744
Cdd:pfam09606  503 MNK---MKRLLEILSNPS 517
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
430-709 2.49e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 56.20  E-value: 2.49e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  430 SFQQGSPASSPTVNQTQQQMGPRPPQNNPLSQGFQQPVSSpGRNPMVQQGNVPpnfmvmqqqppsqgppSLHP--GLGGM 507
Cdd:pfam09770  103 NRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRT-GYEKYKEPEPIP----------------DLQVdaSLWGV 165
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  508 PkrlPPGFSAGQANPnfmqgQVPSTTAATPGNS----------GALQLQANQSVQHAGGQGAGPPQNQMQVSHGPPNMMQ 577
Cdd:pfam09770  166 A---PKKAAAPAPAP-----QPAAQPASLPAPSrkmmsleeveAAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFP 237
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  578 PSlmgihgninnqqagssgvpqvtlgSMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQM 657
Cdd:pfam09770  238 PQ------------------------IQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPP 293
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1958759467  658 MVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAP-HNQMMGPQGQVLLQQNPMI 709
Cdd:pfam09770  294 PVPVQPTQILQNPNRLSAARVGYPQNPQPGVQPaPAHQAHRQQGSFGRQAPII 346
PHA03247 PHA03247
large tegument protein UL36; Provisional
166-539 3.19e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 3.19e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  166 PMASGPGLIRMTSPATVMMPQGGNASSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQSHPSGSLPPAHHP--- 242
Cdd:PHA03247  2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvg 2693
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  243 -----MQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGI----RPQFTAPTQVPVPPG 313
Cdd:PHA03247  2694 sltslADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParpaRPPTTAGPPAPAPPA 2773
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  314 WNQLPSGALQPPPAQGSLGPMTTNQGWKKAPLPSPMqAQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP- 392
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA-AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPl 2852
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  393 GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRPPqnnPLSQGFQQPVSSPGR 472
Cdd:PHA03247  2853 GGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP---PQPQAPPPPQPQPQP 2923
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958759467  473 NPMVQQGNVPPnfmvmqqqPPSQGPPSLHPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTAATPGN 539
Cdd:PHA03247  2924 PPPPQPQPPPP--------PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP 2982
PHA03247 PHA03247
large tegument protein UL36; Provisional
151-482 7.91e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 7.91e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  151 GPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQggnaSSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQS 230
Cdd:PHA03247  2630 SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPR----RARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPP 2705
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  231 HPSGSLPPAHHPMQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGIRPQFTAPTQVPV 310
Cdd:PHA03247  2706 PTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR 2785
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  311 PPGWNQLPSGALQP-------PPAQGSLGPMTTNQGWKKAPLPSPMQAQLQARPSLAT--VQTPSHPPPPYPFGSQQASQ 381
Cdd:PHA03247  2786 PAVASLSESRESLPspwdpadPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPgpPPPSLPLGGSVAPGGDVRRR 2865
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  382 AHTNFPqMSNPGQFTAPQMKSL-QGGPSRVPTPLQQPHLTNKSPAS--SPSSFQQGSPASSPTVNQTQQQMGPRP----- 453
Cdd:PHA03247  2866 PPSRSP-AAKPAAPARPPVRRLaRPAVSRSTESFALPPDQPERPPQpqAPPPPQPQPQPPPPPQPQPPPPPPPRPqppla 2944
                          330       340
                   ....*....|....*....|....*....
gi 1958759467  454 PQNNPLSQGFQQPVSSPGRNPMVQQGNVP 482
Cdd:PHA03247  2945 PTTDPAGAGEPSGAVPQPWLGALVPGRVA 2973
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
615-786 1.61e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 49.81  E-value: 1.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  615 SQLMGMHQQIVPsqgqMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMtPPKQMLPQQGPQMMAPhnqm 694
Cdd:TIGR01628  369 AHLQDQFMQLQP----RMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP-TPMGPGGPLRPNGLAP---- 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  695 MGPQGQVLLQQNPMIEQimtNQMQGNKAQFNSQNQSNVMPGPAQimrGPTPNMQGNMVQFTGQMSgqmlpqqgpvSNSPS 774
Cdd:TIGR01628  440 MNAVRAPSRNAQNAAQK---PPMQPVMYPPNYQSLPLSQDLPQP---QSTASQGGQNKKLAQVLA----------SATPQ 503
                          170
                   ....*....|..
gi 1958759467  775 QVMGIQGQVLRP 786
Cdd:TIGR01628  504 MQKQVLGERLFP 515
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
608-751 2.16e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 49.42  E-value: 2.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  608 QPQQGPPSQ-LMGMHQQIVPSQGQMAQQQGTLNPQNPMilsraqLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQ 686
Cdd:TIGR01628  384 QLPMGSPMGgAMGQPPYYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP 457
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958759467  687 MMAPHNQMMGPQGQVLLQQNPmieqimtnQMQGNKAQFNSqnqsnvMPGPAQIMRGPTPNMQGNM 751
Cdd:TIGR01628  458 PMQPVMYPPNYQSLPLSQDLP--------QPQSTASQGGQ------NKKLAQVLASATPQMQKQV 508
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
604-805 3.39e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 49.26  E-value: 3.39e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  604 SMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPmilsrAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQq 683
Cdd:pfam09770  202 AMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQ-----QQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPD- 275
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  684 gpqmmaPHNQMMGPQGQVLLQQNPmieqimtnqmqgnkaqfnsqnqsNVMPGPAQIMrgPTPNMQGNMVQftgQMSGQML 763
Cdd:pfam09770  276 ------PAQPSIQPQAQQFHQQPP-----------------------PVPVQPTQIL--QNPNRLSAARV---GYPQNPQ 321
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1958759467  764 PQQGPVSNSPSQvmgiqgqvlRPPGPSPHMAQQHTDPATTAN 805
Cdd:pfam09770  322 PGVQPAPAHQAH---------RQQGSFGRQAPIITHPQQLAQ 354
PHA03247 PHA03247
large tegument protein UL36; Provisional
1052-1272 7.36e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 7.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1052 PPRGPLNPDSQRVPMQQSgnVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNSSSP-LGEMPSLPEA 1130
Cdd:PHA03247  2744 VPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPA 2821
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1131 ---GGSEVPSVSGGPSNMPSHLVVSQNQLMMTG--------PKPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1199
Cdd:PHA03247  2822 aspAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvapggdvRRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958759467 1200 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNP 1272
Cdd:PHA03247  2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP 2960
PHA03247 PHA03247
large tegument protein UL36; Provisional
1081-1564 9.08e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 9.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1081 PASVPPSPDK-----QRMPMPVNTPLGSNSRKMVYQESPQNSSSPLGEMPSLP-EAGGSEVPSVSGGPSNMPSHLVVSQN 1154
Cdd:PHA03247  2557 PAAPPAAPDRsvpppRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRgPAPPSPLPPDTHAPDPPPPSPSPAAN 2636
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1155 QLMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHHfPNVAAPTQTSRPktpnRASPRPYYPQTPNNRPPSTEPSEISL 1234
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRA-AQASSPPQRPRR----RAARPTVGSLTSLADPPPPPPTPEPA 2711
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1235 SPERLNAsiaglfppqinIPLPPRPNLNRgfdqQGLNPTTLKAIGQAPSNLTVSnPPNFAAPQAHKLDSVVVSSGKQSNP 1314
Cdd:PHA03247  2712 PHALVSA-----------TPLPPGPAAAR----QASPALPAAPAPPAVPAGPAT-PGGPARPARPPTTAGPPAPAPPAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1315 GTTKRASPSNSRRSSPGSSRKTTPSPgRQNSKAPKLTLASQTSTTLLQNMELPRNVLVGPTPLANPPLSGSFPNnnglns 1394
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESLPSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP------ 2848
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1395 qnptvPAPAVGTVVEDNKESLNVPQDSDCQNSQGRKEQVNTELKAVPIQEAKMVVPEdqskkdgQPLDPNKLPSVEENKT 1474
Cdd:PHA03247  2849 -----SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL-------PPDQPERPPQPQAPPP 2916
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1475 LMSPAMREAPTSLSQLLDNSGAPNVTIKPPGLTDLEVTPPAVSGEDLKKASVIPTLQDPSSKEPSNSLNLPHSNEPCSTL 1554
Cdd:PHA03247  2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPL 2996
                          490
                   ....*....|
gi 1958759467 1555 AHPELSEVSS 1564
Cdd:PHA03247  2997 TGHSLSRVSS 3006
PHA03247 PHA03247
large tegument protein UL36; Provisional
1018-1408 9.47e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 9.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1018 SQPQSQQQQQQMMMMLMMQQDPKSIRLPVSQNVHPPR----GPLNPDSQRVPMQQSGNVPVMVSLQG--PASVPPSPDKQ 1091
Cdd:PHA03247  2574 APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGpappSPLPPDTHAPDPPPPSPSPAANEPDPhpPPTVPPPERPR 2653
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1092 RMPMPvntPLGSNSRKMVYQESPQNSSSPLG--EMPSLPEAGGS-----EVPSVSGGPSNMPSHLVVSQNQLMMTGPKPG 1164
Cdd:PHA03247  2654 DDPAP---GRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSltslaDPPPPPPTPEPAPHALVSATPLPPGPAAARQ 2730
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1165 PSPLSATQGATPQQPPVNSLPsshghhfpnvAAPTQTSRPKTPNrASPRPYYPQTPNNRPP--STEPSEISLSPERLNAS 1242
Cdd:PHA03247  2731 ASPALPAAPAPPAVPAGPATP----------GGPARPARPPTTA-GPPAPAPPAAPAAGPPrrLTRPAVASLSESRESLP 2799
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1243 IAGLFPPQINIPLPPRPNLNRGFDQQGLNPTTLKAIGQAPSnltvsnPPNFAAPQAHKLDSVVVSSGKQSNPGTTKRASP 1322
Cdd:PHA03247  2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP------PPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAA 2873
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1323 SNSRRSSPGSSRKTTPSPGRQNS---------KAPKLTLASQTSTTLLQNMELPRNVLVGPTPLANPPLSGSFPNNNGLN 1393
Cdd:PHA03247  2874 KPAAPARPPVRRLARPAVSRSTEsfalppdqpERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953
                          410
                   ....*....|....*
gi 1958759467 1394 SQNPTVPAPAVGTVV 1408
Cdd:PHA03247  2954 EPSGAVPQPWLGALV 2968
PHA03247 PHA03247
large tegument protein UL36; Provisional
1670-1937 1.58e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1670 PSTIPATPLTTNSGLMPPSVAVVGPlhiPQNIKFSSAPVTPNVPSSSPAPniQTGRPLVLSSRATPVPLPSPPCTS---S 1746
Cdd:PHA03247  2735 LPAAPAPPAVPAGPATPGGPARPAR---PPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPAdppA 2809
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1747 PVVAPNPSVQQVKELNPDEASPQTNTSAdQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKG------KVDKIGQ 1820
Cdd:PHA03247  2810 AVLAPAAALPPAASPAGPLPPPTSAQPT-APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPaaparpPVRRLAR 2888
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1821 ILLTKACKKVtgSLEKGEEQYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTPSAPTllkmTSSPMGPSSTSTGPIL 1900
Cdd:PHA03247  2889 PAVSRSTESF--ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT----TDPAGAGEPSGAVPQP 2962
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1958759467 1901 PGGALPTSVRSVVTTLVPSELIS---TAPTTKGNHGGITS 1937
Cdd:PHA03247  2963 WLGALVPGRVAVPRFRVPQPAPSreaPASSTPPLTGHSLS 3002
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1494-1813 2.00e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 43.37  E-value: 2.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1494 SGAPNVTIKPPGLTDLEVTPPAVSGEDLKK-------ASVIPTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNI 1566
Cdd:pfam05109  446 TGLPSSTHVPTNLTAPASTGPTVSTADVTSptpagttSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTP 525
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1567 APSIP-PVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNTSAALPTHLQSALMSTVVT-MPNVGNKVMvseGQSAAQS 1644
Cdd:pfam05109  526 AVTTPtPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTpTPNATSPTV---GETSPQA 602
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1645 NARPQFITpvfiNSSSIIQVMKGSQPSTIPATPLTTNSGLMPPSVAVVGPLHIPQNIKFSSAPVT----PNVPSSSP--A 1718
Cdd:pfam05109  603 NTTNHTLG----GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNStshmPLLTSAHPtgG 678
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1719 PNIQTGRPLVLSSRATPVPLPSP-PCTSSPVVAPNPSVQQVK--ELNPDEASPQTNTSADQStlPSSQPTTVVSpllANS 1795
Cdd:pfam05109  679 ENITQVTPASTSTHHVSTSSPAPrPGTTSQASGPGNSSTSTKpgEVNVTKGTPPKNATSPQA--PSGQKTAVPT---VTS 753
                          330
                   ....*....|....*...
gi 1958759467 1796 PGSSANRRSPVSSSKGKG 1813
Cdd:pfam05109  754 TGGKANSTTGGKHTTGHG 771
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1103-1409 2.69e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 2.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1103 SNSRKMVYQESPQNSSSPlgemPSLPEAGGSEVPSVSGGPSN--MPSHLVV--SQNQLMMTGPKPGPSPLSATQGATPQQ 1178
Cdd:pfam05109  414 TTTHKVIFSKAPESTTTS----PTLNTTGFAAPNTTTGLPSSthVPTNLTApaSTGPTVSTADVTSPTPAGTTSGASPVT 489
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1179 PPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSEISLSPErlnasiaglfpPQINIP 1254
Cdd:pfam05109  490 PSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSAVTTPT-----------PNATSP 558
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1255 LPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTVSNPPNFAAPQAHKLDSVVvsSGKQSNPGTT---KRASPSNSRRSSPG 1331
Cdd:pfam05109  559 TPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTL--GGTSSTPVVTsppKNATSAVTTGQHNI 636
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958759467 1332 SSRKTTPSPGRQNSKAPklTLASQTSTTLLQNMELprnvLVGPTPLANPPLSGSFPNNNGLNSQNPTVPAPAVGTVVE 1409
Cdd:pfam05109  637 TSSSTSSMSLRPSSISE--TLSPSTSDNSTSHMPL----LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQ 708
PHA03247 PHA03247
large tegument protein UL36; Provisional
199-579 2.75e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 2.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  199 PNPELQPRTPRPASQSDAMDPllsGLHIQQ--------------QSHPSGSLPPAHHPMQPVPVNRQMNPANFPQLQQQQ 264
Cdd:PHA03247  2569 PPPRPAPRPSEPAVTSRARRP---DAPPQSarprapvddrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  265 QQQQQQQQQQQQQQQQQLQTRplqqhqqQQPQGIRPQFTAPTQVPVPPGWNQL--PSGALQPPPAQGSlGPMTTNQGWKK 342
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRR-------ARRLGRAAQASSPPQRPRRRAARPTvgSLTSLADPPPPPP-TPEPAPHALVS 2717
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  343 A-PLPSPMQAQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTnfPQMSNPGQFTAPQMKSlqGGPSRVPTPLQQPHLTN 421
Cdd:PHA03247  2718 AtPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP--PTTAGPPAPAPPAAPA--AGPPRRLTRPAVASLSE 2793
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  422 KSPA----SSPSSFQQGSPASSPTVNQTQQQMGPRPPQNNPLsQGFQQPVSSPGRNPMVQQGNVPPNFMVM----QQQPP 493
Cdd:PHA03247  2794 SRESlpspWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ-PTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppSRSPA 2872
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  494 SQGPPSLHPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTAATPGNSGAlQLQANQSVQHAgGQGAGPPQNQMQVSHGPP 573
Cdd:PHA03247  2873 AKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQP-QPPPPPQPQPP-PPPPPRPQPPLAPTTDPA 2950

                   ....*.
gi 1958759467  574 NMMQPS 579
Cdd:PHA03247  2951 GAGEPS 2956
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
643-772 3.28e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 42.49  E-value: 3.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  643 PMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMM---APHnQMMGPQGQVLLQQNPMIEQIMTnqMQG 719
Cdd:TIGR01628  355 PLYVALAQRKEQRRAHLQDQFMQLQPRMRQLPMGSPMGGAMGQPPYygqGPQ-QQFNGQPLGWPRMSMMPTPMGP--GGP 431
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1958759467  720 NKAQFNSQNQSNVMPGPAQIMRGPTPNMQGnmVQFTGQMSGQMLPQQGPVSNS 772
Cdd:TIGR01628  432 LRPNGLAPMNAVRAPSRNAQNAAQKPPMQP--VMYPPNYQSLPLSQDLPQPQS 482
KAR9 pfam08580
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal ...
1159-1316 4.24e-03

Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules. KAR9 localizes at the shmoo tip in mating cells and at the tip of the growing bud in anaphase.


Pssm-ID: 430088 [Multi-domain]  Cd Length: 684  Bit Score: 42.12  E-value: 4.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1159 TGPKPGPSPlSATQGATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRP---------YYPQTPNNRPPSTEP 1229
Cdd:pfam08580  511 TATSETPTP-ALRPPSRPQPPPPGNRPRWNASTNTNDLDVGHNFKPLTLTTPSPTPsrssrssstLPPVSPLSRDKSRSP 589
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1230 SEISLSPERLNASIAGLFPPQINIPLPPRPNLNrgfdqqglNPTTLKAIGQAPSNLTVSNPPNFAAPQAHKLDSVVVSSG 1309
Cdd:pfam08580  590 APTCRSVSRASRRRASRKPTRIGSPNSRTSLLD--------EPPYPKLTLSKGLPRTPRNRQSYAGTSPSRSVSVSSGLG 661

                   ....*..
gi 1958759467 1310 KQSNPGT 1316
Cdd:pfam08580  662 PQTRPGT 668
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1528-1898 5.07e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.83  E-value: 5.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1528 PTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNIAPSIPPVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNT 1607
Cdd:pfam05109  432 PTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTS 511
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1608 SAALPTHLQSALMSTVVT-MPNVGNKVMVSEGQSAAQSNARPQFITPVFINSSSIIQVMKGSQPSTIPATPLTTNS-GLM 1685
Cdd:pfam05109  512 AVTTPTPNATSPTPAVTTpTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNAT 591
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1686 PPSVAVVGPLHIPQNIKFSSAPVTPNV--PSSSPAPNIQTGRPLVLSSRATPVPLpsPPCTSSPVVAPNPSVQQVKELN- 1762
Cdd:pfam05109  592 SPTVGETSPQANTTNHTLGGTSSTPVVtsPPKNATSAVTTGQHNITSSSTSSMSL--RPSSISETLSPSTSDNSTSHMPl 669
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467 1763 PDEASPQTNTSADQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKgkvdKIGQILLTKACKKVTGSLEKGEE--- 1839
Cdd:pfam05109  670 LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTST----KPGEVNVTKGTPPKNATSPQAPSgqk 745
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958759467 1840 -------QYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTP-----SAPTLLKMTSSPMGPSSTSTGP 1898
Cdd:pfam05109  746 tavptvtSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGDSTTPrtrynATTYLPPSTSSKLRPRWTFTSP 816
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
711-823 5.80e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 41.72  E-value: 5.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  711 QIMTNQMQGNKAQFNSQNQSNVMPG--PAQIMRGPTPNMQGNMVQFTGQMSGQMLPQQGPvsNSPSQVMGIQ--GQVLRP 786
Cdd:TIGR01628  369 AHLQDQFMQLQPRMRQLPMGSPMGGamGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGP--GGPLRPNGLApmNAVRAP 446
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1958759467  787 PGPSPHMAQQHTDPATTANNDVNLSQMMPDVSMQQTS 823
Cdd:TIGR01628  447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQST 483
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
671-837 6.05e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 41.72  E-value: 6.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  671 QRMTPPKQMLPQQGPQMMAPH--NQMMGPQGQVLL-QQNPmieqimtnQMQGNKAQFNSQNQSNvMPGPAQIMRGPTPNM 747
Cdd:TIGR01628  366 QRRAHLQDQFMQLQPRMRQLPmgSPMGGAMGQPPYyGQGP--------QQQFNGQPLGWPRMSM-MPTPMGPGGPLRPNG 436
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759467  748 QGNMVQFTGQMSGQmlpQQGPVSNSPsqvMGIQGQVLRPPGPSPHMAQQHTDPATTANNDVNLSQMMPDVSMQ-QTSMVP 826
Cdd:TIGR01628  437 LAPMNAVRAPSRNA---QNAAQKPPM---QPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQmQKQVLG 510
                          170
                   ....*....|....*
gi 1958759467  827 ----PHVQSMQGNSA 837
Cdd:TIGR01628  511 erlfPLVEAIEPALA 525
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH