NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|968021628|ref|XP_015012543|]
View 

uncharacterized protein LOC6547211 isoform X2 [Drosophila erecta]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4739 pfam15893
Domain of unknown function (DUF4739); This presumed domain is functionally uncharacterized. ...
516-744 1.78e-123

Domain of unknown function (DUF4739); This presumed domain is functionally uncharacterized. This domain family is found in eukaryotes, and is typically between 138 and 167 amino acids in length.


:

Pssm-ID: 406347  Cd Length: 235  Bit Score: 381.01  E-value: 1.78e-123
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628   516 SGNDVTNLLSKSSRGNDTFQSTTLVREQQNKVETSTHFERKPRIVGLSAFQQKLSRSSDSVGQHSSSSNSLETSTDEPSP 595
Cdd:pfam15893    1 SGNDVTNLLSKSSRGNDTFQSTTLVREQQFKVETSSHFERKPRIVGLSAFQQKLSRSSDSVGQHSSSSNSLETSTDEPSP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628   596 LYYEEKQRKTVEKSRSFRNYEDEP---VDSTAVHNNMPSLPDLSLNFRVPAFYKQA-----PQSPCSPISPNAKCVSLGF 667
Cdd:pfam15893   81 LYYEEKQRKTVEKSRSFRNYQEEReatVDSISVHNNMPSLPDLSLSFRVPSYYKQQqqqqhPQSPCSPMSPNAKCVSLGF 160
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 968021628   668 EINDNKLLQARAESTGQLPSAAKISPRKaaSTKLIVSSPGSADISQIEQNIDLIVKSPMVNVLRKSGNVAEKLPKEQ 744
Cdd:pfam15893  161 EINDNKLLQSRAESTGQLSTKTKMSPQH--STKLLVTSPGPANISQIEQNIDMIVKSPLVSVLRKSATVGEHLSPEQ 235
PHA03247 super family cl33720
large tegument protein UL36; Provisional
737-1102 4.64e-08

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.03  E-value: 4.64e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  737 AEKLPKEQTTPTKQRPKLLDLGAKMTAPAANPTPSPASPLTKSAKLSNSPPTTPAKGQGSSNGSRRNSSNVenTGEPEFM 816
Cdd:PHA03247 2574 APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP--TVPPPER 2651
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  817 KIQLNRVDQARLHNKTNHLVLAKNFKSPTERSQsnddlsfRRNSGENLAGIEIVEHPQPKPLSPGVRPTILEPSLSRQL- 895
Cdd:PHA03247 2652 PRDDPAPGRVSRPRRARRLGRAAQASSPPQRPR-------RRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPg 2724
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  896 -----KSVSATSMRNSYGSSSEGLATPVTPAT--TPNTPKTPKSNGFAKIPPSAPknvkevkdakEPVSPKEPVKSNRLS 968
Cdd:PHA03247 2725 paaarQASPALPAAPAPPAVPAGPATPGGPARpaRPPTTAGPPAPAPPAAPAAGP----------PRRLTRPAVASLSES 2794
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  969 LEERKRLFLSEERLKTVEQRRKSITKAENTA---PPTPLVIPATPAIPVTPVTPTSPEI--------FEVNGNTSPTSNP 1037
Cdd:PHA03247 2795 RESLPSPWDPADPPAAVLAPAAALPPAASPAgplPPPTSAQPTAPPPPPGPPPPSLPLGgsvapggdVRRRPPSRSPAAK 2874
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 968021628 1038 VVLRKKSFASCNGNPSPSK---------DDPTPELMKVFARRSLKVKDDEVVTLPQVQPPAPVSNSKKLSPSGG 1102
Cdd:PHA03247 2875 PAAPARPPVRRLARPAVSRstesfalppDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
DUF4592 super family cl21119
Domain of unknown function (DUF4592); This protein family is a domain of unknown function, ...
127-257 2.99e-03

Domain of unknown function (DUF4592); This protein family is a domain of unknown function, which lies to the N-terminus of the protein. This domain family is found in eukaryotes, and is typically between 114 and 130 amino acids in length. There are two completely conserved residues (L and A) that may be functionally important. In humans, the gene that encodes this protein lies in the position, chromosome 2 open reading frame 55.


The actual alignment was detected with superfamily member pfam15262:

Pssm-ID: 464602 [Multi-domain]  Cd Length: 127  Bit Score: 38.94  E-value: 2.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628   127 KRRNRPDASDEDLGLPRSP--ASPQRRAAGGTSSRNTSGVGHqsrghqseVSSLSLHSVNSGEVETEDSQSSHQYRHHSR 204
Cdd:pfam15262   15 KRADDAGASSEDDGLPRSPpeISLLHEILLSTESKSSDPPQH--------LSSLSLAGTGSEEEEQSTPVKSSRPKSPFS 86
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 968021628   205 VSTSSSISmgsdilrshhedDVDAVGGERHRLSHAAAKHKMAIRPVKKKGPTR 257
Cdd:pfam15262   87 PSGTIEPI------------NFDAPPQALACLDNSAAKHKLSVKPKNQRASRK 127
 
Name Accession Description Interval E-value
DUF4739 pfam15893
Domain of unknown function (DUF4739); This presumed domain is functionally uncharacterized. ...
516-744 1.78e-123

Domain of unknown function (DUF4739); This presumed domain is functionally uncharacterized. This domain family is found in eukaryotes, and is typically between 138 and 167 amino acids in length.


Pssm-ID: 406347  Cd Length: 235  Bit Score: 381.01  E-value: 1.78e-123
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628   516 SGNDVTNLLSKSSRGNDTFQSTTLVREQQNKVETSTHFERKPRIVGLSAFQQKLSRSSDSVGQHSSSSNSLETSTDEPSP 595
Cdd:pfam15893    1 SGNDVTNLLSKSSRGNDTFQSTTLVREQQFKVETSSHFERKPRIVGLSAFQQKLSRSSDSVGQHSSSSNSLETSTDEPSP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628   596 LYYEEKQRKTVEKSRSFRNYEDEP---VDSTAVHNNMPSLPDLSLNFRVPAFYKQA-----PQSPCSPISPNAKCVSLGF 667
Cdd:pfam15893   81 LYYEEKQRKTVEKSRSFRNYQEEReatVDSISVHNNMPSLPDLSLSFRVPSYYKQQqqqqhPQSPCSPMSPNAKCVSLGF 160
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 968021628   668 EINDNKLLQARAESTGQLPSAAKISPRKaaSTKLIVSSPGSADISQIEQNIDLIVKSPMVNVLRKSGNVAEKLPKEQ 744
Cdd:pfam15893  161 EINDNKLLQSRAESTGQLSTKTKMSPQH--STKLLVTSPGPANISQIEQNIDMIVKSPLVSVLRKSATVGEHLSPEQ 235
PHA03247 PHA03247
large tegument protein UL36; Provisional
737-1102 4.64e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.03  E-value: 4.64e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  737 AEKLPKEQTTPTKQRPKLLDLGAKMTAPAANPTPSPASPLTKSAKLSNSPPTTPAKGQGSSNGSRRNSSNVenTGEPEFM 816
Cdd:PHA03247 2574 APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP--TVPPPER 2651
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  817 KIQLNRVDQARLHNKTNHLVLAKNFKSPTERSQsnddlsfRRNSGENLAGIEIVEHPQPKPLSPGVRPTILEPSLSRQL- 895
Cdd:PHA03247 2652 PRDDPAPGRVSRPRRARRLGRAAQASSPPQRPR-------RRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPg 2724
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  896 -----KSVSATSMRNSYGSSSEGLATPVTPAT--TPNTPKTPKSNGFAKIPPSAPknvkevkdakEPVSPKEPVKSNRLS 968
Cdd:PHA03247 2725 paaarQASPALPAAPAPPAVPAGPATPGGPARpaRPPTTAGPPAPAPPAAPAAGP----------PRRLTRPAVASLSES 2794
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  969 LEERKRLFLSEERLKTVEQRRKSITKAENTA---PPTPLVIPATPAIPVTPVTPTSPEI--------FEVNGNTSPTSNP 1037
Cdd:PHA03247 2795 RESLPSPWDPADPPAAVLAPAAALPPAASPAgplPPPTSAQPTAPPPPPGPPPPSLPLGgsvapggdVRRRPPSRSPAAK 2874
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 968021628 1038 VVLRKKSFASCNGNPSPSK---------DDPTPELMKVFARRSLKVKDDEVVTLPQVQPPAPVSNSKKLSPSGG 1102
Cdd:PHA03247 2875 PAAPARPPVRRLARPAVSRstesfalppDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
DUF4592 pfam15262
Domain of unknown function (DUF4592); This protein family is a domain of unknown function, ...
127-257 2.99e-03

Domain of unknown function (DUF4592); This protein family is a domain of unknown function, which lies to the N-terminus of the protein. This domain family is found in eukaryotes, and is typically between 114 and 130 amino acids in length. There are two completely conserved residues (L and A) that may be functionally important. In humans, the gene that encodes this protein lies in the position, chromosome 2 open reading frame 55.


Pssm-ID: 464602 [Multi-domain]  Cd Length: 127  Bit Score: 38.94  E-value: 2.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628   127 KRRNRPDASDEDLGLPRSP--ASPQRRAAGGTSSRNTSGVGHqsrghqseVSSLSLHSVNSGEVETEDSQSSHQYRHHSR 204
Cdd:pfam15262   15 KRADDAGASSEDDGLPRSPpeISLLHEILLSTESKSSDPPQH--------LSSLSLAGTGSEEEEQSTPVKSSRPKSPFS 86
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 968021628   205 VSTSSSISmgsdilrshhedDVDAVGGERHRLSHAAAKHKMAIRPVKKKGPTR 257
Cdd:pfam15262   87 PSGTIEPI------------NFDAPPQALACLDNSAAKHKLSVKPKNQRASRK 127
 
Name Accession Description Interval E-value
DUF4739 pfam15893
Domain of unknown function (DUF4739); This presumed domain is functionally uncharacterized. ...
516-744 1.78e-123

Domain of unknown function (DUF4739); This presumed domain is functionally uncharacterized. This domain family is found in eukaryotes, and is typically between 138 and 167 amino acids in length.


Pssm-ID: 406347  Cd Length: 235  Bit Score: 381.01  E-value: 1.78e-123
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628   516 SGNDVTNLLSKSSRGNDTFQSTTLVREQQNKVETSTHFERKPRIVGLSAFQQKLSRSSDSVGQHSSSSNSLETSTDEPSP 595
Cdd:pfam15893    1 SGNDVTNLLSKSSRGNDTFQSTTLVREQQFKVETSSHFERKPRIVGLSAFQQKLSRSSDSVGQHSSSSNSLETSTDEPSP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628   596 LYYEEKQRKTVEKSRSFRNYEDEP---VDSTAVHNNMPSLPDLSLNFRVPAFYKQA-----PQSPCSPISPNAKCVSLGF 667
Cdd:pfam15893   81 LYYEEKQRKTVEKSRSFRNYQEEReatVDSISVHNNMPSLPDLSLSFRVPSYYKQQqqqqhPQSPCSPMSPNAKCVSLGF 160
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 968021628   668 EINDNKLLQARAESTGQLPSAAKISPRKaaSTKLIVSSPGSADISQIEQNIDLIVKSPMVNVLRKSGNVAEKLPKEQ 744
Cdd:pfam15893  161 EINDNKLLQSRAESTGQLSTKTKMSPQH--STKLLVTSPGPANISQIEQNIDMIVKSPLVSVLRKSATVGEHLSPEQ 235
PHA03247 PHA03247
large tegument protein UL36; Provisional
737-1102 4.64e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.03  E-value: 4.64e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  737 AEKLPKEQTTPTKQRPKLLDLGAKMTAPAANPTPSPASPLTKSAKLSNSPPTTPAKGQGSSNGSRRNSSNVenTGEPEFM 816
Cdd:PHA03247 2574 APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP--TVPPPER 2651
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  817 KIQLNRVDQARLHNKTNHLVLAKNFKSPTERSQsnddlsfRRNSGENLAGIEIVEHPQPKPLSPGVRPTILEPSLSRQL- 895
Cdd:PHA03247 2652 PRDDPAPGRVSRPRRARRLGRAAQASSPPQRPR-------RRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPg 2724
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  896 -----KSVSATSMRNSYGSSSEGLATPVTPAT--TPNTPKTPKSNGFAKIPPSAPknvkevkdakEPVSPKEPVKSNRLS 968
Cdd:PHA03247 2725 paaarQASPALPAAPAPPAVPAGPATPGGPARpaRPPTTAGPPAPAPPAAPAAGP----------PRRLTRPAVASLSES 2794
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  969 LEERKRLFLSEERLKTVEQRRKSITKAENTA---PPTPLVIPATPAIPVTPVTPTSPEI--------FEVNGNTSPTSNP 1037
Cdd:PHA03247 2795 RESLPSPWDPADPPAAVLAPAAALPPAASPAgplPPPTSAQPTAPPPPPGPPPPSLPLGgsvapggdVRRRPPSRSPAAK 2874
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 968021628 1038 VVLRKKSFASCNGNPSPSK---------DDPTPELMKVFARRSLKVKDDEVVTLPQVQPPAPVSNSKKLSPSGG 1102
Cdd:PHA03247 2875 PAAPARPPVRRLARPAVSRstesfalppDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
909-1120 2.44e-06

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 52.00  E-value: 2.44e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  909 SSSEGLATPVTPATTPNTPKTPKSNGF---------------AKIP-----PSAPKNVKEVKDAKEPVSPKEPVKSNR-L 967
Cdd:PTZ00449  528 EGEEGEHEDSKESDEPKEGGKPGETKEgevgkkpgpakehkpSKIPtlskkPEFPKDPKHPKDPEEPKKPKRPRSAQRpT 607
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628  968 SLEERKRLFLSEerLKTVEQRRKSITKAENTAPPTPLVIPATPAIPVTPVTPTSPEifevngNTSPTSNPvVLRKKSFAS 1047
Cdd:PTZ00449  608 RPKSPKLPELLD--IPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPK------SPKPPFDP-KFKEKFYDD 678
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 968021628 1048 CNGNPSPSKDDPTPELMKVFARRSLKVKDDEV----VTLPQVQPPA-PVSNSKKLSPSGGGGQSVDSDKENQSNSEEK 1120
Cdd:PTZ00449  679 YLDAAAKSKETKTTVVLDESFESILKETLPETpgtpFTTPRPLPPKlPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEE 756
DUF4592 pfam15262
Domain of unknown function (DUF4592); This protein family is a domain of unknown function, ...
127-257 2.99e-03

Domain of unknown function (DUF4592); This protein family is a domain of unknown function, which lies to the N-terminus of the protein. This domain family is found in eukaryotes, and is typically between 114 and 130 amino acids in length. There are two completely conserved residues (L and A) that may be functionally important. In humans, the gene that encodes this protein lies in the position, chromosome 2 open reading frame 55.


Pssm-ID: 464602 [Multi-domain]  Cd Length: 127  Bit Score: 38.94  E-value: 2.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 968021628   127 KRRNRPDASDEDLGLPRSP--ASPQRRAAGGTSSRNTSGVGHqsrghqseVSSLSLHSVNSGEVETEDSQSSHQYRHHSR 204
Cdd:pfam15262   15 KRADDAGASSEDDGLPRSPpeISLLHEILLSTESKSSDPPQH--------LSSLSLAGTGSEEEEQSTPVKSSRPKSPFS 86
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 968021628   205 VSTSSSISmgsdilrshhedDVDAVGGERHRLSHAAAKHKMAIRPVKKKGPTR 257
Cdd:pfam15262   87 PSGTIEPI------------NFDAPPQALACLDNSAAKHKLSVKPKNQRASRK 127
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH