NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|970949411|ref|NP_001305168|]
View 

zinc finger protein 821 isoform 4 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG2433 super family cl43687
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
110-184 8.66e-06

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


The actual alignment was detected with superfamily member COG2433:

Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 46.39  E-value: 8.66e-06
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 970949411 110 LRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRmRDREAKRLQRMQETDEQRARRLQRDREamRLKRANE 184
Cdd:COG2433  432 LEAELEEKDERIERLERELSEARSEERREIRKDREISR-LDREIERLERELEEERERIEELKRKLE--RLKELWK 503
 
Name Accession Description Interval E-value
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
110-184 8.66e-06

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 46.39  E-value: 8.66e-06
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 970949411 110 LRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRmRDREAKRLQRMQETDEQRARRLQRDREamRLKRANE 184
Cdd:COG2433  432 LEAELEEKDERIERLERELSEARSEERREIRKDREISR-LDREIERLERELEEERERIEELKRKLE--RLKELWK 503
CCDC47 pfam07946
PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of ...
143-204 1.80e-04

PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of the PAT complex, an endoplasmic reticulum (ER)-resident membrane multiprotein complex that facilitates multi-pass membrane proteins insertion into membranes. The PAT complex, formed by CCDC47 and Asterix proteins, acts as an intramembrane chaperone by directly interacting with nascent transmembrane domains (TMDs), releasing its substrates upon correct folding, and is needed for optimal biogenesis of multi-pass membrane proteins. CCDC47 is required to maintain the stability of Asterix. CCDC47 is associated with various membrane-associated processes and is component of a ribosome-associated ER translocon complex involved in multi-pass membrane protein transport into the ER membrane and biogenesis. It is also involved in the regulation of calcium ion homeostasis in the ER, being also required for proper protein degradation via the ERAD (ER-associated degradation) pathway.


Pssm-ID: 462322  Cd Length: 323  Bit Score: 42.17  E-value: 1.80e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 970949411  143 REVRRMRDREAKRLQRMQETDEQRARRLQRD---REAMRLKRANETPEKRQARLIREREaKRLKR 204
Cdd:pfam07946 260 KKAKKTREEEIEKIKKAAEEERAEEAQEKKEeakKKEREEKLAKLSPEEQRKYEEKERK-KEQRK 323
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
125-237 4.45e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 41.03  E-value: 4.45e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411  125 ERERTAKKSR-RDNETPEEREVRRMRDREAKR-------LQRMQETDEQRARRLQRDREAMRLK----RANETPEKRQAR 192
Cdd:TIGR01642   4 EPDREREKSRgRDRDRSSERPRRRSRDRSRFRdrhrrsrERSYREDSRPRDRRRYDSRSPRSLRyssvRRSRDRPRRRSR 83
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 970949411  193 LI------REREAKRLKRRLEKMDMMLRAQFGQDPSAM----AALAAEMNFFQLP 237
Cdd:TIGR01642  84 SVrsieqhRRRLRDRSPSNQWRKDDKKRSLWDIKPPGYelvtADQAKASQVFSVP 138
PTZ00121 PTZ00121
MAEBL; Provisional
122-208 1.04e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 40.51  E-value: 1.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411  122 QRLERERTAKKSRRDNETPEEREVRRMRD-----REAKRLQRMQETDEQRARRLQRDREAMRLKRANETPEKRQARLIRE 196
Cdd:PTZ00121 1206 RKAEEERKAEEARKAEDAKKAEAVKKAEEakkdaEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKK 1285
                          90
                  ....*....|..
gi 970949411  197 REAKRLKRRLEK 208
Cdd:PTZ00121 1286 AEEKKKADEAKK 1297
 
Name Accession Description Interval E-value
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
110-184 8.66e-06

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 46.39  E-value: 8.66e-06
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 970949411 110 LRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRmRDREAKRLQRMQETDEQRARRLQRDREamRLKRANE 184
Cdd:COG2433  432 LEAELEEKDERIERLERELSEARSEERREIRKDREISR-LDREIERLERELEEERERIEELKRKLE--RLKELWK 503
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
92-216 8.78e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 43.60  E-value: 8.78e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411  92 AAYRKLLEAQTPSVRKWALRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRMQETDEQRARRLQ 171
Cdd:COG4717  119 EKLEKLLQLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLA 198
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*
gi 970949411 172 RDREAMRLKRANETPEKRQARlireREAKRLKRRLEKMDMMLRAQ 216
Cdd:COG4717  199 EELEELQQRLAELEEELEEAQ----EELEELEEELEQLENELEAA 239
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
109-214 1.45e-04

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 42.92  E-value: 1.45e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411 109 ALRRQNEPLEVRlqRLERERTAKKSRRDNETPEEREVRRMRDR------EAKRLQRMQETDEQRARRLQRDREAMRLKRA 182
Cdd:COG2433  381 ALEELIEKELPE--EEPEAEREKEHEERELTEEEEEIRRLEEQverleaEVEELEAELEEKDERIERLERELSEARSEER 458
                         90       100       110
                 ....*....|....*....|....*....|..
gi 970949411 183 NETPEKRQARlIREREAKRLKRRLEKMDMMLR 214
Cdd:COG2433  459 REIRKDREIS-RLDREIERLERELEEERERIE 489
CCDC47 pfam07946
PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of ...
143-204 1.80e-04

PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of the PAT complex, an endoplasmic reticulum (ER)-resident membrane multiprotein complex that facilitates multi-pass membrane proteins insertion into membranes. The PAT complex, formed by CCDC47 and Asterix proteins, acts as an intramembrane chaperone by directly interacting with nascent transmembrane domains (TMDs), releasing its substrates upon correct folding, and is needed for optimal biogenesis of multi-pass membrane proteins. CCDC47 is required to maintain the stability of Asterix. CCDC47 is associated with various membrane-associated processes and is component of a ribosome-associated ER translocon complex involved in multi-pass membrane protein transport into the ER membrane and biogenesis. It is also involved in the regulation of calcium ion homeostasis in the ER, being also required for proper protein degradation via the ERAD (ER-associated degradation) pathway.


Pssm-ID: 462322  Cd Length: 323  Bit Score: 42.17  E-value: 1.80e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 970949411  143 REVRRMRDREAKRLQRMQETDEQRARRLQRD---REAMRLKRANETPEKRQARLIREREaKRLKR 204
Cdd:pfam07946 260 KKAKKTREEEIEKIKKAAEEERAEEAQEKKEeakKKEREEKLAKLSPEEQRKYEEKERK-KEQRK 323
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
125-237 4.45e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 41.03  E-value: 4.45e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411  125 ERERTAKKSR-RDNETPEEREVRRMRDREAKR-------LQRMQETDEQRARRLQRDREAMRLK----RANETPEKRQAR 192
Cdd:TIGR01642   4 EPDREREKSRgRDRDRSSERPRRRSRDRSRFRdrhrrsrERSYREDSRPRDRRRYDSRSPRSLRyssvRRSRDRPRRRSR 83
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 970949411  193 LI------REREAKRLKRRLEKMDMMLRAQFGQDPSAM----AALAAEMNFFQLP 237
Cdd:TIGR01642  84 SVrsieqhRRRLRDRSPSNQWRKDDKKRSLWDIKPPGYelvtADQAKASQVFSVP 138
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
99-220 4.82e-04

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 41.00  E-value: 4.82e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411  99 EAQTPSVRKWALRRQNEPLEVRLQRLERErtAKKSRRDNETpEEREVRRMRDREAKRLQRMQETDEQRARRLQRDREAMR 178
Cdd:COG2433  393 EEPEAEREKEHEERELTEEEEEIRRLEEQ--VERLEAEVEE-LEAELEEKDERIERLERELSEARSEERREIRKDREISR 469
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 970949411 179 LKRANETPEKRQARLirEREAKRLKRRLEKMDMMLRAQFGQD 220
Cdd:COG2433  470 LDREIERLERELEEE--RERIEELKRKLERLKELWKLEHSGE 509
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
113-208 9.37e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 40.49  E-value: 9.37e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411  113 QNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRM----QETDEQRARRLQRDREAMRLKRA------ 182
Cdd:pfam17380 416 QQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRLEEQERQQQVerlrQQEEERKRKKLELEKEKRDRKRAeeqrrk 495
                          90       100
                  ....*....|....*....|....*....
gi 970949411  183 ---NETPEKRQARLIREREAKRLKRRLEK 208
Cdd:pfam17380 496 ileKELEERKQAMIEEERKRKLLEKEMEE 524
PTZ00121 PTZ00121
MAEBL; Provisional
122-208 1.04e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 40.51  E-value: 1.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411  122 QRLERERTAKKSRRDNETPEEREVRRMRD-----REAKRLQRMQETDEQRARRLQRDREAMRLKRANETPEKRQARLIRE 196
Cdd:PTZ00121 1206 RKAEEERKAEEARKAEDAKKAEAVKKAEEakkdaEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKK 1285
                          90
                  ....*....|..
gi 970949411  197 REAKRLKRRLEK 208
Cdd:PTZ00121 1286 AEEKKKADEAKK 1297
PTZ00121 PTZ00121
MAEBL; Provisional
112-208 3.24e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 38.97  E-value: 3.24e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411  112 RQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRmrdreAKRLQRMQETDEQRARRLQRD-----REAMRLKRANETP 186
Cdd:PTZ00121 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK-----AEELKKAEEENKIKAAEEAKKaeedkKKAEEAKKAEEDE 1687
                          90       100
                  ....*....|....*....|..
gi 970949411  187 EKRQARLIREREAKRLKRRLEK 208
Cdd:PTZ00121 1688 KKAAEALKKEAEEAKKAEELKK 1709
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
110-216 3.94e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 38.38  E-value: 3.94e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411 110 LRRQNEPLEVRLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRMQETDEQRARRLQRDREAMRLKRANETpEKR 189
Cdd:COG1196  307 LEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELE-ELA 385
                         90       100
                 ....*....|....*....|....*..
gi 970949411 190 QARLIREREAKRLKRRLEKMDMMLRAQ 216
Cdd:COG1196  386 EELLEALRAAAELAAQLEELEEAEEAL 412
PTZ00121 PTZ00121
MAEBL; Provisional
99-208 5.04e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 38.20  E-value: 5.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411   99 EAQTPSVRKWALRRQNEplevrLQRLERERTAKKSRRDNETPEEREVRRM----RDREAKRLQRMQETDEqrARRLQRDR 174
Cdd:PTZ00121 1110 KAEEARKAEEAKKKAED-----ARKAEEARKAEDARKAEEARKAEDAKRVeiarKAEDARKAEEARKAED--AKKAEAAR 1182
                          90       100       110
                  ....*....|....*....|....*....|....
gi 970949411  175 EAMRLKRANETPEKRQARLIREREAKRLKRRLEK 208
Cdd:PTZ00121 1183 KAEEVRKAEELRKAEDARKAEAARKAEEERKAEE 1216
PRK00247 PRK00247
putative inner membrane protein translocase component YidC; Validated
106-191 8.35e-03

putative inner membrane protein translocase component YidC; Validated


Pssm-ID: 178945 [Multi-domain]  Cd Length: 429  Bit Score: 37.14  E-value: 8.35e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411 106 RKWALRRQN----------EPLEVRLQRLERERTAKKSRRDNETPEEREVRRMRDREA---KRLQRMQETDEQRARRLQR 172
Cdd:PRK00247 304 FLWTLRRNRlrmiitpwraPELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREInreARQERAAAMARARARRAAV 383
                         90
                 ....*....|....*....
gi 970949411 173 DREAMRLKRANETPEKRQA 191
Cdd:PRK00247 384 KAKKKGLIDASPNEDTPSE 402
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
120-233 8.46e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 37.22  E-value: 8.46e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411 120 RLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRMQETDEQRARRLQRDREAMRLKRANETPEKRQARLIREREA 199
Cdd:COG1196  285 EAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAE 364
                         90       100       110
                 ....*....|....*....|....*....|....
gi 970949411 200 KRLKRRLEKMDMMLRAQFGQDPSAMAALAAEMNF 233
Cdd:COG1196  365 EALLEAEAELAEAEEELEELAEELLEALRAAAEL 398
PRK12705 PRK12705
hypothetical protein; Provisional
120-216 8.74e-03

hypothetical protein; Provisional


Pssm-ID: 237178 [Multi-domain]  Cd Length: 508  Bit Score: 37.00  E-value: 8.74e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 970949411 120 RLQRLERERTAKKSRRDNETPEEREVRRMRDREAKRLQRMQETDEQRARRLQRDREAMRLKRANETPEKRQARLirerea 199
Cdd:PRK12705  27 KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKL------ 100
                         90
                 ....*....|....*..
gi 970949411 200 KRLKRRLEKMDMMLRAQ 216
Cdd:PRK12705 101 DNLENQLEEREKALSAR 117
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH