NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1832484274|ref|NP_001368797|]
View 

BRCA2-interacting transcriptional repressor EMSY isoform 2 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
ENT super family cl04239
ENT domain; This presumed domain is named after Emsy N Terminus (ENT). Emsy is a protein that ...
17-59 1.41e-13

ENT domain; This presumed domain is named after Emsy N Terminus (ENT). Emsy is a protein that is amplified in breast cancer and interacts with BRCA2. The N terminus of this protein is found to be similar to other vertebrate and plant proteins of unknown function. This domain has a completely conserved histidine residue that may be functionally important.


The actual alignment was detected with superfamily member pfam03735:

Pssm-ID: 461032  Cd Length: 71  Bit Score: 66.82  E-value: 1.41e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1832484274   17 KRILRKLELEAYAGVISALRAQGD-LTKEKKDLLGELSKVLSMS 59
Cdd:pfam03735    1 KRQLRKLELEAYSSVLRAFRAQGDaLSWEKEKLLTELRKELNIS 44
PABP-1234 super family cl31127
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
861-1004 1.66e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


The actual alignment was detected with superfamily member TIGR01628:

Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 45.95  E-value: 1.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1832484274  861 KQQKLSQPQLEQTQLQVKTLQCFQTKQKQTIHLQ-ADQLQHKLTQMPQLSIRHQKLNPL---QQEQAQPKPDAQHTQHTV 936
Cdd:TIGR01628  364 KEQRRAHLQDQFMQLQPRMRQLPMGSPMGGAMGQpPYYGQGPQQQFNGQPLGWPRMSMMptpMGPGGPLRPNGLAPMNAV 443
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1832484274  937 VAKDRQLPTLMAQPPQTVVQvlAVKTTQQLPKLQQAPNQPKIYVQpqtpQSQMALPSSEKQPASQVEQ 1004
Cdd:TIGR01628  444 RAPSRNAQNAAQKPPMQPVM--YPPNYQSLPLSQDLPQPQSTASQ----GGQNKKLAQVLASATPQMQ 505
COG3889 super family cl28569
Extracellular solute-binding protein, contains Ig-fold domain [General function prediction ...
394-495 2.98e-04

Extracellular solute-binding protein, contains Ig-fold domain [General function prediction only];


The actual alignment was detected with superfamily member COG3889:

Pssm-ID: 443097 [Multi-domain]  Cd Length: 878  Bit Score: 45.02  E-value: 2.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1832484274  394 LPKPVTATLPTSSNSPIMVVSSNGAIM----TTKLVTTPTGTQATYTRPTVSPSLGRVATT-----------PGAATYVK 458
Cdd:COG3889    758 LPAEVTAKLSPGSYTLVVIAYSELVALpaiyTTSFIVTPAVTIAVSTTIQTLTDTDTTTTTsakttpslttaATTTTTST 837
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1832484274  459 TTSGSIITVVPKSLATLGGKIISSNIVSGTTTKITTI 495
Cdd:COG3889    838 TTSTTSVTTTAAALASAATAAIVVGVVAVAVAIATAI 874
 
Name Accession Description Interval E-value
ENT pfam03735
ENT domain; This presumed domain is named after Emsy N Terminus (ENT). Emsy is a protein that ...
17-59 1.41e-13

ENT domain; This presumed domain is named after Emsy N Terminus (ENT). Emsy is a protein that is amplified in breast cancer and interacts with BRCA2. The N terminus of this protein is found to be similar to other vertebrate and plant proteins of unknown function. This domain has a completely conserved histidine residue that may be functionally important.


Pssm-ID: 461032  Cd Length: 71  Bit Score: 66.82  E-value: 1.41e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1832484274   17 KRILRKLELEAYAGVISALRAQGD-LTKEKKDLLGELSKVLSMS 59
Cdd:pfam03735    1 KRQLRKLELEAYSSVLRAFRAQGDaLSWEKEKLLTELRKELNIS 44
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
861-1004 1.66e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 45.95  E-value: 1.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1832484274  861 KQQKLSQPQLEQTQLQVKTLQCFQTKQKQTIHLQ-ADQLQHKLTQMPQLSIRHQKLNPL---QQEQAQPKPDAQHTQHTV 936
Cdd:TIGR01628  364 KEQRRAHLQDQFMQLQPRMRQLPMGSPMGGAMGQpPYYGQGPQQQFNGQPLGWPRMSMMptpMGPGGPLRPNGLAPMNAV 443
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1832484274  937 VAKDRQLPTLMAQPPQTVVQvlAVKTTQQLPKLQQAPNQPKIYVQpqtpQSQMALPSSEKQPASQVEQ 1004
Cdd:TIGR01628  444 RAPSRNAQNAAQKPPMQPVM--YPPNYQSLPLSQDLPQPQSTASQ----GGQNKKLAQVLASATPQMQ 505
COG3889 COG3889
Extracellular solute-binding protein, contains Ig-fold domain [General function prediction ...
394-495 2.98e-04

Extracellular solute-binding protein, contains Ig-fold domain [General function prediction only];


Pssm-ID: 443097 [Multi-domain]  Cd Length: 878  Bit Score: 45.02  E-value: 2.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1832484274  394 LPKPVTATLPTSSNSPIMVVSSNGAIM----TTKLVTTPTGTQATYTRPTVSPSLGRVATT-----------PGAATYVK 458
Cdd:COG3889    758 LPAEVTAKLSPGSYTLVVIAYSELVALpaiyTTSFIVTPAVTIAVSTTIQTLTDTDTTTTTsakttpslttaATTTTTST 837
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1832484274  459 TTSGSIITVVPKSLATLGGKIISSNIVSGTTTKITTI 495
Cdd:COG3889    838 TTSTTSVTTTAAALASAATAAIVVGVVAVAVAIATAI 874
PRK10263 PRK10263
DNA translocase FtsK; Provisional
897-1073 6.94e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.84  E-value: 6.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1832484274  897 QLQHKLTQMPQLSIRHQKLNPLQQEQAQPKPDAQHTQHTvvakdrqlptlmaQPPQTVVQVLAVKTTQQLPKLQQAPNQP 976
Cdd:PRK10263   754 QPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQ-------------QPQQPVAPQPQYQQPQQPVAPQPQYQQP 820
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1832484274  977 KIYVQPQtPQSQMALPSSEKQPASQVEQPIITQgssvtkitfEGRQPPTVTKITGGSSVPKLTSPVTSISPIQASEKTAV 1056
Cdd:PRK10263   821 QQPVAPQ-PQYQQPQQPVAPQPQDTLLHPLLMR---------NGDSRPLHKPTTPLPSLDLLTPPPSEVEPVDTFALEQM 890
                          170
                   ....*....|....*..
gi 1832484274 1057 SDILQMSLMEAQIDTNV 1073
Cdd:PRK10263   891 ARLVEARLADFRIKADV 907
 
Name Accession Description Interval E-value
ENT pfam03735
ENT domain; This presumed domain is named after Emsy N Terminus (ENT). Emsy is a protein that ...
17-59 1.41e-13

ENT domain; This presumed domain is named after Emsy N Terminus (ENT). Emsy is a protein that is amplified in breast cancer and interacts with BRCA2. The N terminus of this protein is found to be similar to other vertebrate and plant proteins of unknown function. This domain has a completely conserved histidine residue that may be functionally important.


Pssm-ID: 461032  Cd Length: 71  Bit Score: 66.82  E-value: 1.41e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1832484274   17 KRILRKLELEAYAGVISALRAQGD-LTKEKKDLLGELSKVLSMS 59
Cdd:pfam03735    1 KRQLRKLELEAYSSVLRAFRAQGDaLSWEKEKLLTELRKELNIS 44
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
861-1004 1.66e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 45.95  E-value: 1.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1832484274  861 KQQKLSQPQLEQTQLQVKTLQCFQTKQKQTIHLQ-ADQLQHKLTQMPQLSIRHQKLNPL---QQEQAQPKPDAQHTQHTV 936
Cdd:TIGR01628  364 KEQRRAHLQDQFMQLQPRMRQLPMGSPMGGAMGQpPYYGQGPQQQFNGQPLGWPRMSMMptpMGPGGPLRPNGLAPMNAV 443
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1832484274  937 VAKDRQLPTLMAQPPQTVVQvlAVKTTQQLPKLQQAPNQPKIYVQpqtpQSQMALPSSEKQPASQVEQ 1004
Cdd:TIGR01628  444 RAPSRNAQNAAQKPPMQPVM--YPPNYQSLPLSQDLPQPQSTASQ----GGQNKKLAQVLASATPQMQ 505
COG3889 COG3889
Extracellular solute-binding protein, contains Ig-fold domain [General function prediction ...
394-495 2.98e-04

Extracellular solute-binding protein, contains Ig-fold domain [General function prediction only];


Pssm-ID: 443097 [Multi-domain]  Cd Length: 878  Bit Score: 45.02  E-value: 2.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1832484274  394 LPKPVTATLPTSSNSPIMVVSSNGAIM----TTKLVTTPTGTQATYTRPTVSPSLGRVATT-----------PGAATYVK 458
Cdd:COG3889    758 LPAEVTAKLSPGSYTLVVIAYSELVALpaiyTTSFIVTPAVTIAVSTTIQTLTDTDTTTTTsakttpslttaATTTTTST 837
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1832484274  459 TTSGSIITVVPKSLATLGGKIISSNIVSGTTTKITTI 495
Cdd:COG3889    838 TTSTTSVTTTAAALASAATAAIVVGVVAVAVAIATAI 874
PRK10263 PRK10263
DNA translocase FtsK; Provisional
897-1073 6.94e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.84  E-value: 6.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1832484274  897 QLQHKLTQMPQLSIRHQKLNPLQQEQAQPKPDAQHTQHTvvakdrqlptlmaQPPQTVVQVLAVKTTQQLPKLQQAPNQP 976
Cdd:PRK10263   754 QPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQ-------------QPQQPVAPQPQYQQPQQPVAPQPQYQQP 820
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1832484274  977 KIYVQPQtPQSQMALPSSEKQPASQVEQPIITQgssvtkitfEGRQPPTVTKITGGSSVPKLTSPVTSISPIQASEKTAV 1056
Cdd:PRK10263   821 QQPVAPQ-PQYQQPQQPVAPQPQDTLLHPLLMR---------NGDSRPLHKPTTPLPSLDLLTPPPSEVEPVDTFALEQM 890
                          170
                   ....*....|....*..
gi 1832484274 1057 SDILQMSLMEAQIDTNV 1073
Cdd:PRK10263   891 ARLVEARLADFRIKADV 907
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH