NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1039732645|ref|XP_017169366|]
View 

androglobin isoform X6 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Adgb_C_mid-like cd22307
C-terminal middle region of Androglobins (Adgbs) and related proteins; including permuted ...
796-1208 0e+00

C-terminal middle region of Androglobins (Adgbs) and related proteins; including permuted globin domain and IQ motif; Androglobin (Adgb, also known as Calpain-7-like protein, CAPN7L) is a large multidomain protein consisting of an N-terminal peptidase C2 family calpain-like domain, an IQ calmodulin-binding motif, and an internal, circularly permuted globin domain. The canonical secondary structure of hemoglobins is an 3-over-3 alpha-helical sandwich structure, where the eight alpha-helical segments are conventionally labeled, A-H, according to their sequential order; Adgbs differ from this in having helices C-H followed by A-B. Adgbs and other phylogenetically ancient globins, such as neuroglobins and globin X, form hexacoordinated heme iron complexes. Globins contain various highly conserved residues of the heme pocket: including a Phe in the interhelical position CD1 (Phe CD1, first position in the loop between the helices C and D) that is packed against the heme, a His at the 7th position of the E-helix (His E7) that binds the heme iron distally, and a His at the 8th position of the F-helix (His F8) that binds the heme iron proximally. Unlike other hexacoordinated globins, Adgbs have an E7 Gln; their hexacoordination scheme is [Gln]-Fe-[His]. In mammals, Adgb is mainly expressed in the testes and may play an important role in spermatogenesis. Arthropod Adgbs have degenerate globin domains (DOI:10.3389/fgene.2020.00858). This model spans the permuted globin domain, the IQ motif, and a conserved region of about 200 amino acid residues located C-terminal to the globin domain; it does not include the N-terminal protease domain or the large uncharacterized C-terminal domain of approximately 500 residues.


:

Pssm-ID: 412094  Cd Length: 416  Bit Score: 593.38  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  796 IHVCSMTTFVIGDEDIVLPNFEPESYRFTEQSIIIMKAIGNVIANFKDKGKLPAALRDLQAAHYPIPLNNKELTAQHFRV 875
Cdd:cd22307      1 LHLCSDTPFVFGDEETVMPLLTKESVRFTEQASSILKALGNAIQSFGDEEYLPAALKELYRSYCPPLLWSKEDKKEHHKV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  876 FHISLWRLMKKSQVAKPPSNFKFAFRAMVFDTDLLDSFSEDVSLAEWVDLKYSTPINEK-EYTSEEIAAAVKIQSMWKGC 954
Cdd:cd22307     81 FNEALYHLLKKALGRKETPDELFALRALFLDPDIGLEYKESPSSSLREIVEPDECDCRTrEPTIEEHEAATKIQAFFRGT 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  955 YVRLLMKARKPETKENVTVADTLQKIWAVLEMNLEQYALSLLRLMFKSKCKSMESYPCYQDEETKLAFADHTVNYADQPP 1034
Cdd:cd22307    161 LVRKLLKAHKPGTKENLKVAETLKKIWEKIESNLESLAASLLRYMFKNNPKLKELYPCYEDEWTVISFQDYSGTYPDQPP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1035 NSWFIVFREIFLVPQDMIILPKVYTTLPICILHVINNDTLEQVPKVFQKVVPFLYTKNKKGYTFVAEAYTGDTFVSGARW 1114
Cdd:cd22307    241 NSWFPVFREVFNVPEEMLVVPKLYSPLPRCLLRVFNNDTGEELPRVFNKVAPFVYKPNKKGYTFVAEAYTGDQPPKEGKW 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1115 KLRLIGSYNPLPFLARDSPCNTFSIKE-IRDYYIPNDRKILFRYSIKVTVAQSITIQVRTSKPDTFIKLQVLESEEVITS 1193
Cdd:cd22307    321 RLRLIGSKEPLPKLSRETPLSTFSVKEeIKDYYIPNKKNIICRYIVKVTKDHLVTIRLQTSKPDVEIKLQVLDEEEEVAS 400
                          410
                   ....*....|....*
gi 1039732645 1194 TVGKGQAVIPAFYFL 1208
Cdd:cd22307    401 ETGGGHVVIPVFRLL 415
CysPc super family cl00051
Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. ...
200-307 1.47e-12

Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.


The actual alignment was detected with superfamily member cd00044:

Pssm-ID: 469591 [Multi-domain]  Cd Length: 315  Bit Score: 70.44  E-value: 1.47e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  200 NSYGKYVVKLYWMGCWRKITVDDFLPFDeeNNLLLPATSYEF-ELWPMLLSKAIIKLanvdvH--VAHRRELGELTVIHA 276
Cdd:cd00044    106 NYAGIYHFRFWKNGEWVEVVIDDRLPTS--NGGLLFMHSRDRnELWVALLEKAYAKL-----HgsYEALVGGNTAEALED 178
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1039732645  277 LTGWLPEVIPLHPAYVDRVWELLKEILPEFK 307
Cdd:cd00044    179 LTGGPTERIDLKSADASSGDNDLFALLLSFL 209
PTZ00121 super family cl31754
MAEBL; Provisional
1332-1640 1.82e-05

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 49.75  E-value: 1.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1332 KDMEKMDIKAE--KHEEPAPMGSPDSHAVSEGQKSVGVPKTTRKGKEKSAEKEKLAKE-KQAPRFEPQQVQMPTAVHSQQ 1408
Cdd:PTZ00121  1444 KKADEAKKKAEeaKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEaKKAAEAKKKADEAKKAEEAKK 1523
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1409 EDPnkpywiLRLVSEHTDSDYVDVKKDTERADEIRA---MKQAWETTEPGRAIKAAQARLKYLtqfiKKPVTTDTTTSAP 1485
Cdd:PTZ00121  1524 ADE------AKKAEEAKKADEAKKAEEKKKADELKKaeeLKKAEEKKKAEEAKKAEEDKNMAL----RKAEEAKKAEEAR 1593
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1486 SPETLSVSQSQTKSSEEVVRQRSPTilETSPQQIRKALEFLDFSHYVRKTAAEAVLQTEELnkqqamQKAEEIHQFRQHR 1565
Cdd:PTZ00121  1594 IEEVMKLYEEEKKMKAEEAKKAEEA--KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEEL------KKAEEENKIKAAE 1665
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039732645 1566 SRILSIRDIDQEERFKQKDEVLEMYGEMRDSVDEARQKILDIREVYRNKLLEAERLRME----ALAAQEAAVKIEIEKK 1640
Cdd:PTZ00121  1666 EAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAeeenKIKAEEAKKEAEEDKK 1744
Peptidase_C2 super family cl47577
Calpain family cysteine protease;
90-255 1.15e-03

Calpain family cysteine protease;


The actual alignment was detected with superfamily member pfam00648:

Pssm-ID: 459889 [Multi-domain]  Cd Length: 295  Bit Score: 42.87  E-value: 1.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645   90 FEDPE--------GKIELPQSLKVFSWKRPQDfifsrtpvvvkneitfdlfspnehlLCSElmrwiiseiyavWKIFNGG 161
Cdd:pfam00648    1 FEDPEfpaddsslGYPPSPPPPRGVEWKRPKE-------------------------ICSN------------PQFIVDG 43
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  162 ILSN-YHKGNLGELpilpwkpW-----------EHIysLCKAVKGHVPLFNSY-GKYVVKLYWMGCWRKITVDDFLPFde 228
Cdd:pfam00648   44 ASRFdICQGELGDC-------WllaaiasltlnPKL--LERVVPPDQSFEENYaGIFHFRFWRFGEWVDVVIDDRLPT-- 112
                          170       180       190
                   ....*....|....*....|....*....|
gi 1039732645  229 ENNLLL---PATSYEFelWPMLLSKAIIKL 255
Cdd:pfam00648  113 RNGKLLfvhSRDKNEF--WSALLEKAYAKL 140
 
Name Accession Description Interval E-value
Adgb_C_mid-like cd22307
C-terminal middle region of Androglobins (Adgbs) and related proteins; including permuted ...
796-1208 0e+00

C-terminal middle region of Androglobins (Adgbs) and related proteins; including permuted globin domain and IQ motif; Androglobin (Adgb, also known as Calpain-7-like protein, CAPN7L) is a large multidomain protein consisting of an N-terminal peptidase C2 family calpain-like domain, an IQ calmodulin-binding motif, and an internal, circularly permuted globin domain. The canonical secondary structure of hemoglobins is an 3-over-3 alpha-helical sandwich structure, where the eight alpha-helical segments are conventionally labeled, A-H, according to their sequential order; Adgbs differ from this in having helices C-H followed by A-B. Adgbs and other phylogenetically ancient globins, such as neuroglobins and globin X, form hexacoordinated heme iron complexes. Globins contain various highly conserved residues of the heme pocket: including a Phe in the interhelical position CD1 (Phe CD1, first position in the loop between the helices C and D) that is packed against the heme, a His at the 7th position of the E-helix (His E7) that binds the heme iron distally, and a His at the 8th position of the F-helix (His F8) that binds the heme iron proximally. Unlike other hexacoordinated globins, Adgbs have an E7 Gln; their hexacoordination scheme is [Gln]-Fe-[His]. In mammals, Adgb is mainly expressed in the testes and may play an important role in spermatogenesis. Arthropod Adgbs have degenerate globin domains (DOI:10.3389/fgene.2020.00858). This model spans the permuted globin domain, the IQ motif, and a conserved region of about 200 amino acid residues located C-terminal to the globin domain; it does not include the N-terminal protease domain or the large uncharacterized C-terminal domain of approximately 500 residues.


Pssm-ID: 412094  Cd Length: 416  Bit Score: 593.38  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  796 IHVCSMTTFVIGDEDIVLPNFEPESYRFTEQSIIIMKAIGNVIANFKDKGKLPAALRDLQAAHYPIPLNNKELTAQHFRV 875
Cdd:cd22307      1 LHLCSDTPFVFGDEETVMPLLTKESVRFTEQASSILKALGNAIQSFGDEEYLPAALKELYRSYCPPLLWSKEDKKEHHKV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  876 FHISLWRLMKKSQVAKPPSNFKFAFRAMVFDTDLLDSFSEDVSLAEWVDLKYSTPINEK-EYTSEEIAAAVKIQSMWKGC 954
Cdd:cd22307     81 FNEALYHLLKKALGRKETPDELFALRALFLDPDIGLEYKESPSSSLREIVEPDECDCRTrEPTIEEHEAATKIQAFFRGT 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  955 YVRLLMKARKPETKENVTVADTLQKIWAVLEMNLEQYALSLLRLMFKSKCKSMESYPCYQDEETKLAFADHTVNYADQPP 1034
Cdd:cd22307    161 LVRKLLKAHKPGTKENLKVAETLKKIWEKIESNLESLAASLLRYMFKNNPKLKELYPCYEDEWTVISFQDYSGTYPDQPP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1035 NSWFIVFREIFLVPQDMIILPKVYTTLPICILHVINNDTLEQVPKVFQKVVPFLYTKNKKGYTFVAEAYTGDTFVSGARW 1114
Cdd:cd22307    241 NSWFPVFREVFNVPEEMLVVPKLYSPLPRCLLRVFNNDTGEELPRVFNKVAPFVYKPNKKGYTFVAEAYTGDQPPKEGKW 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1115 KLRLIGSYNPLPFLARDSPCNTFSIKE-IRDYYIPNDRKILFRYSIKVTVAQSITIQVRTSKPDTFIKLQVLESEEVITS 1193
Cdd:cd22307    321 RLRLIGSKEPLPKLSRETPLSTFSVKEeIKDYYIPNKKNIICRYIVKVTKDHLVTIRLQTSKPDVEIKLQVLDEEEEVAS 400
                          410
                   ....*....|....*
gi 1039732645 1194 TVGKGQAVIPAFYFL 1208
Cdd:cd22307    401 ETGGGHVVIPVFRLL 415
CysPc cd00044
Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. ...
200-307 1.47e-12

Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.


Pssm-ID: 238004 [Multi-domain]  Cd Length: 315  Bit Score: 70.44  E-value: 1.47e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  200 NSYGKYVVKLYWMGCWRKITVDDFLPFDeeNNLLLPATSYEF-ELWPMLLSKAIIKLanvdvH--VAHRRELGELTVIHA 276
Cdd:cd00044    106 NYAGIYHFRFWKNGEWVEVVIDDRLPTS--NGGLLFMHSRDRnELWVALLEKAYAKL-----HgsYEALVGGNTAEALED 178
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1039732645  277 LTGWLPEVIPLHPAYVDRVWELLKEILPEFK 307
Cdd:cd00044    179 LTGGPTERIDLKSADASSGDNDLFALLLSFL 209
CysPc smart00230
Calpain-like thiol protease family; Calpain-like thiol protease family (peptidase family C2). ...
200-300 1.67e-10

Calpain-like thiol protease family; Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).


Pssm-ID: 128526  Cd Length: 318  Bit Score: 64.27  E-value: 1.67e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645   200 NSYGKYVVKLYWMGCWRKITVDDFLPFDeENNLLLPATSYEFELWPMLLSKAIIKLANVDVHvahrreLGELTVIHAL-- 277
Cdd:smart00230   98 NYAGIFHFRFWRFGKWVDVVIDDRLPTY-NGELVFMHSNSRNEFWSALLEKAYAKLNGCYEA------LKGGSTTEALed 170
                            90       100
                    ....*....|....*....|....*.
gi 1039732645   278 -TGWLPEVIPLHPAYVDR--VWELLK 300
Cdd:smart00230  171 lTGGVAESIDLKEASKDPdnLFEDLF 196
PTZ00121 PTZ00121
MAEBL; Provisional
1332-1640 1.82e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 49.75  E-value: 1.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1332 KDMEKMDIKAE--KHEEPAPMGSPDSHAVSEGQKSVGVPKTTRKGKEKSAEKEKLAKE-KQAPRFEPQQVQMPTAVHSQQ 1408
Cdd:PTZ00121  1444 KKADEAKKKAEeaKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEaKKAAEAKKKADEAKKAEEAKK 1523
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1409 EDPnkpywiLRLVSEHTDSDYVDVKKDTERADEIRA---MKQAWETTEPGRAIKAAQARLKYLtqfiKKPVTTDTTTSAP 1485
Cdd:PTZ00121  1524 ADE------AKKAEEAKKADEAKKAEEKKKADELKKaeeLKKAEEKKKAEEAKKAEEDKNMAL----RKAEEAKKAEEAR 1593
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1486 SPETLSVSQSQTKSSEEVVRQRSPTilETSPQQIRKALEFLDFSHYVRKTAAEAVLQTEELnkqqamQKAEEIHQFRQHR 1565
Cdd:PTZ00121  1594 IEEVMKLYEEEKKMKAEEAKKAEEA--KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEEL------KKAEEENKIKAAE 1665
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039732645 1566 SRILSIRDIDQEERFKQKDEVLEMYGEMRDSVDEARQKILDIREVYRNKLLEAERLRME----ALAAQEAAVKIEIEKK 1640
Cdd:PTZ00121  1666 EAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAeeenKIKAEEAKKEAEEDKK 1744
Caldesmon pfam02029
Caldesmon;
1344-1640 5.51e-05

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 47.55  E-value: 5.51e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1344 HEEPAPMGSPDSHAVSEGQKSVGVPKTTRKGKEKSAEKEKLAKEKQAPRFEPQQVQMPTAVHSQQE-DPNKPYWILRLVS 1422
Cdd:pfam02029   23 KEEEEPSGQVTESVEPNEHNSYEEDSELKPSGQGGLDEEEAFLDRTAKREERRQKRLQEALERQKEfDPTIADEKESVAE 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1423 ----------------EHTDSDYVDVKKDTERADEIRAMKQAWETTEPGRAIKAAQARLKyltQFIKKPVTTDTTTSAPS 1486
Cdd:pfam02029  103 rkenneeeensswekeEKRDSRLGRYKEEETEIREKEYQENKWSTEVRQAEEEGEEEEDK---SEEAEEVPTENFAKEEV 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1487 PETLSVSQSQTKSSEEVVRQRSPTILETSPQQIRKALEFLDFSHYVRKTAAEAVLQTEELNKQQ--AMQKAEEIHQFRQH 1564
Cdd:pfam02029  180 KDEKIKKEKKVKYESKVFLDQKRGHPEVKSQNGEEEVTKLKVTTKRRQGGLSQSQEREEEAEVFleAEQKLEELRRRRQE 259
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039732645 1565 RSRilsirdidQE-ERFKQKDEvlemygEMRDSVDEARQKildiREvYRNKLLEAERLRMEAlAAQEAAVKIEIEKK 1640
Cdd:pfam02029  260 KES--------EEfEKLRQKQQ------EAELELEELKKK----RE-ERRKLLEEEEQRRKQ-EEAERKLREEEEKR 316
Peptidase_C2 pfam00648
Calpain family cysteine protease;
90-255 1.15e-03

Calpain family cysteine protease;


Pssm-ID: 459889 [Multi-domain]  Cd Length: 295  Bit Score: 42.87  E-value: 1.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645   90 FEDPE--------GKIELPQSLKVFSWKRPQDfifsrtpvvvkneitfdlfspnehlLCSElmrwiiseiyavWKIFNGG 161
Cdd:pfam00648    1 FEDPEfpaddsslGYPPSPPPPRGVEWKRPKE-------------------------ICSN------------PQFIVDG 43
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  162 ILSN-YHKGNLGELpilpwkpW-----------EHIysLCKAVKGHVPLFNSY-GKYVVKLYWMGCWRKITVDDFLPFde 228
Cdd:pfam00648   44 ASRFdICQGELGDC-------WllaaiasltlnPKL--LERVVPPDQSFEENYaGIFHFRFWRFGEWVDVVIDDRLPT-- 112
                          170       180       190
                   ....*....|....*....|....*....|
gi 1039732645  229 ENNLLL---PATSYEFelWPMLLSKAIIKL 255
Cdd:pfam00648  113 RNGKLLfvhSRDKNEF--WSALLEKAYAKL 140
 
Name Accession Description Interval E-value
Adgb_C_mid-like cd22307
C-terminal middle region of Androglobins (Adgbs) and related proteins; including permuted ...
796-1208 0e+00

C-terminal middle region of Androglobins (Adgbs) and related proteins; including permuted globin domain and IQ motif; Androglobin (Adgb, also known as Calpain-7-like protein, CAPN7L) is a large multidomain protein consisting of an N-terminal peptidase C2 family calpain-like domain, an IQ calmodulin-binding motif, and an internal, circularly permuted globin domain. The canonical secondary structure of hemoglobins is an 3-over-3 alpha-helical sandwich structure, where the eight alpha-helical segments are conventionally labeled, A-H, according to their sequential order; Adgbs differ from this in having helices C-H followed by A-B. Adgbs and other phylogenetically ancient globins, such as neuroglobins and globin X, form hexacoordinated heme iron complexes. Globins contain various highly conserved residues of the heme pocket: including a Phe in the interhelical position CD1 (Phe CD1, first position in the loop between the helices C and D) that is packed against the heme, a His at the 7th position of the E-helix (His E7) that binds the heme iron distally, and a His at the 8th position of the F-helix (His F8) that binds the heme iron proximally. Unlike other hexacoordinated globins, Adgbs have an E7 Gln; their hexacoordination scheme is [Gln]-Fe-[His]. In mammals, Adgb is mainly expressed in the testes and may play an important role in spermatogenesis. Arthropod Adgbs have degenerate globin domains (DOI:10.3389/fgene.2020.00858). This model spans the permuted globin domain, the IQ motif, and a conserved region of about 200 amino acid residues located C-terminal to the globin domain; it does not include the N-terminal protease domain or the large uncharacterized C-terminal domain of approximately 500 residues.


Pssm-ID: 412094  Cd Length: 416  Bit Score: 593.38  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  796 IHVCSMTTFVIGDEDIVLPNFEPESYRFTEQSIIIMKAIGNVIANFKDKGKLPAALRDLQAAHYPIPLNNKELTAQHFRV 875
Cdd:cd22307      1 LHLCSDTPFVFGDEETVMPLLTKESVRFTEQASSILKALGNAIQSFGDEEYLPAALKELYRSYCPPLLWSKEDKKEHHKV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  876 FHISLWRLMKKSQVAKPPSNFKFAFRAMVFDTDLLDSFSEDVSLAEWVDLKYSTPINEK-EYTSEEIAAAVKIQSMWKGC 954
Cdd:cd22307     81 FNEALYHLLKKALGRKETPDELFALRALFLDPDIGLEYKESPSSSLREIVEPDECDCRTrEPTIEEHEAATKIQAFFRGT 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  955 YVRLLMKARKPETKENVTVADTLQKIWAVLEMNLEQYALSLLRLMFKSKCKSMESYPCYQDEETKLAFADHTVNYADQPP 1034
Cdd:cd22307    161 LVRKLLKAHKPGTKENLKVAETLKKIWEKIESNLESLAASLLRYMFKNNPKLKELYPCYEDEWTVISFQDYSGTYPDQPP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1035 NSWFIVFREIFLVPQDMIILPKVYTTLPICILHVINNDTLEQVPKVFQKVVPFLYTKNKKGYTFVAEAYTGDTFVSGARW 1114
Cdd:cd22307    241 NSWFPVFREVFNVPEEMLVVPKLYSPLPRCLLRVFNNDTGEELPRVFNKVAPFVYKPNKKGYTFVAEAYTGDQPPKEGKW 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1115 KLRLIGSYNPLPFLARDSPCNTFSIKE-IRDYYIPNDRKILFRYSIKVTVAQSITIQVRTSKPDTFIKLQVLESEEVITS 1193
Cdd:cd22307    321 RLRLIGSKEPLPKLSRETPLSTFSVKEeIKDYYIPNKKNIICRYIVKVTKDHLVTIRLQTSKPDVEIKLQVLDEEEEVAS 400
                          410
                   ....*....|....*
gi 1039732645 1194 TVGKGQAVIPAFYFL 1208
Cdd:cd22307    401 ETGGGHVVIPVFRLL 415
CysPc cd00044
Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. ...
200-307 1.47e-12

Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.


Pssm-ID: 238004 [Multi-domain]  Cd Length: 315  Bit Score: 70.44  E-value: 1.47e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  200 NSYGKYVVKLYWMGCWRKITVDDFLPFDeeNNLLLPATSYEF-ELWPMLLSKAIIKLanvdvH--VAHRRELGELTVIHA 276
Cdd:cd00044    106 NYAGIYHFRFWKNGEWVEVVIDDRLPTS--NGGLLFMHSRDRnELWVALLEKAYAKL-----HgsYEALVGGNTAEALED 178
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1039732645  277 LTGWLPEVIPLHPAYVDRVWELLKEILPEFK 307
Cdd:cd00044    179 LTGGPTERIDLKSADASSGDNDLFALLLSFL 209
CysPc smart00230
Calpain-like thiol protease family; Calpain-like thiol protease family (peptidase family C2). ...
200-300 1.67e-10

Calpain-like thiol protease family; Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).


Pssm-ID: 128526  Cd Length: 318  Bit Score: 64.27  E-value: 1.67e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645   200 NSYGKYVVKLYWMGCWRKITVDDFLPFDeENNLLLPATSYEFELWPMLLSKAIIKLANVDVHvahrreLGELTVIHAL-- 277
Cdd:smart00230   98 NYAGIFHFRFWRFGKWVDVVIDDRLPTY-NGELVFMHSNSRNEFWSALLEKAYAKLNGCYEA------LKGGSTTEALed 170
                            90       100
                    ....*....|....*....|....*.
gi 1039732645   278 -TGWLPEVIPLHPAYVDR--VWELLK 300
Cdd:smart00230  171 lTGGVAESIDLKEASKDPdnLFEDLF 196
PTZ00121 PTZ00121
MAEBL; Provisional
1332-1640 1.82e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 49.75  E-value: 1.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1332 KDMEKMDIKAE--KHEEPAPMGSPDSHAVSEGQKSVGVPKTTRKGKEKSAEKEKLAKE-KQAPRFEPQQVQMPTAVHSQQ 1408
Cdd:PTZ00121  1444 KKADEAKKKAEeaKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEaKKAAEAKKKADEAKKAEEAKK 1523
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1409 EDPnkpywiLRLVSEHTDSDYVDVKKDTERADEIRA---MKQAWETTEPGRAIKAAQARLKYLtqfiKKPVTTDTTTSAP 1485
Cdd:PTZ00121  1524 ADE------AKKAEEAKKADEAKKAEEKKKADELKKaeeLKKAEEKKKAEEAKKAEEDKNMAL----RKAEEAKKAEEAR 1593
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1486 SPETLSVSQSQTKSSEEVVRQRSPTilETSPQQIRKALEFLDFSHYVRKTAAEAVLQTEELnkqqamQKAEEIHQFRQHR 1565
Cdd:PTZ00121  1594 IEEVMKLYEEEKKMKAEEAKKAEEA--KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEEL------KKAEEENKIKAAE 1665
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039732645 1566 SRILSIRDIDQEERFKQKDEVLEMYGEMRDSVDEARQKILDIREVYRNKLLEAERLRME----ALAAQEAAVKIEIEKK 1640
Cdd:PTZ00121  1666 EAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAeeenKIKAEEAKKEAEEDKK 1744
PTZ00121 PTZ00121
MAEBL; Provisional
1321-1651 3.48e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 48.98  E-value: 3.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1321 NETQLTFVQALKDMEKMDIKAEKHEEPAPMGSPDSHAVSEGQKSVGVPKTTRKGKEKSAEKEKLAKEKQAPRFEPQQVQM 1400
Cdd:PTZ00121  1056 HEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARK 1135
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1401 PTAVHSQQE----DPNKPYWILRLVSEHTDSDYVDVKKDTERADEIR---AMKQAWETTEPGRAIKAAQARlKYltQFIK 1473
Cdd:PTZ00121  1136 AEDARKAEEarkaEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARkaeEVRKAEELRKAEDARKAEAAR-KA--EEER 1212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1474 KPVTTDTTTSAPSPETLSVSQSQTKSSEEVVRQRSptilETSPQQIRKaLEFLDFSHYVRKTAAeavLQTEELNKQQAMQ 1553
Cdd:PTZ00121  1213 KAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEE----ERNNEEIRK-FEEARMAHFARRQAA---IKAEEARKADELK 1284
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1554 KAEEIHQFRQHRSrilsirdidQEErfKQKDEVLEMYGEMRDSVDEARQKIldirEVYRNKLLEAERLRMEALAAQEAAV 1633
Cdd:PTZ00121  1285 KAEEKKKADEAKK---------AEE--KKKADEAKKKAEEAKKADEAKKKA----EEAKKKADAAKKKAEEAKKAAEAAK 1349
                          330
                   ....*....|....*...
gi 1039732645 1634 KIEIEKKSPASDSQKKKK 1651
Cdd:PTZ00121  1350 AEAEAAADEAEAAEEKAE 1367
Caldesmon pfam02029
Caldesmon;
1344-1640 5.51e-05

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 47.55  E-value: 5.51e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1344 HEEPAPMGSPDSHAVSEGQKSVGVPKTTRKGKEKSAEKEKLAKEKQAPRFEPQQVQMPTAVHSQQE-DPNKPYWILRLVS 1422
Cdd:pfam02029   23 KEEEEPSGQVTESVEPNEHNSYEEDSELKPSGQGGLDEEEAFLDRTAKREERRQKRLQEALERQKEfDPTIADEKESVAE 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1423 ----------------EHTDSDYVDVKKDTERADEIRAMKQAWETTEPGRAIKAAQARLKyltQFIKKPVTTDTTTSAPS 1486
Cdd:pfam02029  103 rkenneeeensswekeEKRDSRLGRYKEEETEIREKEYQENKWSTEVRQAEEEGEEEEDK---SEEAEEVPTENFAKEEV 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1487 PETLSVSQSQTKSSEEVVRQRSPTILETSPQQIRKALEFLDFSHYVRKTAAEAVLQTEELNKQQ--AMQKAEEIHQFRQH 1564
Cdd:pfam02029  180 KDEKIKKEKKVKYESKVFLDQKRGHPEVKSQNGEEEVTKLKVTTKRRQGGLSQSQEREEEAEVFleAEQKLEELRRRRQE 259
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039732645 1565 RSRilsirdidQE-ERFKQKDEvlemygEMRDSVDEARQKildiREvYRNKLLEAERLRMEAlAAQEAAVKIEIEKK 1640
Cdd:pfam02029  260 KES--------EEfEKLRQKQQ------EAELELEELKKK----RE-ERRKLLEEEEQRRKQ-EEAERKLREEEEKR 316
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1494-1641 2.30e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 45.88  E-value: 2.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1494 QSQTKSSEEVVRQ-----RSPTILETSPQqiRKALEFLDFSHYVRKTAAEA---VLQTEELNKQQAMQKAEEIHQFRQHR 1565
Cdd:pfam17380  384 QMERQQKNERVRQeleaaRKVKILEEERQ--RKIQQQKVEMEQIRAEQEEArqrEVRRLEEERAREMERVRLEEQERQQQ 461
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1566 SRILSirdiDQEERFKQKDEVLEMYGEMRDSVDEARQKILD----------IREVYRNKLLEAE-RLRMEALAAQEAAVK 1634
Cdd:pfam17380  462 VERLR----QQEEERKRKKLELEKEKRDRKRAEEQRRKILEkeleerkqamIEEERKRKLLEKEmEERQKAIYEEERRRE 537

                   ....*..
gi 1039732645 1635 IEIEKKS 1641
Cdd:pfam17380  538 AEEERRK 544
IQCD cd23767
IQ (isoleucine-glutamine) motif containing D (IQCD); IQCD, also called dynein regulatory ...
933-968 2.72e-04

IQ (isoleucine-glutamine) motif containing D (IQCD); IQCD, also called dynein regulatory complex protein 10 (DRC10), belongs to the IQ motif-containing protein family which contains a C-terminal conserved IQ motif domain and two coiled-coil domains. The IQ motif ([ILV]QxxxRxxxx[RK]), where x stands for any amino-acid residue, interacts with calmodulin (CaM) in a calcium-independent manner and is present in proteins with a wide diversity of biological functions. The IQCD protein was found to primarily accumulate in the acrosome area of round and elongating spermatids of the testis during late stage of spermiogenesis and was then localized to the acrosome and tail regions of mature spermatozoa. The expression of IQCD follows the trajectory of acrosome development during spermatogenesis. IQCD is associated with neuroblastoma and neurodegenerative diseases, and is reported to interact with the nuclear retinoid X receptor in the presence of 9-cis-retinoic acid, thereby activating the transcriptional activity of the receptor.


Pssm-ID: 467745 [Multi-domain]  Cd Length: 37  Bit Score: 39.83  E-value: 2.72e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1039732645  933 EKEYTSEEIAAAVKIQSMWKGCYVRLLMKARKPETK 968
Cdd:cd23767      1 EEEELQRMNRAATLIQALWRGYKVRKELKKKKKKGK 36
PTZ00121 PTZ00121
MAEBL; Provisional
1329-1647 3.93e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 45.52  E-value: 3.93e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1329 QALKDMEKMDIKAE----KHEEPAPMGSPDSHAVSEGQKSVGVPKTTRKGKEKSAEKEKLAKEKQAP---RFEPQQVQMP 1401
Cdd:PTZ00121  1354 AAADEAEAAEEKAEaaekKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAdeaKKKAEEKKKA 1433
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1402 TAVHSQQEDPNKPYWILRLVSEHTDSDYVDVKKDTER-ADEIRamKQAWETTEPGRAIKAAQARLKYLTQFIKKPVTTDT 1480
Cdd:PTZ00121  1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKkADEAK--KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1481 TTSAPSPETLSVSQSQTKSSEevvrQRSPTILETSpQQIRKALEfldfshyVRKtaAEAVLQTEELNKQQAMQKAEEIHQ 1560
Cdd:PTZ00121  1512 ADEAKKAEEAKKADEAKKAEE----AKKADEAKKA-EEKKKADE-------LKK--AEELKKAEEKKKAEEAKKAEEDKN 1577
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1561 FRQHRSRIL-SIRDIDQEERFKQKDEVLEMYGEMRDSVDEARQKILDIRE-------VYRNKLLEAERLR-MEAL--AAQ 1629
Cdd:PTZ00121  1578 MALRKAEEAkKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKaeeekkkVEQLKKKEAEEKKkAEELkkAEE 1657
                          330
                   ....*....|....*...
gi 1039732645 1630 EAAVKIEIEKKSPASDSQ 1647
Cdd:PTZ00121  1658 ENKIKAAEEAKKAEEDKK 1675
Peptidase_C2 pfam00648
Calpain family cysteine protease;
90-255 1.15e-03

Calpain family cysteine protease;


Pssm-ID: 459889 [Multi-domain]  Cd Length: 295  Bit Score: 42.87  E-value: 1.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645   90 FEDPE--------GKIELPQSLKVFSWKRPQDfifsrtpvvvkneitfdlfspnehlLCSElmrwiiseiyavWKIFNGG 161
Cdd:pfam00648    1 FEDPEfpaddsslGYPPSPPPPRGVEWKRPKE-------------------------ICSN------------PQFIVDG 43
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645  162 ILSN-YHKGNLGELpilpwkpW-----------EHIysLCKAVKGHVPLFNSY-GKYVVKLYWMGCWRKITVDDFLPFde 228
Cdd:pfam00648   44 ASRFdICQGELGDC-------WllaaiasltlnPKL--LERVVPPDQSFEENYaGIFHFRFWRFGEWVDVVIDDRLPT-- 112
                          170       180       190
                   ....*....|....*....|....*....|
gi 1039732645  229 ENNLLL---PATSYEFelWPMLLSKAIIKL 255
Cdd:pfam00648  113 RNGKLLfvhSRDKNEF--WSALLEKAYAKL 140
DUF4670 pfam15709
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ...
1484-1640 1.54e-03

Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.


Pssm-ID: 464815 [Multi-domain]  Cd Length: 522  Bit Score: 43.02  E-value: 1.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732645 1484 APSpETLSVSQSQTKSSEEVVRQRsptiletspqqiRKALEFLDFSHYVRKTAAEAVLQTEELNKQQAMQKAEEIHQfrQ 1563
Cdd:pfam15709  319 DPS-KALLEKREQEKASRDRLRAE------------RAEMRRLEVERKRREQEEQRRLQQEQLERAEKMREELELEQ--Q 383
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039732645 1564 HRSRILSIRDIDQE-ERFKQKDEVLEMYGEMRDSVDEARQKildiREVYRNKLLEAERLRMealaaQEAAVKIEIEKK 1640
Cdd:pfam15709  384 RRFEEIRLRKQRLEeERQRQEEEERKQRLQLQAAQERARQQ----QEEFRRKLQELQRKKQ-----QEEAERAEAEKQ 452
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH