NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1387194813|ref|XP_024853393|]
View 

MAX gene-associated protein isoform X5 [Bos taurus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
T-box_MGA-like cd20195
DNA-binding domain of MAX gene-associated protein and related T-box proteins; MGA (also known ...
75-260 1.35e-137

DNA-binding domain of MAX gene-associated protein and related T-box proteins; MGA (also known as MGAP, MAX dimerization protein, MAD5, MXD5) is a dual-specificity transcription factor that regulates the expression of both, MAX-network and T-box family target genes. MGA functions as a repressor or an activator; it binds to 5'-AATTTCACACCTAGGTGTGAAATT-3' core sequence. Its function is activated by heterodimerization with MAX. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


:

Pssm-ID: 410321  Cd Length: 186  Bit Score: 427.23  E-value: 1.35e-137
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG 154
Cdd:cd20195      1 ITVTLENNSMWNEFYRCGTEMILTKQGRRMFPYCRFRISGLDPDRNYILVMDISPVDNFRYRWNGRWWEPSGKAEPHVLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20195     81 RVFIHPESPATGRHWMDQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHVVPADKEVDVIQLNGPDVHTFTFPQTEFF 160
                          170       180
                   ....*....|....*....|....*.
gi 1387194813  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20195    161 AVTAYQNKQITQLKIDYNPFAKGFRE 186
bHLHzip_MGA cd18911
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) and ...
2365-2429 2.59e-33

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) and similar proteins; MGA, also termed MAX dimerization protein 5 (MAD5), is a dual specificity T-box/ bHLHzip transcription factor that regulates the expression of both Max-network and T-box family target genes. It contains a Myc-like bHLHZip motif and requires heterodimerization with Max for binding to the preferred Myc-Max-binding site CACGTG. In addition to the bHLHZip domain, MGA harbors a second DNA-binding domain, the T-box or T-domain. It thus binds the preferred Brachyury-binding sequence and represses transcription of reporter genes containing promoter-proximal Brachyury-binding sites.


:

Pssm-ID: 381481  Cd Length: 65  Bit Score: 123.74  E-value: 2.59e-33
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILTRAFSEIQGLTDQADKLIGQKNLLT 2429
Cdd:cd18911      1 RRTHTANERRRRNEMRDLFEKLKRTLGLHNLPKVSKYYILKQAFEEIQGLTDQADRLIGQKTLLT 65
MGA_dom super family cl24582
MGA, conserved domain; This domain can be found in the MAX gene-associated protein (Mga), ...
1041-1082 1.28e-13

MGA, conserved domain; This domain can be found in the MAX gene-associated protein (Mga), which is a dual-specificity transcription factor that contains both a bHLHZip domain and a T-box domain and is able to bind to and regulate transcriptional targets through both E-box sites as well as T-box-binding elements (TBEs).


The actual alignment was detected with superfamily member pfam16059:

Pssm-ID: 464998  Cd Length: 51  Bit Score: 67.51  E-value: 1.28e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1387194813 1041 RKRAPPCNNDFCRLGCVCSSLA-LEKRQPAHCRRPDCMFGCTC 1082
Cdd:pfam16059    2 KDAKKPCDKDYCQLGCVCDSLAgTRPPKREHCGRADCVLGCVC 44
DUF5585 super family cl39316
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1407-1708 7.62e-11

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


The actual alignment was detected with superfamily member pfam17823:

Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 67.68  E-value: 7.62e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1407 STLSTVISKVASNAKVAasrkPRTLLPSTSNSKTASSSGTTTNRPgknlKAFVPAKRPIAARPSPGGVFTQFVMSKVGAl 1486
Cdd:pfam17823  128 QSLPAAIAALPSEAFSA----PRAAACRANASAAPRAAIAAASAP----HAASPAPRTAASSTTAASSTTAASSAPTTA- 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1487 qqKIPGVSTPQPLTGpqkfsIRPSPVMVVTPVVSSEPVQVcssvtaAVTTTTPQVFLENVPAVTPTTalsdVGTKETTYS 1566
Cdd:pfam17823  199 --ASSAPATLTPARG-----ISTAATATGHPAAGTALAAV------GNSSPAAGTVTAAVGTVTPAA----LATLAAAAG 261
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1567 SGATTAGVVEVSETNTSTLvTPTQSTAT----LNLIKTTGITT--PVASVAFPKSLVASPPTITLPVASTASTSIVVVTT 1640
Cdd:pfam17823  262 TVASAAGTINMGDPHARRL-SPAKHMPSdtmaRNPAAPMGAQAqgPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSV 340
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1387194813 1641 AASSSMVTTPTS------SLSSVPIilsgidgsPPVSQRPENAPQIPVAPPQVSPNTVKRAGPRLLLIPVQQGS 1708
Cdd:pfam17823  341 ASTNLAVVTTTKaqakepSASPVPV--------LHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVAT 406
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1609-1918 1.01e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 1.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1609 SVAFPKSLVASPPTITLPVASTASTSIVVVTTAASSSmvTTPTSSLSSVPiilsgIDGSPPVSQrPENAPQIPVAPPQVS 1688
Cdd:pfam03154  160 SSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAAT--AGPTPSAPSVP-----PQGSPATSQ-PPNQTQSTAAPHTLI 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1689 PNTVKRAGPRL-----LLIPVQQGSP----TLRPVPNTQLQG-----------------HRMVLQPVRSPSGMNLFRHPN 1742
Cdd:pfam03154  232 QQTPTLHPQRLpsphpPLQPMTQPPPpsqvSPQPLPQPSLHGqmppmphslqtgpshmqHPVPPQPFPLTPQSSQSQVPP 311
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1743 GQIVQL---------LPLHQLRGSNNQPNLQPVMFRNPGSVMGIR---------LPTPSKPSETPPSSASPSAFSVVN-- 1802
Cdd:pfam03154  312 GPSPAApgqsqqrihTPPSQSQLQSQQPPREQPLPPAPLSMPHIKpppttpipqLPNPQSHKHPPHLSGPSPFQMNSNlp 391
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1803 ---------------------------PVIQAVGSSPAM-NVITQAPSLLSSGPNFVSQSGTLTLRISPPEP-HSFTSkt 1853
Cdd:pfam03154  392 pppalkplsslsthhppsahppplqlmPQSQQLPPPPAQpPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPqHPFVP-- 469
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1387194813 1854 ASETKITYSSGGQPVGTASLIPLQSGSFALLQLPGqkPVPSSILQHVASLQMKRESQNADQKDET 1918
Cdd:pfam03154  470 GGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSG--PVPAAVSCPLPPVQIKEEALDEAEEPES 532
 
Name Accession Description Interval E-value
T-box_MGA-like cd20195
DNA-binding domain of MAX gene-associated protein and related T-box proteins; MGA (also known ...
75-260 1.35e-137

DNA-binding domain of MAX gene-associated protein and related T-box proteins; MGA (also known as MGAP, MAX dimerization protein, MAD5, MXD5) is a dual-specificity transcription factor that regulates the expression of both, MAX-network and T-box family target genes. MGA functions as a repressor or an activator; it binds to 5'-AATTTCACACCTAGGTGTGAAATT-3' core sequence. Its function is activated by heterodimerization with MAX. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410321  Cd Length: 186  Bit Score: 427.23  E-value: 1.35e-137
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG 154
Cdd:cd20195      1 ITVTLENNSMWNEFYRCGTEMILTKQGRRMFPYCRFRISGLDPDRNYILVMDISPVDNFRYRWNGRWWEPSGKAEPHVLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20195     81 RVFIHPESPATGRHWMDQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHVVPADKEVDVIQLNGPDVHTFTFPQTEFF 160
                          170       180
                   ....*....|....*....|....*.
gi 1387194813  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20195    161 AVTAYQNKQITQLKIDYNPFAKGFRE 186
T-box pfam00907
T-box; The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box ...
77-260 1.17e-107

T-box; The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box proteins are found in a wide range of animals, but not in other kingdoms such as plants. Family members are all thought to bind to the DNA consensus sequence TCACACCT. they are found exclusively in the nucleus, and perform DNA-binding and transcriptional activation/repression roles. They are generally required for development of the specific tissues they are expressed in, and mutations in T-box genes are implicated in human conditions such as DiGeorge syndrome and X-linked cleft palate, which feature malformations.


Pssm-ID: 459990  Cd Length: 182  Bit Score: 341.46  E-value: 1.17e-107
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   77 VTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLGRV 156
Cdd:pfam00907    1 VSLENKELWKKFHELGTEMIITKSGRRMFPTLKVSVSGLDPNAKYSVLLDIVPVDDKRYKFHNGKWVVAGKAEPHSPPRV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  157 FIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVpaEKATEVIQLNGPGVHTFTFPQTEFFAV 236
Cdd:pfam00907   81 YIHPDSPATGSHWMKQPVSFDKLKLTNNKEDKNGHIILNSMHKYQPRLHIV--RVGGDEPSLPEENVKTFVFPETEFIAV 158
                          170       180
                   ....*....|....*....|....
gi 1387194813  237 TAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:pfam00907  159 TAYQNEEITQLKIDNNPFAKGFRD 182
TBOX smart00425
Domain first found in the mice T locus (Brachyury) protein;
75-264 1.59e-86

Domain first found in the mice T locus (Brachyury) protein;


Pssm-ID: 214656  Cd Length: 190  Bit Score: 281.08  E-value: 1.59e-86
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813    75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG 154
Cdd:smart00425    1 IKVSLEDKELWRKFHELGTEMIVTKSGRRMFPTLKYKVSGLDPNALYSVLMDLVPVDDKRYKFNNGKWVVAGKAEPHMPS 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGH--IILHSMHRYLPRLHLVpaeKATEVIQLNGPGVHTFTFPQTE 232
Cdd:smart00425   81 RVYVHPDSPATGAHWMKQPVSFDKVKLTNNQSDKNGHlqIILNSMHKYQPRLHIV---EVDDISKEILSQFKTFVFPETQ 157
                           170       180       190
                    ....*....|....*....|....*....|..
gi 1387194813   233 FFAVTAYQNIQITQLKIDYNPFAKGFRDDGLN 264
Cdd:smart00425  158 FIAVTAYQNQKITKLKIDNNPFAKGFRDQGRR 189
bHLHzip_MGA cd18911
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) and ...
2365-2429 2.59e-33

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) and similar proteins; MGA, also termed MAX dimerization protein 5 (MAD5), is a dual specificity T-box/ bHLHzip transcription factor that regulates the expression of both Max-network and T-box family target genes. It contains a Myc-like bHLHZip motif and requires heterodimerization with Max for binding to the preferred Myc-Max-binding site CACGTG. In addition to the bHLHZip domain, MGA harbors a second DNA-binding domain, the T-box or T-domain. It thus binds the preferred Brachyury-binding sequence and represses transcription of reporter genes containing promoter-proximal Brachyury-binding sites.


Pssm-ID: 381481  Cd Length: 65  Bit Score: 123.74  E-value: 2.59e-33
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILTRAFSEIQGLTDQADKLIGQKNLLT 2429
Cdd:cd18911      1 RRTHTANERRRRNEMRDLFEKLKRTLGLHNLPKVSKYYILKQAFEEIQGLTDQADRLIGQKTLLT 65
MGA_dom pfam16059
MGA, conserved domain; This domain can be found in the MAX gene-associated protein (Mga), ...
1041-1082 1.28e-13

MGA, conserved domain; This domain can be found in the MAX gene-associated protein (Mga), which is a dual-specificity transcription factor that contains both a bHLHZip domain and a T-box domain and is able to bind to and regulate transcriptional targets through both E-box sites as well as T-box-binding elements (TBEs).


Pssm-ID: 464998  Cd Length: 51  Bit Score: 67.51  E-value: 1.28e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1387194813 1041 RKRAPPCNNDFCRLGCVCSSLA-LEKRQPAHCRRPDCMFGCTC 1082
Cdd:pfam16059    2 KDAKKPCDKDYCQLGCVCDSLAgTRPPKREHCGRADCVLGCVC 44
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1407-1708 7.62e-11

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 67.68  E-value: 7.62e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1407 STLSTVISKVASNAKVAasrkPRTLLPSTSNSKTASSSGTTTNRPgknlKAFVPAKRPIAARPSPGGVFTQFVMSKVGAl 1486
Cdd:pfam17823  128 QSLPAAIAALPSEAFSA----PRAAACRANASAAPRAAIAAASAP----HAASPAPRTAASSTTAASSTTAASSAPTTA- 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1487 qqKIPGVSTPQPLTGpqkfsIRPSPVMVVTPVVSSEPVQVcssvtaAVTTTTPQVFLENVPAVTPTTalsdVGTKETTYS 1566
Cdd:pfam17823  199 --ASSAPATLTPARG-----ISTAATATGHPAAGTALAAV------GNSSPAAGTVTAAVGTVTPAA----LATLAAAAG 261
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1567 SGATTAGVVEVSETNTSTLvTPTQSTAT----LNLIKTTGITT--PVASVAFPKSLVASPPTITLPVASTASTSIVVVTT 1640
Cdd:pfam17823  262 TVASAAGTINMGDPHARRL-SPAKHMPSdtmaRNPAAPMGAQAqgPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSV 340
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1387194813 1641 AASSSMVTTPTS------SLSSVPIilsgidgsPPVSQRPENAPQIPVAPPQVSPNTVKRAGPRLLLIPVQQGS 1708
Cdd:pfam17823  341 ASTNLAVVTTTKaqakepSASPVPV--------LHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVAT 406
HLH pfam00010
Helix-loop-helix DNA-binding domain;
2364-2415 6.22e-09

Helix-loop-helix DNA-binding domain;


Pssm-ID: 459628 [Multi-domain]  Cd Length: 53  Bit Score: 54.00  E-value: 6.22e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1387194813 2364 YRRTHTANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILTRAFSEIQGLT 2415
Cdd:pfam00010    1 RREAHNERERRRRDRINDAFDELRELLpTLPPDKKLSKAEILRLAIEYIKHLQ 53
PHA03247 PHA03247
large tegument protein UL36; Provisional
1448-1848 6.95e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.80  E-value: 6.95e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1448 TNRPGKNLKAFVPAKRPI--AARPSPGgvftqfvmsKVGALQQKIPGVSTPQPltgpqkfsiRPSPVMVVTPVvssepvq 1525
Cdd:PHA03247  2667 ARRLGRAAQASSPPQRPRrrAARPTVG---------SLTSLADPPPPPPTPEP---------APHALVSATPL------- 2721
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1526 vcSSVTAAVTTTTPQVFLENVPAVTPTTALSDVGtkETTYSSGATTAGvvEVSETNTSTLVTPTQSTATLNLIKTTGITT 1605
Cdd:PHA03247  2722 --PPGPAAARQASPALPAAPAPPAVPAGPATPGG--PARPARPPTTAG--PPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1606 ---PVASVAFPKSLVASPPTITLPVASTASTSIVVVTTAASSSMVTTPTSSLSSVPIILSGIDGSpPVSQRP--ENAPQI 1680
Cdd:PHA03247  2796 eslPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG-DVRRRPpsRSPAAK 2874
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1681 PVAP----------PQVSPNTVKRAGPRLLLIPVQQGSPTLRPVPNTQLQGHRMVLQPVRSPSgmnLFRHPNGQIVQLLP 1750
Cdd:PHA03247  2875 PAAParppvrrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP---RPQPPLAPTTDPAG 2951
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1751 LHQLRGSNNQPNLQPVMfrnPGSVMGIRLPTPS-KPSETPPSSASPSAFSVVNPVIQAVGSSPAMNVITQAP--SLLSS- 1826
Cdd:PHA03247  2952 AGEPSGAVPQPWLGALV---PGRVAVPRFRVPQpAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPpvSLKQTl 3028
                          410       420       430
                   ....*....|....*....|....*....|....*.
gi 1387194813 1827 -GPNFVSQSGTLTLRIS-------------PPEPHS 1848
Cdd:PHA03247  3029 wPPDDTEDSDADSLFDSdsersdlealdplPPEPHD 3064
HLH smart00353
helix loop helix domain;
2369-2420 1.08e-05

helix loop helix domain;


Pssm-ID: 197674 [Multi-domain]  Cd Length: 53  Bit Score: 44.90  E-value: 1.08e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 1387194813  2369 TANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILTRAFSEIQGLTDQADK 2420
Cdd:smart00353    1 NARERRRRRKINEAFDELRSLLpTLPKNKKLSKAEILRLAIEYIKSLQEELQK 53
SP4_N cd22536
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins ...
1616-1783 4.49e-04

N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. Human SP4 is a risk gene of multiple psychiatric disorders including schizophrenia, bipolar disorder, and major depression. SP4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP4.


Pssm-ID: 411773 [Multi-domain]  Cd Length: 623  Bit Score: 45.68  E-value: 4.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1616 LVASPPTitlpvasTASTSIVVVTTAASSSMVTTPTSSLSSVPIILS-----GIDGSPPVSQRPENAPQIPVAP-PQVSP 1689
Cdd:cd22536    276 LVSTPIT-------TASVSTMPESPSSSTTCTTTASTSLTSSDTLVSsaetgQYASTAASSERTEEEPQTSAAEsEAQSS 348
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1690 NTVKRAGprllLIPVQQGSptlrpvpnTQLQGHRMVLQPVRSpsgMNLFRHPNGQIVQLLPLH--QLRGSNN-------- 1759
Cdd:cd22536    349 SQLQSNG----LQNVQDQS--------NSLQQVQIVGQPILQ---QIQIQQPQQQIIQAIQPQsfQLQSGQTiqtiqqqp 413
                          170       180
                   ....*....|....*....|....*.
gi 1387194813 1760 QPNLQPVMFRNPGSVMgIRLP--TPS 1783
Cdd:cd22536    414 LQNVQLQAVQSPTQVL-IRAPtlTPS 438
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1609-1918 1.01e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 1.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1609 SVAFPKSLVASPPTITLPVASTASTSIVVVTTAASSSmvTTPTSSLSSVPiilsgIDGSPPVSQrPENAPQIPVAPPQVS 1688
Cdd:pfam03154  160 SSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAAT--AGPTPSAPSVP-----PQGSPATSQ-PPNQTQSTAAPHTLI 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1689 PNTVKRAGPRL-----LLIPVQQGSP----TLRPVPNTQLQG-----------------HRMVLQPVRSPSGMNLFRHPN 1742
Cdd:pfam03154  232 QQTPTLHPQRLpsphpPLQPMTQPPPpsqvSPQPLPQPSLHGqmppmphslqtgpshmqHPVPPQPFPLTPQSSQSQVPP 311
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1743 GQIVQL---------LPLHQLRGSNNQPNLQPVMFRNPGSVMGIR---------LPTPSKPSETPPSSASPSAFSVVN-- 1802
Cdd:pfam03154  312 GPSPAApgqsqqrihTPPSQSQLQSQQPPREQPLPPAPLSMPHIKpppttpipqLPNPQSHKHPPHLSGPSPFQMNSNlp 391
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1803 ---------------------------PVIQAVGSSPAM-NVITQAPSLLSSGPNFVSQSGTLTLRISPPEP-HSFTSkt 1853
Cdd:pfam03154  392 pppalkplsslsthhppsahppplqlmPQSQQLPPPPAQpPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPqHPFVP-- 469
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1387194813 1854 ASETKITYSSGGQPVGTASLIPLQSGSFALLQLPGqkPVPSSILQHVASLQMKRESQNADQKDET 1918
Cdd:pfam03154  470 GGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSG--PVPAAVSCPLPPVQIKEEALDEAEEPES 532
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1547-1655 1.14e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.36  E-value: 1.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1547 PAVTPTTALSDVGTKETTYSSGATTAGVVEVSETNTSTLVTPTQSTATLNLIKTTGITTPVASVAFPKSLVAS-PPTITL 1625
Cdd:COG3469     92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTtTSTTTT 171
                           90       100       110
                   ....*....|....*....|....*....|
gi 1387194813 1626 PVASTASTSIVVVTTAASSSMVTTPTSSLS 1655
Cdd:COG3469    172 TTSASTTPSATTTATATTASGATTPSATTT 201
 
Name Accession Description Interval E-value
T-box_MGA-like cd20195
DNA-binding domain of MAX gene-associated protein and related T-box proteins; MGA (also known ...
75-260 1.35e-137

DNA-binding domain of MAX gene-associated protein and related T-box proteins; MGA (also known as MGAP, MAX dimerization protein, MAD5, MXD5) is a dual-specificity transcription factor that regulates the expression of both, MAX-network and T-box family target genes. MGA functions as a repressor or an activator; it binds to 5'-AATTTCACACCTAGGTGTGAAATT-3' core sequence. Its function is activated by heterodimerization with MAX. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410321  Cd Length: 186  Bit Score: 427.23  E-value: 1.35e-137
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG 154
Cdd:cd20195      1 ITVTLENNSMWNEFYRCGTEMILTKQGRRMFPYCRFRISGLDPDRNYILVMDISPVDNFRYRWNGRWWEPSGKAEPHVLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20195     81 RVFIHPESPATGRHWMDQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHVVPADKEVDVIQLNGPDVHTFTFPQTEFF 160
                          170       180
                   ....*....|....*....|....*.
gi 1387194813  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20195    161 AVTAYQNKQITQLKIDYNPFAKGFRE 186
T-box pfam00907
T-box; The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box ...
77-260 1.17e-107

T-box; The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box proteins are found in a wide range of animals, but not in other kingdoms such as plants. Family members are all thought to bind to the DNA consensus sequence TCACACCT. they are found exclusively in the nucleus, and perform DNA-binding and transcriptional activation/repression roles. They are generally required for development of the specific tissues they are expressed in, and mutations in T-box genes are implicated in human conditions such as DiGeorge syndrome and X-linked cleft palate, which feature malformations.


Pssm-ID: 459990  Cd Length: 182  Bit Score: 341.46  E-value: 1.17e-107
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   77 VTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLGRV 156
Cdd:pfam00907    1 VSLENKELWKKFHELGTEMIITKSGRRMFPTLKVSVSGLDPNAKYSVLLDIVPVDDKRYKFHNGKWVVAGKAEPHSPPRV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  157 FIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVpaEKATEVIQLNGPGVHTFTFPQTEFFAV 236
Cdd:pfam00907   81 YIHPDSPATGSHWMKQPVSFDKLKLTNNKEDKNGHIILNSMHKYQPRLHIV--RVGGDEPSLPEENVKTFVFPETEFIAV 158
                          170       180
                   ....*....|....*....|....
gi 1387194813  237 TAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:pfam00907  159 TAYQNEEITQLKIDNNPFAKGFRD 182
T-box_TBX4_5-like cd20189
DNA-binding domain of T-box transcription factor 4 and 5, and related T-box proteins; This ...
75-260 3.30e-89

DNA-binding domain of T-box transcription factor 4 and 5, and related T-box proteins; This subfamily includes the T-box transcription factors TBX4 and TBX5 which play important roles in vertebrate limb and heart development, and in lung and trachea development. TBX4 is needed for normal skeletal and muscular hindlimb development and is involved in super-enhancer-driven transcriptional programs underlying features specific to lung fibroblasts. TBX5 plays a role in regulating cardiac conduction system function, and in coordinating forelimb muscle pattern. Mutations in human TBX5 and TBX4 are associated with Holt-Oram syndrome and Small Patella syndrome, respectively. Both syndromes are characterized by limb defects in addition to other abnormalities. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410315  Cd Length: 185  Bit Score: 288.56  E-value: 3.30e-89
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG 154
Cdd:cd20189      1 IKVFLENRELWQKFHEVGTEMIITKAGRRMFPSIKVKVTGLNPKTKYILLMDIVPADDHRYKFHDSEWVVAGKAEPAMPG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKaTEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20189     81 RLYVHPDSPATGAHWMRQLVSFQKLKLTNNHLDQFGHIILNSMHKYQPRIHIVQADD-NNAFGSKNTAFSTHVFPETAFI 159
                          170       180
                   ....*....|....*....|....*.
gi 1387194813  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20189    160 AVTAYQNHQITQLKIENNPFAKGFRG 185
T-box_TBX6_VegT-like cd20190
DNA-binding domain of T-box transcription factor 6, VegT and related T-box proteins; This ...
75-260 6.29e-87

DNA-binding domain of T-box transcription factor 6, VegT and related T-box proteins; This subfamily includes the transcriptional regulators TBX6 and VegT. TBX6 plays an essential role in the fate determination of axial stem to become either neural or mesodermal. It also plays an essential role in the regulation of left/right patterning in mouse embryos through effects on nodal cilia and perinodal signaling. VegT (also known as Antipodean, Brat and Xombi) is required in early Xenopus embryos for the formation of both the mesoderm and endoderm germ layers. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved 1DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410316  Cd Length: 183  Bit Score: 282.16  E-value: 6.29e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG 154
Cdd:cd20190      1 VSLSLEDRELWKEFSSVGTEMIITKSGRRMFPACKVSVTGLDPEAKYLFLLDVVPVDNARYKWNKRRWEPSGKAEPHLPD 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVpaeKATEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20190     81 RVYIHPDSPAPGAHWMRQPISFHKLKLTNNTLDPHGHLILHSMHKYQPRIHLV---QSADLCSQHWGGMASFRFPETTFI 157
                          170       180
                   ....*....|....*....|....*.
gi 1387194813  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20190    158 AVTAYQNPQITKLKIAANPFAKGFRE 183
T-box_VegT-like cd20197
DNA-binding domain of Xenopus VegT and related T-box proteins; VegT, (also known as Antipodean, ...
75-260 1.05e-86

DNA-binding domain of Xenopus VegT and related T-box proteins; VegT, (also known as Antipodean, Brat and Xombi), is a T-box transcription factor required in early Xenopus embryos for the formation of both, the mesoderm and endoderm germ layers. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410323  Cd Length: 183  Bit Score: 281.34  E-value: 1.05e-86
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG 154
Cdd:cd20197      1 VRASLEDQDLWKKFHQIGTEMIITKSGRRMFPQCKIRVSGLLPYAKYVMLVDFVPVDNFRYKWNKDQWEVAGKAEPQPPC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVpaeKATEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20197     81 RTYVHPDSPAPGSHWMKQPISFQKLKLTNNTLDQHGHIILHSMHRYQPRFHIV---QADDLFNVRWSLFQVFSFPETVFT 157
                          170       180
                   ....*....|....*....|....*.
gi 1387194813  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20197    158 AVTAYQNEKITKLKIDNNPFAKGFRE 183
TBOX smart00425
Domain first found in the mice T locus (Brachyury) protein;
75-264 1.59e-86

Domain first found in the mice T locus (Brachyury) protein;


Pssm-ID: 214656  Cd Length: 190  Bit Score: 281.08  E-value: 1.59e-86
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813    75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG 154
Cdd:smart00425    1 IKVSLEDKELWRKFHELGTEMIVTKSGRRMFPTLKYKVSGLDPNALYSVLMDLVPVDDKRYKFNNGKWVVAGKAEPHMPS 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGH--IILHSMHRYLPRLHLVpaeKATEVIQLNGPGVHTFTFPQTE 232
Cdd:smart00425   81 RVYVHPDSPATGAHWMKQPVSFDKVKLTNNQSDKNGHlqIILNSMHKYQPRLHIV---EVDDISKEILSQFKTFVFPETQ 157
                           170       180       190
                    ....*....|....*....|....*....|..
gi 1387194813   233 FFAVTAYQNIQITQLKIDYNPFAKGFRDDGLN 264
Cdd:smart00425  158 FIAVTAYQNQKITKLKIDNNPFAKGFRDQGRR 189
T-box_TBX2_3-like cd20188
DNA-binding domain of T-box transcription factor 2 and 3, and related T-box proteins; This ...
75-260 1.51e-81

DNA-binding domain of T-box transcription factor 2 and 3, and related T-box proteins; This subfamily includes the T-box transcription factors TBX2 and TBX3 and similar proteins. TBX2 is an oncogenic transcription factor implicated in developmental processes, including coordinating cell fate, patterning and morphogenesis of a wide range of tissues and organs. It is overexpressed in several cancers, including melanoma and breast, and plays a key role during cardiac development. TBX2 is a negative regulator of promyelocytic leukemia protein (PML) function in cellular senescence, and it interacts with HP1 to recruit a repression complex to EGR1-responsive promoters to drive the proliferation of breast cancer cells. TBX3 has also been implicated in oncogenesis in breast cancer and melanoma. The tbx3 gene is downregulated by PML. TBX3 directly represses TBX2 under the control of the PRC2 complex in skeletal muscle and rhabdomyosarcoma. Also included in this family is the Drosophila melanogaster optomotor-blind protein (Omb, also known as lethal(1)optomotor-blind, or L(1)omb, or protein bifid) which controls many developmental processes such as wing, eye, and abdominal tergites and optic lobes, and induces epithelial cell migration and extrusion in vivo. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410314  Cd Length: 185  Bit Score: 266.60  E-value: 1.51e-81
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG 154
Cdd:cd20188      3 PKVELEAKDLWDQFHKLGTEMVITKSGRRMFPPFKVRVSGLDKKAKYILLMDIVAADDCRYKFHNSRWMVAGKADPEMPK 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVpaeKATEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20188     83 RMYIHPDSPSTGEQWMQKVVSFHKLKLTNNISDKHGFTILNSMHKYQPRFHIV---RANDILKLPYSTFRTYVFKETEFI 159
                          170       180
                   ....*....|....*....|....*.
gi 1387194813  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20188    160 AVTAYQNEKITQLKIDNNPFAKGFRD 185
T-box_TBX6 cd20196
DNA-binding domain of T-box transcription factor 6, and related T-box proteins; TBX6 is a ...
75-260 2.13e-81

DNA-binding domain of T-box transcription factor 6, and related T-box proteins; TBX6 is a T-box transcription factor which plays an essential role in the fate determination of axial stem to become either neural or mesodermal. It also plays an essential role in the regulation of left/right patterning in mouse embryos, through effects on nodal cilia and perinodal signaling. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410322  Cd Length: 182  Bit Score: 265.96  E-value: 2.13e-81
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG 154
Cdd:cd20196      1 VRMSLENAELWKQFSSVGTEMIITKAGRRMFPQLRVSVSGLDPEARYLLLLDVVPVDGSRYRWQGNSWEASGKAEPRLPD 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKatevIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20196     81 RVYIHPDSPATGAHWMRQPISFHRAKLTNNTLDPHGHIILHSMHRYQPRVHVVRARD----VLSWGGGCASFTFPETQFI 156
                          170       180
                   ....*....|....*....|....*.
gi 1387194813  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20196    157 TVTAYQNPKITQLKINSNPFAKGFRE 182
T-box cd00182
DNA-binding domain of the T-box transcription factor family; The T-box family is an ancient ...
75-252 2.73e-81

DNA-binding domain of the T-box transcription factor family; The T-box family is an ancient family of transcription factors which plays a multitude of diverse functions throughout development. The founding member of the family is Brachyury (also known as TBXT, or T). Members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns. The T-box factors in Caenorhabditis elegans have evolved very differently than those in other organisms; its genome contains 22 T-box genes which encode factors which are diverse in DNA-binding specificity, function and sequence, and only 3 of these factors fall into the conserved T-box subfamilies.


Pssm-ID: 410312  Cd Length: 176  Bit Score: 265.61  E-value: 2.73e-81
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHV-L 153
Cdd:cd00182      1 ITVSLRNEELWKKFHELGTEMIVTKSGRRMFPTLEYSVSGLDPNKLYSVSLHFERVDNKRYKFNNGKWVPSGKAEPPPeP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  154 GRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLD-QEGHIILHSMHRYLPRLHLVpaeKATEVIQLNGPgVHTFTFPQTE 232
Cdd:cd00182     81 SRIYVHPDGPQTGSFWMKKGVSFDKVKITNNKEDkKEGHILLHSMHKYIPVLTIY---EVDDNGLLSKL-VKEFRFPETE 156
                          170       180
                   ....*....|....*....|
gi 1387194813  233 FFAVTAYQNIQITQLKIDYN 252
Cdd:cd00182    157 FIAVTAYQNDEITQLKIDNN 176
T-box_Drosocross-like cd20681
DNA-binding domain of Drosophila Dorsocross and related T-box proteins; Drosophila Dorsocross ...
75-260 2.76e-78

DNA-binding domain of Drosophila Dorsocross and related T-box proteins; Drosophila Dorsocross (Doc) includes three Dorsocross paralogs, Doc1-3. These are key cardiogenic T-box transcription factors during specification and differentiation of heart cells. Drosophila Doc also functions in caudal visceral mesoderm development, and modulates Notch signaling in the developing Drosophila eye by regulating the expression of Delta in the eye imaginal discs. Doc also functions in the morphogenesis of epithelial tissues: in Drosophila, which possesses a single extraembryonic (EE) membrane, it is essential for EE epithelia tissue maintenance while in Tribolium castaneum, which has 2 EE membranes, Doc plays a major role in EE morphogenetic events throughout development without affecting EE tissue specificity or maintenance. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410332  Cd Length: 186  Bit Score: 257.26  E-value: 2.76e-78
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNG-RWWEPSGKAEP--H 151
Cdd:cd20681      1 VKVTLKNRDLWQQFHREGTEMIITKSGRRMFPSLRLSVSGLEPDARYCVLLEMVLASDCRFKYSGnGGWVPAGGAEPqpP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  152 VLGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVpaeKATEVIQLNGPGVHTFTFPQT 231
Cdd:cd20681     81 LPRRIYIHPDSPATGDHWMSQPISFSKVKLTNNTLDPQGNIVLTSMHKYQPRIHIV---RCSDTLALPWAPTASFTFPET 157
                          170       180
                   ....*....|....*....|....*....
gi 1387194813  232 EFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20681    158 EFIAVTAYQNERITKLKIDNNPFAKGFRE 186
T-box_TBXT_TBX19-like cd20192
DNA-binding domain of T-box transcription factor T, T-box transcription factor 19 and related ...
75-260 1.65e-77

DNA-binding domain of T-box transcription factor T, T-box transcription factor 19 and related T-box proteins; Tbx19 (also known as Tpit) is a T-box factor restricted to two pituitary (pro-opiomelanocortin) POMC-expressing lineages, the corticotrophs and melanotrophs; it controls terminal differentiation of these lineages. TBX19 activates POMC gene transcription with the cooperation of another transcription factor Pitx1. TBXT, also known as Brachyury protein, or protein T, is a transcription factor needed for posterior mesoderm formation and differentiation as well as for the notochord development during embryogenesis. It binds to a 24 base-pair (bp) palindromic site (called the T site) and activates gene transcription when bound to such a site. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. TBXT is the founding member of the T-box family, members of which share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410318  Cd Length: 180  Bit Score: 254.88  E-value: 1.65e-77
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKW-NGRWWePSGKAEPHVL 153
Cdd:cd20192      1 IRVTLEDRELWKKFHSLTNEMIVTKSGRRMFPVLKVSVSGLDPNAMYSVLLDFVQVDNHRWKYvNGEWV-PGGKAEPPPP 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  154 GRVFIHPESPSTGHYWMHQPVSFYKLKLTNNtLDQEGHIILHSMHRYLPRLHLVPAEKateviQLNGPGVHTFTFPQTEF 233
Cdd:cd20192     80 SSVYVHPDSPNFGAHWMKGPVSFSKVKLTNK-PNGEGQIMLNSLHKYEPRVHIVRVGS-----NNHERLVSTFSFPETQF 153
                          170       180
                   ....*....|....*....|....*..
gi 1387194813  234 FAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20192    154 IAVTAYQNEEITALKIKYNPFAKAFLD 180
T-box_TBX1_10-like cd20187
DNA-binding domain of T-box transcription factor 1 and 10, and related T-box proteifactors; ...
74-260 4.47e-75

DNA-binding domain of T-box transcription factor 1 and 10, and related T-box proteifactors; This subfamily includes TBX1 and TBX10. TBX1 is a T-box transcription factor which plays an important role in heart development and has been implicated in DiGeorge or 22q11.2 deletion syndrome. This syndrome is associated with various types of cardiac outflow tract (OFT) and vascular defects. Wnt5a is regulated by TBX1 in the second heart field (SHF). TBX1 is required to maintain the integrity of extracellular matrix-cell interactions in the SHF and this interaction is critical for cardiac (OFT) development. TBX10 is a putative T-box transcription factor. Diseases associated with TBX10 include Isolated Cleft Lip and Cleft Lip/cleft lip with or without cleft palate. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410313  Cd Length: 189  Bit Score: 248.11  E-value: 4.47e-75
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   74 GITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKW--NGRWWEPSGKAEPH 151
Cdd:cd20187      1 NVTVQLEMKALWDEFNQLGTEMIVTKAGRRMFPTFQVKIFGMDPMADYMLMMDFVPVDDKRYRYafHSSSWLVAGKADPA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  152 VLGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLNGPGVHTFTFPQT 231
Cdd:cd20187     81 MPGRIHVHPDSPAKGAQWMKQIVSFDKLKLTNNLLDDNGHIILNSMHRYQPRFHVVYVDPRKDSENSAEENFKTFIFPET 160
                          170       180
                   ....*....|....*....|....*....
gi 1387194813  232 EFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20187    161 KFTAVTAYQNHRITQLKIASNPFAKGFRD 189
T-box_TBX20-like cd20193
DNA-binding domain of T-box transcription factor 20 and related T-box proteins; TBX20 is a ...
75-260 3.63e-72

DNA-binding domain of T-box transcription factor 20 and related T-box proteins; TBX20 is a T-box transcriptional factor which functions in embryonic development and its deficiency is associated with congenital heart disease. It acts both as a transcriptional activator and a repressor required for cardiac development, and has key roles in maintaining the functional and structural phenotypes in the adult heart. The TBX20-cardiac transcription factor CASZ1 protein complex is protective against dilated cardiomyopathy and is essential for maintaining cardiac homeostasis. TBX20 has also been shown to regulate angiogenesis through the PROK2-PROKR1 (prokineticin receptor 1) pathway and is involved in both, pathological and developmental, angiogenesis. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410319  Cd Length: 190  Bit Score: 240.02  E-value: 3.63e-72
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKW--NGRWWEPSGKAEPHV 152
Cdd:cd20193      1 VQCHLETKELWDKFHELGTEMIITKSGRRMFPTVRVSFSGVDPDAKYIVLMDIVPVDNKRYRYayHRSSWLVAGKADPPL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  153 LGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKAT-EVIQLNGPGVHTFTFPQT 231
Cdd:cd20193     81 PARLYVHPDSPFTGEQLLKQMVSFEKVKLTNNELDKHGHIILNSMHKYQPRVHIVKKKDHTaSLVNLKSEEFRTFIFPET 160
                          170       180
                   ....*....|....*....|....*....
gi 1387194813  232 EFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20193    161 VFTAVTAYQNQLITKLKIDSNPFAKGFRD 189
T-box_TBX15_18_22-like cd20191
DNA-binding domain of T-box transcription factor 15, 18 and 22, and related T-box proteins; ...
75-260 2.51e-71

DNA-binding domain of T-box transcription factor 15, 18 and 22, and related T-box proteins; This subfamily includes the transcriptional regulators TBX15, TBX18 and TBX22 which are involved in various developmental processes. TBX15 (also known as TBX14) plays an important role in the development of the skeleton of the limb, vertebral column and head, possibly through its control of the number of mesenchymal precursor cells and chondrocytes; it also plays a role in the differentiation of brown and brite adipocytes. TBX18 is involved in the developmental processes of a variety of tissues and organs, including the ureter, vertebral column. epicardium and coronary vessels; it is important for the development of the head portion of the sino atrial node (SAN). Mutations in the T-box transcription factor gene TBX22 are found in X-linked Cleft Palate with or without Ankyloglossia syndrome (CPX syndrome), and associated with cleft lip and palate, and tooth agenesis. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410317  Cd Length: 194  Bit Score: 237.86  E-value: 2.51e-71
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKW--NGRWWEPSGKAEPHV 152
Cdd:cd20191      3 IQVELQGSELWKRFHDIGTEMIITKAGRRMFPAIRVKVSGLDPHAQYIVAMDIVPVDNKRYRYvyHSSKWMVAGNADAPV 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  153 LGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEV----IQLNGPGVHTFTF 228
Cdd:cd20191     83 PPRVYIHPDSPASGETWMRQVVSFDKLKLTNNEMDDQGHIILHSMHKYQPRVHVIRKDSSTDLspkkPVPPGEGVKTFSF 162
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1387194813  229 PQTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20191    163 PETVFTTVTAYQNQQITRLKIDRNPFAKGFRD 194
T-box-like cd20682
T-box DNA-binding domain; uncharacterized subfamily; The T-box family is an ancient group that ...
75-260 4.73e-70

T-box DNA-binding domain; uncharacterized subfamily; The T-box family is an ancient group that appears to play a critical role in development in all animal species. These genes were uncovered on the basis of similarity to the DNA binding domain of murine Brachyury (T) gene product, the defining feature of the family. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development and conserved expression patterns, most of the known genes in all species being expressed in mesoderm or mesoderm precursors.


Pssm-ID: 410333  Cd Length: 191  Bit Score: 233.82  E-value: 4.73e-70
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKW---NGRwWEPSGKAEPH 151
Cdd:cd20682      1 IQVELCSRELWLQFHNLGNEMIITKAGRRMFPALKVKLTGLDPDKLYIVWVDIVPVDSNRYRYvyhSSK-WVVAGSGDVL 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  152 VLGRVFIHPESPSTGHYWMHQPVSFYKLKLTNN-TLDQEGHIILHSMHRYLPRLHLVPAE-KATEVIQLNGPGVH-TFTF 228
Cdd:cd20682     80 PPANRYIHPDSPASGKYWMSQIVSFDKLKLTNNkEPKQKGQISLHSMHKYQPRIHIQPVEdDGRNVEKAINSSKAlSFEF 159
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1387194813  229 PQTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20682    160 PETSFITVTAYQNQQITKLKIASNPFAKGFRD 191
T-box_TBR1_2_21-like cd20194
DNA-binding domain of T-box brain protein 1 and 2, T-box transcription factor 21 and related ...
77-260 2.86e-67

DNA-binding domain of T-box brain protein 1 and 2, T-box transcription factor 21 and related T-box proteins; TBX21 (also known as T-cell-specific T-box transcription factor T-bet or transcription factor TBLYM) is a lineage-defining transcription factor which directs T helper type 1 (Th1) cell differentiation. This subfamily includes TBR1 (also known as T-brain-1, or TES-56), which is a neuron-specific transcription factor involved in forebrain development, and TBR2 (also known as Eomesodermin, Eomes, or T-brain-2), which is associated with neurogenesis, cardiogenesis and tumor immune response. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410320  Cd Length: 185  Bit Score: 225.82  E-value: 2.86e-67
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   77 VTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG-R 155
Cdd:cd20194      4 VYLCNRDLWLKFHQHQTEMIITKQGRRMFPTLSFNLSGLDPTAHYNVFVDMVLADPNHWKFQSGKWVPCGKAEGLPQGnR 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  156 VFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHlvpaekateVIQLNGPG------VHTFTFP 229
Cdd:cd20194     84 VYVHPDSPNTGAHWMKQEISFSKLKLTNNKGADQGMIVLNSMHKYQPRIH---------VIEVGGNGpneqrnLQTHSFP 154
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1387194813  230 QTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20194    155 ETQFIAVTAYQNTDITQLKIDHNPFAKGFRD 185
T-box_TBX21 cd20203
DNA-binding domain of T-box transcription factor 21 and related T-box proteins; TBX21 (also ...
75-260 9.98e-65

DNA-binding domain of T-box transcription factor 21 and related T-box proteins; TBX21 (also known as T-cell-specific T-box transcription factor T-bet or transcription factor TBLYM) is a lineage-defining transcription factor which directs T helper type 1 (Th1) cell differentiation. It initiates Th1 lineage development from naive T helper precursor cells both by initiating the Th1 genetic programs and by inhibiting the opposing Th2 and Th17 lineage-commitment programs. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410329  Cd Length: 191  Bit Score: 218.67  E-value: 9.98e-65
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG 154
Cdd:cd20203      2 LQVLLNNHPLWSKFHKHQTEMIITKQGRRMFPFLSFNLTGLDPTAHYNVYVDVVLADQHHWRYQGGKWVQCGKAEGNMPG 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  155 -RVFIHPESPSTGHYWMHQPVSFYKLKLTNN---TLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLNGPGVHTFTFPQ 230
Cdd:cd20203     82 nRLYVHPDSPNTGAHWMRQEVSFGKLKLTNNkgaSNNVTQMIVLQSLHKYQPRLHIVEVKEGETEEAYSSSKTHTFTFPE 161
                          170       180       190
                   ....*....|....*....|....*....|
gi 1387194813  231 TEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20203    162 TQFIAVTAYQNAEITQLKIDHNPFAKGFRD 191
T-box_TBX18_like cd20199
DNA-binding domain of T-box transcription factor 18 and related T-box proteins; TBX18 acts as ...
75-260 8.03e-64

DNA-binding domain of T-box transcription factor 18 and related T-box proteins; TBX18 acts as a transcription repressor involved in the developmental processes of a variety of tissues and organs, including the ureter, vertebral column. epicardium and coronary vessels. TBX18 is important for the development of the head portion of the sino atrial node (SAN); SAN is the pacemaker region of the heart that initiates each heartbeat. This subgroup belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410325  Cd Length: 195  Bit Score: 216.45  E-value: 8.03e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKW--NGRWWEPSGKAEPHV 152
Cdd:cd20199      4 VRVDLQGADLWKRFHEIGTEMIITKAGRRMFPAMRVKITGLDPHQQYYIAMDIVPVDNKRYRYvyHSSKWMVAGNADSPV 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  153 LGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLN----GPGVHTFTF 228
Cdd:cd20199     84 PPRVYIHPDSPASGETWMRQVISFDKLKLTNNELDDQGHIILHSMHKYQPRVHVIRKECGEELSPVKpipsGEGVKAFSF 163
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1387194813  229 PQTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20199    164 PETVFTTVTAYQNQQITRLKIDRNPFAKGFRD 195
T-box_TBX22-like cd20200
DNA-binding domain of T-box transcription factor 22 and related T-box proteins; TBX22 is a ...
75-260 1.18e-61

DNA-binding domain of T-box transcription factor 22 and related T-box proteins; TBX22 is a transcriptional regulator involved in developmental processes. Mutations in the T-Box transcription factor gene TBX22 are found in X-linked Cleft Palate with or without Ankyloglossia syndrome (CPX syndrome). TBX22 mutation is also associated with cleft lip and palate, and tooth agenesis. This subgroup belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410326  Cd Length: 194  Bit Score: 210.16  E-value: 1.18e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKW--NGRWWEPSGKAEPHV 152
Cdd:cd20200      3 VQVELQGSELWKRFHEIGTEMIITKAGRRMFPSVRVKVKGLDPLKQYYIAMDVVPVDSKRYRYvyHSSQWMVAGNTDHSC 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  153 LG-RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLV---PAEKATEVIQLNGPGVHTFTF 228
Cdd:cd20200     83 ITpRLYVHPDSPCSGETWMRQIISFDRVKLTNNEMDDKGHIILQSMHKYKPRVHVIlqdSRFDLSQIQSLPAEGVKTFSF 162
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1387194813  229 PQTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20200    163 PETEFTTVTAYQNQQITKLKIDRNPFAKGFRD 194
T-box_TBX15-like cd20198
DNA-binding domain of T-box transcription factor 15 and related T-box proteins; TBX15 (also ...
75-260 5.02e-61

DNA-binding domain of T-box transcription factor 15 and related T-box proteins; TBX15 (also known as TBX14) plays an important role in the development of the skeleton of the limb, vertebral column and head, possibly through its control of the number of mesenchymal precursor cells and chondrocytes. TBX15 also plays a role in the differentiation of brown and brite adipocytes. This subgroup belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410324  Cd Length: 198  Bit Score: 208.43  E-value: 5.02e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKW--NGRWWEPSGKAEPHV 152
Cdd:cd20198      7 IQVELQCADLWKRFHDIGTEMIITKAGRRMFPAMRVKITGLDPHQQYYIAMDIVPVDNKRYRYvyHSSKWMVAGNADSPV 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  153 LGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLN----GPGVHTFTF 228
Cdd:cd20198     87 PPRVYIHPDSLASGDTWMRQVVSFDKLKLTNNELDDQGHIILHSMHKYQPRVHVIRKDFSSDLSPTKpvptGDGVKTFSF 166
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1387194813  229 PQTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20198    167 PETVFTTVTAYQNQQITRLKIDRNPFAKGFRD 198
T-box_TBR1 cd20204
DNA-binding domain of T-box brain protein 1 and related T-box proteins; TBR1 (also known as ...
77-260 1.08e-57

DNA-binding domain of T-box brain protein 1 and related T-box proteins; TBR1 (also known as T-brain-1 or TES-56) is a neuron-specific transcription factor of the T-box family and involved in forebrain development. It has been recognized as a high-confidence risk gene for autism spectrum disorders (ASD); it regulates the expression of ASD-related genes that are critical for cortical development. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410330  Cd Length: 191  Bit Score: 198.80  E-value: 1.08e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   77 VTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG-R 155
Cdd:cd20204      4 VYLCNRPLWLKFHRHQTEMIITKQGRRMFPFLSFNISGLDPTAHYNIFVDVILADPNHWRFQGGKWVPCGKADTNVQGnR 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  156 VFIHPESPSTGHYWMHQPVSFYKLKLTNN--TLDQEGH-IILHSMHRYLPRLHLVPA-EKATEviQLNGPG-VHTFTFPQ 230
Cdd:cd20204     84 VYMHPDSPNTGAHWMRQEISFGKLKLTNNkgASNNNGQmVVLQSLHKYQPRLHVVEVnEDGTE--DTSQPGrVQTFTFPE 161
                          170       180       190
                   ....*....|....*....|....*....|
gi 1387194813  231 TEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20204    162 TQFIAVTAYQNTDITQLKIDHNPFAKGFRD 191
T-box_Fungi_incertae_sedis cd20683
T-box DNA-binding domain; uncharacterized subfamily of fungi classified as Fungi incertae ...
76-261 6.89e-57

T-box DNA-binding domain; uncharacterized subfamily of fungi classified as Fungi incertae sedis; Fungi incertae sedis refers to a fungal taxonomic group where its broader relationships are unknown or undefined. The T-box family is an ancient group that appears to play a critical role in development in all animal species. These genes were uncovered on the basis of similarity to the DNA binding domain of murine Brachyury (T) gene product, the defining feature of the family. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development and conserved expression patterns, most of the known genes in all species being expressed in mesoderm or mesoderm precursors.


Pssm-ID: 410334  Cd Length: 214  Bit Score: 197.23  E-value: 6.89e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   76 TVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKW-NGRwWEPSGK------- 147
Cdd:cd20683      2 QLLLEDADLWAQFHSVQNEMIITKSGRCLFPLLRFRAVNLDPKALYSIALDIEQVSPNRFRFrNGR-WNPIDKdqrgdda 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  148 ------AEPHVLGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTL------------------DQEGHIILHSMHRYLPR 203
Cdd:cd20683     81 fssgtaDKSVLLPESYIHPDGPQTGAFWMANGISFAKIKLSNRQPnssdrdgpkenitnsisaLPDGHFFLTSFHKYQPR 160
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1387194813  204 LHLVPAEKATEVIQLngpgVHTFTFPQTEFFAVTAYQNIQITQLKIDYNPFAKGFRDD 261
Cdd:cd20683    161 LHLIQHSAGDHDDIL----STTFTFEETEFIAVTHYQNEKVNILKKDYNPHAKGFKDD 214
T-box_TBXT cd20202
DNA-binding domain of T-box transcription factor T and related T-box proteins; TBXT, also ...
75-260 8.56e-57

DNA-binding domain of T-box transcription factor T and related T-box proteins; TBXT, also known as Brachyury protein, or protein T, is a transcription factor needed for posterior mesoderm formation and differentiation as well as for the notochord development during embryogenesis. It binds to a 24 base-pair (bp) palindromic site (called the T site) and activates gene transcription when bound to such a site. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. TBXT is the founding member of the T-box family, members of which share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410328  Cd Length: 179  Bit Score: 195.64  E-value: 8.56e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG 154
Cdd:cd20202      2 LKVSLEESELWLRFKELTNEMIVTKNGRRMFPVLKVNVSGLDPNAMYSFLLDFVAADNHRWKYVNGEWVPGGKPEPQAPS 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNtLDQEGHIILHSMHRYLPRLHLVpaekateviQLNGPG--VHTFTFPQTE 232
Cdd:cd20202     82 CVYIHPDSPNFGAHWMKAPVSFSKVKLTNK-LNGGGQIMLNSLHKYEPRIHIV---------RVGGPQrmITSHSFPETQ 151
                          170       180
                   ....*....|....*....|....*...
gi 1387194813  233 FFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20202    152 FIAVTAYQNEEITALKIKYNPFAKAFLD 179
T-box_TBX19-like cd20201
DNA-binding domain of T-box transcription factor 19 and related T-box proteins; Tbx19 (also ...
75-260 7.36e-56

DNA-binding domain of T-box transcription factor 19 and related T-box proteins; Tbx19 (also known as Tpit) is a T-box factor restricted to two pituitary (pro-opiomelanocortin) POMC-expressing lineages, the corticotrophs and melanotrophs; it controls terminal differentiation of these lineages. TBX19 activates POMC gene transcription with the cooperation of another transcription factor Pitx1. Mutations of the human TPIT gene cause early onset pituitary adrenocorticotrophic hormone (ACTH) deficiency. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410327  Cd Length: 183  Bit Score: 192.94  E-value: 7.36e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   75 ITVTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG 154
Cdd:cd20201      6 LQVSLEDAELWQRFKEVTNEMIVTKNGRRMFPVLKISVSGLDPNAMYSFLLDFAPADGHRWKYVNGEWVPAGKPEPHSHS 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNtLDQEGHIILHSMHRYLPRLHLVpaekateviQLNGPG--VHTFTFPQTE 232
Cdd:cd20201     86 CVYIHPDSPNFGAHWMKAPISFSKVKLTNK-LNGGGQIMLNSLHKYEPQIHIV---------RVGGPHrmVTNCSFPETQ 155
                          170       180
                   ....*....|....*....|....*...
gi 1387194813  233 FFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20201    156 FIAVTAYQNEEITALKIKYNPFAKAFLD 183
T-box_TBR2 cd20205
DNA-binding domain of T-box brain protein 2 and related T-box proteins; TBR2 (also known as ...
77-260 5.17e-55

DNA-binding domain of T-box brain protein 2 and related T-box proteins; TBR2 (also known as Eomesodermin, Eomes, or T-brain-2) is a member of the T-box family of transcription factors and is associated with neurogenesis, cardiogenesis and tumor immune response. This subfamily belongs to the T-box family of transcription factors which plays a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410331  Cd Length: 191  Bit Score: 191.05  E-value: 5.17e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813   77 VTLDNNNMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNMKYILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLG-R 155
Cdd:cd20205      4 VYLCNRPLWLKFHRHQTEMIITKQGRRMFPFLSFNITGLNPTAHYNVFVEVVLADPNHWRFQGGKWVTCGKADNNMQGnK 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813  156 VFIHPESPSTGHYWMHQPVSFYKLKLTNN---TLDQEGHIILHSMHRYLPRLHLVP-AEKATEviQLNGPG-VHTFTFPQ 230
Cdd:cd20205     84 VYVHPESPNTGAHWMRQEISFGKLKLTNNkgaNNNNTQMIVLQSLHKYQPRLHIVEvSEDGVE--DLNDSSkTQTFTFPE 161
                          170       180       190
                   ....*....|....*....|....*....|
gi 1387194813  231 TEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20205    162 NQFIAVTAYQNTDITQLKIDHNPFAKGFRD 191
bHLHzip_MGA cd18911
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) and ...
2365-2429 2.59e-33

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) and similar proteins; MGA, also termed MAX dimerization protein 5 (MAD5), is a dual specificity T-box/ bHLHzip transcription factor that regulates the expression of both Max-network and T-box family target genes. It contains a Myc-like bHLHZip motif and requires heterodimerization with Max for binding to the preferred Myc-Max-binding site CACGTG. In addition to the bHLHZip domain, MGA harbors a second DNA-binding domain, the T-box or T-domain. It thus binds the preferred Brachyury-binding sequence and represses transcription of reporter genes containing promoter-proximal Brachyury-binding sites.


Pssm-ID: 381481  Cd Length: 65  Bit Score: 123.74  E-value: 2.59e-33
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILTRAFSEIQGLTDQADKLIGQKNLLT 2429
Cdd:cd18911      1 RRTHTANERRRRNEMRDLFEKLKRTLGLHNLPKVSKYYILKQAFEEIQGLTDQADRLIGQKTLLT 65
bHLHzip_MGA_like cd19682
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) ...
2365-2429 1.55e-21

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) family; The MGA family includes MGA, Schizosaccharomyces pombe ESC1 (spESC1) and similar proteins. MGA, also termed MAX dimerization protein 5 (MAD5), is a dual specificity T-box/ bHLHzip transcription factor that regulates the expression of both Max-network and T-box family target genes. It contains a Myc-like bHLHZip motif and requires heterodimerization with Max for binding to the preferred Myc-Max-binding site CACGTG. In addition to the bHLHZip domain, MGA harbors a second DNA-binding domain, the T-box or T-domain. It thus binds the preferred Brachyury-binding sequence and represses transcription of reporter genes containing promoter-proximal Brachyury-binding sites. spESC1 is a bHLHzip protein with homology to human MyoD and Myf-5 myogenic differentiation inducers. It is involved in the sexual differentiation process.


Pssm-ID: 381525 [Multi-domain]  Cd Length: 65  Bit Score: 90.41  E-value: 1.55e-21
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILTRAFSEIQGLTDQADKLIGQKNLLT 2429
Cdd:cd19682      1 RLRHKKRERERRSELRELFDKLKQLLGLDSDEKASKLAVLTEAIEEIQQLKREEDELQKEKARLT 65
MGA_dom pfam16059
MGA, conserved domain; This domain can be found in the MAX gene-associated protein (Mga), ...
1041-1082 1.28e-13

MGA, conserved domain; This domain can be found in the MAX gene-associated protein (Mga), which is a dual-specificity transcription factor that contains both a bHLHZip domain and a T-box domain and is able to bind to and regulate transcriptional targets through both E-box sites as well as T-box-binding elements (TBEs).


Pssm-ID: 464998  Cd Length: 51  Bit Score: 67.51  E-value: 1.28e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1387194813 1041 RKRAPPCNNDFCRLGCVCSSLA-LEKRQPAHCRRPDCMFGCTC 1082
Cdd:pfam16059    2 KDAKKPCDKDYCQLGCVCDSLAgTRPPKREHCGRADCVLGCVC 44
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1407-1708 7.62e-11

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 67.68  E-value: 7.62e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1407 STLSTVISKVASNAKVAasrkPRTLLPSTSNSKTASSSGTTTNRPgknlKAFVPAKRPIAARPSPGGVFTQFVMSKVGAl 1486
Cdd:pfam17823  128 QSLPAAIAALPSEAFSA----PRAAACRANASAAPRAAIAAASAP----HAASPAPRTAASSTTAASSTTAASSAPTTA- 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1487 qqKIPGVSTPQPLTGpqkfsIRPSPVMVVTPVVSSEPVQVcssvtaAVTTTTPQVFLENVPAVTPTTalsdVGTKETTYS 1566
Cdd:pfam17823  199 --ASSAPATLTPARG-----ISTAATATGHPAAGTALAAV------GNSSPAAGTVTAAVGTVTPAA----LATLAAAAG 261
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1567 SGATTAGVVEVSETNTSTLvTPTQSTAT----LNLIKTTGITT--PVASVAFPKSLVASPPTITLPVASTASTSIVVVTT 1640
Cdd:pfam17823  262 TVASAAGTINMGDPHARRL-SPAKHMPSdtmaRNPAAPMGAQAqgPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSV 340
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1387194813 1641 AASSSMVTTPTS------SLSSVPIilsgidgsPPVSQRPENAPQIPVAPPQVSPNTVKRAGPRLLLIPVQQGS 1708
Cdd:pfam17823  341 ASTNLAVVTTTKaqakepSASPVPV--------LHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVAT 406
bHLHzip_Myc cd11400
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in the Myc family; The Myc family is a ...
2365-2442 9.59e-10

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in the Myc family; The Myc family is a member of the bHLHzip family of transcription factors that play important roles in the control of normal cell proliferation, growth, survival and differentiation. All Myc isoforms contain two independently functioning polypeptide chain regions: N-terminal transactivating residues and a C-terminal bHLHzip segment. The bHLHzip family of bHLH transcription factors are characterized by a highly conserved N-terminal basic region that may bind DNA at a consensus hexanucleotide sequence known as the E-box (CANNTG) followed by HLH and leucine zipper motifs that may interact with other proteins to form homo- and heterodimers. Myc heterodimerizes with Max enabling specific binding to E-box DNA sequences in the promoters of target genes. The Myc proto-oncoprotein family includes at least five different functional members: c-, N-, L-, S- and B-Myc (which is lacking the bHLH domain).


Pssm-ID: 381406 [Multi-domain]  Cd Length: 80  Bit Score: 57.17  E-value: 9.59e-10
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILTRAFSEIQGLTDQADKLIGQKNLLTRKRNILIRKVSSL 2442
Cdd:cd11400      2 RRLHNVLERQRRNDLKNSFEKLRDLVpELADNEKASKVVILKKATEYIKQLQQEEKKLEKEKDKLKARNEQLRKKLERL 80
bHLHzip_spESC1_like cd19690
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Schizosaccharomyces pombe ESC1 (spESC1) ...
2365-2421 1.21e-09

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Schizosaccharomyces pombe ESC1 (spESC1) and similar proteins; spESC1 is a bHLHzip protein with homology to human MyoD and Myf-5 myogenic differentiation inducers. It is involved in the sexual differentiation process.


Pssm-ID: 381533  Cd Length: 65  Bit Score: 56.70  E-value: 1.21e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILTRAFSEIQGLTDQADKL 2421
Cdd:cd19690      1 RVSHKLAERKRRKEMKELFEDLRDALPQERGTKASKWEILTKAISYIQQLKRHIREL 57
HLH pfam00010
Helix-loop-helix DNA-binding domain;
2364-2415 6.22e-09

Helix-loop-helix DNA-binding domain;


Pssm-ID: 459628 [Multi-domain]  Cd Length: 53  Bit Score: 54.00  E-value: 6.22e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1387194813 2364 YRRTHTANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILTRAFSEIQGLT 2415
Cdd:pfam00010    1 RREAHNERERRRRDRINDAFDELRELLpTLPPDKKLSKAEILRLAIEYIKHLQ 53
PHA03247 PHA03247
large tegument protein UL36; Provisional
1448-1848 6.95e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.80  E-value: 6.95e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1448 TNRPGKNLKAFVPAKRPI--AARPSPGgvftqfvmsKVGALQQKIPGVSTPQPltgpqkfsiRPSPVMVVTPVvssepvq 1525
Cdd:PHA03247  2667 ARRLGRAAQASSPPQRPRrrAARPTVG---------SLTSLADPPPPPPTPEP---------APHALVSATPL------- 2721
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1526 vcSSVTAAVTTTTPQVFLENVPAVTPTTALSDVGtkETTYSSGATTAGvvEVSETNTSTLVTPTQSTATLNLIKTTGITT 1605
Cdd:PHA03247  2722 --PPGPAAARQASPALPAAPAPPAVPAGPATPGG--PARPARPPTTAG--PPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1606 ---PVASVAFPKSLVASPPTITLPVASTASTSIVVVTTAASSSMVTTPTSSLSSVPIILSGIDGSpPVSQRP--ENAPQI 1680
Cdd:PHA03247  2796 eslPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG-DVRRRPpsRSPAAK 2874
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1681 PVAP----------PQVSPNTVKRAGPRLLLIPVQQGSPTLRPVPNTQLQGHRMVLQPVRSPSgmnLFRHPNGQIVQLLP 1750
Cdd:PHA03247  2875 PAAParppvrrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP---RPQPPLAPTTDPAG 2951
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1751 LHQLRGSNNQPNLQPVMfrnPGSVMGIRLPTPS-KPSETPPSSASPSAFSVVNPVIQAVGSSPAMNVITQAP--SLLSS- 1826
Cdd:PHA03247  2952 AGEPSGAVPQPWLGALV---PGRVAVPRFRVPQpAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPpvSLKQTl 3028
                          410       420       430
                   ....*....|....*....|....*....|....*.
gi 1387194813 1827 -GPNFVSQSGTLTLRIS-------------PPEPHS 1848
Cdd:PHA03247  3029 wPPDDTEDSDADSLFDSdsersdlealdplPPEPHD 3064
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1424-1735 1.09e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 57.28  E-value: 1.09e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1424 ASRKPRTLLPSTSNSKTASSSGTTTNRP-GKNLKAfvPAKRPIAARPSPGGVFTQFVMSKVGALQQKIPGVSTPQPltgP 1502
Cdd:pfam17823   65 AAPAPVTLTKGTSAAHLNSTEVTAEHTPhGTDLSE--PATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALP---S 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1503 QKFSIrPSPVMVVTPVVSSEPVQVCSSVTAAVTTTTPQVFLENVPAVTPTTALSDVGTKETTyssgATTAGVVEVSETNT 1582
Cdd:pfam17823  140 EAFSA-PRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAAS----SAPATLTPARGIST 214
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1583 STL--VTPTQSTATlnliKTTGITTPVasvafPKSLVASPPTITLPVASTASTSIVVVTTAA-----SSSMVT------- 1648
Cdd:pfam17823  215 AATatGHPAAGTAL----AAVGNSSPA-----AGTVTAAVGTVTPAALATLAAAAGTVASAAgtinmGDPHARrlspakh 285
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1649 TPTSSLSSVPIILSGIDGSPPVSQRPENAPQIPVAP---PQVSPNTVKRAGPRLLLIP---------VQQGSPTLRPVP- 1715
Cdd:pfam17823  286 MPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGeptPSPSNTTLEPNTPKSVASTnlavvtttkAQAKEPSASPVPv 365
                          330       340
                   ....*....|....*....|.
gi 1387194813 1716 -NTQLQGHRMVLQPVRSPSGM 1735
Cdd:pfam17823  366 lHTSMIPEVEATSPTTQPSPL 386
bHLH_SF cd00083
basic Helix Loop Helix (bHLH) domain superfamily; bHLH proteins are transcriptional regulators ...
2372-2414 1.16e-07

basic Helix Loop Helix (bHLH) domain superfamily; bHLH proteins are transcriptional regulators that are found in organisms from yeast to humans. Members of the bHLH superfamily have two highly conserved and functionally distinct regions. The basic part is at the amino end of the bHLH that may bind DNA to a consensus hexanucleotide sequence known as the E box (CANNTG). Different families of bHLH proteins recognize different E-box consensus sequences. At the carboxyl-terminal end of the region is the HLH region that interacts with other proteins to form homo- and heterodimers. bHLH proteins function as a diverse set of regulatory factors because they recognize different DNA sequences and dimerize with different proteins. The bHLH proteins can be divided to cell-type specific and widely expressed proteins. The cell-type specific members of bHLH superfamily are involved in cell-fate determination and act in neurogenesis, cardiogenesis, myogenesis, and hematopoiesis.


Pssm-ID: 381392 [Multi-domain]  Cd Length: 46  Bit Score: 50.21  E-value: 1.16e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1387194813 2372 ERRRRGEMRDLFEKLKITLGLLH-SSKVSKSLILTRAFSEIQGL 2414
Cdd:cd00083      1 ERRRRDKINDAFEELKRLLPELPdSKKLSKASILQKAVEYIREL 44
bHLHzip_L-Myc cd11457
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in L-Myc and similar proteins; L-Myc, ...
2365-2444 1.48e-06

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in L-Myc and similar proteins; L-Myc, also termed Class E basic helix-loop-helix protein 38 (bHLHe38), or protein L-Myc-1, or V-myc myelocytomatosis viral oncogene homolog, is a bHLHZip oncoprotein belonging to the Myc oncogene protein family. It binds DNA as a heterodimer with MAX. L-Myc is co-expressed with another Myc family member and has weaker transformation/transactivation activities. L-Myc knockout mouse did not exhibit any phenotypic abnormalities.


Pssm-ID: 381463 [Multi-domain]  Cd Length: 89  Bit Score: 48.64  E-value: 1.48e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILTRAFSEIQGLTDQADKLIGQKNLLTRKRNILIRKVSSLS 2443
Cdd:cd11457      8 RKNHNFLERKRRNDLRSRFLALRDEVpGLASCSKTPKVVILSKATEYLRGLVSAERRMAAEKRQLKSRQQQLLRRIAQLK 87

                   .
gi 1387194813 2444 G 2444
Cdd:cd11457     88 G 88
bHLHzip_Mlx_like cd11404
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-like protein X (Mlx) family; Mlx, ...
2365-2432 7.01e-06

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-like protein X (Mlx) family; Mlx, also termed Class D basic helix-loop-helix protein 13 (bHLHd13), or Max-like bHLHZip protein, or protein BigMax, or transcription factor-like protein 4, is a Max-like bHLHZip transcription regulator that interacts with the Max network of transcription factors. It forms a sequence-specific DNA-binding protein complex with some member of Mad family (Mad1 and Mad4) and Mondo family but not the Myc family and bind the E-box DNA to control transcription. The family also includes Saccharomyces cerevisiae INO4, which is a bHLH transcriptional activator of phospholipid synthetic genes (such as INO1, CHO1/PSS, CHO2/PEM1, OPI3/PEM2, etc.). It is required for de-repression of phospholipid biosynthetic gene expression in response to inositol deprivation in yeast.


Pssm-ID: 381410 [Multi-domain]  Cd Length: 70  Bit Score: 46.14  E-value: 7.01e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILTRAFSEIQGLTDQADKLIGQKNLLTRKR 2432
Cdd:cd11404      3 RLNHVRSEKKRRELIKKGYDELCALVPGLDPQKRTKADILQKAADWIQELKEENEKLEEQLDELKEAA 70
HLH smart00353
helix loop helix domain;
2369-2420 1.08e-05

helix loop helix domain;


Pssm-ID: 197674 [Multi-domain]  Cd Length: 53  Bit Score: 44.90  E-value: 1.08e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 1387194813  2369 TANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILTRAFSEIQGLTDQADK 2420
Cdd:smart00353    1 NARERRRRRKINEAFDELRSLLpTLPKNKKLSKAEILRLAIEYIKSLQEELQK 53
bHLHzip_N-Myc_like cd11456
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in N-Myc and similar proteins; N-Myc, ...
2365-2439 3.48e-05

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in N-Myc and similar proteins; N-Myc, also termed Class E basic helix-loop-helix protein 37 (bHLHe37), is a bHLHZip proto-oncogene protein that positively regulates the transcription of MYCNOS in neuroblastoma cells. It is also essential during embryonic development. N-Myc has a critical role in regulating the switch between proliferation and differentiation of progenitor cells. It binds DNA as a heterodimer with MAX. The family also includes S-Myc, encoded by rat or mouse intronless myc gene, which has apoptosis-inducing activity.


Pssm-ID: 381462 [Multi-domain]  Cd Length: 87  Bit Score: 44.51  E-value: 3.48e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILTRAFSEIQGLTDQADKLIGQKNLLTRKRNILIRKV 2439
Cdd:cd11456      6 RRNHNILERQRRNDLRSSFLTLRDHVpELVKNEKAAKVVILKKATEYVHSLQAEEQKLLLEKEKLQARQQQLLKKI 81
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1410-1721 3.81e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 49.53  E-value: 3.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1410 STVISKVASNA-----KVAASRKPRTLLPS-TSNSKTASSSGTTTNRPGKNlkaFVPAKRPIAARPSPGgVFTQFVMSKV 1483
Cdd:pfam05109  402 TLIITRTATNAtttthKVIFSKAPESTTTSpTLNTTGFAAPNTTTGLPSST---HVPTNLTAPASTGPT-VSTADVTSPT 477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1484 GALQQKIPGVSTPQPltgpqkfSIRPSPVMVVTPVVSSEPVQVcssvtaavTTTTPQvflenvpAVTPTTALSDVGTKET 1563
Cdd:pfam05109  478 PAGTTSGASPVTPSP-------SPRDNGTESKAPDMTSPTSAV--------TTPTPN-------ATSPTPAVTTPTPNAT 535
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1564 TYSSGATTAGVVEVSETNTSTLVTPTQSTATLN-LIKTTGITTPVASVAFPKSLVASPP---------TITLPVASTAST 1633
Cdd:pfam05109  536 SPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNaTIPTLGKTSPTSAVTTPTPNATSPTvgetspqanTTNHTLGGTSST 615
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1634 SIVVVTTAASSSMVTT-----PTSSLSSVPIILSGIDG--SPPVSQRP-ENAPQIPVAPPQVSPNTVKRAGPRLLLIPVQ 1705
Cdd:pfam05109  616 PVVTSPPKNATSAVTTgqhniTSSSTSSMSLRPSSISEtlSPSTSDNStSHMPLLTSAHPTGGENITQVTPASTSTHHVS 695
                          330
                   ....*....|....*.
gi 1387194813 1706 QGSPTLRPVPNTQLQG 1721
Cdd:pfam05109  696 TSSPAPRPGTTSQASG 711
bHLHzip_Max cd11406
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in protein Max and similar proteins; Max, ...
2365-2412 2.56e-04

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in protein Max and similar proteins; Max, also termed Class D basic helix-loop-helix protein 4 (bHLHd4), or Myc-associated factor X, is a bHLHZip transcription regulator that forms a sequence-specific DNA-binding protein complex with MYC or MAD which recognizes the core sequence 5'-CAC[GA]TG-3'. The MYC:MAX complex is a transcriptional activator, whereas the MAD:MAX complex is a transcriptional repressor. Max homodimer bind DNA but is transcriptionally inactive. Targeted deletion of max results in early embryonic lethality in mice.


Pssm-ID: 381412  Cd Length: 69  Bit Score: 41.57  E-value: 2.56e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILTRAFSEIQ 2412
Cdd:cd11406      2 RAHHNALERKRRDHIKDSFHSLRDSVPSLQGEKASRAQILKKATEYIQ 49
bHLHzip_Mad4 cd18929
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-associated protein 4 (Mad4) and ...
2364-2443 2.61e-04

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-associated protein 4 (Mad4) and similar proteins; Mad4, also termed Max dimerization protein 4, or Max dimerizer 4 (MXD4), or Class C basic helix-loop-helix protein 12 (bHLHc12), or Max-interacting transcriptional repressor MAD4, is a bHLHZip Max-interacting transcriptional repressor that suppresses c-myc dependent transformation and is expressed during neural and epidermal differentiation. It is regulated by a transcriptional repressor complex that contains Miz-1 and c-Myc.


Pssm-ID: 381499 [Multi-domain]  Cd Length: 88  Bit Score: 42.30  E-value: 2.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 2364 YRRTHTANERRRRGEMRDLFEKLK--ITLGLLHSSKVSKSLiLTRAFSEIQGLTDQADKLIGQKNLLTRKRNILIRKVSS 2441
Cdd:cd18929      2 NRSSHNELEKHRRAKLRLYLEQLKqlVPLGPDSTRHTTLSL-LKRAKMHIKKLEEQDRKALNIKEQLQREHRYLKRRLEQ 80

                   ..
gi 1387194813 2442 LS 2443
Cdd:cd18929     81 LS 82
SP4_N cd22536
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins ...
1616-1783 4.49e-04

N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. Human SP4 is a risk gene of multiple psychiatric disorders including schizophrenia, bipolar disorder, and major depression. SP4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP4.


Pssm-ID: 411773 [Multi-domain]  Cd Length: 623  Bit Score: 45.68  E-value: 4.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1616 LVASPPTitlpvasTASTSIVVVTTAASSSMVTTPTSSLSSVPIILS-----GIDGSPPVSQRPENAPQIPVAP-PQVSP 1689
Cdd:cd22536    276 LVSTPIT-------TASVSTMPESPSSSTTCTTTASTSLTSSDTLVSsaetgQYASTAASSERTEEEPQTSAAEsEAQSS 348
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1690 NTVKRAGprllLIPVQQGSptlrpvpnTQLQGHRMVLQPVRSpsgMNLFRHPNGQIVQLLPLH--QLRGSNN-------- 1759
Cdd:cd22536    349 SQLQSNG----LQNVQDQS--------NSLQQVQIVGQPILQ---QIQIQQPQQQIIQAIQPQsfQLQSGQTiqtiqqqp 413
                          170       180
                   ....*....|....*....|....*.
gi 1387194813 1760 QPNLQPVMFRNPGSVMgIRLP--TPS 1783
Cdd:cd22536    414 LQNVQLQAVQSPTQVL-IRAPtlTPS 438
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1609-1918 1.01e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 1.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1609 SVAFPKSLVASPPTITLPVASTASTSIVVVTTAASSSmvTTPTSSLSSVPiilsgIDGSPPVSQrPENAPQIPVAPPQVS 1688
Cdd:pfam03154  160 SSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAAT--AGPTPSAPSVP-----PQGSPATSQ-PPNQTQSTAAPHTLI 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1689 PNTVKRAGPRL-----LLIPVQQGSP----TLRPVPNTQLQG-----------------HRMVLQPVRSPSGMNLFRHPN 1742
Cdd:pfam03154  232 QQTPTLHPQRLpsphpPLQPMTQPPPpsqvSPQPLPQPSLHGqmppmphslqtgpshmqHPVPPQPFPLTPQSSQSQVPP 311
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1743 GQIVQL---------LPLHQLRGSNNQPNLQPVMFRNPGSVMGIR---------LPTPSKPSETPPSSASPSAFSVVN-- 1802
Cdd:pfam03154  312 GPSPAApgqsqqrihTPPSQSQLQSQQPPREQPLPPAPLSMPHIKpppttpipqLPNPQSHKHPPHLSGPSPFQMNSNlp 391
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1803 ---------------------------PVIQAVGSSPAM-NVITQAPSLLSSGPNFVSQSGTLTLRISPPEP-HSFTSkt 1853
Cdd:pfam03154  392 pppalkplsslsthhppsahppplqlmPQSQQLPPPPAQpPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPqHPFVP-- 469
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1387194813 1854 ASETKITYSSGGQPVGTASLIPLQSGSFALLQLPGqkPVPSSILQHVASLQMKRESQNADQKDET 1918
Cdd:pfam03154  470 GGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSG--PVPAAVSCPLPPVQIKEEALDEAEEPES 532
bHLHzip_USF3 cd18910
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in basic helix-loop-helix ...
2362-2422 1.10e-03

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in basic helix-loop-helix domain-containing protein USF3 and similar proteins; USF3, also termed upstream transcription factor 3, is a bHLHzip protein that is involved in the negative regulation of epithelial-mesenchymal transition, the process by which epithelial cells lose their polarity and adhesion properties to become mesenchymal cells with enhanced migration and invasive properties.


Pssm-ID: 381480  Cd Length: 65  Bit Score: 39.59  E-value: 1.10e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1387194813 2362 AYYRRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILTRAFSEIQGLTDQADKLI 2422
Cdd:cd18910      3 EKKRESHNEVERRRKDKINAGINKIGELLPDRDAKKQSKNMILEQAYKYIVELKKKNDKLL 63
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1547-1655 1.14e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.36  E-value: 1.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1547 PAVTPTTALSDVGTKETTYSSGATTAGVVEVSETNTSTLVTPTQSTATLNLIKTTGITTPVASVAFPKSLVAS-PPTITL 1625
Cdd:COG3469     92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTtTSTTTT 171
                           90       100       110
                   ....*....|....*....|....*....|
gi 1387194813 1626 PVASTASTSIVVVTTAASSSMVTTPTSSLS 1655
Cdd:COG3469    172 TTSASTTPSATTTATATTASGATTPSATTT 201
bHLHzip_c-Myc cd11458
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in c-Myc and similar proteins; c-Myc, ...
2365-2442 1.15e-03

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in c-Myc and similar proteins; c-Myc, also termed Myc proto-oncogene protein, or Class E basic helix-loop-helix protein 39 (bHLHe39), or transcription factor p64, a bHLHZip proto-oncogene protein that functions as a transcription factor, which binds DNA in a non-specific manner, yet also specifically recognizes the core sequence 5'-CAC[GA]TG-3'. It activates the transcription of growth-related genes.


Pssm-ID: 381464 [Multi-domain]  Cd Length: 84  Bit Score: 40.25  E-value: 1.15e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILTRAFSEIQGLTDQADKLIGQKNLLTRKRNILIRKVSSL 2442
Cdd:cd11458      6 RRTHNVLERQRRNELKLSFFALRDQIpEVANNEKAPKVVILKKATEYILSMQADEQRLISEKEQLRRRREQLKHRLEQL 84
PHA03255 PHA03255
BDLF3; Provisional
1547-1691 1.31e-03

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 42.97  E-value: 1.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1547 PAVTPTTALSDVGTKETTYSSGATTAGVVevseTNTSTLVTPTQSTATLNLIKTTGITTPVASVAFPKSLVASPPTITlp 1626
Cdd:PHA03255    53 PSTNQSTTLTTTSAPITTTAILSTNTTTV----TSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTS-- 126
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1387194813 1627 VASTASTSIVVVTTAASSSMVTTPTSSlssvpiilsgidgSPPVSQRPENAPQIPVAPPQVSPNT 1691
Cdd:PHA03255   127 NVTTRSSSTTSATTRITNATTLAPTLS-------------SKGTSNATKTTAELPTVPDERQPSL 178
bHLHzip_MLXIP_like cd11405
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MLX-interacting protein (MLXIP), ...
2365-2434 1.57e-03

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MLX-interacting protein (MLXIP), MLX-interacting protein-like (MLXIPL) and similar proteins; The family includes MLXIP and MLXIPL. MLXIP, also termed Class E basic helix-loop-helix protein 36 (bHLHe36), or transcriptional activator MondoA, is a bHLHZip transcriptional activator that binds DNA as a heterodimer with Mlx. It binds to the canonical E box sequence 5'-CACGTG-3' and plays a role in transcriptional activation of glycolytic target genes. MLXIP is most highly expressed in skeletal muscle and functions as an indirect glucose sensor, by sensing glucose 6-phosphate and shuttling between the nucleus and the cytoplasm. MLXIPL, also termed carbohydrate-responsive element-binding protein (ChREBP), or Class D basic helix-loop-helix protein 14 (bHLHd14), or MLX interactor, or WS basic-helix-loop-helix leucine zipper protein (WS-bHLH), or Williams-Beuren syndrome chromosomal region 14 protein (WBSCR14), is a bHLHZip transcriptional factor integral to the regulation of glycolysis and lipogenesis in the liver. It forms heterodimers with the bHLHZip protein Mlx to bind the DNA sequence 5'-CACGTG-3'.


Pssm-ID: 381411 [Multi-domain]  Cd Length: 74  Bit Score: 39.57  E-value: 1.57e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLK---ITLGLLHSSKVSKSLILTRAFSEIQGLTDQADKLIGQKNLLtrKRNI 2434
Cdd:cd11405      4 RLSHISAEQKRRFNIKSGFDTLQsliPSLGQNPNQKVSKAAMLQKAAEYIKSLKRERQQMQEEAEQL--RQEI 74
bHLHzip_Mad cd11401
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in the Mad family; Members of the Mad ...
2365-2439 2.55e-03

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in the Mad family; Members of the Mad family (Mad1, Mxi, Mad3, and Mad4) bear the bHLHzip domain (also known as basic-helix-loop-helix-leucine-zipper or bHLH-LZ domain), which mediates heterodimerization to Max and the sequence-specific DNA binding ability to E-box DNA. Mad family proteins can repress transcription at the E-box through their interaction with co-repressors. Mad family proteins antagonize Myc function in transactivation and transformation and they are growth/tumor suppressors. The developmental phenotypes of the individual Mad family member knockout mice are relatively mild- all these mice have been shown to be viable and normal.


Pssm-ID: 381407 [Multi-domain]  Cd Length: 76  Bit Score: 39.12  E-value: 2.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLK------------ITLGLlhsskvsksliLTRAFSEIQGLTDQADKLIGQKNLLTRKR 2432
Cdd:cd11401      1 RSTHNELEKNRRAHLRLCLERLKelvplgpdatrhTTLSL-----------LTKAKAYIKNLEDKEKRQRQQKEQLRREQ 69

                   ....*..
gi 1387194813 2433 NILIRKV 2439
Cdd:cd11401     70 RELKRRL 76
bHLHzip_Mlx cd19687
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-like protein X (Mlx) and similar ...
2365-2431 4.30e-03

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-like protein X (Mlx) and similar proteins; Mlx, also termed Class D basic helix-loop-helix protein 13 (bHLHd13), or Max-like bHLHZip protein, or protein BigMax, or transcription factor-like protein 4, is a Max-like bHLHZip transcription regulator that interacts with the Max network of transcription factors. It forms a sequence-specific DNA-binding protein complex with some member of Mad family (Mad1 and Mad4) and Mondo family but not the Myc family and bind the E-box DNA to control transcription.


Pssm-ID: 381530 [Multi-domain]  Cd Length: 76  Bit Score: 38.17  E-value: 4.30e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLKITLGLLH------SSKVSKSLILTRAFSEIQGLTDQADKLIGQKNLLtRK 2431
Cdd:cd19687      3 REAHTQAEQKRRDAIKKGYDDLQDIVPTCQqqddigSQKLSKATILQRSIDYIQFLHQQKKKQEEELSAL-RK 74
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
1491-1717 5.14e-03

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 42.22  E-value: 5.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1491 PGVSTPQPLTGPQKFSIRPSPvmvvtpvvsSEPVQVCSSVTAAVTTTTPQVFLENVPAVTP---TTALSDVGTKETTYSS 1567
Cdd:cd22540    157 PVQVLQQPQQAHKPVPIKPAP---------LQTSNTNSASLQVPGNVIKLQSGGNVALTLPvnnLVGTQDGATQLQLAAA 227
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1568 GATTAGVVEVSETNTSTLVTPTQSTATLNLIKTTGITTPVASVAFpkSLVASPPTITLPVAStastSIVVVTTAASSSMV 1647
Cdd:cd22540    228 PSKPSKKIRKKSAQAAQPAVTVAEQVETVLIETTADNIIQAGNNL--LIVQSPGTGQPAVLQ----QVQVLQPKQEQQVV 301
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1387194813 1648 TTPTSSLSSVPiilSGIDGSPPVSQRPENAPQIPVAPPQVSPNTVKRAGPRLLLIPVQQG-SPTLRPVPNT 1717
Cdd:cd22540    302 QIPQQALRVVQ---AASATLPTVPQKPLQNIQIQNSEPTPTQVYIKTPSGEVQTVLLQEApAATATPSSST 369
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1537-1721 6.96e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.05  E-value: 6.96e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1537 TTPQVFLE-NVPAVTPTTALSDVGTKETTYSSGATTAGVVEVSETNTSTLVTPTQSTATLNLiKTTGITTPVASVAFPKS 1615
Cdd:COG3469     35 TAATATTVvSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATL-VATSTASGANTGTSTVT 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387194813 1616 LVASPPTITLPVASTASTSIVVVTTAASSSMVTTPTSSLSSVPIILSGIDGSPPVSQRPENAPQIPVAPPQVSPNTVKRA 1695
Cdd:COG3469    114 TTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGA 193
                          170       180
                   ....*....|....*....|....*.
gi 1387194813 1696 GPRLLLIPVQQGSPTLRPVPNTQLQG 1721
Cdd:COG3469    194 TTPSATTTATTTGPPTPGLPKHVLVG 219
bHLH_ScINO2_like cd11388
basic helix-loop-helix (bHLH) domain found in Saccharomyces cerevisiae protein INO2 and ...
2365-2421 9.92e-03

basic helix-loop-helix (bHLH) domain found in Saccharomyces cerevisiae protein INO2 and similar proteins; INO2 is a positive regulatory factor required for depression of the co-regulated phospholipid biosynthetic enzymes in Saccharomyces cerevisiae. It is also involved in the expression of ITR1.


Pssm-ID: 381394  Cd Length: 68  Bit Score: 36.95  E-value: 9.92e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1387194813 2365 RRTHTANERRRRGEMRDLFEKLkitLGLLH------SSKVSKSLILTRAFSEIQGLTDQADKL 2421
Cdd:cd11388      4 KWKHVEAEKKRRNQIKKGFEDL---INLINyprnnnEKRISKSELLNKAVDDIRGLLKANEQL 63
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH