NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217356123|ref|XP_047273248|]
View 

protein Largen isoform X3 [Homo sapiens]

Protein Classification

Largen family protein( domain architecture ID 12172577)

Largen family protein containing a DUF4589 domain, similar to Homo sapiens protein Largen (also called proline-rich protein 16), a regulator of cell size that promotes cell size increase independently of mTOR and Hippo signaling pathways

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4589 pfam15252
Domain of unknown function (DUF4589); This protein family is a domain of unknown function. The ...
2-234 3.03e-51

Domain of unknown function (DUF4589); This protein family is a domain of unknown function. The precise function of the protein domain remains to be elucidated. This family of proteins is found in eukaryotes and are typically between 215 and 293 amino acids in length. The protein contains two conserved sequence motifs: SSS and KST.


:

Pssm-ID: 464592  Cd Length: 232  Bit Score: 166.13  E-value: 3.03e-51
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356123   2 TDSSKTDTLNSS---SSGTTASSLEKIKVQANAPLIKPPAhpsaILTVLRKPNPPPpppRLTPVKCEDPkrvvptanpVK 78
Cdd:pfam15252  19 PDDWTTATLSSTsssDKGGGPFDLGKLDFMTADILSDSWE----FCSFLDKSTPSP---RLTPPESEDP---------GK 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356123  79 TNGTLLRNGGLPggpnkIPNGDICCIPNSNLDKAPVQLLMHRPEKDRCPqaGPRERVRFNEKVQYHGYCPDCDTRYNIKN 158
Cdd:pfam15252  83 GPGYRLMNGGLP-----IPNGPRIETPDSSSEEAFSSAPLLRHEKQRTP--GTRERVRFSDKVLYHALCCDDDERYDEDN 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356123 159 RevhlHSEPVH---PPGKIPHQGPPLPPTPHLPPFPLENGGmGISHSNSFP---PIRPATVPPPTAPKPQKTILRKSTTT 232
Cdd:pfam15252 156 R----HEEPEDgasLPLDPPHCCPSSSPPPPPLPPFLNPSF-PPVPPCVKPrpsPLKPGRRGKTTRNSSTQTVSDKSTQT 230

                  ..
gi 2217356123 233 TV 234
Cdd:pfam15252 231 TL 232
 
Name Accession Description Interval E-value
DUF4589 pfam15252
Domain of unknown function (DUF4589); This protein family is a domain of unknown function. The ...
2-234 3.03e-51

Domain of unknown function (DUF4589); This protein family is a domain of unknown function. The precise function of the protein domain remains to be elucidated. This family of proteins is found in eukaryotes and are typically between 215 and 293 amino acids in length. The protein contains two conserved sequence motifs: SSS and KST.


Pssm-ID: 464592  Cd Length: 232  Bit Score: 166.13  E-value: 3.03e-51
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356123   2 TDSSKTDTLNSS---SSGTTASSLEKIKVQANAPLIKPPAhpsaILTVLRKPNPPPpppRLTPVKCEDPkrvvptanpVK 78
Cdd:pfam15252  19 PDDWTTATLSSTsssDKGGGPFDLGKLDFMTADILSDSWE----FCSFLDKSTPSP---RLTPPESEDP---------GK 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356123  79 TNGTLLRNGGLPggpnkIPNGDICCIPNSNLDKAPVQLLMHRPEKDRCPqaGPRERVRFNEKVQYHGYCPDCDTRYNIKN 158
Cdd:pfam15252  83 GPGYRLMNGGLP-----IPNGPRIETPDSSSEEAFSSAPLLRHEKQRTP--GTRERVRFSDKVLYHALCCDDDERYDEDN 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356123 159 RevhlHSEPVH---PPGKIPHQGPPLPPTPHLPPFPLENGGmGISHSNSFP---PIRPATVPPPTAPKPQKTILRKSTTT 232
Cdd:pfam15252 156 R----HEEPEDgasLPLDPPHCCPSSSPPPPPLPPFLNPSF-PPVPPCVKPrpsPLKPGRRGKTTRNSSTQTVSDKSTQT 230

                  ..
gi 2217356123 233 TV 234
Cdd:pfam15252 231 TL 232
 
Name Accession Description Interval E-value
DUF4589 pfam15252
Domain of unknown function (DUF4589); This protein family is a domain of unknown function. The ...
2-234 3.03e-51

Domain of unknown function (DUF4589); This protein family is a domain of unknown function. The precise function of the protein domain remains to be elucidated. This family of proteins is found in eukaryotes and are typically between 215 and 293 amino acids in length. The protein contains two conserved sequence motifs: SSS and KST.


Pssm-ID: 464592  Cd Length: 232  Bit Score: 166.13  E-value: 3.03e-51
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356123   2 TDSSKTDTLNSS---SSGTTASSLEKIKVQANAPLIKPPAhpsaILTVLRKPNPPPpppRLTPVKCEDPkrvvptanpVK 78
Cdd:pfam15252  19 PDDWTTATLSSTsssDKGGGPFDLGKLDFMTADILSDSWE----FCSFLDKSTPSP---RLTPPESEDP---------GK 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356123  79 TNGTLLRNGGLPggpnkIPNGDICCIPNSNLDKAPVQLLMHRPEKDRCPqaGPRERVRFNEKVQYHGYCPDCDTRYNIKN 158
Cdd:pfam15252  83 GPGYRLMNGGLP-----IPNGPRIETPDSSSEEAFSSAPLLRHEKQRTP--GTRERVRFSDKVLYHALCCDDDERYDEDN 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217356123 159 RevhlHSEPVH---PPGKIPHQGPPLPPTPHLPPFPLENGGmGISHSNSFP---PIRPATVPPPTAPKPQKTILRKSTTT 232
Cdd:pfam15252 156 R----HEEPEDgasLPLDPPHCCPSSSPPPPPLPPFLNPSF-PPVPPCVKPrpsPLKPGRRGKTTRNSSTQTVSDKSTQT 230

                  ..
gi 2217356123 233 TV 234
Cdd:pfam15252 231 TL 232
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH