Protein

Protein accession
2zRsi [EnVhog]
Representative
1DG6C
Source
EnVhog (cluster: phalp2_5083)
Protein name
2zRsi
Lysin probability
94%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MRSTKALSPSKFFGGNRYTAYLDELTASGTIAGQKLSPSERKEGFKKRGDKINFEKFVNKVLEKKTGPAMSGGARALPGNGRGGAIVKSSGVTPFSTPPVSEKGQENIDDILKGIDSILETLKKEEKFKRQVSVKNRRKQESERRSSNEDKLENKSFKGLGNAISKVLKPVKSVFDSLMDFIINTLIGRFLTKFIDWFSDPKNEDKLNAIGRFLSDTWPALLAGFILFGTGLGGFLTSLVGLVTFFIPKLFKLTTSLLRFAGRNPMKALAISGVGALGYAAFTGTQASNDPERAAQGKTQLDDNIENLGGMMQNFSFFNAGGRVPGTGTKDTVPAMLTPGEFVMSRGAVSKFGMNTMSSMNSAGGGSGAPSLMSDGLLGYAQGGLVGGSPGNPRNPENRKIFLHWSGGFHNSIQGLPYHQTFSGSGKPASTNVNYGVDKYAHTAGHNTDSIGLGAAAMGHSGMSKNYYDEDKGWAENPITNAQTTAMAKEAAALLRAYGQTTTDVDKNVWTHGEWERHAVKKGLLDPPIQRWDLDSLTPPPYAKHPGGFWKTNQIYSDGGNKMRAKIKSFMTGAQIPVTSPEEPKSGNNLTGGAASMMSGSRPNNTNTSNANQPTITATKPKPTVKPLSAPDRAAFFKSIRELVDGPVPAPTSPSPGVNIPDLNAAMFQDPRKRQVLGIGG
Physico‐chemical
properties
protein length:681 AA
molecular weight:72877,8 Da
isoelectric point:9,77
hydropathy:-0,46
Representative Protein Details
Accession
1DG6C
Protein name
1DG6C
Sequence length
659 AA
Molecular weight
69353,93660 Da
Isoelectric point
9,75763
Sequence
MRSTKALSPSKFFGGNRYASYLDELTSSGTIAGQRLSPAERKEGFKKRADKINFEKFVNKVLEKKTGPAMGSTNRALPGGGRGGAIVKTSGNVAQSFVSSPVTEKTQENLDDVLKGIDSILETLRAEQKFKKQVVAKEKRKQERERRSASEDKLEKKAFSGLGKAVSKVLKPVKSIFDKLFDFIFTVLIGRVLIKLIDWFSDENNKGKINAIGRFLKDTWPAILAGFLLFGTGLGGFLKGLVGLVTFFIPKILKLTGKLISIAVRNPLKTLAVAGVGALGYAAYTGTQASNDPERAAQGKTQLDDNIENLGNMMQNISFYSAGGRVPGSGNADTVPAMLTPGEFVMSRGAVSKYGVGFMQGINSAGGGGRTSKPGYYSSGGEVPPNEEHGARISPDGSAKPNISASGVAKSSGSGGTLSLTAQDFRDLAYIVSAEAARNTDDEYGVAAAVLNRLTDPNWPNTIAAVGSQSGQFEAVYTGKAYDDPELAKKLASPQGQAKIAEALKILNGRTDFKGQSQLGNKGSTDPMFHPSGNFYHYTSQVGKSDPVPSNPPQNWRRLIGTGGPAVTLASTSGPSTTLSSGVTRGSGGGSSYSGNITAAKPKASVKPPSQADIKSMLDSIKLAVDAPMSITPSPGVSLPDLNAAVMHDPRKVKVLGIG
Other Proteins in cluster: phalp2_5083
Total (incl. this protein): 35 Avg length: 691,0 Avg pI: 9,51

Protein ID Length (AA) pI
1DG6C 659 9,75763
1B7rW 634 9,75028
1F3uY 766 9,78664
1RsEO 788 9,73713
1UM50 646 9,39564
1Ue0i 697 8,67469
1tsHw 597 9,75260
27lES 808 9,56358
2U5tY 696 8,88086
2f3MF 659 9,76711
2v5Qa 570 9,78342
2zODC 707 9,06891
2zYe6 655 9,93459
369dz 696 8,79679
3IOy0 806 9,80882
3J2sP 698 9,28707
3LZDy 731 9,40170
3M2np 705 9,17200
4DhP 671 9,87586
4M4Gf 697 8,88086
4RVsf 739 9,73287
6CnYX 746 9,67679
6JqZ4 594 9,61741
7EDMp 717 9,59955
7HfXE 732 9,49917
7IH9b 659 9,74834
7hwl 607 9,81075
8CxJ9 717 9,17174
8gAj7 673 9,89527
8pmp4 717 9,17174
8xAbs 705 9,17200
9jak 672 9,67537
Yyxd 597 9,75756
vHLO 742 9,52051
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_7396
4DPZb
5 26,7% 646 6.924E-53
2 phalp2_38885
1VyAB
3 29,0% 455 1.464E-49
3 phalp2_35743
4atxr
25 25,2% 502 2.441E-41
4 phalp2_18050
3ARhs
32 25,3% 545 5.815E-39

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1DG6C) rather than this protein.
PDB ID
1DG6C
Method AlphaFoldv2
Resolution 61.26
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50