Protein

Protein accession
20C7L [EnVhog]
Representative
2AkIG
Source
EnVhog (cluster: phalp2_34681)
Protein name
20C7L
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MVLLATLSLLAIGTKGVAEAPNIPYAQKTFEEQVQETFKDDYQIMMAIFKAEGGIDKNGKPKLEAKNYNCFYYNEKGKRYSTYCKKEDRQKAWSVDCGIAQVNVKGQICPTRLVTIEGNIESAKKIKDEQGFEAWVVYKTGKYKKYL
Physico‐chemical
properties
protein length:147 AA
molecular weight:16743,1 Da
isoelectric point:9,02
hydropathy:-0,55
Representative Protein Details
Accession
2AkIG
Protein name
2AkIG
Sequence length
150 AA
Molecular weight
17191,71970 Da
Isoelectric point
9,21377
Sequence
MWYKLIVVVLVLASLLGYKPQRISADDPVITFEQHMTRIFGDKTEIAIAVLKHESGLRLDAKNYNCFYYNKNGKRYSTFCKPSDYGDAWSVDCGIAQINVPGTICPKRLLTLEGNVEQVEKIYREQGLNAWVSYTNGKYKQFLKKKPLTT
Other Proteins in cluster: phalp2_34681
Total (incl. this protein): 13 Avg length: 152,0 Avg pI: 8,34

Protein ID Length (AA) pI
2AkIG 150 9,21377
1Xdl1 140 9,33136
1l143 158 6,14506
1leZk 162 9,09779
1luIR 139 9,24627
2kHUV 133 5,68710
2tgcx 192 8,92605
2ttXq 160 7,66226
3Mnxm 141 8,31624
4kHOx 145 8,78003
4wOtA 160 8,64510
UIj0 149 8,32785
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_2658
6W2SL
17 42,9% 121 1.975E-31
2 phalp2_1216
gj2Y
3 38,7% 116 5.728E-29
3 phalp2_5112
1M0iw
1 31,1% 135 2.296E-17
4 phalp2_10848
4hnrw
1 25,8% 143 1.299E-12
5 phalp2_14404
4wK5S
1 26,2% 118 4.518E-12
6 phalp2_2050
2D0kD
12 26,8% 119 7.435E-11
7 phalp2_5082
1Dz46
1 24,5% 122 1.384E-10
8 phalp2_14996
ECkt
4 26,1% 111 4.793E-10
9 phalp2_8005
5nhTA
1 27,3% 117 1.216E-09
10 phalp2_33305
6dOqy
114 29,0% 100 1.447E-08

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 2AkIG (150 AA)
Member sequence: 20C7L (147 AA)
1 150 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2AkIG) rather than this protein.
PDB ID
2AkIG
Method AlphaFoldv2
Resolution 88.26
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50