Protein

Protein accession
21kc3 [EnVhog]
Representative
8eGjW
Source
EnVhog (cluster: phalp2_20210)
Protein name
21kc3
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSIGHMKTSQNGIDLIKKFEGCRLTAYKDIVGVWTIGYGHTGSDVYSGLKITQAKAEALLKADLERFEKGVNRLDYIYDFNQNEFDALVSFSYNLGVGCLNQLTNNGMRSRQTIRSKWKAYCNAGGRYSRGLYNRRAAELDLFNKGNLSTVEPYLIPDGSCVLKYNTTRHDDVKDLQRALNDFAGVNLVVDGKFGPATRKAVMELQKRSGYLLVDGKYGKNTAAYLIKLARS
Physico‐chemical
properties
protein length:232 AA
molecular weight:25966,3 Da
isoelectric point:9,43
hydropathy:-0,41
Representative Protein Details
Accession
8eGjW
Protein name
8eGjW
Sequence length
281 AA
Molecular weight
31284,80290 Da
Isoelectric point
5,79709
Sequence
MQIGKSGIALIKKWEGYHTKLPDGQCQAYLDRLVRPALRSPGYDGLWTIGYGCTEGVYEGLVWSEAEAEKHLLVEVNKHVGHVNRLLDGAEVNQNQFDALVSFSYNLGPGWLQKSDHPNGLLGLIKAGEHAKASNHFGSYVRAGGKVYKGLVNRRADEKKLFNTVSTKEVVVTSRRLSFTQNIRNFFATLSVGSFFTWQNFEQAKTFMSDNAGFILLGAGLTAFLGYKFIEYLSMNEYKEGRYTPSKQEEVPEAEVAEVDMPEEVLYGDEEQSGGPDHGTV
Other Proteins in cluster: phalp2_20210
Total (incl. this protein): 19 Avg length: 254,2 Avg pI: 7,63

Protein ID Length (AA) pI
8eGjW 281 5,79709
1olMv 242 9,17761
2LN63 243 5,00737
36EWO 248 8,34693
39o87 243 4,88454
39wB4 243 4,88454
3Qv02 300 8,70357
4LBRJ 255 9,14872
4MnFZ 258 9,77162
4eSWn 264 6,09248
4gOQq 256 7,55853
4gcvO 245 9,53850
6ICxz 253 9,31376
7Pp13 254 5,19226
7PqFp 245 8,66676
8eGhr 255 8,62099
8tzI1 248 9,10863
vdRZ 264 5,80129
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_2498
5GMvl
297 36,8% 182 4.399E-45
2 phalp2_33932
23JqA
85 35,2% 173 1.024E-36
3 phalp2_14481
4Lj5N
23 36,6% 169 7.811E-35
4 phalp2_23227
7HiLX
27 33,3% 189 1.450E-34
5 phalp2_26378
1q5mp
207 28,5% 256 1.975E-34
6 phalp2_9745
82S7v
113 32,9% 176 1.494E-32
7 phalp2_30561
5EHHM
5 38,2% 183 9.511E-32
8 phalp2_18164
4uApd
5 31,7% 170 1.272E-23
9 phalp2_16965
86QY9
61 28,9% 190 3.172E-23
10 phalp2_6008
4MOEK
54 29,5% 176 2.668E-22

Domains

Domains
GH24
Disordered region
Representative sequence (used for alignment): 8eGjW (281 AA)
Member sequence: 21kc3 (232 AA)
1 281 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (8eGjW) rather than this protein.
PDB ID
8eGjW
Method AlphaFoldv2
Resolution 79.28
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50