Protein

Protein accession
4oYLg [EnVhog]
Representative
4e6HN
Source
EnVhog (cluster: phalp2_21800)
Protein name
4oYLg
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDVDYVRPCKTHEVRDNFDDHVKRHSKMPGLDYACKTGDKVFATAKGTIVSCSNNPDQVLGKNIAIRHPDGKHSYYLHLSKLEVRNGQRVKAGEVIALSGNTGTTSTGPHLHFAIKNARGTFIDPKKLLRKEIAEKRAEAAAIIPPVVDVVAEVIPE
Physico‐chemical
properties
protein length:157 AA
molecular weight:17263,6 Da
isoelectric point:9,13
hydropathy:-0,44
Representative Protein Details
Accession
4e6HN
Protein name
4e6HN
Sequence length
155 AA
Molecular weight
16774,86900 Da
Isoelectric point
9,42736
Sequence
MKLDYRRPCKTHTVRDNFEDHKKRGSNLPGLDYACNTGDQVYATADGMILSVSHLDNSASGINIVIRHPDGQKSYYLHLSRILVGVGKRVKAGDLIAKSGNTGHSTGPHLHFSIRNNKGVCVDPQKVIDGPRKPKGPAVKTSTVELPPAPVDAAE
Other Proteins in cluster: phalp2_21800
Total (incl. this protein): 71 Avg length: 156,8 Avg pI: 9,08

Protein ID Length (AA) pI
4e6HN 155 9,42736
16mXN 128 10,11743
17LYu 150 9,76388
17Nse 139 9,85897
19NSi 160 9,73635
1JPGL 165 10,20207
1JU2c 156 9,15227
1K1z1 168 9,35664
1K7tE 159 9,09754
1K9j7 157 8,90806
1KdU3 157 6,28721
1LcUk 146 9,62270
1M1El 142 9,44302
1haky 164 7,80759
1xRtE 161 9,14756
1ywlM 157 9,15253
25NmX 155 8,84940
2X1CF 146 9,72817
2d741 158 9,86252
2hDNS 137 9,74241
30LZK 162 8,89453
30vuh 160 6,74755
32zCF 157 9,93137
372ay 133 9,94388
38eeG 156 9,15227
3bqAf 157 7,13042
46ATU 155 8,49360
46Ihr 157 8,75985
46hLy 161 9,35019
46uhs 155 7,75178
46wm5 139 10,01815
490oZ 157 7,79411
495Jj 139 9,56280
49egQ 157 8,76095
4IW7C 164 7,80759
4IWeV 155 7,79411
4J9LV 157 8,95938
4advM 188 9,84053
4afp8 164 9,67240
4bZEJ 157 8,75992
4efdm 163 9,60497
4glmq 162 9,33239
4r6Hq 164 8,97511
4rmTU 158 9,12081
4rrA9 161 6,74721
52tH7 161 9,35019
54mp5 135 10,05044
55C2z 160 9,35058
55qx0 155 9,54121
56vp1 164 9,60361
57PCh 165 9,78619
57T2M 174 10,07926
57VOb 156 9,83377
5E6qb 161 9,18547
5bAIl 157 9,17264
5eshr 159 9,18573
5f6ra 163 9,69374
5kUWp 159 6,85986
5l5iR 142 9,61348
5l7PT 157 8,86738
5lAVH 162 9,18515
5m0cA 159 8,99026
5vGMD 165 9,70818
6ADRT 161 9,35019
6IsKl 160 9,94794
6xK1n 158 10,05083
82AZ2 157 8,97498
GZpL 158 10,11311
S7of 162 6,43516
dZz0 166 8,79273
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3247
2k2so
54 44,0% 109 7.738E-30
2 phalp2_18623
3ebV
83 45,5% 101 3.379E-28
3 phalp2_34915
4pbkX
159 41,6% 137 1.629E-27
4 phalp2_40658
5cY9d
663 43,3% 106 6.411E-25
5 phalp2_24104
8fsA2
7 38,0% 113 1.650E-21
6 phalp2_30262
4ib6k
5 30,1% 136 7.921E-21
7 phalp2_22862
2DiVF
1 34,6% 124 1.084E-20
8 phalp2_28378
1lkGC
5 37,4% 139 1.483E-20
9 phalp2_29190
3zo1
2 35,4% 124 3.801E-20
10 phalp2_38962
7VmTn
1 36,5% 104 3.412E-19

Domains

Domains
Representative sequence (used for alignment): 4e6HN (155 AA)
Member sequence: 4oYLg (157 AA)
1 155 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4e6HN) rather than this protein.
PDB ID
4e6HN
Method AlphaFoldv2
Resolution 89.09
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50