Protein

Protein accession
6UbUq [EnVhog]
Representative
2WD4c
Source
EnVhog (cluster: phalp2_21656)
Protein name
6UbUq
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSVNAQKAFLSMLGLPYRFAWQRKSSWKKFQHASAWYAFDTAASKVGALVVDGIPGPKTQVHVAASKRHHNRVAPNFALREFACPCGGRNAGCKGVWFSHVMVRKLQQVRADLYPSGLAILSGYRCRKVNASLPGSVPNSAHLDGLAADIPQRVTGWQLHRYGFHGIEVRRSTGKVSHVDLRADLKVNTVFYV
Physico‐chemical
properties
protein length:193 AA
molecular weight:21356,4 Da
isoelectric point:10,39
hydropathy:-0,19
Representative Protein Details
Accession
2WD4c
Protein name
2WD4c
Sequence length
183 AA
Molecular weight
20169,37370 Da
Isoelectric point
10,24649
Sequence
VAKIMSVTRQIATLKRLGLPYLTKAQRKASWIVFQEAQTWTAPLDPDGVPGPLTTAAVNMAAKHNYRVSPHFHLREFACPHCGKVKVARKLLVALEKVRKRNYPTGLRIVSGYRCAKHNAKVGGIPTSAHLKGEAADIPAKFKPDNFKGLGLHGIGYKARHGLVTHVDVKTGLKANTIFREDY
Other Proteins in cluster: phalp2_21656
Total (incl. this protein): 29 Avg length: 192,6 Avg pI: 10,21

Protein ID Length (AA) pI
2WD4c 183 10,24649
1IlqT 183 11,00967
1joPC 210 9,93795
1lI6x 185 9,89462
1osRl 195 10,81788
1otfG 195 10,81788
24T0j 181 7,83428
2K4bK 160 8,98581
2QxsW 202 10,44918
2QyQI 204 9,76266
2TivI 210 9,90326
38zqf 188 10,14966
3Mp22 189 10,03897
3XhOg 193 10,84379
4Ik0U 192 10,66186
4Ssj5 181 9,96077
4Vnlz 198 9,86561
5dhuR 180 11,05286
5km2D 195 10,17216
6Hvhq 209 10,14070
6N8Aq 199 10,56484
6U7H8 185 11,39519
7Gf5K 191 10,16694
7ohWI 190 10,81388
7siMy 190 10,69377
TzUk 210 9,93795
dDC2 207 9,76446
pINo 186 9,80443
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_11075
4YH66
22 39,5% 162 8.826E-42
2 phalp2_18374
5Bl31
3 36,4% 148 2.844E-34
3 phalp2_20618
4JRfo
7 30,6% 173 1.673E-29
4 phalp2_29133
79Li3
3905 45,5% 123 7.035E-25
5 phalp2_14040
85wjf
1353 43,7% 128 2.366E-21
6 phalp2_18632
8g8s
5 36,0% 136 3.911E-20
7 phalp2_24008
45fcF
278 27,5% 156 8.794E-19
8 phalp2_22651
21uyP
33 28,7% 139 2.806E-15
9 phalp2_8512
1lzj9
23 29,0% 179 5.209E-15
10 phalp2_14159
2nMFq
1 26,0% 146 9.892E-13

Domains

Domains
Representative sequence (used for alignment): 2WD4c (183 AA)
Member sequence: 6UbUq (193 AA)
1 183 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08291

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2WD4c) rather than this protein.
PDB ID
2WD4c
Method AlphaFoldv2
Resolution 96.99
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50