Protein

Protein accession
71ukk [EnVhog]
Representative
20BaY
Source
EnVhog (cluster: phalp2_8611)
Protein name
71ukk
Lysin probability
98%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MKSKKEIYRRNIRKRHRRRALFRLAVSFFAYTVKIILTALVALFLMKASAAPEKEEPGQQAQASEEGVFYQEARQQATPQKGSDPASGPSFVPLDVPMSEEDQKAIFDICNDYKIAYTLVMAMIEHESSFDASARSKTGDSGLMQINDCNSARLAELGFTDLYNARENVEAAVYILRKLFNKYGEVEAVLMCYNMGEAGAAALWEEGIFSSVYSSEIMAREAEFSSYIDNFITNK
Physico‐chemical
properties
protein length:235 AA
molecular weight:26416,7 Da
isoelectric point:5,44
hydropathy:-0,28
Representative Protein Details
Accession
20BaY
Protein name
20BaY
Sequence length
217 AA
Molecular weight
25067,10550 Da
Isoelectric point
9,68903
Sequence
MNNTIMIKKCLVVMITLLVMTPNAYAHKNKKHVHKHSHKVEKFHKPVVKPKAVNELNYGRLYSWCKDQAHPKVNDTDLQRIVDHVLVKKDPLLLISMIDTESEFRKNAISKKHAIGLMQVRPSVWLSELRKEFPHIRNYNDLLNINNNIDAGEYILAKYIEQTGSLKKALYCYSGGSSRYVNKVLRTYYTVSVASYLAQPIYQQYLATINVIGKLLI
Other Proteins in cluster: phalp2_8611
Total (incl. this protein): 8 Avg length: 197,6 Avg pI: 7,08

Protein ID Length (AA) pI
20BaY 217 9,68903
2w8CY 172 7,70358
4hWBO 170 9,43857
4ijm7 162 9,79122
6t2Vn 214 5,49039
8fvs7 225 4,56965
fkUU 186 4,50576
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_21370
238mK
3 33,1% 151 2.769E-15
2 phalp2_23989
3MMD6
17 24,8% 205 3.181E-14
3 phalp2_18090
3WXWI
1 33,1% 145 1.076E-13
4 phalp2_37921
4UYf9
2 28,0% 150 4.917E-13
5 phalp2_9885
2tkXx
10 25,7% 140 3.406E-11
6 phalp2_19999
6ULRQ
3 26,8% 145 5.114E-10
7 phalp2_10183
ujbY
7 28,2% 170 5.114E-10

Domains

Domains
Representative sequence (used for alignment): 20BaY (217 AA)
Member sequence: 71ukk (235 AA)
1 217 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (20BaY) rather than this protein.
PDB ID
20BaY
Method AlphaFoldv2
Resolution 79.12
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50