Protein

Protein accession
4K66F [EnVhog]
Representative
4gCgU
Source
EnVhog (cluster: phalp2_5617)
Protein name
4K66F
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MNYKIKRIVNRNLDTRYNHYIALKRQTRKKSKAEKVVLLVLINICVYSFLFPVMNSRRVVTFTSPIESAQASEYSPMCGLDVVECDNEPKKELKEKSIESMIYDTFPESPAIALAIAKAESGLNHHAVNHGNRNGSKDCGIYQINSVHRPTVEQCTNPAANIALARKIYDSRGDWSAWSAYNNGAYLRHIK
Physico‐chemical
properties
protein length:191 AA
molecular weight:21664,6 Da
isoelectric point:9,35
hydropathy:-0,44
Representative Protein Details
Accession
4gCgU
Protein name
4gCgU
Sequence length
178 AA
Molecular weight
19716,51590 Da
Isoelectric point
6,60301
Sequence
MKVQTKIKIERFKIGCKKLCLVLLGIVIGACWFFVLTEGKEFYVSSVARDVVVIAPLAEASTAPTDTDNGVKEGEANGGQPATSLSIEEKIRQMFPEKPDIMLAIAKAESKLNPHVVNRGNSNGTIDCGIFQINSVWGYDEEFLKNEDNNLKIARIVYEKQGITAWASYNNGAYLKWL
Other Proteins in cluster: phalp2_5617
Total (incl. this protein): 6 Avg length: 195,3 Avg pI: 7,87

Protein ID Length (AA) pI
4gCgU 178 6,60301
2dmqq 187 9,49885
36vA7 198 5,27281
4H8fo 238 8,79241
6DbOx 180 7,70830
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_10848
4hnrw
1 35,5% 149 7.965E-17
2 phalp2_23720
IqL3
1 36,6% 120 9.548E-16
3 phalp2_12888
2t72B
3 31,7% 148 6.343E-13
4 phalp2_17407
4NSsy
3 31,5% 130 8.636E-13
5 phalp2_8905
3cvac
3 28,9% 169 1.600E-12
6 phalp2_32794
2wOgV
1 26,1% 168 1.600E-12
7 phalp2_2152
3Guf6
27 30,8% 136 5.489E-12
8 phalp2_37679
3XmBc
1 25,5% 133 2.536E-09
9 phalp2_27607
5OcqN
31 25,0% 124 6.338E-09
10 phalp2_40223
2uoeL
5 24,1% 112 2.246E-05

Domains

Domains
Disordered region
SLT_3
Representative sequence (used for alignment): 4gCgU (178 AA)
Member sequence: 4K66F (191 AA)
1 178 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF18896

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4gCgU) rather than this protein.
PDB ID
4gCgU
Method AlphaFoldv2
Resolution 74.46
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50