Protein

Protein accession
RCEU [EnVhog]
Representative
4B2BD
Source
EnVhog (cluster: phalp2_33064)
Protein name
RCEU
Lysin probability
76%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
INVGARTFAIAPEGKPEPSTIMEDSPSQPPAKKFRIALDPGHSEWHTGAMGMAPDYPREEVMNVYFVNIVKSTLEATGRFVVDIKDMIEDHLDEIGAMAKGYDLFASCHHNALRRVDYYCTAMVHKSQASKGSIQFAMLASKEMAEAMGHKLFGGTDRDYPGVYFAGLSVLRAAEKTDCPACVLLEPYFLDAYGHMEAVQKRTSLAAQGFIRAILKHFDLT
Physico‐chemical
properties
protein length:221 AA
molecular weight:24409,7 Da
isoelectric point:5,98
hydropathy:-0,17
Representative Protein Details
Accession
4B2BD
Protein name
4B2BD
Sequence length
281 AA
Molecular weight
31092,39910 Da
Isoelectric point
6,01978
Sequence
MWYKVIVTENKDVIAVAFEGSDPIEKVTLLKYGAEVNIQALIRLLSTLKNNDLTYEETLFTTKTLAPIRKLMNEEGMFKSFTEPVVKKELDGRVIVLSPGHSEKEVGARGLAPDYPQEEDYNRLQVSIIASILTKSGAKVIIYDPSVDDLYAIGAKAKGADMFIDVHLNAYNRDKVDEYSCVMVHNRYKRDSDVKFAALCAARISTALGNPIFKGQSPRMPYGVYEAGLGVLNAATATGCPVCVLTEAFFIDAYGYNEIVKQRSIKAANAIAESVIAWFNR
Other Proteins in cluster: phalp2_33064
Total (incl. this protein): 3 Avg length: 242,0 Avg pI: 6,24

Protein ID Length (AA) pI
4B2BD 281 6,01978
41Cno 224 6,71271
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_36993
itwj
7 37,0% 281 1.529E-44
2 phalp2_3504
4kB6G
5 39,1% 189 1.494E-32
3 phalp2_40500
4C5vv
18 31,3% 220 3.615E-22
4 phalp2_6106
5d1Yf
1 34,0% 194 1.017E-20
5 phalp2_40486
4Akhs
1 30,2% 192 9.338E-16
6 phalp2_24413
4m1OU
1246 25,0% 188 3.119E-09
7 phalp2_33525
85Tz
32 23,5% 187 2.375E-06
8 phalp2_21852
4t88M
271 23,8% 193 5.609E-06
9 phalp2_27233
3PKbz
18 22,1% 194 2.343E-05

Domains

Domains
Disordered region
Ami3
Representative sequence (used for alignment): 4B2BD (281 AA)
Member sequence: RCEU (221 AA)
1 281 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01520

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
RCEU
Method AlphaFoldv2
Resolution 86.42
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4B2BD) rather than this protein.
PDB ID
4B2BD
Method AlphaFoldv2
Resolution 79.18
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50