Protein

Protein accession
3buf0 [EnVhog]
Representative
d4bm
Source
EnVhog (cluster: phalp2_10112)
Protein name
3buf0
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTKRAFLTAVLNAATEARDHYRIPIVPAVVAAQAALESAWGQSELARAANNLFGVKAGKSWDGPTYTLPTREFDPERGWHTVPAAFRAYATWTECIRDYGDIIATRPWFRDAREAAIRGDAEGFLNGLLARPGREPGWATDPAYKDKILDIARRYGLLDGPARLLVDRLYLNGAEVEPVAVSLAETDTAGVKLYVTWRLLDRPNLLGRLRLAWQVFRAVGGRR
Physico‐chemical
properties
protein length:223 AA
molecular weight:24805,0 Da
isoelectric point:9,47
hydropathy:-0,23
Representative Protein Details
Accession
d4bm
Protein name
d4bm
Sequence length
223 AA
Molecular weight
24876,88650 Da
Isoelectric point
7,95928
Sequence
MAHIDAARQIFLDSIERYALEAARAQGVPVVPKAVAAQAALESDWGRSRLALEGNNLFGVKAGSSWRGPVIELPTWEVVDGHRIETVARFRRYEDFEAAVQDYVAIIGRLDWYRDAREAARYGDPYGFLYGLEGRGGEPGWATDPDYARKVLALMRTYNLLDGPHPVWADRVYLNGFNVGAERVSIAHTETAGVKVYVLARPKRMGLWTRLSGAWALVTGRWG
Other Proteins in cluster: phalp2_10112
Total (incl. this protein): 3 Avg length: 223,0 Avg pI: 8,98

Protein ID Length (AA) pI
d4bm 223 7,95928
38x3V 223 9,50581
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24548
7J4ML
47 48,2% 141 4.619E-39
2 phalp2_30438
7FbFs
6 38,5% 205 1.526E-27
3 phalp2_5346
2cQBB
648 40,5% 153 2.500E-26
4 phalp2_23271
4Y2T6
95 35,6% 157 2.066E-18
5 phalp2_14516
4U1py
1 39,7% 146 2.809E-18
6 phalp2_28108
7xE0g
17 33,1% 154 8.183E-17
7 phalp2_19667
4yvE1
134 32,4% 151 1.509E-16
8 phalp2_8561
1Ibzf
44 36,3% 154 3.776E-16
9 phalp2_374
7enmQ
47 33,9% 156 9.439E-16
10 phalp2_30397
4MmCW
1 34,2% 140 1.281E-15

Domains

Domains
GLUCO
Disordered region
Representative sequence (used for alignment): d4bm (223 AA)
Member sequence: 3buf0 (223 AA)
1 223 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (d4bm) rather than this protein.
PDB ID
d4bm
Method AlphaFoldv2
Resolution 79.57
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50