Protein

Protein accession
7ub4f [EnVhog]
Representative
6KFU2
Source
EnVhog (cluster: phalp2_30650)
Protein name
7ub4f
Lysin probability
88%
PhaLP type
endolysin
Probability: 95% (predicted by ML model)
Protein sequence
MSSWASHQRRSPRSSEPGTDFFCPIGTPVLAPANGRIYDTGNSIVPATGRWVGVDLDNGMRFRAMHLSSIRRASGFVRRGEVLGWSGASGYGEEDWSWNVAETGGAHTHVTLWPTHASNFGYDRNGRPYTVDFMDYADTSGSASGGGGDEDDMSARAEQQIDAVYKALFGPNNGETATTSPLGWQNVYGDVQSSAYGLLPIVIHNQTLIASQAGRLAAIEEVVEQLAQGSGAVLDMNAISAAAERGAKKALDGLVLTADVEG
Physico‐chemical
properties
protein length:262 AA
molecular weight:27784,2 Da
isoelectric point:4,91
hydropathy:-0,32
Representative Protein Details
Accession
6KFU2
Protein name
6KFU2
Sequence length
233 AA
Molecular weight
25553,85790 Da
Isoelectric point
4,64161
Sequence
MSYTTPADVRVSASYQSHIDRNPPSGEPGTDYATNYGTDLRMAGDGIVVDLSNSNDGGTGRFLAVDMYDGRRFRYLHLSEIMAYIGQHVFEGQRGIVWSGASGFGSDYGYGSHVHVTLFPTHAYNFGSTLDFELYAGEDDDMNAEQDNRLKNVENLLQVAGAGYGWPEVGGKASQDVQSRIYIYDENGNKLYDVFQLLTNTTNSINRLVYITGGIGLLSLAGIITLIVNQFVN
Other Proteins in cluster: phalp2_30650
Total (incl. this protein): 5 Avg length: 260,8 Avg pI: 5,33

Protein ID Length (AA) pI
6KFU2 233 4,64161
1uOFz 256 5,87644
40wdD 278 6,07105
5hWMA 275 5,16595
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_37829
4HRho
7 51,2% 154 1.106E-37
2 phalp2_993
2vbyj
4 32,1% 146 3.102E-12
3 phalp2_3160
8arnU
9 29,0% 162 2.535E-11
4 phalp2_28943
5FpkK
1 28,8% 170 1.525E-10
5 phalp2_26514
41roI
79 27,1% 173 1.040E-07
6 phalp2_33817
1joRs
29 25,1% 183 1.396E-07
7 phalp2_7549
5jaLN
16 28,6% 164 1.538E-04

Domains

Domains
PET_M23
Disordered region
Representative sequence (used for alignment): 6KFU2 (233 AA)
Member sequence: 7ub4f (262 AA)
1 233 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
7ub4f
Method AlphaFoldv2
Resolution 74.95
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6KFU2) rather than this protein.
PDB ID
6KFU2
Method AlphaFoldv2
Resolution 70.70
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50