Protein

Protein accession
4qAEd [EnVhog]
Representative
4LFwQ
Source
EnVhog (cluster: phalp2_21923)
Protein name
4qAEd
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MATPMTADQFLRQLQKWGVAYVEAAGWRTRNRNAAGLWGPVTGVLLHHTGDDAPDSADQRVLTEGRADLPGPLCHWGMRDDGTVVLIGWGRANHAGRGAANVRNALLTEGYGTRPPFPGEDTVDGNQIFYGQETMYSGAQPPTSVAYAATVRVFAAVCDFHNWSGRSCIGHREWTARKPDPGHLDMTQFRADVDALLEAGPEGDDVATLDGDDLTAIRKQVFGAIQDYLERGGPQPGGTSPTRDIGDGAQSTLVRMIEQGAKASLAGKTVATDVRTAVLALTGQVASLDPVDEGELAAQLAPILAAALPTQIVHFTAGDLDSIATAVADEQARRQQE
Physico‐chemical
properties
protein length:337 AA
molecular weight:35803,4 Da
isoelectric point:4,90
hydropathy:-0,30
Representative Protein Details
Accession
4LFwQ
Protein name
4LFwQ
Sequence length
329 AA
Molecular weight
35630,33770 Da
Isoelectric point
9,10843
Sequence
MTPDQFLGALRKWSVQVKEFPGWRTRTRPYSFDSVNGVVIHHTGSDAQSESYDEWLFTVGRPDEGIPGPLAHATIDFDGDFHLGAAGTANHAGKGSSATLSKVVNENYDGYSAEIKPGSDNTNGNTHFYGFEVKYDGDQPMTPQQYRTAVLAAAAICDFYGWSALSVIGHREWTGRKNDPGNNPMTKFRADVAAALKAGPPGAAKPVVPVDQPGAAVPGHPNPPKGPDGRYLLDASVLAAQTLNDQASLAGKPRPYRLNTVQLAQVGLNRRTLQILLGRKGVKWQSNWSTKALVQGYQTYYLGRKGSTGIADTRTCKALAKAANYYYQD
Other Proteins in cluster: phalp2_21923
Total (incl. this protein): 4 Avg length: 307,3 Avg pI: 6,36

Protein ID Length (AA) pI
4LFwQ 329 9,10843
6GE1n 276 5,58792
6SLHF 287 5,84671
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_30257
4gSoB
83 50,4% 238 4.171E-67
2 phalp2_36765
6EBV3
40 42,6% 218 7.295E-56
3 phalp2_7405
4FizF
295 46,5% 232 1.059E-53
4 phalp2_19071
1jII3
225 40,3% 265 3.236E-52
5 phalp2_40432
4fxAl
3 35,6% 202 2.898E-30
6 phalp2_37674
4urEs
29 30,0% 233 4.375E-24
7 phalp2_17841
gsY5
11 31,4% 235 4.655E-20
8 phalp2_31672
4Kwyh
4 29,9% 234 1.738E-17
9 phalp2_15221
Q1Wv
34 31,4% 235 5.648E-17
10 phalp2_30116
3dGzs
45 31,5% 225 1.064E-15

Domains

Domains
Ami2
Unannotated
Representative sequence (used for alignment): 4LFwQ (329 AA)
Member sequence: 4qAEd (337 AA)
1 329 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4qAEd
Method AlphaFoldv2
Resolution 79.33
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4LFwQ) rather than this protein.
PDB ID
4LFwQ
Method AlphaFoldv2
Resolution 89.58
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50