Protein

Protein accession
79ywr [EnVhog]
Representative
4gqz6
Source
EnVhog (cluster: phalp2_4603)
Protein name
79ywr
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MAAVGSFLSGMGNGFFAGRSITDRKADLALREKELANRNSGGAAQRSVLGGATGPNLSSSQSGAGRSLNLSGGISENARSAYQRLTSNGVSPVMASALVGNMMQESGAGLNTGAVGDNGNAYGAGQWNGPRKRAYLDFAQSRGSNPDDLNTQVDFLLHEGQTSEKSAWQAIMSANDPQEAARIASNKFWRPGVPHTENRMAYAQKIYDALGQQPTHETSGRSIVDGMGTGDYMVARYGGGR
Physico‐chemical
properties
protein length:241 AA
molecular weight:25226,5 Da
isoelectric point:9,25
hydropathy:-0,55
Representative Protein Details
Accession
4gqz6
Protein name
4gqz6
Sequence length
232 AA
Molecular weight
24778,20160 Da
Isoelectric point
6,76994
Sequence
MTLGAFAGFLQGAAGSIERKRDRAERRSLLDAAERMGRPDPSLAGSGGGGADAGRVGERSPAFVYDGEISDRPAYAYNYLTENGVSPIMASGLVGNLMQESGKDIDPAASGDNGNAFGSAQWNGPRMRAYMGYAKGRGAEPTDFKTQLDFLLHEGRTTEKAAWSAIAAAKTPEEAALIASGKFWRPGVPHNERRQGYATAVYGRFGAAAPEPEIASDEPTDWLWFRNRKGAK
Other Proteins in cluster: phalp2_4603
Total (incl. this protein): 10 Avg length: 230,9 Avg pI: 6,54

Protein ID Length (AA) pI
4gqz6 232 6,76994
6FhP8 211 5,51341
6J28C 210 5,32305
6PPLI 230 6,84304
6R1G5 224 6,10794
7s2ZZ 234 5,13361
80mvB 235 7,98952
8ehsL 260 6,53077
8iZi7 232 5,97534
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_5269
8e5oF
72 40,5% 148 1.775E-36
2 phalp2_3546
4AJc3
7 27,4% 175 7.219E-15
3 phalp2_31702
3nLVw
4 24,8% 173 2.489E-11
4 phalp2_12958
3aMd2
55 30,0% 143 1.112E-10
5 phalp2_5996
6I23q
11 29,7% 158 2.724E-10

Domains

Domains
Phage_lys2
Disordered region
Representative sequence (used for alignment): 4gqz6 (232 AA)
Member sequence: 79ywr (241 AA)
1 232 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF18013

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4gqz6) rather than this protein.
PDB ID
4gqz6
Method AlphaFoldv2
Resolution 77.02
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50