Protein

Protein accession
8n1sx [EnVhog]
Representative
i4nR
Source
EnVhog (cluster: phalp2_21087)
Protein name
8n1sx
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDLDRKIFFDEIRGSVFEGSMTEKQVEGLDILLNIWEDDFSSYPDEFLAYCLGTTYHETARTMQPIREYGRGRGRKYGRPDPKTGQTYYGRGFVQLTWLFNYARAGRKLSMDLVAKPDKVMIPEVAATILYRGCIEGWFTTKKLSDYITETSCNYRQARRIVNGMDKATLIAGYAKKFAKAIEEARHEPVITAEKLEKAGSRTAKAARKGTDATGGVIAIGLTGALAEALKYLQGFSEKFGEWRGVLDALADGFIWLTTNSWVFAVAFGAWGLFQIRKMMQARIEDEEIIGRLKNANPD
Physico‐chemical
properties
protein length:299 AA
molecular weight:33716,2 Da
isoelectric point:8,77
hydropathy:-0,33
Representative Protein Details
Accession
i4nR
Protein name
i4nR
Sequence length
394 AA
Molecular weight
44337,90750 Da
Isoelectric point
8,63536
Sequence
MILTGLKYWSLNVLISLDQLLNTIFLGAPDETISSRAGKARARGDWWGCYLCLVLDWIDPRHCETSREDDEGSNAVLERLKRDKAHQGPFLMERPMSFNRSVFFREVRASFFSKGLSQSKVDGLNHLLDCWDANYSDYPTEFLAYCLATAYHETAHTFEPLREYGRGRGRRYGRPDPETGETYYGRGYVQLTWKYNYKKAGNKLGCNFVDNPDAVMRPDWAARILFTGCIEGWFTGKKLQHYINPKKSDYRQARRIVNGMDRASKIAGYARKFEHALDAATSEQIVTEKQLEQAGSRTIKNATGNKDAGGAVIGVGVLGVFAKILEYVKGLGEALGDWAGSIDTISNALTQLLTLWPVALLVCGGIVIWRNREIIKARISDEPLIGRLENATPD
Other Proteins in cluster: phalp2_21087
Total (incl. this protein): 12 Avg length: 264,7 Avg pI: 8,61

Protein ID Length (AA) pI
i4nR 394 8,63536
17aLL 299 8,76191
7Cvg6 354 5,49522
80Ybj 345 9,09580
hxY4 309 6,33592
W6AQZ3 210 9,38842
A0A481S282 201 9,34123
A0A873WKL4 201 9,40273
A0AA50KK63 188 9,34535
A0AA50Q5R6 188 9,34535
A0AA50Q9L7 188 9,34535
Similar Clusters

No similar clusters were found for representative i4nR.

Domains

Domains
Unannotated
GH19
Disordered region
Representative sequence (used for alignment): i4nR (394 AA)
Member sequence: 8n1sx (299 AA)
1 394 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00182

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (i4nR) rather than this protein.
PDB ID
i4nR
Method AlphaFoldv2
Resolution 74.91
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50