Protein

Protein accession
23FN9 [EnVhog]
Representative
40EaA
Source
EnVhog (cluster: phalp2_25237)
Protein name
23FN9
Lysin probability
99%
PhaLP type
endolysin
Probability: 93% (predicted by ML model)
Protein sequence
MFTGYFPKFTLSSGKSDDRHGDGLAMICNGQTMLVDGFEGGEPTKRMITWLKSNGVKKIDVAVLTHYHYDHYNGLLQIEADPDLHIDLVYCYDPRTLLHGVDNSSNGRSVKEDIANAYAWIRKMQSYGTRVKWIDRGDVLKFGDITWRIFRDQPKAFTHLDKGNAYAFVNDGSLILWSPEIELLLGGDGPNDLESAIRWFGAKVSGYDVSHHGNNCSQHNALALKSAGCVVAWQSCIERSGAGTTGWTEYGSRRVKQQGVPVWQQDQDILITAGAGKITFTQGGKTMSKSVPYKGTAVPAKGSWVQDSKGWYYRKADGSCAYGWALLPWSKGTNWFYFNDAGYCVYGWQYLKWSGGSHWFYFDEASAAMRTGWVYDNGSWYYLDPKDGYMHTGWIDWKGKKCYLEPVSGRNQGHAYRNEKAIIGGRYYSFDNDCYATEISAVTASSMVKPGAKVIDVSEFQPENIDWSRIKAGGYAVIARIGLRGSRPHTDRYRKVGYDYHFKQYINGIIAAGIPYSVYFFPTPMSDQEADEEANWIIMNVAGLDLSMPLWLDSEKVPGGVANDISTADRTRYLKRITDKLVAAGIPCGIYASTSWLQHQINMGQLQQQVRDNTWCAQYSTKCTYDGVYAMWQYSSNAHVDGINEKVDISEVKQAFNMSCRKTAPKTDAMAKDNVRIFPTTDPVKISNSGSDEHGNYKGGAAGDNTGKEWYIRDWYSRPWNCVLRHPDPAVRACIADLATKAANNNKIGYDQYQRQTYWIELQKVGYDPSKITTACEADCSAGVIANVKAAGHLLGRKELQGITCTYTGNMRSGLKAAGFACLTDSKYINGSSYLVAGDILLNDAHHTATAVTNGINSGNGSVTPASMPLIKKGSKGSAVLQLQKILNSKGYKLSEDSDFGPATEAAVKAFQRANHLEVDGEVGPLTWAALLK
Physico‐chemical
properties
protein length:933 AA
molecular weight:103446,1 Da
isoelectric point:8,18
hydropathy:-0,43
Representative Protein Details
Accession
40EaA
Protein name
40EaA
Sequence length
983 AA
Molecular weight
110079,72770 Da
Isoelectric point
7,91802
Sequence
MAIDAFFPKITLPKGETDIRKGEGLGIKYTSENGKVIVMVVDAFEGGEPTRGLIDWLKSIDVKQIDLAVATHAHGDHFGGFYDIAEAGIKIKDFRCYHIDSIREGNSESRKDSDNLLKLIRWLQARGTRVLFVDHGDVLEFGDISWHIYRKQPKGPAAKDDPNAWSYVNDGSLVLCSPELSGIIFGDGPAAQKEAIAYFQKKFGKTKFLIWVTVSHHGGHFSMSNAQSARNAGCEFAYESCVERRGPGKEEWTEFGARRLIQNGVTVWMQDENIYIHAAAGKITFKQGNKTLTYDIPYQGKESKVTGKWEYGTKGWWYKYSNGGYATGWKQIVYKGEPCWFLFDDDGWMLTGWRKDDGKWYFLDYKTGVMLKGWHKLPHGPNNVEDWYLLNKSGEMLSGWQWSTKDGKTGWYYLDPDNGMRTGWIYDDGEWYCLGNDGRMLTGWVTYKGRKCYLEPLSDKTHVQGVCYRSRTAVIDGKSYKFDKDGYAEEIAVGGGGKAELNGCDVASYQAGIDFAKMTTTDFAIIKGTQGTWYVNPYADIQYSGAKAAGKLLGMYHYAEGKDPIAEAQYFIRKVGSRVGSCILALDWEGHDNSKFNSDAEVAWVLKFAMEVYRLTKVHIFLYMSKSVTRRRNWAEVAKDVRLWCAQYANQEHTDYKSNPWTDNGGFGAWKYDTIRQYSSKGRVTGYGKDLDINRAYMSRAEWLAAAAGKNTVVATAAKPKTQWSAYVSMTTSPVKISNSGSDENGKYKGGKAGDQTGKEWRIRDWYNYPWNCVLRHPLAEVRACLATMAVKAAENDNIGYDQLQRDDYGIELAKNDYDPSKIKKPVESDCSKGVIDNTKGTGHTLGIPELQSIKATYTGNMRAAYKAAGFYVLTESKYLTSSDYLLAGDILLNDKHHTCTVITNGSKSGDTVVMPLVKKGSTGAAVEQLQEMLIAQGYSCGKWGADGDFGNDTLQAVKSYQRDHGLDVDGEVGPMTWGVLFK
Other Proteins in cluster: phalp2_25237
Total (incl. this protein): 18 Avg length: 920,2 Avg pI: 7,47

Protein ID Length (AA) pI
40EaA 983 7,91802
1cLZi 925 8,07139
1cMec 970 6,67144
1cNub 944 8,42661
23qEI 928 7,71711
23vwv 839 6,85446
23ysT 794 7,29019
23zUz 988 6,24828
38GEP 935 7,71353
3TNAy 880 5,83619
3VGNn 929 8,31592
3VLtx 930 7,73626
3ielO 907 7,48032
409eA 841 6,14392
7q1lF 981 8,30754
7q23E 935 7,21266
8130e 921 8,41952
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_15776
3WJi7
6 27,8% 1032 2.564E-170
2 phalp2_12980
3ibDa
3 27,2% 801 2.351E-137
3 phalp2_24350
3WE0c
3 25,0% 706 5.705E-86

Domains

Domains
Unannotated
Choline_bind_3
GH25
Unannotated
PG_1
Representative sequence (used for alignment): 40EaA (983 AA)
Member sequence: 23FN9 (933 AA)
1 983 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183, PF01471, PF19127

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (40EaA) rather than this protein.
PDB ID
40EaA
Method AlphaFoldv2
Resolution 81.14
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50