Chemistry:Linusorb

From HandWiki
Short description: Type of peptide

}}

Linusorbs are small biologically active cyclic peptides synthesized in flaxseed (Linum usitatissimum) from two ribosome-derived precursors. The name is derived from Linum usitatissimum orbitide.

The molecules are very stable, not easily digested and stiff.[1]

Nomenclature for Linusorb: [1–#-NαC],[#-Xaa,#-Xaa]-linusorb #

The International Union of Pure and Applied Chemistry (IUPAC) Nomenclature for Natural Products supports the use of taxonomic names for the development of compound trivial names, while UniProt utilizes standard organism names as well as organism mnemonics in the naming of proteins. Linusorb product name according to Shim et al. (2015).[2]

Linkage [1–#-NαC] Occurs between amino acid 1 and amino acid “#” through the α-amino group that a N-C cyclization of the core peptide. The en dash (–) is used and placed in square brackets.

Modifications [#-Xaa,#-Xaa] The notation utilizes a prefix code of 3, 4 characters. Specifies the position and type of modified amino acids in the peptide. Each modified amino acid is identified by its position in the core peptide sequence using abbreviations found in UniProt and IUPAC. A comma separates multiple post-translational modifications.

Taxonomic name abbreviation -linus The first three letters of the genus name and first two letters of the species name are used to identify the origin of the orbitide. Based on the list of species maintained by UniProt (http://www.uniprot.org/docs/speclist).

Common suffix (-orb) Short for orbitide.

Amino acid residue numbering (#) Residue numbering begins at the N-terminal-most amino acid residue of the orbitide, based on DNA sequence. Notation for any post-translational modifications start with the lowest number and proceed to the highest number. Where multiple identical modifications are present, they are grouped together as illustrated for Met groups in [1–8-NαC],[1,3-(Rs,Ss)-MetO]-linusorb A1 (Table 1 and Figure 1; formerly cyclolinopeptide G, CLG). This more descriptive name for CLG reflects the complexity of fully and appropriately naming this compound as it contains the chiral amino acid methionine S-oxide. Clearly, what was formerly recognized as CLG is actually four distinct compounds. New full names for all “cyclolinopeptide” variants are suggested in Table 1.

Figure 1. Structures of Linusorbs A−E

Table 1. Elements of Linusorb Names

Proposed linusorb namea Literature nameb (code) Amino acid sequence (NαC-)c Gene
[1–8-NαC]-linusorb A1 CLM (1) Met-Leu-Met-Pro-Phe-Phe-Trp-Ile AFSQ01025165.1
[1–8-NαC],[1-(Rs)-MetO]-linusorb A1 [(Rs)-MetO]-Leu-Met-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[1-(Ss)-MetO]-linusorb A1 [(Ss)-MetO]-Leu-Met-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[1-(Rs,Ss)-MetO]-linusorb A1 CLH (2) [(Rs,Ss)-MetO]-Leu-Met-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[3-(Rs)-MetO]-linusorb A1 Met-Leu-[(Rs)-MetO]-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[3-(Ss)-MetO]-linusorb A1 Met-Leu-[(Ss)-MetO]-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[3-(Rs,Ss)-MetO]-linusorb A1 CLN (3) Met-Leu-[(Rs,Ss)-MetO]-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[1,3-(Rs,Rs)-MetO]-linusorb A1 [(Rs)-MetO]-Leu-[(Rs)-MetO]-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[1,3-(Ss,Ss)-MetO]-linusorb A1 [(Ss)-MetO]-Leu-[(Ss)-MetO]-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[1-(Rs)-MetO,3-(Ss)-MetO]-linusorb A1 [(Rs)-MetO]-Leu-[(Ss)-MetO]-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[1-(Ss)-MetO,3-(Rs)-MetO]-linusorb A1 [(Ss)-MetO]-Leu-[(Rs)-MetO]-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[1,3-(Rs,Ss)-MetO]-linusorb A1 CLG (4) [(Rs,Ss)-MetO]-Leu-[(Rs,Ss)-MetO]-Pro-Phe-Phe-Trp-Ile
[1–8-NαC]-linusorb A2 CLO (5) Met-Leu-Leu-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[1-(Rs)-MetO]-linusorb A2 [(Rs)-MetO]-Leu-Leu-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[1-(Ss)-MetO]-linusorb A2 [(Ss)-MetO]-Leu-Leu-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[1-(Rs,Ss)-MetO]-linusorb A2 CLD (6) [(Rs,Ss)-MetO]-Leu-Leu-Pro-Phe-Phe-Trp-Ile
[1–8-NαC],[1-MetO2]-linusorb A2 1-Msn-CLD (7) MetO2-Leu-Leu-Pro-Phe-Phe-Trp-Ile
[1–8-NαC]-linusorb A3 CLL (8) Met-Leu-Met-Pro-Phe-Phe-Trp-Val
[1–8-NαC],[1-(Rs)-MetO]-linusorb A3 [(Rs)-MetO]-Leu-Met-Pro-Phe-Phe-Trp-Val
[1–8-NαC],[1-(Ss)-MetO]-linusorb A3 [(Ss)-MetO]-Leu-Met-Pro-Phe-Phe-Trp-Val
[1–8-NαC],[1-(Rs,Ss)-MetO]-linusorb A3 [1–8-NαC],[1-MetO]-CLF (9) [(Rs,Ss)-MetO]-Leu-Met-Pro-Phe-Phe-Trp-Val
[1–8-NαC],[3-(Rs)-MetO]-linusorb A3 Met-Leu-[(Rs)-MetO]-Pro-Phe-Phe-Trp-Val
[1–8-NαC],[3-(Ss)-MetO]-linusorb A3 Met-Leu-[(Ss)-MetO]-Pro-Phe-Phe-Trp-Val
[1–8-NαC],[3-(Rs,Ss)-MetO]-linusorb A3 CLI (10) Met-Leu-[(Rs,Ss)-MetO]-Pro-Phe-Phe-Trp-Val
[1–8-NαC],[1-(Rs)-MetO,3-(Rs)-MetO]-linusorb A3 [(Rs)-MetO]-Leu-[(Rs)-MetO]-Pro-Phe-Phe-Trp-Val
[1–8-NαC],[1-(Ss)-MetO,3-(Ss)-MetO]-linusorb A3 [(Ss)-MetO]-Leu-[(Ss)-MetO]-Pro-Phe-Phe-Trp-Val
[1–8-NαC],[1-(Rs)-MetO,3-(Ss)-MetO]-linusorb A3 [(Rs)-MetO]-Leu-[(Ss)-MetO]-Pro-Phe-Phe-Trp-Val
[1–8-NαC],[1-(Ss)-MetO,3-(Rs)-MetO]-linusorb A3 [(Ss)-MetO]-Leu-[(Rs)-MetO]-Pro-Phe-Phe-Trp-Val
[1–8-NαC],[1,3-(Rs,Ss)-MetO]-linusorb A3 CLF (11) [(Rs,Ss)-MetO]-Leu-[(Rs,Ss)-MetO]-Pro-Phe-Phe-Trp-Val
[1–8-NαC]-linusorb B1 CLP (12) Met-Leu-Val-Phe-Pro-Leu-Phe-Ile AFSQ01016651.1
[1–8-NαC],[1-(Rs)-MetO]-linusorb B1 [(Rs)-MetO]-Leu-Val-Phe-Pro-Leu-Phe-Ile
[1–8-NαC],[1-(Ss)-MetO]-linusorb B1 [(Ss)-MetO]-Leu-Val-Phe-Pro-Leu-Phe-Ile
[1–8-NαC],[1-(Rs,Ss)-MetO]-linusorb B1 CLE (13) [(Rs,Ss)-MetO]-Leu-Val-Phe-Pro-Leu-Phe-Ile
[1–8-NαC],[1-MetO2]-linusorbB1 CLJ (14) MetO2-Leu-Val-Phe-Pro-Leu-Phe-Ile
[1–9-NαC]-linusorb B2 CLB (15) Met-Leu-Ile-Pro-Pro-Phe-Phe-Val-Ile
[1–9-NαC],[1-(Rs)-MetO]-linusorb B2 [(Rs)-MetO]-Leu-Ile-Pro-Pro-Phe-Phe-Val-Ile
[1–9-NαC],[1-(Ss)-MetO]-linusorb B2 [(Ss)-MetO]-Leu-Ile-Pro-Pro-Phe-Phe-Val-Ile
[1–9-NαC],[1-(Rs,Ss)-MetO]-linusorb B2 CLC (16) [(Rs,Ss)-MetO]-Leu-Ile-Pro-Pro-Phe-Phe-Val-Ile
[1–9-NαC],[1-MetO2]-linusorb B2 CLK (17) MetO2-Leu-Ile-Pro-Pro-Phe-Phe-Val-Ile
[1–9-NαC]-linusorb B3 CLA (18) Ile-Leu-Val-Pro-Pro-Phe-Phe-Leu-Ile
[1–9-NαC]-linusorb C1 [1–9-NαC]-CLQ (19) Met-Leu-Lys-Pro-Phe-Phe-Phe-Trp-Ile Gened
[1–9-NαC],[1-(Rs,Ss)-MetO]-linusorb C1 [1–9-NαC],[1-MetO]-CLQ (20) [(Rs,Ss)-MetO]-Leu-Lys-Pro-Phe-Phe-Phe-Trp-Ile
[1–9-NαC]-linusorb D1 [1–9-NαC]-CLS (21) Gly-Ile-Pro-Pro-Phe-Trp-Leu-Thr-Leu Genese
[1–9-NαC]-linusorb E1 [1–9-NαC]-CLX (22) Gly-Met-Leu-Val-Phe-Pro-Leu-Phe-Ile Genef
[1–9-NαC],[2-(Rs,Ss)-MetO]-linusorb E1 [1–9-NαC],[2-MetO]-CLX (23) Gly-[(Rs,Ss)-MetO]-Leu-Val-Phe-Pro-Leu-Phe-Ile
[1–9-NαC],[2-MetO2]-linusorb E1 (24) Gly-MetO2-Leu-Val-Phe-Pro-Leu-Phe-Ile
[1–10-NαC]-linusorb E2 [1–10-NαC]-CLV (25) Gly-Met-Leu-Ile-Pro-Pro-Phe-Phe-Val-Ile
[1–10-NαC],[2-(Rs,Ss)-MetO]-linusorb E2 [1–10-NαC],[2-MetO]-CLV (26) Gly-[(Rs,Ss)-MetO]-Leu-Ile-Pro-Pro-Phe-Phe-Val-Ile
[1–10-NαC],[2-MetO2]-linusorb E2 (27) Gly-MetO2-Leu-Ile-Pro-Pro-Phe-Phe-Val-Ile
[1–10-NαC]-linusorb E3 [1–10-NαC]-CLU (28) Gly-Ile-Leu-Val-Pro-Pro-Phe-Phe-Leu-Ile

Notes

aProposed orbitide name: [1–#-NαC], linkage occurs between amino acid 1 and amino acid “#” through the α-amino group that a N-C cyclization of the core peptide. Use the en dash (–) as in ranges and place the square brackets; [#-Xaa,#-Xaa], methionine S-oxide has a chiral sulfur. Two diastereomers are designated as Ss and Rs. Identical amino acids substituents are numbered and grouped; lin-, Genus name is recognized by three letters or from UniProt list; -us-, species name is recognized by 2 letters or name from UniProt list; -orb, common suffix for orbitide.

bName used in first literature description.

cThe methionine residues of amino acid sequences are highlighted in Figure 1. Abbreviations of the methionine residues are MetO for methionine S-oxide and MetO2 for methionine S,S-dioxide.

dAFSQ01016651.1.

eAFSQ01011783.1 and AFSQ01009065.1.

fAFSQ01016651.1.

References

  1. YouTube
  2. Shim, YY, LW Young, PG Arnison, E Gilding, MJT Reaney (2015) Proposed systematic nomenclature for orbitides. Journal of Natural Products, 78(4), 645-652, doi:10.1021/np500802p