Patent application title: NPP1 FUSION PROTEINS
Inventors:
IPC8 Class: AC12N916FI
USPC Class:
1 1
Class name:
Publication date: 2017-05-25
Patent application number: 20170145393
Abstract:
The present invention provides a novel fusion polypeptide containing a
catalytic domain of NPP1 fused to a targeting moiety, nucleic acids
encoding the fusion polypeptide, a vector containing the nucleic acid
integrated thereinto, a host cell transformed with the vector and
pharmaceutical compositions comprising the fusion polypeptide.Claims:
1. An isolated polypeptide comprising a targeting moiety and a C-terminus
catalytic domain of NPP1.
2. The polypeptide of claim 1, wherein said catalytic domain is a pyrophosphatase or phosphodiesterase domain of NPP1.
3. The polypeptide of claim 1, wherein said polypeptide is a recombinant protein.
4. The polypeptide of claim 1, wherein said targeting moiety is chemically linked to said catalytic domain of NPP1.
5. The polypeptide of claim 1, wherein said catalytic domain of NPP1 comprises A205 through L591 of SEQ ID NO:1.
6. The polypeptide of claim 5, wherein said catalytic domain of NPP1 comprises P99 through L591 of SEQ ID NO:1.
7. The polypeptide of claim 5, wherein said catalytic domain of NPP1 comprises A205 through D925 of SEQ ID NO: 1.
8. The polypeptide of claim 5, wherein said catalytic domain of NPP1 comprises P99 through D925 of SEQ ID NO: 1.
9. The polypeptide of claim 8, wherein said polypeptide is a consecutive fragment of the polypeptide having the amino acid sequence of P99 through D925 of SEQ ID NO:1 and containing at least the amino acid resides A205 through L591 of SEQ ID NO:1.
10. The polypeptide of claim 5, wherein said targeting moiety is a peptide comprising at least four negatively charged amino acid residues.
11. The polypeptide of claim 5, wherein said targeting moiety is a peptide comprising between about five and about fifteen negatively charged amino acid residues.
12. The polypeptide of claim 5, wherein said negatively charged amino acid residues comprise at least one aspartic acid or glutamic acid.
13. The polypeptide of claim 5, wherein said negatively charged amino acid residues comprise at least four aspartic acid residues.
14. The polypeptide of claim 5, wherein said negatively charged amino acid residues comprise at least four glutamic acid residues.
15. The polypeptide of claim 5, wherein said negatively charged amino acid residues comprise eight consecutive aspartic acid residues.
16. The polypeptide of claim 5, wherein said targeting moiety is fused to either N-terminus or C-terminus of said catalytic domain of NPP1.
17. The polypeptide of claim 5, wherein said targeting moiety is fused to N-terminus and C-terminus of said catalytic domain of NPP1.
18. The polypeptide of claim 12, wherein said polypeptide further comprises a polypeptide linker between said targeting moiety and said catalytic domain of NPP1.
19. The polypeptide of claim 1, wherein said polypeptide further comprises an Fc region of an immunoglobulin.
20. The polypeptide of claim 1, wherein said polypeptide further comprises a signal peptide.
21. The polypeptide of claim 1, wherein said polypeptide forms a homodimer.
22. The polypeptide of claim 1, wherein said polypeptide protein is a monomer.
23. An isolated nucleic acid encoding the polypeptide of claim 1.
24. A replication or expression vector carrying the isolated nucleic acid of claim 23.
25. A host cell transformed with the replication or expression vector according to claim 23.
26. The host cell of claim 25, wherein said host cell is selected from the group consisting of CHO cell, HEK293 cell, or COS cell.
27. The host cell of claim 20, wherein said host cell is an avian tumor cell.
28. A transgenic animal producing the polypeptide of claim 1.
29. The transgenic animal of claim 28, wherein said transgenic animal is mammalian or avian.
30. A method producing said polypeptide of claim 1 by an avian cell comprising culturing an avian cell transfected with at least one expression vector comprising a transcription unit having a nucleotide sequence encoding said polypeptide of claim 1 operably linked to a promoter and a transcriptional terminator, and wherein the cultured avian cell produces said polypeptide.
31. A method for producing said polypeptide of claim 1 in an egg of a chicken, the method comprising: (a) providing an avian leukosis viral vector comprising a nucleic acid sequence encoding said polypeptide, and a promoter operably linked to said sequence, wherein said promoter drives expression of the encoding sequence in the chicken oviduct; (b) introducing said vector into chicken stage X embryonic cells; (c) incubating said embryonic cells under conditions conducive to hatching live chicks; (d) nurturing growth of a mature chimeric chicken from said chicks; (e) mating said chimeric chicken, either naturally or via artificial insemination, with a non-transgenic chicken; (f) identifying a transgenic chicken by screening the progeny of step (e) for germ line incorporation of the protein encoding sequence; and (g) mating the transgenic progeny with non-transgenic chickens to produce eggs containing the exogenous protein.
32. A method of treating a disorder in a subject in need thereof comprising administering an effective amount of a pharmaceutical composition comprising the polypeptide of claim 1.
33. The method of claim 32, wherein said subject is a mammal.
34. The method of claim 33, wherein said mammal is a human.
35. The method of claim 32, wherein said disorder is arterial calcification, insulin resistance, hypophosphatemic rickets, or ossification of posterior longitudinal ligament of spine.
36. The method of claim 35, wherein said disorder is arterial calcification.
37. The method of claim 36, wherein the treatment reduces calcification in the artery.
38. The method of claim 36, wherein the arterial calcification is generalized arterial calcification of infancy.
Description:
RELATED APPLICATIONS
[0001] This application is a continuation of application Ser. No. 14/004,294, filed on Nov. 14, 2013, which is a U.S. National Stage of International Patent Application No. PCT/US2011/051858, filed Sep. 15, 2011 and is a continuation-in-part of International Patent Application No. PCT/US2011/028233, filed Mar. 11, 2011. The entire teachings of the above application are incorporated herein by reference.
SEQUENCE LISTING
[0002] The instant application contains a Sequence Listing which has been submitted in ASCII format via EFS-Web and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Jan. 27, 2017, is named 0812450070.text and is 109,060 bytes in size.
BACKGROUND OF THE INVENTION
[0003] Ectonucleotide pyrophosphatase/phosphodiesterase 1 (NPP1/ENPP1/PC-1) is a type II transmembrane glycoprotein that forms a homodimer. The protein cleaves a variety of substrates, including phosphodiester bonds of nucleotides and nucleotide sugars and pyrophosphate bonds of nucleotides and nucleotide sugars. NPP1 protein functions to hydrolyze nucleoside 5' triphosphatase to either corresponding monophosphates and also hydrolyzes diadenosine polyphosphates. Mutations in the NPP1 gene have been associated with idiopathic infantile arterial calcification (IIAC), insulin resistance, hypophosphatemic rickets, and ossification of the posterior longitudinal ligament of the spine.
[0004] IIAC, a rare autosomal recessive and nearly always fatal disorder, is characterized by calcification of the internal elastic lamina of muscular arteries and stenosis due to myointimal proliferation. There are more than 160 cases of IIAC that have been reported world-wide. The symptoms of the disease most often appear by early infancy, and the disease is lethal by 6 months of age, generally because of ischemic cardiomyopathy, and other complications of obstructive arteriopathy including renal artery stenosis. In more than a dozen reported cases of IIAC, periarticular calcifications of large joints also developed in infancy. Rutsch et al. (2003) reported that mutations in ENPP1 are associated with IIAC characterized by the spontaneous periarticular and aortic calcifications in early life and systemic lowering of nucleotide pyrophosphatase/phosphodiesterase activity in the affected individuals.
[0005] Although defects in the NPP1 protein have been implicated in such serious disease as IIAC, there is no treatment available in the art for those who are affected by the disease. Therefore, a dire need exists for an effective and safe composition, formulation and medicament for the treatment of IIAC, insulin resistance, hypophosphatemic rickets, and ossification of the posterior longitudinal ligament of the spine.
SUMMARY OF THE INVENTION
[0006] The present invention includes fusion proteins of truncated domains of NPP1 (i.e., an NPP1 component) fused to a targeting moiety. The targeting moiety functions to enhance the efficiency in targeting the NPP1 fusion protein to a site clinical or biological importance (e.g., site of calcification in a subject that needs lowering of calcification). Without wishing to limit the invention to any particular theory or mechanism of operation it is believed that the NPP1 component function to inhibit calcification by enhancing the formation of pyrophosphate (PPi) and/or by cleaving pyrophosphate to produce soluble phosphate (Pi) and/or by increasing the availability of adenosine monophosphate (AMP) and/or adenosine. It is contemplated that the targeting moiety can be attached to the N-terminus and/or the C-terminus of the NPP1 component at any useful position. Additionally, the NPP1 fusion protein disclosed herein can also include one or more of Fc fragment, PEG, polypeptide linker or other additional polypeptides to enhance the enzymatic activity, stability or targeting.
[0007] The fusion proteins of the invention can be used to treat a wide variety of conditions in a subject. Any condition that can be beneficially treated by the administering of a fusion protein of the invention is included within the scope of the invention. For example, treatment of conditions that can be improved by reducing and/or eliminating one or more calcification structures and/or preventing calcification structures from forming in a subject such as a mammal, for example, a human patient is within the scope of the invention. Conditions such as arterial blockage are contemplated for treatment by employing fusion proteins of the invention. In one particularly useful embodiment, the condition to be treated is generalized arterial calcification of infancy also referred to as idiopathic arterial calcification of infancy and arterial media calcification of infancy. Conditions such as insulin resistance, hypophosphatemic rickets, and ossification of the posterior longitudinal ligament of the spine are also contemplated for treatment.
[0008] Fusion proteins of the invention can be produced in any useful protein expression system including, without limitation, cell culture (e.g., CHO cells, COS cells, HEK203), bacteria such as Escherichia coli (E. coli) and transgenic animals, including, but no limited to, mammals and avians (e.g., chickens, quail, duck and turkey).
[0009] The manufacture of pharmaceutical compositions (or pharmaceutical formulations) is well known in the art and such pharmaceutical compositions are contemplated for use in accordance with fusion proteins of the invention.
[0010] Generally, the dosage of fusion protein administered to a subject will vary depending upon known factors such as age, health and weight of the recipient, type of concurrent treatment, frequency of treatment, and the like. Usually, a dosage of active ingredient (i.e., fusion protein) can be between about 0.0001 and about 50 milligrams per kilogram of body weight. Precise dosage, frequency of administration and time span of treatment can be determined by a physician skilled in the art of administration of therapeutic proteins.
BRIEF DESCRIPTION OF THE DRAWINGS
[0011] FIG. 1 illustrates the amino acid sequence of wild-type NPP1 protein. The cytosolic and transmembrane regions are underlined. The potential N-glycosylation sites are in bold. PSCAKE in italics is the start of soluble NPP1 which includes the cysteine rich region.
[0012] FIG. 2 illustrates the amino acid sequence of the catalytic domain(s) of NPP1 protein having no target moiety attached ("sssNPP1").
[0013] FIG. 3 illustrates the amino acid sequence of TAGsssNPP1. The targeting moiety of eight consecutive aspartic acid residues is fused to the N-terminus of the sssNPP1. The signal peptide is underlined, and the targeting moiety is indicated in bold.
[0014] FIG. 4 illustrates the amino acid sequence of TAGsssNPP1 that contains the targeting moiety of ten consecutive aspartic acid residues fused to the C-terminus of the sssNPP1. The signal peptide is underlined, and the targeting moiety is indicated in bold.
[0015] FIG. 5 illustrates the nucleic acid sequence of TAGssNPP1 fusion protein. The targeting moiety of eight consecutive aspartic acid residues is fused to the N-terminus of the ssNPP1.
[0016] FIG. 6 illustrates the amino acid sequence of TAGssNPP1 fusion protein. The targeting moiety of eight consecutive aspartic acid residues is fused to the N-terminus of the ssNPP1. The signal peptide is underlined, and the targeting moiety is indicated in bold.
[0017] FIG. 7 illustrates the nucleic acid sequence of ssNPP1.
[0018] FIG. 8 illustrates the amino acid sequence of ssNPP1. The signal peptide is underlined.
[0019] FIG. 9 illustrates the nucleic acid sequence of sNPP1.
[0020] FIG. 10 illustrates the amino acid sequence of sNPP1. The signal peptide is underlined.
[0021] FIG. 11 illustrates the nucleic acid sequence of TAGsNPP1. The targeting moiety of eight consecutive aspartic acid residues is fused to the N-terminus of the sNPP1.
[0022] FIG. 12 illustrates the amino acid sequence of TAGsNPP1. The targeting moiety of eight consecutive aspartic acid residues is fused to the N-terminus of the ssNPP1. The signal peptide is underlined, and the targeting moiety is indicated in bold.
[0023] FIG. 13 illustrates the nucleic acid sequence of TAGsNPP1. The targeting moiety of eight consecutive aspartic acid residues is fused to the C-terminus of the sNPP1.
[0024] FIG. 14 illustrates the amino acid sequence of TAGsNPP1. The targeting moiety of eight consecutive aspartic acid residues is fused to the N-terminus of the sNPP1. The signal peptide is underlined, and the targeting moiety is indicated in bold.
[0025] FIG. 15 illustrates the amino acid sequence of a linker peptide.
[0026] FIG. 16 illustrates the amino acid sequence of an immunoglobulin Fc segment.
[0027] FIG. 17 illustrates the amino acid sequence of TAGsssNPP1 which contains the targeting moiety of eight consecutive aspartic acid residues fused to the N-terminus of the sssNPP1 via a peptide linker. The Fc segment is fused to N-terminus of the target moiety. The signal peptide is underlined and the targeting moiety is indicated in bold. The peptide linker is in italics.
[0028] FIG. 18 illustrates the amino acid sequence of TAGsssNPP1 which contains the targeting moiety of eight consecutive aspartic acid residues fused to the C-terminus of the sssNPP1 via a peptide linker. The Fc segment is fused to N-terminus of the sssNPP1. The signal peptide is underlined and the targeting moiety is indicated in bold. The peptide linker is in italics.
[0029] FIG. 19 is a schematic representation of an expression vector (i.e., pTT22) containing a TAGsNPP1 construct. The targeting moiety of eight consecutive aspartic acid residues fused to the C-terminus of the sNPP1.
[0030] FIG. 20 illustrates Western blot analysis of TAGsNPP1. Reducing condition; NR, non-reducing condition.
[0031] FIG. 21 demonstrates the enzymatic activity of TAGsNPP1 produced and isolated from HEK293 cells.
[0032] FIGS. 22A-22C illustrate schematics of TAGNPP1 fusion protein constructs described herein.
[0033] FIG. 23 demonstrates calicification levels of human aortic smooth muscle cells treated with soluble sNPP1 (WT), TAGsNPP1 (D8 fused to sNPP1 C-terminus), and sNPP1-Fc.
[0034] FIG. 24 depicts a schematic representation of an expression vector (i.e., pTT22) containing a TAGsNPP1 construct. The targeting moiety of eight consecutive aspartic acid residues (D8) fused to the C-terminus of the sNPP1.
[0035] FIG. 25 depicts the nucleic acid sequence of a TAGsNPP1 fusion protein.
[0036] FIG. 26 depicts the amino acid sequence of a TAGsNPP1 fusion protein.
[0037] FIG. 27 depicts a schematic representation of an expression vector (i.e., pTT22) containing a sNPP1-Fc fusion construct.
[0038] FIG. 28 depicts the nucleic acid sequence of a sNPP1-Fc fusion protein.
[0039] FIG. 29 depicts the amino acid sequence of a sNPP1-Fc fusion protein.
DETAILED DESCRIPTION OF THE INVENTION
[0040] The present invention provides novel human NPP1 fusion proteins that are soluble and contain truncated and biologically active domain(s) of NPP1 (i.e., NPP1 components that contain at least one extracellular catalytic domain of naturally occurring NPP1 for the pyrophosphatase and/or phosphodiesterase activity) and one or more targeting moieties (i.e., "TAG"). The NPP1 fusion proteins of the present invention comprise at least the NPP1 domain essential to carry out the pyrophosphatase and/or phosphodiesterase activity. Accordingly, the invention features isolated fusion proteins comprising the amino acid residues A205 through L591 of SEQ ID NO:1 fused to one or more targeting moieties. The targeting moiety can be recombinantly fused or chemically bonded (e.g., covalent bond, ionic bond, hydrophobic bond and Van der Waals force) to the NPP1 component by methods well known in the art and direct the NPP1 component to a certain target site where the attached NPP1 component will have a desirable effect (e.g., catalysis of a reaction such as solubilizing a substrate such as PPi or preventing the formation of a substrate such as PPi) in a subject to which the fusion protein of the present invention is administered.
TAGNPP1s
[0041] All NPP1 fusion proteins ("TAGNPP1s") of the present invention have the N-terminal cytosolic and the transmembrane domains of the naturally occurring human NPP1 removed. Optionally, TAGNPP1s fusion proteins of the present invention can also contain C-terminal truncation of wild-type NPP1 in various lengths. The amino acid sequence of full-length wild-type NPP1 is set forth in SEQ ID NO: 1.
[0042] In one embodiment, the fusion protein contains a polypeptide comprising the amino acid residues A205 through L591 of SEQ ID NO:1 ("sssNPP1") fused a TAG on either the N- or C-terminus of the polypeptide ("TAGsssNPP1"). In one embodiment, the fusion protein comprises a polypeptide comprising the amino acid residues A205 through D925 of SEQ ID NO:1 ("ssNPP1") fused to a TAG on either the N- or C-terminus ("TAGssNPP1"). In one embodiment, the fusion protein comprises a polypeptide comprising the amino acid residues P99 through D925 of SEQ ID NO:1 ("sNPP1") fused to a TAG on either the N- or C-terminus of the polypeptide ("TAGsNPP1"). Also contemplated is any consecutive fragment of sNPP1 that comprises at least the amino acid resides A205 through L591 of SEQ ID NO:1 and the polypeptide fragment is fused to a TAG on either the N- or C-terminus.
[0043] When expressed in a cell culture or transgenic animal, the TAGNPP1 fusion proteins can further comprise a signal peptide (or leader sequence) at its N-terminus. The signal peptide co-translationally or post-translationally directs transport of the TAGNPP1 fusion proteins through the subcellular organelles of the cell expressing the TAGNPP1 fusion proteins and, thereby determining the post-translational modification of the TAGNPP1 fusion proteins. It is to be understood that because the signal peptide is cleaved at the co-translational or post-translational stage of the fusion protein, the TAGNNP1 fusion proteins are generally devoid of the signal peptide when once secreted and isolated. Accordingly, in the embodiments that are directed to the nucleic acid sequences encoding the TAGNPP1 fusion proteins are described, the leader sequences are also contemplated as used in the present invention. For example, the nucleotide sequence set forth in SEQ ID NO:2 contains an example of the leader sequence for TAGNNP1 at its 5' end.
[0044] Each of the fusion proteins disclosed herein is contemplated with one or more targeting moieties ("TAG"). The TAG component according to the present invention comprises four or more negatively charged amino acids such as aspartic acid and glutamic acid. The TAG component can be a stretch of negatively charged amino acid residues, for example, aspartic acids and/or glutamic acids that are between about 4 and about 20 amino acid residues in length. The TAG can be fused to either the N- or C-terminus of the NPP1 component. The TAG can be also fused to both N- and C-termini of the NPP1 component. Accordingly, the amino acid sequence of the TAGsNPP1 fusion protein, for example, includes PSCAKE through the C-terminus end of the NPP1 component and one or more targeting moieties (e.g., polyglutamic acid tag or polyaspartic acid tag) at the N- and/or C-terminus end of the NPP1 component. The fusion protein comprising the NPP1 component having the TAG fused to the C-terminus is a particularly useful embodiment. In one very specific embodiment, TAG having a stretch of eight aspartic acids is employed as can be seen in the exemplary sequences for TAGsNPP1 and TAGssNPP1 in Figures, though any useful number of negatively charged amino acid residues (e.g., 4, 5, 6, 7, 9, 10, 11, 12, 13, 14, 15, 16, 17, or 18) can be used in accordance with the invention. The TAG component is indicated as "A" in Figures.
[0045] The invention also encompasses polynucleotides which encode various TAGNPP1 fusion proteins described herein. Accordingly, any nucleic acid sequence which encodes the amino acid sequence of any TAGNPP1 fusion protein can be used to generate recombinant molecules which express the corresponding TAGNPP1 fusion protein. In a particular embodiment, the invention encompasses the polynucleotide comprising the nucleic acid sequence of SEQ ID NO:2 as shown in FIG. 2.
[0046] Within certain specific embodiments, the fusion protein comprises multiple polypeptides as described herein, or that comprises at least one polypeptide as described herein and an unrelated sequence. Certain preferred polypeptide can, for example, assist in dimerization and stability or minimize aggregation of the fusion proteins. For example, the additional polypeptide can be the Fc region of the immunoglobulin G1 to increase stability in serum. The use of the Fc segment is well known in the art and described in U.S. Pat. No. 7,902,151; and U.S. Pat. No. 7,858,297, the entire teachings of which are incorporated herein by reference in their entirety. The cysteine rich region of wild-type NPP1 (i.e., PSCAKE through NEPQCP; the amino acid sequence from P99 to P204 of SEQ ID NO:1) can be employed to facilitates dimerization of the TAGNPP1 fusion proteins.
[0047] In another embodiment, the polyethylene glycol (PEG) can be conjugated to the TAGNPP1 fusion proteins. Other polypeptides may be selected so as to minimize aggregation and immunogenicity, to increase the solubility of the protein, or to enable the protein to be targeted to desired sites of clinical or biological importance.
[0048] TAGNPP1 can be also fused or conjugated to an appropriate polypeptide linker or other sequence for ease of identification, synthesis, or purification of the fusion protein, or to better preserve the native structure of the NPP1 component which can enhance the activity and targeting of the TAGNPP1. Specifically, a peptide linker sequence can be employed to separate the first and the second polypeptide components by a distance sufficient to ensure that each polypeptide folds into its secondary and tertiary structures. Such a peptide linker sequence is incorporated into the fusion protein between the NPP1 component and the TAG component using standard techniques well known in the art. Suitable peptide linker sequences may be chosen based on their ability to adopt a flexible extended conformation and their inability to adopt a secondary structure that could interact with functional portion on the NPP1, TAG or other secondary polypeptides described herein (e.g., Fc). Preferred peptide linker sequences contain Gly, His, Asn and Ser residues. The useful peptide linkers include, without limitation, poly-Gly, poly-His, poly-Asn, or poly-Ser. Other near neutral amino acids, such as Thr and Ala can be also used in the linker sequence Amino acid sequences which can be usefully employed as linkers include those disclosed in Maratea et al., Gene 40:39-46, 1985; Murphy et al., Proc. Natl. Acad Sci. USA 83:8258-8262, 1986; U.S. Pat. No. 4,935,233 and U.S. Pat. No. 4,751,180. The linker sequence may be from 1 to about 20 amino acid residues in length. Preferably, the polypeptide linker is between about 8 and about 12 amino acids in length. In a preferred embodiment, the peptide linker used in the invention is GGGGSGGGGS (SEQ ID NO:15), although any functional combination of Gly, Ser, His, or Asn can be employed.
[0049] Fusion proteins can also comprise a TAGNPP1 of the present invention together with an unrelated polypeptide. Preferably the unrelated polypeptide is capable of enhancing the targeting of the fusion protein to the site of clinical or biological importance (e.g., site of calcification). For example, peptides that have high affinity to the bone are described in U.S. Pat. No. 7,323,542, the entire teachings of which are incorporated herein by reference.
[0050] TAGNPP1 can be prepared using standard methods, including recombinant techniques or chemical conjugation well known in the art. Techniques useful for isolating and characterizing the nucleic acids and proteins of the present invention are well known to those of skill in the art and standard molecular biology and biochemical manuals can be consulted to select suitable protocols for use without undue experimentation. See, for example, Sambrook et al, 1989, "Molecular Cloning: A Laboratory Manual," 2nd ed., Cold Spring Harbor, the content of which is herein incorporated by reference in its entirety. Briefly, DNA sequences encoding the polypeptide components can be assembled separately, and ligated into an appropriate expression vector. For example, the 3' end of the DNA sequence encoding the NPP1 component is ligated, with or without a peptide linker, to the 5' end of a DNA sequence encoding the second polypeptide component such as TAG PEG, or Fc so that the reading frames of the sequences are in phase. This permits translation into a single fusion protein that retains the biological activity of both component polypeptides. The ligated DNA sequences are operably linked to suitable transcriptional or translational regulatory elements including a promoter. The regulatory elements responsible for expression of DNA are located only 5' to the DNA sequence encoding the first polypeptide such as the leader sequence encoding a signal peptide. Similarly, stop codons required to end translation and transcription termination signals are only present 3' to the DNA sequence encoding the second polypeptide.
[0051] The invention also encompasses TAGNPP1 variants. A preferred TAGNPP1 variant is one having 80%, 85%, 90%, 95% and more preferably 96% amino acid sequence identity to the amino acid sequence A205 through L591 of SEQ ID NO:1. A most preferred TAGNPP1 variant is one having at least 97% amino acid sequence identity to amino acid sequence A205 through L591 of SEQ ID NO:1.
[0052] The invention also relates to a polynucleotide sequence comprising the complement of SEQ ID NO:2 or variants thereof. In addition, the present invention also features polynucleotide sequences which hybridize under stringent conditions to SEQ ID NO:2 and whose the antisense sequence is 85%, 90%, 95%, 97%, 98%, or 99% identical to SEQ ID NO:2. Hybridization conditions are based on the melting temperature (Tm) of the nucleic acid binding complex or probe, as taught in Wahl, G. M. and S. L. Berger (1987; Methods Enzymol. 152:399-407) and Kimmel, A. R. (1987; Methods Enzymol. 152:507-511), and can be used at a defined stringency.
[0053] The invention additionally contemplates nucleic acid sequences encoding polypeptides, oligonucleotides, peptide nucleic acids (PNA), fragments, portions or antisense molecules thereof.
[0054] Although nucleotide sequences which encode TAGNPP1 and its variants are preferably capable of hybridizing to the nucleotide sequence of the TAGNPP1 under appropriately selected conditions of stringency, it can be advantageous to produce nucleotide sequences encoding TAGNPP1 or its derivatives possessing a substantially different codon usage. Codons can be selected to increase the rate at which expression of the peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which particular codons are utilized by the host. Other reasons for substantially altering the nucleotide sequence encoding TAGNPP1 and its derivatives without altering the encoded amino acid sequences include the production of RNA transcripts having more desirable properties, such as a greater half-life.
[0055] Altered nucleic acid sequences encoding TAGNPP1 which are encompassed by the invention include deletions, insertions, or substitutions of different nucleotides resulting in a polynucleotide that encodes the same or a functionally equivalent of TAGNPP1. The encoded protein can also contain deletions, insertions, or substitutions of amino acid residues which produce a silent change and result in a functionally equivalent TAGNPP1. Deliberate amino acid substitutions can be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues as long as the biological activity of TAGNPP1 is retained. For example, positively charged amino acid residues include Lys and Arg; negatively charged amino acid residues include Asp and Glu; and amino acids with uncharged polar head groups having similar hydrophilicity can include Leu, Ile, and Val; Gly and Ala; Asp and Gln; Ser and Thr; Phe and Tyr.
[0056] Expression Vector
[0057] Methods which are well known to those skilled in the art can be used to construct expression vectors containing sequences encoding TAGNPP1 and appropriate transcriptional and translational control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. Such techniques are described, for example, in Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et al. (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York, N.Y., the teachings of which are incorporated herein by reference in its entirety.
[0058] A variety of expression vector/host systems can be utilized to contain and express sequences encoding TAGNPP1. These include, but are not limited to, microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; insect cell systems infected with virus expression vectors (e.g., baculovirus) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or animal cell systems (e.g., pTT22 vector).
[0059] The control elements or regulatory sequences can includes those non-translated regions of the vector-enhancers, promoters, 5' and 3' untranslated regions-which interact with host cellular proteins to carry out transcription and translation. Such elements can vary in their strength and specificity. Depending on the vector system and host utilized, any number of suitable transcription and translation elements, including, tissue-specific, constitutive and inducible promoters, can be used. For example, when cloning in bacterial systems, inducible promoters such as the hybrid lacZ promoter of the Bluescript.TM. phagemid (Stratagene, LaJolla, Calif.) or pSport1.TM. plasmid (Gibco BRL) and the like can be used. In mammalian cell systems, promoters from mammalian genes or from mammalian viruses are preferred. If it is necessary to generate a cell line that contains multiple copies of the sequence encoding TAGNPP1, vectors based on SV40 or EBV can be used with an appropriate selectable marker. When an avian expression system is used, suitable vectors for expression various TAGNPP1 constructs are described in U.S. Pat. No. 6,730,822; U.S. Pat. No. 6,825,396; U.S. Pat. No. 6,875,588; U.S. Pat. No. 7,294,507; U.S. Pat. No. 7,521,591; U.S. Pat. No. 7,534,929; and U.S. patent application Ser. No. 11/376,023, the entire teachings of which are incorporated herein by reference in their entirety. Briefly, when an avian expression system is employed to express TAGNPP1, suitable oviduct-specific promoters, for example, and without limitation, ovomucoid promoters, ovalbumin promoters, lysozyme promoters, conalbumin promoters, ovomucin promoters, ovotransferrin promoters and functional portions of each of these promoters are contemplated. Suitable non-specific promoters can include, for example and without limitation, cytomegalovirus (CMV) promoters, MDOT promoters and rous-sarcoma virus (RSV) promoters, murine leukemia virus (MLV) promoters, mouse mammary tumor virus (MMTV) promoters and SV40 promoters and functional portions of each of these promoters. Non-limiting examples of other promoters which can be useful in the present invention include, without limitation, Pol III promoters (for example, type 1, type 2 and type 3 Pol III promoters) such as H1 promoters, U6 promoters, tRNA promoters, RNase MPR promoters and functional portions of each of these promoters. Typically, functional terminator sequences are selected for use in the present invention in accordance with the promoter that is employed.
[0060] Host Cells
[0061] The present invention includes the production of soluble TAGNPP1 in a transgenic avian (e.g., transgenic chicken) system as is well known in the art, for example, in U.S. Pat. No. 7,534,929, the disclosure of which is incorporated in its entirety herein by reference. Production in the avian system (e.g., in the avian oviduct) of an NPP1 component with or without to a targeting moiety (e.g., ssNPP1, sNPP1, TAGsNPP1 and TAGssNPP1) is within the scope of the invention. Furthermore, TAGNPP1 produced in any useful protein expression system including, without limitation, transgenic avians, transgenic mammal, cell culture (e.g., CHO cells, HEK293 cells, and COS cells), bacteria such as E. coli, transgenic animals such as mammals and avians (e.g., chickens, quail, duck and turkey) and in plant systems including duck weed, is contemplated herein.
[0062] A host cell strain can be chosen for its ability to modulate the expression of the inserted sequences or to process the expressed TAGNPP1s in the desired fashion. Such modifications of the polypeptide of TAGNNP1 include, without limitation, acetylation, carboxylation, sialylation, glycosylation, phosphorylation, lipidation, and acylation. Different host cells such as CHO, COS, HeLa, MDCK, HEK293, and W138, which have specific cellular machinery and characteristic mechanisms for such post-translational activities, can be chosen to ensure the correct modification and processing of the fusion protein of the present invention. Avian tumor cell line is also contemplated as a host cell for expressing the polypeptide of the present invention. Examples of useful avian cell lines (e.g., an avian oviduct tumor cell line) which can be employed in the present invention are described in U.S. Pat. Publication No. 2009/0253176, the entire teachings of which are incorporated herein by reference.
[0063] Production of TAGNPP1
[0064] TAGNPP1 can be produced using any of a variety of well-known techniques. TAGNPP1 encoded by DNA sequences as described above can be readily prepared from the DNA sequences using any of a variety of expression vectors described herein or well known to those of ordinary skill in the art. Expression can be achieved in any appropriate host cell that has been transformed or transfected with an expression vector containing a DNA molecule that encodes a recombinant polypeptide of the present invention. Supernatants from suitable host/vector systems which secrete recombinant fusion protein or polypeptide into culture media can be first concentrated using a commercially available filter. Following concentration, the concentrate can be applied to a suitable purification matrix such as an affinity matrix or an ion exchange resin. One or more reverse phase HPLC steps can be employed to further purify a recombinant polypeptide.
[0065] For high-yield production of recombinant proteins, stable expression is preferred. Cell lines stably expressing TAGNPP1 can be transformed using expression vectors which contain viral origins of replication and/or endogenous expression elements and/or a selectable marker gene on the same or on a separate vector. Following the introduction of the vector, cells can be allowed to grow for 1-2 days in an enriched media before they are switched to selective media. The purpose of the selectable marker is to confer resistance to selection, and its presence allows growth and recovery of cells which successfully express the introduced sequences. Resistant clones of stably transformed cells can be proliferated using tissue culture techniques appropriate to the cell type. Methods of producing exogenous protein in mammalian cell lines are well known in the art. Illustrative examples of this and other aspects and embodiments of the present invention for the production of heterologous polypeptides such as TAGNPP1 fusion proteins in avian cells are fully disclosed in U.S. patent application Ser. No. 09/877,374, filed Jun. 8, 2001, published as U.S. 2002/0108132-A1 on Aug. 8, 2002, and U.S. patent application Ser. No. 10/251,364, filed Sep. 18, 2002, each of which are incorporated herein by reference in their entirety. Examples of producing exogenous proteins in avian tumor cell lines are also described in U.S. Pat. Publication No. 2009/0253176, the entire teachings of which are incorporated herein by reference in entirety.
[0066] The invention specifically contemplates the production of the TAGNPP1 proteins disclosed herein in a transgenic avian system. In one particularly useful embodiment, the invention is drawn to the production of TAGNPP1 which can be produced in the oviduct of a transgenic avian, such as a chicken, in accordance with the invention. Examples of producing exogenous proteins in transgenic avian expression system are also described in U.S. Pat. No. 6,730,822, the entire teachings of which are incorporated herein by reference in entirety. Briefly, a suitable avian vector described above that contains a nucleic acid sequence encoding a TAGNPP1 fusion protein, operably linked to a tissue-specific or constitutive promoter that drives expression of the encoding sequence in the chicken oviduct are introduced into chicken stage X embryonic cells. The transformed embryonic cells are incubated under conditions conducive to hatching live chicks. Live chicks are nurtured into a mature chimeric chicken which are mated with a non-transgenic chicken naturally or via artificial insemination. A transgenic chicken is identified by screening progeny for germ line incorporation of the protein encoding sequence. The transgenic progeny can be mated with another transgenic or a non-transgenic chicken to produce eggs containing the TAGNPP1 fusion protein. The TAGNPP1 is then isolated and purified by methods well known in the art. Accordingly, the invention provides recombinant TAGNPP1 fusion proteins that have been produced by transgenic avians.
[0067] Pharmaceutical Composition
[0068] The present invention also features pharmaceutical compositions comprising isolated and substantially purified TAGNPP1 or a pharmaceutically acceptable salt thereof. Pharmaceutical composition of the present invention can also include a pharmaceutically acceptable carrier or excipient therefor. Compositions comprising such carriers, including composite molecules, are formulated by well-known conventional methods (see, for example, Remington's Pharmaceutical Sciences, 14th Ed., Mack Publishing Co., Easton, Pa.), the entire teachings of which are incorporated herein by reference. The carrier may comprise a diluent. In one embodiment, the pharmaceutical carrier can be a liquid and the fusion protein may be in the form of a solution. The pharmaceutical carrier can be wax, fat, or alcohol. In another embodiment, the pharmaceutically acceptable carrier may be a solid in the form of a powder, a lyophilized powder, or a tablet. In one embodiment, the carrier may comprise a liposome or a microcapsule.
[0069] The pharmaceutical compositions can be in the form of a sterile lyophilized powder for injection upon reconstitution with a diluent. The diluent can be water for injection, bacteriostatic water for injection, or sterile saline. The lyophilized powder may be produced by freeze drying a solution of the fusion protein to produce the protein in dry form. As is known in the art, the lyophilized protein generally has increased stability and a longer shelf life than a liquid solution of the protein.
DEFINITIONS
[0070] As used herein, the term "acceptable" with respect to a formulation, composition or ingredient, as used herein, means having no persistent detrimental effect on the general health of the subject being treated.
[0071] As used herein, the term "administration" or "administering" refers to providing a fusion protein of the invention to a subject in need of treatment.
[0072] "Alterations," as used herein, comprise any alteration in the sequence of polynucleotides encoding TAGNPP1 including deletions, insertions, and point mutations that may be detected using hybridization assays.
[0073] The term "animal" is used herein to include all vertebrate animals, including avians and mammals such as rat, mouse and human. It also includes an individual animal in all stages of development, including embryonic and fetal stages.
[0074] Where "amino acid sequence" is recited herein to refer to an amino acid sequence of a fusion protein molecule, "amino acid sequence" and like terms, such as "polypeptide" or "protein" are not meant to limit the amino acid sequence to the complete amino acid sequence associated with the recited protein or polypeptide molecule.
[0075] The term "avian" as used herein refers to any species, subspecies or strain of organism of the taxonomic class ava, such as, but not limited to, such organisms as chicken, turkey, duck, goose, quail, pheasants, parrots, finches, hawks, crows and ratites including ostrich, emu and cassowary. The term includes the various known strains of Gallus gallus, or chickens, (for example, White Leghorn, Brown Leghorn, Barred-Rock, Sussex, New Hampshire, Rhode Island, Ausstralorp, Minorca, Amrox, California Gray, Italian Partridge-colored), as well as strains of turkeys, pheasants, quails, duck, ostriches and other poultry commonly bred in commercial quantities.
[0076] The phrase "based on" or "derived from" as in a retroviral vector being based on or derived from a particular retrovirus or based on a nucleotide sequence of a particular retrovirus mean that the genome of the retroviral vector contains a substantial portion of the nucleotide sequence of the genome of the particular retrovirus. The substantial portion may be a particular gene or nucleotide sequence such as the nucleotide sequence encoding the gag, pol and/or env proteins or other structural or functional nucleotide sequence of the virus genome such as sequences encoding the LTRs or may be substantially the complete retrovirus genome, for example, most (e.g., more than 60% or more than 70% or more than 80% or more than 90%) or all of the retrovirus genome, as will be apparent from the context in the specification as the knowledge of one skilled in the art. Examples of retroviral vectors that are based on or derived from a retrovirus are the NL retroviral vectors (e.g., NLB) which are based on the ALV retrovirus as disclosed in Cosset et al., Journal of Virology (1991) vol 65, p 3388-3394.
[0077] The term "biologically active," as used herein, refers to a fusion protein having structural, regulatory, or biochemical functions of pyrophosphatase/phosphodiesterase of a naturally occurring NPP1 protein.
[0078] The term "construct," as used herein, refers to a linear or circular nucleotide sequence such as DNA that has been assembled from more than one segments of nucleotide sequence which have been isolated from a natural source or have been chemically synthesized, or combinations thereof.
[0079] The term "complementary," as used herein, refers to two nucleic acid molecules that can form specific interactions with one another. In the specific interactions, an adenine base within one strand of a nucleic acid can form two hydrogen bonds with thymine within a second nucleic acid strand when the two nucleic acid strands are in opposing polarities. Also in the specific interactions, a guanine base within one strand of a nucleic acid can form three hydrogen bonds with cytosine within a second nucleic acid strand when the two nucleic acid strands are in opposing polarities. Complementary nucleic acids as referred to herein, may further comprise modified bases wherein a modified adenine may form hydrogen bonds with a thymine or modified thymine, and a modified cytosine may form hydrogen bonds with a guanine or a modified guanine.
[0080] A "deletion," as used herein, refers to a change in either amino acid or nucleotide sequence in which one or more amino acid or nucleotide residues, respectively, are absent.
[0081] The term "expressed" or "expression" as used herein refers to the transcription from a gene to give an RNA nucleic acid molecule at least complementary in part to a region of one of the two nucleic acid strands of the gene. The term "expressed" or "expression" as used herein can also refer to the translation of RNA to produce a protein or peptide.
[0082] The term "expression vector" as used herein refers to a nucleic acid vector that comprises a gene expression controlling region, such as a promoter or promoter component, operably linked to a nucleotide sequence coding at least one polypeptide.
[0083] "Functional portion" or "functional fragment" are used interchangeably and as used herein means a portion or fragment of a whole capable of performing, in whole or in part, a function of the whole. For example, a biologically functional portion of a molecule means a portion of the molecule that performs a biological function of the whole or intact molecule. For example, a functional portion of a gene expression controlling region is a fragment or portion of the specified gene expression controlling region that, in whole or in part, regulates or controls gene expression (e.g., facilitates either in whole or in part) in a biological system (e.g., a promoter). Functional portions may be of any useful size.
[0084] The term "gene expression controlling region" as used herein refers to nucleotide sequences that are associated with a coding sequence and which regulate, in whole or in part, expression of the coding sequence, for example, regulate, in whole or in part, the transcription of the coding sequence. Gene expression controlling regions may be isolated from a naturally occurring source or may be chemically synthesized and can be incorporated into a nucleic acid vector to enable regulated transcription in appropriate cells. The "gene expression controlling regions" may precede, but is not limited to preceding, the region of a nucleic acid sequence that is in the region 5' of the end of a coding sequence that may be transcribed into mRNA.
[0085] The terms "heterologous," "exogenous" and "foreign" are used interchangeably herein and in general refer to a biomolecule such as a nucleic acid or a protein that is not normally found in a certain organism or in a certain cell, tissue or other component contained in or produced by an organism. For example, a protein that is heterologous or exogenous to an egg is a protein that is not normally found in the egg. As used herein, the terms "heterologous," "exogenous" and "foreign" with reference to nucleic acids, such as DNA and RNA, are used interchangeably and refer to nucleic acid that does not occur naturally as part of a chromosome, a genome or cell in which it is present or which is found in a location(s) and/or in amounts that differ from the location(s) and/or amounts in which it occurs in nature. It can be nucleic acid that is not endogenous to the genome, chromosome or cell and has been exogenously introduced into the genome, chromosome or cell. Examples of heterologous DNA include, but are not limited to, a DNA comprising a gene expression control region and DNA that encodes a product or products, for example, RNA or protein product. Examples of heterologous DNA include, but are not limited to, gene expression controlling regions or promoters disclosed herein once isolated from the avian and as used thereafter, e.g., after re-introduction into an avian genome.
[0086] The term "isolated nucleic acid" as used herein covers, for example, (a) a DNA which has the sequence of part of a naturally occurring genomic molecule but is not flanked by at least one of the sequences that flank that part of the molecule in the genome of the species in which it naturally occurs; (b) a nucleic acid which has been incorporated into a vector or into the genomic DNA of a prokaryote or eukaryote in a manner such that the resulting vector or genomic DNA is not identical to naturally occurring DNA from which the nucleic acid was obtained; (c) a separate molecule such as a cDNA, a genomic fragment, a fragment produced by polymerase chain reaction (PCR), ligase chain reaction (LCR) or chemical synthesis, or a restriction fragment; (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding a fusion protein, and (e) a recombinant nucleotide sequence that is part of a hybrid sequence that is not naturally occurring. Isolated nucleic acid molecules of the present invention can include, for example, natural allelic variants as well as nucleic acid molecules modified by nucleotide deletions, insertions, inversions, or substitutions.
[0087] An "insertion" or "addition," as used herein, refers to a change in an amino acid or nucleotide sequence resulting in the addition of one or more amino acid or nucleotide residues, respectively, as compared to the TAGNPP1 molecule.
[0088] The term "nucleic acid" as used herein refers to any linear or sequential array of nucleotides and nucleosides, for example cDNA, genomic DNA, mRNA, tRNA, oligonucleotides, oligonucleosides and derivatives thereof. For ease of discussion, non-naturally occurring nucleic acids may be referred to herein as constructs. Nucleic acids can include bacterial plasmid vectors including expression, cloning, cosmid and transformation vectors such as, animal viral vectors such as, but not limited to, modified adenovirus, influenza virus, polio virus, pox virus, retroviruses such as avian leukosis virus (ALV) retroviral vector, a murine leukemia virus (MLV) retroviral vector, and a lentivirus vector, and the like and fragments thereof. In addition, the nucleic acid can be an LTR of an avian leukosis virus (ALV) retroviral vector, a murine leukemia virus (MLV) retroviral vector, or a lentivirus vector and fragments thereof. Nucleic acids can also include NL vectors such as NLB, NLD and NLA and fragments thereof and synthetic oligonucleotides such as chemically synthesized DNA or RNA. Nucleic acids can include modified or derivatised nucleotides and nucleosides such as, but not limited to, halogenated nucleotides such as, but not only, 5-bromouracil, and derivatised nucleotides such as biotin-labeled nucleotides.
[0089] "Nucleic acid sequence" as used herein refers to an oligonucleotide, nucleotide, or polynucleotide, and fragments or portions thereof, and to DNA or RNA of genomic or synthetic origin which may be single- or double-stranded, and represent the sense or antisense strand.
[0090] The term "operably linked" refers to an arrangement of elements wherein the components so described are configured so as to perform their usual function. Gene expression controlling regions or promoters (e.g., promoter components) operably linked to a coding sequence are capable of effecting the expression of the coding sequence. The controlling sequences need not be contiguous with the coding sequence, so long as they function to direct the expression thereof. Thus, for example, intervening untranslated yet transcribed sequences can be present between a promoter sequence and the coding sequence and the promoter sequence can still be considered "operably linked" to the coding sequence.
[0091] The term "oviduct specific promoter" as used herein refers to promoters and promoter components which are functional, i.e., provide for transcription of a coding sequence, to a large extent, for example, primarily (i.e., more than 50% of the transcription product produced in the animal by a particular promoter type being produced in oviduct cells) or exclusively in oviduct cells of a bird. Examples of useful oviduct specific promoters include, without limitation, ovalbumin promoter, ovomucoid promoter, ovoinhibitor promoter, lysozyme promoter and ovotransferrin promoter and functional portions of these promoters, e.g., promoter components.
[0092] The terms "polynucleotide," "oligonucleotide," "nucleotide sequence" and "nucleic acid sequence" can be used interchangeably herein and include, but are not limited to, coding sequences, i.e., polynucleotide(s) or nucleic acid sequence(s) which are transcribed and translated into polypeptide in vitro or in vivo when placed under the control of appropriate regulatory or control sequences; controlling sequences, e.g., translational start and stop codons, promoter sequences, ribosome binding sites, polyadenylation signals, transcription factor binding sites, transcription termination sequences, upstream and downstream regulatory domains, enhancers, silencers, DNA sequences to which a transcription factor(s) binds and alters the activity of a gene's promoter either positively (induction) or negatively (repression) and the like. No limitation as to length or to synthetic origin are suggested by the terms described herein.
[0093] As used herein, the terms "polypeptide" and "protein" can be used interchangeably and refer to a polymer of amino acids of three or more amino acids in a serial array, linked through peptide bonds. The term "polypeptide" includes proteins such as fusion proteins, protein fragments, protein analogues, oligopeptides and the like. The term "polypeptides" includes polypeptides as defined above that are encoded by nucleic acids, produced through recombinant technology (e.g., isolated from a transgenic bird), or chemically synthesized.
[0094] As used herein, the term "promoter" refers to a DNA sequence useful to initiate transcription initiation by an RNA polymerase in an avian cell. A "promoter component" is a DNA sequence that can, by itself or, in combination with other DNA sequences effect or facilitate transcription. Specific promoter components such as ovalbumin promoter components, ovomucoid promoter components and lysozyme promoter components and other promoters and promoter components disclosed and claimed herein do not describe a specific promoter sequence. Rather, they encompass any sequence or sequence fragment of the respective promoter that is useful to effect or facilitate transcription of a coding sequence. For example, an ovomucoid promoter component includes, without limitation, the about 1.8 kb, the about 3.9 kb and the about 10 kb ovomucoid promoters disclosed in U.S. Publication Ser. No. 11/649,543, published May 17, 2007, which is incorporated in its entirety herein by reference. "Promoter components" can also encompass rearranged gene expression controlling regions which function to initiate RNA transcription and hybrid DNA molecules composed of naturally occurring DNA sequences and/or synthetic DNA sequences which function to initiate RNA transcription.
[0095] The terms "recombinant nucleic acid" and "recombinant DNA" as used herein refer to combinations of at least two nucleic acid sequences that are not naturally found in a eukaryotic or prokaryotic cell. The nucleic acid sequences may include, but are not limited to, nucleic acid vectors, gene expression regulatory elements, origins of replication, suitable gene sequences that when expressed confer antibiotic resistance, protein-encoding sequences and the like. The term "recombinant polypeptide" or "recombinant protein" is meant to include a polypeptide produced by recombinant DNA techniques such that it is distinct from a naturally occurring polypeptide either in its location, purity or structure. Generally, such a recombinant polypeptide will be present in a cell in an amount different from that normally observed in nature.
[0096] The term "stringent conditions," as used herein, is the "stringency" which occurs within a range from about Tm -5.degree. C. (5.degree. C. below the melting temperature (Tm) of the probe) to about 20.degree. C. to 25.degree. C. below Tm. As will be understood by those of skill in the art, the stringency of hybridization may be altered in order to identify or detect identical or related polynucleotide sequences.
[0097] As used herein, the term "subject" or "patient" encompasses mammals and non-mammals. Examples of mammals include, but are not limited to, humans, chimpanzees, apes monkeys, cattle, horses, sheep, goats, swine; rabbits, dogs, cats, rats, mice, guinea pigs, and the like. Examples of non-mammals include, but are not limited to, birds, fish and the like.
[0098] A "substitution," as used herein, refers to the replacement of one or more amino acids or nucleotides by different amino acids or nucleotides, respectively.
[0099] As used herein, the term "therapeutically effective amount" refers to any amount of a compound which, as compared to a corresponding subject who has not received such amount, results in improved treatment, healing, prevention, or amelioration of a disease, disorder, or side effect, or a decrease in the rate of advancement of a disease or disorder. The term also includes within its scope amounts effective to enhance normal physiological function.
[0100] As used herein, the terms "TAGNPP1," "fusion protein," "TAGNPP1 polypeptide" and "NPP1 component fused to a targeting moiety" are used interchangeably.
[0101] As used herein, the term "treat," "treating" or "treatment" refers to methods of alleviating, abating or ameliorating a disease or condition symptoms, preventing additional symptoms, ameliorating or preventing the underlying causes of symptoms, inhibiting the disease or condition, arresting the development of the disease or condition, relieving the disease or condition, causing regression of the disease or condition, relieving a condition caused by the disease or condition, or stopping the symptoms of the disease or condition either prophylactically and/or therapeutically.
[0102] A "variant" of TAGNPP1, as used herein, refers to an amino acid sequence that is altered by one or more amino acids. Preferably, a variant contains conservative substitutions. A "conservative substitution" is one in which an amino acid is substituted for another amino acid that has similar properties, such that one skilled in the art of peptide chemistry would expect the secondary structure and hydropathic nature of the polypeptide to be substantially unchanged Amino acid substitutions may generally be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic nature of the residues. For example, negatively charged amino acids include Asp and Glu; positively charged amino acids include lysine and arginine; and amino acids with uncharged polar head groups having similar hydrophilicity values include Leu, Ile and Val; Gly and Ala; Asp and Gln; and Ser, Thr, Phe and Tyr. Other groups of amino acids that may represent conservative changes include: Ala, Pro, Gly, Glu, Asp, Gln, Asn, Ser, Thr; (2) Cys, Ser, Tyr, Thr; (3) Val, Ile, Leu, Met, Ala, Phe; (4) Lys, Arg, His; and (5) Phe, Tyr, Trp, His. A variant may also, or alternatively, contain nonconservative changes. In a preferred embodiment, variant polypeptides differ from a native sequence by substitution, deletion or addition of five amino acids or fewer, or by, for example, replacement of a Gly with a Trp. Variants may also (or alternatively) be modified by, for example, the deletion or addition of amino acids that have minimal influence on the immunogenicity, secondary structure and hydropathic nature of the NPP1 component. Guidance in determining which amino acid residues can be substituted, inserted, or deleted without abolishing biological or immunological activity can be found using computer programs well known in the art.
[0103] The term "vector" and "nucleic acid vector" as used herein refers to a natural or synthetic single or double stranded plasmid or viral nucleic acid molecule that can be transfected or transformed into cells and replicate independently of, or within, the host cell genome. A circular double stranded vector can be linearized by treatment with an appropriate restriction enzyme based on the nucleotide sequence of the vector. A nucleic acid can be inserted into a vector by cutting the vector with restriction enzymes and ligating the desired pieces together.
[0104] The term "portion," as used herein, with regard to a fusion protein refers to fragments of that protein. The fragments may range in size from four amino acid residues to the entire amino acid sequence minus one amino acid. Thus, a protein "comprising at least a portion of the amino acid sequence of SEQ ID NO:1" encompasses the full-length TAGNPP1 and fragments thereof.
[0105] "Transformation" or "transfection," as used herein, describes a process by which exogenous DNA enters and changes a recipient cell using various methods well known in the art. Transformation may rely on any known method for the insertion of foreign nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method is selected based on the host cell being transformed and may include, but is not limited to, electroporation, particle bombardment, viral infection, and lipofection. Such "transformed" cells include stably transformed cells in which the inserted DNA is capable of replicating either as an autonomously replicating plasmid or as part of the host chromosome. They also include cells which transiently express the inserted DNA or RNA for limited periods of time.
EXAMPLES
[0106] The present invention is further exemplified by the following examples. The examples are for illustrative purpose only and are not intended, nor should they be construed as limiting the invention in any manner.
Example I
[0107] The TAGsNNP1 construct containing the targeting moiety having eight consecutive aspartic acids fused to sNPP1 was ligated into pTT22 vector using EcoRI and HindIII sites (pTT22-sNPP1.D8; FIG. 19). pTT22-sNPP1.D8 was transfected into HEK203E cells and the transformants were cultured to express TAGsNNP1. TAGsNNP1 was isolated from the culture media and partially purified as well known in the art. Following the purification, the pyrophosphase/phosphodiesterase activity of TAGsNPP1 was measured for its ability to hydrolyze thymidine 5' monophosphate p-nitrophenyl ester. Briefly, TAGsNPP1 was diluted to 1 ng/.mu.L in 50 mM Tris, 250 mM NaCl, pH 9.5. In a plate containing 50 .mu.L of 1 ng/.mu.L TAGsNPP1, 50 .mu.L of 10 mM thymidine 5' monophosphate p-nitrophenyl ester (Sigma.TM., Catalog #T4510) substrate was added. The enzyme activity of TAGsNPP1 was measured at 405 nm (absobance) in kinetic mode for 5 minutes. As shown in FIG. 21, the activity of TAGsNPP1 was detected above the level observed in control containing no TAGsNPP1. Particularly, TAGsNPP1 produced from HEK203D6 exhibited the highest level of the enzymatic activity. This results strongly suggested that the truncated NPP1 fused to a targeting moiety (i.e., D8) sufficiently maintained its normal function as nuclease.
Example II
[0108] This non-limiting prophetic example describes how to treat idiopathic infantile arterial calcification by administering a formulation comprising a TAGNPP1 fusion protein.
[0109] A clinician uses a diagnostic test to verify that a patient has high levels of calcification in the artery. A genetic test can be also performed for NPP1 defects as described in Rutsch et al. (2003), Nature Genetics 34:379-81.
[0110] The pharmaceutical compositions of the present invention are preferably administered intravenously, although interadermal, intramuscular or oral administration are employed in certain circumstances.
[0111] The clinician determines a dose which may vary depending on the gender, age, health, and weight of the patient. The determination of the appropriate dosage or route of administration is well within the skill of an ordinary physician.
[0112] The formulation containing TAGNPP1 can be infused, between about 10 mg/kg and about 1000 mg/kg per week weekly. 10-30 mg/kg can be administered once. During the infusion period, patients are monitored closely and appropriate clinical intervention is taken in the event of an adverse event. Treatment lasts at least 1 month or for the life of the patient. A window of 48 hours may be allowed for each infusion. An infusion schedule in which the rate of infusion increases with time reduces or eliminates adverse events. Infusions for infants can be administered according to the following schedule: 5-10 cc/hr for 60 minutes in each interval.
[0113] On the other hand, when continuous intravenous administration is desired, typical example of the slow release systems comprises that 1-100 mg/kg of effective TAGNPP1 proteins can be continuously released for more than 1 day.
Example III
Purification of Soluble TAGsNPP1-D8
[0114] The soluble form of TAGsNPP1 (C-terminally tagged D8) (see, SEQ ID NO:20 and FIG. 26) was purified from HEK293 conditioned medium. The conditioned medium (500 ml) was equilibrated to hydroxyapatite (HA)-Buffer A (10 mM Na.sub.3PO.sub.4, pH 6.8) and filtered with a 0.2 .mu.m Sartobran P Size 8 MidiCap Filter (Sartorius Stedim). A 20 ml HA-Ultrogel column (Pall Life Sciences) was equilibrated with 5 column volume (CV) of Buffer A before loading the filtered conditioned medium at 3 ml/min. The column was washed to UV baseline with Buffer A. The protein was eluted stepwise using 2.5 CV each of HA-Buffer B (150 mM Na.sub.3PO.sub.4, pH 6.8), HA-Buffer C (250 mM Na.sub.3PO.sub.4, pH 6.8), and HA-Buffer D (500 mM Na.sub.3PO.sub.4, pH 6.8) while collecting 5 ml fractions. Active fractions from the HA-column were pooled (50 ml) and equilibrated to WGA-Buffer A (20 mM Tris, pH 8.0, 150 mM NaCl, 0.7% CHAPS). After filtering, 6-7 mg total protein (15-18 ml) was loaded on a 1 ml Wheat Germ Agglutinin (WGA) gravity column (EMD Chemicals) that had been equilibrated with 5 CV of WGA-Buffer A. The column was washed with 7 CV of WGA-Buffer A, and then the protein was eluted with 5.times.1 ml WGA-Buffer B (20 mM Tris, pH 8.0, 150 mM NaCl, 500 mM N-Acetylglucosamine, 0.7% CHAPS buffer). The WGA-Buffer B was allowed to incubate with the column at room temperature for 10 minutes before collecting 1 ml fractions. The process was repeated until all of the starting material had been purified by the WGA-column (total of 4 runs). The active fractions (29 ml) were pooled and concentrated to 1.2 ml using a 100,000 molecular weight cut off (MWCO) Vivaspin-15 (Sartorius Stedim), buffer exchanging into PBS at the same time.
[0115] Active sNPP1-D8 (TAGsNPP1) was purified to 91.5% based on HPLC. When the final pool was analyzed by SDS-PAGE a dimer band corresponded to .about.210 kD under non-reducing conditions and under reducing conditions a monomer band corresponded to .about.105 kD.
Example IV
[0116] Human aortic smooth muscle cells were plated at 1.times.10.sup.4 cells per well in 48 well plates and maintained in Dulbecco's Modified Eagle Medium (DMEM) under standard conditions. After 2 days, DMEM was supplemented with NaPO.sub.4 and ATP at 3.8 mM and 50 uM, respectively. sNPP1 (WT; FIG. 9), TAGsNPP1 having eight consecutive aspartic acid residues (D8) fused to C-terminus (FIG. 26) and sNPP1-Fc (FIG. 29) were supplemented at 1 ug/ml. Media was replaced with same every other day. After 5 days, media was removed from the culture and replaced with 100 ul of 0.6N HCl and incubated for 16-20 hours at room temperature, dissolving calcium phosphates into the solution. Calcium levels were then quantified by a colorimetric Calcium-Cresolphthalein reaction using the Calcium Assay Kit (#700550) from Cayman Chemical Company, Ann Arbor, Mich.
[0117] As shown in FIG. 23, TAGsNPP1 demonstrated an increased inhibition on calcification as compared to that of sNPP1 (WT), suggesting that the D8 domain contained in TAGsNPP1 provides an increased homing ability to calcification sites and/or an enhanced inhibitory effect.
Example V
[0118] Enzymatic Activity Assay of sNPP1-Fc
[0119] Addition of 1 ug sNPP1-Fc (FIG. 29) was based on 70% purity (i.e. .about.1.1 .mu.g of sNPP1-Fc was used in the assay). HPLC size exclusion column (SEC) indicates that the sNPP1-Fc is .about.78% pure. The non-reduced gel showed a dimer at the expected size (.about.250 kD). The dimer band reduced to the expected monomer size (.about.125 kD) when treated with DTT. Western blot analysis confirmed the dimer and monomer bands. Treatment of the sNPP1-Fc sample with PNGase F reduced the monomer band from .about.125 kD to .about.100 kD, which is the expected molecular weight considering that sNPP1-Fc contained 12 N-glycan sites. At 78% purity, final quantitation yielded .about.0.808 mg sNPP1-Fc (2.2 .mu.g/ml conditioned media). The enzymatic activity of sNPP1-Fc was determined as known in the art.
TABLE-US-00001 TABLE 1 Enzymatic activity of sNPP1-Fc Sample OD.sub.405 Final OD.sub.405 Neg. Cntrl (non-NPP1 protein) 0.053 0.000 1 ug Wt sNPP1 (soluble sNPP1) 0.296 0.243 1 ug NPP1-Fc (c-terminus) 0.397 0.344
[0120] Each example in the above specification is provided by way of explanation of the invention, not limitation of the invention. In fact, it will be apparent to those skilled in the art that various modifications, combinations, additions, deletions and variations can be made in the present invention without departing from the scope or spirit of the invention. For instance, features illustrated or described as part of one embodiment can be used in another embodiment to yield a still further embodiment. It is intended that the present invention cover such modifications, combinations, additions, deletions, and variations.
[0121] All publications, patents, patent applications, internet sites, and accession numbers/database sequences (including both polynucleotide and polypeptide sequences) cited herein are hereby incorporated by reference in their entirety for all purposes to the same extent as if each individual publication, patent, patent application, internet site, or accession number/database sequence were specifically and individually indicated to be so incorporated by reference.
Sequence CWU
1
1
221925PRTHomo sapiens 1Met Glu Arg Asp Gly Cys Ala Gly Gly Gly Ser Arg Gly
Gly Glu Gly 1 5 10 15
Gly Arg Ala Pro Arg Glu Gly Pro Ala Gly Asn Gly Arg Asp Arg Gly
20 25 30 Arg Ser His Ala
Ala Glu Ala Pro Gly Asp Pro Gln Ala Ala Ala Ser 35
40 45 Leu Leu Ala Pro Met Asp Val Gly Glu
Glu Pro Leu Glu Lys Ala Ala 50 55
60 Arg Ala Arg Thr Ala Lys Asp Pro Asn Thr Tyr Lys Val
Leu Ser Leu 65 70 75
80 Val Leu Ser Val Cys Val Leu Thr Thr Ile Leu Gly Cys Ile Phe Gly
85 90 95 Leu Lys Pro Ser
Cys Ala Lys Glu Val Lys Ser Cys Lys Gly Arg Cys 100
105 110 Phe Glu Arg Thr Phe Gly Asn Cys Arg
Cys Asp Ala Ala Cys Val Glu 115 120
125 Leu Gly Asn Cys Cys Leu Asp Tyr Gln Glu Thr Cys Ile Glu
Pro Glu 130 135 140
His Ile Trp Thr Cys Asn Lys Phe Arg Cys Gly Glu Lys Arg Leu Thr 145
150 155 160 Arg Ser Leu Cys Ala
Cys Ser Asp Asp Cys Lys Asp Lys Gly Asp Cys 165
170 175 Cys Ile Asn Tyr Ser Ser Val Cys Gln Gly
Glu Lys Ser Trp Val Glu 180 185
190 Glu Pro Cys Glu Ser Ile Asn Glu Pro Gln Cys Pro Ala Gly Phe
Glu 195 200 205 Thr
Pro Pro Thr Leu Leu Phe Ser Leu Asp Gly Phe Arg Ala Glu Tyr 210
215 220 Leu His Thr Trp Gly Gly
Leu Leu Pro Val Ile Ser Lys Leu Lys Lys 225 230
235 240 Cys Gly Thr Tyr Thr Lys Asn Met Arg Pro Val
Tyr Pro Thr Lys Thr 245 250
255 Phe Pro Asn His Tyr Ser Ile Val Thr Gly Leu Tyr Pro Glu Ser His
260 265 270 Gly Ile
Ile Asp Asn Lys Met Tyr Asp Pro Lys Met Asn Ala Ser Phe 275
280 285 Ser Leu Lys Ser Lys Glu Lys
Phe Asn Pro Glu Trp Tyr Lys Gly Glu 290 295
300 Pro Ile Trp Val Thr Ala Lys Tyr Gln Gly Leu Lys
Ser Gly Thr Phe 305 310 315
320 Phe Trp Pro Gly Ser Asp Val Glu Ile Asn Gly Ile Phe Pro Asp Ile
325 330 335 Tyr Lys Met
Tyr Asn Gly Ser Val Pro Phe Glu Glu Arg Ile Leu Ala 340
345 350 Val Leu Gln Trp Leu Gln Leu Pro
Lys Asp Glu Arg Pro His Phe Tyr 355 360
365 Thr Leu Tyr Leu Glu Glu Pro Asp Ser Ser Gly His Ser
Tyr Gly Pro 370 375 380
Val Ser Ser Glu Val Ile Lys Ala Leu Gln Arg Val Asp Gly Met Val 385
390 395 400 Gly Met Leu Met
Asp Gly Leu Lys Glu Leu Asn Leu His Arg Cys Leu 405
410 415 Asn Leu Ile Leu Ile Ser Asp His Gly
Met Glu Gln Gly Ser Cys Lys 420 425
430 Lys Tyr Ile Tyr Leu Asn Lys Tyr Leu Gly Asp Val Lys Asn
Ile Lys 435 440 445
Val Ile Tyr Gly Pro Ala Ala Arg Leu Arg Pro Ser Asp Val Pro Asp 450
455 460 Lys Tyr Tyr Ser Phe
Asn Tyr Glu Gly Ile Ala Arg Asn Leu Ser Cys 465 470
475 480 Arg Glu Pro Asn Gln His Phe Lys Pro Tyr
Leu Lys His Phe Leu Pro 485 490
495 Lys Arg Leu His Phe Ala Lys Ser Asp Arg Ile Glu Pro Leu Thr
Phe 500 505 510 Tyr
Leu Asp Pro Gln Trp Gln Leu Ala Leu Asn Pro Ser Glu Arg Lys 515
520 525 Tyr Cys Gly Ser Gly Phe
His Gly Ser Asp Asn Val Phe Ser Asn Met 530 535
540 Gln Ala Leu Phe Val Gly Tyr Gly Pro Gly Phe
Lys His Gly Ile Glu 545 550 555
560 Ala Asp Thr Phe Glu Asn Ile Glu Val Tyr Asn Leu Met Cys Asp Leu
565 570 575 Leu Asn
Leu Thr Pro Ala Pro Asn Asn Gly Thr His Gly Ser Leu Asn 580
585 590 His Leu Leu Lys Asn Pro Val
Tyr Thr Pro Lys His Pro Lys Glu Val 595 600
605 His Pro Leu Val Gln Cys Pro Phe Thr Arg Asn Pro
Arg Asp Asn Leu 610 615 620
Gly Cys Ser Cys Asn Pro Ser Ile Leu Pro Ile Glu Asp Phe Gln Thr 625
630 635 640 Gln Phe Asn
Leu Thr Val Ala Glu Glu Lys Ile Ile Lys His Glu Thr 645
650 655 Leu Pro Tyr Gly Arg Pro Arg Val
Leu Gln Lys Glu Asn Thr Ile Cys 660 665
670 Leu Leu Ser Gln His Gln Phe Met Ser Gly Tyr Ser Gln
Asp Ile Leu 675 680 685
Met Pro Leu Trp Thr Ser Tyr Thr Val Asp Arg Asn Asp Ser Phe Ser 690
695 700 Thr Glu Asp Phe
Ser Asn Cys Leu Tyr Gln Asp Phe Arg Ile Pro Leu 705 710
715 720 Ser Pro Val His Lys Cys Ser Phe Tyr
Lys Asn Asn Thr Lys Val Ser 725 730
735 Tyr Gly Phe Leu Ser Pro Pro Gln Leu Asn Lys Asn Ser Ser
Gly Ile 740 745 750
Tyr Ser Glu Ala Leu Leu Thr Thr Asn Ile Val Pro Met Tyr Gln Ser
755 760 765 Phe Gln Val Ile
Trp Arg Tyr Phe His Asp Thr Leu Leu Arg Lys Tyr 770
775 780 Ala Glu Glu Arg Asn Gly Val Asn
Val Val Ser Gly Pro Val Phe Asp 785 790
795 800 Phe Asp Tyr Asp Gly Arg Cys Asp Ser Leu Glu Asn
Leu Arg Gln Lys 805 810
815 Arg Arg Val Ile Arg Asn Gln Glu Ile Leu Ile Pro Thr His Phe Phe
820 825 830 Ile Val Leu
Thr Ser Cys Lys Asp Thr Ser Gln Thr Pro Leu His Cys 835
840 845 Glu Asn Leu Asp Thr Leu Ala Phe
Ile Leu Pro His Arg Thr Asp Asn 850 855
860 Ser Glu Ser Cys Val His Gly Lys His Asp Ser Ser Trp
Val Glu Glu 865 870 875
880 Leu Leu Met Leu His Arg Ala Arg Ile Thr Asp Val Glu His Ile Thr
885 890 895 Gly Leu Ser Phe
Tyr Gln Gln Arg Lys Glu Pro Val Ser Asp Ile Leu 900
905 910 Lys Leu Lys Thr His Leu Pro Thr Phe
Ser Gln Glu Asp 915 920 925
2387PRTArtificial sequenceNPP1 catalytic domain (sssNPP1) 2Ala Gly Phe
Glu Thr Pro Pro Thr Leu Leu Phe Ser Leu Asp Gly Phe 1 5
10 15 Arg Ala Glu Tyr Leu His Thr Trp
Gly Gly Leu Leu Pro Val Ile Ser 20 25
30 Lys Leu Lys Lys Cys Gly Thr Tyr Thr Lys Asn Met Arg
Pro Val Tyr 35 40 45
Pro Thr Lys Thr Phe Pro Asn His Tyr Ser Ile Val Thr Gly Leu Tyr 50
55 60 Pro Glu Ser His
Gly Ile Ile Asp Asn Lys Met Tyr Asp Pro Lys Met 65 70
75 80 Asn Ala Ser Phe Ser Leu Lys Ser Lys
Glu Lys Phe Asn Pro Glu Trp 85 90
95 Tyr Lys Gly Glu Pro Ile Trp Val Thr Ala Lys Tyr Gln Gly
Leu Lys 100 105 110
Ser Gly Thr Phe Phe Trp Pro Gly Ser Asp Val Glu Ile Asn Gly Ile
115 120 125 Phe Pro Asp Ile
Tyr Lys Met Tyr Asn Gly Ser Val Pro Phe Glu Glu 130
135 140 Arg Ile Leu Ala Val Leu Gln Trp
Leu Gln Leu Pro Lys Asp Glu Arg 145 150
155 160 Pro His Phe Tyr Thr Leu Tyr Leu Glu Glu Pro Asp
Ser Ser Gly His 165 170
175 Ser Tyr Gly Pro Val Ser Ser Glu Val Ile Lys Ala Leu Gln Arg Val
180 185 190 Asp Gly Met
Val Gly Met Leu Met Asp Gly Leu Lys Glu Leu Asn Leu 195
200 205 His Arg Cys Leu Asn Leu Ile Leu
Ile Ser Asp His Gly Met Glu Gln 210 215
220 Gly Ser Cys Lys Lys Tyr Ile Tyr Leu Asn Lys Tyr Leu
Gly Asp Val 225 230 235
240 Lys Asn Ile Lys Val Ile Tyr Gly Pro Ala Ala Arg Leu Arg Pro Ser
245 250 255 Asp Val Pro Asp
Lys Tyr Tyr Ser Phe Asn Tyr Glu Gly Ile Ala Arg 260
265 270 Asn Leu Ser Cys Arg Glu Pro Asn Gln
His Phe Lys Pro Tyr Leu Lys 275 280
285 His Phe Leu Pro Lys Arg Leu His Phe Ala Lys Ser Asp Arg
Ile Glu 290 295 300
Pro Leu Thr Phe Tyr Leu Asp Pro Gln Trp Gln Leu Ala Leu Asn Pro 305
310 315 320 Ser Glu Arg Lys Tyr
Cys Gly Ser Gly Phe His Gly Ser Asp Asn Val 325
330 335 Phe Ser Asn Met Gln Ala Leu Phe Val Gly
Tyr Gly Pro Gly Phe Lys 340 345
350 His Gly Ile Glu Ala Asp Thr Phe Glu Asn Ile Glu Val Tyr Asn
Leu 355 360 365 Met
Cys Asp Leu Leu Asn Leu Thr Pro Ala Pro Asn Asn Gly Thr His 370
375 380 Gly Ser Leu 385
3420PRTArtificial sequenceD8 fused to NPP1 catalytic domain (TAGsssNPP1)
3Ile Gly Val Leu Leu Thr Gln Arg Thr Leu Leu Ser Leu Val Leu Ala 1
5 10 15 Leu Leu Phe Pro
Ser Met Ala Ser Met Asp Asp Asp Asp Asp Asp Asp 20
25 30 Asp Ala Gly Phe Glu Thr Pro Pro Thr
Leu Leu Phe Ser Leu Asp Gly 35 40
45 Phe Arg Ala Glu Tyr Leu His Thr Trp Gly Gly Leu Leu Pro
Val Ile 50 55 60
Ser Lys Leu Lys Lys Cys Gly Thr Tyr Thr Lys Asn Met Arg Pro Val 65
70 75 80 Tyr Pro Thr Lys Thr
Phe Pro Asn His Tyr Ser Ile Val Thr Gly Leu 85
90 95 Tyr Pro Glu Ser His Gly Ile Ile Asp Asn
Lys Met Tyr Asp Pro Lys 100 105
110 Met Asn Ala Ser Phe Ser Leu Lys Ser Lys Glu Lys Phe Asn Pro
Glu 115 120 125 Trp
Tyr Lys Gly Glu Pro Ile Trp Val Thr Ala Lys Tyr Gln Gly Leu 130
135 140 Lys Ser Gly Thr Phe Phe
Trp Pro Gly Ser Asp Val Glu Ile Asn Gly 145 150
155 160 Ile Phe Pro Asp Ile Tyr Lys Met Tyr Asn Gly
Ser Val Pro Phe Glu 165 170
175 Glu Arg Ile Leu Ala Val Leu Gln Trp Leu Gln Leu Pro Lys Asp Glu
180 185 190 Arg Pro
His Phe Tyr Thr Leu Tyr Leu Glu Glu Pro Asp Ser Ser Gly 195
200 205 His Ser Tyr Gly Pro Val Ser
Ser Glu Val Ile Lys Ala Leu Gln Arg 210 215
220 Val Asp Gly Met Val Gly Met Leu Met Asp Gly Leu
Lys Glu Leu Asn 225 230 235
240 Leu His Arg Cys Leu Asn Leu Ile Leu Ile Ser Asp His Gly Met Glu
245 250 255 Gln Gly Ser
Cys Lys Lys Tyr Ile Tyr Leu Asn Lys Tyr Leu Gly Asp 260
265 270 Val Lys Asn Ile Lys Val Ile Tyr
Gly Pro Ala Ala Arg Leu Arg Pro 275 280
285 Ser Asp Val Pro Asp Lys Tyr Tyr Ser Phe Asn Tyr Glu
Gly Ile Ala 290 295 300
Arg Asn Leu Ser Cys Arg Glu Pro Asn Gln His Phe Lys Pro Tyr Leu 305
310 315 320 Lys His Phe Leu
Pro Lys Arg Leu His Phe Ala Lys Ser Asp Arg Ile 325
330 335 Glu Pro Leu Thr Phe Tyr Leu Asp Pro
Gln Trp Gln Leu Ala Leu Asn 340 345
350 Pro Ser Glu Arg Lys Tyr Cys Gly Ser Gly Phe His Gly Ser
Asp Asn 355 360 365
Val Phe Ser Asn Met Gln Ala Leu Phe Val Gly Tyr Gly Pro Gly Phe 370
375 380 Lys His Gly Ile Glu
Ala Asp Thr Phe Glu Asn Ile Glu Val Tyr Asn 385 390
395 400 Leu Met Cys Asp Leu Leu Asn Leu Thr Pro
Ala Pro Asn Asn Gly Thr 405 410
415 His Gly Ser Leu 420 4422PRTArtificial
sequenceD10 fused to NPP1 catalytic domain 4Ile Gly Val Leu Leu Thr Gln
Arg Thr Leu Leu Ser Leu Val Leu Ala 1 5
10 15 Leu Leu Phe Pro Ser Met Ala Ser Met Ala Gly
Phe Glu Thr Pro Pro 20 25
30 Thr Leu Leu Phe Ser Leu Asp Gly Phe Arg Ala Glu Tyr Leu His
Thr 35 40 45 Trp
Gly Gly Leu Leu Pro Val Ile Ser Lys Leu Lys Lys Cys Gly Thr 50
55 60 Tyr Thr Lys Asn Met Arg
Pro Val Tyr Pro Thr Lys Thr Phe Pro Asn 65 70
75 80 His Tyr Ser Ile Val Thr Gly Leu Tyr Pro Glu
Ser His Gly Ile Ile 85 90
95 Asp Asn Lys Met Tyr Asp Pro Lys Met Asn Ala Ser Phe Ser Leu Lys
100 105 110 Ser Lys
Glu Lys Phe Asn Pro Glu Trp Tyr Lys Gly Glu Pro Ile Trp 115
120 125 Val Thr Ala Lys Tyr Gln Gly
Leu Lys Ser Gly Thr Phe Phe Trp Pro 130 135
140 Gly Ser Asp Val Glu Ile Asn Gly Ile Phe Pro Asp
Ile Tyr Lys Met 145 150 155
160 Tyr Asn Gly Ser Val Pro Phe Glu Glu Arg Ile Leu Ala Val Leu Gln
165 170 175 Trp Leu Gln
Leu Pro Lys Asp Glu Arg Pro His Phe Tyr Thr Leu Tyr 180
185 190 Leu Glu Glu Pro Asp Ser Ser Gly
His Ser Tyr Gly Pro Val Ser Ser 195 200
205 Glu Val Ile Lys Ala Leu Gln Arg Val Asp Gly Met Val
Gly Met Leu 210 215 220
Met Asp Gly Leu Lys Glu Leu Asn Leu His Arg Cys Leu Asn Leu Ile 225
230 235 240 Leu Ile Ser Asp
His Gly Met Glu Gln Gly Ser Cys Lys Lys Tyr Ile 245
250 255 Tyr Leu Asn Lys Tyr Leu Gly Asp Val
Lys Asn Ile Lys Val Ile Tyr 260 265
270 Gly Pro Ala Ala Arg Leu Arg Pro Ser Asp Val Pro Asp Lys
Tyr Tyr 275 280 285
Ser Phe Asn Tyr Glu Gly Ile Ala Arg Asn Leu Ser Cys Arg Glu Pro 290
295 300 Asn Gln His Phe Lys
Pro Tyr Leu Lys His Phe Leu Pro Lys Arg Leu 305 310
315 320 His Phe Ala Lys Ser Asp Arg Ile Glu Pro
Leu Thr Phe Tyr Leu Asp 325 330
335 Pro Gln Trp Gln Leu Ala Leu Asn Pro Ser Glu Arg Lys Tyr Cys
Gly 340 345 350 Ser
Gly Phe His Gly Ser Asp Asn Val Phe Ser Asn Met Gln Ala Leu 355
360 365 Phe Val Gly Tyr Gly Pro
Gly Phe Lys His Gly Ile Glu Ala Asp Thr 370 375
380 Phe Glu Asn Ile Glu Val Tyr Asn Leu Met Cys
Asp Leu Leu Asn Leu 385 390 395
400 Thr Pro Ala Pro Asn Asn Gly Thr His Gly Ser Leu Asp Asp Asp Asp
405 410 415 Asp Asp
Asp Asp Asp Asp 420 52265DNAArtificial
sequenceNucleotide sequence encoding D8 NPP1 fusion protein
5atgggtgtac tgctcacaca gaggacgctg ctcagtctgg tccttgcact cctgtttcca
60agcatggcga gcatggatga cgatgatgac gacgatgacg cagggtttga aacgcctcct
120acactcttgt tttctttgga tggattcagg gcagaatatt tgcacacttg gggtggactt
180cttcctgtta ttagcaaact caaaaaatgt ggaacatata ctaaaaacat gagaccggtg
240tatccaacaa aaactttccc caatcactac agcattgtca ccggattgta tccagaatct
300catggcataa tcgacaataa gatgtatgat cccaaaatga atgcttcctt ttcacttaaa
360agtaaagaga aatttaatcc ggagtggtac aaaggagaac caatttgggt cacagctaag
420tatcaaggcc tcaagtctgg cacatttttc tggccaggat cagatgtgga aattaacgga
480attttcccag acatctataa aatgtataat ggttcagtgc catttgaaga aaggattttg
540gctgttcttc agtggctgca gcttccaaaa gatgaaagac cacactttta cactttgtat
600ttggaagaac cagattcttc aggtcattca tatggaccag tcagcagtga agtcatcaaa
660gccttgcaga gggttgatgg tatggttggt atgctgatgg atggtctgaa agagctgaac
720ttgcacagat gcctgaacct catccttatt tcagatcatg gcatggaaca aggcagttgt
780aagaaataca tatatctgaa taagtatttg ggggatgtta aaaatattaa agttatctat
840ggacctgcag ctcgattgag accctctgat gtcccagata aatactattc atttaactat
900gaaggcattg cccgaaatct ttcttgccgg gaaccaaacc agcacttcaa accttatctg
960aaacatttct tgcctaagcg tttgcacttt gctaagagtg atagaattga gcccttgaca
1020ttctatttgg accctcagtg gcaacttgca ttgaatccct cagaaaggaa atattgtgga
1080agtggatttc atggctctga caatgtgttt tcaaatatgc aagccctctt tgttggctat
1140ggacctggat tcaagcatgg cattgaggct gacacctttg aaaacattga agtctataac
1200ttgatgtgtg atttgctgaa tttgacaccg gctcctaata acggaactca tggaagtctt
1260aaccaccttc tgaagaatcc tgtttatacg ccaaagcatc ccaaagaagt gcaccccctg
1320gtgcagtgcc ccttcacaag aaaccccaga gataaccttg gctgctcatg taacccttcc
1380attttgccga ttgaggattt tcaaacacag ttcaatctga ccgtggcaga agagaagatt
1440attaagcatg aaactttgcc ctatggaaga cctagagttc tccagaagga aaacaccatc
1500tgtcttcttt cccagcacca gtttatgagt ggatacagcc aagacatctt gatgcccctt
1560tggacatcct ataccgtgga cagaaatgac agtttctcta cggaagactt ctccaactgt
1620ctgtaccagg actttagaat tcctcttagt cctgtccata aatgttcatt ttataaaaat
1680aacaccaaag tgagttacgg gttcctctcc ccaccacaac tgaataagaa ttcaagtgga
1740atatattctg aagccttgct tactacaaat atagtgccaa tgtaccagag ttttcaagtt
1800atatggcgct actttcatga caccctcttg cgaaagtatg cagaagaaag aaatggtgtc
1860aatgtcgtca gtggtcctgt gtttgacttt gattatgatg gacgttgtga ttccttggag
1920aatttgaggc aaaaaagaag agtcatccgt aaccaagaaa ttttgattcc aactcatttc
1980ttcattgtgc tgacaagctg taaagataca tctcagacgc ctttgcactg tgaaaacctg
2040gacaccttgg ctttcatttt gcctcacagg actgataaca gcgagagctg tgtgcatggg
2100aagcatgact cctcatgggt tgaagaattg ttgatgttgc acagagcacg gatcacagac
2160gtcgagcaca tcactggact cagcttttat caacaaagaa aagagccagt ttcagacatt
2220ttgaagttga aaacacattt gccaaccttt agccaagaag attga
22656754PRTArtificial SequenceD8 NPP1 fusion protein 6Ile Gly Val Leu Leu
Thr Gln Arg Thr Leu Leu Ser Leu Val Leu Ala 1 5
10 15 Leu Leu Phe Pro Ser Met Ala Ser Met Asp
Asp Asp Asp Asp Asp Asp 20 25
30 Asp Ala Gly Phe Glu Thr Pro Pro Thr Leu Leu Phe Ser Leu Asp
Gly 35 40 45 Phe
Arg Ala Glu Tyr Leu His Thr Trp Gly Gly Leu Leu Pro Val Ile 50
55 60 Ser Lys Leu Lys Lys Cys
Gly Thr Tyr Thr Lys Asn Met Arg Pro Val 65 70
75 80 Tyr Pro Thr Lys Thr Phe Pro Asn His Tyr Ser
Ile Val Thr Gly Leu 85 90
95 Tyr Pro Glu Ser His Gly Ile Ile Asp Asn Lys Met Tyr Asp Pro Lys
100 105 110 Met Asn
Ala Ser Phe Ser Leu Lys Ser Lys Glu Lys Phe Asn Pro Glu 115
120 125 Trp Tyr Lys Gly Glu Pro Ile
Trp Val Thr Ala Lys Tyr Gln Gly Leu 130 135
140 Lys Ser Gly Thr Phe Phe Trp Pro Gly Ser Asp Val
Glu Ile Asn Gly 145 150 155
160 Ile Phe Pro Asp Ile Tyr Lys Met Tyr Asn Gly Ser Val Pro Phe Glu
165 170 175 Glu Arg Ile
Leu Ala Val Leu Gln Trp Leu Gln Leu Pro Lys Asp Glu 180
185 190 Arg Pro His Phe Tyr Thr Leu Tyr
Leu Glu Glu Pro Asp Ser Ser Gly 195 200
205 His Ser Tyr Gly Pro Val Ser Ser Glu Val Ile Lys Ala
Leu Gln Arg 210 215 220
Val Asp Gly Met Val Gly Met Leu Met Asp Gly Leu Lys Glu Leu Asn 225
230 235 240 Leu His Arg Cys
Leu Asn Leu Ile Leu Ile Ser Asp His Gly Met Glu 245
250 255 Gln Gly Ser Cys Lys Lys Tyr Ile Tyr
Leu Asn Lys Tyr Leu Gly Asp 260 265
270 Val Lys Asn Ile Lys Val Ile Tyr Gly Pro Ala Ala Arg Leu
Arg Pro 275 280 285
Ser Asp Val Pro Asp Lys Tyr Tyr Ser Phe Asn Tyr Glu Gly Ile Ala 290
295 300 Arg Asn Leu Ser Cys
Arg Glu Pro Asn Gln His Phe Lys Pro Tyr Leu 305 310
315 320 Lys His Phe Leu Pro Lys Arg Leu His Phe
Ala Lys Ser Asp Arg Ile 325 330
335 Glu Pro Leu Thr Phe Tyr Leu Asp Pro Gln Trp Gln Leu Ala Leu
Asn 340 345 350 Pro
Ser Glu Arg Lys Tyr Cys Gly Ser Gly Phe His Gly Ser Asp Asn 355
360 365 Val Phe Ser Asn Met Gln
Ala Leu Phe Val Gly Tyr Gly Pro Gly Phe 370 375
380 Lys His Gly Ile Glu Ala Asp Thr Phe Glu Asn
Ile Glu Val Tyr Asn 385 390 395
400 Leu Met Cys Asp Leu Leu Asn Leu Thr Pro Ala Pro Asn Asn Gly Thr
405 410 415 His Gly
Ser Leu Asn His Leu Leu Lys Asn Pro Val Tyr Thr Pro Lys 420
425 430 His Pro Lys Glu Val His Pro
Leu Val Gln Cys Pro Phe Thr Arg Asn 435 440
445 Pro Arg Asp Asn Leu Gly Cys Ser Cys Asn Pro Ser
Ile Leu Pro Ile 450 455 460
Glu Asp Phe Gln Thr Gln Phe Asn Leu Thr Val Ala Glu Glu Lys Ile 465
470 475 480 Ile Lys His
Glu Thr Leu Pro Tyr Gly Arg Pro Arg Val Leu Gln Lys 485
490 495 Glu Asn Thr Ile Cys Leu Leu Ser
Gln His Gln Phe Met Ser Gly Tyr 500 505
510 Ser Gln Asp Ile Leu Met Pro Leu Trp Thr Ser Tyr Thr
Val Asp Arg 515 520 525
Asn Asp Ser Phe Ser Thr Glu Asp Phe Ser Asn Cys Leu Tyr Gln Asp 530
535 540 Phe Arg Ile Pro
Leu Ser Pro Val His Lys Cys Ser Phe Tyr Lys Asn 545 550
555 560 Asn Thr Lys Val Ser Tyr Gly Phe Leu
Ser Pro Pro Gln Leu Asn Lys 565 570
575 Asn Ser Ser Gly Ile Tyr Ser Glu Ala Leu Leu Thr Thr Asn
Ile Val 580 585 590
Pro Met Tyr Gln Ser Phe Gln Val Ile Trp Arg Tyr Phe His Asp Thr
595 600 605 Leu Leu Arg Lys
Tyr Ala Glu Glu Arg Asn Gly Val Asn Val Val Ser 610
615 620 Gly Pro Val Phe Asp Phe Asp Tyr
Asp Gly Arg Cys Asp Ser Leu Glu 625 630
635 640 Asn Leu Arg Gln Lys Arg Arg Val Ile Arg Asn Gln
Glu Ile Leu Ile 645 650
655 Pro Thr His Phe Phe Ile Val Leu Thr Ser Cys Lys Asp Thr Ser Gln
660 665 670 Thr Pro Leu
His Cys Glu Asn Leu Asp Thr Leu Ala Phe Ile Leu Pro 675
680 685 His Arg Thr Asp Asn Ser Glu Ser
Cys Val His Gly Lys His Asp Ser 690 695
700 Ser Trp Val Glu Glu Leu Leu Met Leu His Arg Ala Arg
Ile Thr Asp 705 710 715
720 Val Glu His Ile Thr Gly Leu Ser Phe Tyr Gln Gln Arg Lys Glu Pro
725 730 735 Val Ser Asp Ile
Leu Lys Leu Lys Thr His Leu Pro Thr Phe Ser Gln 740
745 750 Glu Asp 72238DNAArtificial
SequenceNucleotide sequence encoding ssNPP1 protein 7atgggtgtac
tgctcacaca gaggacgctg ctcagtctgg tccttgcact cctgtttcca 60agcatggcga
gcatggcagg gtttgaaacg cctcctacac tcttgttttc tttggatgga 120ttcagggcag
aatatttgca cacttggggt ggacttcttc ctgttattag caaactcaaa 180aaatgtggaa
catatactaa aaacatgaga ccggtgtatc caacaaaaac tttccccaat 240cactacagca
ttgtcaccgg attgtatcca gaatctcatg gcataatcga caataagatg 300tatgatccca
aaatgaatgc ttccttttca cttaaaagta aagagaaatt taatccggag 360tggtacaaag
gagaaccaat ttgggtcaca gctaagtatc aaggcctcaa gtctggcaca 420tttttctggc
caggatcaga tgtggaaatt aacggaattt tcccagacat ctataaaatg 480tataatggtt
cagtgccatt tgaagaaagg attttggctg ttcttcagtg gctgcagctt 540ccaaaagatg
aaagaccaca cttttacact ttgtatttgg aagaaccaga ttcttcaggt 600cattcatatg
gaccagtcag cagtgaagtc atcaaagcct tgcagagggt tgatggtatg 660gttggtatgc
tgatggatgg tctgaaagag ctgaacttgc acagatgcct gaacctcatc 720cttatttcag
atcatggcat ggaacaaggc agttgtaaga aatacatata tctgaataag 780tatttggggg
atgttaaaaa tattaaagtt atctatggac ctgcagctcg attgagaccc 840tctgatgtcc
cagataaata ctattcattt aactatgaag gcattgcccg aaatctttct 900tgccgggaac
caaaccagca cttcaaacct tatctgaaac atttcttgcc taagcgtttg 960cactttgcta
agagtgatag aattgagccc ttgacattct atttggaccc tcagtggcaa 1020cttgcattga
atccctcaga aaggaaatat tgtggaagtg gatttcatgg ctctgacaat 1080gtgttttcaa
atatgcaagc cctctttgtt ggctatggac ctggattcaa gcatggcatt 1140gaggctgaca
cctttgaaaa cattgaagtc tataacttga tgtgtgattt gctgaatttg 1200acaccggctc
ctaataacgg aactcatgga agtcttaacc accttctgaa gaatcctgtt 1260tatacgccaa
agcatcccaa agaagtgcac cccctggtgc agtgcccctt cacaagaaac 1320cccagagata
accttggctg ctcatgtaac ccttccattt tgccgattga ggattttcaa 1380acacagttca
atctgaccgt ggcagaagag aagattatta agcatgaaac tttgccctat 1440ggaagaccta
gagttctcca gaaggaaaac accatctgtc ttctttccca gcaccagttt 1500atgagtggat
acagccaaga catcttgatg cccctttgga catcctatac cgtggacaga 1560aatgacagtt
tctctacgga agacttctcc aactgtctgt accaggactt tagaattcct 1620cttagtcctg
tccataaatg ttcattttat aaaaataaca ccaaagtgag ttacgggttc 1680ctctccccac
cacaactgaa taagaattca agtggaatat attctgaagc cttgcttact 1740acaaatatag
tgccaatgta ccagagtttt caagttatat ggcgctactt tcatgacacc 1800ctcttgcgaa
agtatgcaga agaaagaaat ggtgtcaatg tcgtcagtgg tcctgtgttt 1860gactttgatt
atgatggacg ttgtgattcc ttggagaatt tgaggcaaaa aagaagagtc 1920atccgtaacc
aagaaatttt gattccaact catttcttca ttgtgctgac aagctgtaaa 1980gatacatctc
agacgccttt gcactgtgaa aacctggaca ccttggcttt cattttgcct 2040cacaggactg
ataacagcga gagctgtgtg catgggaagc atgactcctc atgggttgaa 2100gaattgttga
tgttgcacag agcacggatc acagacgtcg agcacatcac tggactcagc 2160ttttatcaac
aaagaaaaga gccagtttca gacattttga agttgaaaac acatttgcca 2220acctttagcc
aagaagat
22388746PRTArtificial sequencessNPP1 protein 8Ile Gly Val Leu Leu Thr Gln
Arg Thr Leu Leu Ser Leu Val Leu Ala 1 5
10 15 Leu Leu Phe Pro Ser Met Ala Ser Met Ala Gly
Phe Glu Thr Pro Pro 20 25
30 Thr Leu Leu Phe Ser Leu Asp Gly Phe Arg Ala Glu Tyr Leu His
Thr 35 40 45 Trp
Gly Gly Leu Leu Pro Val Ile Ser Lys Leu Lys Lys Cys Gly Thr 50
55 60 Tyr Thr Lys Asn Met Arg
Pro Val Tyr Pro Thr Lys Thr Phe Pro Asn 65 70
75 80 His Tyr Ser Ile Val Thr Gly Leu Tyr Pro Glu
Ser His Gly Ile Ile 85 90
95 Asp Asn Lys Met Tyr Asp Pro Lys Met Asn Ala Ser Phe Ser Leu Lys
100 105 110 Ser Lys
Glu Lys Phe Asn Pro Glu Trp Tyr Lys Gly Glu Pro Ile Trp 115
120 125 Val Thr Ala Lys Tyr Gln Gly
Leu Lys Ser Gly Thr Phe Phe Trp Pro 130 135
140 Gly Ser Asp Val Glu Ile Asn Gly Ile Phe Pro Asp
Ile Tyr Lys Met 145 150 155
160 Tyr Asn Gly Ser Val Pro Phe Glu Glu Arg Ile Leu Ala Val Leu Gln
165 170 175 Trp Leu Gln
Leu Pro Lys Asp Glu Arg Pro His Phe Tyr Thr Leu Tyr 180
185 190 Leu Glu Glu Pro Asp Ser Ser Gly
His Ser Tyr Gly Pro Val Ser Ser 195 200
205 Glu Val Ile Lys Ala Leu Gln Arg Val Asp Gly Met Val
Gly Met Leu 210 215 220
Met Asp Gly Leu Lys Glu Leu Asn Leu His Arg Cys Leu Asn Leu Ile 225
230 235 240 Leu Ile Ser Asp
His Gly Met Glu Gln Gly Ser Cys Lys Lys Tyr Ile 245
250 255 Tyr Leu Asn Lys Tyr Leu Gly Asp Val
Lys Asn Ile Lys Val Ile Tyr 260 265
270 Gly Pro Ala Ala Arg Leu Arg Pro Ser Asp Val Pro Asp Lys
Tyr Tyr 275 280 285
Ser Phe Asn Tyr Glu Gly Ile Ala Arg Asn Leu Ser Cys Arg Glu Pro 290
295 300 Asn Gln His Phe Lys
Pro Tyr Leu Lys His Phe Leu Pro Lys Arg Leu 305 310
315 320 His Phe Ala Lys Ser Asp Arg Ile Glu Pro
Leu Thr Phe Tyr Leu Asp 325 330
335 Pro Gln Trp Gln Leu Ala Leu Asn Pro Ser Glu Arg Lys Tyr Cys
Gly 340 345 350 Ser
Gly Phe His Gly Ser Asp Asn Val Phe Ser Asn Met Gln Ala Leu 355
360 365 Phe Val Gly Tyr Gly Pro
Gly Phe Lys His Gly Ile Glu Ala Asp Thr 370 375
380 Phe Glu Asn Ile Glu Val Tyr Asn Leu Met Cys
Asp Leu Leu Asn Leu 385 390 395
400 Thr Pro Ala Pro Asn Asn Gly Thr His Gly Ser Leu Asn His Leu Leu
405 410 415 Lys Asn
Pro Val Tyr Thr Pro Lys His Pro Lys Glu Val His Pro Leu 420
425 430 Val Gln Cys Pro Phe Thr Arg
Asn Pro Arg Asp Asn Leu Gly Cys Ser 435 440
445 Cys Asn Pro Ser Ile Leu Pro Ile Glu Asp Phe Gln
Thr Gln Phe Asn 450 455 460
Leu Thr Val Ala Glu Glu Lys Ile Ile Lys His Glu Thr Leu Pro Tyr 465
470 475 480 Gly Arg Pro
Arg Val Leu Gln Lys Glu Asn Thr Ile Cys Leu Leu Ser 485
490 495 Gln His Gln Phe Met Ser Gly Tyr
Ser Gln Asp Ile Leu Met Pro Leu 500 505
510 Trp Thr Ser Tyr Thr Val Asp Arg Asn Asp Ser Phe Ser
Thr Glu Asp 515 520 525
Phe Ser Asn Cys Leu Tyr Gln Asp Phe Arg Ile Pro Leu Ser Pro Val 530
535 540 His Lys Cys Ser
Phe Tyr Lys Asn Asn Thr Lys Val Ser Tyr Gly Phe 545 550
555 560 Leu Ser Pro Pro Gln Leu Asn Lys Asn
Ser Ser Gly Ile Tyr Ser Glu 565 570
575 Ala Leu Leu Thr Thr Asn Ile Val Pro Met Tyr Gln Ser Phe
Gln Val 580 585 590
Ile Trp Arg Tyr Phe His Asp Thr Leu Leu Arg Lys Tyr Ala Glu Glu
595 600 605 Arg Asn Gly Val
Asn Val Val Ser Gly Pro Val Phe Asp Phe Asp Tyr 610
615 620 Asp Gly Arg Cys Asp Ser Leu Glu
Asn Leu Arg Gln Lys Arg Arg Val 625 630
635 640 Ile Arg Asn Gln Glu Ile Leu Ile Pro Thr His Phe
Phe Ile Val Leu 645 650
655 Thr Ser Cys Lys Asp Thr Ser Gln Thr Pro Leu His Cys Glu Asn Leu
660 665 670 Asp Thr Leu
Ala Phe Ile Leu Pro His Arg Thr Asp Asn Ser Glu Ser 675
680 685 Cys Val His Gly Lys His Asp Ser
Ser Trp Val Glu Glu Leu Leu Met 690 695
700 Leu His Arg Ala Arg Ile Thr Asp Val Glu His Ile Thr
Gly Leu Ser 705 710 715
720 Phe Tyr Gln Gln Arg Lys Glu Pro Val Ser Asp Ile Leu Lys Leu Lys
725 730 735 Thr His Leu Pro
Thr Phe Ser Gln Glu Asp 740 745
92556DNAArtificial sequenceNucleotide sequence encoding sNPP1 9atgggtgtac
tgctcacaca gaggacgctg ctcagtctgg tccttgcact cctgtttcca 60agcatggcga
gcatgccaag ttgtgccaaa gaagttaaaa gttgcaaagg tcgctgtttc 120gagagaacat
ttgggaactg tcgctgtgat gctgcctgtg ttgagcttgg aaactgctgt 180ttggattacc
aggagacgtg catagaacca gaacatatat ggacttgcaa caaattcagg 240tgtggtgaga
aaagattgac cagaagcctc tgtgcctgtt cagatgattg caaggacaag 300ggcgactgct
gcatcaacta cagttcagtg tgtcaaggtg agaaaagttg ggtggaagaa 360ccatgtgaga
gcattaatga gccacagtgc ccagcagggt ttgaaacgcc tcctacactc 420ttgttttctt
tggatggatt cagggcagaa tatttgcaca cttggggtgg acttcttcct 480gttattagca
aactcaaaaa atgtggaaca tatactaaaa acatgagacc ggtgtatcca 540acaaaaactt
tccccaatca ctacagcatt gtcaccggat tgtatccaga atctcatggc 600ataatcgaca
ataagatgta tgatcccaaa atgaatgctt ccttttcact taaaagtaaa 660gagaaattta
atccggagtg gtacaaagga gaaccaattt gggtcacagc taagtatcaa 720ggcctcaagt
ctggcacatt tttctggcca ggatcagatg tggaaattaa cggaattttc 780ccagacatct
ataaaatgta taatggttca gtgccatttg aagaaaggat tttggctgtt 840cttcagtggc
tgcagcttcc aaaagatgaa agaccacact tttacacttt gtatttggaa 900gaaccagatt
cttcaggtca ttcatatgga ccagtcagca gtgaagtcat caaagccttg 960cagagggttg
atggtatggt tggtatgctg atggatggtc tgaaagagct gaacttgcac 1020agatgcctga
acctcatcct tatttcagat catggcatgg aacaaggcag ttgtaagaaa 1080tacatatatc
tgaataagta tttgggggat gttaaaaata ttaaagttat ctatggacct 1140gcagctcgat
tgagaccctc tgatgtccca gataaatact attcatttaa ctatgaaggc 1200attgcccgaa
atctttcttg ccgggaacca aaccagcact tcaaacctta tctgaaacat 1260ttcttgccta
agcgtttgca ctttgctaag agtgatagaa ttgagccctt gacattctat 1320ttggaccctc
agtggcaact tgcattgaat ccctcagaaa ggaaatattg tggaagtgga 1380tttcatggct
ctgacaatgt gttttcaaat atgcaagccc tctttgttgg ctatggacct 1440ggattcaagc
atggcattga ggctgacacc tttgaaaaca ttgaagtcta taacttgatg 1500tgtgatttgc
tgaatttgac accggctcct aataacggaa ctcatggaag tcttaaccac 1560cttctgaaga
atcctgttta tacgccaaag catcccaaag aagtgcaccc cctggtgcag 1620tgccccttca
caagaaaccc cagagataac cttggctgct catgtaaccc ttccattttg 1680ccgattgagg
attttcaaac acagttcaat ctgaccgtgg cagaagagaa gattattaag 1740catgaaactt
tgccctatgg aagacctaga gttctccaga aggaaaacac catctgtctt 1800ctttcccagc
accagtttat gagtggatac agccaagaca tcttgatgcc cctttggaca 1860tcctataccg
tggacagaaa tgacagtttc tctacggaag acttctccaa ctgtctgtac 1920caggacttta
gaattcctct tagtcctgtc cataaatgtt cattttataa aaataacacc 1980aaagtgagtt
acgggttcct ctccccacca caactgaata agaattcaag tggaatatat 2040tctgaagcct
tgcttactac aaatatagtg ccaatgtacc agagttttca agttatatgg 2100cgctactttc
atgacaccct cttgcgaaag tatgcagaag aaagaaatgg tgtcaatgtc 2160gtcagtggtc
ctgtgtttga ctttgattat gatggacgtt gtgattcctt ggagaatttg 2220aggcaaaaaa
gaagagtcat ccgtaaccaa gaaattttga ttccaactca tttcttcatt 2280gtgctgacaa
gctgtaaaga tacatctcag acgcctttgc actgtgaaaa cctggacacc 2340ttggctttca
ttttgcctca caggactgat aacagcgaga gctgtgtgca tgggaagcat 2400gactcctcat
gggttgaaga attgttgatg ttgcacagag cacggatcac agacgtcgag 2460cacatcactg
gactcagctt ttatcaacaa agaaaagagc cagtttcaga cattttgaag 2520ttgaaaacac
atttgccaac ctttagccaa gaagat
255610852PRTArtificial seqeuncesNPP1 protein 10Ile Gly Val Leu Leu Thr
Gln Arg Thr Leu Leu Ser Leu Val Leu Ala 1 5
10 15 Leu Leu Phe Pro Ser Met Ala Ser Met Pro Ser
Cys Ala Lys Glu Val 20 25
30 Lys Ser Cys Lys Gly Arg Cys Phe Glu Arg Thr Phe Gly Asn Cys
Arg 35 40 45 Cys
Asp Ala Ala Cys Val Glu Leu Gly Asn Cys Cys Leu Asp Tyr Gln 50
55 60 Glu Thr Cys Ile Glu Pro
Glu His Ile Trp Thr Cys Asn Lys Phe Arg 65 70
75 80 Cys Gly Glu Lys Arg Leu Thr Arg Ser Leu Cys
Ala Cys Ser Asp Asp 85 90
95 Cys Lys Asp Lys Gly Asp Cys Cys Ile Asn Tyr Ser Ser Val Cys Gln
100 105 110 Gly Glu
Lys Ser Trp Val Glu Glu Pro Cys Glu Ser Ile Asn Glu Pro 115
120 125 Gln Cys Pro Ala Gly Phe Glu
Thr Pro Pro Thr Leu Leu Phe Ser Leu 130 135
140 Asp Gly Phe Arg Ala Glu Tyr Leu His Thr Trp Gly
Gly Leu Leu Pro 145 150 155
160 Val Ile Ser Lys Leu Lys Lys Cys Gly Thr Tyr Thr Lys Asn Met Arg
165 170 175 Pro Val Tyr
Pro Thr Lys Thr Phe Pro Asn His Tyr Ser Ile Val Thr 180
185 190 Gly Leu Tyr Pro Glu Ser His Gly
Ile Ile Asp Asn Lys Met Tyr Asp 195 200
205 Pro Lys Met Asn Ala Ser Phe Ser Leu Lys Ser Lys Glu
Lys Phe Asn 210 215 220
Pro Glu Trp Tyr Lys Gly Glu Pro Ile Trp Val Thr Ala Lys Tyr Gln 225
230 235 240 Gly Leu Lys Ser
Gly Thr Phe Phe Trp Pro Gly Ser Asp Val Glu Ile 245
250 255 Asn Gly Ile Phe Pro Asp Ile Tyr Lys
Met Tyr Asn Gly Ser Val Pro 260 265
270 Phe Glu Glu Arg Ile Leu Ala Val Leu Gln Trp Leu Gln Leu
Pro Lys 275 280 285
Asp Glu Arg Pro His Phe Tyr Thr Leu Tyr Leu Glu Glu Pro Asp Ser 290
295 300 Ser Gly His Ser Tyr
Gly Pro Val Ser Ser Glu Val Ile Lys Ala Leu 305 310
315 320 Gln Arg Val Asp Gly Met Val Gly Met Leu
Met Asp Gly Leu Lys Glu 325 330
335 Leu Asn Leu His Arg Cys Leu Asn Leu Ile Leu Ile Ser Asp His
Gly 340 345 350 Met
Glu Gln Gly Ser Cys Lys Lys Tyr Ile Tyr Leu Asn Lys Tyr Leu 355
360 365 Gly Asp Val Lys Asn Ile
Lys Val Ile Tyr Gly Pro Ala Ala Arg Leu 370 375
380 Arg Pro Ser Asp Val Pro Asp Lys Tyr Tyr Ser
Phe Asn Tyr Glu Gly 385 390 395
400 Ile Ala Arg Asn Leu Ser Cys Arg Glu Pro Asn Gln His Phe Lys Pro
405 410 415 Tyr Leu
Lys His Phe Leu Pro Lys Arg Leu His Phe Ala Lys Ser Asp 420
425 430 Arg Ile Glu Pro Leu Thr Phe
Tyr Leu Asp Pro Gln Trp Gln Leu Ala 435 440
445 Leu Asn Pro Ser Glu Arg Lys Tyr Cys Gly Ser Gly
Phe His Gly Ser 450 455 460
Asp Asn Val Phe Ser Asn Met Gln Ala Leu Phe Val Gly Tyr Gly Pro 465
470 475 480 Gly Phe Lys
His Gly Ile Glu Ala Asp Thr Phe Glu Asn Ile Glu Val 485
490 495 Tyr Asn Leu Met Cys Asp Leu Leu
Asn Leu Thr Pro Ala Pro Asn Asn 500 505
510 Gly Thr His Gly Ser Leu Asn His Leu Leu Lys Asn Pro
Val Tyr Thr 515 520 525
Pro Lys His Pro Lys Glu Val His Pro Leu Val Gln Cys Pro Phe Thr 530
535 540 Arg Asn Pro Arg
Asp Asn Leu Gly Cys Ser Cys Asn Pro Ser Ile Leu 545 550
555 560 Pro Ile Glu Asp Phe Gln Thr Gln Phe
Asn Leu Thr Val Ala Glu Glu 565 570
575 Lys Ile Ile Lys His Glu Thr Leu Pro Tyr Gly Arg Pro Arg
Val Leu 580 585 590
Gln Lys Glu Asn Thr Ile Cys Leu Leu Ser Gln His Gln Phe Met Ser
595 600 605 Gly Tyr Ser Gln
Asp Ile Leu Met Pro Leu Trp Thr Ser Tyr Thr Val 610
615 620 Asp Arg Asn Asp Ser Phe Ser Thr
Glu Asp Phe Ser Asn Cys Leu Tyr 625 630
635 640 Gln Asp Phe Arg Ile Pro Leu Ser Pro Val His Lys
Cys Ser Phe Tyr 645 650
655 Lys Asn Asn Thr Lys Val Ser Tyr Gly Phe Leu Ser Pro Pro Gln Leu
660 665 670 Asn Lys Asn
Ser Ser Gly Ile Tyr Ser Glu Ala Leu Leu Thr Thr Asn 675
680 685 Ile Val Pro Met Tyr Gln Ser Phe
Gln Val Ile Trp Arg Tyr Phe His 690 695
700 Asp Thr Leu Leu Arg Lys Tyr Ala Glu Glu Arg Asn Gly
Val Asn Val 705 710 715
720 Val Ser Gly Pro Val Phe Asp Phe Asp Tyr Asp Gly Arg Cys Asp Ser
725 730 735 Leu Glu Asn Leu
Arg Gln Lys Arg Arg Val Ile Arg Asn Gln Glu Ile 740
745 750 Leu Ile Pro Thr His Phe Phe Ile Val
Leu Thr Ser Cys Lys Asp Thr 755 760
765 Ser Gln Thr Pro Leu His Cys Glu Asn Leu Asp Thr Leu Ala
Phe Ile 770 775 780
Leu Pro His Arg Thr Asp Asn Ser Glu Ser Cys Val His Gly Lys His 785
790 795 800 Asp Ser Ser Trp Val
Glu Glu Leu Leu Met Leu His Arg Ala Arg Ile 805
810 815 Thr Asp Val Glu His Ile Thr Gly Leu Ser
Phe Tyr Gln Gln Arg Lys 820 825
830 Glu Pro Val Ser Asp Ile Leu Lys Leu Lys Thr His Leu Pro Thr
Phe 835 840 845 Ser
Gln Glu Asp 850 112580DNAArtificial sequenceNucleotide
sequence encoding a TAGsNPP1 fusion protein 11atgggtgtac tgctcacaca
gaggacgctg ctcagtctgg tccttgcact cctgtttcca 60agcatggcga gcatggatga
cgatgatgac gacgatgacc caagttgtgc caaagaagtt 120aaaagttgca aaggtcgctg
tttcgagaga acatttggga actgtcgctg tgatgctgcc 180tgtgttgagc ttggaaactg
ctgtttggat taccaggaga cgtgcataga accagaacat 240atatggactt gcaacaaatt
caggtgtggt gagaaaagat tgaccagaag cctctgtgcc 300tgttcagatg attgcaagga
caagggcgac tgctgcatca actacagttc agtgtgtcaa 360ggtgagaaaa gttgggtgga
agaaccatgt gagagcatta atgagccaca gtgcccagca 420gggtttgaaa cgcctcctac
actcttgttt tctttggatg gattcagggc agaatatttg 480cacacttggg gtggacttct
tcctgttatt agcaaactca aaaaatgtgg aacatatact 540aaaaacatga gaccggtgta
tccaacaaaa actttcccca atcactacag cattgtcacc 600ggattgtatc cagaatctca
tggcataatc gacaataaga tgtatgatcc caaaatgaat 660gcttcctttt cacttaaaag
taaagagaaa tttaatccgg agtggtacaa aggagaacca 720atttgggtca cagctaagta
tcaaggcctc aagtctggca catttttctg gccaggatca 780gatgtggaaa ttaacggaat
tttcccagac atctataaaa tgtataatgg ttcagtgcca 840tttgaagaaa ggattttggc
tgttcttcag tggctgcagc ttccaaaaga tgaaagacca 900cacttttaca ctttgtattt
ggaagaacca gattcttcag gtcattcata tggaccagtc 960agcagtgaag tcatcaaagc
cttgcagagg gttgatggta tggttggtat gctgatggat 1020ggtctgaaag agctgaactt
gcacagatgc ctgaacctca tccttatttc agatcatggc 1080atggaacaag gcagttgtaa
gaaatacata tatctgaata agtatttggg ggatgttaaa 1140aatattaaag ttatctatgg
acctgcagct cgattgagac cctctgatgt cccagataaa 1200tactattcat ttaactatga
aggcattgcc cgaaatcttt cttgccggga accaaaccag 1260cacttcaaac cttatctgaa
acatttcttg cctaagcgtt tgcactttgc taagagtgat 1320agaattgagc ccttgacatt
ctatttggac cctcagtggc aacttgcatt gaatccctca 1380gaaaggaaat attgtggaag
tggatttcat ggctctgaca atgtgttttc aaatatgcaa 1440gccctctttg ttggctatgg
acctggattc aagcatggca ttgaggctga cacctttgaa 1500aacattgaag tctataactt
gatgtgtgat ttgctgaatt tgacaccggc tcctaataac 1560ggaactcatg gaagtcttaa
ccaccttctg aagaatcctg tttatacgcc aaagcatccc 1620aaagaagtgc accccctggt
gcagtgcccc ttcacaagaa accccagaga taaccttggc 1680tgctcatgta acccttccat
tttgccgatt gaggattttc aaacacagtt caatctgacc 1740gtggcagaag agaagattat
taagcatgaa actttgccct atggaagacc tagagttctc 1800cagaaggaaa acaccatctg
tcttctttcc cagcaccagt ttatgagtgg atacagccaa 1860gacatcttga tgcccctttg
gacatcctat accgtggaca gaaatgacag tttctctacg 1920gaagacttct ccaactgtct
gtaccaggac tttagaattc ctcttagtcc tgtccataaa 1980tgttcatttt ataaaaataa
caccaaagtg agttacgggt tcctctcccc accacaactg 2040aataagaatt caagtggaat
atattctgaa gccttgctta ctacaaatat agtgccaatg 2100taccagagtt ttcaagttat
atggcgctac tttcatgaca ccctcttgcg aaagtatgca 2160gaagaaagaa atggtgtcaa
tgtcgtcagt ggtcctgtgt ttgactttga ttatgatgga 2220cgttgtgatt ccttggagaa
tttgaggcaa aaaagaagag tcatccgtaa ccaagaaatt 2280ttgattccaa ctcatttctt
cattgtgctg acaagctgta aagatacatc tcagacgcct 2340ttgcactgtg aaaacctgga
caccttggct ttcattttgc ctcacaggac tgataacagc 2400gagagctgtg tgcatgggaa
gcatgactcc tcatgggttg aagaattgtt gatgttgcac 2460agagcacgga tcacagacgt
cgagcacatc actggactca gcttttatca acaaagaaaa 2520gagccagttt cagacatttt
gaagttgaaa acacatttgc caacctttag ccaagaagat 258012860PRTArtificial
sequenceD8 TAGsNPP1 fusion protein 12Ile Gly Val Leu Leu Thr Gln Arg Thr
Leu Leu Ser Leu Val Leu Ala 1 5 10
15 Leu Leu Phe Pro Ser Met Ala Ser Met Asp Asp Asp Asp Asp
Asp Asp 20 25 30
Asp Pro Ser Cys Ala Lys Glu Val Lys Ser Cys Lys Gly Arg Cys Phe
35 40 45 Glu Arg Thr Phe
Gly Asn Cys Arg Cys Asp Ala Ala Cys Val Glu Leu 50
55 60 Gly Asn Cys Cys Leu Asp Tyr Gln
Glu Thr Cys Ile Glu Pro Glu His 65 70
75 80 Ile Trp Thr Cys Asn Lys Phe Arg Cys Gly Glu Lys
Arg Leu Thr Arg 85 90
95 Ser Leu Cys Ala Cys Ser Asp Asp Cys Lys Asp Lys Gly Asp Cys Cys
100 105 110 Ile Asn Tyr
Ser Ser Val Cys Gln Gly Glu Lys Ser Trp Val Glu Glu 115
120 125 Pro Cys Glu Ser Ile Asn Glu Pro
Gln Cys Pro Ala Gly Phe Glu Thr 130 135
140 Pro Pro Thr Leu Leu Phe Ser Leu Asp Gly Phe Arg Ala
Glu Tyr Leu 145 150 155
160 His Thr Trp Gly Gly Leu Leu Pro Val Ile Ser Lys Leu Lys Lys Cys
165 170 175 Gly Thr Tyr Thr
Lys Asn Met Arg Pro Val Tyr Pro Thr Lys Thr Phe 180
185 190 Pro Asn His Tyr Ser Ile Val Thr Gly
Leu Tyr Pro Glu Ser His Gly 195 200
205 Ile Ile Asp Asn Lys Met Tyr Asp Pro Lys Met Asn Ala Ser
Phe Ser 210 215 220
Leu Lys Ser Lys Glu Lys Phe Asn Pro Glu Trp Tyr Lys Gly Glu Pro 225
230 235 240 Ile Trp Val Thr Ala
Lys Tyr Gln Gly Leu Lys Ser Gly Thr Phe Phe 245
250 255 Trp Pro Gly Ser Asp Val Glu Ile Asn Gly
Ile Phe Pro Asp Ile Tyr 260 265
270 Lys Met Tyr Asn Gly Ser Val Pro Phe Glu Glu Arg Ile Leu Ala
Val 275 280 285 Leu
Gln Trp Leu Gln Leu Pro Lys Asp Glu Arg Pro His Phe Tyr Thr 290
295 300 Leu Tyr Leu Glu Glu Pro
Asp Ser Ser Gly His Ser Tyr Gly Pro Val 305 310
315 320 Ser Ser Glu Val Ile Lys Ala Leu Gln Arg Val
Asp Gly Met Val Gly 325 330
335 Met Leu Met Asp Gly Leu Lys Glu Leu Asn Leu His Arg Cys Leu Asn
340 345 350 Leu Ile
Leu Ile Ser Asp His Gly Met Glu Gln Gly Ser Cys Lys Lys 355
360 365 Tyr Ile Tyr Leu Asn Lys Tyr
Leu Gly Asp Val Lys Asn Ile Lys Val 370 375
380 Ile Tyr Gly Pro Ala Ala Arg Leu Arg Pro Ser Asp
Val Pro Asp Lys 385 390 395
400 Tyr Tyr Ser Phe Asn Tyr Glu Gly Ile Ala Arg Asn Leu Ser Cys Arg
405 410 415 Glu Pro Asn
Gln His Phe Lys Pro Tyr Leu Lys His Phe Leu Pro Lys 420
425 430 Arg Leu His Phe Ala Lys Ser Asp
Arg Ile Glu Pro Leu Thr Phe Tyr 435 440
445 Leu Asp Pro Gln Trp Gln Leu Ala Leu Asn Pro Ser Glu
Arg Lys Tyr 450 455 460
Cys Gly Ser Gly Phe His Gly Ser Asp Asn Val Phe Ser Asn Met Gln 465
470 475 480 Ala Leu Phe Val
Gly Tyr Gly Pro Gly Phe Lys His Gly Ile Glu Ala 485
490 495 Asp Thr Phe Glu Asn Ile Glu Val Tyr
Asn Leu Met Cys Asp Leu Leu 500 505
510 Asn Leu Thr Pro Ala Pro Asn Asn Gly Thr His Gly Ser Leu
Asn His 515 520 525
Leu Leu Lys Asn Pro Val Tyr Thr Pro Lys His Pro Lys Glu Val His 530
535 540 Pro Leu Val Gln Cys
Pro Phe Thr Arg Asn Pro Arg Asp Asn Leu Gly 545 550
555 560 Cys Ser Cys Asn Pro Ser Ile Leu Pro Ile
Glu Asp Phe Gln Thr Gln 565 570
575 Phe Asn Leu Thr Val Ala Glu Glu Lys Ile Ile Lys His Glu Thr
Leu 580 585 590 Pro
Tyr Gly Arg Pro Arg Val Leu Gln Lys Glu Asn Thr Ile Cys Leu 595
600 605 Leu Ser Gln His Gln Phe
Met Ser Gly Tyr Ser Gln Asp Ile Leu Met 610 615
620 Pro Leu Trp Thr Ser Tyr Thr Val Asp Arg Asn
Asp Ser Phe Ser Thr 625 630 635
640 Glu Asp Phe Ser Asn Cys Leu Tyr Gln Asp Phe Arg Ile Pro Leu Ser
645 650 655 Pro Val
His Lys Cys Ser Phe Tyr Lys Asn Asn Thr Lys Val Ser Tyr 660
665 670 Gly Phe Leu Ser Pro Pro Gln
Leu Asn Lys Asn Ser Ser Gly Ile Tyr 675 680
685 Ser Glu Ala Leu Leu Thr Thr Asn Ile Val Pro Met
Tyr Gln Ser Phe 690 695 700
Gln Val Ile Trp Arg Tyr Phe His Asp Thr Leu Leu Arg Lys Tyr Ala 705
710 715 720 Glu Glu Arg
Asn Gly Val Asn Val Val Ser Gly Pro Val Phe Asp Phe 725
730 735 Asp Tyr Asp Gly Arg Cys Asp Ser
Leu Glu Asn Leu Arg Gln Lys Arg 740 745
750 Arg Val Ile Arg Asn Gln Glu Ile Leu Ile Pro Thr His
Phe Phe Ile 755 760 765
Val Leu Thr Ser Cys Lys Asp Thr Ser Gln Thr Pro Leu His Cys Glu 770
775 780 Asn Leu Asp Thr
Leu Ala Phe Ile Leu Pro His Arg Thr Asp Asn Ser 785 790
795 800 Glu Ser Cys Val His Gly Lys His Asp
Ser Ser Trp Val Glu Glu Leu 805 810
815 Leu Met Leu His Arg Ala Arg Ile Thr Asp Val Glu His Ile
Thr Gly 820 825 830
Leu Ser Phe Tyr Gln Gln Arg Lys Glu Pro Val Ser Asp Ile Leu Lys
835 840 845 Leu Lys Thr His
Leu Pro Thr Phe Ser Gln Glu Asp 850 855
860 132580DNAArtificial sequenceNucleotide sequence encoding C-terminal
D8 TAGsNPP1 protein 13gtgggtgtac tgctcacaca gaggacgctg ctcagtctgg
tccttgcact cctgtttcca 60agcatggcga gcatgccaag ttgtgccaaa gaagttaaaa
gttgcaaagg tcgctgtttc 120gagagaacat ttgggaactg tcgctgtgat gctgcctgtg
ttgagcttgg aaactgctgt 180ttggattacc aggagacgtg catagaacca gaacatatat
ggacttgcaa caaattcagg 240tgtggtgaga aaagattgac cagaagcctc tgtgcctgtt
cagatgattg caaggacaag 300ggcgactgct gcatcaacta cagttcagtg tgtcaaggtg
agaaaagttg ggtggaagaa 360ccatgtgaga gcattaatga gccacagtgc ccagcagggt
ttgaaacgcc tcctacactc 420ttgttttctt tggatggatt cagggcagaa tatttgcaca
cttggggtgg acttcttcct 480gttattagca aactcaaaaa atgtggaaca tatactaaaa
acatgagacc ggtgtatcca 540acaaaaactt tccccaatca ctacagcatt gtcaccggat
tgtatccaga atctcatggc 600ataatcgaca ataagatgta tgatcccaaa atgaatgctt
ccttttcact taaaagtaaa 660gagaaattta atccggagtg gtacaaagga gaaccaattt
gggtcacagc taagtatcaa 720ggcctcaagt ctggcacatt tttctggcca ggatcagatg
tggaaattaa cggaattttc 780ccagacatct ataaaatgta taatggttca gtgccatttg
aagaaaggat tttggctgtt 840cttcagtggc tgcagcttcc aaaagatgaa agaccacact
tttacacttt gtatttggaa 900gaaccagatt cttcaggtca ttcatatgga ccagtcagca
gtgaagtcat caaagccttg 960cagagggttg atggtatggt tggtatgctg atggatggtc
tgaaagagct gaacttgcac 1020agatgcctga acctcatcct tatttcagat catggcatgg
aacaaggcag ttgtaagaaa 1080tacatatatc tgaataagta tttgggggat gttaaaaata
ttaaagttat ctatggacct 1140gcagctcgat tgagaccctc tgatgtccca gataaatact
attcatttaa ctatgaaggc 1200attgcccgaa atctttcttg ccgggaacca aaccagcact
tcaaacctta tctgaaacat 1260ttcttgccta agcgtttgca ctttgctaag agtgatagaa
ttgagccctt gacattctat 1320ttggaccctc agtggcaact tgcattgaat ccctcagaaa
ggaaatattg tggaagtgga 1380tttcatggct ctgacaatgt gttttcaaat atgcaagccc
tctttgttgg ctatggacct 1440ggattcaagc atggcattga ggctgacacc tttgaaaaca
ttgaagtcta taacttgatg 1500tgtgatttgc tgaatttgac accggctcct aataacggaa
ctcatggaag tcttaaccac 1560cttctgaaga atcctgttta tacgccaaag catcccaaag
aagtgcaccc cctggtgcag 1620tgccccttca caagaaaccc cagagataac cttggctgct
catgtaaccc ttccattttg 1680ccgattgagg attttcaaac acagttcaat ctgaccgtgg
cagaagagaa gattattaag 1740catgaaactt tgccctatgg aagacctaga gttctccaga
aggaaaacac catctgtctt 1800ctttcccagc accagtttat gagtggatac agccaagaca
tcttgatgcc cctttggaca 1860tcctataccg tggacagaaa tgacagtttc tctacggaag
acttctccaa ctgtctgtac 1920caggacttta gaattcctct tagtcctgtc cataaatgtt
cattttataa aaataacacc 1980aaagtgagtt acgggttcct ctccccacca caactgaata
agaattcaag tggaatatat 2040tctgaagcct tgcttactac aaatatagtg ccaatgtacc
agagttttca agttatatgg 2100cgctactttc atgacaccct cttgcgaaag tatgcagaag
aaagaaatgg tgtcaatgtc 2160gtcagtggtc ctgtgtttga ctttgattat gatggacgtt
gtgattcctt ggagaatttg 2220aggcaaaaaa gaagagtcat ccgtaaccaa gaaattttga
ttccaactca tttcttcatt 2280gtgctgacaa gctgtaaaga tacatctcag acgcctttgc
actgtgaaaa cctggacacc 2340ttggctttca ttttgcctca caggactgat aacagcgaga
gctgtgtgca tgggaagcat 2400gactcctcat gggttgaaga attgttgatg ttgcacagag
cacggatcac agacgtcgag 2460cacatcactg gactcagctt ttatcaacaa agaaaagagc
cagtttcaga cattttgaag 2520ttgaaaacac atttgccaac ctttagccaa gaagatgatg
acgatgatga cgacgattga 258014859PRTArtificial sequenceC-terminal D8
TAGsNPP1 fusion protein 14Gln Gly Val Leu Leu Thr Gln Arg Thr Leu Leu Ser
Leu Val Leu Ala 1 5 10
15 Leu Leu Phe Pro Ser Met Ala Ser Met Pro Ser Cys Ala Lys Glu Val
20 25 30 Lys Ser Cys
Lys Gly Arg Cys Phe Glu Arg Thr Phe Gly Asn Cys Arg 35
40 45 Cys Asp Ala Ala Cys Val Glu Leu
Gly Asn Cys Cys Leu Asp Tyr Gln 50 55
60 Glu Thr Cys Ile Glu Pro Glu His Ile Trp Thr Cys Asn
Lys Phe Arg 65 70 75
80 Cys Gly Glu Lys Arg Leu Thr Arg Ser Leu Cys Ala Cys Ser Asp Asp
85 90 95 Cys Lys Asp Lys
Gly Asp Cys Cys Ile Asn Tyr Ser Ser Val Cys Gln 100
105 110 Gly Glu Lys Ser Trp Val Glu Glu Pro
Cys Glu Ser Ile Asn Glu Pro 115 120
125 Gln Cys Pro Ala Gly Phe Glu Thr Pro Pro Thr Leu Leu Phe
Ser Leu 130 135 140
Asp Gly Phe Arg Ala Glu Tyr Leu His Thr Trp Gly Gly Leu Leu Pro 145
150 155 160 Val Ile Ser Lys Leu
Lys Lys Cys Gly Thr Tyr Thr Lys Asn Met Arg 165
170 175 Pro Val Tyr Pro Thr Lys Thr Phe Pro Asn
His Tyr Ser Ile Val Thr 180 185
190 Gly Leu Tyr Pro Glu Ser His Gly Ile Ile Asp Asn Lys Met Tyr
Asp 195 200 205 Pro
Lys Met Asn Ala Ser Phe Ser Leu Lys Ser Lys Glu Lys Phe Asn 210
215 220 Pro Glu Trp Tyr Lys Gly
Glu Pro Ile Trp Val Thr Ala Lys Tyr Gln 225 230
235 240 Gly Leu Lys Ser Gly Thr Phe Phe Trp Pro Gly
Ser Asp Val Glu Ile 245 250
255 Asn Gly Ile Phe Pro Asp Ile Tyr Lys Met Tyr Asn Gly Ser Val Pro
260 265 270 Phe Glu
Glu Arg Ile Leu Ala Val Leu Gln Trp Leu Gln Leu Pro Lys 275
280 285 Asp Glu Arg Pro His Phe Tyr
Thr Leu Tyr Leu Glu Glu Pro Asp Ser 290 295
300 Ser Gly His Ser Tyr Gly Pro Val Ser Ser Glu Val
Ile Lys Ala Leu 305 310 315
320 Gln Arg Val Asp Gly Met Val Gly Met Leu Met Asp Gly Leu Lys Glu
325 330 335 Leu Asn Leu
His Arg Cys Leu Asn Leu Ile Leu Ile Ser Asp His Gly 340
345 350 Met Glu Gln Gly Ser Cys Lys Lys
Tyr Ile Tyr Leu Asn Lys Tyr Leu 355 360
365 Gly Asp Val Lys Asn Ile Lys Val Ile Tyr Gly Pro Ala
Ala Arg Leu 370 375 380
Arg Pro Ser Asp Val Pro Asp Lys Tyr Tyr Ser Phe Asn Tyr Glu Gly 385
390 395 400 Ile Ala Arg Asn
Leu Ser Cys Arg Glu Pro Asn Gln His Phe Lys Pro 405
410 415 Tyr Leu Lys His Phe Leu Pro Lys Arg
Leu His Phe Ala Lys Ser Asp 420 425
430 Arg Ile Glu Pro Leu Thr Phe Tyr Leu Asp Pro Gln Trp Gln
Leu Ala 435 440 445
Leu Asn Pro Ser Glu Arg Lys Tyr Cys Gly Ser Gly Phe His Gly Ser 450
455 460 Asp Asn Val Phe Ser
Asn Met Gln Ala Leu Phe Val Gly Tyr Gly Pro 465 470
475 480 Gly Phe Lys His Gly Ile Glu Ala Asp Thr
Phe Glu Asn Ile Glu Val 485 490
495 Tyr Asn Leu Met Cys Asp Leu Leu Asn Leu Thr Pro Ala Pro Asn
Asn 500 505 510 Gly
Thr His Gly Ser Leu Asn His Leu Leu Lys Asn Pro Val Tyr Thr 515
520 525 Pro Lys His Pro Lys Glu
Val His Pro Leu Val Gln Cys Pro Phe Thr 530 535
540 Arg Asn Pro Arg Asp Asn Leu Gly Cys Ser Cys
Asn Pro Ser Ile Leu 545 550 555
560 Pro Ile Glu Asp Phe Gln Thr Gln Phe Asn Leu Thr Val Ala Glu Glu
565 570 575 Lys Ile
Ile Lys His Glu Thr Leu Pro Tyr Gly Arg Pro Arg Val Leu 580
585 590 Gln Lys Glu Asn Thr Ile Cys
Leu Leu Ser Gln His Gln Phe Met Ser 595 600
605 Gly Tyr Ser Gln Asp Ile Leu Met Pro Leu Trp Thr
Ser Tyr Thr Val 610 615 620
Asp Arg Asn Asp Ser Phe Ser Thr Glu Asp Phe Ser Asn Cys Leu Tyr 625
630 635 640 Gln Asp Phe
Arg Ile Pro Leu Ser Pro Val His Lys Cys Ser Phe Tyr 645
650 655 Lys Asn Asn Thr Lys Val Ser Tyr
Gly Phe Leu Ser Pro Pro Gln Leu 660 665
670 Asn Lys Asn Ser Ser Gly Ile Tyr Ser Glu Ala Leu Leu
Thr Thr Asn 675 680 685
Ile Val Pro Met Tyr Gln Ser Phe Gln Val Ile Trp Arg Tyr Phe His 690
695 700 Asp Thr Leu Leu
Arg Lys Tyr Ala Glu Glu Arg Asn Gly Val Asn Val 705 710
715 720 Val Ser Gly Pro Val Phe Asp Phe Asp
Tyr Asp Gly Arg Cys Asp Ser 725 730
735 Leu Glu Asn Leu Arg Gln Lys Arg Arg Val Ile Arg Asn Gln
Glu Ile 740 745 750
Leu Ile Pro Thr His Phe Phe Ile Val Leu Thr Ser Cys Lys Asp Thr
755 760 765 Ser Gln Thr Pro
Leu His Cys Glu Asn Leu Asp Thr Leu Ala Phe Ile 770
775 780 Leu Pro His Arg Thr Asp Asn Ser
Glu Ser Cys Val His Gly Lys His 785 790
795 800 Asp Ser Ser Trp Val Glu Glu Leu Leu Met Leu His
Arg Ala Arg Ile 805 810
815 Thr Asp Val Glu His Ile Thr Gly Leu Ser Phe Tyr Gln Gln Arg Lys
820 825 830 Glu Pro Val
Ser Asp Ile Leu Lys Leu Lys Thr His Leu Pro Thr Phe 835
840 845 Ser Gln Glu Asp Asp Asp Asp Asp
Asp Asp Asp 850 855 1510PRTArtificial
sequenceLinker peptide 15Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 1
5 10 16232PRTArtificial sequenceFc amino acid
sequence 16Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro
Ala 1 5 10 15 Pro
Glu Ala Ala Gly Ala Pro Ser Val Phe Leu Phe Pro Pro Lys Pro
20 25 30 Lys Asp Thr Leu Met
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val 35
40 45 Val Asp Val Ser His Glu Asp Pro Glu
Val Lys Phe Asn Trp Tyr Val 50 55
60 Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg
Glu Glu Gln 65 70 75
80 Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln
85 90 95 Asp Trp Leu Asn
Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala 100
105 110 Leu Pro Ala Pro Ile Glu Lys Thr Ile
Ser Lys Ala Lys Gly Gln Pro 115 120
125 Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu
Met Thr 130 135 140
Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser 145
150 155 160 Asp Ile Ala Val Glu
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr 165
170 175 Lys Thr Thr Pro Pro Val Leu Asp Ser Asp
Gly Ser Phe Phe Leu Tyr 180 185
190 Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val
Phe 195 200 205 Ser
Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys 210
215 220 Ser Leu Ser Leu Ser Pro
Gly Lys 225 230 17662PRTArtificial
sequenceN-terminus Fc-D8 TAGsssNPP1 fusion protein 17Ile Gly Val Leu Leu
Thr Gln Arg Thr Leu Leu Ser Leu Val Leu Ala 1 5
10 15 Leu Leu Phe Pro Ser Met Ala Ser Met Glu
Pro Lys Ser Cys Asp Lys 20 25
30 Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Ala Ala Gly Ala
Pro 35 40 45 Ser
Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser 50
55 60 Arg Thr Pro Glu Val Thr
Cys Val Val Val Asp Val Ser His Glu Asp 65 70
75 80 Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly
Val Glu Val His Asn 85 90
95 Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val
100 105 110 Val Ser
Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu 115
120 125 Tyr Lys Cys Lys Val Ser Asn
Lys Ala Leu Pro Ala Pro Ile Glu Lys 130 135
140 Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro
Gln Val Tyr Thr 145 150 155
160 Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn Gln Val Ser Leu Thr
165 170 175 Cys Leu Val
Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu 180
185 190 Ser Asn Gly Gln Pro Glu Asn Asn
Tyr Lys Thr Thr Pro Pro Val Leu 195 200
205 Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr
Val Asp Lys 210 215 220
Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu 225
230 235 240 Ala Leu His Asn
His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly 245
250 255 Lys Asp Asp Asp Asp Asp Asp Asp Asp
Gly Gly Gly Gly Ser Gly Gly 260 265
270 Gly Gly Ser Ala Gly Phe Glu Thr Pro Pro Thr Leu Leu Phe
Ser Leu 275 280 285
Asp Gly Phe Arg Ala Glu Tyr Leu His Thr Trp Gly Gly Leu Leu Pro 290
295 300 Val Ile Ser Lys Leu
Lys Lys Cys Gly Thr Tyr Thr Lys Asn Met Arg 305 310
315 320 Pro Val Tyr Pro Thr Lys Thr Phe Pro Asn
His Tyr Ser Ile Val Thr 325 330
335 Gly Leu Tyr Pro Glu Ser His Gly Ile Ile Asp Asn Lys Met Tyr
Asp 340 345 350 Pro
Lys Met Asn Ala Ser Phe Ser Leu Lys Ser Lys Glu Lys Phe Asn 355
360 365 Pro Glu Trp Tyr Lys Gly
Glu Pro Ile Trp Val Thr Ala Lys Tyr Gln 370 375
380 Gly Leu Lys Ser Gly Thr Phe Phe Trp Pro Gly
Ser Asp Val Glu Ile 385 390 395
400 Asn Gly Ile Phe Pro Asp Ile Tyr Lys Met Tyr Asn Gly Ser Val Pro
405 410 415 Phe Glu
Glu Arg Ile Leu Ala Val Leu Gln Trp Leu Gln Leu Pro Lys 420
425 430 Asp Glu Arg Pro His Phe Tyr
Thr Leu Tyr Leu Glu Glu Pro Asp Ser 435 440
445 Ser Gly His Ser Tyr Gly Pro Val Ser Ser Glu Val
Ile Lys Ala Leu 450 455 460
Gln Arg Val Asp Gly Met Val Gly Met Leu Met Asp Gly Leu Lys Glu 465
470 475 480 Leu Asn Leu
His Arg Cys Leu Asn Leu Ile Leu Ile Ser Asp His Gly 485
490 495 Met Glu Gln Gly Ser Cys Lys Lys
Tyr Ile Tyr Leu Asn Lys Tyr Leu 500 505
510 Gly Asp Val Lys Asn Ile Lys Val Ile Tyr Gly Pro Ala
Ala Arg Leu 515 520 525
Arg Pro Ser Asp Val Pro Asp Lys Tyr Tyr Ser Phe Asn Tyr Glu Gly 530
535 540 Ile Ala Arg Asn
Leu Ser Cys Arg Glu Pro Asn Gln His Phe Lys Pro 545 550
555 560 Tyr Leu Lys His Phe Leu Pro Lys Arg
Leu His Phe Ala Lys Ser Asp 565 570
575 Arg Ile Glu Pro Leu Thr Phe Tyr Leu Asp Pro Gln Trp Gln
Leu Ala 580 585 590
Leu Asn Pro Ser Glu Arg Lys Tyr Cys Gly Ser Gly Phe His Gly Ser
595 600 605 Asp Asn Val Phe
Ser Asn Met Gln Ala Leu Phe Val Gly Tyr Gly Pro 610
615 620 Gly Phe Lys His Gly Ile Glu Ala
Asp Thr Phe Glu Asn Ile Glu Val 625 630
635 640 Tyr Asn Leu Met Cys Asp Leu Leu Asn Leu Thr Pro
Ala Pro Asn Asn 645 650
655 Gly Thr His Gly Ser Leu 660
18662PRTArtificial sequenceC-termininal Fc-D8 TAGsssNPP1 fusion protein
18Ile Gly Val Leu Leu Thr Gln Arg Thr Leu Leu Ser Leu Val Leu Ala 1
5 10 15 Leu Leu Phe Pro
Ser Met Ala Ser Met Glu Pro Lys Ser Cys Asp Lys 20
25 30 Thr His Thr Cys Pro Pro Cys Pro Ala
Pro Glu Ala Ala Gly Ala Pro 35 40
45 Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
Ile Ser 50 55 60
Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp 65
70 75 80 Pro Glu Val Lys Phe
Asn Trp Tyr Val Asp Gly Val Glu Val His Asn 85
90 95 Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr
Asn Ser Thr Tyr Arg Val 100 105
110 Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
Glu 115 120 125 Tyr
Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys 130
135 140 Thr Ile Ser Lys Ala Lys
Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr 145 150
155 160 Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn
Gln Val Ser Leu Thr 165 170
175 Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu
180 185 190 Ser Asn
Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu 195
200 205 Asp Ser Asp Gly Ser Phe Phe
Leu Tyr Ser Lys Leu Thr Val Asp Lys 210 215
220 Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser
Val Met His Glu 225 230 235
240 Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly
245 250 255 Lys Ala Gly
Phe Glu Thr Pro Pro Thr Leu Leu Phe Ser Leu Asp Gly 260
265 270 Phe Arg Ala Glu Tyr Leu His Thr
Trp Gly Gly Leu Leu Pro Val Ile 275 280
285 Ser Lys Leu Lys Lys Cys Gly Thr Tyr Thr Lys Asn Met
Arg Pro Val 290 295 300
Tyr Pro Thr Lys Thr Phe Pro Asn His Tyr Ser Ile Val Thr Gly Leu 305
310 315 320 Tyr Pro Glu Ser
His Gly Ile Ile Asp Asn Lys Met Tyr Asp Pro Lys 325
330 335 Met Asn Ala Ser Phe Ser Leu Lys Ser
Lys Glu Lys Phe Asn Pro Glu 340 345
350 Trp Tyr Lys Gly Glu Pro Ile Trp Val Thr Ala Lys Tyr Gln
Gly Leu 355 360 365
Lys Ser Gly Thr Phe Phe Trp Pro Gly Ser Asp Val Glu Ile Asn Gly 370
375 380 Ile Phe Pro Asp Ile
Tyr Lys Met Tyr Asn Gly Ser Val Pro Phe Glu 385 390
395 400 Glu Arg Ile Leu Ala Val Leu Gln Trp Leu
Gln Leu Pro Lys Asp Glu 405 410
415 Arg Pro His Phe Tyr Thr Leu Tyr Leu Glu Glu Pro Asp Ser Ser
Gly 420 425 430 His
Ser Tyr Gly Pro Val Ser Ser Glu Val Ile Lys Ala Leu Gln Arg 435
440 445 Val Asp Gly Met Val Gly
Met Leu Met Asp Gly Leu Lys Glu Leu Asn 450 455
460 Leu His Arg Cys Leu Asn Leu Ile Leu Ile Ser
Asp His Gly Met Glu 465 470 475
480 Gln Gly Ser Cys Lys Lys Tyr Ile Tyr Leu Asn Lys Tyr Leu Gly Asp
485 490 495 Val Lys
Asn Ile Lys Val Ile Tyr Gly Pro Ala Ala Arg Leu Arg Pro 500
505 510 Ser Asp Val Pro Asp Lys Tyr
Tyr Ser Phe Asn Tyr Glu Gly Ile Ala 515 520
525 Arg Asn Leu Ser Cys Arg Glu Pro Asn Gln His Phe
Lys Pro Tyr Leu 530 535 540
Lys His Phe Leu Pro Lys Arg Leu His Phe Ala Lys Ser Asp Arg Ile 545
550 555 560 Glu Pro Leu
Thr Phe Tyr Leu Asp Pro Gln Trp Gln Leu Ala Leu Asn 565
570 575 Pro Ser Glu Arg Lys Tyr Cys Gly
Ser Gly Phe His Gly Ser Asp Asn 580 585
590 Val Phe Ser Asn Met Gln Ala Leu Phe Val Gly Tyr Gly
Pro Gly Phe 595 600 605
Lys His Gly Ile Glu Ala Asp Thr Phe Glu Asn Ile Glu Val Tyr Asn 610
615 620 Leu Met Cys Asp
Leu Leu Asn Leu Thr Pro Ala Pro Asn Asn Gly Thr 625 630
635 640 His Gly Ser Leu Gly Gly Gly Gly Ser
Gly Gly Gly Gly Ser Asp Asp 645 650
655 Asp Asp Asp Asp Asp Asp 660
192580DNAArtificial sequenceNucleotide sequence encoding D8 TAGsNPP1
fusion protein 19atgggtgtac tgctcacaca gaggacgctg ctcagtctgg
tccttgcact cctgtttcca 60agcatggcga gcatgccaag ttgtgccaaa gaagttaaaa
gttgcaaagg tcgctgtttc 120gagagaacat ttgggaactg tcgctgtgat gctgcctgtg
ttgagcttgg aaactgctgt 180ttggattacc aggagacgtg catagaacca gaacatatat
ggacttgcaa caaattcagg 240tgtggtgaga aaagattgac cagaagcctc tgtgcctgtt
cagatgattg caaggacaag 300ggcgactgct gcatcaacta cagttcagtg tgtcaaggtg
agaaaagttg ggtggaagaa 360ccatgtgaga gcattaatga gccacagtgc ccagcagggt
ttgaaacgcc tcctacactc 420ttgttttctt tggatggatt cagggcagaa tatttgcaca
cttggggtgg acttcttcct 480gttattagca aactcaaaaa atgtggaaca tatactaaaa
acatgagacc ggtgtatcca 540acaaaaactt tccccaatca ctacagcatt gtcaccggat
tgtatccaga atctcatggc 600ataatcgaca ataagatgta tgatcccaaa atgaatgctt
ccttttcact taaaagtaaa 660gagaaattta atccggagtg gtacaaagga gaaccaattt
gggtcacagc taagtatcaa 720ggcctcaagt ctggcacatt tttctggcca ggatcagatg
tggaaattaa cggaattttc 780ccagacatct ataaaatgta taatggttca gtgccatttg
aagaaaggat tttggctgtt 840cttcagtggc tgcagcttcc aaaagatgaa agaccacact
tttacacttt gtatttggaa 900gaaccagatt cttcaggtca ttcatatgga ccagtcagca
gtgaagtcat caaagccttg 960cagagggttg atggtatggt tggtatgctg atggatggtc
tgaaagagct gaacttgcac 1020agatgcctga acctcatcct tatttcagat catggcatgg
aacaaggcag ttgtaagaaa 1080tacatatatc tgaataagta tttgggggat gttaaaaata
ttaaagttat ctatggacct 1140gcagctcgat tgagaccctc tgatgtccca gataaatact
attcatttaa ctatgaaggc 1200attgcccgaa atctttcttg ccgggaacca aaccagcact
tcaaacctta tctgaaacat 1260ttcttgccta agcgtttgca ctttgctaag agtgatagaa
ttgagccctt gacattctat 1320ttggaccctc agtggcaact tgcattgaat ccctcagaaa
ggaaatattg tggaagtgga 1380tttcatggct ctgacaatgt gttttcaaat atgcaagccc
tctttgttgg ctatggacct 1440ggattcaagc atggcattga ggctgacacc tttgaaaaca
ttgaagtcta taacttgatg 1500tgtgatttgc tgaatttgac accggctcct aataacggaa
ctcatggaag tcttaaccac 1560cttctgaaga atcctgttta tacgccaaag catcccaaag
aagtgcaccc cctggtgcag 1620tgccccttca caagaaaccc cagagataac cttggctgct
catgtaaccc ttccattttg 1680ccgattgagg attttcaaac acagttcaat ctgaccgtgg
cagaagagaa gattattaag 1740catgaaactt tgccctatgg aagacctaga gttctccaga
aggaaaacac catctgtctt 1800ctttcccagc accagtttat gagtggatac agccaagaca
tcttgatgcc cctttggaca 1860tcctataccg tggacagaaa tgacagtttc tctacggaag
acttctccaa ctgtctgtac 1920caggacttta gaattcctct tagtcctgtc cataaatgtt
cattttataa aaataacacc 1980aaagtgagtt acgggttcct ctccccacca caactgaata
agaattcaag tggaatatat 2040tctgaagcct tgcttactac aaatatagtg ccaatgtacc
agagttttca agttatatgg 2100cgctactttc atgacaccct cttgcgaaag tatgcagaag
aaagaaatgg tgtcaatgtc 2160gtcagtggtc ctgtgtttga ctttgattat gatggacgtt
gtgattcctt ggagaatttg 2220aggcaaaaaa gaagagtcat ccgtaaccaa gaaattttga
ttccaactca tttcttcatt 2280gtgctgacaa gctgtaaaga tacatctcag acgcctttgc
actgtgaaaa cctggacacc 2340ttggctttca ttttgcctca caggactgat aacagcgaga
gctgtgtgca tgggaagcat 2400gactcctcat gggttgaaga attgttgatg ttgcacagag
cacggatcac agacgtcgag 2460cacatcactg gactcagctt ttatcaacaa agaaaagagc
cagtttcaga cattttgaag 2520ttgaaaacac atttgccaac ctttagccaa gaagatgatg
acgatgatga cgacgattga 258020859PRTArtificial sequenceC-terminal D8
TAGsNPP1 fusion protein 20Met Gly Val Leu Leu Thr Gln Arg Thr Leu Leu Ser
Leu Val Leu Ala 1 5 10
15 Leu Leu Phe Pro Ser Met Ala Ser Met Pro Ser Cys Ala Lys Glu Val
20 25 30 Lys Ser Cys
Lys Gly Arg Cys Phe Glu Arg Thr Phe Gly Asn Cys Arg 35
40 45 Cys Asp Ala Ala Cys Val Glu Leu
Gly Asn Cys Cys Leu Asp Tyr Gln 50 55
60 Glu Thr Cys Ile Glu Pro Glu His Ile Trp Thr Cys Asn
Lys Phe Arg 65 70 75
80 Cys Gly Glu Lys Arg Leu Thr Arg Ser Leu Cys Ala Cys Ser Asp Asp
85 90 95 Cys Lys Asp Lys
Gly Asp Cys Cys Ile Asn Tyr Ser Ser Val Cys Gln 100
105 110 Gly Glu Lys Ser Trp Val Glu Glu Pro
Cys Glu Ser Ile Asn Glu Pro 115 120
125 Gln Cys Pro Ala Gly Phe Glu Thr Pro Pro Thr Leu Leu Phe
Ser Leu 130 135 140
Asp Gly Phe Arg Ala Glu Tyr Leu His Thr Trp Gly Gly Leu Leu Pro 145
150 155 160 Val Ile Ser Lys Leu
Lys Lys Cys Gly Thr Tyr Thr Lys Asn Met Arg 165
170 175 Pro Val Tyr Pro Thr Lys Thr Phe Pro Asn
His Tyr Ser Ile Val Thr 180 185
190 Gly Leu Tyr Pro Glu Ser His Gly Ile Ile Asp Asn Lys Met Tyr
Asp 195 200 205 Pro
Lys Met Asn Ala Ser Phe Ser Leu Lys Ser Lys Glu Lys Phe Asn 210
215 220 Pro Glu Trp Tyr Lys Gly
Glu Pro Ile Trp Val Thr Ala Lys Tyr Gln 225 230
235 240 Gly Leu Lys Ser Gly Thr Phe Phe Trp Pro Gly
Ser Asp Val Glu Ile 245 250
255 Asn Gly Ile Phe Pro Asp Ile Tyr Lys Met Tyr Asn Gly Ser Val Pro
260 265 270 Phe Glu
Glu Arg Ile Leu Ala Val Leu Gln Trp Leu Gln Leu Pro Lys 275
280 285 Asp Glu Arg Pro His Phe Tyr
Thr Leu Tyr Leu Glu Glu Pro Asp Ser 290 295
300 Ser Gly His Ser Tyr Gly Pro Val Ser Ser Glu Val
Ile Lys Ala Leu 305 310 315
320 Gln Arg Val Asp Gly Met Val Gly Met Leu Met Asp Gly Leu Lys Glu
325 330 335 Leu Asn Leu
His Arg Cys Leu Asn Leu Ile Leu Ile Ser Asp His Gly 340
345 350 Met Glu Gln Gly Ser Cys Lys Lys
Tyr Ile Tyr Leu Asn Lys Tyr Leu 355 360
365 Gly Asp Val Lys Asn Ile Lys Val Ile Tyr Gly Pro Ala
Ala Arg Leu 370 375 380
Arg Pro Ser Asp Val Pro Asp Lys Tyr Tyr Ser Phe Asn Tyr Glu Gly 385
390 395 400 Ile Ala Arg Asn
Leu Ser Cys Arg Glu Pro Asn Gln His Phe Lys Pro 405
410 415 Tyr Leu Lys His Phe Leu Pro Lys Arg
Leu His Phe Ala Lys Ser Asp 420 425
430 Arg Ile Glu Pro Leu Thr Phe Tyr Leu Asp Pro Gln Trp Gln
Leu Ala 435 440 445
Leu Asn Pro Ser Glu Arg Lys Tyr Cys Gly Ser Gly Phe His Gly Ser 450
455 460 Asp Asn Val Phe Ser
Asn Met Gln Ala Leu Phe Val Gly Tyr Gly Pro 465 470
475 480 Gly Phe Lys His Gly Ile Glu Ala Asp Thr
Phe Glu Asn Ile Glu Val 485 490
495 Tyr Asn Leu Met Cys Asp Leu Leu Asn Leu Thr Pro Ala Pro Asn
Asn 500 505 510 Gly
Thr His Gly Ser Leu Asn His Leu Leu Lys Asn Pro Val Tyr Thr 515
520 525 Pro Lys His Pro Lys Glu
Val His Pro Leu Val Gln Cys Pro Phe Thr 530 535
540 Arg Asn Pro Arg Asp Asn Leu Gly Cys Ser Cys
Asn Pro Ser Ile Leu 545 550 555
560 Pro Ile Glu Asp Phe Gln Thr Gln Phe Asn Leu Thr Val Ala Glu Glu
565 570 575 Lys Ile
Ile Lys His Glu Thr Leu Pro Tyr Gly Arg Pro Arg Val Leu 580
585 590 Gln Lys Glu Asn Thr Ile Cys
Leu Leu Ser Gln His Gln Phe Met Ser 595 600
605 Gly Tyr Ser Gln Asp Ile Leu Met Pro Leu Trp Thr
Ser Tyr Thr Val 610 615 620
Asp Arg Asn Asp Ser Phe Ser Thr Glu Asp Phe Ser Asn Cys Leu Tyr 625
630 635 640 Gln Asp Phe
Arg Ile Pro Leu Ser Pro Val His Lys Cys Ser Phe Tyr 645
650 655 Lys Asn Asn Thr Lys Val Ser Tyr
Gly Phe Leu Ser Pro Pro Gln Leu 660 665
670 Asn Lys Asn Ser Ser Gly Ile Tyr Ser Glu Ala Leu Leu
Thr Thr Asn 675 680 685
Ile Val Pro Met Tyr Gln Ser Phe Gln Val Ile Trp Arg Tyr Phe His 690
695 700 Asp Thr Leu Leu
Arg Lys Tyr Ala Glu Glu Arg Asn Gly Val Asn Val 705 710
715 720 Val Ser Gly Pro Val Phe Asp Phe Asp
Tyr Asp Gly Arg Cys Asp Ser 725 730
735 Leu Glu Asn Leu Arg Gln Lys Arg Arg Val Ile Arg Asn Gln
Glu Ile 740 745 750
Leu Ile Pro Thr His Phe Phe Ile Val Leu Thr Ser Cys Lys Asp Thr
755 760 765 Ser Gln Thr Pro
Leu His Cys Glu Asn Leu Asp Thr Leu Ala Phe Ile 770
775 780 Leu Pro His Arg Thr Asp Asn Ser
Glu Ser Cys Val His Gly Lys His 785 790
795 800 Asp Ser Ser Trp Val Glu Glu Leu Leu Met Leu His
Arg Ala Arg Ile 805 810
815 Thr Asp Val Glu His Ile Thr Gly Leu Ser Phe Tyr Gln Gln Arg Lys
820 825 830 Glu Pro Val
Ser Asp Ile Leu Lys Leu Lys Thr His Leu Pro Thr Phe 835
840 845 Ser Gln Glu Asp Asp Asp Asp Asp
Asp Asp Asp 850 855
213255DNAArtificial sequenceNucleotide sequence encoding sNPP1-Fc fusion
protein 21atgggtgtac tgctcacaca gaggacgctg ctcagtctgg tccttgcact
cctgtttcca 60agcatggcga gcatgccaag ttgtgccaaa gaagttaaaa gttgcaaagg
tcgctgtttc 120gagagaacat ttgggaactg tcgctgtgat gctgcctgtg ttgagcttgg
aaactgctgt 180ttggattacc aggagacgtg catagaacca gaacatatat ggacttgcaa
caaattcagg 240tgtggtgaga aaagattgac cagaagcctc tgtgcctgtt cagatgattg
caaggacaag 300ggcgactgct gcatcaacta cagttcagtg tgtcaaggtg agaaaagttg
ggtggaagaa 360ccatgtgaga gcattaatga gccacagtgc ccagcagggt ttgaaacgcc
tcctacactc 420ttgttttctt tggatggatt cagggcagaa tatttgcaca cttggggtgg
acttcttcct 480gttattagca aactcaaaaa atgtggaaca tatactaaaa acatgagacc
ggtgtatcca 540acaaaaactt tccccaatca ctacagcatt gtcaccggat tgtatccaga
atctcatggc 600ataatcgaca ataagatgta tgatcccaaa atgaatgctt ccttttcact
taaaagtaaa 660gagaaattta atccggagtg gtacaaagga gaaccaattt gggtcacagc
taagtatcaa 720ggcctcaagt ctggcacatt tttctggcca ggatcagatg tggaaattaa
cggaattttc 780ccagacatct ataaaatgta taatggttca gtgccatttg aagaaaggat
tttggctgtt 840cttcagtggc tgcagcttcc aaaagatgaa agaccacact tttacacttt
gtatttggaa 900gaaccagatt cttcaggtca ttcatatgga ccagtcagca gtgaagtcat
caaagccttg 960cagagggttg atggtatggt tggtatgctg atggatggtc tgaaagagct
gaacttgcac 1020agatgcctga acctcatcct tatttcagat catggcatgg aacaaggcag
ttgtaagaaa 1080tacatatatc tgaataagta tttgggggat gttaaaaata ttaaagttat
ctatggacct 1140gcagctcgat tgagaccctc tgatgtccca gataaatact attcatttaa
ctatgaaggc 1200attgcccgaa atctttcttg ccgggaacca aaccagcact tcaaacctta
tctgaaacat 1260ttcttgccta agcgtttgca ctttgctaag agtgatagaa ttgagccctt
gacattctat 1320ttggaccctc agtggcaact tgcattgaat ccctcagaaa ggaaatattg
tggaagtgga 1380tttcatggct ctgacaatgt gttttcaaat atgcaagccc tctttgttgg
ctatggacct 1440ggattcaagc atggcattga ggctgacacc tttgaaaaca ttgaagtcta
taacttgatg 1500tgtgatttgc tgaatttgac accggctcct aataacggaa ctcatggaag
tcttaaccac 1560cttctgaaga atcctgttta tacgccaaag catcccaaag aagtgcaccc
cctggtgcag 1620tgccccttca caagaaaccc cagagataac cttggctgct catgtaaccc
ttccattttg 1680ccgattgagg attttcaaac acagttcaat ctgaccgtgg cagaagagaa
gattattaag 1740catgaaactt tgccctatgg aagacctaga gttctccaga aggaaaacac
catctgtctt 1800ctttcccagc accagtttat gagtggatac agccaagaca tcttgatgcc
cctttggaca 1860tcctataccg tggacagaaa tgacagtttc tctacggaag acttctccaa
ctgtctgtac 1920caggacttta gaattcctct tagtcctgtc cataaatgtt cattttataa
aaataacacc 1980aaagtgagtt acgggttcct ctccccacca caactgaata agaattcaag
tggaatatat 2040tctgaagcct tgcttactac aaatatagtg ccaatgtacc agagttttca
agttatatgg 2100cgctactttc atgacaccct cttgcgaaag tatgcagaag aaagaaatgg
tgtcaatgtc 2160gtcagtggtc ctgtgtttga ctttgattat gatggacgtt gtgattcctt
ggagaatttg 2220aggcaaaaaa gaagagtcat ccgtaaccaa gaaattttga ttccaactca
tttcttcatt 2280gtgctgacaa gctgtaaaga tacatctcag acgcctttgc actgtgaaaa
cctggacacc 2340ttggctttca ttttgcctca caggactgat aacagcgaga gctgtgtgca
tgggaagcat 2400gactcctcat gggttgaaga attgttgatg ttgcacagag cacggatcac
agacgtcgag 2460cacatcactg gactcagctt ttatcaacaa agaaaagagc cagtttcaga
cattttgaag 2520ttgaaaacac atttgccaac ctttagccaa gaagatgagc ccaaatcttg
tgacaaaact 2580cacacatgcc caccgtgccc agcacctgaa ctcctggggg gaccgtcagt
cttcctcttc 2640cccccaaaac ccaaggacac cctcatgatc tcccggaccc ctgaggtcac
atgcgtggtg 2700gtggacgtga gccacgaaga ccctgaggtc aagttcaact ggtacgtgga
cggcgtggag 2760gtgcataatg ccaagacaaa gccgcgggag gagcagtaca acagcacgta
ccgtgtggtc 2820agcgtcctca ccgtcctgca ccaggactgg ctgaatggca aggagtacaa
gtgcaaggtc 2880tccaacaaag ccctcccagc ccccatcgag aaaaccatct ccaaagccaa
agggcagccc 2940cgagaaccac aggtgtacac cctgccccca tcccgggagg agatgaccaa
gaaccaggtc 3000agcctgacct gcctggtcaa aggcttctat cccagcgaca tcgccgtgga
gtgggagagc 3060aatgggcagc cggagaacaa ctacaagacc acgcctcccg tgctggactc
cgacggctcc 3120ttcttcctct atagcaagct caccgtggac aagagcaggt ggcagcaggg
gaacgtcttc 3180tcatgctccg tgatgcatga ggctctgcac aaccactaca cgcagaagag
cctctccctg 3240tccccgggta aatga
3255221084PRTArtificial sequencesNPP1-Fc fusion protein 22Met
Gly Val Leu Leu Thr Gln Arg Thr Leu Leu Ser Leu Val Leu Ala 1
5 10 15 Leu Leu Phe Pro Ser Met
Ala Ser Met Pro Ser Cys Ala Lys Glu Val 20
25 30 Lys Ser Cys Lys Gly Arg Cys Phe Glu Arg
Thr Phe Gly Asn Cys Arg 35 40
45 Cys Asp Ala Ala Cys Val Glu Leu Gly Asn Cys Cys Leu Asp
Tyr Gln 50 55 60
Glu Thr Cys Ile Glu Pro Glu His Ile Trp Thr Cys Asn Lys Phe Arg 65
70 75 80 Cys Gly Glu Lys Arg
Leu Thr Arg Ser Leu Cys Ala Cys Ser Asp Asp 85
90 95 Cys Lys Asp Lys Gly Asp Cys Cys Ile Asn
Tyr Ser Ser Val Cys Gln 100 105
110 Gly Glu Lys Ser Trp Val Glu Glu Pro Cys Glu Ser Ile Asn Glu
Pro 115 120 125 Gln
Cys Pro Ala Gly Phe Glu Thr Pro Pro Thr Leu Leu Phe Ser Leu 130
135 140 Asp Gly Phe Arg Ala Glu
Tyr Leu His Thr Trp Gly Gly Leu Leu Pro 145 150
155 160 Val Ile Ser Lys Leu Lys Lys Cys Gly Thr Tyr
Thr Lys Asn Met Arg 165 170
175 Pro Val Tyr Pro Thr Lys Thr Phe Pro Asn His Tyr Ser Ile Val Thr
180 185 190 Gly Leu
Tyr Pro Glu Ser His Gly Ile Ile Asp Asn Lys Met Tyr Asp 195
200 205 Pro Lys Met Asn Ala Ser Phe
Ser Leu Lys Ser Lys Glu Lys Phe Asn 210 215
220 Pro Glu Trp Tyr Lys Gly Glu Pro Ile Trp Val Thr
Ala Lys Tyr Gln 225 230 235
240 Gly Leu Lys Ser Gly Thr Phe Phe Trp Pro Gly Ser Asp Val Glu Ile
245 250 255 Asn Gly Ile
Phe Pro Asp Ile Tyr Lys Met Tyr Asn Gly Ser Val Pro 260
265 270 Phe Glu Glu Arg Ile Leu Ala Val
Leu Gln Trp Leu Gln Leu Pro Lys 275 280
285 Asp Glu Arg Pro His Phe Tyr Thr Leu Tyr Leu Glu Glu
Pro Asp Ser 290 295 300
Ser Gly His Ser Tyr Gly Pro Val Ser Ser Glu Val Ile Lys Ala Leu 305
310 315 320 Gln Arg Val Asp
Gly Met Val Gly Met Leu Met Asp Gly Leu Lys Glu 325
330 335 Leu Asn Leu His Arg Cys Leu Asn Leu
Ile Leu Ile Ser Asp His Gly 340 345
350 Met Glu Gln Gly Ser Cys Lys Lys Tyr Ile Tyr Leu Asn Lys
Tyr Leu 355 360 365
Gly Asp Val Lys Asn Ile Lys Val Ile Tyr Gly Pro Ala Ala Arg Leu 370
375 380 Arg Pro Ser Asp Val
Pro Asp Lys Tyr Tyr Ser Phe Asn Tyr Glu Gly 385 390
395 400 Ile Ala Arg Asn Leu Ser Cys Arg Glu Pro
Asn Gln His Phe Lys Pro 405 410
415 Tyr Leu Lys His Phe Leu Pro Lys Arg Leu His Phe Ala Lys Ser
Asp 420 425 430 Arg
Ile Glu Pro Leu Thr Phe Tyr Leu Asp Pro Gln Trp Gln Leu Ala 435
440 445 Leu Asn Pro Ser Glu Arg
Lys Tyr Cys Gly Ser Gly Phe His Gly Ser 450 455
460 Asp Asn Val Phe Ser Asn Met Gln Ala Leu Phe
Val Gly Tyr Gly Pro 465 470 475
480 Gly Phe Lys His Gly Ile Glu Ala Asp Thr Phe Glu Asn Ile Glu Val
485 490 495 Tyr Asn
Leu Met Cys Asp Leu Leu Asn Leu Thr Pro Ala Pro Asn Asn 500
505 510 Gly Thr His Gly Ser Leu Asn
His Leu Leu Lys Asn Pro Val Tyr Thr 515 520
525 Pro Lys His Pro Lys Glu Val His Pro Leu Val Gln
Cys Pro Phe Thr 530 535 540
Arg Asn Pro Arg Asp Asn Leu Gly Cys Ser Cys Asn Pro Ser Ile Leu 545
550 555 560 Pro Ile Glu
Asp Phe Gln Thr Gln Phe Asn Leu Thr Val Ala Glu Glu 565
570 575 Lys Ile Ile Lys His Glu Thr Leu
Pro Tyr Gly Arg Pro Arg Val Leu 580 585
590 Gln Lys Glu Asn Thr Ile Cys Leu Leu Ser Gln His Gln
Phe Met Ser 595 600 605
Gly Tyr Ser Gln Asp Ile Leu Met Pro Leu Trp Thr Ser Tyr Thr Val 610
615 620 Asp Arg Asn Asp
Ser Phe Ser Thr Glu Asp Phe Ser Asn Cys Leu Tyr 625 630
635 640 Gln Asp Phe Arg Ile Pro Leu Ser Pro
Val His Lys Cys Ser Phe Tyr 645 650
655 Lys Asn Asn Thr Lys Val Ser Tyr Gly Phe Leu Ser Pro Pro
Gln Leu 660 665 670
Asn Lys Asn Ser Ser Gly Ile Tyr Ser Glu Ala Leu Leu Thr Thr Asn
675 680 685 Ile Val Pro Met
Tyr Gln Ser Phe Gln Val Ile Trp Arg Tyr Phe His 690
695 700 Asp Thr Leu Leu Arg Lys Tyr Ala
Glu Glu Arg Asn Gly Val Asn Val 705 710
715 720 Val Ser Gly Pro Val Phe Asp Phe Asp Tyr Asp Gly
Arg Cys Asp Ser 725 730
735 Leu Glu Asn Leu Arg Gln Lys Arg Arg Val Ile Arg Asn Gln Glu Ile
740 745 750 Leu Ile Pro
Thr His Phe Phe Ile Val Leu Thr Ser Cys Lys Asp Thr 755
760 765 Ser Gln Thr Pro Leu His Cys Glu
Asn Leu Asp Thr Leu Ala Phe Ile 770 775
780 Leu Pro His Arg Thr Asp Asn Ser Glu Ser Cys Val His
Gly Lys His 785 790 795
800 Asp Ser Ser Trp Val Glu Glu Leu Leu Met Leu His Arg Ala Arg Ile
805 810 815 Thr Asp Val Glu
His Ile Thr Gly Leu Ser Phe Tyr Gln Gln Arg Lys 820
825 830 Glu Pro Val Ser Asp Ile Leu Lys Leu
Lys Thr His Leu Pro Thr Phe 835 840
845 Ser Gln Glu Asp Glu Pro Lys Ser Cys Asp Lys Thr His Thr
Cys Pro 850 855 860
Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe 865
870 875 880 Pro Pro Lys Pro Lys
Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val 885
890 895 Thr Cys Val Val Val Asp Val Ser His Glu
Asp Pro Glu Val Lys Phe 900 905
910 Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys
Pro 915 920 925 Arg
Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr 930
935 940 Val Leu His Gln Asp Trp
Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val 945 950
955 960 Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys
Thr Ile Ser Lys Ala 965 970
975 Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg
980 985 990 Glu Glu
Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly 995
1000 1005 Phe Tyr Pro Ser Asp
Ile Ala Val Glu Trp Glu Ser Asn Gly Gln 1010 1015
1020 Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
Val Leu Asp Ser Asp 1025 1030 1035
Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg
1040 1045 1050 Trp Gln
Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala 1055
1060 1065 Leu His Asn His Tyr Thr Gln
Lys Ser Leu Ser Leu Ser Pro Gly 1070 1075
1080 Lys
User Contributions:
Comment about this patent or add new information about this topic: