Patent application title: NUCLEIC SEQUENCE AND DEDUCED PROTEIN SEQUENCE FAMILY WITH HUMAN ENDOGENOUS RETROVIRAL MOTIFS, AND THEIR USES
Inventors:
Patrick M. Alliel (Clamart, FR)
Jean-Pierre Perin (Le Plessis-Robinson, FR)
Francois Rieger (Boulogne, FR)
Assignees:
INSTITUT NAT'L DE LA SANTE ET DE LA RECHERCHE MED.
IPC8 Class: AC12Q100FI
USPC Class:
435 4
Class name: Chemistry: molecular biology and microbiology measuring or testing process involving enzymes or micro-organisms; composition or test strip therefore; processes of forming such composition or test strip
Publication date: 2009-10-22
Patent application number: 20090263783
Inventors list |
Agents list |
Assignees list |
List by place |
Classification tree browser |
Top 100 Inventors |
Top 100 Agents |
Top 100 Assignees |
Usenet FAQ Index |
Documents |
Other FAQs |
Patent application title: NUCLEIC SEQUENCE AND DEDUCED PROTEIN SEQUENCE FAMILY WITH HUMAN ENDOGENOUS RETROVIRAL MOTIFS, AND THEIR USES
Inventors:
Patrick M. ALLIEL
Jean-Pierre Perin
Francois Rieger
Agents:
OBLON, SPIVAK, MCCLELLAND MAIER & NEUSTADT, L.L.P.
Assignees:
INSTITUT NAT'L DE LA SANTE ET DE LA RECHERCHE MED.
Origin: ALEXANDRIA, VA US
IPC8 Class: AC12Q100FI
USPC Class:
435 4
Patent application number: 20090263783
Abstract:
The present invention provides a translational product encoded by the
nucleotide sequence of SEQ ID NO: 2, which corresponds to the gag gene of
an endogenous human retrovirus named HERV-7q. The present invention also
provides methods for diagnosing a neurological or autoimmune disease in a
patient.Claims:
1. A translational product encoded by the nucleotide sequence of SEQ ID
NO: 2, corresponding to the gag gene of an endogenous human retrovirus
named HERV-7q.
2. The translational product according to claim 1, which is selected from the group consisting of the sequences SEQ ID NOs: 29 to 34, 36, and 58.
3. A method for diagnosing a neurological or autoimmune disease in a patient, comprising the detection of a translation product according to claim 1 in a biological sample containing proteins obtained from said patient.
4. The method according to claim 3, wherein said neurological or autoimmune disease is multiple sclerosis.
5. The method according to claim 3, wherein said detection is an immunodetection.
6. A method for diagnosing a neurological or autoimmune disease in a patient, comprising comparative analysis of the expression of the translational product according to claim 1 between a biological sample containing proteins obtained from said patient and a sample from a normal individual used as a reference.
7. The method according to claim 6, wherein said neurological or autoimmune disease is multiple sclerosis.
8. A method for diagnosing a neurological or autoimmune disease in a patient, comprising comparing the protein sequences present in a biological sample from said patient with the sequence of the translation product according to claim 1.
9. The method according to claim 8, wherein said neurological or autoimmune disease is multiple sclerosis.
Description:
[0001]The present invention relates to a novel nucleic sequence and
deduced protein sequence family with complete or partial human endogenous
retroviral motifs, and sequences flanking or adjacent to said sequences,
and controlled by the latter; modification of the expression or
impairment of the structure (polyadenylation, alternative splicing and
the like) of said flanking sequences.
[0002]The invention also relates to the detection and/or use of said nucleic sequences and of said corresponding protein sequences in the context of diagnostic, prophylactic and therapeutic applications, in particular for neuropathological conditions with an autoimmune component such as multiple sclerosis.
[0003]The invention also relates to the production of antisense double-stranded and single-stranded nucleic probes, of ribozymes, capable of modulating viral replication (T. R. Cech, Science, 1987, 236, 1532-1539; R. H. Symons, Trends Biochem. Sci., 1989, 14, 445-450) of the corresponding recombinant molecules, and associated antibodies.
[0004]Retroviruses are viruses which replicate solely by using the opposite route to the conventional processing of genetic information. This process, called reverse transcription, is mediated by an RNA dependent DNA polymerase or reverse transcriptase, encoded by the pol gene. Retroviruses also encode at least two additional genes. The gag gene encodes the proteins of the skeleton, matrix, nucleocapsid and capsid. The env gene encodes the envelope glycoproteins. Retroviral transcription is regulated by promoter regions or "enhancers" situated in highly repeated regions or LTR (Long Terminal Repeat) and which are present at both ends of the retroviral genome.
[0005]During the infection of a cell, polymerase makes a DNA copy of the RNA genome; this copy may then integrate into the human genome. Retroviruses do not kill the cells which they infect, but on the contrary often enhance their rate of growth. Retroviruses can infect germ cells or embryos at an early stage; they can, under these conditions, integrate the germ line and be transmitted by vertical Mendelian transmission, which constitutes the closest relationship between a host and its parasite. These endogenous viruses can degenerate during generations of the host organism and lose their initial properties. However, some of them may conserve all or part of their properties or of the properties of their constituent motifs, or acquire novel functional properties having an advantage for the host organism, which would explain the preservation of their sequence.
[0006]The existence of endogenous motifs having long open reading frames and/or subjected to a strong selection pressure can therefore be an indication of a preserved or acquired biological function, which may correspond to a benefit for the host organism. These retroviral sequences can also undergo, over the generations, discrete modifications which will be able to trigger some of their potentials and generate or promote pathological processes. It has recently appeared necessary to carry out a review and to identify these sequences so as to be able to evaluate their functional impact.
[0007]Human endogenous retroviral sequences or HERVs represent a substantial part of the human genome. These retroviral regions exist in several forms: [0008]complete endogenous retroviral structures combining gag, pol and env motifs, flanked by repeat nucleic sequences which exhibit a significant analogy with the LTR-gag-pol-env-LTR structure of infectious retroviruses, [0009]truncated retroviral sequences; for example the retrotransposons lack their env domain and the retroposons do not possess the env and LTR regions.
[0010]Up until now, the study of these regions of the genome has been neglected in humans for essentially two reasons: [0011]the existence of insertions/deletions which can shift the reading frame and of mutations which modify the sequence. These modifications cause impairment of the structure and consequently of the biological function of these motifs, [0012]the absence of confirmed associations with human pathological conditions.
[0013]The recent knowledge of fragments which are significantly representative of the human genome and an orientation of research studies toward a study of structure/function of endogenous retroviral motifs have made it possible to specify the importance of these regions. The involvement of truncated or complete endogenous sequences in pathological conditions in animals is documented; for example their association with tumor processes has been clearly demonstrated (S. K. Chattopadhyay et al., 1982, Nature, 295, 25-31). Research aimed at specifying the association or the influence of HERVs in human pathological conditions is now therefore justified.
[0014]A classification of the HERV elements has been proposed (Tonjes R. R. et al., AIDS & Hum. Retroviral., 1996, 13, p 261-p 267; A. M. Krieg et al., FASEB J., 1992, 6, 2537-2544). It is based on a homology of these sequences with retroviruses isolated in animals, with the aid of heterologous retroviral probes. Indeed, in general, the HERVs exhibit relatively little homology with known human infectious retroviruses.
[0015]The class I families exhibit a sequence homology with the type C mammalian retroviruses; there may be mentioned in particular the ERI superfamily, close to the MuLV virus (murine leukemia virus) and to the BaEV virus (baboon endogenous virus).
[0016]The class II families exhibit a sequence homology with the type B mammalian retroviruses such as MMTV (mouse mammary tumor virus) or the type D retroviruses such as SRV (squirrel monkey retrovirus).
[0017]Other families have also been described; among these, there may be mentioned HERVs which exceptionally exhibit partial homology with HTLV-1 (RTVL-H) or primate viruses; HRES-1, for example, exhibits sequence homology with HTLVs.
[0018]Programmes for very large sequencing of the human genome now make it possible to have available a significant number of novel retroviral sequences. The use of data processing software packages makes it possible to identify and analyse these genes. In this context, a systematic search relating to the entire information available to date has been initiated in order to identify novel human endogenous retroviral sequences as a function of certain analytical criteria: [0019]presence of long open reading frames conserved during evolution of the host organism and which may suggest a biological function, [0020]analogy with sequences already characterized outside or inside the retrovirus domain, [0021]location in regions of susceptibility for certain pathological conditions or close to essential genes, for example in the cancer domain, regulation of the immune system or in certain neuropathological conditions.
[0022]The work carried out by the inventors on sequence databases allowed them to identify a set of endogenous retroviral sequences or motifs whose normal or pathological expression can promote or disrupt a protective effect in relation to pathological processes, or play a role in the onset or worsening of pathological conditions.
[0023]The subject of the present invention is a purified nucleic acid fragment, characterized in that it comprises all or part of a sequence encoding a human endogenous retroviral sequence, which has at least env-type retroviral motifs, corresponding to the sequence SEQ ID NO: 1 or to a sequence exhibiting a level of homology with said sequence SEQ ID NO: 1 greater than or equal to 80% on more than 190 nucleotides or greater than or equal to 70% on more than 600 nucleotides for the env-type domains.
[0024]The expression homologous sequence is understood to mean both a sequence which exhibits complete or partial identity with the above-mentioned sequence SEQ ID NO: 1 and a sequence which exhibits partial similarity with said sequence SEQ ID NO: 1.
[0025]According to an advantageous embodiment of said fragment, it has retroviral motifs corresponding to an env domain and corresponding to the sequence SEQ ID NO: 1 and retroviral motifs corresponding to a gag domain and corresponding to the sequence SEQ ID NO: 2 or to a sequence exhibiting a level of homology greater than or equal to 80% on more than 190 nucleotides or greater than or equal to 70% on more than 600 nucleotides for the env-type domains and a level of homology greater than or equal to 90% on more than 700 nucleotides or greater than or equal to 70% on more than 1 200 nucleotides for the gag-type domains, said motifs having no insertion or deletion of more than 200 nucleotides.
[0026]Said fragments constitute a novel family of human endogenous retroviral sequences (HERV-7q family) which exhibits sequence homology with the MSRV retro-viruses, as described in International Application WO 97/06260; said fragments according to the present invention have: [0027]two repeat nucleotide motifs of 711 bp (FIG. 3), having characteristic signals identified in LTRs (Long Terminal Repeats): transcription promoters of the TATAA or CCAAT box type. These repeat domains delimit three deduced motifs of the gag, pol and env type (FIG. 2), [0028]an env-type motif (positions 6965 nt-9550 nt on the sequence SEQ ID NO: 3 or in FIG. 1) which contains a long open reading frame of 1 620 nucleotides (positions 7874-9493 of the sequence ID NO: 3 and FIG. 1) encoding a protein having an unpublished sequence of 540 amino acids called enverin (FIG. 4 and SEQ ID NO: 26) and underlined fragment in FIG. 18. There is present inside the transmembrane domain of this env domain a peptide motif of the CKS-25/CKS-17 type (FIG. 5), recognized as having immunosuppressive functions on the host lymphocytic cells (M. Mitani et al., 1987, Proc. Natl. Acad. Sci. USA, 84, 237-240). A zinc finger type domain HX3-4HX22-33CX2C (Kulkolski et al., 1992, Mol. Cell. Biol., 12, 2331-2338), which is present in integrase-type domains is identified in another reading frame. This particular env domain signatures the characteristic of novel endogenous retroviral motifs,
[0029]the motif (positions 3065 nt-4390 nt on the sequence SEQ ID NO: 3) of the gag type encoding protein motifs according to FIG. 6 (SEQ ID NO: 58) (positions 3118-4198 of SEQ ID NO: 3) was identified by virtue of analogies with known gag domains. The region of major homology QX3EX7R is for example present (Benit et al., 1997, J. Virol., 71, 5652-5657). The nucleic acid binding motif CX2CX3-4HX4C, situated at the C-terminal position, is identified in another reading frame (Covey et al., 1986, Nucleic Acids Res., 14, 623-633). Upstream of the gag domain, a motif of 182 nucleotides is detected which is repeated twice (FIG. 1), [0030]the pol domain exhibits the conventional consensus of a retrovirus pol region at the level of the protease, reverse transcriptase and RNAse H domains. A motif close to the consensus LLDTGA is found in pol (Weber et al., 1988, Science, 243, 928-931). The motifs D and AF, LPQ and SP, and YVDD (Xiong and Eickbush, 1990, EMBO J., 9, 3353-3362) are respectively found in the 3rd, 4th and 5th homology boxes. The motifs YTDGSS and TDS are present in the RNAse H region, [0031]the gag and pol regions could be considered as being joined with a passage from the gag region to the pol region by a reading frame shift.
[0032]The present invention includes the sequences belonging to the HERV-7q family as defined above (presence of the SEQ ID NO: 1 sequence or of a homologous sequence or presence of both the sequences SEQ ID NO: 1 and SEQ ID NO: 2) and in particular the sequences SEQ ID NO: 3-22, 28 and 61; it also includes the complementary nucleic sequences and the reverse sequences complementary to the preceding sequences as well as fragments derived from the coding regions of the preceding sequences corresponding to a shifting frame greater than or equal to 14 nucleotides or their complementary sequences (SEQ ID NO: 37-57, 59-60 and 121-122).
[0033]These various fragments may be advantageously used as primers or as probes (reagents A); they hybridize specifically under high stringency conditions to a sequence of the HERV-7q family.
[0034]Among these fragments, the following fragments may be preferably mentioned: [0035]a fragment of 182 nucleotides, repeated twice, situated upstream of the gag domain at positions 2502-2611/2613˜2865 of SEQ ID NO: 3:
[0036]Primers and Probes Specific for the gag Region [0037]a sense primer G1F located in the region upstream of the gag domain of HERV-7q:
TABLE-US-00001 [0037]5'GGACCATAGAGGACACTCCAGGACTA3'; (SEQ ID NO: 37)
[0038]an antisense primer G1R located in the terminal 3' region of the gag domain:
TABLE-US-00002 [0038]5'CCTCAGTCCTGCTGCTGGATCATCT3' (SEQ ID NO: 38)
[0039]the fragment of 1505 nt amplified by the pair G1F-G1R is used in order to generate the probes capable of hybridizing the various PCR amplification products:
TABLE-US-00003 [0039] a nested sense primer G2F: (SEQ ID NO: 39) 5'CCTCCAAGCAGTGGGAGGAAGAGAATT3' a nested antisense primer G2R: (SEQ ID NO: 40) 5'CCTTCCCTGTGTTATTGTGGACATCATT3' a nested sense primer G4F: (SEQ ID NO: 41) 5'GGAAGAAGTCTATGAATTATTCAATGATGT3' a nested sense primer G3F: (SEQ ID NO: 42) 5'GGGACACAGAATCAGAACATGGAGATT3' a nested antisense primer G4R: (SEQ ID NO: 43) 5'GCCTTCAGAAGAGTCAGGTGACAGAGA3' a nested antisense primer G5R: (SEQ ID NO: 44) 5'GAGCCTCCAAAGTCCACTTGCCTGA3'
[0040]Primers and Probes Specific for the env Region
TABLE-US-00004 a sense primer E1F: (SEQ ID NO: 45) 5'GATTTCAGTATCTACTAGTCTGGGTAGAT3' an antisense primer E1R: (SEQ ID NO: 46) 5'CTAGGAAATCCAGCTAGTCCTGTCTCA3'
[0041]the fragment of 2529 nt, amplified by the pair of primers E1F-E1R, is used to generate the probes capable of hybridizing the various PCR amplification products:
TABLE-US-00005 [0041] a sense primer E2F: (SEQ ID NO: 47) 5'CCAAGACAGCCAACTTAGTTGCAGACAT3' an antisense primer E2R: (SEQ ID NO: 48) 5'GGACGCTGCATTCTCCATAGAAACTCTT3' a sense primer E3F: (SEQ ID NO: 49) 5'GCAATACTACATACACAACCAACTCCCAA3' an antisense primer E3R: (SEQ ID NO: 50) 5'GGGGGAGGCATATCCAACAGTTAGTA3' a sense primer E4F: (SEQ ID NO: 51) 5'CCATCTACACTGAACAAGATTTATACACTT3' an antisense primer E4R: (SEQ ID NO: 52) 5'AATGCCAGTACCTAGTGCACCTAGCACT3' a sense primer E5F: (SEQ ID NO: 53) 5'CGAATACAACGTAGAGCAGAGGAGCTTCGAA3' a sense primer E6F: (SEQ ID NO: 54) 5'AGCCCAAGATGCAGTCCAAGACTAAGAT3' a primer E5R: (SEQ ID NO: 55) 5'GCGTAGTAGAGGTTGTGCAGCTGAGAT3' a primer ExF: (SEQ ID NO: 56) CCCTTACCAAGAGTTTCTATGGAGAAT a primer ExR: (SEQ ID NO: 57) ACCGCTCTAACTGCTTCCTGCTGAATT
[0042]All the oligonucleotides are designed to be able to generate a sense primer and an antisense primer by a shift in the sequence of the reference primer of 1 to 7 nucleotides toward the 5' side or toward the 3' side; the modification of the sequence may cause a modification of the size of the primer of 1 to 7 nucleotides depending on the cases. The primers chosen may be optimized depending on the cases by shortening or extension affecting 1 to 9 nucleotides.
[0043]Preferably, the hybridization, cloning, subcloning, production, preparation and analysis of the nucleic acids, peptides and antibodies, the sequencing of the nucleic acids and peptides, the in situ hybridization and the immunohistochemistry are carried out under the conditions described in the following books: [0044]Current Protocols in Molecular Biology, Eds. F. M. Ausubel, R. Brent & R. E. Kingston et al. Green Publishing associates and Wiley Interscience. [0045]Molecular Cloning: a laboratory manual. Eds. J. Sambrook, E. F. Fritsch & T. Maniatis, Cold Spring Harbor Laboratory Press, Cold Spring Harbor. [0046]The Practical Approach series. Eds. D. Rickwood & B. D. Ames, IRL Press and Oxford University Press. In particular antibodies I & II; DNA cloning I, II, III; Nucleic acid and protein sequence analysis; Nucleic acid hybridization; Nucleic acid sequencing; Oligonucleotide synthesis; Protein purification applications; Protein purification methods; Protein sequencing; Transcription and translation; Gels electrophoresis of nucleic acids; Gels electrophoresis of proteins; Genome analysis; HPLC of macromolecules; Human genetic diseases; Microcomputing in biology; Molecular neurobiology; Mutagenicity testing; Essential molecular biology I & II. [0047]Proteome research: New frontiers in functional genomics, Eds. M. R. Wilkins et al., Springer.
[0048]The human endogenous retroviral sequence (SEQ ID NO: 3) situated on the long arm of chromosome 7 corresponds to the HERV-7q sequence; it has 10.5 kb (FIGS. 1 and 2) and satisfies the criteria defined above.
[0049]The search for domains exhibiting total or partial similarity with the gag and env regions of HERV-7q resulted in the identification of novel endogenous retroviral sequences. These sequences may have the structure of a complete endogenous retrovirus such as the endogenous retroviral sequence situated close to the gene for the alpha and delta subunits of the T cell receptor, and consequently called HERV-TcR; by way of example, FIG. 7 shows the comparison of the nucleic alignments of the respective gag domains of HERV-7q and HERV-TcR (sequence HG12, SEQ ID NO: 19). Partial retroviral structures also exist. These retroviral domains, similar to HERV-7q, are identified in independent nucleic sequences as shown by their chromosomal location. Nucleic motifs (called here HEx or HGx, and analogous to env or gag type domains, respectively) resembling the env or gag domains of HERV-7q were found, with the aid of the above-mentioned databases: [0050]HE2: chromosome 17 (SEQ ID NO: 4), [0051]HE3 and HG3: chromosome 6 (SEQ ID NO: 5 and 6), [0052]HE4: chromosome X (SEQ ID NO: 7), [0053]HE5: chromosome X q22 (SEQ ID NO: 8), [0054]HE6 and HG6: chromosome 1 q23.3-q24.3 (SEQ ID NO: 9 and 10), [0055]HE7: chromosome 7 p15 (SEQ ID NO: 11), [0056]HE8 and HGB: chromosome 19 (SEQ ID NO: 12 and 13), [0057]HE9: chromosome X (SEQ ID NO: 14), [0058]HE10: chromosome X q13.1-21.1 (SEQ ID NO: 15), [0059]HE11 and HG11: chromosome 7 q21-22 (SEQ ID NO: 16 and 17), [0060]HE12 and HG12, in HERV-TcR: chromosome 14 q11.2 (SEQ ID NO: 18 and 19), [0061]HE13 (SEQ ID NO: 61): chromosome 6 q24.1-24.3
[0062]The present invention also includes the coding and noncoding fragments for all or part of enverin comprising at least 14 nucleotides and in particular the fragments encoding the C-terminal part of enverin, either from amino acid 291, or from amino acid 321, starting from the first methionine.
[0063]These fragments comprise in particular a critical zone where two inserts of 12 nucleotides were characterized: [0064]a first insert was identified (sequence A) in individuals of 2 groups (patients and controls). This insert, situated between amino acids 487 and 488, makes it possible to insert the tetrapeptide VLQM. A comparative analysis shows that this insert is identified in a homologous region situated in the sequence HE13, belonging to the HERV-7q family. The amplification of the HE13 type sequence could indicate that there is an impairment of the enverin sequence of HERV-7q, which would promote the amplification of the sequence contained in HE13. This observation also makes it possible to use this insert as a specific element for amplification of sequences of the HE13 type.
[0065]A second insert (sequence B) was identified in a patient with MS. The insert of 12 nucleotides is situated at the level of amino acid 495 and encodes the tetrapeptide MQSM. It is remarkable to observe that this insert is also identified in a homologous region situated in HE13.
TABLE-US-00006 (SEQ ID NO: 59) Sequence A: TAAACTACAAATGGTTCTTCAAATGGAGCCCA (SEQ ID NO: 60) Sequence B: GATGCAGTCCAAGATGCAGTCCATGACTAAGA
[0066]These observations demonstrate modifications of the enverin sequence of the HERV-7q type which constitute the basis for a detection strategy by allele-specific amplification (AS-PCR), making it possible to detect these differences in a population and which could correspond either to a mutation/deletion associated with a degree of susceptibility, or to a polymorphism, or to a mutation/deletion associated with a pathological condition such as multiple sclerosis.
[0067]The alignments of the env (FIG. 8) and gag (FIG. 9) domains explain the levels of homology observed between the sequences described above and the homologous sequences in HERV-7q. The analogies can extend to the flanking retroviral motifs.
[0068]Analysis of the sequence tags available in databases shows that transcripts belonging to some members of this family, in particular HERV-7q, are essentially expressed in tissues of foetal or placental origin.
[0069]Polypeptide sequences generated by these transcripts can therefore be potentially produced and biological functions or activities can be envisaged, by analogy with biologically active polypeptides of viral or retroviral origin; for example, the peptide motifs of the CKS-17 type (Haraguchi et al., PNAS, 1995, 92, 5568-5571) (FIG. 5) or CKS-25 type (Huang S. S, and Huang J. S., J. Biol. Chem. 1998, 273, 4815-4818) which have immuno-modulatory functions on the lymphocytic host cells. The differences in sequence which are observed and possible normal or pathological modifications are in particular responsible for modulation of the function.
[0070]HERV-7q represents the paradigm of the novel family of human endogenous retroviral sequences or of endogenous retroviral motifs.
[0071]HERV-7q and some of the endogenous retroviral sequences belonging to its family have a pol-type domain analogous to pol-type retroviral sequences such as for example the pol region identified in the MSRV retrovirus associated with multiple sclerosis and described by H. Perron et al. (1997, Proc. Natl. Acad. Sci. USA, 94, 7583-7588; International Application PCT WO 97/06260).
[0072]However, the sequences according to the present invention are distinguishable from the infectious exogenous retroviral sequences analogous to MSRV previously described in that the gag and env sequences according to the invention are significantly different according to the criteria defined above and as a function of certain specific characteristics, for example the long open reading frame of the env domain of HERV-7q; they would be able to allow the signaturing of a pathological condition when they have insertions, deletions, reading frame shifts or mutations.
[0073]Indeed, the differences observed between the human sequences of the HERV-7q type, which are isolated from individuals reputed to be normal, and the sequences derived from some samples of pathological origin are not randomly distributed. Comparisons carried out between the gag region obtained from infectious retroviral particles (EMBL accession No.: A60168, A60200, A60201, A60171 and the like) and the corresponding gag sequence of HERV-7q (FIG. 9), make it possible to observe that the mutations preferably affect non-sense codons. For example, two non-sense codons in HERV-7q are replaced by an arginine codon in A60200, which makes it possible to obtain a deduced sequence of 109 amino acids for HERV-7q and of 166 amino acids for A60200. The base changes consequently make it possible to extend the reading frame and to potentially encode larger sized polypeptide structures (FIG. 10).
[0074]Likewise, an env-type sequence obtained from infectious retroviral particles exhibits a significant analogy with the env domain of HERV-7q (FIG. 11). These marked analogies between exogenous and endogenous retroviral sequences could be responsible for the triggering or worsening of certain pathological processes, in particular certain autoimmune diseases such as multiple sclerosis. In this regard, it is possible to note that certain endogenous retroviral sequences described in the invention are situated close to or in regions reputed to exhibit susceptibility for multiple sclerosis: for example HERV-7q and the 7q21-22 region of chromosome 7, likewise for HE12 and HG12 in HERV-TcR and the region of the gene encoding the alpha and delta chains of the T cell receptor, HE2 and chromosome 17, or HE3, HE13 and HG3 and chromosome 6, for example, the sequences HE11 and HG11, around the region 7q 21-22 or HE4, HE5, HE6, HE9, HE10 or HG10 on the X chromosome. These sequences would therefore be capable of providing the means for locating or identifying the genes for predisposition.
[0075]No significant homology is observed with endogenous retroviral sequences already described; on the other hand, a limited homology may be noted, which makes it possible to identify a general structure of the env domain; however, said homology is less than the criteria defined according to the invention between the env domains of the sequence HERV-7q (SEQ ID NO: 1) and the sequence HERV-9 (FIG. 12). FIG. 11 shows extensive homologies between the sequence HERV-7q with an exogenous retroviral sequence (accession No. EMBL: A60170).
[0076]The human endogenous retroviral sequences belonging to the HERV-7q family can protect against attacks linked to the environment or can be beneficial for the individual. This beneficial effect could be one of the possible reasons for the selection pressure exerted on some of these sequences and the potentially functional character of the deduced protein structures identified: for example the long open reading frame capable of encoding a novel protein and corresponding to the env domain of HERV-7q.
[0077]The human endogenous retroviral sequences belonging to the HERV-7q family could be associated, for example, with pathological conditions related to processes linked to cancer, to neuropathological conditions with an autoimmune component or to any other pathological process in association or otherwise with endogenous or exogenous viruses or retroviruses. Their action could be related to the outbreak, the worsening, the modification of the time of appearance or the protection against the disease.
[0078]In the context of application to autoimmune pathological conditions (such as for example lupus, Sjogren's syndrome, rheumatoid arthritis, multiple sclerosis and the like), significant analogies may be detected between the endogenous retroviral motifs identified and motifs found in retroviral structures characterized in patients with autoimmune pathological conditions such as multiple sclerosis; for example, fragments of gag domain (recently available in databases) obtained from infectious retroviral particles or the complete sequence of the pol domain corresponding to the MSRV virus associated with multiple sclerosis. These retroviral motifs possess significant analogies with homologous endogenous sequences of the HERV-7q type, which makes it possible to envisage direct or indirect association with pathological processes, including multiple sclerosis, in association or otherwise with MSRV.
[0079]The importance of these sequences goes beyond the context of autoimmune diseases. Apart from the general importance of retroviral motifs in the triggering or worsening of a tumor process, which is well established in particular in murine models (H. Fan in The retroviridiae, 1994, ed. J. A. Levy, Plenum, New York, p. 313-353), these sequences could be present close to or inside important genes and could alter the expression thereof: for example HERV-TcR and the genes for the alpha and delta subunits of the receptor for the T cells involved in disruptions of the immune system.
[0080]The present invention includes, in addition, the use of sequences combined with the sequences of the HERV-7q family for the detection and/or prognosis of various autoimmune diseases (neuropathological conditions in particular); these sequences encode all or part of a factor whose function, the regulation/de-regulation or alteration (polyadenylation, alternative splicing), is associated with the normal or pathological expression or with the regulation/deregulation of the motifs belonging to the HERV-7q family and correspond to transcripts or cDNAs of the nucleotide sequences encoding genes situated in regions flanking or delimiting retroviral sequences of the HERV-7q family.
[0081]The expression flanking region is understood to mean any region situated close to (contained in or including) an endogenous retroviral sequence belonging to the HERV-7q family, as defined above, up to and including the genes immediately contiguous and/or situated at a distance which cannot exceed 120 kb.
[0082]The inventors have now found that the presence of the retroviral sequences as defined above disrupts the expression or impairs the structure of the flanking sequences defined below.
[0083]The transcripts of said flanking sequences (and fragments thereof, in particular those underlined or in italics in FIGS. 14-16, 22-26, as defined below: [0084]at 1021 bp upstream of HERV-7q, there is identified an endogenous retroviral sequence called RH7 (SEQ ID NO: 62 and FIG. 22); this sequence is situated in 5' of the HERV-7q sequence; in FIG. 22, the portion in italics corresponds to the beginning of the HERV-7q sequence; the RH7 sequence is underlined; two putative polyadenylation sites are in bold. This sequence SEQ ID NO: 62 exhibits significant homology, on more than 6 kb, with RGH-type endogenous retroviral sequences (FIG. 13). Sequences belonging to this family are expressed in particular in patients with rheumatoid osteoarthritis (Nakagawa et al., (1997), Arthritis, Rheum., 40, 627-638). The present invention also includes fragments of the sequence SEQ ID NO: 62, comprising between 14 and 50 nucleotides (used as primers), preferably between 14 and 25 nucleotides, or at least 25 nucleotides (used as probe), which fragments have the following characteristics: the 4 nucleotides of the 3' end are different from the corresponding motifs of the sequence RGH2 (bottom sequence in FIG. 13, GenBank accession No.: D110 18), [0085]at less than 9 kb upstream of HERV-7q, there is identified the sequence RAM75 (SEQ ID NO: 63 and FIG. 14) containing the 24 coding exons (which cover close to 41 kb) of the gene for peroxisomal ATPase PEX1. PEX1, in combination with PEX6, is responsible for the import of peroxisomal proteins and for stabilizing the PEX5 receptor. A disruption/alteration affecting PEX1 is responsible for various neuropathological conditions such as Zellweger syndrome, neonatal adrenoleukodystrophy and the infantile form of Refsum's disease (Reuber et al., (1997), Nature Genet., 17, 445-448). It can be recalled that the main function of the peroxisomes is associated with the metabolism of fatty acids, in particular by β-oxidation processes. Impairment of the gene identified in the sequence RAM75, or of its expression, by modification of the function of the regulatory 5' and 3' regions or by modification of the splicings or of the polyadenylation processes, in particular under the influence of neighboring retroviral motifs, would be able to disrupt the expression and the structure of ATPase and consequently to disrupt one of the peroxisomal functions, in particular the metabolism of lipids, in particular myelin lipids, with consequences for certain pathological conditions, including neuro-pathological conditions such as multiple sclerosis; the underlined portions (FIG. 14) correspond to the 24 coding exons.
[0086]The present invention also includes the fragments of the sequence SEQ ID NO: 63, included in the abovementioned 24 coding exons and comprising at least 14 nucleotides.
[0087]Analysis of the expression profile (transcripts and proteins) of the sequence RAM75 (SEQ ID NO: 63) is a good indicator for the differential diagnosis of neuropathological conditions with an autoimmune component.
[0088]In FIG. 14, the coding exons are underlined. The initiation and non-sense codons as well as the putative polyadenylation sites are in bold and underlined; [0089]at 0.7 kb downstream of the sequence HERV-7q and on nearly 17 kb (SEQ ID NO: 64 and FIG. 15), there is identified the nucleotide sequence RAV73, where there are detected sequence tags and potential exons capable of producing one or more polypeptide sequences; the invention also includes fragments of this sequence SEQ ID NO: 64 included in the sequence tags and the potential exons as they appear (portions underlined) in FIG. 15, which fragments comprise at least 14 nucleotides, [0090]at 120 kb upstream of the sequence HG3, and on 15 kb, there is the nucleotide sequence RBP3 (SEQ ID NO: 65 and FIG. 23), which covers the 3' end of the gene encoding a transcription factor of the Blimp-1 family (SEQ ID NO: 119 and FIG. 25), a protein of 789 amino acids which is a repressor of the expression of the interferon-beta gene (Keller and Maniatis, Genes Dev., (1991), 5, 868-879), which is already associated with certain malignant pathological conditions (Mock et al., Genomics, (1996), 37, 24-28), and which could play a role in the differentiation and the pathogenesis of B cells. The possible association of the endogenous retroviral sequence containing the motifs HG3 and HE3 and of Blimp-1 has many benefits, in the case of pathological conditions, and in particular multiple sclerosis. Blimp-1 acts in particular on the B cells whose contribution in inflammatory processes associated with multiple sclerosis is known. Blimp-1 is capable of blocking the viral induction of the INFβ promoter whose capacity to reduce the frequency of attacks and the progression of lesions in patients with MS is known. Disruption in the expression or the structure of Blimp-1, in relation to a retroviral element of the HERV-7q type, is consequently associated with neuropathological conditions or with diseases having an autoimmune character, such as multiple sclerosis; this nucleotide sequence RBP3 (SEQ ID NO: 65) contains nucleotide motifs identified in the nucleic sequence encoding the Blimp-1 gene; the invention also includes the detection of the mRNA sequences for the Blimp-1 protein (SEQ ID NO: 119), [0091]the endogenous retroviral sequence of the HERV-7q type, containing HE3 and HG3, is situated in the HI3 region corresponding to an intron extending over more than 46 kb (SEQ ID NO: 66), of a gene which could encode the analogue of APS (FIG. 24), a protein of 275 amino acids specific to apoptosis, overexpressed in various cells in culture after triggering an apoptotic process (Hammond et al., FEBS Lett., (1998), 425, 391-395). The intron is situated at the level of amino acid 231 of APS. The end of HE3 is at more than 12 kb from the 5' end of the intron, whereas HG3 is situated at more than 28 kb from the 3' end of the intron. Apoptotic processes are associated with multiple sclerosis. In particular, there has been described an apoptotic process affecting astrocytes and oligodendrocytes in the presence of a purified fraction of cerebrospinal fluid of patients suffering from multiple sclerosis (Menard et al., J. Neurol. Sci., (1998), 154, 209-221).
[0092]Finally, it should be stressed that the nucleic region containing HE3, HG3, HI3 and RBP3 is located at the level of the short arm of chromosome 6, in 6p21, which is a proposed region of susceptibility to multiple sclerosis (The Multiple Sclerosis Genetic Group, Nature Genet., (1996), 13, 469-472).
[0093]The interaction between the HERV-7q type sequences and the flanking sequences and the importance of establishing a profile of expression including one or more of the abovementioned sequences in order to establish a differential diagnosis of a neuro-pathological condition is even more evident because it is observed that the sequences HG12 and HE12 are situated in an intron region of the gene encoding the alpha and delta subunits of the T cell receptors. The T cell receptors are involved in the immune regulation process and their influence has been proposed in the case of autoimmune diseases, including multiple sclerosis.
[0094]The subject of the invention is also transcripts generated from the abovementioned sequences as well as those optionally exhibiting modifications in the reference sequences described in the invention when they are expressed in certain patients.
[0095]Indeed, the systems for regulating the expression of the retroviral proteins of HERV-7q, which are present in the LTR type motifs, could influence the expression of genes situated in the close or distant chromosomal vicinity and could induce disruptions of an immunological and/or neurological character. For example, the endogenous retroviral sequence HERV-TcR exists in the immediate vicinity of the genes for the alpha and delta subunits of the T cell receptor previously described. The LTR-type motifs could also encode superantigens (Acha-Orbea and Palmer, 1991, Immunol. Today, 12, 356-361). In general, retroviral proteins of the HERV-7q or related type, or their truncated or partial forms could be involved in cytotoxicity or superantigenicity phenomena, such as for example those derived from the long open reading frame identified in the env domain (FIG. 4).
[0096]Sequences of the HERV-7q 5' and 3' LTR type, which are highly conserved, are involved in such regulatory effects. By way of example, LTX is described, which is a sequence comparable to that of an HERV-7q LTR (SEQ ID NO: 67 and FIG. 16), and which is present in the center of an intron of more than 49 kb, but at 2 kb from the donor 5' site of the FMR2 gene associated with fragile X and encoding a protein of 1311 amino acids (FIG. 26). The LTRs modulate the alternative splicing (Kapitonov and Jurka, (1999), J. Mol. Evol., 48, 248-251), the expression of the gene, the binding to nuclear proteins (Akopov et al., (1998), FEBS Lett., 421, 229-233), or allow the production of an alternative polyadenylation signal (Goodchild et al., (1992), Gene, 121, 287-294).
[0097]In general, there may be noted the existence of several endogenous retroviral sequences of the HERV-7q type (HE4, HE5, HE9, HE10), situated at the level of chromosome X which represents the chromosome associated with the largest number of pathological conditions.
[0098]In this regard, it is possible to note that retroviral motifs derived from defective regions are capable of having biological functions; for example, the envelope protein p15E, derived from defective retroviral motifs, possesses an anti-inflammatory and immunosuppressive activity (Snyderman and Ciancolo, 1984, Immunol. Today, 5, 240-244).
[0099]These structures are probably capable of causing breaks or of amplifying deregulations in the immune defense processes. Some of the motifs of the gag, env and LTR-type domains may be associated with a particular function or may contribute to the normal or pathological function of the flanking domains as defined above (SEQ ID NO: 62-67). Recombinations with an element of exogenous, retroviral origin or otherwise can give rise to the production of nucleic or protein motifs which could either protect or trigger or promote or worsen a pathological condition. Likewise, a retroviral structure containing endogenous retroviral elements according to the invention would be capable of causing a pathological process after passing through an exogenous transient cycle followed by reintegration into a sensitive or critical region of the human genome.
[0100]It is thus possible to obtain expression profiles (transcripts and optionally proteins) which correspond to the abovementioned neuropathological conditions.
[0101]Likewise, the combination of motifs belonging to the HERV-7q family, or of elements induced by motifs belonging to the HERV-7q family, with motifs of exogenous origin or induced exogenously would be capable of triggering or worsening a pathological process or on the contrary of promoting protection or partial remission or a complete and permanent cure.
[0102]The detection made possible of the HERV-7q type domains suggests possible applications at the prophylactic, prognostic and diagnostic level; for example, immunological approaches or gene amplification, which make it possible to compare normal individuals serving as reference with patients, would be capable of promoting screening, of improving early detection of the outbreak of the disease and/or of monitoring the progression of a pathological condition in patients which may exhibit a susceptibility or in whom there has been an outbreak of the disease or in individuals considered to be normal, based on current clinical criteria.
[0103]The specific nucleic and immunological probes, as defined, in the present invention are capable of promoting the identification and detection of motifs which are abnormally expressed in the context of pathological conditions associated with cancer, or of neuropathological conditions, in particular autoimmune pathological conditions, at the forefront of which is multiple sclerosis.
[0104]The subject of the present invention is also hybrid nucleic sequences, characterized in that they comprise sequences or motifs belonging to the HERV-7q family, or of elements induced by motifs belonging to the HERV-7q family, with motifs of exogenous origin or induced exogenously (exogenous retroviral sequences); such hybrid sequences are probably capable of triggering or worsening a pathological process or on the contrary of promoting protection or partial remission or a complete and permanent cure.
[0105]The subject of the present invention is also a diagnostic reagent for the differential detection of complete or partial human endogenous nucleic sequences, having retroviral motifs, selected from the sequences SEQ ID NO: 1 and/or SEQ ID NO: 2, characterized in that it is selected from the group consisting of the sequences SEQ ID NO: 1-22, 28, 37-57, 59-61 and 121-122, the complementary nucleic sequences and the reverse sequences complementary to the preceding sequences, of nucleotide fragments capable of defining or of identifying the sequences SEQ ID NO: 1 and/or SEQ ID NO: 2 and any flanking sequence or any sequence overlapping them as well as of fragments derived from the coding regions of the sequences SEQ ID NO: 1-22 and 61, corresponding to a shifting frame greater than or equal to 14 nucleotides or their complementary sequences, optionally labeled with an appropriate marker as well as of sequences as defined in FIGS. 18-21.
[0106]The sequences of the nucleic, ribonucleic and oligonucleotide probes used will be chosen from the env and gag regions or their flanking regions; for example the oligonucleotide primers for HERV-7q will be chosen from the regions situated between nucleotides 3065 and 4390, nucleotides 6965 and 9550 or nucleotides 2502-2865 of SEQ ID NO: 3, as well as from any adjacent sequence (upstream or downstream) capable of allowing specific amplification (FIG. 1).
[0107]Among the appropriate markers, there may be mentioned radioactive isotopes, enzymes, fluorochromes, chemical markers (biotin), haptens (digoxygenin) and antibodies or appropriate base analogues.
[0108]Preferably: [0109]said reagent is selected from the sequences SEQ ID NO: 37-57 and is capable of being used as a primer, [0110]said reagent is selected from the following sequences: [0111]a fragment of 1505 nt amplified by the pair of primers SEQ ID NO: 37 and SEQ ID NO: 38 (primers G1F and G1R), [0112]a fragment of 2529 nt amplified by the pair of primers SEQ ID NO: 45 and SEQ ID NO: 46 (primers E1F and E1R), [0113]a fragment of 182 nucleotides, repeated twice, situated upstream of the gag domain at positions 2502-2611/2613-2865, [0114]fragments encoding or not encoding all or part of enverin, comprising at least 14 nucleotides and in particular the fragments encoding the C-terminal portion of enverin, either from amino acid 291, or from amino acid 321, starting from the first methionine,
[0115]and is capable of being used as a probe.
[0116]The subject of the present invention is also a method for the rapid and differential detection of the endogenous retroviral nucleic sequences of the env or env and gag type, their normal or pathological variants, by hybridization and/or gene amplification, carried out using a biological sample, which method is characterized in that it comprises:
[0117](a) a step in which a biological sample to be analysed is brought into contact with at least one probe as defined above, and
[0118](b) a step in which the product(s) resulting from the nucleotide sequence-probe interaction is detected by any appropriate means.
[0119]In accordance with said method, it may comprise: [0120]prior to step (a): [0121]a step of preparing the relevant biological tissue or fluid, [0122]a step of extracting the nucleic acid to be detected, and [0123]at least one gene amplification cycle, and subsequent to step (b): [0124]a step of comparing the nucleic sequences obtained in said biological sample with the human endogenous retroviral sequences according to the invention by any appropriate means and in particular by sequencing, Southern blotting, restriction cleavage, SSCP or any other method which makes it possible to identify an insertion or a deletion or a single mutation between the various sequences compared.
[0125]In accordance with the invention, the human endogenous retroviral sequences according to the invention are thus compared with the nucleic sequences present in the biological sample to be analysed and allow the detection of homologous sequences from patients suffering from pathological conditions likely to involve a modification of their genome.
[0126]Advantageously, said gene comparisons are carried out using genomic DNA obtained from control individuals and from patients.
[0127]A conventional gene amplification by PCR will be carried out with the aid of 5'-sense and 3'-antisense primers delimiting or comprising the zone to be studied (env zone or gag zone).
[0128]Also advantageously, the sequences of the nucleic, ribonucleic and oligonucleotide probes used are chosen from the env and gag regions or their flanking regions; for example the oligonucleotides which are primers for HERV-7q will be chosen from the regions situated between nucleotides 3065 and 4390 and nucleotides 6965 and 9550, and from any adjacent sequence (upstream or downstream) capable of allowing specific amplification (FIG. 1), as specified above. They are preferably selected from the group consisting of
[0129]a fragment of 1505 nt amplified by the pair of primers SEQ ID NO: 37 and SEQ ID NO: 38 (primers G1F and G1R),
[0130]a fragment of 2529 nt amplified by the pair of primers SEQ ID NO: 45 and SEQ ID NO: 46 (primers E1F and E1R).
[0131]The gene amplification step is in particular carried out with the aid of one of the following gene amplification techniques: amplification using Qβ-replicase, PCR, LCR, ERA, CPR or SDA.
[0132]The subject of the present invention is also chimeric sequences, characterized in that they consist of a fragment of 17 to 40 nucleotides of a flanking sequence as defined above combined with an endogenous retroviral motif of the HERV-7q type comprising between 17 and 40 nucleotides, as defined above.
[0133]The subject of the present invention is also a method of detecting transcripts as defined above, characterized in that it comprises: [0134]collecting messenger RNAs obtained from control biological samples (biological tissues, cells or fluids) and from a similar sample collected from patients, and [0135]the qualitative and/or quantitative analysis of said mRNAs by in situ hybridization, by dot-blot, Northern blotting, RNAse mapping or RT-PCR, with the aid of a diagnostic reagent as defined above.
[0136]The subject of the present invention is also a method for the detection and/or evaluation of an overexpression/underexpression or of a modification of at least one of the endogenous retroviral sequences or fragments of sequences of the HERV-7q type and/or of their associated flanking sequences, characterized in that it comprises: [0137]depositing on an appropriate support, such as for example a nylon filter, a glass slide or their equivalent, cDNA or its equivalent obtained from clones, PCR products obtained from genomic DNA, RT-PCR products obtained from transcripts or from specific oligonucleotide sequences, said DNA sequences being endogenous retroviral sequences or fragments of sequences of the HERV-7q type and/or their flanking sequences, as defined above, consisting of transcripts and cDNAs of the genomic sequences, which encode all or part of a factor, whose function, regulation/de-regulation or alteration is associated with the normal or pathological expression or with the regulation/deregulation of motifs belonging to said HERV-7q family, these sequences corresponding to nucleotide sequences encoding genes situated in flanking regions situated upstream and/or downstream of a retroviral sequence of said HERV-7q family and in which one of the ends cannot be at a distance exceeding 120 kb, and/or a chimeric sequence as defined above, [0138]the hybridization of said support with at least one appropriately labeled probe obtained, for example, by retrotransposition of an RNA mixture obtained from biological cells, tissues or fluids obtained from controls reputed to be normal, from members of various ethnic populations, from patients suffering from pathological conditions often associated with expression of retroviruses, such as tumor processes, or such as autoimmune diseases, and [0139]the detection of the hybrids formed.
[0140]According to an advantageous embodiment of said method, said transcript or cDNA is selected from the group consisting of the sequences SEQ ID NO: 62-67 and 119 and their fragments corresponding to a shifting frame greater than or equal to 14 nucleotides or their complementary sequences.
[0141]According to another advantageous embodiment of said method, said support comprises, in addition, any endogenous or exogenous retroviral sequence.
[0142]The method of DNA chips (Bowtell, (1999), Nature Genet., 21, 25-32), is used to evaluate the modification of the expression of all or part of some of the sequences of retroviral origin of the HERV-7q type and flanking sequences. Briefly, DNA obtained from clones, PCR products obtained from genomic DNA, RT-PCR products obtained from transcripts or specific oligonucleotide sequences are deposited on a support, such as for example a nylon filter, a glass slide or their equivalent. The deposited nucleic sequences cover the various retroviral domains described above, as well as the contiguous sequences and the flanking genes. In order to detect possible alternative splicing processes, specific DNAs are synthesized per step of 500-600 nucleotides with an overlap of 250-300 nucleotides on either side. The alternative splicings already identified will be the subject of a specific synthesis. The hybridization is carried out with the aid of a probe obtained, for example, by retrotransposition of an RNA mixture obtained from biological cells, tissues or fluids obtained from controls reputed to be normal, members of the various ethnic populations, patients suffering from pathological conditions often associated with expression of retroviruses, such as tumor processes, or such as autoimmune diseases, including multiple sclerosis. In this case, a μg fraction and up to a few μg of mRNA or up to a few μg or a few tens of μg of RNA, depending on the method used and the size of the DNA chip involved, are sufficient for the synthesis of the nucleic probe. The nucleic probe is suitably labeled so as to allow subsequent detection, such as for example by fluorescence or by an equivalent method.
[0143]The use of bi- or even multicolored probes makes it possible to specify the concerted expression of several genes in parallel, while taking advantage, furthermore, of a precise normalization. The results are acquired automatically, such as for example by a laser scanning system or its equivalent.
[0144]Two types of DNA chips are designed, on the one hand chips having an exhaustive set of sequences, and on the other hand specific DNA chips enabling targeting to a more specific application.
[0145]For example, a critical sequence in that it would contain a difference relating to a deletion or even a mutation is detected with the aid of specific oligonucleotides (Wang et al., (1998), Science, 280, 1077-1082). The polymorphism associated with a base or with a mutation is detected with the aid of four oligonucleotides possessing one of the four sequence possibilities at the level of a base (A, C, G or T); for each point difference, the 4 oligonucleotides are deposited and the hybridization intensities are compared. Furthermore, an alternative splicing is detected using DNAs corresponding to a single effective or putative exon; the gene is therefore analyzed exon by exon. The DNA chips also relate, by extension, to any endogenous or exogenous retroviral sequence, such as for example ERV-9, ERV-K, ERV-L, ERV-H, ERV-4, ERV-6, ERV-8, ERV-10, ERV-15, ERV-16, ERV-17, ERV-18, ERV-21, ERV-24, ERV-33, ERV-34, ERV-36, ERV-40, ERV-42, ERV-MLN, ERV-FRD, ERV-FTD and the like), as well as all the putative exon sequences (identified by the existence of sequence tags and corresponding transcripts) or effective exon sequences, and which are situated on either side up to a distance of 120 kb of the endogenous retroviral sequences of the HERV-7q type.
[0146]The comparative study is carried out between a control sample and the sample to be tested, in a prophylactic, diagnostic or therapeutic perspective, such as for example the early detection of a modification of the expression of one of the sequences, in a cell, a tissue or an organism, the identification of a sequence associated with a susceptibility or with any pathological condition, the monitoring of the progression of the pathological condition or the monitoring of a treatment and the evaluation of its efficacy.
[0147]Apart from the applications already mentioned, the advantage of the method makes it possible, more generally, to make an assessment of the changes observed in an individual, which constitutes to a certain extent an identity card, which facilitates an epidemiological approach which makes it possible to establish novel correlations between a particular observed profile and a pathological condition, in the absence of an a priori regarding this pathological condition.
[0148]The subject of the present invention is also a kit for the detection and/or evaluation of an auto-immune disease and in particular of neuropathological conditions with an autoimmune etiology, characterized in that it comprises, in addition to the buffers necessary for carrying out the methods as defined above: [0149]diagnostic reagents A as defined above, and [0150]reagents B consisting of the transcripts and cDNAs of the genomic sequences, which encode all or part of a factor, whose function, regulation/de-regulation or alteration is associated with the normal or pathological expression or with the regulation/de-regulation of motifs belonging to said HERV-7q family, these sequences corresponding to nucleotide sequences encoding genes situated in flanking regions situated upstream and/or downstream of a retroviral sequence of said HERV-7q family, of which one of the ends cannot be at a distance exceeding 120 kb, [0151]which reagents are preferably attached to an appropriate support.
[0152]According to an advantageous embodiment of said kit, said reagents B are selected from the group consisting of the sequences SEQ ID NO: 62-67 and 119 and their fragments corresponding to a shifting frame greater than or equal to 14 nucleotides or their complementary sequences, as well as the sequences represented in FIGS. 13-17, 22-26.
[0153]The subject of the present invention is also products of translation, characterized in that they are encoded by a nucleotide sequence as defined above.
[0154]The subject of the present invention is also a peptide, characterized in that it is capable of being expressed with the aid of a nucleotide sequence selected from the group consisting of the sequences SEQ ID NO: 1-22, 28 and 61, as defined above, according to the combinations offered by the use of the various possible reading frames (see also FIGS. 18-21).
[0155]Said peptide also includes the derived peptides or polypeptides comprising between 5 and 540 amino acids (SEQ ID NO: 23-36 and SEQ ID NO: 58 and their fragments of at least 5 amino acids) and in particular a fragment of 538 amino acids, starting at the first methionine of the sequence SEQ ID NO: 26 (enverin).
[0156]According to an advantageous embodiment of said peptides they are in particular selected from the sequences SEQ ID NO: 23-36, 58, in particular the sequence SEQ ID NO: 26 and its C-terminal fragments, either from the amino acid 291, or from the amino acid 321, starting from the first methionine.
[0157]According to another advantageous embodiment of said peptides, they are obtained from nucleic sequences as defined above, in which at least one non-sense codon may be replaced with a codon encoding one of the following amino acids: Phe (F), Leu (L), Ser (S), Tyr (Y), Cys (C), Trp (W), Gln (O), Arg (R), Lys (K), Glu (E) or Gly (G).
[0158]The invention thus includes the deduced peptides or the deduced proteins corresponding to all or part of the nucleic sequences described in the invention, and optionally exhibiting modifications with the reference sequences described in the invention, when they are expressed in some patients. In particular, the invention includes the complete or partial sequences obtained according to the 3 sense reading frames and the 3 reverse and complementary reading frames (see FIGS. 18-21).
[0159]Advantageously, the analysis of the structure of the env domain of HERV-7q, called enverin, made it possible to demonstrate successively: [0160]an N-terminal signal peptide (region 1-21) and two transmembrane domains (region 320-340; 455-477), responsible for interactions with membrane lipid or protein motifs, [0161]an immunomodulatory motif of the CKS-17 (Haraguchi et al., (1995), 92, 5568-5571)/CKS-25 type. It is possible to note, in this regard, the presence of an RalD motif inside the peptide of the CKS-17/CKS-25 type of HERV-7q and a motif RvaD at position 363 which correspond to the consensus W/RxxD, proposed for the active site of the TGF-βs (Huang et al., J. Biol. Chem., 1997, 272, 27155-27159), potent factors associated with growth, with differentiation and with morphogenesis and which are associated with many human pathological conditions, such as tumor processes (Tang et al., (1998), Nat. Med., 4, 802-807) or neuro-degenerative diseases (Flanders et al., (1998), Prog. Neurobiol., 54, 71-85). The peptides according to the invention containing these motifs can advantageously serve as antagonists by inhibiting the attachment of the TGF-βs to their natural receptors, [0162]N-glycosylation motifs. The glycosylation of the envelope proteins of retroviruses appears to be directly associated with their functional properties, for example by influencing the number of determinants available in the T cells or by promoting recognition of antigens by the T cells. Glycosylation could play a role in the outbreak or the spread of a pathological condition with an autoimmune component. The glycosylations are necessary for maintaining the conformation of certain epitopes, in particular during the production of a recombinant envelope protein so as to develop a diagnostic reagent and to promote the efficacy of a possible vaccine. Positions 171, 210, 216, 236, 244, 283 and 411. Expected number at random: 3.2 [0163]prenylation sites. Prenylation is an essential mechanism for attachment to the cell membrane and for the targeting of certain proteins. This targeting process could be essential for the production of specific therapeutic agents capable of interfering with the production and regulation of the traffic of cellular complexes calling into play proteins involved in the cell interactions, growth and movement. Positions 188 and 290. Expected number at random: 1.8 [0164]targeting sites in the endoplasmic reticulum. These sites could make it possible to bring about the targeting toward the endoplasmic reticulum in order to carry out the modifications necessary for promoting membrane crossing. Positions 353 and 431. Expected number at random: 0.2
[0165]Moreover, the inventors have shown that a number of peptides derived from the env protein of HERV-7q (enverin) have a high affinity/half-life for the class I HLA alleles. CADD analysis has made it possible to select candidate peptides, for which the best scores are indicated in Table I:
TABLE-US-00007 TABLE I HLA Location Sequence molecule Score Sequence No. 399 FLGEECCYYV A-0201 7214 SEQ ID NO: 68 462 LLFGPCIFNL A-0201 1792 SEQ ID NO: 69 189 CLPLNFRPYV A-0201 1453 SEQ ID NO: 70 439 GLLSQWMPWI A-0201 488 SEQ ID NO: 71 263 CLPSGIFFV A-0201 5103 SEQ ID NO: 72 444 WMPWILPFL A-0201 897 SEQ ID NO: 73 252 IRWVTPPTQI B-2705 3000 SEQ ID NO: 74 432 LRNTGPWGLL B-2705 2000 SEQ ID NO: 75 158 LRTHTRLVSL B-2705 2000 SEQ ID NO: 76 316 KRVPILPFVI B-2705 1800 SEQ ID NO: 77 25 CRCMTSSSPY B-2705 1000 SEQ ID NO: 78 137 TRVHGTSSPY B-2705 1000 SEQ ID NO: 79 124 AREKHVKEVI B-2705 600 SEQ ID NO: 80 478 SRIEAVKLQM B-2705 600 SEQ ID NO: 81 442 SQWMPWILPF B-2705 500 SEQ ID NO: 82 405 CYYVNQSGI Kd 2400 SEQ ID NO: 83 346 FYYKLSQEL Kd 2400 SEQ ID NO: 84 244 TYTTNSQCI Kd 2400 SEQ ID NO: 85 291 SFLVPPMTI Kd 1600 SEQ ID NO: 86 406 YYVNQSGIV Kd 1200 SEQ ID NO: 87 167 LFNTTLTGL Kd 1152 SEQ ID NO: 88 463 LFGPCIFNL Kd 960 SEQ ID NO: 89 253 RWVTPPTQI Kd 480 SEQ ID NO: 90 449 LPFLGPLAAI B-5102 2200 SEQ ID NO: 91 3 LPYHIFLFTV B-5102 1210 SEQ ID NO: 92 331 GALGTGIGGI B-5102 798 SEQ ID NO: 93 321 LPFVIGAGVL B-5102 550 SEQ ID NO: 94 499 RRPLDRPAS B-2705 600 SEQ ID NO: 95 194 FRPYVSIPV B-2705 600 SEQ ID NO: 96 383 RRALDLLTA B-2705 600 SEQ ID NO: 97 39 WRNQRPGNI B-2705 600 SEQ ID NO: 98 423 DRIQRRAEEL B14 1800 SEQ ID NO: 99 158 LRTHTRLVSL B14 600 SEQ ID NO: 100 359 ERVADSLVTL B14 540 SEQ ID NO: 101 463 LFGPCIFNLL Kd 1658 SEQ ID NO: 102 345 QFYYKLSQEL Kd 1152 SEQ ID NO: 103 443 QWMPWILPFL Kd 691 SEQ ID NO: 104 405 CYYVNQSGIV Kd 500 SEQ ID NO: 105 474 NFVSSRIEAV Kd 480 SEQ ID NO: 106 221 GPLVSNLEI B-5102 1320 SEQ ID NO: 107 190 LPLNFRPYV B-5102 726 SEQ ID NO: 108 449 LPFLGPLAAI B-5101 1144 SEQ ID NO: 109 488 EPKMQSKTKI B-5101 968 SEQ ID NO: 110 3 LPYHFLFTV B-5101 629 SEQ ID NO: 111 125 REKHVKEVI Kk 1000 SEQ ID NO: 112 312 KPRNKRVPIL B7 800 SEQ ID NO: 113 378 VVLQNRRAL Db 792 SEQ ID NO: 114 377 AVVLQNRRAL Db 660 SEQ ID NO: 115 321 LPFVIGAGV B-5101 629 SEQ ID NO: 116 304 DLYSYVISK A3 540 SEQ ID NO: 117 301 TEQDLYSYVI Kk 500 SEQ ID NO: 118
[0166]This Table I indicates an estimation of the dissociation half-life of a peptide of enverin with an allele of the class I HLA system (the tables of Parker coefficients: J. Immunol, (1994), 152, 163-175). The location indicates the position of the first amino acid of the peptides tested in the enverin sequence. The one-letter code is used for the amino acid sequence. The scores around 500 or greater than 500 were selected. By way of comparison, an analysis was carried out on a concatenation of peptides (polypeptide of 4968 amino acids) reputed to bind the molecules of the class I major histocompatibility complex (Rammensee, Immunogenetics, (1995), 41, 178-228); the ten best scores recorded for nonapeptides and the HLA type A--0201 are respectively 4984, 4047, 2406, 1267, 800, 705, 607, 591, 591 and 577.
[0167]It can be seen from this Table I that some molecules of the type I major histocompatibility complex are capable of binding peptides derived from enverin, thus assimilated with peptides of viral or tumor origin, at the level of the endoplasmic reticulum. The complexes formed at the level of the endoplasmic reticulum are then transported to the cell surface, which causes the destruction of the target cell by the cytotoxic T lymphocytes. The peptides identified generally comprise 8 to 10 amino acids. Studies have shown that some alleles of the class I HLA system are thus associated with certain pathologies, in particular with an autoimmune character, such as HLA-B27 with rheumatoid spondylitis or HLA-B51 with Behcet's disease.
[0168]A peptide capable of binding a particular class I molecule is consequently capable of functioning as a T cell epitope.
[0169]Consequently, the present invention also includes the fragments 399-471 and 244-271 of enverin which advantageously group together several epitopes having high affinity for various haplotypes of the class I HLA system. The use of all or some of these polypeptides is consequently capable of promoting an increase in the T cell repertoire, by allowing better efficacy of the immune response in the context of the various immunotherapeutic, prophylactic or vaccine strategies. These polypeptides may be advantageously delivered for example by the use of viral vectors, viral or synthetic particles, lipopeptides, conventional adjuvants, naked nucleic acids or nucleic acids adsorbed on particles, or liposomes.
[0170]For the purposes of the present invention, the peptides may be chemically or biochemically modified; some of the amino acids may be replaced with an analogous amino acid, according to conventional criteria for homologies (A or G; S or T; I, L or V; F, Y or W; N or Q; D or E).
[0171]The subject of the present invention is also immunogenic or vaccine compositions for protecting against autoimmune diseases, in particular in at-risk subjects, characterized in that it comprises at least one peptide comprising at least one motif of the CKS type and/or at least one peptide consisting of a motif having affinity with one of the haplotypes of the class I or class II HLA system and a pharmaceutically acceptable vehicle.
[0172]According to an advantageous embodiment of said composition, said motif is selected from the group consisting of peptides, as defined in Table I above.
[0173]According to another advantageous embodiment of said composition, said peptide has the following sequence:
TABLE-US-00008 (SEQ ID NO: 120) sequence CKH: LQNRRALDLLTAERGGTclFLGEECCYYV
[0174]It is remarkable to note at the level of position 380 of the enverin protein, the contiguousness of the motifs of the CKS-17 type (underlined) and of the peptide having the highest score (in bold; see peptide at position 399 in Table I, SEQ ID NO: 68) in the sequence CKH.
[0175]The clonal activation of the subgroups of lymphocytes, for example of cytotoxic lymphocytes, by the peptides in Table I and by extension their homologues, is blocked by conventional immunotherapy means such as for example serotherapy and vaccination.
[0176]The combination of two sequences or of the sequences analogous to the CKH peptide (SEQ ID NO: 120), is capable of causing a synergistic process in the immune response, which could bring into play additional signaling and activation pathways capable of modulating the lymphocyte activation.
[0177]The vaccination relates to the production of antibodies directed against the peptides of Table I, according to the rules of the prior art and according to the methods of release controlled by artificial or cellular implants using a composition as defined above and by using gene therapy means, such as for example expression of nucleic sequences encoding the peptides of Table I. Consequently, the subject of the invention is also immunogenic or vaccine compositions, characterized in that they comprise a vector including at least one nucleic sequence encoding a peptide as defined in Table I, optionally combined with a sequence encoding a motif of the CKS-17 type.
[0178]The serotherapy relates to the use of neutralizing antibodies produced from the peptides of Table I and their homologues.
[0179]The protein products generated by the endogenous retroviral sequences or produced in parallel may be advantageously characterized by micro-methods of analysis and quantification of peptides and proteins: HPLC/FPLC or equivalent, capillary electrophoresis or equivalent, microsequencing techniques (Edman method or equivalent, mass spectrometry and the like).
[0180]The subject of the invention is also antibodies directed against one or more of the peptides described above and their use either for carrying out a method, in particular a differential method, of in vitro detection of the presence of such a sequence in an individual, or for the preparation of a composition capable of being used in serotherapy in neuropathological conditions with an autoimmune component.
[0181]Said antibodies are advantageously polyclonal or monoclonal antibodies obtained by an immunological reaction from a human, mammalian or avian organism or other species toward the proteins, as defined above.
[0182]The subject of the present invention is a method for the differential immunological screening of normal or pathological human endogenous retroviral sequences of the HERV-7q family, characterized in that it comprises bringing a biological sample into contact with an antibody according to the invention, the reading of the result being visualized by an appropriate means, in particular EIA, ELISA, RIA, fluorescence.
[0183]By way of illustration, such an in vitro diagnostic method according to the invention comprises bringing a biological sample collected from a patient into contact with antibodies according to the invention and detecting with the aid of any appropriate method, in particular with the aid of labeled anti-immunoglobulins, the immunological complexes formed between the proteins produced normally or pathologically and the antibodies.
[0184]Monoclonal or polyclonal antibodies, produced from antigens corresponding to synthetic peptides, or recombinant polypeptide or proteins make it possible to monitor the expression of the peptides or proteins produced normally or pathologically. The analysis is preferably carried out by ELISA or equivalent, Western blotting or equivalent, or by immunohistochemistry.
[0185]The peptides or proteins, derived from the endogenous retroviral sequences or whose expression is associated with the expression of these endogenous retroviral sequences, are tested for and identified.
[0186]The subject of the present invention is also a method for the identification and detection of endogenous retroviral motifs which are abnormally expressed in the context of pathological conditions associated with cancer, or of neuropathological conditions, in particular autoimmune neuropathological conditions, at the forefront of which is multiple sclerosis, characterized in that it comprises the comparative analysis of the sequences extracted from a biological sample and the sequences according to the invention.
[0187]The subject of the present invention is also the application of the nucleic sequences or of the protein sequences according to the invention to the diagnosis of, to the prognosis of, to the evaluation of genetic susceptibility to, any induced, congenital or acquired human diseases, in particular those with cancerous, autoimmune and/or neurological components, such as multiple sclerosis, the associated syndromes and the neurodegenerative diseases in which all or part of the nucleic sequences according to the invention and related endogenous or exogenous forms are involved.
[0188]The subject of the present invention is also hybrid nucleic sequences, characterized in that they comprise nucleic sequences or motifs according to the invention, combined with sequences or motifs of endogenous origin or of exogenous origin or induced exogenously.
[0189]The subject of the present invention is, in addition, a recombinant cloning or expression vector, characterized in that it comprises a nucleic sequence in accordance with the invention.
[0190]Therapeutic strategies may be envisaged by using some of the nucleic sequences contained in HERV-7q and the sequences of the same family or deduced polypeptide structures or by the use of peptides or proteins, or of specific antibodies.
[0191]In accordance with the invention, all or part of the endogenous retroviral nucleic sequences of the HERV-7q type may be used for use as a vector or as vector elements for therapeutic use, in particular the LTR sequences and the gag region (SEQ ID NO: 2, 21 and 22).
[0192]The advantage of such sequences lies in the safety of the vector thus formed, in the possibility of a targeted specific insertion in a well-defined region by a strategy similar to homologous recombination, in cellular targeting, which is optionally transient in the case of a placental expression in women. Another aspect relates to the possibility of combining with the genes of interest the biologically active retroviral motifs (immunomodulatory peptides, as represented in the sequences SEQ ID NO: 68-118, below, fusogenic peptide and the like).
[0193]The subject of the present invention is also transgenic animals, characterized in that they comprise all or part of a sequence of the HERV-7q type (SEQ ID NO: 1-22 and 61).
[0194]Table II below establishes the correspondences between the sequence numbers as they appear in the sequence listing and the name of the various sequences.
TABLE-US-00009 TABLE II SEQ ID NO: DESIGNATION 1 Nucleic acid: 7 env 2 Nucleic acid: gag 3 Nucleic acid: HERV-7q 4 Nucleic acid: HE2 5 Nucleic acid: HE3 6 Nucleic acid: HG3 7 Nucleic acid: HE4 8 Nucleic acid: HE5 9 Nucleic acid: HE6 10 Nucleic acid: HG6 11 Nucleic acid: HE7 12 Nucleic acid: HE8 13 Nucleic acid: HG8 14 Nucleic acid: HE9 15 Nucleic acid: HE10 16 Nucleic acid: HE11 17 Nucleic acid: HG11 18 Nucleic acid: HE12 19 Nucleic acid: HG12 20 Nucleic acid: R1 21 Nucleic acid: RIF 22 Nucleic acid + deduced env protein: HERV-7q 23 Fragment of deduced env protein according to SEQ ID NO: 22 24 Fragment of deduced env protein according to SEQ ID NO: 22 25 Fragment of deduced env protein according to SEQ ID NO: 22 26 Protein: enverin 27 Fragment of deduced env protein according to SEQ ID NO: 22 28 Nucleic acid + protein deduced from gag: HERV-7q 29 Fragment of deduced gag protein according to SEQ ID NO: 28 30 Fragment of deduced gag protein according to SEQ ID NO: 28 31 Fragment of deduced gag protein according to SEQ ID NO: 28 32 Fragment of deduced gag protein according to SEQ ID NO: 28 33 Fragment of deduced gag protein according to SEQ ID NO: 28 34 Fragment of deduced gag protein according to SEQ ID NO: 28 35 env protein: reading frame 1 36 gag protein 37 Nucleic acid: G1F (primer) 38 Nucleic acid: G1R (primer) 39 Nucleic acid: G2F (primer) 40 Nucleic acid: G2R (primer) 41 Nucleic acid: G4F (primer) 42 Nucleic acid: G3F (primer) 43 Nucleic acid: G4R (primer) 44 Nucleic acid: G5R (primer) 45 Nucleic acid: E1F (primer) 46 Nucleic acid: E1R (primer) 47 Nucleic acid: E2F (primer) 48 Nucleic acid: E2R (primer) 49 Nucleic acid: E3F (primer) 50 Nucleic acid: E3R (primer) 51 Nucleic acid: E4F (primer) 52 Nucleic acid: E4R (primer) 53 Nucleic acid: E5F (primer) 54 Nucleic acid: E6F (primer) 55 Nucleic acid: E5R (primer) 56 Nucleic acid: ExF (primer) 57 Nucleic acid: ExR (primer) 58 Protein gag 59 Nucleic acid: Sequence A (insertion sequence) 60 Nucleic acid: Sequence B (insertion sequence) 61 Nucleic acid: HE13 62 Nucleic acid: RH7 63 Nucleic acid: RAM75 64 Nucleic acid: RAV73 65 Nucleic acid: RBP3 66 Nucleic acid: HI3 67 Nucleic acid: LTX 68 Peptide Table I 69 Peptide Table I 70 Peptide Table I 71 Peptide Table I 72 Peptide Table I 73 Peptide Table I 74 Peptide Table I 75 Peptide Table I 76 Peptide Table I 77 Peptide Table I 78 Peptide Table I 79 Peptide Table I 80 Peptide Table I 81 Peptide Table I 82 Peptide Table I 83 Peptide Table I 84 Peptide Table I 85 Peptide Table I 86 Peptide Table I 87 Peptide Table I 88 Peptide Table I 89 Peptide Table I 90 Peptide Table I 91 Peptide Table I 92 Peptide Table I 93 Peptide Table I 94 Peptide Table I 95 Peptide Table I 96 Peptide Table I 97 Peptide Table I 98 Peptide Table I 99 Peptide Table I 100 Peptide Table I 101 Peptide Table I 102 Peptide Table I 103 Peptide Table I 104 Peptide Table I 105 Peptide Table I 106 Peptide Table I 107 Peptide Table I 108 Peptide Table I 109 Peptide Table I 110 Peptide Table I 111 Peptide Table I 112 Peptide Table I 113 Peptide Table I 114 Peptide Table I 115 Peptide Table I 116 Peptide Table I 117 Peptide Table I 118 Peptide Table I 119 Nucleic acid: BLIMP-1 120 Peptide: CKH 121 Nucleic acid: F645 (primer) 122 Nucleic acid: PS5D (primer)
[0195]In addition to the preceding arrangements, the invention also comprises other arrangements which will emerge from the description which follows, which refers to exemplary embodiments of the method which is the subject of the present invention as well as to the appended drawings, in which:
[0196]FIG. 1. Human nucleic sequence HERV-7q, whose analysis and treatment make it possible to characterize a novel endogenous retroviral structure. The repeat nucleic regions of type R1 and R2 and the gag, pol and env domains are underlined. The gag and env type domains are in italics. The region homologous to a noncoding 3' portion of Rab7 is double underlined.
[0197]FIG. 2. Map of the human endogenous retro-viral region HERV-7q. The upper part of the figure corresponds to an anonymous region of the human genome situated on the long arm of chromosome 7. The repeat domains (1), gag (2), pol (3) and env (4) of HERV-7q can be identified. The C-terminal env region (4.3) is prolonged upstream in the form of a long open reading frame (4.2). The domain 4.1 corresponds to the N-terminal region of the env domain.
[0198]FIG. 3. Comparison of the repeat nucleic sequences situated at the boundaries of HERV-7q. The 5' (top) and 3' (bottom) repeat nucleic regions are compared and the identical bases are indicated by two dots.
[0199]FIG. 4. Deduced sequence having an open reading frame in the env-type domain of HERV-7q according to the longest open reading frame rule.
[0200]FIG. 5. Sequences around the CKS-17 domain identified in various deduced env domains of the HERV-7q family and comparison with reference CKS-17 motifs.
[0201]1) HE2-2) HERV-7q-3) GenBank accession No.: M85205-4) HE7-5) HE9-6) CKS-17; the peptide motif endowed with immunomodulatory properties is underlined-7) gp20 of retrovirus type D (SRV-Pc).
[0202]FIG. 6. Possible deduced sequence of the gag-type domain identified in HERV-7q established according to the longest open reading frame rule. X and/correspond to a non-sense codon and to a reading frame shift, respectively. The underlined sequence corresponds to the beginning of the pol domain.
[0203]FIG. 7. Comparison of the nucleic regions covering the gag region of HERV-7q (top) and HERV-TcR (bottom) and their flanking regions. The identical bases are specified by two dots.
[0204]FIG. 8. Example of nucleic alignments of the env-type domain of HERV-7q with similar env-type domains present in human endogenous retroviral sequences of the same family. The non-sense codons are underlined: 1) HERV-7q-2) HE2 03) HE3-04) HE4.
[0205]FIG. 9. Nucleic alignments between the gag domain of HERV-7q and the corresponding domains belonging to the same family. Comparison with fragments of gag domains isolated from infectious retroviral agents. Sequences of infectious retroviral origin: EMBL database accession No.: 1) A60168-2) A60201-3) A60200-4) A60171. Human endogenous retroviral sequences: 5) HERV-7q-6) HG11-7) HG3. The figures indicated in the endogenous sequences correspond to the number of nucleotides inserted in order to optimize the alignment with the gag-type sequences identified in retroviruses of infectious origin.
[0206]FIG. 10. Alignment of a deduced gag protein motif (top) belonging to an infectious retrovirus (EMBL accession No.: A60200) with the deduced gag protein motif (bottom) identified in HERV-7q. The non-sense codons are in bold and underlined. The identical amino acids are specified by 2 dashes. One dash indicates a deletion or a homologous amino acid.
[0207]FIG. 11. Alignment of an env motif (top) belonging to an infectious retrovirus (EMBL accession No.: A60170) with the env motif (bottom) identified in HERV-7q. The homologous nucleotides are specified by two dots and the deletions by a dash.
[0208]FIG. 12. Comparison between the env domain of HERV-7q (top) and the env domain of HERV-9 (bottom). The 66% homology is limited to the 3' region of the env domain of HERV-7q and HERV-9, respectively between nucleotides 8976 nt and 9500 nt of HERV-7q and nucleotides 2898 nt and 3465 nt of HERV-9 (GenBank accession No.: X57147). Numerous insertions/deletions are also observed.
[0209]FIG. 13. Homology between a portion of the sequence of the transcript encoding RH7 (top, SEQ ID NO: 62) and an RGH2 motif (bottom-GenBank accession No.: D11018).
[0210]FIG. 14. Identification of the sequence of the transcript encoding RAM75 (SEQ ID NO: 63), corresponding to the gene for an ATPase of PEX1 type.
[0211]The coding exons are underlined. The initiation and non-sense codons as well as the putative poly-adenylation sites are in bold and underlined. The region in italics corresponds to the beginning of the endogenous retroviral sequence RH7.
[0212]FIG. 15. Sequence of the transcript encoding RAV73 (SEQ ID NO: 64), situated at 0.7 kb downstream of HERV-7q; the nucleic sequences capable of encoding one or more polypeptides are underlined.
[0213]FIG. 16. Comparison between the 3' LTR sequence (top) of HERV-7q and the intron sequence LTX (SEQ ID NO: 67), situated in the FMR2 gene, associated with fragile X (bottom).
[0214]FIG. 17. Detection of modifications on the nucleotide sequence (ID NO: 3), in patients suffering from MS. The modified bases, in at least one patient, are underlined. The primers used are in italics (sequences SEQ ID NO: 121 and 122). The initiation ATG and the non-sense codon are in bold.
[0215]FIG. 18. The env coding portion of the HERV-7q sequence (sequence ID NO: 3), with 3 reading frames.
[0216]FIGS. 19, 20, 21. Separate presentation of the env protein according to the 3 reading frames.
[0217]FIG. 22. Nucleic sequence containing the retroviral sequence RH7 situated in 5' of the HERV-7q sequence. The sequence in italics corresponds to the beginning of the HERV-7q sequence. The RH7 sequence is underlined. Two putative polyadenylation sites are in bold.
[0218]FIG. 23. Sequence of the transcript encoding RBP3 containing nucleotide motifs identified in the nucleic sequence encoding the Blimp-1 gene.
[0219]FIG. 24. Sequence of the transcript encoding APS.
[0220]FIG. 25. Sequence of the transcript encoding Blimp-1; the coding portion is underlined; the initiation and termination codons are in bold.
[0221]FIG. 26. Sequence of the transcript encoding FMR2. The coding portion is underlined. The initiation and non-sense codons are in bold.
[0222]It should be clearly understood, however, that these examples are given solely by way of illustration of the subject of the invention and do not in any manner constitute a limitation thereto.
Example 1
Detection, by Gene Amplification, of a Nucleic Sequence Belonging to a Domain of the gag or env Type According to the Invention, in a Genomic DNA Sample of Human or Mammalian Origin
[0223]The gene amplification is carried out using genomic DNA isolated from blood. An anticoagulant treatment is carried out with 1 ml of a citrate solution (per liter: 4.8 g of citric acid, 13.2 g of sodium citrate, 14.7 g of glucose) per 6 ml of fresh blood. After centrifugation of 20 ml of blood for 15 min at 130 000 g, the supernatant is removed and the fraction enriched with white blood cells is transferred into a new tube and then recentrifuged under the same conditions as above. The fraction enriched with white blood cells is resuspended in an extraction buffer (10 nM Tris-HCl, 0.1 M EDTA, 20 μg/ml of pancreatic RNAse treated so as to eliminate the DNAses, 0.5% SDS, pH 8.0), and then incubated for 1 hour at 37° C. Proteinase K is added at a final concentration of 100 μg/ml. The suspension of lyzed cells is incubated at 50° C. for 3 hours, with occasional stirring, and then treated with an equal volume of phenol equilibrated with 0.5 M Tris-HCl, pH 8.0. The emulsion formed is placed on a wheel for one hour and then centrifuged at 5 000 g for 15 min at room temperature. The aqueous solution is treated and deproteinized by a triple phenol extraction in order to obtain a level of purification corresponding to an absorbance A260/A280 final ratio greater than 1.75. The aqueous fraction is precipitated with 0.2 vol. of 10 M sodium acetate and 2 vol. of ethanol. The DNA is then either collected with the tip of a bent Pasteur pipette, or centrifuged at 5 000 g for 5 min at room temperature. The DNA or the DNA pellet is washed twice with 70% ethanol and then taken up in 1 ml of TE, pH 8.0 so as to be eluted, with gentle stirring, for 12 to 24 hours.
[0224]Oligonucleotides specific for the endogenous sequences described according to the invention are chosen in order to amplify the gag or env region of the endogenous retroviral regions described according to the invention. The genomic DNA studied is obtained from patients having pathological conditions such as multiple sclerosis and from individuals reputed to be healthy.
[0225]The thermostable DNA polymerases used were chosen for their high accuracy during the amplification process, such as Vent DNA polymerase (Biolabs) and the like, and are used according to the conditions recommended by the supplier.
[0226]The amplification strategy uses, depending on the case, a simple PCR, or a nested or seminested PCR.
[0227]Oligonucleotides used to amplify the gag region: [0228]primer G1F, sense, located in the region upstream of the gag domain of HERV-7q (SEQ ID NO: 37), [0229]primer G1R, antisense, located in the 3' terminal region of the gag domain (SEQ ID NO: 38).
[0230]The fragment of 1505 nt amplified by the pair G1F-G1R; 1505 nt is used to generate the probes capable of hybridizing the various PCR amplification products. [0231]primer G2F, sense nested (SEQ ID NO: 39), [0232]primer G2R, antisense nested (SEQ ID NO: 40), [0233]primer G4F, sense nested (SEQ ID NO: 41), [0234]primer G3F, sense nested (SEQ ID NO: 42), [0235]primer G4R, antisense nested (SEQ ID NO: 43), [0236]primer G5R, antisense nested (SEQ ID NO: 44).
[0237]Oligonucleotides used to amplify the env region of HERV-7q: [0238]primer E1F, sense (SEQ ID NO: 45), [0239]primer E1R, antisense (SEQ ID NO: 46).
[0240]The fragment of 2529 nt amplified by the pair of primers E1F-E1R is used to generate the probes capable of hybridizing the various PCR amplification products. [0241]primer E2F, sense (SEQ ID NO: 47), [0242]primer E2R, antisense (SEQ ID NO: 48), [0243]primer E3F, sense (SEQ ID NO: 49), [0244]primer E3R, antisense (SEQ ID NO: 50), [0245]primer E4F, sense (SEQ ID NO: 51), [0246]primer E4R, antisense (SEQ ID NO: 52), [0247]primer E5F, sense (SEQ ID NO: 53), [0248]primer E6F, sense (SEQ ID NO: 54), [0249]primer E5R (SEQ ID NO: 55), [0250]primer EXF (SEQ ID NO: 56), [0251]primer ExR (SEQ ID NO: 57).
[0252]The PCR is carried out using 50 to 200 ng of genomic DNA. The PCR conditions are those recommended by the supplier. The amplification cycle conditions are carried out in 50 μl: denaturation of 94° C. for 1 min, hybridization of 70° C. for 1 min, and extension at 72° C. for 1 to 2 min, depending on the amplified fragments. After 35 cycles, a terminal reaction is carried out at 72° C. for 10 min. Automated sequencing of the amplified samples is carried out with the aid of an Applied Biosystems type ABI 377 sequencer or another comparable model, according to the protocols provided by the manufacturer.
[0253]In the case of a nested or seminested PCR, the same experimental conditions are used, the only difference being that the genomic DNA sequence is replaced with 5 to 10 μl of the amplification product derived from the first PCR.
[0254]Two independent amplifications are carried out using the same sample. A control reaction is carried out by replacing the DNA sample with water in order to detect possible contaminants.
Example 2
Detection, by Gene Amplification, of a Nucleic Sequence According to the Invention in a Biological Sample of Genomic DNA Collected from Patients having an Existing Candidate Pathological Condition or Suspected of having this Pathological Condition
[0255]The amplification protocol is the same as in Example 1, apart from the origin of the sample which is obtained from patients having a candidate pathological condition. A genomic DNA sample reputed to be normal is systematically integrated into the set of amplified pathological samples and then analyzed.
[0256]The PCR products are separated on a 1.5% agarose gel and then transferred in the presence of 0.4 N sodium hydroxide on a charged nylon membrane. Hybridization is carried out with a specific probe corresponding to the PCR fragments amplified either with the pair G1F-G1R or the pair E1F-E1R. The probe is labeled by incorporating dUTP-digoxygenin according to the supplier's protocol (Boehringer Mannheim). The hybridization is carried out in a hybridization buffer (5×SSC, 50% formamide, 0.1% lauroylsarcosine, 0.02% SDS, 2% blocking reagent Boehringer) overnight at 42° C. The Southern is washed for twice 5 min at room temperature in a 2×SSC solution containing 0.1% SDS. Next, a high stringency wash is carried out twice for 15 min at 55° C. in a 0.1×SSC solution containing 0.1% SDS. The hybridization is visualized according to the supplier's protocol (Boehringer Mannheim), in the presence of a chemiluminescent substrate for alkaline phosphatase, of the CSPD or CDP-STAR type. The filter is visualized after a 15 min exposure at 60° C.
[0257]SSCP (single strand conformation polymorphism) analysis makes it possible to detect discrete modifications of the sequence of the fragments amplified by PCR. The PCR is carried out in the presence of dCTP labeled with 32P. The sample to be analyzed is denatured at 95° C. for 10 min in the presence of loading buffer, and then immediately loaded onto a 10% polyacrylamide gel containing 7.5% glycerol. The migration is carried out at 4° C. at 8-10 W. The gel is dried and then autoradiographed.
[0258]The PCR fragments likely to exhibit an alteration of their nucleotide sequence are sequenced according to Example 1.
[0259]Hybridization with the aid of a specific oligonucleotide (17 mers to 20 mers) corresponding to the modified nucleotide region makes it possible to identify the samples having an identical modification (ASO method). Briefly, the southern is hybridized with an oligonucleotide which is distally labeled either with 32P, or in the presence of digoxygenin (according to the Boehringer Mannheim protocol) and then washed under stringent conditions at 65° C. in a 6×SSC solution containing 0.05% sodium pyrophosphate.
[0260]For example, an automated nucleotide sequencing was carried out on six PCR fragments obtained from 5 patients suffering from MS and a control reputed to be normal, and which were amplified using the primers F645: CTTCAAACAACAACCAGGAGG (SEQ ID NO: 121) (situated 26 nucleotides upstream of the initiation methionine of enverin) and PS5D: TTGGGGAGGTTGGCCGACGA (SEQ ID NO: 122) (situated 6 nucleotides downstream of the non-sense codon of enverin). Modifications of the sequence of enverin were observed on the DNA from some patients (FIG. 17).
Example 3
Detection of a Protein According to the Invention in a Biological Sample
[0261]Preparation of a Purified Protein Fraction of Cerebrospinal Fluid from Patients Suffering from MS
[0262]After a treatment at 56° C. for 30 min and removal of the immunoglobulins on a G HiTrap protein column (Pharmacia), the equivalent of 10 ml of CSF is deposited on a DEAE Sepharose CL-6B column (Pharmacia). The elution is carried out in 20 mM Tris-HCl, pH 8.8, and a gradient from 0 to 0.4 M NaCl, and then the fraction is dialyzed twice against a phosphate-NaCl buffer (PBS). After concentration on Ultrafree-MC (Millipore), the fraction is deposited on a Superose 12 column (FPLC Pharmacia) and eluted in the presence of PBS. After separation by polyacrylamide-SDS gel electrophoresis and electrotransfer onto an Immobilon-P membrane (Millipore), the protein bands are subjected to controlled trypsin hydrolysis.
[0263]Analysis of the Protein Fraction by Mass Spectrometry
[0264]The peptides digested in the presence of trypsin are analyzed by the MALDI-TOF method, which allows the analysis of peptides present in a mixture (COTTRELL J. S., Pept. Res., 1997, 7, 115-124). The peptides characterized according to their mass are compared with the proteins and with the associated proteins according to the invention.
Example 4
Detection of Specific Antibodies to the env Domain of HERV-7q
[0265]The identification of a long open reading frame in the env sequence of HERV-7q made it possible to determine a deduced protein sequence SEQ ID NO: 22 and 35 and FIGS. 18-20 of a region of the said gene.
[0266]The protein sequences deduced from the sequences ID NO: 22, 35 and FIGS. 18-20 are positioned as follows with respect to FIG. 1 or the sequence ID NO: 3:
[0267]SEQ ID NO: 22 (reading frame 1) and FIG. 19: beginning of the coding sequence: position 7874, end of the coding sequence 1st nonsense codon (position 9493)
[0268]SEQ ID NO: 35: beginning of the coding sequence: position 7874, end of the coding sequence 1st nonsense codon (position 9493) (reading frame 1)
[0269]FIG. 19: beginning of the coding sequence: position 6970, end of the coding sequence 1st nonsense codon (position 9493) (reading frame 1)
[0270]FIG. 20: beginning of the coding sequence: position 6971, the end of the reading frame is shifted depending on the case by 1, 2 or 3 codons
[0271]FIG. 21: beginning of the coding sequence: position 6972, the end of the reading frame is shifted depending on the case by 1, 2 or 3 codons
[0272]Various peptides corresponding to all or part of SEQ ID NO: 22 (see SEQ ID NO: 23-27 and 35) were synthesized by genetic engineering in order to test their antigenic specificity toward sera or tissues from patients suffering from MS, for example. Briefly, all or part of the env region of HERV-7q is subcloned into the vectors pQE30, 31 and 32. The vectors pQE30, 31 and 32 contain, in 5' of the multiple cloning site, the consensus sequences for transcription (the strong T5 bacteriophage promoter, 2 operators of the lactose operon) and translation (one synthetic ribosome binding site). Likewise, pQE30, 31 and 32 possess, in 3', the phage 1 transcription terminator as well as a Stop codon for translation. The expression of the protein is carried out after transformation in E. coli M15. The plasmid pQE30, 31 and 32 possess, upstream of the multiple cloning site, the coding sequence for a succession of 6 histidines having affinity for nickel ions. This stretch allows the purification of the expressed chimeric protein by adsorption on a resin consisting of a chelating ligand, nitrotriacetic acid (NTA), charged with 4 nickel ions (NI-NTA resin, Qiagen).
[0273]The transformation is carried out by electroporation or treatment with calcium chloride. For example, an E. coli M15 colony is incubated in 100 ml of LB medium containing 250 μg of kanamycin, with stirring at 37° C. until an OD600 of 0.5 is obtained. After centrifugation for 5 minutes at 2000 g at 4° C., the bacterial pellet is taken up in 30 ml of TFB1 solution (100 mM rubidium chloride, 50 mM manganese chloride, 30 mM potassium acetate, 10 mM CaCl2, 15% glycerol, pH 5.8), at 4° C. for 90 minutes. After a centrifugation of 5 minutes at 2000 g at 4° C., the bacterial pellet is taken up in 4 ml of TFB2 solution (10 mM rubidium chloride, 10 mM MOPS, 75 mM CaCl2, 15% glycerol, pH 8). The cells may be kept at -70° C. in aliquots of 500 ml. 20 μl of the ligation and 125 μl of competent cells are mixed and placed on ice for 20 minutes. After a heat shock of 42° C. for 90 seconds, the cells are stirred for 90 minutes at 37° C. in 500 ml of Psi-broth medium (LB medium supplemented with 4 mM MgSO4, 10 mM potassium chloride). The transformed cells are plated on LB-agar dishes supplemented with 25 μg/ml of kanamycin and 100 μg/ml of ampicillin, and the dishes are incubated overnight at 37° C.
[0274]The potentially recombinant clones are sub-cultured in an orderly manner on a nylon filter deposited on an LB-agar dish supplemented with 25 μg/ml of kanamycin and 100 μg/ml of ampicillin. After one night at 37° C., the recombinant clones are located by hybridization of the plasmid DNA with the nucleotide probe amplified by PCR with the pair of primers according to SEQ ID NO: 45 and SEQ ID NO: 46.
[0275]An independent colony containing the insert is inoculated at 20 ml of LB medium supplemented with 25 μg/ml of kanamycin and 100 μg/ml of ampicillin. After one night at 37° C., with stirring, 500 ml of the same medium are incubated at 1/50 with this preculture until an OD600 of 0.8 is obtained, and then 1 to 2 mM final of IPTG is added. After 5 hours, the cells are centrifuged for 20 minutes at 4 000 g.
[0276]A portion of the cellular pellet is taken up in 5 ml of sonification buffer (50 mM of sodium phosphate, pH 7.8, 300 mM NaCl) and then placed on ice. After rapid sonification, the cells are centrifuged for 20 minutes at 10 000 g. A portion of the cellular pellet is taken up in 10 ml of a 30 mM Tris/HCl-20% sucrose solution pH 8. The cells are incubated for 5 to 10 minutes, with stirring, after addition of 1 mM EDTA. After a centrifugation of 10 minutes at 8 000 g at 4° C., the pellet is taken up in 10 ml of 5 mM ice cold MGSO4. After 10 minutes on the ice, with stirring, the cells are centrifuged for 10 minutes at 8 000 g at 4° C.
[0277]The pellet is taken up in 5 ml/g in buffer A (6 M GuHCl (guanidine hydrochloride), 0.1 M sodium phosphate, 0.01 M Tris/HCl, pH 8), 1 hour at room temperature. The lysate is centrifuged for 15 minutes at 10 000 g at 4° C., and the supernatant is supplemented with 8 ml of Ni-NTA resin, pre-equilibrated in buffer A. After 45 minutes at room temperature, the resin is poured into a column, washed with 10 times the column volume with buffer A and then with 5 times the column volume with buffer B (8 M urea, 0.1 M sodium phosphate, 0.01 M Tris/HCl, pH 8). The column is washed with buffer C (8 M urea, 0.1 M sodium phosphate, 0.01 M Tris/HCl, pH 6.3) until A280 is less than 0.01. The recombinant protein is eluted with 10 to 20 ml of buffer D (8 M urea, 0.1 M sodium phosphate, 0.01 M Tris/HCl, pH 5.9) and then with 10 to 20 ml of buffer E (8 M urea, 0.1 M sodium phosphate, 0.01 M Tris/HCl, pH 4.5), and then with 20 ml of buffer F (6 M HCl, 0.2 M acetic acid). After SDS-PAGE analysis, the purified fraction(s) containing the chimeric protein allowed the production of antibodies in rabbits. The antibodies obtained are tested by Western blotting after visualization with a secondary antibody coupled to alkaline phosphatase.
[0278]Antibodies are obtained in the same manner, using peptides synthesized chemically according to the Merrifield technique (G. Barany and B. Merrifield, 1980, in The peptides, 2, 1-284, E. Gross and J. Meienhofer, Academic Press, New York).
[0279]The specific antibodies obtained are used for detection of the serum or tissue expression of all or part of the endogenous retroviral sequences according to the invention, in normal and pathological cases.
[0280]The proteins of serum or tissue origin are separated on acrylamide-SDS gel and then transferred onto a nitrocellulose filter with the aid of a Novablot 2117-2250 apparatus (LKB). The transfer is carried out on a Hybond C-extra sheet (Amersham) using a 100 mM CAPS buffer pH 11, methanol, water (V/V/V:1/1/8) containing 1 mM CaCl2. After a transfer of 1 hour at 0.8 mA/cm2, the sheet is saturated for 1 hour at room temperature in PBS-0.5% gelatin. The sheet is brought into contact with the specific antibody at the concentration of 1/1 000 in PBS-0.25% gelatin. After 2 hours, the filter is washed 3 times 15 minutes in PBS-0.1% Tween-20, and then the filter is incubated for 30 minutes in the presence of a secondary antibody coupled to alkaline phosphatase (Promega), diluted 1/7 500 in PBS-0.25% gelatin. After three washes in PBS-0.1% Tween-20, the filter is equilibrated in a buffer (100 mM Tris-HCl, pH 9.5, 100 mM NaCl, 5 mM MgCl2). The visualization is carried out in the presence of 45 μl of NBT at 75 mg/ml and 35 μl of BCIP at 50 mg/ml, per 10 ml of alkaline phosphatase buffer.
[0281]The chimeric proteins obtained by genetic engineering are also used for tests of biological activity, such as for example the test for biological activity of the CKS-17-type peptide identified in the env domain of HERV-7q (FIG. 5).
Example 5
Production of Ribonucleic Probes Encoding the env Sequences of HERV-7q
[0282]The PCR fragments obtained are subcloned into the plasmid PGEM 4Z (Promega) which possesses on either side of its multiple cloning site, promoter sequences for the SP6 and T7 RNA polymerases.
[0283]The method of competence used is electroporation. The plasmid and the PCR fragment are hybridized in a ratio of 50 ng of vector (SmaI cleavage) to 100 ng of PCR fragment (made blunt ended by treatment with the Klenow fragment of DNA polymerase). The incubation takes place overnight at 22° C. in ligation buffer (66 mM Tris-HCl, pH 7.5, 5 mM MgCl21 1 mM dithioerythritol, 1 mM ATP) in the presence of 1 u of T4 DNA ligase and is then stopped by denaturation for 10 minutes at 65° C. In parallel, the E. coli JM 105 strain is inoculated overnight at 37° C. in LB medium. This preculture is diluted 1/500 and placed at 37° C. until an OD600 equal to 1 is obtained. For the remainder of the procedure, the cells will always be stored at cold temperature. After centrifugation for 5 minutes at 3 500 g at 4° C., the cellular pellet is resuspended in 1/4 vol. of ultra-pure ice-cold water. This step is repeated 5 to 6 times. The pellet is then resuspended in 1/4 000 vol. of water; 10% of sterile glycerol is added, allowing preservation of the electrocompetent cells, in aliquots of 10 μl at 20° C. 1 μl of the ligation is added to 50 μl of electrocompetent cells; the mixture is subjected to an electrical discharge of 12.5 kV/cm, applied for 5.8 ms. The cells are rapidly resuspended in the SOC medium, incubated for 1 hour at 37° C. and then plated in the presence of 2% X-Gal in dimethylformamide, and 10 mM IPTG, on an LB-agar dish supplemented with ampicillin (100 μg/ml). After one night at 37° C., the potentially recombinant white clones are subcultured in an orderly manner on an LB/ampicillin dish and in parallel on a nylon filter deposited on an LB/ampicillin dish. These two dishes are incubated overnight at 37° C. The recombinant clones are then located by hybridization with a nucleic probe amplified by PCR with the pair or primers according to SEQ ID NO: 45 and SEQ ID NO: 46 and labeled with digoxygenin.
[0284]The recombinant clones are cultured in 50 ml of LB/ampicillin medium (100 μg/ml), with stirring, over-night at 37° C. After centrifugation at 3 500 g for 15 minutes at 4° C., the bacterial pellet is taken up in 4 ml of P1 buffer (50 mM Tris-HCl, 10 mM EDTA, 400 μg/ml RNase A, pH 8) and 4 ml of P2 buffer (200 mM NaOH, 1% SDS). The medium is incubated at room temperature for 5 minutes. After addition of 4 ml of P3 buffer (2.55 M potassium acetate, pH 4.8), the mixture is centrifuged at 12 000 g for 30 minutes at 4° C. This supernatant is applied to a Qiagen type 100 column, pre-equilibrated with 2 ml of QBT buffer (750 mM NaCl, 50 mM MOPS, 15% ethanol, pH 7), the column is washed with twice 4 ml of QC buffer (1 M NaCl, 50 mM MOPS, 15% ethanol, pH 7) and the DNA is eluted with 2 ml of QF buffer (1.2 M NaCl, 50 mM MPOS, 15% ethanol, pH 8). The DNA is precipitated with 0.8 vol. of isopropanol and centrifuged at 12 000 g at 4° C. for 30 minutes. The pellet is washed with 70% ice-cold ethanol and then the plasmid DNA is taken up in twice 150 μl of TE buffer.
[0285]The ribonucleic probes are used as specific probes, in particular for the detection of the transcripts expressed by the endogenous retroviral sequences according to the invention.
Example 6
Construction of a Transgenic Mouse Containing all or Part of the Gene for Enverin
[0286]A transgenic mouse containing all or part of the HERV-7q sequence (SEQ ID NO: 3) is constructed so as to identify the sequences responsible for the tissue specificity, and to evaluate the role of all or part of the endogenous retroviral motifs of the HERV-7q type, in particular all or part of the peptide motifs of enverin. The microinjection technique used refers to the conventional technique (Hogan et al., (1994), Manipulating the mouse embryo, Cold Spring Harbor, Cold Spring Harbor Laboratory Press) or to its equivalents. Forms identical to the normal human molecule of motifs of the HERV-7q type, including enverin, or forms which are mutated, deleted, having insertions, or truncated are tested in order to determine the motifs which are critical both from the normal and pathological point of view, and more particularly during fetal development and during tumor processes.
BIBLIOGRAPHIC REFERENCES
[0287]Benit L. et al., 1997. Cloning of a new murine endogenous retrovirus MuERV-L, with strong similarity of the human HERV-L element and with a gag coding sequence closely related to the Fv1 restriction gene. J. Virol. 71, 5652-5657. [0288]Coffin J. M. 1985. Endogenous retrovirus, In: "RNA tumor viruses" (Weiss R. A., Varmus H. E., Teich N. M., and Coffin J. M. eds), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. [0289]Conrad B., Weissmahr R. N., Boni J., Arcari R., Schupbach J., and Mach B. 1997. A human endogenous retroviral superantigen as candidate autoimmunogene in type 1 diabetes. Cell 90, 303-313. [0290]Covey S. N. 1986. Amino acid sequence homology in gag region of reverse transcribing elements and the coat protein gene of cauliflower mosaic virus, Nucleic Acids Res. 14, 623-633. [0291]Hertig C., Coupar B. E., Gould A. R., and Boyle D. B. 1997. Field and vaccine strains of fowlpox virus carry integrated sequences from the avian retrovirus, reticuloendotheliosis virus. Virology 235, 367-376. [0292]Hohenadl C., Leib-Mosch C., Hehlemann R., and Erfle Y. 1996. Biological significance of human endogenous retroviral sequences. J. Acqui. 1 mm. Def. Synd. Hum. Retrovir. 13, S268-S273. [0293]Kulkoski J. K., Jones S., Katz R. A., Mack J. P. G., and Skalka A. M. 1992. Residues critical for retroviral integrative recombination in a region that is highly conserved among retroviral/retrotransposon integrases and bacterial insertion sequence transposases. Mol. Cell. Biol. 12, 2331-2338. [0294]La Mantia G. et al., N.A.R., 1991, 19, 7, 1513-1520 [0295]Patience C., Wilkinson D. A., and Weiss R. A. 1997. Our retroviral heritage. Trends Genet. 13, 116-120. [0296]Pearson W. R. 1994. Using the FASTA program to search protein and DNA sequence databases. Methods Mol. Biol. 24, 307-331. [0297]Perron H., Garson J. A., Bedin F., Beseme F., Paranhos-Baccala G., Komurian-Pradel F., Mallet F., Tuke P. W., Voisset C., Blond J. L., Lalande B., Seigneurin J. M., Mandrand B. and the Collaborative Research Group on Multiple Sclerosis. 1997. Molecular identification of a novel retrovirus repeatedly isolated from patients with multiple sclerosis. Proc. Natl. Acad. Sci. USA 94, 7583-7588. [0298]Tonjes R. R. et al., J. AIDS and Hum. Retrovirol. 1996, 13. S261-S267. [0299]Vitelli R., Chiarillo M., Lattero D., Bruni C. B., and Bucci C., 1996. Molecular cloning and expression analysis of the human Rab7 GTP-ase complementary deoxyribonucleic acid. Biochem. Biophys. Res. Commun. 229, 887-890. [0300]Weber L. T., Miller M., Jaskolski M., Leis J., Skalka M., and Wlodawer A., 1989. Molecular modeling of the HIV-1 protease and its substrate binding site. Science 243, 928-931. [0301]Wilkinson D., Mager D. L., and Leong J. A. C. 1994. Endogenous human retroviruses. In: "The Retroviridae" (Levy J. A. ed). Plenum Press New York., Vol. 3, 465-535. [0302]Xiong Y., and Eickbush, T. 1990. Origin and evolution of retroelements based upon their reverse transcriptase sequences. EMBO J. 9, 3353-3362.
[0303]As is evident from the above, the invention is not at all limited to its embodiments, implementations and applications which have just been described more explicitly; it embraces on the contrary all the variants which may occur to a specialist in this field, without departing from the framework or scope of the present invention.
Sequence CWU
1
14312599DNAHomo sapiens 1atcccctgcc ttaatcgcca agctccttca ggagaacaaa
gaacaggcca ttaccctgga 60gaagactggc aactgatttt acccacaagc ccaaacctca
gggatttcag tatctactag 120tctgggtaga tactttcacg ggttgggcag aggccttccc
ctgtaggaca gaaaaggccc 180aagaggtaat aaaggcacta gttcatgaaa taattcccag
attcggactt ccccgaggct 240tacagagtga caatagccct gctttccagg ccacagtaac
ccagggagta tcccaggcgt 300taggtatacg atatcactta cactgcgcct gaaggccaca
gtcctcaggg aaggtcgaga 360aaatgaatga aacactcaaa ggacatctaa aaaagcaaac
ccaggaaacc cacctcacat 420ggcctgctct gttgcctata gccttaaaaa gaatctgcaa
ctttccccaa aaagcaggac 480ttagcccata cgaaatgctg tatggaaggc ccttcataac
caatgacctt gtgcttgacc 540caagacagcc aacttagttg cagacatcac ctccttagcc
aaatatcaac aagttcttaa 600aacattacaa ggaacctatc cctgagaaga gggaaaagaa
ctattccacc cttgtgacat 660ggtattagtc aagtcccttc cctctaattc cccatcccta
gatacatcct gggaaggacc 720ctacccagtc attttatcta ccccaactgc ggttaaagtg
gctggagtgg agtcttggat 780acatcacact tgagtcaaat cctggatact gccaaaggaa
cctgaaaatc caggagacaa 840cgctagctat tcctgtgaac ctctagagga tttgcgcctg
ctcttcaaac aacaaccagg 900aggaaagtaa ctaaaatcat aaatccccat ggccctccct
tatcatattt ttctctttac 960tgttctttta ccctctttca ctctcactgc accccctcca
tgccgctgta tgaccagtag 1020ctccccttac caagagtttc tatggagaat gcagcgtccc
ggaaatattg atgccccatc 1080gtataggagt ctttctaagg gaacccccac cttcactgcc
cacacccata tgccccgcaa 1140ctgctatcac tctgccactc tttgcatgca tgcaaatact
cattattgga caggaaaaat 1200gattaatcct agttgtcctg gaggacttgg agtcactgtc
tgttggactt acttcaccca 1260aactggtatg tctgatgggg gtggagttca agatcaggca
agagaaaaac atgtaaaaga 1320agtaatctcc caactcaccc gggtacatgg cacctctagc
ccctacaaag gactagatct 1380ctcaaaacta catgaaaccc tccgtaccca tactcgcctg
gtaagcctat ttaataccac 1440cctcactggg ctccatgagg tctcggccca aaaccctact
aactgttgga tatgcctccc 1500cctgaacttc aggccatatg tttcaatccc tgtacctgaa
caatggaaca acttcagcac 1560agaaataaac accacttccg ttttagtagg acctcttgtt
tccaatctgg aaataaccca 1620tacctcaaac ctcacctgtg taaaatttag caatactaca
tacacaacca actcccaatg 1680catcaggtgg gtaactcctc ccacacaaat agtctgccta
ccctcaggaa tattttttgt 1740ctgtggtacc tcagcctatc gttgtttgaa tggctcttca
gaatctatgt gcttcctctc 1800attcttagtg ccccctatga ccatctacac tgaacaagat
ttatacagtt atgtcatatc 1860taagccccgc aacaaaagag tacccattct tccttttgtt
ataggagcag gagtgctagg 1920tgcactaggt actggcattg gcggtatcac aacctctact
cagttctact acaaactatc 1980tcaagaacta aatggggaca tggaacgggt cgccgactcc
ctggtcacct tgcaagatca 2040acttaactcc ctagcagcag tagtccttca aaatcgaaga
gctttagact tgctaaccgc 2100tgaaagaggg ggaacctgtt tatttttagg ggaagaatgc
tgttattatg ttaatcaatc 2160cggaatcgtc actgagaaag ttaaagaaat tcgagatcga
atacaacgta gagcagagga 2220gcttcgaaac actggaccct ggggcctcct cagccaatgg
atgccctgga ttctcccctt 2280cttaggacct ctagcagcta taatattgct actcctcttt
ggaccctgta tctttaacct 2340ccttgttaac tttgtctctt ccagaatcga agctgtaaaa
ctacaaatgg agcccaagat 2400gcagtccaag actaagatct accgcagacc cctggaccgg
cctgctagcc cacgatctga 2460tgttaatgac atcaaaggca cccctcctga ggaaatctca
gctgcacaac ctctactacg 2520ccccaattca gcaggaagca gttagagcgg tctcggccaa
cctccccaac agcacttagg 2580ttttcctgtt gagatgggg
259921326DNAHomo sapiens 2gccgcctggc actcctgagg
gaagtataaa ttataacacc atcttacagc tagacctctt 60ttgtagaaaa ggcaaatgga
gtgaagtgcc ataagtacaa actttctttt cattaagaga 120caactcacaa ttatgtaaaa
agtgtgattt atgccctaca ggaagccttc agagtctacc 180tccctatccc agcatccccg
actccttccc caactaataa ggacccccct tcaacccaaa 240tggtccaaaa ggagatagac
aaaagggtaa acagtgaacc aaagagtgcc aatattcccc 300aattatgacc cctccaagca
gtgggaggaa gagaattcgg cccagccaga gtgcatgtgc 360ctttttctct cccagactta
aagcaaataa aaacagactt aggtaaattc tcagataacc 420ctgatggcta tattgatgtt
ttacaagggt taggacaatt ctttgatctg acatggagag 480atataatgtc actgctaaat
cagacactaa ccccaaatga gagaagtgcc accataactg 540cagcctgaga gtttggcgat
ctctggtatc tcagtcaggt caatgatagg atgacaacag 600aggaaagaga atgattcccc
acaggccagc aggcagttcc cagtctagac cctcattggg 660acacagaatc agaacatgga
gattggtgct gcagacattt gctaacttgt gtgctagaag 720gactaaggaa aactaggaag
aagtctatga attactcaat gatgtccacc ataacacagg 780gaagggaaga aaatcctact
gcctttctgg agagactaag ggaggcattg aggaagcgtg 840cctctctgtc acctgactct
tctgaaggcc aactaatctt aaagcgtaag tttatcactc 900agtcagctgc agacattaga
aaaaaacttc aaaagtctgc cgtaggcccg gagcaaaact 960tagaaaccct attgaacttg
gcaacctcgg ttttttataa tagagatcag gaggagcagg 1020cggaacagga caaacgggat
taaaaaaaag gccaccgctt tagtcatgac cctcaggcaa 1080gtggactttg gaggctctgg
aaaagggaaa agctgggcaa attgaatgcc taatagggct 1140tgcttccagt gcggtctaca
aggacacttt aaaaaagatt gtccaagtag aagtaagccg 1200ccccctcgtc catgcccctt
atttcaaggg aatcactgga aggcccactg ccccagggga 1260caaaggtcct ctgagtcaga
agccactaac cagatgatcc agcagcagga ctgagggtgc 1320ctgggg
1326310499DNAHomo sapiens
3ccctggggcg ggcttccttt ctgggatgag ggcaaaacgc ctggagatac agcaattatc
60ttgcaactga gagacaggac tagctggatt tcctaggccg actaagaatc cctaagccta
120gctgggaagg tgaccacgtc cacctttaaa cacggggctt gcaacttagc tcacacctga
180ccaatcagag agctcactaa aatgctaatt aggcaaagac aggaggtaaa gaaatagcca
240atcatctatt gcctgagagc acagcaggag ggacaacaat cgggatataa acccaggcat
300tcgagctggc aacagcagcc cccctttggg tcccttccct ttgtatggga gctgttttca
360tgctatttca ctctattaaa tcttgcaact gcactcttct ggtccatgtt tcttacggct
420cgagctgagc ttttgctcac cgtccaccac tgctgtttgc caccaccgca gacctgccgc
480tgactcccat ccctctggat cctgcagggt gtccgctgtg ctcctgatcc agcgaggcgc
540ccattgccgc tcccaattgg gctaaaggct tgccattgtt cctgcacggc taagtgcctg
600ggtttgttct aattgagctg aacactagtc actgggttcc atggttctct tctgtgaccc
660acggcttcta atagaactat aacacttacc acatggccca agattccatt ccttggaatc
720cgtgaggcca agaactccag gtcagagaat acgaggcttg ccaccatctt ggaagcggcc
780tgctaccatc ttggaagtgg ttcaccacca tcttgggagc tctgtgagca aggacccccc
840ggtaacattt tggcaaccac gaacggacat ccaaagtggt gagtaatatt ggaccacttt
900cacttgctat tctgtcctat ccttccttag aattggagga aaataccggg cacttgtcgg
960ccagttaaaa acgattagtg tggccaccgg acttaagact caggtgtgag gctatctggg
1020gaagggcttt ctaacaaccc ccaacccttc tgggttgggg acttggtttg cctcaagcca
1080gcttccactt tcagttttct tggggaagcc gagggccgac tagaggcaga aagctgtcgt
1140cctgaactcc cggcagtagc cggttgagat catggtgtag ccagaagtct caacagtcgc
1200ccatgcatgc acccctatct ttccttctga cccatacctc ctgggtccca accacaactt
1260tcttcaaagt gtagccccaa aattctcctt acctctgaat atacttcctc tgatccctgc
1320ctcctaggta ctattggttc agacttccat ttcctctagc aagttgtatc tccaaaggga
1380tctaaggaag ctctgcgctg cgtccttagg cacctaggct ataacccagg gagtcttatc
1440cctggtgtcc ctcccaattt aggcatacag ctcttgacat gggcagttat gtaggaccca
1500ctccccacca cccttgccag ggccccaagt ttgtaaatgg ctgagggaaa agagagacag
1560aggagagaga gagaaatgga ggagaaagag agagagacag agaggagaga gagacagtga
1620gagagacaga agagagagag agacaaagag gagagagaga gagtcaaaga gagaaagaaa
1680gagaaagaaa tagtaaaaaa cagtgtgccc tattccttta aaagccaggg taaatttaaa
1740acctgtactt gataattgaa ggtcttctct gtgaccctat agcactccaa tccactttgt
1800ggtcagtgta aataagagca taggccgaaa gcactgaggc cattgacaac ccgtagcttc
1860cctatcaaaa atccttaacc cagtaacccg cagatggacc aaatgcattc agtcggtagc
1920gcaactgctt tgctaaaagt agaaaagtaa cttttagagg aaacctcatt gtgagcacac
1980ctcacctgtt cagaattatt ctaataaaaa aagcaaaaag gtagcttact aactcaaaaa
2040tcttaaagta tggggctatt ctgttagaaa aaggtaatgt aactccaacc actgataatt
2100cccttaaccc agcagatttc ctaacgggat ttaaatctta attaccatac aaaggtccga
2160ccagacctag gcggaactcc cttcaggaca ggacgataga tggttcctcc caggtgattg
2220aggaaaaaaa ccacaatggg tattcagtaa ttgatacggg gactcttgtg gaagcagagt
2280tagaaaaatt gcctaataac tggtctcctc aaacgtgtga gctgtttgca ctcagccaag
2340ccttaaagta cttacagaat caaaagacta tctcaatcct gattcaaaag gttagctaca
2400ccctctctgt aatgcatttg cataagaact tgtttatggg aatgcatctt gatggggcag
2460ctgggttgtt ataaaatagg aacccagccc agctctagga ctcacccctg agcgcaaagg
2520caatgttggg catgctggta aaggaccact agaatccagc agcccagacc cctttctttg
2580tggtcaagaa aggcgggaaa aggggtgcag gactgctaca tcggtaagca taactaatcc
2640gataaacaga ggtccatggg tggttacgca ccctggaaag gaactcaccc ctgagcacaa
2700aggcaatgtt gggcacgctg gtaaaggacc actagaatcc agcagcctgg acccctttct
2760ttgtggtcaa gagaggcagg aaaacaggtg caggactgca acatcagtga gcataactaa
2820ttcgataagc agaggtccat gggtggtgat gcaccctgga aagaataagc attaggacca
2880tagaggacac tccaggacta aagctcatcg gaaaatgact agggttgctg gcatccctat
2940gttctttttt cagatgggaa acgttccccg caagacaaaa acgcccctaa gacgtattct
3000ggagaattgg gaccaatttg accctcagac actaagaaag aaacgactta tattcttctg
3060cagtgccgcc tggcactcct gagggaagta taaattataa caccatctta cagctagacc
3120tcttttgtag aaaaggcaaa tggagtgaag tgccataagt acaaactttc ttttcattaa
3180gagacaactc acaattatgt aaaaagtgtg atttatgccc tacaggaagc cttcagagtc
3240tacctcccta tcccagcatc cccgactcct tccccaacta ataaggaccc cccttcaacc
3300caaatggtcc aaaaggagat agacaaaagg gtaaacagtg aaccaaagag tgccaatatt
3360ccccaattat gacccctcca agcagtggga ggaagagaat tcggcccagc cagagtgcat
3420gtgccttttt ctctcccaga cttaaagcaa ataaaaacag acttaggtaa attctcagat
3480aaccctgatg gctatattga tgttttacaa gggttaggac aattctttga tctgacatgg
3540agagatataa tgtcactgct aaatcagaca ctaaccccaa atgagagaag tgccaccata
3600actgcagcct gagagtttgg cgatctctgg tatctcagtc aggtcaatga taggatgaca
3660acagaggaaa gagaatgatt ccccacaggc cagcaggcag ttcccagtct agaccctcat
3720tgggacacag aatcagaaca tggagattgg tgctgcagac atttgctaac ttgtgtgcta
3780gaaggactaa ggaaaactag gaagaagtct atgaattact caatgatgtc caccataaca
3840cagggaaggg aagaaaatcc tactgccttt ctggagagac taagggaggc attgaggaag
3900cgtgcctctc tgtcacctga ctcttctgaa ggccaactaa tcttaaagcg taagtttatc
3960actcagtcag ctgcagacat tagaaaaaaa cttcaaaagt ctgccgtagg cccggagcaa
4020aacttagaaa ccctattgaa cttggcaacc tcggtttttt ataatagaga tcaggaggag
4080caggcggaac aggacaaacg ggattaaaaa aaaggccacc gctttagtca tgaccctcag
4140gcaagtggac tttggaggct ctggaaaagg gaaaagctgg gcaaattgaa tgcctaatag
4200ggcttgcttc cagtgcggtc tacaaggaca ctttaaaaaa gattgtccaa gtagaagtaa
4260gccgccccct cgtccatgcc ccttatttca agggaatcac tggaaggccc actgccccag
4320gggacaaagg tcctctgagt cagaagccac taaccagatg atccagcagc aggactgagg
4380gtgcctgggg caagcgccat cccatgccat caccctcaca gagccctggg tatgcttgac
4440cattgagggc caggaggttg tctcctggac actggtgcgg tcttcttagt cttactcttc
4500tgtcccggac aactgtcctc cagatctgtc actatctgag ggggtcctaa gacgggcagt
4560cactagatac ttctcccagc cactaagtta tgactgggga gctttattct tttcacatgc
4620ttttctaatt atgcttgaaa gccccactac cttgttaggg agagacattc tagcaaaagc
4680aggggccatt atacacctga acataggaga aggaacaccc gtttgttgtc ccctgcttga
4740ggaaggaatt aatcctgaag tctgggcaac agaaggacaa tatggacgag caaagaatgc
4800ccgtcctgtt caagttaaac taaaggattc cacctccttt ccctaccaaa ggcagtaccc
4860cctcagaccc aaggcccaac aaggactcca aaagattgtt aaggacctaa aagcccaagg
4920cctagtaaaa ccatgcagta acccctgcag tactccaatt ttaggagtac agaaacccaa
4980cagacagtgg aggttagtgc aagatctcag gattatcaat gaggctgttg ttcctctata
5040gccagctgta cctagccctt atactctgct ttcccaaata ccagaggaag cagagtggtt
5100tacagtcctg gaccttcagg atgccttctt ctgcatccct gtacatcctg actctcaatt
5160cttgtttgcc tttgaagata cttcaaaccc aacatctcaa ctcacctgga ctattttacc
5220ccaagggttc agggatagtc cccatctatt tggccaggca ttagcccaag acttgagcca
5280atcctcatac ctggacactt gtccttcggt aggtggatga tttacttttg gccgcccatt
5340cagaaacctt gtgccatcaa gccacccaag cgctcttcaa tttcctcgct acctgtggct
5400acatggtttc caaaccaaag gctcaactct gctcacagca ggttacttag ggctaaaatt
5460atccaaaggc accagggccc tcagtgagga acacatccag cctatactgg cttatcctca
5520tcccaaaacc ctaaagcaac taaggggatt ccttggcgta ataggtttct gccgaaaatg
5580gattcccagg tatggcgaaa tagccaggtc attaaataca ctaattaagg aaactcagaa
5640agccaatacc catttagtaa gatggacaac tgaagtagaa gtggctttcc aggccctaac
5700ccaagcccca gtgttaagtt tgccaacagg gcaagacttt tcttcatatg tcacagaaaa
5760aacaggaata gctctaggag tccttacaca gatccgaggg atgagcttgc aacctgtggc
5820atacctgact aaggaaattg atgtagtggc aaagggttga cctcattgtt tacgggtagt
5880ggtggcagta gcagtcttag tatctgaagc agttaaaata atacagggaa gagatcttac
5940tgtgtggaca tctcatgatg tgaatggcat actcactgct aaaggagact tgtggctgtc
6000agacaactgt ttacttaaat gtcaggctct attacttgaa gggccagtgc tgcgactgtg
6060cacttgtgca actcttaacc cagccacatt tcttccagac aatgaagaaa agataaaaca
6120taactgtcaa caagtaattt ctcaaaccta tgccactcga ggggaccttt tagaggttcc
6180tttgactgat cccgacctca acttgtatac tgatggaagt tcctttgtag aaaaaggact
6240tcgaaaagtg gggtatgcag tggtcagtga taatggaata cttgaaagta atcccctcac
6300tccaggaact agtgctcagc tagcagaact aatagccctc acttgggcac tagaattagg
6360agaagaaaaa agggcaaata tatatacaga ctctaaatat gcttacctag tcctccatgc
6420ccatgcagca atatggaaag aaagggaatt cctaacttct gagagaacac ctatcaaaca
6480tcaggaagcc attaggaaat tattattggc tgtacagaaa cctaaagagg tggcagtctt
6540acactgccgg ggtcatcaga aaggaaagga aagggaaata gaagagaact gccaagcaga
6600tattgaagcc aaaagagctg caaggcagga ccctccatta gaaatgctta taaaacaacc
6660cctagtatag ggtaatcccc tccgggaaac caagccccag tactcagcag gagaaacaga
6720atggggaacc tcacgaggac agttttctcc cctcgggacg gctagccact gaagaaggga
6780aaatactttt gcctgcaact atccaatgga aattacttaa aacccttcat caaacctttc
6840acttaggcat cgatagcacc catcagatgg ccaaatcatt atttactgga ccaggccttt
6900tcaaaactat caagcagata gtcagggcct gtgaagtgtg ccagagaaat aatcccctgc
6960cttatcgcca agctccttca ggagaacaaa gaacaggcca ttaccctgga gaagactggc
7020aactgatttt acccacaagc ccaaacctca gggatttcag tatctactag tctgggtaga
7080tactttcacg ggttgggcag aggccttccc ctgtaggaca gaaaaggccc aagaggtaat
7140aaaggcacta gttcatgaaa taattcccag attcggactt ccccgaggct tacagagtga
7200caatagccct gctttccagg ccacagtaac ccagggagta tcccaggcgt taggtatacg
7260atatcactta cactgcgcct gaaggccaca gtcctcaggg aaggtcgaga aaatgaatga
7320aacactcaaa ggacatctaa aaaagcaaac ccaggaaacc cacctcacat ggcctgctct
7380gttgcctata gccttaaaaa gaatctgcaa ctttccccaa aaagcaggac ttagcccata
7440cgaaatgctg tatggaaggc ccttcataac caatgacctt gtgcttgacc caagacagcc
7500aacttagttg cagacatcac ctccttagcc aaatatcaac aagttcttaa aacattacaa
7560ggaacctatc cctgagaaga gggaaaagaa ctattccacc cttgtgacat ggtattagtc
7620aagtcccttc cctctaattc cccatcccta gatacatcct gggaaggacc ctacccagtc
7680attttatcta ccccaactgc ggttaaagtg gctggagtgg agtcttggat acatcacact
7740tgagtcaaat cctggatact gccaaaggaa cctgaaaatc caggagacaa cgctagctat
7800tcctgtgaac ctctagagga tttgcgcctg ctcttcaaac aacaaccagg aggaaagtaa
7860ctaaaatcat aaatccccat ggccctccct tatcatattt ttctctttac tgttctttta
7920ccctctttca ctctcactgc accccctcca tgccgctgta tgaccagtag ctccccttac
7980caagagtttc tatggagaat gcagcgtccc ggaaatattg atgccccatc gtataggagt
8040ctttctaagg gaacccccac cttcactgcc cacacccata tgccccgcaa ctgctatcac
8100tctgccactc tttgcatgca tgcaaatact cattattgga caggaaaaat gattaatcct
8160agttgtcctg gaggacttgg agtcactgtc tgttggactt acttcaccca aactggtatg
8220tctgatgggg gtggagttca agatcaggca agagaaaaac atgtaaaaga agtaatctcc
8280caactcaccc gggtacatgg cacctctagc ccctacaaag gactagatct ctcaaaacta
8340catgaaaccc tccgtaccca tactcgcctg gtaagcctat ttaataccac cctcactggg
8400ctccatgagg tctcggccca aaaccctact aactgttgga tatgcctccc cctgaacttc
8460aggccatatg tttcaatccc tgtacctgaa caatggaaca acttcagcac agaaataaac
8520accacttccg ttttagtagg acctcttgtt tccaatctgg aaataaccca tacctcaaac
8580ctcacctgtg taaaatttag caatactaca tacacaacca actcccaatg catcaggtgg
8640gtaactcctc ccacacaaat agtctgccta ccctcaggaa tattttttgt ctgtggtacc
8700tcagcctatc gttgtttgaa tggctcttca gaatctatgt gcttcctctc attcttagtg
8760ccccctatga ccatctacac tgaacaagat ttatacagtt atgtcatatc taagccccgc
8820aacaaaagag tacccattct tccttttgtt ataggagcag gagtgctagg tgcactaggt
8880actggcattg gcggtatcac aacctctact cagttctact acaaactatc tcaagaacta
8940aatggggaca tggaacgggt cgccgactcc ctggtcacct tgcaagatca acttaactcc
9000ctagcagcag tagtccttca aaatcgaaga gctttagact tgctaaccgc tgaaagaggg
9060ggaacctgtt tatttttagg ggaagaatgc tgttattatg ttaatcaatc cggaatcgtc
9120actgagaaag ttaaagaaat tcgagatcga atacaacgta gagcagagga gcttcgaaac
9180actggaccct ggggcctcct cagccaatgg atgccctgga ttctcccctt cttaggacct
9240ctagcagcta taatattgct actcctcttt ggaccctgta tctttaacct ccttgttaac
9300tttgtctctt ccagaatcga agctgtaaaa ctacaaatgg agcccaagat gcagtccaag
9360actaagatct accgcagacc cctggaccgg cctgctagcc cacgatctga tgttaatgac
9420atcaaaggca cccctcctga ggaaatctca gctgcacaac ctctactacg ccccaattca
9480gcaggaagca gttagagcgg tctcggccaa cctccccaac agcacttagg ttttcctgtt
9540gagatggggg actgagagac aggactagct ggatttccta ggctgactaa gaatccctaa
9600gcctagctgg gaaggtgacc acatccacct ttaaacacgg ggcttgcaac ttagctcaca
9660cctgaccaat cagagagctc actaaaatgc taattaggca aagacaggag gtaaagaaat
9720agccaatcat ctattgcctg agagcacagc aggagggaca atgatcggga tataaaccca
9780agtcttcgag ccggcaacgg caaccccctt tgggtcccct ccctttgtat gggagctctg
9840ttttcatgct atttcactct attaaatctt gcaactgcac tcttctggtc catgtttctt
9900acggcttgag ctgagctttc gctcgccatc caccactgct gtttgccgcc accgcagacc
9960cgccgctgac tcccatccct ctggatcatg cagggtgtcc gctgtgctcc tgatccagcg
10020aggcacccat tgccgctccc aatcgggcta aaggcttgcc attgttcctg catggctaag
10080tgcctgggtt catcctaatt gagctgaaca ctagtcactg ggttccatgg ttctcttctg
10140tgacccacag cttctaatag agctataaca ctcaccgcat ggcccaaggt tccattcctt
10200gaatccataa ggccaagaac cccaggtcag agaacacgag gcttgccacc atcttgggag
10260ctctgtgagc aaggaccccc aagtaacaca accatgaggg tgcaaatgca tgggccacta
10320atggtagagc aagaaaacag aagggccctg gttcctcgaa ggcatcagtg agctgaaatg
10380cctgccctgg atgtcctatt cctaggtgtt tttctgcctg aagcagatta aaccctttgt
10440tcacttctcc aagtagggct tctattacag cccaaatcaa tccccacccc agatgacat
1049942784DNAHomo sapiens 4ctccttcagg agaacaaaga acaggccact acccaagaga
agactggcaa ctagatttta 60cccatatgcc caaatctcag ggatttcagt atctactagt
ttgggtagat actttcactg 120gttgggcaga ggccttcccc tgtaggacag aaaaggccca
agaggtaata aacgttcatg 180aaataattcc cagattcgga cttccccaag gcttacagag
tgacaatggc cctgctttca 240aggctacagt aacccaagga gtatcccagg tgttaggtat
acaatatcac tcacactgcg 300cctggaggcc acagtcctca ggaaaggtgg agaaaatgaa
caaaacactc aaatgacatc 360taaaaaagct aatccaggaa acccacctcg catggcctgc
tctgttgcct atagccttac 420taagaatccg aaactctccc caaaaagcag gacttagtcc
atacaaaatg ctgtatggac 480ggcccttcct aaccaatgaa cttgggcttg accgagagac
agccaactta gttgcagaca 540tcatctcctt agccaaatat caacaggttc ttaaaacatt
acagggagcc tgtccccaag 600aagagggaaa ggaactattc caccctggtg acatggtatt
agtcaagtcc cttccctcta 660attccccatc cctagataca tcctgggaag gaaactaccc
agccatttta tctaccctaa 720cggcagttaa agtggctgga gcggagtctt ggatacatca
cactcaagtc aaaccctgga 780tactgccaaa ggaactcaaa aatccatgag acaatgctag
ctattcctgt gaacctctag 840aggatctgcg cctgctcttc aaatgacaac cagggggaaa
gtaactaaaa tcgtaaatcc 900cctggccctc ccttatcata tttttctctt tactgttctc
ttaccccctt tcactctcac 960tgcaccccgt ccatgccact gcaccccgtc catgccccgt
ccatgccagt agctcccctt 1020agcaagagtt tctatggaga atgcagcgtc ccggaaatat
tgatgcccca ttgtatagga 1080gtttatctaa gggaaccccc accttcactg cccacaccca
tatgccccac aactgctata 1140actctgccac tctttgcatg catgcaaata ctcattattg
gacaggaaaa acgattaatc 1200ccagttgtcc tggaggactt ggaggactca cttcactcat
accagtatgt ctgatggggg 1260tggagttcaa gatcaggcaa cagaaaaaca cataaaggaa
gtaatctccc aactgacctg 1320ggtacatagc acccctggcc cctacaaagg actagatctc
tcaaaactac atgaaaccct 1380ccatacccat actggcctgg taagcctatt taataccacc
ctgactgggc tccatgaggt 1440ctcggcccaa aaccctacta actgttggat gtgcctcccc
ctgcacttta ggccatacat 1500ttcaatccct atacctgaac aatggaacaa cttcagcaca
gaaataaaca ccacttctgt 1560tttagtaggt cctctttcca atctggaaat aacccatacc
tcaaacctca cctgtgtaaa 1620atttagcaat actatagaca cagccaactc ccaatgcatc
aggtgggtaa ctcctcccac 1680acgaatagtc tgcctaccct caggaatatt ttttgtctgt
ggtacctcag cctatcattg 1740tttgaatggc tcttcagaat ctgtgtgctt cctctcattc
ttagtggccc ctatgcccat 1800ctacactgaa caagatttat acaatcatgt catacctaag
ccccgcaaca aaagagtacc 1860cattcttcct tttgttattg gagcaggagt gctaggcgga
gtagctactg gcattggcgg 1920tatcacaacc tctactcagt tctactacaa actgtctcaa
gaactaaatg gtgacatgga 1980atgggtcgct gataccctgg tcaccttgca agatcaactt
aactccctag cagcagtagt 2040ccttcaaaat cgaagagctt tagacttgct aaccgcggaa
agcgggggaa cctttttatt 2100tttagaggaa aaatgctgtt gttatgttaa tcaatccgga
atcatcaccg agaaagttaa 2160agaaattcaa ggtcgaatat aacgtagagc aaaggagctg
caaaacactg gaccctgggg 2220cctcctcagc caatggatgc cctggattct ccccttctta
ggacctctag cagctataat 2280attgttactc ctctttggac cctgtatctt taacctcctt
gttaagtttg tcttttccag 2340aatcgaagca gtaaaactac aaatcgttct tcaaatggag
ccccagatgc agtccatgag 2400taaaatctac cacggacccc tggaccggcc tgctagccca
tgctctgatg ttaatgacat 2460caaaggcacc cctcccgagg aaatctcaac tgcacaacct
ctactacgcc ccaattcagc 2520aggaagcagt tagagtggtt gttggccaac ctccccaaca
gcagttgggt tttcctgttg 2580agagggggga ctgagagaca ggaataacta gatttcctag
accaactaag aatccctaag 2640actagctggg aaggtgaccg cttccacctt taaacaccgg
gcttgcaact tagctcacgc 2700ccaaccaatc agatactaaa gagagctcac taaaatgcta
attaggcaaa aacaggagat 2760aaagaaatag ccaatcatct gttg
278451799DNAHomo sapiens 5gggattctta gtcggcctag
gaaatccagc taatcctgtc tctcagtccc cccactcaac 60aggaaaaccc aagtgctgtt
ggggaggttg gctgacgacc agtctaactg cttcctgcgg 120aattggggca tagtaggggt
tgtgcagttg agatttcctc gggaggggtg cgttcgatat 180cattacaatt ggagcatggg
ctagtaggcc ggtccagggg tccacggtag atcttagtca 240tggacttcat ctggggttcc
atttgaagaa cgatttgtag ctttacaact ttgattctgg 300aagagacaaa cttaacaagg
aggttaaaga tacagggtcc aaagaggagt atcaatatta 360gagctgctag agatcctaag
aaggggagaa tccagggcat ccattggctg aggaggcccc 420agggtctggt gtttttgaag
ctcctctgtt ctacgttgta ttcaatctcg aatttcttca 480actttctctg tgacaattca
ggattgatta acataataac aacattcttc cgctaaaata 540acataataac aacattcttc
ccctaaaaat aaacagcttc cccctctttc agaggttagc 600aagtctaaag ctcttcaatt
ttgaaggact actgatgcta ggaagttaag ttgatcttgc 660aaggtgacca gggagtcggc
aacccattcc atgtcaccat tgagttcttg agatagtttg 720tagtagaact gagtagaggt
tgtggtaccg ccaatgccag aacctagtcc acctagcact 780cctgctccga taacaaaagg
aagaatgagt actcttttgt tgtggggctt aggtacaaca 840taattgtata aatcttgttc
agtgtaaatg gtcatggggg cactaagaat gagaggaagc 900acatagattc tgaagagcca
ttcaaacaac gataggctaa ggtaccacag acaaaaaata 960ttcctgaggg taggcagact
attcgtgtgg gaggagttac ccacctgatg cattgggagt 1020tggttgtgtc tacagtattg
ctaaatttta cacaggtgag gtttgaggta tgggttattt 1080ccagattgga aacaagaggt
cctactaaaa cggaagtggt gtttatttct gtgctgtagt 1140tgttccattg ttcaggtaca
gggattgaaa tgcatggcct gaaatacagg gggaggcaca 1200accaacagtt agtagggttt
tggaccgaga cctcatggag cccagtgagg gtggtattaa 1260ataggcttac caggcaagta
tgggtatgga gggtttcatg tagttttaag agatctagtc 1320ctttgtaggg gctaggggtg
ctatgtaccc gggtcagttg ggaggttact tcctttacat 1380gtttttctct tgcctgatct
tgaactccac ccccctcaga cataccagta tgggtgaagt 1440aagtccgaca gacagtggct
ccaagtcttc caggacaact aggattaatc attttccctg 1500tccaataatg agtatttgca
tgcatgcaaa gagtggcaga gttatagcag ttgtggggca 1560tatgggtgtg ggcagtgaag
gtggagtttc ctttaggtaa actcctattt gatggggcat 1620caatatttct gggaagccgc
attcttcata gaaactcttg gtaaggggag ctgctggttg 1680tacagcagca tggagggggt
gcagtgagag tgaaaggggg taagagaaca gtaaagagaa 1740aaatatgata agggagggcc
atggggattt acgattttag ttactttcct cacggttgt 179961489DNAHomo sapiens
6tggtgcttgc cccgggcact ctcagtcctg ctgctggatc atctggttag tggcttctga
60ctcagaggac ctacgtcccc tggggcagtg ggccttacag tgattccctt gacacgaggt
120gcatggacga gggggcggct tatttctatt tggacaatct tttttaaagt gtccttgtag
180accgcactgg aagcaaaccc tattaggcat ttgatttgcc tagcttttcc cttttccagt
240gcctccaaag tccgcttgcc tgagggccat gactaaagcg gtggcctttt ttttatccca
300tttgtcccat tctgcctgct catcctgatc tctattataa aaaactgagg ttgccaagtt
360caatagggtt tctaagtttt gttccgggcc taaggcagac ttttgaagtt ttttcctaat
420gtctgtagct gactgagtga taaacttatc ctttaagatt agttggcctt cagtagagtc
480agttgacaga gagaggtatg cttcctcaat gcctccgtta gtcactccag aaaggcggta
540ggattttctt cctttccctg tgttatagtg gacatcattg aataactcac aggcttcttt
600ctagttttcc ttagtccttc tagcacgcaa gttagcaaat gtctgcggca ccaatctcca
660tgttctgatt ctgtgtccca gtgagggtct acactgggaa ctgcctgctg gcctgtgggg
720aatcgttctc tttcctctgt tgtcgaccta tcattgacct gactgagata ccagagatcg
780ccaaactctc aggctgcagt tacggcgaca cttctgtcat ttggggttag tgtctgattt
840agcagtaaca ttatatctct ccatatcaga tcaaaggatt gtcctaaacc ttgtaaaaca
900tcaatatagc cattagggtt atctgagaat ttacctaggt ctattttaat ttaaagtctg
960ggagagaaaa aggcacatgc actctggctg ggccgaattc tcttcctccc actgcgtctg
1020agagagaaaa aggtacgtgc actctggctg ggccgaattc tcctcccacc gcttggaggg
1080ggcataatcg gggaatattg gcattctttg gttagttgtt tacccctttg tctatctcct
1140tttggaccgt ttgggttgaa ggggggtcct tattatttgg ggaaggagtc tgggggatgc
1200tggggtaggg aggtagactc tgagggcttc ctgtagggca taaatcacac tttttacata
1260attgcgagtt gtctcttaat gaaaagaaag tttgtacgta tgacacttca caccatttgc
1320cttcttttct acaaaagagg tctagctgta agatggtgtt ataatttatg cttccctcag
1380gatgccaggt ttctccccct taaagagtat atcgttgcca ggcggtactg cagaagaata
1440tgtctttttt ttcttagcat ctgagagtca aattggtccc aattctcca
148971216DNAHomo sapiens 7taaagataca gggattgaaa tgtatggcct gaagtgcagg
gtcatatagg tgtgggtggt 60gaaaatgggg tttcctttag aaaaactcct atacgatggg
tcatcaatat ttccaggaag 120ccgcattctc catagaagct cttggtaatg ggagctactg
gtagtacagt ggcatggagg 180gggtgcagtg agagtgaaag agggtaaaag aacagtaaag
agaaaaatat gataagggag 240gggttcagtg agagtgaaag ggggtaagag aacagtaaag
aaaaaaatat gacaaggagg 300gccatgagga tctacgattc tagttacttt cctcacggtt
gtcgcttgaa gagcaggtgc 360agatcctcta gaggttcaca ggaatagcta gcgttgtctc
ctggattttc gggttccttt 420ggcagtatac agagtttgac tcgagtgtga tgtattcaag
actccactcc agccacttta 480accgcagttg gggtagataa aatgactggg tagggtcctt
cccaggatgt atctaaggat 540ggggacttag aaggaaggga cttgactaat accatgtcac
cagggtgcaa taattacttt 600ccctcttctc gggaacaggt tccctgtaat gttttaagaa
cttgttgata tttggccaag 660gaggtgatgt ctgcaactaa gctggccatc tctcggtcaa
gcacaaggtc cttggttagg 720aagggccatc catacagcat tttgtatggg ctaagtcctg
ctttttgggg agagttttgg 780attcttagta aggctgtagg caacagagca ggccatgcaa
ggtgggtttc ttgggttagc 840ttttttaaat gtcgtttgag tgcttcattc attttcttga
cttttcctga ggattgtggc 900ctccacgcgc agtgtaagtg atattgtatg cctaatgcct
gggatactcc ctgggttact 960gtagccttga aaacggggcc attgtcactc tgtaagcctc
ggggaagtcc gaatctggga 1020attatttcat gaattagtgc ctttattaca tcttggtcct
tttctgtcct acaaaggaag 1080gcctctgccc aaccagtgaa aatatctacc cagactagta
gatactgaaa tccctgagat 1140ttgggcatgt gggtaaaatc tagttgccag tcttctcctg
agtaatggcc tgttctttgt 1200tctcctgaag gagctt
12168976DNAHomo sapiens 8agtgataatg gaatacttga
aagtaatccc ctcactccag gaactagtgc tgagctggcc 60aaactaatag ccctcactcg
ggcactagaa ttaggagaag agaaaagggt aaatatatat 120acagactata agtatgctta
cctagtcctt catgcccatg cagcaatatg gagagaaagg 180gaattcctaa cttccaaagg
aacacctatc aaacatcagg aagccattag gatattatta 240ttggtggtac agaaacctaa
agaggtggca gtcctacact gctggggtca tcagaaaaaa 300aaggaaaggg aaatagaagg
gaactaccaa gcagatattg aagccaaaag agccgcaagg 360caggaccctc cattagaaat
gcttatagaa ggacccctag tgtggggtaa ccccctccag 420gaaagcaatc cccagtactc
agcaggagaa ataaaatgga gaacctcacg aggacatact 480ttcctcccct caggatggct
agccaccaaa gaaggaaaaa tgcttttgcc tgcagctaac 540caatggaaat tacttaaaac
ccttcaccaa acctttcact taggattgat agcacccatc 600agatggccaa attattattt
actggatcag gccttttcaa aactatcaag caggtagtca 660gggcctgtaa agtgtgccaa
agaaataatc tcctgcactg caagccatac atttcaatcc 720ctgtatcttt aacctccttg
ttaagtttgt ctcttccaga atcaaagctg taaaactaca 780aatggttctt caaatggagt
ctcagatgca gtccatgact aagatatacc gcagccccct 840ggagggggcc tgctagccca
tgctccaatg ttaatgacat cgaaggcacc cctcccgggg 900aaatctcaac tgcacaaccc
ctactatgtc ccaattcagc aggaagcagt taaagcggtc 960atcggccaac ctcccc
9769942DNAHomo sapiens
9agaggagaac agcagcataa gcggctggca gaggtaggga aagaccagca agaagaaaag
60agagaaagag aaagagaaag tcagagaaag agacagagag aggaagagac aaagagacag
120aaagtcaaag aggtagtagt cagaaacaga gacaaaaaaa aggagtcaga aagagggaca
180gacacagaaa gtcaaaaaaa aagttaagaa gaaaggaaaa gacaaagaag aagtcgaaga
240ggagaaagag agagatagaa gtagtaaaga aaaaaacagc atatcccatt cctttaaagc
300cagggtaaat ttctatctac ccagccaagg catattctac ttatgtggat cttcaaccca
360tatctgcctc tcagacagtt tgcaagaaat aatgaaatct atccttactt tacaatccca
420aatagactct ttggcagcag tgactctcca aaactgcaga ggcctagacc tcctcactgc
480tgaaaaagga ggacactaca ccttcttagg ggaagaatgt tgtttttaca ctaaccagtc
540ggggatagta tgagatgctg cccggagttt acaggaaaag gcttctgaaa tcagacaacg
600cctttcaaat tcttatacca acttctggag ttaggcaaca tggcttctcc cctttctagg
660tcctgtggca gccatcttgc tgttactcgc ctttgggccc tgtattttta accttcttgt
720caaatttgtt tcctctagaa tcgaggccat caagctacag atggtcttac aaatggaacc
780ccaaaagagt tcaactaaca acttctaccg aggacccctg gatcaaccca ctggcacttc
840ccctggccta gagagttccc ctctgaagga caccgcaact gcagggccct tctttgcccc
900atccagcagg agtagctaga gtggtcatcg gccaaattgc ca
942101375DNAHomo sapiens 10ccccaatatt ctctttctga tggggaaaaa tggccacctg
agggaagcac aaattacaat 60actatcctgc agcttgatct tttctgtaag agggaaggca
aatggagtga aataccttat 120gtccaagctt tcttttcatt gagggagaat acacaactat
gcaaagcttg caatttacat 180cccacaggag gacccctcag cttaccccca tatcctagcc
tccctatagc ttcccttcct 240attgatgata ctcctcctct aatctcccct gcccagaagg
aaataagcaa agaaatctcc 300aaaggtccac aaaaaccccc gggctatcgg ttatgtcccc
ttcaagctgt agggggaggg 360gaatttggcc caacccgggt gcatgtcccc ttctccctct
ctgatttaaa gcagatcagg 420cagacctggg gaagttttca gatgatcctg ataggtacat
agatgtccta cagggtctag 480ggcaaacctt tgacctcact tggagagacg tcatgctact
gttagatcaa accctggcct 540ttaatgaaaa gaatgcggct ttagctgcag cctgagagtt
tggagatacc tggtatccta 600gtcaagtaaa tgaaagaatg acagccgaag aaagggacaa
cttccctact ggtcagcaag 660ccatccccag tatggatccc cactgggact ttgactcaga
tcatggggac tggagtcgta 720aacatctgtt gatctgtgtt ctggaaggac taaggagaat
tgggaaaaag cccatgaatt 780attcaatgat atccaccata acccagggaa aggaagaaaa
tccttctgcc ttcctcgagc 840ggctacaaga ggccttaaga aaatatactc ccctgtcacc
cgaatcactc gagggtcaat 900tgattctaaa agataagttt attacccaat cagccacaga
tatcaggaga aagctccaaa 960agcaagccct gagccctgaa caaaatctag agacattatt
aaacctggca accttggtgt 1020tctataatag ggaccaagag gaacaggccc aaaaggaaaa
gcgagatcag agaaaggccg 1080cagccttagt catggccctc agacaaacaa accttggtgg
ttcagagagg tcagaaaatg 1140gagcaggcca atcacctggt acggcttgtt atcagtgcgg
tttactagga cactttaaaa 1200aagattgtcc aataagaaac aagctgcccc ctcatccgtg
tccactatgc cgaggcaatc 1260actggaaggt gcactgcccc agaggatgaa ggttccctgg
gttagaagcc cccaaccaga 1320tgatccaaca acaggactga gggtgcccgg ggcaagcacc
agctcatgtc atcac 137511944DNAHomo sapiens 11acctaggagg aactgtcttc
aggacaggac tatagatgct tcctcccagg cgattaaggg 60aaaaagacac aatgggtatt
cagtaagtga taaggaaact cttgtagaag cagagttagg 120aaaattgcct aataattggt
ctgctcaaat gtgcgagctg tttgcactca gccaaacctt 180aaaagtatta cagaatcagg
aagaagccat ctataccaat tctaagttaa tatggactga 240acgagaactt attaatagca
aagaataatt gaaatcccaa acttacaagg ttttcaacaa 300aagcacagtt tgctaaaagt
taactgtgta acatgtatta tcctactacc acaaactctc 360aaatgatttc tcagacagtt
tgcaagaaac aatgaaacct atccttactc tacaatccca 420aatagactct ttggcagcag
tgactctcca aaaccaccaa ggcctagacc tcctcactgc 480tgagaaagga ggactctgca
ccttcttagg ggaagattgt tgtttttaca ctaaccagtc 540agggatagtg tgagatgcca
cccagcgttt acaggaaaag gcttctgaaa tcagacacaa 600tgcttttcaa accttatagc
aacctctgga gttcggcgac tggcttttcc cctttctagg 660tcctgtgaca gccatcttgc
tattactcgc cttcgggccc tgtattttta acctcctcgt 720caaatttgtt tcctctagga
tcgaggccat caagctacag atggtcttac aaatggaacc 780ccaaatgagc tcgactaaca
acttctactg aggacccctg gaccgaccca ctggcccttt 840aactggctta aagagtttcc
ctctggagga cactacaact gcagggcccc ttctttgccc 900catccacagg aagttagcta
gagcagtcat cacccaattc ccaa 94412963DNAHomo sapiens
12tacaggaacc ccataatacg tccttggcaa attctattca gctccaactg ctaggagtgg
60cccatttgtc ctgaaccctc aaatcatggg aatgagaaat gaatttagac tgaccacagc
120ccttatgagt tttcagctac aggggtgtat agaaccctga taaggagttt tctttgtgtg
180tggaagatcc ttctatattt gcctccccac caactggaca ggaacttgta ctttagccta
240catagtacct cctgtgactt atccttttca gaagaggcag tagctgtgcc cattcatgct
300aagcttcagc cgagagcaat ctcactactt cctctattgg ctggtttagg atttactacc
360acctaggaag tggactcaca gcctagatga aatctctctc caacttactc aaatccagga
420ccaaatagac tcattagcag ctgtggttct ccgaaccagt gagcactaga tctccaatct
480cctcactgcc gaaaggggag gaacatgcct ttttctgaac aaggaatgtt gtttttatgt
540caataaatca ggcatagtga gagatggaat taaatgactt caggatagag ctagcagact
600acatggtggg acaaccgaaa ctacctcagg gttctcacag cctgttctcc actggcttct
660tccattttta ggtcccttcc ttatgattat tctaggagta acctttggcc catgtctttt
720cagttccttc atcctttcgt ttcttcctga atagaatcaa tgaaactaga aatgttactg
780cagatggaac ctcagatgac ttcaaccagc acctattatc aaggacccct aaaccagcct
840gccggcccat acccggacgt tgacacccaa accacctctc acgaggaaac ctcagctaca
900gaaccccttc tatgccccta ttcagcagga agcaattaga gtggtcatcc tcccacaccc
960caa
963131362DNAHomo sapiens 13ccacaatatc ctcttccagg aggagaacga tggccacctg
agggaagtat acactataat 60accatcctgc aactagatct gttttgtaaa caagaaggca
agtggattta ggtaccatat 120gttcagacct ttttctcatt aagggatgat aacccacgat
tgtgtaagac atgtaacctg 180caccccacag ggagtcctca aattctaccc ccatacccag
tcctccccac ggctcctcct 240actaatgcca aaccctctct ggcttctaca gcccaaaagg
gaacaaataa aagagccttc 300agagagccaa gagaccccac tggcccctgg ctatgtcctc
ttcaggctgt aggaggggaa 360tttggcccaa cccgagtaca tgttcccttt tctctctctg
atctaaagca aattaaggca 420gacttggatg aaagttctca gatgacccca atagatacgt
agatggcctg ctgggtctgg 480gacaatcttt tgacctttcc tggagagaga tcatgttatt
gcttgatcag acctaacctc 540taatgagaag aatgctgctt taacaggagc ccgagagttt
ggggatacct ggtacctcag 600ttaagtaagt gatagaatga catcagaaga gagcagtttc
ctactggcca gcaagcagtc 660cccagtatgg atccccactg ggaccctgac tcggatcatg
gggactggag tcacaaacat 720ttactgacct gtatcctaga agggttaagg agaactagga
aaaagcccat gaactattca 780atgatgtcta ctataaccca agggaaggaa gaaaacccta
ttgccttcct caaaaggctg 840agggaggctt tgagaaaata tactcccctg tcaccagatt
ccctcgaagg ccagttaatt 900ttaaaggaca aatttattac tcagtcagct gcagacatta
ggaaaaagct ccaaaagtta 960gccttgggcc gagcaaaatt tggaggcatc attaaacctg
gcaacctcag tgttctatca 1020tagggaccaa gaggaacagg ccgaaaagga aaagcaggat
aagagaaagg ctgcagattt 1080agtcatgccc tcagacaaac cttggcggtt caaagaggag
aaaaaatgga gcaggccaat 1140cacccagcag ggcttattat cagtgcagtt tacaaggaca
ctttaaacaa gattgtccaa 1200agagaaataa gccgccctct cacccatgtc cactatgcca
aggtgatcac tggaaggcac 1260actgtcccag aggacaaagg ttctctgggc cagaagtccc
caaccagatg atccagcaac 1320aggatggagg gtgcccgggg caagcaccag ctcgtgttgt
ca 136214945DNAHomo sapiens 14ttgcagatca atctcagact
gctgtgctag caatgagtga ggcttcgtgg gcatgggacc 60ctctgagcca ggcatgggat
ataatgtcct tgtgtgccat ttgctaagac tgttggaata 120gcacagtatt agggtgggag
tggcccgatt ttccaggtgc tgtctgtcac cgcttccctt 180ggctaggaaa gagaattccc
tgaccccttg ttcttcccag gtaaggcagt gcctcaccct 240gcttcagctc acactcaggt
gactgcaccc actgtcctgc ccccactgtc ggacaagccc 300cagtgagatg aacctggtac
ctcagttgga aatgcagaaa tcacctgtct tctgcgtcac 360tcacactggg agctgtagac
tggagctgtt cctatttggc catcttggaa ccatctccca 420aatagactct ttggcagcag
tgactctcca aaaccaccaa ggcctagacc tcctcattgc 480tgagaaagga ggactctgca
ccttcttagg ggaggagtgt tgtttttata ctgaccagtc 540agggatggta cgagatgcca
cccgatgttt acaggaaaag gcttctgaaa tcacacaaca 600cctttcaaac tcttatacca
acctctggag ttgggcaaca tggcttctcc cctttctcgg 660tcccattgca gccatcttgc
tattactcgc cttcaggctg tgtattttta acctccttgt 720caaatttgtt tcctctagaa
ttgaggccgt caagctacag atggtcttac aaatgggacc 780ccaaatgagc tcaactaaca
acttctgcca aggacccctg gaccaacctg ctggcccttt 840cactggcctt aagagttccc
ctctggaggg cactacaact gcagggcccc ttctttgccc 900ctatccagca ggaagtagct
agagcagtca tcacccaatt cccaa 94515939DNAHomo sapiens
15agagctacct tggcaagtac tctaggagta tgggaaaatg aaaacaacaa actcacacac
60cattttaaca tacacaatca ggtctgccca cccagcaagg tatattcttt gtatgtggaa
120catcgaccta tatctgcctc cccactaact agacagccac ctgaatctta gtctttctaa
180gtcccaacag taacattgcc ccaggaaatc agaccatatc agtatccctc aaagctcaag
240tctgtcagtg cagagccata caactaatac ccctacttat agggtaagga atggctactg
300ctacaggaac cagaatagct agtttgttta cttcattatc ctactaccac acactctcaa
360atgatttctc agacagtttg caagaaataa cgaaatctat ccttactcta caatcccaaa
420tagactcctt ggcagcagtg accctccaaa acggctgagg cctagacctc ctcactgcca
480agaaaggagg actctgcatt ttcttagggg aagagtgttt ttacactaac cagtcaggga
540cagtatgaga tgccactcgg agtttacagg aaaaggcttc tgaagtcaga caatgccttt
600caaactctat accaaactct ggagttgggc aacatggctt ctcccctttc taggtcccgt
660gacagccatc ttgctattat ttgcctttga gccctgtatt tttaatctcc ttttcaaatt
720tgtttcctct ggatcgaggc catcgagcta cagatggtct tcacaaatgg aaccccaaat
780gagctcaact aacaacttct actgaggacc cctggactaa cctgctgacc ctttcactgg
840cctgaagaat tcccctctgg aggacactac aactgcaggg ctccttcttt gcccctatcc
900agcaggaagt agctagagct gtcattgcct aattcctaa
93916979DNAHomo sapiens 16agtgataatg gaatacttga aagtaatccc ctcactcccc
aggaactagt gctcagctgg 60cagaactaat agccctcact cgggtactag aatcaggaga
aggaaaaagg gtaaatatat 120atacagactc taagtgtgct tacctagtcc tccatgccca
tgcagcaata tggagagaaa 180gggaattcct aacttccgag ggaacaccta tcaaacatca
ggaagccatt aggaaattat 240tattggctgt acagaaacct aaagaggtgg cagttttaca
ctgccggggt catcagaaag 300gaaaggaaag ggaaatacaa gggagccacc aagttgatat
tgaagtcaaa agagccacaa 360ggctggaccc tccattagaa atgcttatag gaggacccct
agtatggggt aatcccctcc 420gggaagccaa gccccagtac tcagcaggag aaatagaata
gggaacttca tgaggacata 480cttccctccc ctccagatgg ctagccacca ataaaggaaa
aatacttttg cctgcagcta 540accaatagaa attacttaaa acccttcatc aaaccttcca
cttaggcatt gatagcaccc 600atgagatggc caaattatta tttactggac caggcctttt
caaaactatc aagcagatag 660tcagggcctg taaagtctgc caaagaaata atcccctgca
ctgcaggcca tacatttcaa 720tccctgtatc tttaacctcc ttcttaaatt tgtctcttcc
agaatcaaag ctgtaaaatt 780acaaatagtt cttcaaatgg agccacagat gcagtccatg
actaagatcc accacagacc 840cctggaccag cctgctagcc catgctccaa tgttaatgac
atcgaaggca ccccctcctg 900aggaaatctc aactgcacaa cccctactac gccccaattc
agcagaaagc agttagagtg 960gtcatcagcc aacctcccc
979171774DNAHomo sapiens 17catgctggta aaggaccgct
agaatccagc agccaggacc actttctttg tggtcaagaa 60aggtgggaaa acaggtgcag
gactgctaca ctggtaagca taactaatcc gataagcaga 120ggtccatggg tggttacgca
ccctggaaag gaataagcat taggactata gaggacactc 180taggactaat gctcatcgga
aaatgactag gggtactggc atccctatgt tcttttttca 240gatgggaaat gttcccccca
aggcagaaat gcccctaaga tgtattctgg agaaatggga 300ccaatctgac catcagacac
taagaaagaa atgacttata ttcttctgca gtaccacctg 360gccacaatat cttcttcaag
gggcagaaac ctggcctcct gagggaagta taaattataa 420caccatctta cagctagacc
tcttttgtag aaaagaaggc aaatggagtg aagtgccata 480tgtacaaact ttcttttcat
taagagataa ctcccaatta tgtaaaaagt gtgatttatg 540ccctacagga agccctcaga
gtctacctcc cgaccccagc aagaccccaa ctccttctcc 600aactaataag gacccccctt
caacccaaat ggtccaaaag gagatagaca aaggggtaaa 660caatgaacca aagagtgcca
atattacacg attatactcg ctccaagcag tgggaggaga 720atttggccca gccagcgtgc
atgtaccttt ttctctctca gatttaaagc aaattaaaat 780agacctaggt aaattctcag
ataaccctga tggctatatt gatgttttac aagggttagg 840acaatccttt gatctgacat
ggagagatat aatgttactg ctaaatcaga cactaacccc 900aaatgaaaaa agtgctgcca
taacagcagc ctgagagttt ggcgaactct ggtatctcag 960tcaggtcaat gataggatga
caacagatga aagagaatga ttccccacag gccagcaggc 1020agttcccagt gtagaccctc
attaggacac agaatcagaa cttggagatt ggtgccacag 1080acatttgcta acttgcgtgc
tagaaggact aaggaaaact aggaagaagc ccatgaatta 1140ttcaatgatg tcccctataa
cacagggaaa ggaagaaaat cctactgcct ttctggagag 1200actaagggaa ggattgagga
agcatacctc cctgtcacct gactctatta aaggccaact 1260aatcttaaag gataagttta
tcactcagtc agctgcagag attaagaaaa aacttcaaaa 1320gtatgcctta ggcccagagc
aaaacttaga aaccctactg aacttggcaa cctcagtttt 1380ttataataga gatcaggaag
agcaggggaa tgggacaaat gggataaaaa aaaaaaaaaa 1440aggtgactgc tttagtcgtg
gccctcaggc aaatggactt tggaggctcc agaaaaggga 1500aaagctgagc aaattgaatg
cctaacaggg cttgcttcta gtgtggtcta caaggacact 1560ttaaaaaaga ttgtccaagt
agaaacaagc tgcccccttg tccatgcccc ttatgtcaag 1620ggaatcactg gaaggcccac
tgccccagga gatgaaggtc ctctgagtca gaagccacta 1680accagataat ccagcagcag
gactgaggat gcccagggca agcgccagcc catgccatca 1740ccctcacaga gccttgggta
tgcttgacca ttga 177418938DNAHomo sapiens
18tgtaggaaga actcccttca ggacaggaca atagatggtt cctcccaggt gattaaggaa
60aaaagacaca gtattcagta agtgataagg aaactcttgt agaagcagag ttagaaaaat
120tgcctaataa ttggtctgct caaatgtgtg agttgtttgc actcagccaa atcttaaagt
180acttacagaa tcaggaagca gccatctata ccaattctaa gttaatatgg actaaacgag
240gttttattag tagcaaagaa aaattaaaat cccaaactta caaggttttc aactaaagtt
300tgccaaaagt taacagtgta acatgtatta tcctactatc acacactctc aaaggatttc
360tcagacagtt tgcaagaaat aacgtaatct atccttactc tacagtccca aatagactct
420ttggtagcag tgactctcca aaactgccga ggtctagacc tcctcaatgc tgagaaagga
480gaactctgca ccttcttagg ggaagagtgc tgtttttaca ctaaccagtc agggatagta
540tgagatactg cctgacgttt acaggaaaag gcttctgaaa tcagacaacg cctttcaagc
600tcttatacca acctctggag ttgggcaaca tggcttctcc ccttgctagg tcctgtggca
660gccatcttgc tattacttgc cttcgggccc tgtattttta acctccttgt caaatttgtt
720tcctctagga tcaaggccat caagctacag atggtcttac aaatggaacc ccaaatgagc
780tcaactaaca acttctactg aggacacctg gactgaccca ctggcccttt cactggccta
840aagagttccc ttctggagga cactacaact gcagggcccc gtcttcaccc ctatccagca
900ggaagtagct agatcagtca ttgcccaatt cccaacag
938191308DNAHomo sapiens 19gatgcttgcc ccaggcaccc tcagtcctgt tgttggatca
tctggtcggg ggcttctggc 60ccaaagaacc tttgtcctct gaggcagtgc accttccagt
gattgcctca gcattgtgga 120catgggcaag ggggcagctt gtttctcact ggacaatctt
ttttaaggtg tccttccaaa 180ccacactggt aacaagccct accaggtgat tggcctgctc
tattttctgt cctctctgaa 240ccaccaaggt ttgtctgtct gagggtcatg actaaggctg
tggcctttct ctgatcttgc 300ttttcctttt tggcctgttc ctcttggtac ctattataga
acactgaggt tgccaggttt 360aacaatggct ccagattttg ttcagggcac agggctcatt
ttggagcttt ctcctgatat 420ctgcagctga ttgggtaata aacttatctt ttaggatcaa
ttgactctca agagagttgg 480gtgacagggg agtatatttc cttgaggcct cccatagccg
ctctaggaag gcagaaggat 540tttcttcctt tccctgagtt ataaaagaca tcattgaaca
actcatggac tttttcccaa 600ttctccgtag tccttctaga acacaggtca gcagatgttt
acgactccag tccccatgat 660ctgagtctag acaccagtgg ggatccatac tggggatggc
ctgctgactg gtagggaatt 720tgtccctttc tttggctgtc attctatcat ttacttgact
aagataccaa gtatctccaa 780attctcaggc tgcagctaaa gctgcattct tttcattaaa
ggccagggtt tgatctaata 840gcatgacatc tctccaagtg aggtcaaagg tttgccctag
atccatagga catcagagaa 900ggagaagggg acatacacct gagttagcca aattcccctc
cctctacagc ttgaagggga 960cataagcaat agcctgggga tttttgtggt cctttggaga
tttctttgct tgtttccttc 1020tgggtggggg agattagagg aggcttatca gtaataggaa
ggggagctat agggaggcta 1080ggatatgggg gtaagctgag aggtcatctt gtgggatgta
aattgcaagc tttgcatagt 1140tgtggatttt ccttacaatg aaaataaagc ttggacataa
ggtatttcac tccatttgcc 1200ttccctctta cagaaaaggt caagctgcag gatagtactg
taatttatac ttccttcagg 1260tggccatttc ttcccatcag agagagaata ctggggctgg
gccatagt 130820711DNAHomo sapiens 20actgagagac aggactagct
ggatttccta ggccgactaa gaatccctaa gcctagctgg 60gaaggtgacc acgtccacct
ttaaacacgg ggcttgcaac ttagctcaca cctgaccaat 120cagagagctc actaaaatgc
taattaggca aagacaggag gtaaagaaat agccaatcat 180ctattgcctg agagcacagc
aggagggaca acaatcggga tataaaccca ggcattcgag 240ctggcaacag cagcccccct
ttgggtccct tccctttgta tgggagctgt tttcatgcta 300tttcactcta ttaaatcttg
caactgcact cttctggtcc atgtttctta cggctcgagc 360tgagcttttg ctcaccgtcc
accactgctg tttgccacca ccgcagacct gccgctgact 420cccatccctc tggatcctgc
agggtgtccg ctgtgctcct gatccagcga ggcgcccatt 480gccgctccca attgggctaa
aggcttgcca ttgttcctgc acggctaagt gcctgggttt 540gttctaattg agctgaacac
tagtcactgg gttccatggt tctcttctgt gacccacggc 600ttctaataga actataacac
ttaccacatg gcccaagatt ccattccttg gaatccgtga 660ggccaagaac tccaggtcag
agaatacgag gcttgccacc atcttggaag c 71121711DNAHomo sapiens
21actgagagac aggactagct ggatttccta ggctgactaa gaatccctaa gcctagctgg
60gaaggtgacc acatccacct ttaaacacgg ggcttgcaac ttagctcaca cctgaccaat
120cagagagctc actaaaatgc taattaggca aagacaggag gtaaagaaat agccaatcat
180ctattgcctg agagcacagc aggagggaca atgatcggga tataaaccca agtcttcgag
240ccggcaacgg caaccccctt tgggtcccct ccctttgtat gggagctctg ttttcatgct
300atttcactct attaaatctt gcaactgcac tcttctggtc catgtttctt acggcttgag
360ctgagctttc gctcgccatc caccactgct gtttgccgcc accgcagacc cgccgctgac
420tcccatccct ctggatcatg cagggtgtcc gctgtgctcc tgatccagcg aggcacccat
480tgccgctccc aatcgggcta aaggcttgcc attgttcctg catggctaag tgcctgggtt
540catcctaatt gagctgaaca ctagtcactg ggttccatgg ttctcttctg tgacccacag
600cttctaatag agctataaca ctcaccgcat ggcccaaggt tccattcctt gaatccataa
660ggccaagaac cccaggtcag agaacacgag gcttgccacc atcttgggag c
711222055DNAHomo sapiensCDS(1)..(2055) 22ccc aag aca gcc aac tta gtt gca
gac atc acc tcc tta gcc aaa tat 48Pro Lys Thr Ala Asn Leu Val Ala
Asp Ile Thr Ser Leu Ala Lys Tyr1 5 10
15caa caa gtt ctt aaa aca tta caa gga acc tat ccc tga gaa
gag gga 96Gln Gln Val Leu Lys Thr Leu Gln Gly Thr Tyr Pro Glu
Glu Gly 20 25 30aaa gaa
cta ttc cac cct tgt gac atg gta tta gtc aag tcc ctt ccc 144Lys Glu
Leu Phe His Pro Cys Asp Met Val Leu Val Lys Ser Leu Pro 35
40 45tct aat tcc cca tcc cta gat aca tcc
tgg gaa gga ccc tac cca gtc 192Ser Asn Ser Pro Ser Leu Asp Thr Ser
Trp Glu Gly Pro Tyr Pro Val 50 55
60att tta tct acc cca act gcg gtt aaa gtg gct gga gtg gag tct tgg
240Ile Leu Ser Thr Pro Thr Ala Val Lys Val Ala Gly Val Glu Ser Trp 65
70 75ata cat cac act tga gtc aaa tcc tgg
ata ctg cca aag gaa cct gaa 288Ile His His Thr Val Lys Ser Trp
Ile Leu Pro Lys Glu Pro Glu80 85
90aat cca gga gac aac gct agc tat tcc tgt gaa cct cta gag gat ttg
336Asn Pro Gly Asp Asn Ala Ser Tyr Ser Cys Glu Pro Leu Glu Asp Leu95
100 105 110cgc ctg ctc ttc
aaa caa caa cca gga gga aag taa cta aaa tca taa 384Arg Leu Leu Phe
Lys Gln Gln Pro Gly Gly Lys Leu Lys Ser 115
120atc ccc atg gcc ctc cct tat cat att ttt ctc ttt act gtt ctt tta
432Ile Pro Met Ala Leu Pro Tyr His Ile Phe Leu Phe Thr Val Leu Leu125
130 135 140ccc tct ttc act
ctc act gca ccc cct cca tgc cgc tgt atg acc agt 480Pro Ser Phe Thr
Leu Thr Ala Pro Pro Pro Cys Arg Cys Met Thr Ser 145
150 155agc tcc cct tac caa gag ttt cta tgg aga
atg cag cgt ccc gga aat 528Ser Ser Pro Tyr Gln Glu Phe Leu Trp Arg
Met Gln Arg Pro Gly Asn 160 165
170att gat gcc cca tcg tat agg agt ctt tct aag gga acc ccc acc ttc
576Ile Asp Ala Pro Ser Tyr Arg Ser Leu Ser Lys Gly Thr Pro Thr Phe
175 180 185act gcc cac acc cat atg ccc
cgc aac tgc tat cac tct gcc act ctt 624Thr Ala His Thr His Met Pro
Arg Asn Cys Tyr His Ser Ala Thr Leu 190 195
200tgc atg cat gca aat act cat tat tgg aca gga aaa atg att aat cct
672Cys Met His Ala Asn Thr His Tyr Trp Thr Gly Lys Met Ile Asn Pro205
210 215 220agt tgt cct gga
gga ctt gga gtc act gtc tgt tgg act tac ttc acc 720Ser Cys Pro Gly
Gly Leu Gly Val Thr Val Cys Trp Thr Tyr Phe Thr 225
230 235caa act ggt atg tct gat ggg ggt gga gtt
caa gat cag gca aga gaa 768Gln Thr Gly Met Ser Asp Gly Gly Gly Val
Gln Asp Gln Ala Arg Glu 240 245
250aaa cat gta aaa gaa gta atc tcc caa ctc acc cgg gta cat ggc acc
816Lys His Val Lys Glu Val Ile Ser Gln Leu Thr Arg Val His Gly Thr
255 260 265tct agc ccc tac aaa gga cta
gat ctc tca aaa cta cat gaa acc ctc 864Ser Ser Pro Tyr Lys Gly Leu
Asp Leu Ser Lys Leu His Glu Thr Leu 270 275
280cgt acc cat act cgc ctg gta agc cta ttt aat acc acc ctc act ggg
912Arg Thr His Thr Arg Leu Val Ser Leu Phe Asn Thr Thr Leu Thr Gly285
290 295 300ctc cat gag gtc
tcg gcc caa aac cct act aac tgt tgg ata tgc ctc 960Leu His Glu Val
Ser Ala Gln Asn Pro Thr Asn Cys Trp Ile Cys Leu 305
310 315ccc ctg aac ttc agg cca tat gtt tca atc
cct gta cct gaa caa tgg 1008Pro Leu Asn Phe Arg Pro Tyr Val Ser Ile
Pro Val Pro Glu Gln Trp 320 325
330aac aac ttc agc aca gaa ata aac acc act tcc gtt tta gta gga cct
1056Asn Asn Phe Ser Thr Glu Ile Asn Thr Thr Ser Val Leu Val Gly Pro
335 340 345ctt gtt tcc aat ctg gaa ata
acc cat acc tca aac ctc acc tgt gta 1104Leu Val Ser Asn Leu Glu Ile
Thr His Thr Ser Asn Leu Thr Cys Val 350 355
360 aaa ttt agc aat act aca tac aca acc aac tcc caa tgc atc agg tgg
1152Lys Phe Ser Asn Thr Thr Tyr Thr Thr Asn Ser Gln Cys Ile Arg Trp365
370 375 380gta act cct
ccc aca caa ata gtc tgc cta ccc tca gga ata ttt ttt 1200Val Thr Pro
Pro Thr Gln Ile Val Cys Leu Pro Ser Gly Ile Phe Phe 385
390 395gtc tgt ggt acc tca gcc tat cgt tgt
ttg aat ggc tct tca gaa tct 1248Val Cys Gly Thr Ser Ala Tyr Arg Cys
Leu Asn Gly Ser Ser Glu Ser 400 405
410atg tgc ttc ctc tca ttc tta gtg ccc cct atg acc atc tac act gaa
1296Met Cys Phe Leu Ser Phe Leu Val Pro Pro Met Thr Ile Tyr Thr Glu
415 420 425caa gat tta tac agt tat gtc
ata tct aag ccc cgc aac aaa aga gta 1344Gln Asp Leu Tyr Ser Tyr Val
Ile Ser Lys Pro Arg Asn Lys Arg Val 430 435
440ccc att ctt cct ttt gtt ata gga gca gga gtg cta ggt gca cta ggt
1392Pro Ile Leu Pro Phe Val Ile Gly Ala Gly Val Leu Gly Ala Leu Gly445
450 455 460act ggc att ggc
ggt atc aca acc tct act cag ttc tac tac aaa cta 1440Thr Gly Ile Gly
Gly Ile Thr Thr Ser Thr Gln Phe Tyr Tyr Lys Leu 465
470 475tct caa gaa cta aat ggg gac atg gaa cgg
gtc gcc gac tcc ctg gtc 1488Ser Gln Glu Leu Asn Gly Asp Met Glu Arg
Val Ala Asp Ser Leu Val 480 485
490acc ttg caa gat caa ctt aac tcc cta gca gca gta gtc ctt caa aat
1536Thr Leu Gln Asp Gln Leu Asn Ser Leu Ala Ala Val Val Leu Gln Asn
495 500 505cga aga gct tta gac ttg cta
acc gct gaa aga ggg gga acc tgt tta 1584Arg Arg Ala Leu Asp Leu Leu
Thr Ala Glu Arg Gly Gly Thr Cys Leu 510 515
520ttt tta ggg gaa gaa tgc tgt tat tat gtt aat caa tcc gga atc gtc
1632Phe Leu Gly Glu Glu Cys Cys Tyr Tyr Val Asn Gln Ser Gly Ile Val525
530 535 540act gag aaa gtt
aaa gaa att cga gat cga ata caa cgt aga gca gag 1680Thr Glu Lys Val
Lys Glu Ile Arg Asp Arg Ile Gln Arg Arg Ala Glu 545
550 555gag ctt cga aac act gga ccc tgg ggc ctc
ctc agc caa tgg atg ccc 1728Glu Leu Arg Asn Thr Gly Pro Trp Gly Leu
Leu Ser Gln Trp Met Pro 560 565
570tgg att ctc ccc ttc tta gga cct cta gca gct ata ata ttg cta ctc
1776Trp Ile Leu Pro Phe Leu Gly Pro Leu Ala Ala Ile Ile Leu Leu Leu
575 580 585ctc ttt gga ccc tgt atc ttt
aac ctc ctt gtt aac ttt gtc tct tcc 1824Leu Phe Gly Pro Cys Ile Phe
Asn Leu Leu Val Asn Phe Val Ser Ser 590 595
600aga atc gaa gct gta aaa cta caa atg gag ccc aag atg cag tcc aag
1872Arg Ile Glu Ala Val Lys Leu Gln Met Glu Pro Lys Met Gln Ser Lys605
610 615 620act aag atc tac
cgc aga ccc ctg gac cgg cct gct agc cca cga tct 1920Thr Lys Ile Tyr
Arg Arg Pro Leu Asp Arg Pro Ala Ser Pro Arg Ser 625
630 635gat gtt aat gac atc aaa ggc acc cct cct
gag gaa atc tca gct gca 1968Asp Val Asn Asp Ile Lys Gly Thr Pro Pro
Glu Glu Ile Ser Ala Ala 640 645
650caa cct cta cta cgc ccc aat tca gca gga agc agt tag agc ggt cgt
2016Gln Pro Leu Leu Arg Pro Asn Ser Ala Gly Ser Ser Ser Gly Arg
655 660 665cgg cca acc tcc cca aca
gca ctt agg ttt tcc tgt tga 2055Arg Pro Thr Ser Pro Thr
Ala Leu Arg Phe Ser Cys 670 6752328PRTHomo sapiens
23Pro Lys Thr Ala Asn Leu Val Ala Asp Ile Thr Ser Leu Ala Lys Tyr1
5 10 15Gln Gln Val Leu Lys Thr
Leu Gln Gly Thr Tyr Pro 20 252455PRTHomo
sapiens 24Glu Glu Gly Lys Glu Leu Phe His Pro Cys Asp Met Val Leu Val
Lys1 5 10 15Ser Leu Pro
Ser Asn Ser Pro Ser Leu Asp Thr Ser Trp Glu Gly Pro 20
25 30Tyr Pro Val Ile Leu Ser Thr Pro Thr Ala
Val Lys Val Ala Gly Val 35 40
45Glu Ser Trp Ile His His Thr 50 552538PRTHomo
sapiens 25Val Lys Ser Trp Ile Leu Pro Lys Glu Pro Glu Asn Pro Gly Asp
Asn1 5 10 15Ala Ser Tyr
Ser Cys Glu Pro Leu Glu Asp Leu Arg Leu Leu Phe Lys 20
25 30Gln Gln Pro Gly Gly Lys
3526540PRTHomo sapiens 26Ile Pro Met Ala Leu Pro Tyr His Ile Phe Leu Phe
Thr Val Leu Leu1 5 10
15Pro Ser Phe Thr Leu Thr Ala Pro Pro Pro Cys Arg Cys Met Thr Ser
20 25 30Ser Ser Pro Tyr Gln Glu Phe
Leu Trp Arg Met Gln Arg Pro Gly Asn 35 40
45Ile Asp Ala Pro Ser Tyr Arg Ser Leu Ser Lys Gly Thr Pro Thr
Phe 50 55 60Thr Ala His Thr His Met
Pro Arg Asn Cys Tyr His Ser Ala Thr Leu65 70
75 80Cys Met His Ala Asn Thr His Tyr Trp Thr Gly
Lys Met Ile Asn Pro 85 90
95Ser Cys Pro Gly Gly Leu Gly Val Thr Val Cys Trp Thr Tyr Phe Thr
100 105 110Gln Thr Gly Met Ser Asp
Gly Gly Gly Val Gln Asp Gln Ala Arg Glu 115 120
125Lys His Val Lys Glu Val Ile Ser Gln Leu Thr Arg Val His
Gly Thr 130 135 140Ser Ser Pro Tyr Lys
Gly Leu Asp Leu Ser Lys Leu His Glu Thr Leu145 150
155 160Arg Thr His Thr Arg Leu Val Ser Leu Phe
Asn Thr Thr Leu Thr Gly 165 170
175Leu His Glu Val Ser Ala Gln Asn Pro Thr Asn Cys Trp Ile Cys Leu
180 185 190Pro Leu Asn Phe Arg
Pro Tyr Val Ser Ile Pro Val Pro Glu Gln Trp 195
200 205Asn Asn Phe Ser Thr Glu Ile Asn Thr Thr Ser Val
Leu Val Gly Pro 210 215 220Leu Val Ser
Asn Leu Glu Ile Thr His Thr Ser Asn Leu Thr Cys Val225
230 235 240Lys Phe Ser Asn Thr Thr Tyr
Thr Thr Asn Ser Gln Cys Ile Arg Trp 245
250 255Val Thr Pro Pro Thr Gln Ile Val Cys Leu Pro Ser
Gly Ile Phe Phe 260 265 270Val
Cys Gly Thr Ser Ala Tyr Arg Cys Leu Asn Gly Ser Ser Glu Ser 275
280 285Met Cys Phe Leu Ser Phe Leu Val Pro
Pro Met Thr Ile Tyr Thr Glu 290 295
300Gln Asp Leu Tyr Ser Tyr Val Ile Ser Lys Pro Arg Asn Lys Arg Val305
310 315 320Pro Ile Leu Pro
Phe Val Ile Gly Ala Gly Val Leu Gly Ala Leu Gly 325
330 335Thr Gly Ile Gly Gly Ile Thr Thr Ser Thr
Gln Phe Tyr Tyr Lys Leu 340 345
350Ser Gln Glu Leu Asn Gly Asp Met Glu Arg Val Ala Asp Ser Leu Val
355 360 365Thr Leu Gln Asp Gln Leu Asn
Ser Leu Ala Ala Val Val Leu Gln Asn 370 375
380Arg Arg Ala Leu Asp Leu Leu Thr Ala Glu Arg Gly Gly Thr Cys
Leu385 390 395 400Phe Leu
Gly Glu Glu Cys Cys Tyr Tyr Val Asn Gln Ser Gly Ile Val
405 410 415Thr Glu Lys Val Lys Glu Ile
Arg Asp Arg Ile Gln Arg Arg Ala Glu 420 425
430Glu Leu Arg Asn Thr Gly Pro Trp Gly Leu Leu Ser Gln Trp
Met Pro 435 440 445Trp Ile Leu Pro
Phe Leu Gly Pro Leu Ala Ala Ile Ile Leu Leu Leu 450
455 460Leu Phe Gly Pro Cys Ile Phe Asn Leu Leu Val Asn
Phe Val Ser Ser465 470 475
480Arg Ile Glu Ala Val Lys Leu Gln Met Glu Pro Lys Met Gln Ser Lys
485 490 495Thr Lys Ile Tyr Arg
Arg Pro Leu Asp Arg Pro Ala Ser Pro Arg Ser 500
505 510Asp Val Asn Asp Ile Lys Gly Thr Pro Pro Glu Glu
Ile Ser Ala Ala 515 520 525Gln Pro
Leu Leu Arg Pro Asn Ser Ala Gly Ser Ser 530 535
5402715PRTHomo sapiens 27Ser Gly Arg Arg Pro Thr Ser Pro Thr Ala
Leu Arg Phe Ser Cys1 5 10
15281080DNAHomo sapiensCDS(1)..(1080) 28acc tct ttt gta gaa aag gca aat
gga gtg aag tgc cat aag tac aaa 48Thr Ser Phe Val Glu Lys Ala Asn
Gly Val Lys Cys His Lys Tyr Lys1 5 10
15ctt tct ttt cat taa gag aca act cac aat tat gta aaa agt
gtg att 96Leu Ser Phe His Glu Thr Thr His Asn Tyr Val Lys Ser
Val Ile 20 25 30tat gcc
cta cag gaa gcc ttc aga gtc tac ctc cct atc cca gca tcc 144Tyr Ala
Leu Gln Glu Ala Phe Arg Val Tyr Leu Pro Ile Pro Ala Ser 35
40 45ccg act cct tcc cca act aat aag gac
ccc cct tca acc caa atg gtc 192Pro Thr Pro Ser Pro Thr Asn Lys Asp
Pro Pro Ser Thr Gln Met Val 50 55
60caa aag gag ata gac aaa agg gta aac agt gaa cca aag agt gcc aat
240Gln Lys Glu Ile Asp Lys Arg Val Asn Ser Glu Pro Lys Ser Ala Asn 65
70 75att ccc caa tta tga ccc ctc caa gca
gtg gga gga aga gaa ttc ggc 288Ile Pro Gln Leu Pro Leu Gln Ala
Val Gly Gly Arg Glu Phe Gly80 85
90cca gcc aga gtg cat gtg cct ttt tct ctc cca gac tta aag caa ata
336Pro Ala Arg Val His Val Pro Phe Ser Leu Pro Asp Leu Lys Gln Ile95
100 105 110aaa aca gac tta
ggt aaa ttc tca gat aac cct gat ggc tat att gat 384Lys Thr Asp Leu
Gly Lys Phe Ser Asp Asn Pro Asp Gly Tyr Ile Asp 115
120 125gtt tta caa ggg tta gga caa ttc ttt gat
ctg aca tgg aga gat ata 432Val Leu Gln Gly Leu Gly Gln Phe Phe Asp
Leu Thr Trp Arg Asp Ile 130 135
140atg tca ctg cta aat cag aca cta acc cca aat gag aga agt gcc acc
480Met Ser Leu Leu Asn Gln Thr Leu Thr Pro Asn Glu Arg Ser Ala Thr
145 150 155ata act gca gcc tga gag ttt
ggc gat ctc tgg tat ctc agt cag gtc 528Ile Thr Ala Ala Glu Phe
Gly Asp Leu Trp Tyr Leu Ser Gln Val 160 165
170aat gat agg atg aca aca gag gaa aga gaa tga ttc ccc aca ggc
cag 576Asn Asp Arg Met Thr Thr Glu Glu Arg Glu Phe Pro Thr Gly
Gln 175 180 185cag gca gtt ccc agt
cta gac cct cat tgg gac aca gaa tca gaa cat 624Gln Ala Val Pro Ser
Leu Asp Pro His Trp Asp Thr Glu Ser Glu His 190 195
200gga gat tgg tgc tgc aga cat ttg cta act tgt gtg cta gaa
gga cta 672Gly Asp Trp Cys Cys Arg His Leu Leu Thr Cys Val Leu Glu
Gly Leu205 210 215 220agg
aaa act agg aag aag tct atg aat tac tca atg atg tcc acc ata 720Arg
Lys Thr Arg Lys Lys Ser Met Asn Tyr Ser Met Met Ser Thr Ile
225 230 235aca cag gga agg gaa gaa aat
cct act gcc ttt ctg gag aga cta agg 768Thr Gln Gly Arg Glu Glu Asn
Pro Thr Ala Phe Leu Glu Arg Leu Arg 240 245
250gag gca ttg agg aag cgt gcc tct ctg tca cct gac tct tct
gaa ggc 816Glu Ala Leu Arg Lys Arg Ala Ser Leu Ser Pro Asp Ser Ser
Glu Gly 255 260 265caa cta atc tta
aag cgt aag ttt atc act cag tca gct gca gac att 864Gln Leu Ile Leu
Lys Arg Lys Phe Ile Thr Gln Ser Ala Ala Asp Ile 270
275 280aga aaa aaa ctt caa aag tct gcc gta ggc ccg gag
caa aac tta gaa 912Arg Lys Lys Leu Gln Lys Ser Ala Val Gly Pro Glu
Gln Asn Leu Glu285 290 295
300acc cta ttg aac ttg gca acc tcg gtt ttt tat aat aga gat cag gag
960Thr Leu Leu Asn Leu Ala Thr Ser Val Phe Tyr Asn Arg Asp Gln Glu
305 310 315gag cag gcg gaa cag
gac aaa cgg gat taa aaa aaa ggc cac cgc ttt 1008Glu Gln Ala Glu Gln
Asp Lys Arg Asp Lys Lys Gly His Arg Phe 320
325 330agt cat gac cct cag gca agt gga ctt tgg agg
ctc tgg aaa agg gaa 1056Ser His Asp Pro Gln Ala Ser Gly Leu Trp Arg
Leu Trp Lys Arg Glu 335 340
345aag ctg ggc aaa ttg aat gcc taa
1080Lys Leu Gly Lys Leu Asn Ala 350 2920PRTHomo sapiens 29Thr Ser
Phe Val Glu Lys Ala Asn Gly Val Lys Cys His Lys Tyr Lys1 5
10 15Leu Ser Phe His
203063PRTHomo sapiens 30Glu Thr Thr His Asn Tyr Val Lys Ser Val Ile Tyr
Ala Leu Gln Glu1 5 10
15Ala Phe Arg Val Tyr Leu Pro Ile Pro Ala Ser Pro Thr Pro Ser Pro
20 25 30Thr Asn Lys Asp Pro Pro Ser
Thr Gln Met Val Gln Lys Glu Ile Asp 35 40
45Lys Arg Val Asn Ser Glu Pro Lys Ser Ala Asn Ile Pro Gln Leu
50 55 603179PRTHomo sapiens 31Pro Leu
Gln Ala Val Gly Gly Arg Glu Phe Gly Pro Ala Arg Val His1 5
10 15Val Pro Phe Ser Leu Pro Asp Leu
Lys Gln Ile Lys Thr Asp Leu Gly 20 25
30Lys Phe Ser Asp Asn Pro Asp Gly Tyr Ile Asp Val Leu Gln Gly
Leu 35 40 45Gly Gln Phe Phe Asp
Leu Thr Trp Arg Asp Ile Met Ser Leu Leu Asn 50 55
60Gln Thr Leu Thr Pro Asn Glu Arg Ser Ala Thr Ile Thr Ala
Ala65 70 753221PRTHomo sapiens 32Glu
Phe Gly Asp Leu Trp Tyr Leu Ser Gln Val Asn Asp Arg Met Thr1
5 10 15Thr Glu Glu Arg Glu
2033142PRTHomo sapiens 33Phe Pro Thr Gly Gln Gln Ala Val Pro Ser Leu Asp
Pro His Trp Asp1 5 10
15Thr Glu Ser Glu His Gly Asp Trp Cys Cys Arg His Leu Leu Thr Cys
20 25 30 Val Leu Glu Gly Leu Arg Lys
Thr Arg Lys Lys Ser Met Asn Tyr Ser 35 40
45Met Met Ser Thr Ile Thr Gln Gly Arg Glu Glu Asn Pro Thr Ala
Phe 50 55 60Leu Glu Arg Leu Arg Glu
Ala Leu Arg Lys Arg Ala Ser Leu Ser Pro65 70
75 80Asp Ser Ser Glu Gly Gln Leu Ile Leu Lys Arg
Lys Phe Ile Thr Gln 85 90
95Ser Ala Ala Asp Ile Arg Lys Lys Leu Gln Lys Ser Ala Val Gly Pro
100 105 110Glu Gln Asn Leu Glu Thr
Leu Leu Asn Leu Ala Thr Ser Val Phe Tyr 115 120
125Asn Arg Asp Gln Glu Glu Gln Ala Glu Gln Asp Lys Arg Asp
130 135 1403429PRTHomo sapiens 34Lys Lys
Gly His Arg Phe Ser His Asp Pro Gln Ala Ser Gly Leu Trp1 5
10 15Arg Leu Trp Lys Arg Glu Lys Leu
Gly Lys Leu Asn Ala 20 2535685PRTHomo
sapiensmisc_feature(29)..(29)Xaa is any amino acid 35Pro Lys Thr Ala Asn
Leu Val Ala Asp Ile Thr Ser Leu Ala Lys Tyr1 5
10 15Gln Gln Val Leu Lys Thr Leu Gln Gly Thr Tyr
Pro Xaa Glu Glu Gly 20 25
30Lys Glu Leu Phe His Pro Cys Asp Met Val Leu Val Lys Ser Leu Pro
35 40 45Ser Asn Ser Pro Ser Leu Asp Thr
Ser Trp Glu Gly Pro Tyr Pro Val 50 55
60Ile Leu Ser Thr Pro Thr Ala Val Lys Val Ala Gly Val Glu Ser Trp65
70 75 80Ile His His Thr Xaa
Val Lys Ser Trp Ile Leu Pro Lys Glu Pro Glu 85
90 95Asn Pro Gly Asp Asn Ala Ser Tyr Ser Cys Glu
Pro Leu Glu Asp Leu 100 105
110Arg Leu Leu Phe Lys Gln Gln Pro Gly Gly Lys Xaa Leu Lys Ser Xaa
115 120 125Ile Pro Met Ala Leu Pro Tyr
His Ile Phe Leu Phe Thr Val Leu Leu 130 135
140Pro Ser Phe Thr Leu Thr Ala Pro Pro Pro Cys Arg Cys Met Thr
Ser145 150 155 160Ser Ser
Pro Tyr Gln Glu Phe Leu Trp Arg Met Gln Arg Pro Gly Asn
165 170 175Ile Asp Ala Pro Ser Tyr Arg
Ser Leu Ser Lys Gly Thr Pro Thr Phe 180 185
190Thr Ala His Thr His Met Pro Arg Asn Cys Tyr His Ser Ala
Thr Leu 195 200 205Cys Met His Ala
Asn Thr His Tyr Trp Thr Gly Lys Met Ile Asn Pro 210
215 220Ser Cys Pro Gly Gly Leu Gly Val Thr Val Cys Trp
Thr Tyr Phe Thr225 230 235
240Gln Thr Gly Met Ser Asp Gly Gly Gly Val Gln Asp Gln Ala Arg Glu
245 250 255Lys His Val Lys Glu
Val Ile Ser Gln Leu Thr Arg Val His Gly Thr 260
265 270Ser Ser Pro Tyr Lys Gly Leu Asp Leu Ser Lys Leu
His Glu Thr Leu 275 280 285Arg Thr
His Thr Arg Leu Val Ser Leu Phe Asn Thr Thr Leu Thr Gly 290
295 300Leu His Glu Val Ser Ala Gln Asn Pro Thr Asn
Cys Trp Ile Cys Leu305 310 315
320Pro Leu Asn Phe Arg Pro Tyr Val Ser Ile Pro Val Pro Glu Gln Trp
325 330 335Asn Asn Phe Ser
Thr Glu Ile Asn Thr Thr Ser Val Leu Val Gly Pro 340
345 350Leu Val Ser Asn Leu Glu Ile Thr His Thr Ser
Asn Leu Thr Cys Val 355 360 365Lys
Phe Ser Asn Thr Thr Tyr Thr Thr Asn Ser Gln Cys Ile Arg Trp 370
375 380Val Thr Pro Pro Thr Gln Ile Val Cys Leu
Pro Ser Gly Ile Phe Phe385 390 395
400Val Cys Gly Thr Ser Ala Tyr Arg Cys Leu Asn Gly Ser Ser Glu
Ser 405 410 415Met Cys Phe
Leu Ser Phe Leu Val Pro Pro Met Thr Ile Tyr Thr Glu 420
425 430Gln Asp Leu Tyr Ser Tyr Val Ile Ser Lys
Pro Arg Asn Lys Arg Val 435 440
445Pro Ile Leu Pro Phe Val Ile Gly Ala Gly Val Leu Gly Ala Leu Gly 450
455 460Thr Gly Ile Gly Gly Ile Thr Thr
Ser Thr Gln Phe Tyr Tyr Lys Leu465 470
475 480Ser Gln Glu Leu Asn Gly Asp Met Glu Arg Val Ala
Asp Ser Leu Val 485 490
495Thr Leu Gln Asp Gln Leu Asn Ser Leu Ala Ala Val Val Leu Gln Asn
500 505 510Arg Arg Ala Leu Asp Leu
Leu Thr Ala Glu Arg Gly Gly Thr Cys Leu 515 520
525Phe Leu Gly Glu Glu Cys Cys Tyr Tyr Val Asn Gln Ser Gly
Ile Val 530 535 540Thr Glu Lys Val Lys
Glu Ile Arg Asp Arg Ile Gln Arg Arg Ala Glu545 550
555 560Glu Leu Arg Asn Thr Gly Pro Trp Gly Leu
Leu Ser Gln Trp Met Pro 565 570
575Trp Ile Leu Pro Phe Leu Gly Pro Leu Ala Ala Ile Ile Leu Leu Leu
580 585 590Leu Phe Gly Pro Cys
Ile Phe Asn Leu Leu Val Asn Phe Val Ser Ser 595
600 605Arg Ile Glu Ala Val Lys Leu Gln Met Glu Pro Lys
Met Gln Ser Lys 610 615 620Thr Lys Ile
Tyr Arg Arg Pro Leu Asp Arg Pro Ala Ser Pro Arg Ser625
630 635 640Asp Val Asn Asp Ile Lys Gly
Thr Pro Pro Glu Glu Ile Ser Ala Ala 645
650 655Gln Pro Leu Leu Arg Pro Asn Ser Ala Gly Ser Ser
Xaa Ser Gly Arg 660 665 670Arg
Pro Thr Ser Pro Thr Ala Leu Arg Phe Ser Cys Xaa 675
680 68536360PRTHomo sapiensmisc_feature(21)..(21)Xaa is
any amino acid 36Thr Ser Phe Val Glu Lys Ala Asn Gly Val Lys Cys His Lys
Tyr Lys1 5 10 15Leu Ser
Phe His Xaa Glu Thr Thr His Asn Tyr Val Lys Ser Val Ile 20
25 30Tyr Ala Leu Gln Glu Ala Phe Arg Val
Tyr Leu Pro Ile Pro Ala Ser 35 40
45Pro Thr Pro Ser Pro Thr Asn Lys Asp Pro Pro Ser Thr Gln Met Val 50
55 60Gln Lys Glu Ile Asp Lys Arg Val Asn
Ser Glu Pro Lys Ser Ala Asn65 70 75
80Ile Pro Gln Leu Xaa Pro Leu Gln Ala Val Gly Gly Arg Glu
Phe Gly 85 90 95Pro Ala
Arg Val His Val Pro Phe Ser Leu Pro Asp Leu Lys Gln Ile 100
105 110Lys Thr Asp Leu Gly Lys Phe Ser Asp
Asn Pro Asp Gly Tyr Ile Asp 115 120
125Val Leu Gln Gly Leu Gly Gln Phe Phe Asp Leu Thr Trp Arg Asp Ile
130 135 140Met Ser Leu Leu Asn Gln Thr
Leu Thr Pro Asn Glu Arg Ser Ala Thr145 150
155 160Ile Thr Ala Ala Xaa Glu Phe Gly Asp Leu Trp Tyr
Leu Ser Gln Val 165 170
175Asn Asp Arg Met Thr Thr Glu Glu Arg Glu Xaa Phe Pro Thr Gly Gln
180 185 190Gln Ala Val Pro Ser Leu
Asp Pro His Trp Asp Thr Glu Ser Glu His 195 200
205Gly Asp Trp Cys Cys Arg His Leu Leu Thr Cys Val Leu Glu
Gly Leu 210 215 220Arg Lys Thr Arg Lys
Lys Ser Met Asn Tyr Ser Met Met Ser Thr Ile225 230
235 240Thr Gln Gly Arg Glu Glu Asn Pro Thr Ala
Phe Leu Glu Arg Leu Arg 245 250
255Glu Ala Leu Arg Lys Arg Ala Ser Leu Ser Pro Asp Ser Ser Glu Gly
260 265 270Gln Leu Ile Leu Lys
Arg Lys Phe Ile Thr Gln Ser Ala Ala Asp Ile 275
280 285Arg Lys Lys Leu Gln Lys Ser Ala Val Gly Pro Glu
Gln Asn Leu Glu 290 295 300Thr Leu Leu
Asn Leu Ala Thr Ser Val Phe Tyr Asn Arg Asp Gln Glu305
310 315 320Glu Gln Ala Glu Gln Asp Lys
Arg Asp Xaa Lys Lys Gly His Arg Phe 325
330 335Ser His Asp Pro Gln Ala Ser Gly Leu Trp Arg Leu
Trp Lys Arg Glu 340 345 350Lys
Leu Gly Lys Leu Asn Ala Xaa 355 3603726DNAHomo
sapiens 37ggaccataga ggacactcca ggacta
263825DNAHomo sapiens 38cctcagtcct gctgctggat catct
253927DNAHomo sapiens 39cctccaagca gtgggaggaa
gagaatt 274028DNAHomo sapiens
40ccttccctgt gttattgtgg acatcatt
284130DNAHomo sapiens 41ggaagaagtc tatgaattat tcaatgatgt
304227DNAHomo sapiens 42gggacacaga atcagaacat ggagatt
274327DNAHomo sapiens
43gccttcagaa gagtcaggtg acagaga
274425DNAHomo sapiens 44gagcctccaa agtccacttg cctga
254529DNAHomo sapiens 45gatttcagta tctactagtc
tgggtagat 294627DNAHomo sapiens
46ctaggaaatc cagctagtcc tgtctca
274728DNAHomo sapiens 47ccaagacagc caacttagtt gcagacat
284828DNAHomo sapiens 48ggacgctgca ttctccatag
aaactctt 284929DNAHomo sapiens
49gcaatactac atacacaacc aactcccaa
295026DNAHomo sapiens 50gggggaggca tatccaacag ttagta
265130DNAHomo sapiens 51ccatctacac tgaacaagat
ttatacactt 305228DNAHomo sapiens
52aatgccagta cctagtgcac ctagcact
285331DNAHomo sapiens 53cgaatacaac gtagagcaga ggagcttcga a
315428DNAHomo sapiens 54agcccaagat gcagtccaag
actaagat 285527DNAHomo sapiens
55gcgtagtaga ggttgtgcag ctgagat
275627DNAHomo sapiens 56cccttaccaa gagtttctat ggagaat
275727DNAHomo sapiens 57accgctctaa ctgcttcctg ctgaatt
2758420PRTHomo
sapiensmisc_feature(21)..(21)Xaa is any amino acid 58Thr Ser Phe Val Glu
Lys Ala Asn Gly Val Lys Cys His Lys Tyr Lys1 5
10 15Leu Ser Phe His Xaa Glu Thr Thr His Asn Tyr
Val Lys Ser Val Ile 20 25
30Tyr Ala Leu Gln Glu Ala Phe Arg Val Tyr Leu Pro Ile Leu Pro Ala
35 40 45Ser Pro Thr Pro Ser Pro Thr Asn
Lys Asp Pro Pro Ser Thr Gln Met 50 55
60Val Gln Lys Glu Ile Asp Lys Arg Val Asn Ser Glu Pro Lys Ser Ala65
70 75 80Asn Ile Pro Gln Leu
Xaa Pro Leu Gln Ala Val Gly Gly Arg Glu Phe 85
90 95Gly Pro Ala Arg Val His Val Pro Phe Ser Leu
Pro Asp Leu Lys Gln 100 105
110Ile Lys Thr Asp Leu Gly Lys Phe Ser Asp Asn Pro Asp Gly Tyr Ile
115 120 125Asp Val Leu Gln Gly Leu Gly
Gln Phe Phe Asp Leu Thr Trp Arg Asp 130 135
140Ile Met Ser Leu Leu Asn Gln Thr Leu Thr Pro Asn Glu Arg Ser
Ala145 150 155 160Thr Ile
Thr Ala Ala Xaa Glu Phe Gly Asp Leu Trp Tyr Leu Ser Gln
165 170 175Val Asn Asp Arg Met Thr Thr
Glu Glu Arg Glu Xaa Phe Pro Thr Gly 180 185
190Gln Gln Ala Val Pro Ser Leu Asp Pro His Trp Asp Thr Glu
Ser Glu 195 200 205His Gly Asp Trp
Cys Cys Arg His Leu Leu Thr Cys Val Leu Glu Gly 210
215 220Leu Arg Lys Thr Arg Lys Lys Ser Met Asn Tyr Ser
Met Met Ser Thr225 230 235
240Ile Thr Gln Gly Arg Glu Glu Asn Pro Thr Ala Phe Leu Glu Arg Leu
245 250 255Arg Glu Ala Leu Arg
Lys Arg Ala Ser Leu Ser Pro Asp Ser Ser Glu 260
265 270Gly Gln Leu Ile Leu Lys Arg Lys Phe Ile Thr Gln
Ser Ala Ala Asp 275 280 285Ile Arg
Lys Lys Leu Gln Lys Ser Ala Val Gly Pro Glu Gln Asn Leu 290
295 300Glu Thr Leu Leu Asn Leu Ala Thr Ser Val Phe
Tyr Asn Arg Asp Gln305 310 315
320Glu Glu Gln Ala Glu Gln Asp Lys Arg Asp Xaa Lys Lys Gly His Arg
325 330 335Phe Ser His Asp
Pro Gln Ala Ser Gly Leu Trp Arg Leu Trp Lys Arg 340
345 350Glu Lys Leu Gly Lys Leu Asn Ala Xaa Xaa Gly
Leu Leu Pro Val Arg 355 360 365Ser
Thr Arg Thr Leu Xaa Lys Arg Leu Ser Lys Xaa Lys Xaa Ala Ala 370
375 380Pro Ser Ser Met Pro Leu Ile Ser Arg Glu
Ser Leu Glu Gly Pro Leu385 390 395
400Pro Gln Gly Thr Lys Val Leu Xaa Val Arg Ser His Xaa Pro Asp
Ser 405 410 415Ser Ser Arg
Thr 4205932DNAHomo sapiens 59taaactacaa atggttcttc aaatggagcc
ca 326032DNAHomo sapiens 60gatgcagtcc
aagatgcagt ccatgactaa ga
32611740DNAHomo sapiens 61aggttggctg acaaccgctc ttaactgctt catgctgaat
tggggcatag taggggtcgt 60gcagttgaga tttccttggg aggggtgcct tcaatgtcat
caacattgga gcatgggcta 120gcaggccagt ccaggggtcc gcggtagatc ttagtcatgg
actgcatctg gggctccatt 180tgaagaacca tttgtagttt tacagcttcg attctggaag
agacaaacgt aacaaggagg 240ttaaagatac aaggattgaa atgtacggcc tgaagtgcag
gggcatatga gtgtgggcgg 300tgcaagtggg gtttccttta gaaaaactcc gatacaatag
ggcatcaata tttctaggaa 360gccacattct ccatagaagc tctcggtaag gggagctact
ggtagtacag cagcatacag 420ggggtgcagt gagagtgaaa gggggtaaga gaacagtaaa
aagaaaaata tgacaaggga 480gggccaagag gatctacgat tctagttact ttcctcacgg
ttgtcgcctg aagagcaggc 540gcagatcctc tagaggttca caggaatagc tagcattgtc
tgctggattt tcgggttcct 600ttggcagtat ccagggtttg gctcgagtgt gacttatcca
agactccact ccagccactt 660aactgcggtt agggtagata aaatgactgg gtagggtcct
tcccaggatg tgtgtaggga 720tggggaatta aaggggaagg gacttgacta ataccatgtc
accagggtgg aataattcct 780ttccctcctc tcagggacag gttccctgta atgttttaag
aactcgttga tatttggcta 840aggaggtgat gtctgcaact aagttggccg tctctcagtc
aagcacaagg tcattggtta 900ggaagggctg tccatacagc atctcatatg gactaagtcc
tgctttttgg ggacagtttc 960ggattcttag taaggctata ggcaacagag caggccatgc
aaggtgggtt tcttgggtta 1020gcttttttag atgtcgtttg agtgtttcat tcattttctc
aacttttcct gaggatcgtg 1080gcctccaggc acagtgtaag tgatattgta tacctaacgc
ctgggatact ccctgcgtta 1140ctgcagcctt gaaattgggg ccattgtcac tctgtaaacc
tcagggaagt ccgaatctgg 1200gaattatttc atgaattagt acttttatta cctcttgggc
cttttctgtc ctacaaggga 1260aggcctccac ccaaccagtg aaagtaccca gattagtaga
tactgaaatc tctgagattt 1320gggcatgtgg gtaaaatcta gttgctagtc ttctcctggg
taatggcctg ttctttgttc 1380tcctgaagga gcttggcaat aaggcagggg attatttctt
tggcacactt cacaggccct 1440gactatctgc ttgacagttt tgaaaaggcc tggtccagta
aataatgatt tggccatctg 1500atgggtgctg tcaatgccta agtgaaaggt ctggtgaagg
gttttaagta atttccattg 1560gttagctgca ggcaaaagta ttttttcttt ggtggctggc
catcctgagg agaggaaact 1620atgtcctcgt gagtttcccc attccatttc ttctgctgag
tactggagct tggtttccca 1680gaggggatta ccccatacta ggggtccttc tgtaagcatt
tctaatggag agtcctgcct 1740627140DNAHomo sapiens 62ttggtcttaa gaacacaaat
gatatggctc caatgactgg aggaacacca gggtccttgg 60tctcacgctg atttagataa
aacgactgtc aggcctctga gcccaagcta agccatcctc 120ccctgtgacc tgcacgtata
catccagatg gcctgaagta accaaagaat cacaaaagca 180gtgaaaatgg cctgttcctg
ccttaactga tgacattcca ccattgtgat ttgttcctgc 240cccatcttaa ctgagcgatt
aaccttgtga aattccttct cctggctcaa aacctccccc 300actgagcacc ttgtgacccc
cgcccctgcc cctaagagaa aacccccttt gattataatt 360ttccactacc cacccaaatc
ctataaaatg gccccacccc tatctccctt cgctgactcc 420tttttcggac tcagcccgcc
tgcacccagg tgaaataaac agccttgttg ctcacacaaa 480gcctgtttgg tggactctct
tcacacggac gctcatgaca tttggtgcca aaacctggga 540taggaggact ccttcaggag
accagtcccc tgtccttgcc ctcactctgt gaggacatcc 600acctacaacc ttgggtcctc
agaccaacca gcccaaggaa cagctcacca atttcaaatc 660aggtaagcag tcttttcact
ctcttctcca gcctctcttg ctacccttca aactccctct 720ctcactaccc ttcaatctcc
ctgtccttcc aattccagtt ctttttcatc tctagtagag 780acaaaggaga cacattttat
ccatggaccc aaaactccag caccagtcac ggacttggga 840agacagtctt cccttggtgt
ttaatcactg cggggacgcc tgcctgatta ttcacccaca 900ctccattggt gtctgatcac
ggtggggaca cctgccttgg tcactcaccc acattccctt 960ggtggtacgt caactgcaaa
agcaggggac gcctgctttg gctgctcacc cacccccttc 1020tctgtgtctc tacctttctc
tttaaactta cctccttcac tatgggcaaa cttctgccct 1080ccattccccc ttcttctccc
ttagcctgtg ttcttaaaaa cctaaaacct cttcaactca 1140cacctgacct aaaacctaaa
tgccttattt tcttctgcaa cactgcgtgg ctgcagtaca 1200aacttgataa tagctttaaa
tggccagaat atggcacttt caatttctcc atcctacaag 1260atctagataa tttttgtgga
aaaatggaaa aatggtctga gatgcctgac gtccaggcat 1320tcttttacac attggtccct
ccctagtctc tgctcccaat gcgactcatc ccaaatcttt 1380cttctttctc tcctgtctgt
tccttcagtc tccaccccaa gctctgagtc ctttgaatcc 1440tcctttgcta cagacccatc
tgaactctcc cctcctcccc aggctgctcc tcaccaggcc 1500gagccaggtc ccaattcttc
ctcagcctct gctcccccac cctataatcc ttttatcacc 1560tcctctcctc acactcagtc
cggcttacag tttcgttctg tgactagccc tcccccatct 1620gcccaacaat ttcctcttaa
agaggtggct ggagctaaag gcatagtcaa ggttaatgct 1680cctttttctt tatctgacct
ctcccaaatc agttagcgtt tacgctcttt ttcatcaaat 1740ataaaaaccc agccagttca
tggcccatct ggcaacaacc cttacaggct ttacagccct 1800agaccctgaa gggtcagaag
gccgtcttat tctcaatatg cattttatta cccaatccgc 1860tcccaacatt aaataaagct
ccaaaaatta aattctggcc ctcaaacccc acaacaggac 1920ttaattaacc tcacttcaag
gtgtacaaga atagagtaga ggcagccaag tagcaacgta 1980tttgagttgc aattccttgc
ctcaactctg agagaaaccc cagccacatc tccagcaaac 2040aagaacttca aaacacctga
actgcagcag ccaggcgttc ctccaggacc acctccccca 2100ggatcttgct tcaagtgccg
gaaatctgac cattgggcca aggaatgcct gcagcccagg 2160attcctccta agccacgtcc
catttgtgca ggaccccact ggaaatcgga ctgtccaact 2220cacccggcag ccaatcccag
agcccctgga actctggccc aaggctctct gactgactcc 2280ttcccagatc ttctcggctt
agcagctgaa gactgacact gcccgatcac ttcagaagtc 2340ccctggacca tcacggatac
tgagcttcag gtaactctca cagtggaggc taagtccatc 2400ccctgtttaa tcgatacagg
ggctacccac tccacatcac cttcttttca agggcctgtt 2460tccctttccc ccataactgt
tgtgggtatt gacggccaag cttcaaaacc ccttaaaact 2520cccccactct ggtgccaact
tggacaacat tcttttatgc actctttttc agttatcctc 2580acctgcccag ttcccttatt
aggccgagac attttaacca aattatctgc ttccccgact 2640attcctgggc tacagccaca
tctccttgcc gcccttcttc ccaacccaaa gcctccttca 2700tatcttcctc tcatatcccc
ccaccttaac ccacaagtat gggacacctc tactccctcc 2760ctggcaaccg atcacacgcc
cattactatc ccattaaaac ctaatcaccc ttaccctgct 2820caatgccagt atcccatacc
acaacaggct ttaaagggat tgaagcctgt tatcacttgc 2880ctgctacagc acgggcttct
aaaacctata aactctccat acaattcccc cattttacct 2940gtctaaaaac cagataagtc
ttacaggtta gttcagaatc tgcaccttat caaccaaatt 3000gttttgccta tccaccctgt
agcacccaac tcgtacactc ttttgtcctc aatgccttcc 3060cccacaactc actattccgt
tcttgatctt aaagatgctt ttttcactat tcccctgcac 3120ccctcatccc agcctctctt
tgcttttacc tggactgacc ctgacaccca tcagtcccag 3180cagcttacct gggctgtact
gccgcaaggc ttcagggaca gccctcatta cttcagccaa 3240gctctttctc atgatttact
ttctttccac ctctctgctt ctcaccttat tcaatatatt 3300gatgaccttc tactttgtag
cccctccttt aaatcttctc aacaagacac cctcctgctc 3360cttcaacatt tgttctccaa
aggatatcgg gtatccccct ccaaagctca aatttcttct 3420ccatctgtta catacctcgg
cataattctt catgaaaaca catgtgctct ccctgccaat 3480tgcgtctcca actgatctct
caaatcccaa cctcttctac aaaacaacaa ctcctttccc 3540tcctaggcat ggttggatac
ttttgccttt ggatacctgg ttttgccatc ctaacaaaat 3600cattatataa actcacaaaa
ggaaacctag ctgaccccat agattctaaa tcctttcccc 3660actcctcttt ccattccttg
aagacagctt tagagactgc tcccacacta gctctccctg 3720tctcatccca acccttttca
ttacacacag ccgaagtgca gggctgtgca gtcggaattc 3780ttacacaagg accgggacca
tgccctgtag cctttttgtc caaacaactt gaccttactg 3840ttttaggctc gccatcatgt
ctccatgcgg tagcttccgc tgccctaata cttttagagg 3900ccctcaaaat cacaaactat
gctcaactca ctctctacag ctctcacaac ttccaaaatc 3960tattttcttt ctcacacctg
acgcatatac tttctgctcc ccggctcctt cagctgtatt 4020cactctttgt tgagtctccc
acaattacca ttcttcctgg cccagacttc aatctggcct 4080cccacattat tctggatacc
acacctgacc ctgatgattg tatgtctctg atctacctga 4140cattcacccc atttccccat
atttccttct tttctgttcc tcatgttgat cacatttggt 4200ttactgacgg cagttccacc
aggcctgatc gccactcacc agcaaaggca ggctatgcta 4260tagaatcttc cacatccatc
attgaggcta ctgctctgcc cccctccact acctctcagc 4320aagccgaact gattgcctta
actcgggcct tcactcttgc aaagggacta cacgtcaata 4380tttatactga ctctaaatat
gccttccata tcttgcacca ccatgctgtt atatgggctg 4440aaagaggttt cctcactacg
caagggtcct ccatcattaa tgcctcttta ataaaaactc 4500ttctcaaggc tgctttactt
ccaaaggaag ctggagtcac acactgcaag ggccaccaaa 4560aggcgtcaga tcccattact
ctaggaaatg cttatgctga taaggtagct aaagaagcac 4620ctagcgttcc aacttctgtc
cctcatggcc agtttttctc cttcccatca gtcattccca 4680cctactcccc cattgaaact
tccgcctatc aatctcttct cacacaaggc aaatggttct 4740tagaccaagg aaaatatctc
cttccagcct cacaggccca ttctattctg tcatcatttc 4800ataacctctt ccatgtaggt
tacaagccac tagtccacct cttagaacct ctcatttcct 4860tccatcgtgg aaacatatcc
tcaaggaaat cacttctcag tgttccatct gctattctac 4920tacccctcag ggattgttca
ggccccctcc cctccctaca catcaagctc ggggatttgc 4980ccctgcccag gactggcaaa
ttgactttac tcacatgccc tgagtcagga aactaaaata 5040cctcttggtc tgggtagaca
ctgtcactgg atgggtagag gcctttccca cagggtctga 5100gaaggccact gcagtcattt
cttcccttct gtcagacata attccttggg ttggccttcc 5160cacctctata cagtccaata
acggagcagc ctttattagt caaatcacct gagcagtttt 5220tcaggctctt ggtattcagt
ggaaccttcg taccccttac tgtcctcaat cttcaggaaa 5280ggtagaatgg actaatggtc
ttttaaaaac acaccccacc aaactcagcc tccaacttaa 5340aaaggaggat agagcccaaa
aactcgcaac caagctagta attatgctga acccccttgg 5400gcactctcta attggatgtc
ttaggtcctc ccaaatctta gtcctttaat atctgttttt 5460ctccttctct tattcggacc
ttgtgtcttc cgtttagttt ttcaattcat acaaaaccgc 5520atccaggcca tcaccaatcg
ttctatacaa taaatgctcc ttctaacaac cccacaatat 5580cgccccttac cacaaaatct
tccttcagct taatctctcc cactctaggt tcccatgccg 5640cccataatcc ctctcgaagc
agccctgaga aacatagccc attatctctc cataccaccc 5700ccaaaatttt tgctgcccca
acacttcaac actattttac attatttttc ttattaatat 5760aagaagacag caatgtcagg
cctctgagcc caagccatca tatcccctgt gacctgcaca 5820tatacatcca gatggcctga
agtaactgaa gaatcacaaa agaagtgaaa atggcctgtt 5880cctgccttaa ccgatgacat
tccaccactg tgatttgttc ctgccccacc ttaactgagc 5940aattaacctt gggaaattcc
ttctcctggc tcaaaacctc ccccactgag caccttgtga 6000cccctgcccc tccactaccc
acccaaatcc tataaaatgg ccccacccca tctcccttag 6060ctgactcctt ttttggactc
agcccgcctg cacccaggtg aaataaacag ccttgttgct 6120cacacaaagc ctgtttggtg
gactctcttc acagggacgg gggtgacaac aacacggaca 6180cacatggagt ggttttaagg
agcagagagt ttaatacgca aaaaagaagg aagaggctcc 6240cctgtacaga cacagaggga
gggggctcca agccgagaga aggaaacccc atgtgcagtg 6300gaaaagtggt tgattatact
gggaggctgg aggaggcggt gtctgatttg cacagggccc 6360aggggattgg gttgaccagg
tgtatcattc atgtaccccg caaaaaacct ggccctccca 6420cctcagccct ttaatatgca
aatgtgggtt gccatgatgt tctgaaaaca catgaattat 6480ctggaggggg ccatgacact
tggtacatgt gctgacaaga agagggtggg aatcgccatg 6540gtggccatgt tgggtggacc
tagtttttaa tagcctgcat ttgcatatca aagtttgctg 6600gcctggctct ttaagctgtc
ttttctgtta gaaaaggaat ggtttggaat gggtgagggt 6660tgcttcttat tacaagaaaa
tttccaaaaa cctttactct ttctagctgc caaaaaacta 6720tttcttaata acttatgtat
taccataatt aggcagcacc aaagatccct gcaggtcaga 6780ccactgcaat taacatgctg
gctttactgc tgattatggt agctgcatcc acctagcctc 6840tcatattgca actgcctgac
ctctgccacc ccacgagcca cttatcccca cttataatca 6900gcccatttcg attgtaacat
ctgccactta ttcccgacgt tgtggtatat cctatagatg 6960aattcattca acatccattc
caacaccacc tctcttgcct tcctatactc tctggagagt 7020gaattactga gtcacatgat
cttcactgca gtcatttgtg gctatgtgac atagttctgg 7080acagtgaaca tagacagaag
tccctggggc gggcttcctt tctgggatga gggcaaaacg 71406344100DNAHomo sapiens
63tgcctttatt tccgtaggct ggtcatatgg cgctagcact cacataaagc taccgaggag
60agcgaatgaa accaaaatca ctttaccttc acagcacgag gccgtcgtcc ctctcgatat
120ttggcccgtg tgtcgcatac cgccctctgg acgtggtgat caaataaact ccctagctcc
180ccgccgctcg acgccatctt gcctactttg atcctcgcag ggaggacaac atccgcccta
240ctgagctccc ttttatccaa taagagagcg ggatgagtta aggagtgcca ggattggctg
300gagaatcgac agcgtcggcc atcgtttcct gcgtgcgaag atttgatgaa cgaggtgccg
360cccccgagcg gctcggcgga gaggcgcggt gggtgacaga agctttcttg tcccacccac
420tacaggctta cggcaggatg cgcagcgggg agagggggcg gggccgcagg gggcggggcc
480gatcgatctc ctccggctcc gacgtcctcg gcctgccggg tcccgggtcc tttgcggcgc
540tagggtgggc gaacccagag cgacgctccg ggacgatgtg gggcagcgat cgcctggcgg
600gtgctggggg aggcggggcg gcagtgactg tggccttcac caacgctcgc gactgcttcc
660tccacctgcc gcggcgtctc gtggcccagc tgcatctgct gcaggtaacc tgccggcccc
720gagccacctg atcttcagcc tggggtcgga cgaggccgaa gcctctcagg gacgcggcgg
780gacaccggct gccacccggg cgccgccgaa gcgcgcagag atcagggtcc ctcgacggca
840gggcccttct gggtagtctc tggatcccac aagtccagtg cagccctggg ctcgtcttat
900cccaggtctt ttcacttggt gaaactgaac ctagaaacgt cctaatattc taccactgtt
960tttataaata ttccttattc caggctggaa aagctcctga gaagtggttt gtttttatta
1020ttttaaaagg tgttttcctt gccagccatt tccagttaac ctgcgctgct gccgtccggg
1080ccgcgagagc gggacgcaga gttgttggcg gagcccctgt cggttcccgg ggactaagca
1140ccgcgtccca tgagcgggaa aggttaatac aatgatggtt ctgccctgcg tcgctgacgc
1200ggaacacagc tgtagtgtgt taggaacaca taacgtagtt aagatcactt gaagctctgc
1260gatcagtcgc ccttctggac gttgtggtta ggatgtttca cagttctaac cactggtgga
1320gatacagcgt ccatattttc ataattaaaa atagaggcac atggtctcac gagtttgagt
1380gtacttatgg gggcaaaagg acggcgtatt tgaaatcctc ataaatcctg gatgcatggt
1440acccaccagt ggctaatcta tgcaatgaat agagtttgca ataatttcaa gcatcccttc
1500tttccacttg agttacttcc ccatacctag gggaagatat ttttggtcca ctgaaaacat
1560gagttcagca gaatcctcct atcatcgtcg ttattatttt ttaccactaa gtagacaatc
1620ttttggtttt tgatgggctt tatggctaga gacaaatcag tcactgtcac caagttccag
1680gtagaagttg gttcagtgct ctgtcagctt cgatgggatt tttcaacatg ttttcaaatc
1740tgcacttaat agtaggaatg ctttcttaca gtaactctaa tttgatccta agatgtagtt
1800gttaccttac attcatcact gtttaagaat ttagtggtct tgatctttgt tttaaatttt
1860gagccttcgg gaagtactta taagaattaa ttcatgcata tctttttgaa atgtaaatgt
1920ctttagccct ggaacaaatt gctgtttctg ttcagcccat attagcagaa taggtcaact
1980ttactttcta attatcaatg taataagttt attactttat agattccata aatctataca
2040tttattcctc gatgaattat ataaatttat agaatttatg ttttatagaa aatttggaaa
2100gcatggaaaa ttattaacaa gaaaataagt tacccataat cccagaactt agaggtgact
2160aatgttgaca gtttggatca aatcttccag ttttgtttct aatctttatt tttaacataa
2220atgaggtcct gtatacacac gtacagtttt gtgtcctggt gtttttattt aatgttatta
2280tgagtgtttt attttgttaa aaggtcatca ttttaagttg ttaattagta ttctagcaca
2340aatttgccat aatttattta attgtttact atgattgacc atttagattg tacttaattt
2400ttaggcatta gaagtgataa actatatttt aatcagacgt tgaaaataac acatctttgt
2460ttagaaaaca tcattttatt tctggttgtc taggatagat tcccagaatt cttgggttag
2520aggccataga taattatgaa agcagaaaga ttcacaagtt gggagttaat acttgaatta
2580ctttatttgg ggtgaagcat tgagtgcata atacagatca tgcagtaatg ggaagaaggg
2640ttggaacaat ggttttctgg cctatgtcag acttaccttg aagcttttaa gaatacagat
2700gttctgatca accctcagac ctattaaatc agacctaaaa tcttagggaa taggctttag
2760gcatctctaa ttttaaaaaa tttattcagg ctacttggat gcacaaaaga gttgagacct
2820actgtcctag aatcatagaa ttttaatgac gatagagacc ttaagcatct aggtcgtttc
2880tgtactttta catgtaagga aactggcatt cctaggccag taccattgcc atgcagctaa
2940tttgccctct tgtctatagc tcactctgca tcacccaacc taccgttctc actgtttctt
3000ctataaccaa tctccttccc acttctgttc tcttactcat gccattcttc cctcagtcat
3060ttttcttcct tccatacaaa ttccatgtct ttaaaaagga ataatcctac ctcctccaca
3120tagctttcca attctctgtt gcccacattt gtctcccttt caatacttct ctgttgtgtt
3180atgtgacaca tcacatttga tatactctgt actgtgtttc aagtattgta ttctcttgtt
3240tactcaagtc attatttcag gactgactac ccagtagatg ctttaagtca ggatttctca
3300accttggcac tgttgacatt ttgagctgga taattttttg ttttgggggc tctcctgtac
3360attttaagat gtttaacagc acccttggcc tctatccagt agacgcctgt actgcctccc
3420cctatctgtg acaaccaaaa aggtcttcag acattgtcag atgtctactg aaggacaaaa
3480tcacctctgg ttgagaacca ccgcttcaac taagttatct tctctgtact cagaacttga
3540tgtgattgca gcagggggag aggattcata tacacagtga atgcaaacga acctaaatca
3600ccattcggat atggccacac aattttcatt tcccttgtgt tagcaagaga taccctaggc
3660tttggacctg attattccta aggcattctg atgtatggtt ttacctgcag atttcctggt
3720aatactgata cctcagtttg ggtcaaagaa ggtcaattaa ttgattgatt tgatttgact
3780cctggaaaag acgctccttt ctagctgtct ctttcttctc tttacctgaa tagccagggc
3840tctgtggttc aagtgaagta ttttgacata aaaattaact tagaacattg gtctgcagag
3900tttgctcaat ataactgagc acatattgtg gctttatgga gctggttact actttttgac
3960caaataaata attagaagta tttttcctcc tcaataaggt tcatttttcc ttttttcagt
4020gagctggtag agtttccttt tttgatattt cagggcatct ttcatatttc catctcttaa
4080gtttcttcat atgaagtaga atttatctgg attatgtatt gctgactctg atgaaaaccc
4140atagaaagca tctggggctt gatcaccttc attcttgtaa tagctcacac ggttacagct
4200gatatggtaa cttaagactt ttgattccaa atctaggcaa aatacactca gttgaaagaa
4260tttgtcagcc agaacagttg gactgttctg tgaaaattgt gagaaaaatt acacaactaa
4320gtgatacatg atgatggctt tcttaaatat aaaattgtaa taacatggtt aatttccagt
4380acgttatatt gtcccagaag tggctccaac attgtttgaa atttgtctca tttaaagaaa
4440cataagctgg ctatggtggc tcacgcctgt aatcccagca ctttgggagg ctgaggcagg
4500cagatcacct gaggtcagga gttcgagacc agcctggcca acatggtaaa accccatctc
4560tactaaaaat acaaaaatta gccgggcatt tggtgggggc ctgtaatccc agctacttgg
4620gaggctgagg caggagaatt gcttgaatct gggaggtgga ggttgcagtg agccgagatt
4680gtgccactgc cctccagcct gggtgacaga gtgagtctcc gtctcaagaa aaaaaaaaaa
4740aaaagcaaga aacataaaga ctgggcatgt tggctcatgc ctgtaatccc agcactttga
4800gagactgagg tgggaagatc acttgagccc aggaggttaa ggctgcagtg agccgtgatt
4860ttgccactgt actcgagcct gggcaacaca gtgagatcct gtctcaggaa aaaaaaaatt
4920gcatgtaaat gaatgaattt gatatttaat attttaaatt atgaaaactg ttctgtagag
4980atgtagatct tgccatgttg cccaggctgg ctttgaactt ctgggctcaa acaatcctcc
5040tgtctcagtc tcccaaagta taaagattac acatgtgagc cactgcacct ggcctaatat
5100ttttaactta atgaatttat tttgatataa ataaattaat aacactgaag cttcctgata
5160taataagtct ttttgtgtgt gtgacgggtt ctcactctgt tgcccagact ggagtgtaat
5220ggcactatca tggctcactg tagcctcaac ctccctgact caagtgatcc tcccacctcg
5280gcttcctgag tagatgggac cacaggcgta tgccaccaca cctggctgat ttttaaaatt
5340tattattgat acatattaat aaaattattt ttattttaaa aatgatatat gtggctgggc
5400atggtggctc atgcctgtaa tcccgacagt ttgggaggcc gaggtgggag gatcacttga
5460gaccaggagc ttaagaccag cctaagcaac atagtgagat cccatctcta tagaaaaaaa
5520aaatggctag gtgtggtggt gtatgcctat attcccagct actcaggaga ctgaggtgag
5580aggattgcta gagcccagga gtttcaagtt acagtgacct atgattgtgc cagtgcactc
5640cagcctgggc aacagagcaa aatcctgtct caaaaaaaaa aaaagttcga aaatgcttat
5700gatgcaatat aagtagtgga aaaggatatt aaattgtgcc tatatgaaca caactatatg
5760aaaaacttgc acatagagaa aaggattaac aagaaataga ccaaattgtt cacatggttg
5820tcttgtttgt ggagagaata tcagtagttc atttgtttcc ttccaagttt atatgttttc
5880cgaggtctct ataatgagtt tgtaattgtt taatcataga aaaccctttt ttggtccttg
5940gccacaaact tacatgtttt aatgtaattg cttttttaat gagaataaat gttatatttt
6000gcttttttaa aacctatatt cccatagtta tatgagccct tacaattatt aagaggctgc
6060ataatataac gtttctggaa gggtacagaa gaaacagcag taattacctc tgagaacaga
6120gacatggctt cacattttac ccttttgtac gttttgtgct tttgccacat gcatttatta
6180ttcttccaat aaataagtaa ataaatatgg attgtatact ccatctggtt ggtgtttcat
6240aattctaaaa ttatattgct acatttttaa agatgatatg tgtttctact tattaacgta
6300tatgttaaaa tagtaaattt atatcttatt taataatttc cctattgata gacatttaag
6360acagtctcaa gtgttcacta tcatagaaaa tactgcacag atagcttttg ctatagtttc
6420ttttttcttt gaatcgttaa ttgggaataa atgctcaaat agttatatgt ggctcaactg
6480ctatttaagt ttattgactg actgctgcca ttttgaattc tgaaggggtt gattaaattt
6540ataatgctgc cataagaata taagggtatt ggcttcatta gcatccacca gcattgggtg
6600ttggaaatga ttatagattt ttaaatgcta caacaaatgt agataacaga gaactatcta
6660tagaactctt tttggacatg tgaattgtaa taatagttta ttttcatgtg aatccagaaa
6720aatgtatacg aaaacctttt ttcctctcat ttcttatatg aatagaatca agctatagaa
6780gtggtctgga gtcaccagcc tgcattcttg agctgggtgg aaggcaggca ttttagtgat
6840gggggacagg taagcacatg tgatggcaat aactttcttc taatatcaca taatatagca
6900atagaaataa aattaaaagt ttagattttt tgttaaagga ggtgagatgt cacctaattt
6960gtatgctatt atgtaactag tctaggatat tgaagctgac tatactctgt ttttaggtca
7020ttatcttgta gtttaccata ctccctactt gcttcttatt ctactattta actcattttc
7080cacatcccct aattttggtt tcatgaaatt atttttcctt ctgaattact aggttctact
7140tactattatt aaactttatt tctgacatat tttataacct tccatggtct cacttgatta
7200aaaataaaaa attcagctgg gtgcggtggc tcacacctat aatcccagca ctttgggagg
7260ccaaggtggg cggataattt gaggtcagga gttggagacc agcctgccca acgtggtgaa
7320accccccctc tctactaaaa attcaaaaat tagctgggca tggtggcagg tgcctgtaat
7380cccagctact caggaggctg aggcaggaga attgcttgaa cctgggaggt ggaggttgca
7440gtgagctgag attgcactgc tgcacttcag ctgggtgaca agagcgaaac aatgtcttga
7500aaaaaaataa aaaataaaaa attctacaac acagggttat tatttttcca tttttgtttt
7560cccttatgag tttaatatgt ttagattata aacctgaaag cttgaatacc tatgtctatc
7620ttttgttttc ttatgtttat caagttattc ctttaaacat tttctaaact gtaagaataa
7680tgtgaggctg ggctcaatgg cttatgcctg taatcccagt gctttgggag gccaaggtgg
7740gaggaccact tgaggccacg agttcaagat tagcctggct aggcaacata gcaagaccct
7800atctctataa aaaaattaaa aaaattagct gggcatggta gcaaatgctt gtagtcccag
7860ctactcagca gactgaggta ggaggaatgc ttgagaccag gaatttgagt gacctatgat
7920tatgcactcc agcccgggca atagcaagac cctatctctt aaaagaagaa gatgtagtaa
7980taatacatat tcattataac tattttacca ttgaaagtaa aaaatgagtt tttacctttt
8040cccagtccca tcctcagaat ggggatctca gtagaccttt aggattggaa gaatgagatc
8100attcatattt tctgcaatta ttaccccaca aaatatttca gatacctttc catgtattac
8160aaacaatgtg catttaacat gtctctctct ttctctctct ctctgtgtgc gtcttcatga
8220tcctctgttg cagccctgcc agtaagacac tatctcctga agaatcactg ataggaacag
8280aaagtggact ggctaggcca ggagtcctta gcttcttagg gggcaggagc tgctttgtgc
8340tttctcagaa tcagatatat atgtggactg aaacatttaa aaacagaata gccaagggtg
8400ctatacgttt aaaacttata tagatggggc tacattgctc tctattacta atttcccatg
8460acaatacacg agagtgccat gtctttttaa cttgttttga gcacagacta atcttgttta
8520tgcatgtttt ttgatgagaa taggctactc atgagaaatc tgtaaaccta acactagtcc
8580cttgcatact ctaaattgtt gctagaatct taaaatttta gcaccagacg gaccttagaa
8640atcattaact ttggtgcttt gttctacaat acaaggagat ggaatatttt acccaggatt
8700gcttagcagg ttacagttct gccctctgag tacccagcac ttccctgtgg gcaacatcaa
8760cttcctgatt ttcaagtctt aattagtact ctgaagaatc ctacttgttt ttaactccca
8820tttgctttga agtgacttta cctgattttt ttagatccct tattgcagca atgccactaa
8880gaaactgagt ctctagcttc ttggtgggca ggagctgctt tgtgcttgct cagaatcatc
8940cttttcagta agggagatat tgaagagaaa tctactgagg agtctggggg tgaggcactc
9000agggaaatcc tgctccagtc cacaaaagca gagaggaagg gttggttacc tagagtattt
9060aacatgcaga ggctttggat tttactcctt taatccttgg aaatgcctat ggaaggggaa
9120aggaagtaag atggtgactc cagcttatag acatactagt gttacatata tttaaactat
9180aataggaggg tattattagt tttacttaac tttcaactgt gaaggattat acttctcaat
9240atttgtctcc agtgtctatt tcagtgtatt tttcactttt cttgaagcag catgtctgtt
9300gcaaaacttc tagaaataat gagaatattt atatattaga tcaagccata acttgatgat
9360atagtcattt cttcttatat tttttactta catttttaca ttttaatgat tactttcatt
9420tttgaaaaac atgtcatgct gagatgtatt tttcttcatt ctgtaattag ttatgaaaca
9480gtttttccta aaatgctgag tatatcaagt cttggctaag aataagtaat aaatatttgc
9540cacatgaaag actacacata tagccaggtg cagtggcttg cacctgtttt cccagctacc
9600caggaggctg aggcaggagg attgcttgag cccagggttt ccaggctgca gtgaactatg
9660attgtaccac tctactccag aatgggtgac agagccaggc cccatctctc aaaacagaaa
9720agaaagatta catagactac atatacaccc ccatccaaaa catacacaca catctactta
9780acctaaaatg gtaagaagat aacttcttat tttctaatat atgacacaga aaagtttttt
9840taaagtagtt ttaaattttt aattttttct aggtatttct caagccatgt tcccatgtgg
9900tatcttgtca acaagttgag gtggaacccc tctcagcaga tgattgggag atactggtaa
9960agaaaaccaa ataagaacta tctcatttaa ggttaaatta cttcacaata tcaatgtctt
10020tagctttctc taagctttat tatatattct gagttggttt tgaattataa gaatgaattg
10080gggccaggca cagtagctca tgcctatagt cccagcactt tgggaggcca aggcaggtgg
10140attgcttgag tccaggagtt caagaccagg ctgggcaaca tggtgaaacc ccgtatctac
10200taaaaataca aaaattagcc aggcatggta gtgcatgcca ttagtcccag tcacttggga
10260ggctgaggca ggagaatcgc ttgagcccgt aaagtcaagg ctgcagtgag tcaggatctt
10320gccattgtac tccagtctgg aaaacagagt gagaccttgt ctcaaataaa aaaagaatga
10380attgatagag atctaatgta caacctgaca actataggta ataaaattgt attggggatt
10440catgttaaat gagtagattt taactactct taccacaaaa acacaaaagt gggtaactgt
10500gagatgatgt atatgttaat ttacttcact atagtaacca ttatactatc tatatgtagc
10560tcataacacc atgtcgtgta tattaaatat gcacattaaa atttgttttt taaaaaaaga
10620attgagattt tttttaacta gatatggagt ggacaaaatg taaagtgaat tgatcttttc
10680gtctgttggt tctaggagct gcatgctgtt tcccttgaac aacatcttct agatcaaatt
10740cgaatagttt ttccaaaagc catttttcct gtttgggttg atcaacaaac gtacatattt
10800atccaaattg gtaggtgcta ttgtaatatt tgctgtcata ttctacacta tagcattgag
10860tccaaagtag aaatgaatgt gcactaatga gctttatttt ctacacagtt gcactaatac
10920cagctgcctc ttatggaagg ctggaaactg acaccaaact ccttattcag ccaaagacac
10980gccgagccaa agagaataca ttttcaaaag ctgatgctga atataaaaaa cttcatagtt
11040atggaagaga ccagaaagga atgatgaaag aacttcaaac caagcaactt cagtcaaata
11100ctgtgggaat cactgaatct aatgaaaacg agtcagagat tccagttgac tcatcatcag
11160tagcaagttt atggactatg ataggaagca ttttttcctt tcaatctgag aagaaacaag
11220agacatcttg gggtttaact gaaatcaatg cattcaaaaa tatgcagtca aaggttgttc
11280ctctagacaa tattttcaga gtatgcaaat ctcaacctcc tagtatatat aacgcgtcag
11340caacctctgt ttttcataaa cactgtgcca ttcatgtatt tccatgggac caggaatatt
11400ttgatgtaga gcccagcttt actgtgacat atggaaagct agttaagcta ctttctccaa
11460agcaacagca aagtaaaaca aaacaaaatg tgttatcacc tgaaaaagag aagcagatgt
11520cagagccact agatcaaaaa aaaattaggt cagatcataa tgaagaagat gagaaggcct
11580gtgtgctaca agtagtctgg aatggacttg aagaattgaa caatgccatc aaatatacca
11640aaaatgtaga agttctccat cttgggaaag tctgggttag tataaatttt ataacttggg
11700agaaatttta tgtggcttaa acatccccaa attatgaatt agaatagtat ttcatatata
11760aattgaaaat caattaaaaa gaaacacagt gcctaaaggc acttggggga cacatttacg
11820ctttgcagta aagtccttgt ttggataaag attgtatgtt ttctggccaa gtaagcttga
11880ataggtacaa gcttagatag gttcaggcca gagaggtcaa aattacttgc ctgagattgc
11940atagctagtg ttacaactag gattcaaacc caggcagatt gacttggggg ttcatcagga
12000tggagtgccc tacaaagcct cccatcttta atgcttgcag atttgttccc cagttaccga
12060aagcaacttg ttaatattag ggaaaagggc cagtgtaggg agagatccat ggcatgaggt
12120aaccttcctg ctgcatgtgg tggcacctgg attggaatgc atccaggagc tgcttaccct
12180gccggtgtct gctctttaat ttgtgtataa cggagaggaa gtagacaggg caactagtgc
12240tccagcccct catcctggcc acaaatatta atgctacctt tatatgacat aagtcactag
12300tccatttatt ggaacctaaa tttgaaccac tgtaaagtaa gacttcatag tgataaagag
12360aggaacttgt taggaaagag aataaaatag aaagagaagg ttgtctcctt ttgtagattt
12420tttttttttc tccaacagtt ttacctgtga cctttataca aataactgac aaagcattaa
12480tctctttggc ctacatcatt ttcttttcta tttttttttt ccacaagatg gagtttcact
12540cttcttgccc aagctggagt gcagtggcat gatctggctc actgcaacct ccgcctccca
12600cgttcaagtg gttctcctgc ctcagcctcc tgagtagctg ggactacagg catgcaccac
12660cacgcctggc taattttttg tatttttagt agaaactggg tttcaccatg ttagccagcc
12720tggtctggaa ctcctgacct caggtgatct gcctgcctcg gcctcccaaa gtgctgggat
12780tacaggcatg agccactgct cctggccggc ctacatcatt ttctaaagct ccagaccatt
12840cttttctttt cttttctttt cttttctttt cttttctttt cttttctttt cttttttctc
12900ttctcttctc ttctcttctc ttctcttctc ttctcttctc ttttcttttc tttttttgag
12960ttagaagctt gctttgttgc ccaggctgga gtgcagtggc accacctcca ctcactacaa
13020cctccacctc ccaggttcaa atgattctcc tgcctcagcc ttcagagtag ctgggactac
13080aagtgtgcgc caccactcct ggctaatttt tgtattttta gtagggacga ggtttcacca
13140tgttggccag gctagtcttg aactcctggt ctcaagtgat ccgcctgcct cagtctccca
13200aggtgctggg attacaggcg tgagccactg tgcctggcct cagatcatta ttttctgtta
13260gctttaaact gtccgttcag gagatcccac tgcatcctca aattcaaaat atctaacact
13320gagcttatga tttagctggt tctgtcatta gatgggaata tccttttatt tccttgaaat
13380tatatggtga gaacagggag aagtgctgat ggtaaagtcc tgtgattaag atagcaataa
13440ggactccgcc cttcccactc cactgaaggt tgaagagcca tggacaatga gaagtcacag
13500taggtgaaat caggtactaa aatggacttg gcttgagaga tcaaaattga tcacttggtg
13560atacaactaa caaattcatg ttaacttgaa cctttattac cctgtgaagc atggtgatta
13620aaaaaaaaca acaaacaaac aggaaacttg attgttaaat tctctttaag tcagaatatg
13680taccttagag tttttattta tgcttttgtc taccattaat atgtctgcac ctgctcttta
13740gaagttaata gagagtaaag tcgtctttat gtctttcagt gcttacttat atttgggaag
13800ttgagaaaaa tttttaacat cattattgat atatatatat atatatatat atatatatat
13860atatatatat atatatatat atagataatt tttttttttt tcttgagacg gagtctcact
13920ctgtcgccca ggccggagtg tggtggcgat ctccactcaa tgcaagctct gcctcccagg
13980ttcaagcgat tctcttgcct cagcctcccg agtagctagg atacaggctc ccaccaccac
14040gcctggctaa tttttgtagt tttagtagag acgaggtttc accatattgg ccacgctggt
14100ctcaaactcc tgaccttgtg atccgcccac ctcggcctcc caaagtgctg ggattacagg
14160cgtgagccac tgcgcccggc tgaggtaaaa tttaaagtgt acaattcagt catttttagt
14220atatttatac tagttgtaca gccatcacca caatctaagt ttagaacatt ttcattaggg
14280ggtgggagaa attttactct gctttttaga ttaagtttct gtctggatct aatcatttaa
14340tcagacaatc aggcagattg tctgtgatta gttttggcca ttccagcttc ttcattggtt
14400gttaactttc acaaataaag gctgctcaaa gattagaaat aacatttaat ttgaatgtaa
14460atgtgccata gtttaaaaga tgggtttggt gaatacagtc aaatacatac atttaaagct
14520ctaattctga agattatgta aagaaaagga aagaaatgta gggagaggat tgaaatgttc
14580atggtataac aatatctgaa catccatctg gtcacaccgt tggtatttga atgttttgtc
14640ctcctcaaat tcatatgtcg aaatcccaac tcccaaggtg atcgtattag gaggtgtggt
14700ctttgggaag tgattaggtc atgaaggtga agccttcatg aatgggattc gtgctcttat
14760aaaagagaac tgtgagaaat aagtttctgt cgtttgttag ccacccagtt taggatattt
14820tgatatagca gcctgcatgg actgagacaa ctatgagtta ttatgatagc ttctgttatt
14880tcacctaaat tcatagaagc taatatatca atatttatgc tatgaaatat ttcttaacca
14940agctttgaat atatttatat ttttgtttat ttttaaattt cagattccag atgacctgag
15000gaagagacta aatatagaaa tgcatgccgt agtcaggata actccagtgg aagttacccc
15060taaaattcca agatctctaa agttacaacc tagagagaat ttagtgagtt caaatatata
15120tgttacatca aaattctttt acacgttttg taagatttct agttgcttta gctaagtaat
15180aagaatgttg tattcctttt tgatacaaat ctttttttat tgtgttaaac tatatataac
15240ataaaatatg ccatgttcgc catttttaag tgtataattc aaaggcatta attacattca
15300taatattgta caaccatcac cactatctat atccagaact tttccatcac cccaaagaga
15360aacttggtac ccattaaaca ataattcccc gtccactcct ttccccagtc cctggtaatc
15420tctaatgtat attgtgtctc tatgaattta cttattctag atatttcata tataagtaga
15480agtatgcatt tgtcttatgt atctgactta tttcatttaa cataatgttt tcaaggctca
15540tctgtgttgt atgtatcaga atgttattcc ttttcatggc tgaatactat tccattgact
15600gcatatacca catttgttta tccattcatc tgttgatgga cacttgggtt gtttccacat
15660ttttggctgc tgtgaataat gctacagtga acattggtgt acaagtatct gtttgagttc
15720ctcttttcag ctcctttggg atatacctag gaattatgtt taactttttg agaagctgag
15780aaatctttaa taaatgataa cacaaatact tatatttgcc aatgcaaata tgaatatttt
15840tggcttttaa gagattgatc attttgccac gtggttgtaa ttaaaaaaaa ttgtcccatg
15900ttgtttcagt attaatattg tagcctaaaa gagtgctaga ctgttttact ttttactcag
15960ttaattcttt ggatactggt agagtcagga aatgagatat tgaacttaaa gatctttgca
16020ggtggggtcc agtggctcac acctgtaatc ctagcacttt gggaagctga ggtgggagga
16080ttgcttgagg ccaagagttt gagaatagcc tgggcaacat agcaagaccc catctctaca
16140aaaaaattaa aaaaaaaatt aagccaggcg tggtagctca cgcctgttat cccaacactt
16200cgggaggctg agatgggtgg atcacttgag gtcaggagtt ggagaccagc ctggccaaca
16260tggtgaaacc ccatctctac taaaaatacc aaaattatcg gggcgtggtg ctaatcctgt
16320aatctcagct actcaggagg ctgaggcagg agaaccactt gaactgagga ggtggaagtt
16380gcagtgagcc tagatctcac cactgcactc cagcctgggt aacagagcga gactctattt
16440caaaaaaagt aaaaataaaa attagacaca tgtggtggca catgcctgta gtcctagcta
16500ctcaggaggc tgactgaagt gggaggatct cttgagccca ggagttccac actgcagtga
16560gctatgattg tgccactgca ctccagccta ggcaatatct caaaaaaaat ttttttaaat
16620agattattag gccagacgtg gtggctcatg ccagtaatcc cagcactttg gaaggccaag
16680gcaggcggat cacctgaggc caggagtttg agaccagcct ggccaacatg gtgaaacccc
16740atgtctacca aaaatacaaa aattagctgc aatgtctata atcccagcta cttgggagcc
16800tgaggcaagc gaatcgcttg aacccgggag gcagaggttg cagtgagtgg agactgcgcc
16860actgcactcc agcctgggcg atacagcgag attctgtctc aaagaaaaag gaatttgttt
16920tcctgtcttt atcgtagagg gaggaaaggg agaatggggt tggaatggtt attgagtgag
16980ccacattatg gtagatgtat cactgggcat agagaaaagg agcatttaaa acttttccgc
17040ctaacagatg tttcttcagg ctacactgca ctcattgtgc taactgtaat gtcaaatccc
17100agacctgtgc ctatagaaca tgaacatcct tcattggatt tgtttggtca ggcttacact
17160ttattaggaa gatcagatgt taaaataagg gtgttaaagt taagttcaga tatgaggata
17220attcattact attccttttt ctggcagcct aaagacataa gtgaagaaga cataaaaact
17280gtattttatt catggctaca gcagtctact accaccatgc ttcctttggt aatatcagag
17340gaagaattta ttaagctgga aactaaagat ggtgagtaca tttgttattt tgactttttt
17400ttctatttaa atagttgtac atttttaatt gttcttgcaa cctgtcatac ctgtgaacag
17460tatgtgaata gtgaaatata attatgataa ttaaacagta gtttttatgt attgaaaaat
17520atctttggcc gggtgcagtg gctcatgcct gtaatcccag cactttggga ggccgaggca
17580ggcggatcac ttgaggccag gagttcgaga gcagcctgcc aacatggcgc aaccctatct
17640atacaaaaaa atacaaaaat tagcctgaca tagtggtgta tgcctgtagt cccagctact
17700tgggaggctg aggcagaagg atcacttgag cccaggaggt ctgtgttcct gccactgcac
17760tccagcctgg gcagcagagt gagaccctgt tggggggaaa aaaaaaaaag tctttaactt
17820aaataaattt gacatttaaa atcttaaatt atttcatctc tgtttcagta ctaactctgc
17880atttattact ttctttttaa taggactgaa ggaattttct ctgagtatag ttcattcttg
17940ggaaaaagaa aaagataaaa atatttttct gttgagtccc aatttgctgc agaagactac
18000aatacaagta atagcatgtt attgaatatt taataaaata ctatttgtta catatgattg
18060ataataaagt atgaagttcc ttgtaacacc ttgcattgtg aagtgtatta aaaacctgct
18120aagagtaagg aataacttga tttaaaatat tttattctgt aatctcttta aattatctgt
18180acaaattatt gacttaacct aaatttaaaa atgaatgcct tagcacaatt aagttccaag
18240aatagagttg atcatgttaa ctggtaaatg gatcatgatt taaaattctt ctaggattga
18300aacaaatgaa aacgtagttt taagggtttg attttttaaa ttcctatttt tacatgcaat
18360tttactgcac aacccatctt attttgacag ttcttaaatt cgcaactctt cagaaatatt
18420atcagatcac ttttctttgc ttccataagt ttttttatta ttatattatt attttttttt
18480tttaaaagac ggtgtctcac tttgtcgccc aggctggagt gcagtggcat gatcatggct
18540cactgcagcc tcgacctccc aggctcaggt gattctccca cctcagcctc ccaagtagct
18600gggaccacag gcgaatgcca tgatgcctgg ctaatttttg tatgttttgt agagataggg
18660tttcaccatg ttgcccagaa ttgtcttgaa ctcctgggtt caagcagttg ttctgccttg
18720cccacccaaa gttgtgggat tacaagtgtg agccactgcg cccagctatt ctagaagtat
18780tttaagagtc atcttttttt tttttttgag atggagtctc actctgtcac ccaggctgga
18840gtgcagtggc acactctcgg ctcactgcaa cctccacctc ctgggttcaa gtgattctcc
18900tgcctcagct tccctagtag ctaggattac aggcgcatgc caccatgccc tgctattttt
18960tgtagtttta gtagagacga gatttcacca tgttggccag gctgctcttg aactcctgac
19020ctcaagtgat ctgccctcct cagcctccca aagtgctggg attctaagtg taaaccacca
19080cacccagcca agagtggtct ttttacaata ttattttttg attaggacat tcattcttgt
19140cataaaattg aagatactct agtcatttag aatttcattg ttttggaact agacattgtt
19200tctttatttt tgaaatgtta ttgaaggaat accatttgga gaagatacaa atgtaagaat
19260tgtgaaaagg ataattgtga cacaaatcaa aattatagat aaaaatatac ctgtaaaatg
19320tattaaggca ataacattct ttctgcttgt tgaccataaa tatttatatt ccctggatgg
19380gtacattgtt attgtcaagg gtgtttaaat aatgatcttg catgcataat ttattctctc
19440tggtataaca gaatcagcaa tttagttttc tgggacccga gaaaaacatg caaaagacat
19500actttgaaat gtaaaactga tttttccttg caactgtagg tccttctaga tcctatggta
19560aaagaagaaa acagtgagga aattgacttt attcttcctt ttttaaagct gagctctttg
19620gggtaagaag ttatggccaa actagcatgt tagacatgtt tttaacacta tatctggcag
19680agttttcaat gtaaatatta aagtagatgt taatgtcaat aagtgatctt aataatgcat
19740cagtagatat tttttcaagg attgtctcta tcttcacgcc tagcttataa tttgccttgt
19800cgtctttttt tttttctctt tatttttatg tttttatcca tccctggtgg taggggataa
19860ccttgtcttc ttcgataaca agaagtctga agcttattag aaattttact ttgagaattg
19920atcgatgaga agaaagcaac tagatatcac gtggatcata tatgcttgaa taaaacaata
19980attcttagaa caaataaata cattttaaaa gttaaagcca aaaacattag ttgaatgttt
20040aaaaatattt caaattaagt tattccttca ctgtcttgta ttactgtaat aatttggatt
20100atttgtgttt ttctcaactt ttaaaacaaa tatttaaaaa attcctcttt tgattaagta
20160gggctagata aaatataaaa aatatttttt aaactcctct taatttccat atttcttata
20220taatatgaga atctcttata aacactacct cttagaagtc tccacagaag ctttggtaga
20280tgtagtagta gggatttgat ttcttagaat ggtataatct gtaaatgttt tagtaaaagg
20340attaaacgat aaagtcaaaa tgtttatagc acagtgttta ttaatataaa ataaaatctc
20400tttttttttt tttgagatgg actctcactt tgtcactcag gctggagtgc agtgttgcaa
20460tctcagctca ttgcaacctc cgcctcctgg gttcaagcaa tccttccgca tcagcctcct
20520aagtagctgg gattacaagc atgcaccacc acacctgcct aattttttgt atttttagta
20580gagatggggt ttcaccatgt tggccaggct ggtctcaagt gatccgcctg cctcagcctc
20640ccaaagtgct gggattacag gcgtgaacca ctgtgcccag cataaagtaa aatctcttca
20700gactctcatg tgatcatgta aagtggcagg cagtcacagt caagaagtag tttaaagttc
20760atgtttgtaa aatataatct acagattgat actggatttc ataggtaatg tttaagagaa
20820aataagtttt tagttatcct cagtacttca aaagcaccca tttatgatta tgttgattac
20880taaactaaat catttggggg ctagaggtgt ttttttatgt gttaagattc cttaaggagt
20940tctattaggg caaaactttt agtaactgca tattttaaaa gtaataaaac taattttaaa
21000agcttggagg ctgggcgcgg tggctcacac ctgtaattcc agcactttgg gaggccaagg
21060cgggtggatc acttgaggtc aggagtttga gacgagcctg agcaacatgg tgaaaccttg
21120tctctactaa aaatacagaa attagccagg tgtggtggtg ggcacctgta atcccagcta
21180ctcgggaggc taaggcagga gaattgctcg aacttgggag gcagaggttg cagtgagccg
21240agatcatgcc actgcactcc agcctgggtg acagagcaag actccgtctc aaaaaaaaaa
21300aaaaaaaaaa gcttgaagtc agattcgaca ttaatcagta tactttctct caagtagggg
21360acaatttcta agattttagt cttttaaaat ttattaacta gtctgagcat ggtggcttgt
21420gtctataatc ccagcacttt gtggggccga ggcagatgga tcacttgagc ccaggagttg
21480gagactagcc tgggcaacat ggcaaaaccc cgtctctaca acaaatgcac acacaaaaaa
21540cccaatcagc tgggtgtggt gttacactcc tgaagtccca gctactcggg aggctgaggc
21600aggaggatca cctttgccag ggcgtttgag gctgcaggga gctgggttca caccactgcg
21660ctccagcctg gatgacacag caagcccctt tctcaaaaaa aaaaagataa aaaattaaat
21720taaattaatt aactacactg ggaaggcaaa attcagcatt tttttatagc taaattttat
21780cctgcttcag tcttttatca tgtaactatg tatatttttt acagaggagt gaattcctta
21840ggcgtatcct ccttggagca catcactcac agcctcctgg gacgcccttt gtctcggcag
21900ctgatgtctc ttgttgcagg acttaggaat ggagctcttt tactcacagg aggaaaggta
21960agtggttaag gtgtgttcat ttttctgtaa catttaataa cttttcattt atctttcttt
22020gggttttgac catctattat atagggtggg ttttgaccat ctattatata gggtttatac
22080gacatatgga aagcattcat ttattcacta atatttctgt gtgtctgctt ttaggtgttg
22140ggggagtgat gacgaataag actgatgttc tccatgccct ttttctgtgt cagttgatac
22200aattatatgg tttttctttt ttaggctatt aggtgttgat agggttgagt aacttacaaa
22260tgttgaacca gccttgcata cctgtgataa ataccacgta gttgtggtgt atcattcttt
22320ctacattgct gagttttatc tgctaatgtt ctgttgagct tttgtccatt taagtttgaa
22380agtgattagt ttgcagtttt ctgtttttgt gttgtctttg tctggttttg ctatccgtgt
22440aaatctggcc tcataaaatg agatgggaag tattctctcc tcttcttttg tttttttgga
22500agaggttgta taaaattgag gctgaatctt ggtggttgcc acaatgacag gaactatttc
22560tgtgactgaa tatattggga attcctataa agcaattatt ttctagggaa gtggaaaatc
22620aactttagcc aaagcaatct gtaaagaagc atttgacaaa ctggatgccc atgtggagag
22680agttgactgt aaagctttac gaggtatgag tatggtaaca ctctatataa atcccttttt
22740cattagaaag acaggaatgt tatacataat gctgtcaatc taataaatac acatatcatc
22800tagtctttaa cttttctgtt tatcatttag tcattaaaat ttctttggct ttctaatgtt
22860tttgataaaa tttctaaaac tctccatatt taatggaggc ctattttttt ttctagccag
22920aactttttgt agactacatt tctggaagtg ctcactgaca ccactctgaa aaattagtac
22980ttagaatata ctctaattgg tataaatgat ctctgaattg ctatggaaaa ctgggagaat
23040ggttgcttca ggggagagaa agtaggaggc tgtggacagc aatgaggaga attacagttc
23100accatataac acttttgtac ttttaaagtc cttaacattt acattattat ctattcaatt
23160aaaaaatatt gggaagattt tactttgaac agttaatttt tcccccatgg gtaccgctgt
23220catatagttc caactaatca tgaacttgtg tatttcctgt tctttgtaaa tttaaacttt
23280gtaactcacc aggaagtttg aagccaaatt tgtgtttcaa atatagcaac tccaggatct
23340ctaggcagat gcatttgcat ttgattttaa atgaatcttg atcccttact ctcacttatg
23400ttttcccaca tcctactttt tttattttgt tgtaagccat ctaaaattct caatgggatg
23460aaactgggta taaatgaata catgcataca ggaattatag tagcatattc cttttctttt
23520ttcttttttt ttttttttga gacagagtct tgctctgtag cccaggctgg agtgcagtgg
23580tgcgatctcg gctcactata gcctccacct cccaggttca agcaattctc gtgcctcaac
23640ctcccgagta attgggacta caggtgcatg ccaccacacc tggctaattt ttgtattttt
23700tagtagagat ggggtttcac catgttggcc aggctgatct caaactcctg acctcaaagt
23760gatctgcctg ccttggtttc ccaaagtgct gggattacta gcataagcca ctgcacctgg
23820cctccttttc tgagttttat aaaatttgat actttactgc acgctttgag actgtattaa
23880ttgaaccatg ttgatgaaca agtttttgtg atgggtatat taataaaata tagatcaaat
23940ttttatagtt aaatcaatat cgagcttttc tagtgctttc aaaaggacaa cctgaatttt
24000cccagcactg aaatgatact gaaaccattt catatcttct gtattaagga aaaaggcttg
24060aaaacataca aaaaacccta gaggtggctt tctcagaggc agtgtggatg cagccatctg
24120ttgtcctgct ggatgacctt gacctcattg ctggactgcc tgctgtcccg gaacatgagc
24180acagtcctga tgcggtgcag agccagcggc ttgctcatgg taaatgcatc caccactggc
24240ttaaggtctt gttcttttgt cagtcagcat ttttagtctt aacaataaat ctactctctt
24300cagagaataa tatatgtgtt atgttaagtg ttgtgtttga ggcccctgat ggcattctac
24360agttgtccta tagactgtaa tagcaaaatt ggtagagtaa aaacagtgtg aaaattctgc
24420aacttcatgg ttagtccttt agggtttttc attctccctt acttattgtt taatttacag
24480atttactctt ttgttcattt gacaaatatt tgtcaaatgc ttgtgcacag tctgtattct
24540caaattctag gagaaaaaga agggtgaaca gtattagcgc agaacgatac taataatgat
24600ggctactgtg tatgagtagc cagccctttc ttggctttct tggattgctt tgtattctac
24660atgaagatat tccctgggct ttacaggtca ataaatggaa attcagagag attaatttga
24720ccagggtgac caacaaggag atgacagcat acactatgcg agaagtatac acagagtagt
24780gtaggagcat ataacctaaa ctgggggtga ggtgggataa ggagttatca gggaaggctt
24840tttggaggag ttgacaactg agccgagttt tgatggaaga gtagaaatta gcatgaacca
24900atttcatgct aataaagaag caaaggaagc gtggtctaca ggcaaaagca cagaggtaca
24960ggaagtaatg atatgttggg gaataccctg ttgactggag cttagagtgc aaggagagga
25020gtgctaggga ggtgaggttg gagggtttgg cagcattgac ttgcttcaag gttcttaaga
25080gctgaaatag atataaaatg caactaagag tggcttggat tattattacc tagtgtgtta
25140atctcaaatt ttgaaatcta tagcatctat aggactggtg ttactaatct tacactcgat
25200ctgttactgt tcttatacta gatctattag tccagtgttt aagggagtgg tgcagatttc
25260taggtcagga caggactcag atgtacatta ttaatgccta tttcagttct gaccttctca
25320tatgaaacct tataagacct ggggtaggaa gagattgttc tggaagtcat aggaatatga
25380actgtatttt gtttaacaaa caatacagta tggaaattta tcacccttcc agaatattta
25440tttcagagac aaatttttat cattcgttca tttatttcat aagatccacg agtagggaac
25500ctcactagac attgctctga gtatatggtc tgagtttgca gtacctcttg tgtctccatt
25560agatttatta ggtcctcaat agataaatca gggaataact agatggattc attttttaaa
25620gacatgaaag agcgatacca tacatactgc accttaaagg tcaaccttag agtatcatta
25680tttttaatga atgtataatt tttaaatttc atgtttactt ttcctaagct tttgcactat
25740attgcttaat tccagctttg aatgatatga taaaagagtt tatctccatg ggaagtttgg
25800ttgcactgat tgccacaagt cagtctcagc aatctctaca tcctttactt gtttctgctc
25860aaggagttca catatttcag tgcgtccaac acattcagcc tcctaatcag gtaatacact
25920acttgtaagg attattgaat tatgtccctt ttatagaaat tatttttcaa ttttattagt
25980aattcgtggc tttaaattta tgcttctctt aatgatttta aggatatgta agtcaacatt
26040tggtgcatat tgtgctagag gcataaatta taatttatag ccacctgaaa tgttagtatg
26100cgctttccaa gaaaatgact tttttgaaaa tggtatttct ttgaatgaga aagaacagag
26160agaaatagat agatggcttt taaacacttc attaattaaa cttttttttt ccaccatcac
26220ataatggcac ttagtcccct ttgggaactc atgagggttt tagtggtagt gagctgaaag
26280aaatatgttc caggactggc aaacatattc taaattcttt aaaattttca cctagcatct
26340accctaaata ttcagaccct gtgctagtta actgctattg aagaacaaag gtattatatc
26400tattattaag gataatagaa tggtatttga gatattggtc attgaatatg aatatgtttt
26460gagaaataag ttttatagga accaaaaaaa aattcttaaa ggaaccatat attactaaaa
26520atgcttctta ttggagaaag aaatgacaat catttattaa tgtgattttt tcacaacttt
26580attaagatat aatttaagta caacaaactc acataaagtg tacaatttga tcagttttaa
26640catatgtaga tgccatgaaa ccatcaccac aattaaggaa acaaacattt tcatcactcc
26700agaagtctcc tagccctttt actacccatt cctcccctgc tccatcccca gacaactacc
26760aatttgcttt ctgtcactat agatttgtca acctgatttt ctccaaatat acattcaaaa
26820atatacagtt gaatacaatt ggaaattcga attttgtgtt tttttcttta ggaacaaaga
26880tgtgaaattc tgtgtaatgt aataaaaaat aaattggact gtgatataaa caagttcacc
26940gatcttgacc tgcagcatgt agctaaagaa actggcgggt ttgtggctag agattttaca
27000gtacttgtgg atcgagccat acattctcga ctctctcgtc agagtatatc caccagagaa
27060agtatgtttt actattaaaa cctgaacttg gaatcttctt tctattgtgg agaaatgtaa
27120ttgtagtaag acaagaatta aatatattcc attgtagtat ttgaataagc agttatttga
27180gtagaaaatt agtgtttcca gctaagatga tggcatattt tgaaaattca tatagtgaat
27240ataactagta aaagaagttt tgtttatttt taaacagaat tagttttaac aacattggac
27300ttccaaaagg ctctccgcgg atttcttcct gcgtctttgc gaagtgtcaa cctgcataaa
27360cctagagacc tgggttggga caagattggt gggttacatg aagttaggca gatactcatg
27420gatactatcc agttacctgc caaggtatgt ttaaaaaaag aaaaagtgaa tacttactcc
27480cagaagaacc actgtattat tggctttggc tttatgtgtc agcttgccca atctccgtgt
27540gagtcaacaa gtgtttactg agttaccaaa taaatgtctt aacactattt taggtacttt
27600aacaaatttt aattttatta attaattttt tattagaatt gagacctcac tctgtcatct
27660aggctggagt acactcacag ctcactgcaa cctcaaactc ctgggctcaa gcaatcctcc
27720tgcctcagcc tccccagtag ctagaactac aggcatgaac caccatgccc ggccaactct
27780ttaattttct tagagacgga gtcttgctat gttgcccagg cagacagatt ttaatgtgta
27840tgatgcagtc tttgatgata agaaacttat aatggaaagc tgaggtgata gttacagtaa
27900atacattttg atgtataatt ctgtttgctt taatcattca aattgtagta aagcaagatg
27960aactgtctgc tgggatttga gcagaaatgg ataggaataa actaggaggt agaagagtta
28020tcaaggttca caggactgat gggtgaagct agatttccag acccgggatg tcagtccttg
28080aaaagcagac ttggcaggca tagacgaggc agatagcagg ataaaggaga caaatgtaga
28140ttgttcttca gaagatcaga tggtagagtc taggaggtag tgtgttttaa tcagagatct
28200gagaggcaaa gatcattgca tgagatcagg gacccatgca aaggagtgag aaaaaaaact
28260gggttaagga gcctgctgca tggcaactcc tgggaacagt ggccactggg gcctgggaca
28320tgttgattgc agcccaggac tgttaaaacc agtgtgagag aacatgggta tggaagtact
28380agctagcagg atcatgaccc cgatgctggg atggggcatc aagcattagt acatggagat
28440tcagtacatc cagatgcagt acatggagac tatatgcgta actgctgact ttgggcttct
28500ttcagattgg agcagaggta gaggtgagtg ggaatattct caatagaggg aactaaatag
28560gcatacctaa taaaggagac caggatattg cagacagtag cctcatgttt ggctcacctg
28620ttcaaaaagt tctcttgttc ttgagcagtg gtgccttaaa aggtaacttg agaagcagtc
28680gattatttgt tcagcctgga gactcttggg atattttact atctttgatt gaatagattt
28740aaatgtacac agctctcata acttgcccca tgaagcatat ccatgaaagg cactatactt
28800gttaaaagat tggtttgtac tttttaaatg tagtactttt aataaaacag gaaaaataga
28860agttctgatg cagttatatg cattttatat agaatgtgtt cttaattgga aaaaatttgt
28920cgtagttcct ttgagttcat ttacagtttt tagtaggaat tgtattttct actgttgtac
28980ttgctgttac taaagaaaga tggtcgtgat taccatctga attttttttc tatacattga
29040tctttagctg ctacttagtc atttctgttt agacttgagc tctttttcat attttttttt
29100tttgtttctc agtatccaga attatttgca aacttgccca tacgacaaag aacaggaata
29160ctgttgtatg gtccgcctgg aacaggaaaa accttactag ctggggtaat tgcacgagag
29220agtagaatga attttataag tgtcaaggta tgttgtctac ttatcttctt tttttattta
29280ggtaaaatta acataaatgc agttagccat ttcaaagtgt aaattcactg gcatttagtg
29340cattcacaat gctatgcaac caccacctct ctctaatttc aaaacttttt cattccactc
29400ctcctcttgc ttatcccctg gcaaccattc atctgctttt tgtctctatg gatttgcctt
29460ttctgtatat ttcatataaa acaaatcatg caatatgtga ccttttttgt ctggcttctt
29520tcacttatgt aatgttttca tggttcatcc aggtagtagc atgtatcagt acttcattcc
29580tttgcatgac tgaataatgt taccatactt tgtttatcca cttatcagtg gtgaacattt
29640gaattgtttc taccttttga ctattatgaa taatgttgct gtaaatattc atgcacaaat
29700ttctccacgg atatgttttc atttctcttg ggtataaact gaggagtaga attcttgggt
29760cttagggtaa ttctctaact tttcaaagaa ccaccaaact gtctttcaca ccaactgcac
29820cattcccact agcagtgtgg ggggttcctg attctccaca tctttaccaa caccattatg
29880tttctcaatt gtgggctagt ctcacatttg gaaagctagt gggagcagcg atccatctat
29940taaaagttgt atgaaattga gtaatgagcc acctctctct tgtagggctt attatgttct
30000tgcttaaggc aatcttcatg cattgtgaac agaattatac ataaatgctc agataaaagg
30060gcaaaccatt cttaaaggga gtagacaact agaggcagga gaccatactg aggcaggaag
30120ctggggtttt tatggttctg ttacttttga ctatatctca ccattgcttt tgtcaaagtg
30180agactaggtc taagtttttt tcaggtataa ggtgagtgtg gtaattaagg ggcatgctag
30240cagatcattt tgggtaatgc ttcacagtcc accactggtg tgtcattgtg gtcgcagatc
30300cagtatctta gctgtgtaat ttcagacatc agcaatatta gtttaacaaa gggcaattag
30360attccaagac aaaggaatcg tgtattattc tagccttatt caaacttgat ttataaatca
30420gtttagtaat ttatttattt gtttctgtat ttatttttat ttctttgaga tggagtctca
30480ctctattggc caggctggag tgtagtgatg caatcttggc ttactgcaac ctctgcctcc
30540tgggttcaag ctattctcct gcctcagcct cccgagtagc tgggattaca ggctaatttt
30600tgtattttta gtagagatgg ggtttcacca tgttggccag gctggtcttg aactcctgac
30660ctcgagtgat ctgcccgcct tggcctccca aagttctggg attacagacg tgagctaccg
30720tgcccagctc agtttagtaa tgtataactg ggttttaccc agttgtaaat tactcttttg
30780tcgtgttttt ttgagaactg gcaatgacgg agaaactaaa agtgccaggc tgttgccttg
30840ttcctgttat tttgccttag tttttttttt tttttttttt ttctctgaga ctgagtcttg
30900ttgtgttacc aggctagagt ggagtggcat gatctcggct cactgcaacc tctgcctcct
30960gggttcaagt gattcctgcc tcagcctccc gagtagctgg gattacaggc gcctgccacc
31020gcacccggtg aatttttgta tttttagtag agacgggatt ttaccatgtt ggccaggctg
31080gcctcgacct cctgacctca tgatccacca gcttcggcct cccaaagtgc tgggattaca
31140ggcgagaacc accgtgcccg gtcttgcctt agttatttct tgttccctcc tctagtccta
31200tagttctctg actgtattga ggaaatgtaa ttaaatatta ttatgttaat agatatttat
31260gtggttgaat attagaaatt ccttattttg gtcacatatc ctgatcagta gttggtcttc
31320tggagatagt gatttttcac tagagatgac tttaggacct attcaggttt tttttaagat
31380cccaatttaa ggaaagacta ttctcattat tgattttgct atatgcaggg aaatttattt
31440cgaaaggttt ttcagttggc ttttagggaa gattatatat tctctttttt tttttttggc
31500cttttcccac atgttctaaa aatgatatat tctttaactc ctatgaaaat acattgtttc
31560agtaattgaa gatgctgatt aaagtcatat ctctacacat tttttaaaat ttgagataga
31620tgggactttg tcccttctta caccattcac ttattcactt ggaaaaacta ttatccaata
31680cttatgtggc agacactgtt tctggcacaa gggattcagc agtgaacaaa actgcctttt
31740tggagtttac attctactag tggaaagcga caacaagcag atagacacat tcagtatata
31800attcactgtc agatggtggt ggtaagtcct atgtaggaag aaaagcaggg taaggaggct
31860tggagtaact ggagtgagtc atagatggac ttgtcaggaa agggtttctg aagaggtggt
31920atttgggcag agatctaaat aaaatgaagc aacaagccat gagaatatcc gggggaaaat
31980gttctgggca gaagcatcaa gcatagaact tgtggtatga tatttattct agcacacatt
32040aattttaaaa atgtataaaa gacatccatt taatcatatt aaagatttcc atgattcatt
32100tagacttagt cagaaaccaa atttatattt tctttttaaa taattttatc tcaactctta
32160ttttacccaa taggggccag agttactcag caaatacatt ggagcaagtg aacaagctgt
32220tcgggatatt tttattaggt tggtagccta tgaatgtttt taaagtaact gactctgtta
32280ttatttatca atcagtgctt tttttggtct tgttttttga agaactgata tttgaaacct
32340gtggtttatg tgaattatta ataagctaga ggacgtggat tctctatttc atcaaataat
32400acaaaacatt ttagatatta aattttggaa attatttggt tttgttttac aatagaaata
32460ctcctcaaag tggaatcgaa gtggttattc aaagaaatct cagagtagat tcttatatga
32520agcaaataat tgcccctaat ttatctctaa attttgtaag ttctaaattc ttttttcccc
32580cagtttctaa tttatctctt ataagtcaag agtccatctg gccaatttaa tttcagtgag
32640tgtaactatt ttgcatatat taaaaaactg tatatgaata cagaagatgg tatttaagga
32700tgaaaataat tattcaaatg tgatagcatt atggggagtt ttaaaataaa agttactgtt
32760ttattcttcc aaaaatttta ttataaagta tacagttaag agaatataca taaaatacat
32820atgcagctta aggaagaata ataaaatgaa tacttcatgt attcaccacc gagtttacca
32880ggaaaaagca taaacaaaat aaacctcttc cacgtaattc ctgggttaaa gagaagttat
32940agtggaaaat atttgggagc aaacgataat gaaaatacta tccattaaaa ttgttagatg
33000ttgcaaaact gatttcaagg aaaatttata gtgttaaatg tttagaaaag aaaaaaggtt
33060agaagttaac cacttatgta tctatctcat gaaattagga aaattataga tataaactaa
33120aaaatatgtt aaaagggaaa taataaagat aagaatgaag tttaatgaaa cacaaaacag
33180agaagctcac aaagccaaga tttatttttt gaacaccgag tacaattgac aaatctctaa
33240caagtttgat taagaaaaaa gaaagcatga ataaacaatt ttagggataa aaagggaaac
33300atcgctaaag atatcccaga aatgtaaaag ataataaggg aatattatga aaatattcat
33360gccaatacat ttgaaaactt aggtgacata gacaaaaaca aaattgacca aaattgagca
33420aaaaagaaac aaaatctgag tagtcctgta acttagtaaa aattgagtta gaaaagttaa
33480agaagtcttt acacaaatca aacatcagac tcagttttct aggagagttt tgccaaacat
33540tcaagtagca gataattctg gtctattttt ggccccagaa gatatatttt acttgccatg
33600catttaatga gatagctgtt gatttttttc aatcaccgtg acaggtgttt tatattaggt
33660gttattcgcc agacatctag tccacctgtt gccagatatg gaattaatat tcacttattt
33720tgaattaaaa tttgttaata aattaataaa acaaagtcaa agttcaaatt attaaaaaag
33780taaaagaaat aaaatatatt ttatagagag cccttacaaa acagtaccaa cataatgagc
33840tttccaaatt ttgaatgggc aaaataaatg aataggcatt tcacaaaaga aggaagggtg
33900gccaataagt atatattaat ataaaaatgg ttacttgtaa taggaatcaa aagtgtttga
33960cttattgact aagagtcagt ttttgttttg atccctgtta gtctatccag aaggcatggg
34020tcttaataaa caccttgacc tcaacagttt actgaataca agggtaattt catatgcctt
34080gccttcttta agggtttgtt gtaaagatta aaataaatac ataaatatat ataaatacat
34140ttatatgtat ttatatgtaa ttacatacaa cttgccttct ttaagggttt gttgtaaaaa
34200ttaaaagaag tatataaata tatataaata cataaaataa atacattcat atatgtatat
34260gaaatcactt tgccaactat gaagcctgat tcaaatatga aatgttgttt gtttttccca
34320gagcacaggc tgcaaagccc tgcattcttt tctttgatga atttgaatcc attgctcctc
34380ggcggggtca tgataataca ggagttacag accgagtagt taaccagttg ctgactcagt
34440tggatggagt agaaggctta cagggtaata attataaata cagaaataga atgttataac
34500aaaatgtcat catgtcatca gattttggta aaaaaatgtt cttttttcct ctaggtgttt
34560atgtattggc tgctactagt cgccctgact tgattgaccc tgccctgctt aggcctggtc
34620gactagataa atgtgtatac tgtcctcctc ctgatcaggt gacaatttca tatttagagt
34680ccaaaaccca acaaatgcta cactctttcc ttgtgagctt tacttctgcc aggtaatggc
34740aattgtcctt agaagaccag ctttcttagg gaaaagcttt agccactgtt tgctcaaagc
34800ataaaaagat tctgaattag atgcaaagcc tttttttggc ccagtgcaag tctgaaaact
34860ttgtaatcct tctgtgttgg ctgattgggg aaaaaaaaat gcaagaaacc taatgtatta
34920tattttcaca ttatcttctg ttcaaagatt acatacttcc attatcctgt caaaaaaaaa
34980actctgatac agaatcaagc atgtgaatcg taagcatgta agcaggtttc atagagataa
35040tttttcaact cttccttgtc ctgtgttgtt ccaactctta ttctccaatt tagaagcaaa
35100caaataaatg aatgaaagaa cagatagaca aatgaatagt caaaggtata aagtatctgt
35160atatatgtta catgtagcta ttatttaaat tatttagatt ttccttttga aataccttct
35220tggcacactt gcctaaatct agaaaataag cactgtgtga ataagaaatt atttacactg
35280aatattttgt aggtttttgg gtttttgttt ttcagacaag gtctcacttt gtcacccagg
35340ctggagtaca ctggtacgat cacaactcac tgcagcctct atggcccagg ctcaagcaat
35400ctccccacct cagcctcccg agtagctggg accacaggca cacgctacca tgcccagata
35460attttattat taatttttgt atagagatgg ggtctccctg tgttgcccag gctttcttga
35520actccagggc tcaagtgatc ctcccacctc aacctcccaa agtgttggga ttacaggcgt
35580gagccaccat gcccagcctt aagagtgttt gattttcatt cattttccta tatatattat
35640ttctgttggg gaaaaaattc caaggaagat aaatagtagg ctgttggtac atttctcaac
35700ttacttataa agctttttag atatataagg ttaatttatg aagaaaatca taagatacac
35760aatttaagat aatattttta attttatttt ttatttgtta aataaatttt tctcctttca
35820ggtgtcacgt cttgaaattt taaatgtcct cagtgactct ctacctctgg cagatgatgt
35880tgaccttcag catgtagcat cagtaactga ctcctttact ggagctgatc tgaaagcttt
35940actttacaat gcccaattgg aggccttaca tggaatgctg ctctcgagtg gactccaggc
36000aagttatatg aggaagttgt tatgacattt tatgagtgat aaaagaagta caatgtcaaa
36060atttccacct taaaaaatgc tattttttaa acaactttgg taaaactgta tagaaacata
36120aatttacctt tagttgaatg ttccatagtt ggaatatggg ttttgcagag aatttataat
36180tatgaagttt gatgtctgtt tctttaacat taccttaata ttggcaaaaa catgttggtg
36240tttgcaagga tattatttaa attgggatac catgaattaa atactacaaa caaaaataat
36300tagagttttt tgtttgtttg tactttaact tttaaaaaat aatcagttaa agttgttgtt
36360ttgaagctca cattgttcca atctggccaa taggagcccc ttttgtatgg ctcctgtatc
36420tttatgacat gtcctcatca ttcttgaatc acttcctcac ttccagatac agtaagttat
36480tcttggccag gtgcagtggt tcacgcctgt aatcccagca ctttggcagg ccaaggcagg
36540aggatcattt gggcctagtt tgagaccaaa tcatggttgc acaaactgta cccactatgg
36600acaacagagt gggatcttgt ctctgtgaaa aatttaaaaa ttagctgggc atggtggcac
36660atacctgtag tcctagcttc ttgggagagg ctgtggcagg aggatcgctt gagtaaatcc
36720aggatgcagt gagccatgct tgtgccactg cactccagca tggatgacag aatgagaccc
36780tgcccccaaa aaagaaaaat attcttggtt tatcttgtac tttctgtatc ccagccctag
36840catcagcctt ttctctaaag acagtattat gattttaata tttacagtag atatttgaac
36900tgttacatta tagactttac catatatttt ctaggaagga ttattctatt actcttcttt
36960accacatttg tttggaatgt ctacagaacc tacagtttct aaatcagaaa ctccctaggt
37020ttttgctatt ttggcaagcc attgaagttc ttccctctcc ctttactacc agaaaggtgt
37080gtatttgtag agctctctat aatgagaaag cactctataa catggttgat tcatcatttt
37140ggagtagaaa agtatgaatg gaaagtcaga gacataaaaa taaagcccag aggtctgagt
37200cttagcttca ttacagactt tcttggggga tggttggtaa attatctaca cattctatct
37260tgtctttata attttaatag ttaaattttt accatgtgcc tcaaaaccgt tagagaatta
37320atgagctctt tgaaaaatgc ttctaagttt cttgtattgc tctaatagaa tgctatctat
37380gttattattt atttctgaga ctaaaattgt ttacatcttt aaactggttg tccttttgtg
37440tattttagga tggaagttcc agctctgata gtgacctaag tctgtcttca atggtctttc
37500ttaaccatag cagtggctct gacgattcag ctggagatgg agaatgtggc ttagatcagt
37560cccttgtttc tttagagatg tccgagatcc ttccagatga atcaaaattc aatatgtacc
37620ggctctactt tggaagctct tatgaatcag aacttggaaa tggaacctct tctgatttgg
37680tatcttgtgc agtcatcatt atacagttct gaaatataaa gctatatgtt ggtgtaaagt
37740tgcagtgatt tctctcctaa ccagccccac atattcttcc tggttggttg gttcttcagt
37800aaaatagtct tgtttcttgc ttacactaat tggtaatttg cattccttgt taagattttc
37860aagacagggc tgggagcaag gaaccaaagt agcgcgtggt tgtgattacc tttggtttct
37920ttgaggtttc tcttacctag tggctttaaa acatctttag gagcagttcc attttatagt
37980aaacttaaat tctgttatca tgaacagttg aggataatga ataatttgat acaataatgt
38040aagaaattcc tgaaaacaaa gtgttatctg tgatactttt gctgcatagt aagcacaatg
38100aagtgtactg ataatgtttc aacaggaaag tgttttgatt aaatgtgggc agtatcactg
38160ttctactagc attcaacatc tcttctaaaa attaatagtg gttcactgta attttattgg
38220tacatgtaac atctgtacat gtgtttggtt atctatatgt ttcctggttt tttgtacatt
38280tgctttatta atttaggctt tttttttttt ttttttttga gacagtctca ctctatcatc
38340cagactagag tgcagtggca caattatggc tcactgcagc cttgacctcc tgggcttagg
38400tgattcttcc acctcagcct cctgagtagc tgggactaca ggcacatgcc accatgccca
38460gctaattttt gtatgttttg tagagacgag gtttcaccat attgcccagg ctggtctcaa
38520actcctgggc tcaagctatc tgcgtgcctt gacctcccaa agtgctagga ttacaggtgt
38580gagccactat gcctagccta actcagactt taaaaatata aaagcaattc atttttattc
38640ccaagaacag taaggtggtg gtttaatttt agtctttaat tctgttttta atttattcta
38700tttagaaatg tcccagaaac ttagtataac tttactttct gaaaatgaag aaacctgtcc
38760ttgggcatta gtgtgttgga tttaagcaac aaagttaaaa aaacctaccc tgtgttatgg
38820caattttcac ttgatggtgg ttctataaca caggtatcag tgaaccttta taaaagatga
38880acaacttttc agcttgctta atttcagtta attaacatgt atacttatct atgttaatgt
38940tttattgctt aaaatgttta atttttatat ttggtaaaca gatagttttt tctctccccc
39000tcttccttcc atctttcatt actacaattt accatgcaga gctcacaatg tctctctgca
39060ccaagctcca tgactcagga tttgcctgga gttcctggga aagaccagtt gttttcacag
39120cctccagtgt taaggacagc ttcacaagag ggttgccaag aacttacaca agaacaaaga
39180gatcaactga gggcagatat cagtattatc aaaggcagat accggagcca aagtggagta
39240tggctttttc cccctcatta taattgttaa aacttcttaa aaattgtttc acccttttga
39300tatatatttc tttgacttat aaacgagcta tatttataaa caagggacca gaacacatta
39360actcagtcat ggttatgtgc ttccttgctt tcaatgtttc attatcttat aaggaagaga
39420acgtatggtc tcttgaaaaa actgacaata agaagtaaca actggactac cacatttttt
39480tttacatcct taatttaact cttcgtcaat ttcttttttt acttaaggag gacgaatcca
39540tgaaccaacc aggaccaatc aaaaccagac tggctattag tcagtcacat ttaatgactg
39600cacttggtca cacaagacca tccattagtg aagatgactg gaagaatttt gctgagctgt
39660aagtaacaga ttctgttttg gaagtacagc tactattaca agtgacatag tattacactt
39720aaacctttaa agttcgtgtt taaaataaaa atattttgaa tatttaaaag ctaattcaaa
39780aaatatgtgt cgtagctatg cattaaaaaa ccccaaaatg tcagaagtac agaagtcaaa
39840attgagtttt cattaaccag ttcatttgat tatatttgaa ttattcataa tggactcatt
39900taattttagt aactttgggc tgggtgctgt ggctcatgcc tgtaatccca gctctttggg
39960aggccaaggc aggtggatca cctgaggtca ggagttcgag gcaagcctaa ccaacacggg
40020gaaaccccat ctctactaaa aatacaaaaa ttagccaggt gtggtggcat gtgcctgtag
40080tcccagctac ttgggaggct gagacaggag aattgcttga acccaggagg tggaggttgc
40140agtgagccga gattgcacca ctgcactcca tccagcctgg gccacagagc gagactgtgt
40200ctcaaaaaaa aaaaaaaaaa atttagtaac ttcgaagaaa taagaaggaa aattaaaagt
40260tgaaagtgat tctaatgtat agtttataaa attttgttat aaaaatacct gttttgcctt
40320caaaataatt tatattaata ttttattgac ctcaagaaca tttaaataca ttcagattta
40380ttcatttgtg gaccacattt gttatacatt ggatttaaag gatccttgca attgagttta
40440tggccaccta tgcatctgag acccatggac tgggaaccat tctaggtcaa tgattcagtg
40500tgattcaatt taagagatgt ttattcctgg tctttagaag ctgctacctt ttgttatcta
40560attttgcagt actttgaagt atgtatgtat gtgtacatac gttagtgcta tgtatttatt
40620aaagaagaat cagaaaacag aggtaaggaa aaataaggaa acaaatttct gttaagccca
40680ccacctccca aagcatattt gtttatatgc ttatatatgt tttcctatta tggtaagaac
40740agtctgtaca tattgctata tagcagtccc cctttatcca catacatcct gaaaattgtt
40800ttacatttta aatgttaact actttattgt ttttaaatgt cattttatag tgtagctatg
40860ccacaatatc caatttttag acatttaaat tgctcccagg caatgtggta atgaacattc
40920ttgcagctga atatatgcac atatctaatt gtttcactag gatagaggtg gaattgtata
40980acagggagct cacatttttt aaggcttttg aaatgtattg ccaaattgcc tgccagatat
41040actgcaccat cactaacatt gtgtgttgca gtatttttct aaacttggcc cttttgattt
41100tagaaaaatg atatcaataa tttacatttc tttgattaaa gtgtagaagt tataattttt
41160catattattc attgtcattt gtattttatc ttttctaact tgtctcttca tcccctttgc
41220tccgttttct attggagtgc aactttattt gtaagaattc tttttaattt ctgtgactgg
41280aatttttttt tctagtttgt tatttcccgt tcatttctta aaatataatt gtgtttgcca
41340acaatccatt atcttttgtt ttgtaatggt agtatttata catattaaat tatctctttc
41400ttttttcaga tatgaaagct ttcaaaatcc aaagaggaga aaaaatcaaa gtggaacaat
41460gtttcgacct ggacagaaag taactttagc ataaaatata cttctttttg atttggttct
41520gttaagtttt ttgatggctt ttccatatgt tgtaacagga aaaaaatggt gtctatgaat
41580ttcttcttaa tttaacaaat ttggttaatt tataaaatca cagattggta aatgctataa
41640ttatgtaatg atcaggattg agattaatac tgtagtataa attgggacat tataacagat
41700tccatatttt atttcctaaa atctaaattc agtctttaat gaaataatat tagccaaatg
41760gtggaactaa tttatttctt ttgaggaaaa gataataaag aatgtaatta aatttaaatt
41820tcttggaatt cccagttgta tattcatcac ctttgtagca tttgacaaat tttatgctta
41880gcagcttctt cactgttttg aaataaaata tcctattacc tactgataca attatctgtt
41940ctttgtatat caaaaaatgt gaaatttaca cataattcaa atacatttaa ttatccgctc
42000aaccagaaat gaaatcacat ccctctacta tactacatcc agctccaagc ccaagatatt
42060taaatgacat ccattcctct cctagttcca gttatgattt tatcttgata ttctctcata
42120tatgaactaa attataaagt tagccaccat caatacaatc tgcgtatcta atatcttaac
42180tatatagtaa tggggtaagg gaacagcaaa aaggagaaca ttaattaaaa tatacaagta
42240agcctgggca acatagtgag accccatctc ttaaaaaaaa aattagccat gcatgatggt
42300atgcctctag tcccagctac ttgggaggct gaggtaggag gatcacttgc tcccaggagg
42360ttcaaggttc taaaccagca aagctcagaa tcccagggga tagaaacaaa gacttagtgg
42420atcactagta ttaaactgag acacgtcacc ctgcattgca ctttgtttct cagttctttg
42480atgaaatcac tgagctgaca tacctgccct cttttcacca taaagtgagt ttcatgatca
42540gaagcaatgt ctatgggata gcctaacaaa caatgtaaaa accatttagt aagttcatga
42600agggtggtgg tggtaaaaat ttggagaaca tacaaaacaa atacaattcc aaggtgtgtc
42660ccctccagga aggacaaatt gctgcctgct ctgtgataga agaggatcag atgtaatcaa
42720cctgccgtca gacttgggct gttctctcct gggtgtggac ttgcctggtt ggtcactgct
42780gctgacaagt aggctgtcaa tatagctggg ttgtcatgtc agctgtggtg agggggaagt
42840ccacattgtg gaggccacat ccctgcactc ttggccaatt tgaccatgaa tcttaagcac
42900tggggtggct ggaaaagaca gccgattgac atccatacag aggtcatctt gaccacttga
42960ttagtataag cactgaaggc ttttaactga gcattcacat aggacacaaa tattctgatt
43020ctttgggccc attccaagaa ctctgggcat acttttcctc cagacctcat acccagttgt
43080gttctttcca aatttctggt catctggtta tgttattagc cactatctgt gaatcagcat
43140agatttttat atcagacatc tctacctcct gacagaatgg aggagatatg ttacttaaca
43200attctgttcc cttggaagat ttcctgtctc cactgtttgt aagggctact ccctcaatgt
43260agcagtaatg ctttcactct gatgggaagt cacagtggaa ttctgggtct ccaagaatta
43320gtgttagtgc atacacagtg tctgataatc cccagagtgt ctggtgccct tggatcctgt
43380gaagaaggct tggagaaaag aagattcatg gcaagaactt gtgatgtgat gacagggcct
43440tttctctggc tcttcattct tagtctgacc taggtgtgag aattaggtca ggggccatga
43500ctatattgtg gtgactcaaa ccaggccttt gtttactaac tgggagattt ttacattgta
43560agaatcaagt aggatctttg cccatgtatt ttggtcttaa gaacacaaat gatatggctc
43620caatgactgg aggaacacca gggtccttgg tctcacgctg atttagataa aacgactgtc
43680aggcctctga gcccaagcta agccatcctc ccctgtgacc tgcacgtata catccagatg
43740gcctgaagta accaaagaat cacaaaagca gtgaaaatgg cctgttcctg ccttaactga
43800tgacattcca ccattgtgat ttgttcctgc cccatcttaa ctgagcgatt aaccttgtga
43860aattccttct cctggctcaa aacctccccc actgagcacc ttgtgacccc cgcccctgcc
43920cctaagagaa aacccccttt gattataatt ttccactacc cacccaaatc ctataaaatg
43980gccccacccc tatctccctt cgctgactcc tttttcggac tcagcccgcc tgcacccagg
44040tgaaataaac agccttgttg ctcacacaaa gcctgtttgg tggactctct tcacacggac
441006416869DNAHomo sapiens 64aagctttagt agagatctca aaaatggttg gatggtagca
aattactaag aactctcaaa 60gtttctaaag ccttagtttc agcttgctag aaaacctatg
ttgagtatta tggctagttc 120catagttgag ttgggaaatg tctttgagga gacacttttt
cactttgtat tcatctgtac 180attttctgtt acttgcattc tgtcatgctc aggctattag
agcaggtaca tttttataac 240tggaatgttt atgtgtagtg aagctctgag aggactttgc
attagatctc agcagcataa 300tcagaaggtt gtcctttgtc tcagcaattt ttaagctaat
agtagcagaa attgcagtgg 360aaatagactg ctttgccaca acattcagaa aatcatttat
ctttttattg cagttcttgt 420caccaaacaa tacattttag tacttctcaa attgcagaac
tctcataggg ctgggaaaat 480gcctgtagac acatacatac tatgaatgtg ctaatgtttt
ttgtattttc atagcccatc 540aaagctcctg agtcagtttc cactataatc actgcagaat
caatcttcta caaggtaagc 600ttttgtagag ttactgaagg aagagttggg cctagtgggt
aatgtgccac taaaatgttg 660gattagtcta aaggtctctg ctactcttta tttgtataag
gtgtgattat actttttgtt 720cccttcttag ctgttttccc ccataagtgg ctgttattaa
aacatctcat ctagagctga 780agtgggagga gaaagtgcct actgacacat gatgtgagga
tcttaagtat ttttttttag 840tgtagattgt aggaattatt cttaaaatgc tgattgtata
gtgtggagcc atggaagact 900gagccgttag tgcgatggca ttgaagaatg agaaggacag
agacaggatt tggactagta 960gaggttgtcg actgtggtgt caaatgggta gagtaggccc
agagattcta aaatgccttt 1020aagtggagtt gagctgagta agggcagtag tgaggattaa
cacctactag aaattcatag 1080tgagaggaat tccaagatgt tttgataaaa gaatgaggag
gtcaggtttc ccagggccaa 1140agtccatgaa catctgatac ctcagtgaga gaagtgacag
attgttgtgt ttaaaccaga 1200agtcttagga aaggaattag aacatagacc cccaaggctc
ggcaggcctg gcacggcaca 1260ggcagcaacc attgaaggct atttggtgtt tcgggatctg
aactgtcatt taggggacag 1320tggtgtgagt tagtacttta tacttgaccc aggtggactg
agaaactcaa gtgatgatgc 1380ccttaagtat actttttttt aagcccacaa tctatatagt
cgaagtctgt tcctcccaac 1440aggggtacac tggcattcct cagcagggct gggaaaaacc
aacaacaaaa aaagtctgta 1500cacaggcaaa catctctctt atttttccaa catttaatac
attgttaata aaatatctaa 1560agtttagcaa acagttgctg tgtatcagtg gctgagcatt
ttgcatgctt tatttcattc 1620agttcactct atgaggtgga tactactatc cccattttct
agatgagaac attgaggcac 1680agcgaggtta attaacttgt ccaagatcac atagccaaca
agtcatggag tgaggcagtc 1740tcatgccaga gcttaagcct agagcatagt tcctggctct
acagctttag caagtgactg 1800gctatgtgac gaggaccaac ctctctaatg tctcatctgt
aaaataggaa ttgtaaatag 1860ttactacctc agtgggtcaa atgaaatcat atgtgttaag
cacttagcag agtaagcact 1920caatgaatag taggagttat cacatcttcg tatttgtgca
ttaccttcac agtttacaga 1980ttaaggccag aagcaacttg ttgagctacg ggtttagtgt
actaacagtt tccatgtgtg 2040tctccatgga agggtgtgtg ggacctgtta ttgtgactgt
ctgtactttc gtattgttgt 2100ctgccaccca tgtttattaa atgataagga caataatgca
acaaagtagt caagtaatgt 2160tgcaaatgcc cagtattgta gtggctatca cagcagtgcc
actggcaggc agcaccatgg 2220tggcaagttc aagaggtcac tgccagccac tgagctagag
cccagatcag gcatgcaaga 2280ggagcctgag tgggagccac tggggatcac ggccaagagt
gtgaccaccc aagacccaga 2340atggctgagt ggcctccctg gagcatggca gtggcagaac
aactccatga actcagatct 2400ggtgatgcct aaactagtgc tgttctcgtg tggacccctt
ttctctacca gaaaccttga 2460atcctctcag caaatgagga gactactcag atcagtgact
tagtcctgtt tggtgttata 2520tatgtgtaca caacacagca catattaata aatacctact
atgtgccagg cactgcctac 2580cactggaatc tttcactaag acattgtttt tactttgcat
ttctgccttt acactatgaa 2640agtagatgtt ttggattcat attcattcag catacatttg
aatatgctgt gttatgcata 2700gtaagcctat gataagcaag tattctcatt tagaatttgg
gaatattgat tatacatgtg 2760gacaaacaaa ccataaatgc aaactattta tatgataaat
aactttggac tgatggctgg 2820gaggaaggac cagctattga tgggtaggaa ctagcaagta
gcggactgtg gcctgcatag 2880accagaccca tccgtagtga tccagatgaa acagccaccc
tcagacactt ggataaaggg 2940tccaccagga aaaaactcct ggcctatcag gtgctatgtt
acagttcagt tactggaagt 3000atttcctcaa aagtgttttt atggttgagg tacacattcc
tacagcttta cctgctgcca 3060agtccctgtt tcaagggaag cagcaatgaa ttacactgtt
cccgtagtca aggacagtat 3120atcttaccaa gaactatacc cacttaagga ggtgctggat
gtcataaaga tttggatcaa 3180ccattatggg tgttcagagg agagattatt tccagctcaa
gacccaggga agaggacata 3240ggatggatac cagagtcata gggaggattt aacacaggac
atgtacacat tagttagttg 3300ggtataaagt ggaacagaaa tgaatgagac acaaagcctt
gaatgccaga aatactagta 3360gtcctgttgt ggaaggatat aaaactcaac tgggagtgga
agagaaaggc agcagtgagt 3420ctaggagatg tacagtaggt tgaggtaaac atatcctgaa
gactataatc caaagattat 3480ttttggtttg aatttgtttt ggtttgaatt catggtatct
attttctttg agtggatggt 3540tggggagggt ggcatgtaga atgcattctt accaaatcag
catgattttc aagacagtac 3600agagaaaaga ctgctgagct gatgtaggag ctttggctgc
agtctctatg gctttcagca 3660agccgtttaa ccttactact gcttcatgac tgtggctaac
aaagtaggga tagtacggag 3720cacagaggat ttttagggcg gtgaaactat taatactctc
tttgtatgat actataatgg 3780tgggtacatg tcattataca tttgcccaac cccacagaat
acacagcacc aagagtgaac 3840cctaatgtga actctggtct ttgatgatgc tatgtcagtg
tacgttcatc cgtgtaacaa 3900gtgtaccact ctagtggtgg gaggggttat tgataatagg
ggaggatgtg catgtgtggg 3960ggcaggaagt atatgggaaa tctctctact tctgctcaat
tttgctgtaa acctaaaacc 4020tctgtaaaaa ataaagtcta ttttttaaaa agtggggatg
gtattacggc aatataaaat 4080caaaatactt tatgaacaaa tcttttctcc agatgtaaac
tgtcatatat gcaccctcgt 4140atgtgtatgt ataattttca ttcaaacgtg aaacaacttt
agaattggca ccaaacatat 4200aaacactgat acattagact atctcgaaca ccttttactg
accactttga aaacttgctt 4260acctattaag gttcattcat agctgtgatg ttctattttt
attttcaatg tgggattatc 4320ttctgtttcc cccagggagt atattaccaa attggtgatg
ttgtttctgt gattgatgaa 4380caagatggaa agccctacta tgctcaaatc agaggtttta
tccaggacca gtattgcgag 4440aagagtgcag cactgacgtg gctcattcct accctctcta
gccccagaga ccaatttgat 4500cccgcctcct atatcatagg taagtttgac aaatggcaca
ggtttttttt taacttagtt 4560aactctccaa tattatgtaa aagagtgtgt tagtcagctt
gggctgtcag gacaaaatat 4620cacagactga gtggcttaaa caacagaaag tcactttctc
acagttgtgg aggctgaagt 4680ccaacatcaa ggtgctggca acacggattt ctggggaggc
ttttcttcct ggcatataga 4740tggtcacctt cttgctgtgt cctcacatgg cctttcatgg
agtgagagct ctttggtgta 4800tcttcttata aggacaccat ttctgtcaga tgagggcccc
acccttatgg tttcatttaa 4860ccttaattgc ctccctaaag gtctcatctc caagtaccat
cacattgggg attagggctt 4920caacatataa atttggaggg tggcgggggg ggatgcaatt
cagtccataa caaaaaaagc 4980atgagtatta ttaagtacaa aaaaattaga gagctttata
gaaaatatga ggcattttat 5040gtagctggag tgtgagtgct atcagttatt ttgagttaga
gcaatgtgca tctactaaga 5100agtggtatgg ataagatttt tttggagtga cccagggtta
aactgtacta caagaatgta 5160ttgctcagga actaggttat ttaggttact tatttataca
aacctattca aaaataattt 5220aggaaagaac tatcccagtt atcccatact tgcaaattct
caatatgtgt gcctctgcat 5280gctacacatg tcatcttagg cctttatagt ataaaggctg
atagttgaaa tggcagctgc 5340tgtgcttttg ttaatttcaa agctgccaaa acagttgtga
gatagactca caagaattta 5400ctgattaata caatttttaa agttttcaga tttttacagt
tacttcagac tttttatctt 5460tctgcagtga gcatgcatca ttacttttgc atcctgagaa
caagcataag tgtgtttttg 5520gagagaactc cagggacaaa taatatacca ctgttattct
cacctatatg tcaagtttga 5580tacattacca aacaattcta gccttctgct tataagtata
tagaattttt atttacctta 5640tctatggatc aggatctcag cagaggcagt gatgtatcag
aatcaccttc gggattcctc 5700tactgcctcc tctttctaat ccccagattc tgatatgcat
ccttgtccta cagcgaggca 5760gcatggcatg aggtcagaac accagttctg gagccagact
gtctaggttc acagcctgcc 5820atttaccggc catgtgactt tggcaagttt cttagtctct
cttgcctcac tttcctcata 5880tgtaaaatgg gaataataat agtgcctacc tcagaaggtt
gatgtgagga atgaaggtat 5940tgatacatgt aaacttagag cagtgtgggt acaaaataaa
catgatgcaa gtgttcaatc 6000actgtttttg ggagaatgcc atattcttta agccgttaaa
gaagaaaaaa tgattaagaa 6060taatttcaaa gtaatgcatg tttcaagggc taatgccagg
ttgctcccag agtggtctct 6120cccagtgtct agaaatttta acatcttatg aaaatgatat
atatggtcaa aaatgtattt 6180aacctttccc ttggctgcct tccagggcca gaggaagatc
ttccaaggaa gatggaatac 6240ttggaatttg tttgtcatgc accttctgag tatttcaagt
cacggtcatc accatttccc 6300acagttccca ccagaccaga gaagggctac atatggactc
atgttgggcc tactcctgca 6360ataacaatta aggaatcagt tgccaaccat ttgtagttca
caaattaaaa ctgggtttcc 6420aggcctggtg tggtggctca cgcctgtagc cccagctatt
gcaccactgc tctccaagct 6480gggcaatgga gtcagattct ctttcttaaa aaaccacaaa
aaaactggat ttccagttct 6540ctaatattct tagtaccaca agatatgtca taggtatctt
taaatgaaat tcttagctgg 6600aaaagtgact aaaaagtttt tctcctgcta cctagtaata
aacaaatcat tgtttattac 6660tggtcactta gaaaattaaa agggataggg ccaggcacag
tggcttatgc ctgtaattgc 6720agcactttta gaggccgagg caggcggatc acctgaggtc
gggaagtgga tcgcctgagg 6780tcaggagttc gagaccagcc tggccaacat ggcgaaaccc
cgtcgctact aaaaatacaa 6840aaattagcca ggtgtggtgg catgtgcctg taatcccagc
tatttgggag gctgaggcag 6900gagaatcgcc taaacccagg aggtggaggt tgtagtgagc
caagattgca ccgctgtgct 6960ccagcctggg caacagagtg agactcttgt ctcggaaaaa
aaaaaaaaaa aaaaaggctg 7020ggcacagtgg ctcacgcctt taatcccagc actttgggag
gctgaggcag atggatcgcc 7080tgaggttggg agttcgagac cagcctggcc agcatggtga
aaccctgtct ctactaaaaa 7140tacaaaaatt agccaggtgt ggtggcgcac acctgtagtc
ccagctactc gggaggctga 7200ggcaggagaa ttggttgaac ccaggaggcg gaggttgcag
tgagcagaga tcgtgccact 7260gcactccagc ctgggtggac agagcaagac tccgtctcaa
agaaacaaac aaaaaattaa 7320aagggataga atataatgaa atatattttg aacttaaatt
atattctata tgtgtatctt 7380cctaggcaaa agctgtaatt tccagagaga ccattaggaa
caggtagtat ctatttttct 7440ccattattta tttctagaaa ctcataaaat ggattgtatt
tttctataag aacaaaatat 7500taattaaggt atagatgact gaccaagggc ttaatcaaat
aaaatgacta acagcatcta 7560tcataaagcc acacaagcct tatgttctca tctcaaaaat
gctgtgacag ctttttggct 7620gctttaacca taagaaaaat gattggtgga tgattttatt
agcccaggct tttaaaaact 7680ttcatctagg ccacgtgcgg tggctcatgc ctgtaatccc
ggcactttgg gaggcctgag 7740tggatggatc acttgaggtc aggagttcag gaccagcctg
gccaacatga tgaaaccctg 7800tctctactaa atatacaaaa attagttggg tgttatggtg
catgcctgta atcccagcta 7860ctcgggaggc tgaggcagga gaattgcttg aactcgggag
gtggagattg cagtaagccg 7920agatcgtgcc actgcactcc agcctgggtg atagagcaag
actgtctcaa aaaagaaaaa 7980aaagaaaaaa ttttaattta atccttctgt agaaacaggc
attcagaacc attccattga 8040tcttaataaa gctgctcttt actgtttcta gtcaaaaatg
agacttcgat caaaccataa 8100gattttatac tgcagatagt cagcttcacc aaagccgcag
aggaaacatg tcgagatcag 8160gcttcctgct tgatagtctc ttgactacca ttaaaacgaa
tattgggagg tcatgaaagt 8220cattggtagg ccattagcat tgatatcttt aaaacatcta
ccctaaacca tctgctatgg 8280acccataata agaggcctgt tgtatatgaa attgtctaga
attcaggtgc aggtctttgc 8340cggttaagta agggagcaac acgtaaaatg ggagaggagt
ggggtgtact cacttgcctc 8400ctcttttgtc ctgatttaac cagcattttt caaccctggg
aaaatttgca gaatctaagt 8460tgattgtaat gattttgagc tgcagcagct ttaactctta
ccctttttcc acatagttat 8520ggtgtttgag ttggaaagaa acaactatag gtagctacac
gtacataatt atctctttat 8580tcacaaaggg tatagtaaaa ttgattgtaa ataactttct
aagtgccaat attcaaaact 8640tttggattaa aatgtatttt tcaccgtgca tttactttgg
atgtatttat ttcatttaaa 8700caatttaaat ggggctcttt aaccaaaaat ggtatttaaa
accaaaacag tatcgtactt 8760agaatttgga gtagaggccg ggcacagtgg ctcacgcctg
taatcccagc actttggaag 8820gctgaggcag gcggatcacc tgaggtcagg agttcgagac
cagcctggtc aacatgaaac 8880cccgtctcta ctaaaaatac aaaaattagc tgggcgtggt
ggcgtgcgcc tataatccca 8940gctagtctac tcgggaggct gaggcaggag aatcgctgga
actcaggagg cagagactgc 9000agtgagccga gatcgcgcca ctgcactcca gtctgggtga
cggcatgact ccatctccaa 9060aaaaaaaaaa aaaagatttt ggagtagatt catcattaat
aagtaacaga ttttaggaaa 9120atcaaaaaat ggctaataaa atgaacacaa tgtaaaacat
ttattaaaat gtagactttt 9180aaaaatctat aaattgatca tctgtttata aattggcaga
tggttgtgta ccatctttta 9240aaataaagat tgaatttcac ccagtgtgat ggttcccatt
gcttatattt ctcctgctga 9300ggccggacct gatatggccc tggtctgtgt tcccagcctt
gtttcctcat taccactaaa 9360atctttcccc tgtatgcccg cccaattttt ctggctctga
gtccttgttc atactgttct 9420ctccaattct accttccaaa ggcctttctt aacaccttcg
gattctttct ttgagaactt 9480tccagattcc catgcctttt tggaatcaat ctctatccta
ttgtcatcac atttaagttt 9540ctacttccat catcctcact cctatccctt tggtcctggg
atgacaggga tgctgtgttt 9600tatttactca tctttgtaac ttccacataa cctaaccccg
gttcttgctt atgggagatg 9660ctgattgtag ggtctgagtt agatactgtt aactaaaatg
cttgttgata ttttagttat 9720taattcatat taactttggc tgaaactttt aaattctatt
gtgaatagtc aagtaaaatt 9780tagattgtta cattctgggt tagtattaga ttgtttttaa
gattgtttta aacaagatgt 9840ttttaagatg agttttaaat agttctctta acacaaataa
agcttaatat gagtatttga 9900aggaaattat cccaaaccat tccagttcct ggctgtgaaa
ggcttttcca ggcctaataa 9960gttttccact tcagccgtaa gtaggtgaaa tcaaatgaac
aatagaggga aatgtattta 10020tttgctttat acacatgcat gtgtgttgtg tctacatata
aacattgcac acgcttagaa 10080tgaagtttct gtcatgccca gaaaagggag aggcattttt
gtggattttg tctggctgcc 10140ctggggatgt ttgaagaact gtgctgttta cttcatacca
ggtgtgtgag ccataccttt 10200ggtaggaggg tatacctcct acacccaaga aatataagcc
aggagaaggt ctgtgccaag 10260agaaggaacc caaatgaccc acaagaggtg ggccattaat
tattgggtca gatgcataaa 10320tgcacagtaa tttatttaag cacctcttaa tggtgaccca
caaggaagat tgctcgtagt 10380agcggaaagg ttcacaataa ataagagaaa aaagcagaat
gtagaactgt atgatagcaa 10440ttctgcaaac aagaagcatc ttttataaaa gatggaagga
gcccaggcac agtagctcat 10500gcctgtaatc ccagcacttt aagaggctga ggtggaggat
cacttgagct gcagtgaccc 10560atgattgtgc caccactcca gcctgggtga tagaagtgag
accttctctc aaaaaaaaaa 10620aaaaaaaaaa aaagacggaa attcctccag aattttaaca
tgtcaacaga ggttttctgc 10680agctactttt ttcagcttta tacttcgcag tattttccaa
attttctcta acaagcagta 10740ttttccaaat tttttacaat aagcacacac acacacacac
gtttgtttgc ataagtgccc 10800aactggtggt gaacaaccgc tggcttttag tctatacata
tctagaatat tttataaata 10860gtagttctta aacccttgaa agggagtgaa tgaccagctg
agaaaataaa gtcagtgatt 10920tcattatttt cctatattca catcatgatt ctaggaaaga
acttgggagt gacttccttc 10980agcttcagcc actcctgggc caggcgcatg cttagctctg
tggtaaaggt caccagcttc 11040ttctgcaggg tgcctgtatc atctgaattg gaggtttggc
gagggtaaga gactgatgta 11100ggttcaagtt tttctttcct gtcctccact tgaaatctgt
cttcccttcc agactgcctg 11160cgctgctgac ttaaggcccc aacaccaaac acagaagcaa
cagccttaca cagagtgttc 11220agcaagctcc aacaattgtg taaggtaaag tttcctttat
agattccttt tctatatcgc 11280tcctagtggt tctgtttctc tgatcgaatt ctggctgata
acagttgctg agactctgaa 11340agagaaggca aggaactact gtttctcatt ataaactgtt
tagaattatt tggccatctt 11400tttgctatga atatgtagtg ctttgataca ttttttaaat
caaaaagtaa tgaaagagat 11460cacataggga aagatagatt ggattatttt taaagtttat
atactaaatt gaaaagcaaa 11520gaataaaatg ggagaaacag ctccctcatg tggctgttgg
caggaagctt ccattcctct 11580ctgtgggcct ccacaggttt gctcacagca aatggtccgt
gacagaaaga cgcaagggca 11640gttgcaccca agatggaagc caccatcttt tctataacct
aatctgaaag aagggacata 11700ccagcacttc tgccatatgc tgttgggtca cacagaccaa
ctctggtaca gtgtgaacac 11760aggaccacac aagggcgtga attccaaggg cagagaccac
tagggaccac ctcagaggca 11820cagagggaca ccctatccag ctggtggcca atgtaaatta
acatagcttt ttagaatagc 11880aatatgtatc tataatctta aaagtattaa aagtacttct
tgatccagta atttcatttc 11940taagaatcca tgctaagagg atttaaaatg tggaccaaaa
aatgggtata aaaagaagtt 12000gttaacagta tttaaagttg tgaaaaacca gaaacaatct
aaaggtccaa caataggaaa 12060atgaattttg atatttttct aatagaattt tatgctgtca
tcagaaatac catttacaaa 12120taatttttaa taacgcaaaa aaaagtttat aaaatgttta
gtgtaaaacc tggacacaac 12180tacataatga ttctgatttt gtaaaaaaaa aaaacaaaaa
cacacacata tacacatgca 12240tacatatgca tataaagaaa actggaacaa acaaaataac
aagcatagtt ggaattacag 12300tcattttaat attctttatg cttttaaaaa ttttgaagtt
tgtattacta gcatccacta 12360cttacgtagt caggaaaaaa atacaacttt aaaatagata
tttaggtcca aagatggtaa 12420tctaaatggt gttacaggct gaatgtgtgc ctgatcccca
tgccccaagt tcatatgtta 12480aagccctggc ccccaaggca atggtattag gggagtaggg
cctttgggag gtaatcagat 12540ttctacgagg tcatgagggt ggagcccgca tagtggaatt
agtgtccttt taggaagagg 12600agaacagacc aaagccttcc tttctctcct cactatgtaa
gaagacagcc agaaggtggc 12660cacagccagg aagagagctc tcaccagaac ccaaatctgc
tagcaccttg ctcttgggtt 12720ctcagcatcc agaactgtga gaaatgaatg tgtgttgttt
aaaccactca ggctacggta 12780ttttgttgca gcagcccaag ctgacagaga tagaaacaac
acaaggaccc atcagcagac 12840gaatggatga tcaaaacgtg gtgaggtcgt gcagtgggat
attattcagc cgtagaagga 12900atgaaattct gatacatgct ataatgatga accttgaaaa
catgttaatg gaaataagcc 12960aaacttaaaa ggacaaatat tgtataattc cacttatatg
agttagttac ctagaatagg 13020caaattatgt catagataca gaacattaga ggttaccagg
gttgtgggaa gaggggtatt 13080gtgggtacaa attttcggtt tggagtgatt ttgaaaaaat
tctggaaatg ggtagtgaca 13140gtagtcaaca tgatgaatgt acttaatgac actaaattgt
acacttaaaa atggttaata 13200ctgggctggc gcagtggctc atggctgtaa atcccagaac
tttgggaggc caagacaggc 13260ggatcatgag gtcaggagat tgagaccatt ctggctaaca
tggtgaaacc ctgtctctac 13320taaaaaataa aaacaaataa aaaaaaaatt agccgggcat
ggtggcaggc acctgtagtc 13380ccagctactc gggaggctga ggcaggagaa tggtgtgacc
tgggagtcgg agcttgcagt 13440gagctgagat cgcgccactg cactccagcc tgggcaacag
agccagattc cgtctcaaaa 13500aaaaaaaaaa aaaggttgat acctgggtgc ggtggctcat
gcctgtaatt tcagcacttt 13560gggaggccaa ggcaggcaga tcagttgagg tcaagagtta
aggaccagcc tggccaacgt 13620ggcgaaaccc catctctatt aaaaatacaa aaattagtcg
agtgtggtgg tgggtgcctg 13680tagtcccagc tgctgggagg atgaggccta ggaattgctt
gaacccagga ggcagaggtt 13740gcagtgagtt gagattgcgc cactgcactc cagcctgggg
gacagagcga gacttagtct 13800caaaaaaaag gttaaaattg taagttttgt tatgcatatt
ttaccataat ctttaaaaaa 13860tagatatata ggagataaag tcaacagaat ttaataacca
gttgtaaata gagactgagt 13920gaggaggatg aattaaggaa gacattgagt acaacttttt
ggtaggtgaa aaactcttaa 13980aaaaatacgt gggcaaagat cctacttgat tcttataatt
taaaaatctc ccagttagta 14040aacaaggcta ggtggagatt tgcatgtgat gtgaggtgtg
tgttctgttt tgtaatgtga 14100ggactgtgag ccatctcctg gacttgaata tccattagat
aattgaaaat acggatttga 14160gaactcagga gacgtgcaat gcagtaacaa aactctgcac
ctagttgatt tctgtctcct 14220aatttaatgc ttttatggga caaactgtta ggcaggtggg
caagatggac agccatattt 14280ttgtgggttt ctggcctgtg ggccagcctc agtgctcact
ctgaggtcat gtccaaactt 14340agaacacatt caggcctacc acagtcaagg ctccctttct
caactctagt cctctgcaca 14400aatatccgaa gcctagaaat aataatcatc tgtccttgtg
tcttgcatta tgaaagccta 14460ggaaagggcc ttgggaatta agaagaatgg aaaaactggt
ctaactgctg catgcttcag 14520cttgcagggg aatcactgaa atggggacag gccataaaag
gacaaccaga agagtggctt 14580cagcaaaggc atcgtttttc agagcaagct agagaatcct
gccagcgtcc tcaggcaggg 14640cccctgggca cagaggttag gcaagggagt gtcccagcat
gttgatgccc tgagcatcag 14700aataatgcca tagaggagct tccaaagagt tcatttcagg
ttttgtaagc cgaacatttc 14760taggcaaata aaatttgatt ttgtgaataa agcttgtttc
ttcaactcca gtgcagattc 14820tcatagattg atagtggctt gtgatccaga taaagaaaac
aatttttcaa agattcatat 14880tctttgtaga tgtacggatt tagagaccat ctaatctaac
tccctcattc tacagatagg 14940aaaaatgagg cctaaagaag ttaagaaaat accatggaaa
tgtcactgct gaactgccat 15000acgtaggatc cgaaagaaat tgggtaaatg ctactgtgag
aaatacagta ctaggtccaa 15060agaatctaat acaaattaaa aatctaaatg ttatttctaa
agcatccctg cacatggctg 15120aacttacata gtttcatttt ctttcttttc tgttgaagaa
gaggcaattg gctgggtgca 15180gtggctcatg cctgtaatcc tggcactttg agaggccgag
gcgggtggat cacctgaggt 15240caggagtttg agaccagcct ggccaacatg gtgaaacccc
atctctacta aaaatacaaa 15300aattagctgg ctgtggtggc cgctgcctgt aatcccagct
actccagagg ctgaggcagg 15360agaattactt gaatctggga ggtggaggtt gcagtgagcc
aagatcacgc cattgcactc 15420tagcctggat gacaagaggg aaactccatc tcaaaaaaaa
aaagaaaaaa agcaatcact 15480aacctgtgtt gtttattaaa catgacagac tggcatgaag
taattaccaa actgtaaaca 15540aaaaagctac aatctgccag gcatggtggc tcatgcctgt
aatcccccac cttgggaggc 15600caggttgggg gatcacctga ggcctggagt tcaagactag
cctggtcaac atggtgaaac 15660ctcgtctcta ctaaaaatac aaaaattagc ccggcgtggt
ggcacatccc tgtaatccca 15720gttactcagg aggctgaggc aggagaatca cttgaacctg
ggcagtgggg aggttgcagt 15780gagccaagat cgcaccgttg tactccagtc tgggccgaca
gagtgagact cggtctcaaa 15840aaaaagaaaa aagaaaagct acaaccttaa tctcaacttc
tcataacatc atctctactt 15900ctgattagaa gagtggaagt ggggaggttt attacaaaaa
gactgttata ccttacacac 15960ttctccccat gaatagtgaa ggtgtgagtg aaaaagacag
caattttatt ttttttttga 16020aacaggttct tgcactgtca cccgggctgg agtgcactgt
tgtgatcact gctcactgca 16080gcctccacct cccaggctca agtgatcctc ctacctcagc
ctcctgagta gctgggacca 16140cagttgtgca ctaccatgcc cagctatttt tttttaagag
atggggtctc actatattgc 16200ttaggctagt tctcaaactc ctggcctcaa gcagtcctcc
gaccttggcc tcccaaaggg 16260ttgtgattac aggcataagc caccacaccc agccagcagt
tttagaataa agggtgaagg 16320tgctgttggg gaaatataat ttaaaaaaca aaatcttctc
tcaacccaga aatcctctcc 16380atgaaggcag tagagaaaga taagctttat tattgaataa
aaattaaatg agaatgtgat 16440gcacatcaca ggcactttgc taagagatca caaagacaga
aggaaatttc accattttgt 16500acagccaagc aggtacagcc cattacatgt atgttttcga
gataaatagt cctcaactaa 16560gagaacttga cagcaccact ggtcacacag ttcattctaa
ctttacctga taattgatgt 16620gaccacttgt gttatctaag atatcaactt ttcgggggtg
ggggagtgtg gaaacaggag 16680ttacttttat agcttggtgc aaggtactca ttaagattag
gctgttaccc tcccacagaa 16740actggaagat aggtatgcta tctggtaatg tttacatttc
ccagatcctt gagaaagaca 16800ttcctaggtc ataaagctga caaaaggctg attcagtttt
taaatatata tatctgtata 16860tgtatttca
168696515000DNAHomo sapiens 65gatctcttga tcccaggagg
tcaaggctgc aatgagctaa gatcaagcca ctgcattcca 60gcctgagtga tagtgggaga
ccttgtcttt aaaacacaca cacacacaca cacacacacg 120agggcctttg accactcttg
agtagaagac tcgagaagaa caaagtagaa ggccagagaa 180gaacaaagtt acttgaaaga
tctcttatta aagagaatgt acaagctatg aaaaaaaaaa 240aacacacaca cacacacaaa
cctcatctgg aatgaaaaaa acataatgca tttggtttct 300ggttccttag gctgttatgg
aacaaccaaa gaacattatt ttggtttctg aggtcagaac 360tattttattc ccctcaagca
cactatgctt atggtttgag ggagaatgag aaataggaaa 420ctaggaacag gctgaaatgg
tctaatcttg accatctaat tctgcagtgt cttattctca 480ttctaaaaga gaatggttat
attcgctgtt ctagcataaa aagtaatgat aaaaataaaa 540gatcccgtat taccagacaa
taatccccta gactgtttta atgcttggtt gagtatttgc 600ttatgatctc agactttaaa
agatggtctc cccctatggt gaagcttgtt aattatgtag 660gcatcattaa tgtctgttta
cttatcaaaa ttttatcatt gttagttgta ttactacttg 720acagtccaat ttatttaatt
gaaaagattg gttaacattt tatagtcaaa gtaattgttt 780cctgtgtttt ttcctgttta
ggttattgga gtgatgagta aagaatacat accaaagggc 840acacgttttg gacccctaat
aggtgaaatc tacaccaatg acacagttcc taagaacgcc 900aacaggaaat atttttggag
ggtaagtaag ggaaatttct tcagacccat taaatgttag 960gaaaaaatgg agctaaaaga
gctgggtggc tcacctttct catcctgtgc tgagaaatgc 1020tggggctcac ccataagtat
ccagcatccc catggacaca gggaattctg aacaaatgtg 1080atgaaaccga tgaaatgtct
ggcctgtagg tggttagtga tggagatacg ggctatatgt 1140gaatcttgat ttttgcaatt
cattagagct ttgtaatgaa aggaaacagt ttgttgcttg 1200ctttaaggat aggttcattt
gcatttctcc gcaaggaagt agtaatgagt taccaagcct 1260tagatttcac ccctttttga
tttcttgctg acttaacttt aattgaatgg aagagttatc 1320acaaatgaat tatctttttg
gttttttttt ttttgagatg gagtctcact ctgtcaccag 1380gctggagtgc aatggcatga
tctcggctca ctgcaacctc cgcctcccag gttcaagcaa 1440ttgtcctgcc tcagcctccc
gagtagctgg gactaaggtg cgcgccacca tgcccagtta 1500atttttgtat ttttagtaga
gacggggttc cactatgttg gccatgatgg tctcgatctc 1560tggacctcgt gatccgccca
ccttggcctc ccaaagtgct ggaattacag gcaagagcca 1620ccgcgcccag ccaggaatga
caaatgaatt accttataag taaatgccat taaggaagga 1680tagctggaag atgggttgag
gggaatggag gaccacagaa ctagtcctat ttaaatacat 1740gtgcatggta aaatgattcc
atttgacaat aggttaatta tctcatagca taaggaaaat 1800gcttaacagt catatgcaag
atgataagct ttcctatagc atccaaccaa aagatctagc 1860cagtacaatt tcctttgcta
tattagggtt agaaaggccc ccagaggtga accaattaga 1920tggaatcctt gaataaaaca
ctggattagc agtgaacaga aaaaagtcag attgctttcc 1980ttcttcccat agatgtctca
gggatattta gtttcctcag aagataaaga atttagtaag 2040cgtttttttg tgcatactta
catgaaatgt acattatttg aattctttaa aaagaaacag 2100ctgcatgata acaaaaattg
tgttatgctt gctttagctg gtatttttgc ctagaacgat 2160tatatcgttc ggacaagaag
ctattcctaa gaaacaatat ttttaatcca ggaagttttt 2220catttttaga aatttatctt
actatttccc aagcaaaaga gggtagttac agattcacta 2280agaatcatgt gctcacaatt
tttatttaat aattattcct ccttaaaata tattaatcac 2340ctgacttaca atggtggaac
catgagtgca tttttgcctt tattgtcaat aacgtcttct 2400cagaagtgag ccacaaaggt
gcatagttct tggagttaaa ggtctgaatt aagacaatcc 2460agcataagtc tcattaatgt
gtgattattt tgagaaaagg caagaagtac ctaagaatct 2520ccccctcact gtccagttcc
ctgtttcatt taaagattca ctgtaagtaa ctgaaaggct 2580ttccttggga ggatttattt
gaatcagtct ttcacatgca aaggatattg tagaacatct 2640cgtttttgct ggcaggaata
tgaacatctg ttgtgaggaa agaaaaagtt tcatgcaaat 2700tacactgcca aagaagggat
gttcaagttg agaaaccagt gacatttctt gtaactgtac 2760tatgaatcag cgcattttaa
tcttctagat aatatatgga agtgcaggaa ggtggtagga 2820aacggtgttc attttacata
tgcgttattt tattctgtgt gagtgacttc atggcaccga 2880cattgctgtt tttaaatgag
gatacagtaa attgcagtcc gaggaaggct aactggaatc 2940aacatacccg tagctttaga
aagcagtttc cgcaccagcg aagagtacaa gagcgatgga 3000accccatgtt cctggaagtt
tgcacatcag agtaaacaaa cttgaaaacc cctcttgata 3060gcagaattca cccagccttg
ttccattttc tcttaacaaa acacaccgca aaagctctca 3120caagctgctt tgatgaagcc
acatgtattt cccccttcac aatttacagg aagttactct 3180taaaagaaag tgattctggt
gtttaccgcc tgtgttaaag ggacagagtt cctttttatt 3240tctgataacg tttgagcgaa
atacagaaac tatctgtaga ctagcatagt cggtacgtga 3300gtaaggaaaa gcaataacct
gctgtccggt gagcacaaaa ttcctgctac gaacagtgcc 3360ttactgctgc ttggagactg
caagtcgcag atcacactag gtattgactg attgtataag 3420gaaatttctt aaagtctaaa
gtaaaggtgg tacctcctaa aaagagggga agagagaaaa 3480ctttgtgtgg aaggataagg
agtgtgttta tagtttcagt aagagtgtac gttttaattt 3540ttcttcttcc tctgcctctt
tgccaagtag cctgagtgca tctgttatcc agaagtagta 3600ttactctagg acaaacttca
aattcttcat tctgcgttgc ctttaaggaa caacatactt 3660tcttcctgtt ctttttccaa
aaacacacgc ctatggctct gtgtgtggtg ttttagccag 3720cctcctccca gataaggggt
tcccttccct cctttgcatt gaaaggaaag tgcaagtctg 3780gacatgttta tcaagaggaa
aagtgacttc tcagtaatag actgtcaaat tcgggctgct 3840gcccgagtgt tcgctttgtt
atggcaggtg aagttcacct ttgccccacc cagtgtttcc 3900acaaaaaggc aaggttccaa
gtattcatat gaacaagtgt tactttagga cttggagggt 3960tgggggtgga ggatgtttgc
atagttgaag ccttgggcgg gggtgtagga aacggcgagt 4020acagaggcca tagaaaaagc
taagactcag tttgacgtcg tcagccggct tggtcttcta 4080cccagtgact caaagcacta
aaagtcagca taatcggaac tgaagtcagt agcatcgccc 4140atttgccatt cactgcagta
gcaaaagtag tactctgtgg tgggttaatc ggtttgaggc 4200agctccttaa atgaacattt
gtgtttcatt tttctgttat tttcccgaac atgaaaagac 4260gataaaactg aaatggaaaa
ggtaactgac aaaagtgtgc cttacctgtt tccgccctga 4320tttctgctga ttcaagacta
ttctggctaa actgattgga ttctttttct aactaggcag 4380taggggatca gaaatcacac
acggtaccgg ctgtgtttat tctgagaggt gctggggagc 4440tttgggtctg acttcctttt
acatgcctgt cttctctttt ggacagatct attccagagg 4500ggagcttcac cacttcattg
acggctttaa tgaagagaaa agcaactgga tgcgctatgt 4560gaatccagca cactctcccc
gggagcaaaa cctggctgcg tgtcagaacg ggatgaacat 4620ctacttctac accattaagc
ccatccctgc caaccaggaa cttcttgtgt ggtattgtcg 4680ggactttgca gaaaggcttc
actaccctta tcccggagag ctgacaatga tgaatctcag 4740taagtggatt acagaacaaa
aaaataaaaa atgccagtaa tgtcggttct gcccctttga 4800actaataaca tgttgtttaa
ttatacggct ttgtcatgtg ttggatgaag taggtggctt 4860aagctaggga ctaggaagag
gaaaaacatt ttttgagtcc ctattaacta ttaggaaact 4920tgatcattta aaagtatata
tatatatgag gagctacctt gagttttgaa ttcaggatgt 4980tacaggaaga aatatatgtc
caattctaat ttatccaaaa gcagttggga gaattacagg 5040gattggtcca gacatgctgc
gtatgcaagg tatagccctc atctgtggta ctttggcagg 5100gcttagactg catcaaaata
tttatagatg tacatttgag tgtacagtta ggatctgatg 5160tggaacattg taagatcatt
gctagaaaaa ctttgtcata atttttcaat attattctaa 5220gtgaataacc gtaaagattt
tacatcttag cttccttcct tacagtaaaa aaactatctg 5280atctcttgat cagtattata
gtagccacct atcactttat cttaacaaat tctcaattcc 5340ttaggtttat gtgcttttac
ttcttttatt tgattaaaat tgctgtcatg acctctctct 5400gcagagggct gcatcatttt
ggtcattctc aagtgatctc tttgagcaat ttaagaattg 5460ccataagatt ctaacctctg
ctgtaactat ggttgtgtgt tcttggttag accactaaat 5520cttattagca gttttaaaaa
ttattccttt tggtttagaa gttaagacta aatgctgaag 5580tttttgtaac ttttggtttt
gatatcattt caaacttaag aaaacatttg aagaaaagga 5640caaagaattt ccacttaccc
tttacccagg tttaccagtt attgataagt atatccattt 5700gctttaccag aaggctaact
tgttttagtt ctcattttca cctttgagac atttggaata 5760aatatcaatg ttaacataaa
ttggaatttt gactttgatt ttaggaccaa tgaacaagcc 5820aagtacttac cctagtcata
tataatccaa ctgtatggtt atttggtatt cattccacac 5880ttcattttac ttgatctccc
ttaagattgc aagattgtgt ttgcagtttt tctgaaaatc 5940tggggctata aaagcatcag
gacctccccc gtaggggagg tcgtgtgttt ggggtcctta 6000cacaacaggt tacccttgag
cttcaggaaa agaactggct ctcagttccc cagttccagc 6060ttaatgggtc taattaggtc
ctgaccaaaa aggtggcagt tcttttccct catgtctctt 6120cagcgctccc cgagactctg
gagactctgt catatcccta gggctgagcc tcccaggaac 6180cattcggctg ttgtggcatc
tgtgtatgcc atgcccagtg ctgaggacct agtaacaaac 6240gacaaatgca caggcacagt
ggcatttttg tggaactcgt attccagctg tgcgtctcag 6300aagaagcgca cagctccctc
ctggctttct taacatagtg agccacttcc acttaagggt 6360ctccttacat tccttgagtt
taatcattca tggattcaga ggaaagtctt ttgatttttg 6420cttttcttta aacagttcat
ttgaggtgac ctaccccagt gactttgcac caaccaccaa 6480gaaacttttt tgcatgcttc
ccgcaccctg tgccaatcaa gggaagggtt taaaggcctg 6540gcgtttttat tcctcaaaga
aaggttttgc acagtatttt aaggttcaag tgcttctact 6600ttgtgttcag aagcaactgt
catatatact gtgaaatgac accttttatt tatccctttt 6660tatttatgca gtatgtcccc
ttttattttg gcagaatttt ttctaaatgg tggtttaaca 6720ttttcaagca catttcattg
tccaatattc atagtaaaga atgagagtta acaataacca 6780gtcacattaa aacaagattc
ctgctgccag ttgtgaaacc ggttgtctta ggcgtggcag 6840ctgatgattg agactgtgat
caggaaaatt tccactattt catcaggcct aataggtaga 6900ttgtgtctcc aaatgaactg
tgttgggttt ccatgcttaa agcacaatag aggtggtgca 6960agaatctcca tgagggctta
aatggcagtg atggttcagg cggtagagtt tggagaagaa 7020gggatttgaa acaaaccaaa
ggaaagaaaa gtaagtagcc agaaatcaca aaatggcatt 7080tttctaaaaa caaaggaaaa
ggaataaaag aactaataag tttgaaaccc ctacccctcc 7140caaatttggc agggggggag
gtattttttt tctatctatc taactaaccc atctagaaaa 7200cagttgacca aattatagac
ttctaaatgt taatctgctt tctcagtttc agttgaaaag 7260agactttgtt ttgcctactg
cagaacttct aggttctttc ttatagtctt ggggttctta 7320ttatagatcg aaaatgtgag
tcggcataat taagccattc ggagtcttca gaagcagttc 7380actcttgaaa tgactccgtc
cgcctacagc catttaagat ttcagaacaa aaacagatct 7440tgattttctt tttcatgtta
actcaagctg ttgctgagtg ggagagtcag aaatgacacc 7500agctccactg attactcagc
tgctgaagga tgatttttta aaatgcacct ttactgtata 7560tggacttcct aatttccacc
tgtagagcat cttagggagg ctaacatgtc actctggatg 7620ttcttttaga ataagatgca
aatctatttt tctgaaggca ttagagatag caaacattta 7680ttgtgagttt actatatact
aggcactgtg ctaagtgttt tgcatagaaa gtttaaaatt 7740ctggcttttt tgttggccca
atcataagtt tcatatcagt tcaacattca aattatatta 7800aggtacttaa gaagaatccc
tggctaaatg tgaggggcag tgccacagat ggactgaaac 7860tttatgctta ttgcacattt
atgctattat tatttgttga attatagaac caagggagtg 7920tggaagccac tggaaaaaat
atgagactta gatacataat ttgagtaaaa atggctcaaa 7980gtcatgaggg taaagttttt
tgtatttcca ttttattcga gcggcatcgt ttttaaaaat 8040cattatgaat ttgaccctat
atagatgttt ccaaataatt ctttttcacc ttcataaaat 8100tccttcctgt ggctgtgaga
tgccttgcct atcagttttc aagcttagtt gtctttctca 8160tcctttacca ttttagcttt
aaaaaacaaa agtgacaatt agaacttcct gcctgctggg 8220cctcactgaa agaccgatat
tggcctgata aggagatatt tattttgttt tagtggcttc 8280agaaatccct ctccctcagc
aagctttcca tcacggcccc cccgtcagca tcttccctga 8340tagcgttctt ctctgtgttt
attctggggc ttcaggctcg cccaggagga actgataacc 8400gctggcagga gataacattc
tctaaggggc tctcaaattg gaatcgaatc cctcaagcca 8460gtcagcctag agaatacatt
taaagggttc agttctggag tttcacagag ttcatttcta 8520gacctatcag atagcaagtg
tggagttctt tctcaactaa attcaagcag agacattttt 8580tagacgatga aggatatttg
cacaaaggct tcagcatgat cccccaaacc tgctgcctct 8640gaaggcatct ccacacattg
acagccaatg ccttcagtgc gttcctaggg caggtgtcct 8700ggcttgagtg actgtcctcc
aataatcaga gctcaaacta aacatcgtat gttttacttt 8760tggtttccag gcaaggctga
gcagggaatt ttcagttttc cctgcccaga tgggtgtttt 8820ttcctgaagg catcatttat
tgtgtagcga ggagacaggg ctggctgtgg cagggatagt 8880ctagaactgt cctcattgct
gctgttccta aatagtatct ttaccaagta ataacgtgcc 8940gtctttggga ataagtgctt
tcctcttagc ctgttctgtt ttcttgggtg cgctaagtaa 9000ttgaactggc tcaggaagta
cctattgtgg tttggcagag gtgactgtca cgccttgtga 9060ctccaggggc cagcactgct
gggatcctgg ctagaccaga cagagccttg gtgaagtgct 9120taggctgtct gcacatcgcg
aggaaggtgg tattcacttc gctaagctcc ttggcatagg 9180cagtttgaac agggctttat
caaattcgta ttcaacaaga gtagaagcga aaattgatga 9240ctgtgtatta cttgaaatga
gtcttaatct ttcacattta gttctcaggg tatgctgatt 9300tcctttaggt aaaccatgaa
catcagaaag acttttatta acctatgaca gggtccccac 9360cccagtattt ttccactcca
ttaaaatgga agtttttttt ttttttttct tttttgagac 9420agagttttgc tcttgttgcc
cagtctggag tgcaatggca caatctcggc tcaccacaac 9480ctccacctcc cagattcaag
cgattcttct gcctcagcct cccaagtagc tgggattaca 9540ggtgtgcgcc accacgccca
gctaattttg tatttttagt agagatgggg tttctccatg 9600ttggtcaggc tggtctcgaa
cttccgacct caggtgatcc gcccacctcg gcctcccaaa 9660gtgctgggat tacaggcaag
agccactgca tccagcttag gctatcttac tccagcctaa 9720acagcaattt tctatcataa
ggtctgtact aatgaaaaca gaatcaccca aggctgctgt 9780ttgttctgtc tgtgctgcca
ttgtccgcat tttgctgagg aggaaacgga actgcacttt 9840tgagtgagtg gcccagagcc
ttctagaatg agagtgcgtt ggaagccaga tatgtggcga 9900ttgtgtcgcc agctgttact
caggttttct caagaaggag gagcaacttt ggcagttttg 9960cttcagttct ctctagccct
ctgtgtaatc gccccttttt ctttatttca gcacaaacac 10020agagcagtct aaagcaaccg
agcactgaga aaaatgaact ctgcccaaag aatgtcccaa 10080agagagagta cagcgtgaaa
gaaatcctaa aattggactc caacccctcc aaaggaaagg 10140acctctaccg ttctaacatt
tcacccctca catcagaaaa ggacctcgat gactttagaa 10200gacgtgggag ccccgaaatg
cccttctacc ctcgggtcgt ttaccccatc cgggcccctc 10260tgccagaaga ctttttgaaa
gcttccctgg cctacgggat cgagagaccc acgtacatca 10320ctcgctcccc cattccatcc
tccaccactc caagcccctc tgcaagaagc agccccgacc 10380aaagcctcaa gagctccagc
cctcacagca gccctgggaa tacggtgtcc cctgtgggcc 10440ccggctctca agagcaccgg
gactcctacg cttacttgaa cgcgtcctac ggcacggaag 10500gtttgggctc ctaccctggc
tacgcacccc tgccccacct cccgccagct ttcatcccct 10560cgtacaacgc tcactacccc
aagttcctct tgccccccta cggcatgaat tgtaatggcc 10620tgagcgctgt gagcagcatg
aatggcatca acaactttgg cctcttcccg aggctgtgcc 10680ctgtctacag caatctcctc
ggtgggggca gcctgcccca ccccatgctc aaccccactt 10740ctctcccgag ctcgctgccc
tcagatggag cccggaggtt gctccagccg gagcatccca 10800gggaggtgct tgtcccggcg
ccccacagtg ccttctcctt taccggggcc gccgccagca 10860tgaaggacaa ggcctgtagc
cccacaagcg ggtctcccac ggcgggaaca gccgccacgg 10920cagaacatgt ggtgcagccc
aaagctacct cagcagcgat ggcagccccc agcagcgacg 10980aagccatgaa tctcattaaa
aacaaaagaa acatgaccgg ctacaagacc cttccctacc 11040cgctgaagaa gcagaacggc
aagatcaagt acgaatgcaa cgtttgcgcc aagactttcg 11100gccagctctc caatctgaag
gtaggccttg agagagagca gtccaagggg ctgtgagtgc 11160atgcttgtgt ttgtatttag
cttgctttcc atggggtatc gattgcattt gcagtagtat 11220gagcccccgg ttggggatag
tgggtatgga ttccgcctgg cttttgccac ttctagctct 11280ttgactttgg acaagtgact
tcccttctcc tgattttctt ctgaataata aaaaaattag 11340gggtttggac tagaagatta
ggtgaaactc cctgctagcc tgtgattttt gtgcttttaa 11400gaaaaacacc attctgaaaa
catgaagatt tcttcttttt aagactgtct tgatgctttt 11460cttaagatat ttgcatcaac
acttgagtct tggagcagaa atgttaggtc tcagagccag 11520cttgagagca gagctaacac
atgtggcttc ttcccaggtc cacctgagag tgcacagtgg 11580agaacggcct ttcaaatgtc
agacttgcaa caagggcttt actcagctcg cccacctgca 11640gaaacactac ctggtacaca
cgggagaaaa gccacatgaa tgccaggtgc gcagtatttt 11700ctgggtagac cttctgacct
ttgtagaaaa tgtctgtgag tcaccctccc atgtcctata 11760tagcccgtag ttaaagccaa
caccagattc tgcgttgtcc catcctggac tgatggcact 11820atggtccttc ccagtacttt
gtatctgctg atgacttgag atggcacagc cagcttccag 11880tgggtgggaa aatggtaggg
gaaataaaca gcccctcgtg tgctgtgtgc ccacatcccc 11940ccgtttgctt aataccacac
tggaggtgcc acaaggaggc ttctcacctc ctaggttgct 12000gggcgttggc cggtaagcct
gcccctcccg ttggcaactc ttaatcttct ggccttcctg 12060tctcccttcc ctgctgtctc
tctcccctac actgtaggtc tgccacaaga gatttagcag 12120caccagcaat ctcaagaccc
acctgcgact ccattctgga gagaaaccat accaatgcaa 12180ggtgtgccct gccaagttca
cccagtttgt gcacctgaaa ctgcacaagc gtctgcacac 12240ccgggagcgg ccccacaagt
gctcccagtg ccacaagaac tacatccatc tctgtagcct 12300caaggttcac ctgaaaggga
actgcgctgc ggccccggcg cctgggctgc ccttggaaga 12360tctgacccga atcaatgaag
aaatcgagaa gtttgacatc agtgacaatg ctgaccggct 12420cgaggacgtg gaggatgaca
tcagtgtgat ctctgtagtg gagaaggaaa ttctggccgt 12480ggtcagaaaa gagaaagaag
aaactggcct gaaagtgtct ttgcaaagaa acatggggaa 12540tggactcctc tcctcagggt
gcagccttta tgagtcatca gatctacccc tcatgaagtt 12600gcctcccagc aacccactac
ctctggtacc tgtaaaggtc aaacaagaaa cagttgaacc 12660aatggatcct taagattttc
agaaaacact tattttgttt cttaagttat gacttggtga 12720gtcagggtgc ctgtaggaag
tggcttgtac ataatcccag ctctgcaaag ctctctcgac 12780agcaaatggt ttcccctcac
ctctggaatt aaagaaggaa ctccaaagtt actgaaatct 12840cagggcatga acaaggcaaa
ggccatatat atatatatat atatatctgt atacatatta 12900tatatactta tttacacctg
tgtctatata tttgcccctg tgtattttga atatttgtgt 12960ggacatgttt gcatagcctt
cccattacta agactattac ctagtcataa ttattttttc 13020aatgataatc cttcataatt
tattatacaa tttatcattc agaaagcaat aattaaaaaa 13080gtttacaatg actggaaaga
ttccttgtaa tttgagtata aatgtatttt tgtcttgtgg 13140ccattctttg tagataattt
ctgcacatct gtataagtac ctaagattta gttaaacaaa 13200tatatgactt cagtcaacct
ctctctctaa taatggtttg aaaatgaggt ttgggtaatt 13260gccaatgttg gacagttgat
gtgttcattc ctgggatcct atcatttgaa cagcattgta 13320cataacttgg gggtatgtgt
gcaggattac ccaagaataa cttaagtaga agaaacaaga 13380aagggaatct tgtatatttt
tgttgatagt tcatgttttt cccccagcca caattttacc 13440ggaagggtga caggaaggct
ttaccaacct gtctctccct ccaaaagagc agaatcctcc 13500caccgccctg ccctccccac
cgagtcctgt ggccattcag agcggccaca tgacttttgc 13560atccattgta ttatcagaaa
atgtgaagaa gaaaaaaatg ccatgtttta aaaccactgc 13620gaaaatttcc ccaaagcata
ggtggctttg tgtgtgtgcg atttgggggc ttgagtctgg 13680gtggtgtttt gttgttggtt
tttgttgctt tttttttttt ttttttttta atgtcaaaat 13740tgcacaaaca tggtgctcta
ccaggaagga ttcgaggtag ataggctcag gccacacttt 13800aaaaacaaac acacaaacaa
caaaaaacgg gtattctagt catcttgggg taaaagcggg 13860taatgaacat tcctatcccc
aacacatcaa ttgtattttt tctgtaaaac tcagattttc 13920ctcagtattt gtgtttttac
attttatggt taatttaatg gaagatgaaa gggcattgca 13980aagttgttca acaacagtta
cctcattgag tgtgtccagt agtgcaggaa atgatgtctt 14040atctaatgat ttgcttctct
agaggagaaa ccgagtaaat gtgctccagc aagatagact 14100ttgtgttatt ctatctttta
ttctgctaag cccaaagatt acatgttggt gttcaaagtg 14160tagcaaaaaa tgatgtatat
ttataaatct atttatacca ctatatcata tgtatatata 14220tttataacca cttaaattgt
gagccaagcc atgtaaaaga tctacttttt ctaagggcaa 14280aaaaaaaaaa aaaaaaaaaa
gaacactcct ttctgagact ttgcttaata cttggtgacc 14340tcacaatcac gtcggtatga
ttgggcaccc ttgcctactg taagagaccc taaaaccttg 14400gtgcagtggt ggggaccaca
aaacaaccag ggaggaagag atacatcatt ttttagtatt 14460aaggaccatc taagacagct
ctattttttt tttgccactt tatgattatg tggtcacacc 14520caagtcacag aaataaaaaa
ctgactttac cgctgcaatt tttctgtttt cctccttact 14580aaatactgat acattactcc
aatctatttt ataattatat ttgacatttt gttcacatca 14640actaatgttc acctgtagaa
gagaacaaat ttcgaataat ccagggaaac ccaagagcct 14700tactggtctt ctgtaacttc
caagactgac agctttttat gtatcagtgt ttgataaaca 14760cagtccttaa ctgaaggtaa
accaaagcat cacgttgaca ttagaccaaa tacttttgat 14820tcccaactac tcgtttgttc
tttttctcct tttgtgcttt cccatagtga gaatttttat 14880aaagacttct tgcttctctc
accatccatc cttctctttt ctgcctctta catgtgaatg 14940ttgagcccac aatcaacagt
ggttttattt tttcctctac tcaaagttaa aactgaccaa 150006646340DNAHomo sapiens
66tattttactt cagtaacaga aaatgaaaga aatgttttaa tgttgctgat tgtattacct
60tcaggatcaa tagcagaagg acaaacttct ttgaggagat ctcctagtgt gtgcaactgt
120ccatctgcag ccacaggacg aaacagcttc tgaatgaaag gtctttcagt cgttgtctat
180ttgaaaaagg aaaaaatgat tcaagcaatt aagtctttgt tgctgccaat tacaaattta
240tatatcataa actttatgtt ggcattaggt gccttttgat acggtgttag cataattaca
300caacatcaca gatgtggtat cactgtgaaa aatgtttaac atgataaatt caggtaaatc
360taattctgag gaaacagaca aatccaaagt tgggtgggac attctaaaga taattggctg
420ggacccttca aaaacttaaa gacattaaaa agcaaacaac acaaaaagat atcaacaaaa
480gcattttttc tcagtatctc ttaaagagac taacaaagca aatacaaaac ataaaccatg
540gctgaatact aaattgaaga aggacatttt ttagaaatcc aactatgaaa cacagttttg
600ggataaatgg ggaaatacag aatggacaac tgataatatt attgagttaa tgtcaaattt
660cttaggtaca ataaggacaa tccttatttt taagaaattc attgttcaag tgtttaggaa
720agaagtgcca tgatatccaa aacttaatct tctttctctt tttttggaga cagagtctcg
780ctctgccacc ccggctggag tgcagtggcg cgatctcagc tcactgcaac ctctactttc
840caggttcaag tgattctcat ggctcagcct cccaagtagc tgggactaca ggagtgcgcc
900accatgtcca gctaactttt tgtattttta ctagagatgg ggtttcacca tgttgcccag
960gctggtctca aactcctgag ctcaggcaat ctgccggctt cggcctccca gagtgttagg
1020gttacaggcg tgagccaacc gctcctggcc ccaaaactta accatctaat ggttgagaga
1080gagacagaga gagagagaaa gagagagaca gagaatgtgt gtgtgtgtga agacaaagca
1140aaaataaaaa aatattaact aatggtgatt ctaggtagag ggtgtatgat tttagtagtt
1200tcattatttc aacttttcga taggtttcac aatttccaaa acagcagatc cagccatttc
1260atctgacaaa aactgttagc agcactacat cgtaatttat tgctaataat ctcattgttt
1320tactcttaaa attgtttcat ttactaaatt tccttagtga tgatggaggc tttatcatga
1380cagagtacag aggctctgaa atgagccagt gtctatgaag agcaccactg tttgcaagat
1440ctatgatctt gtacccagtt tcctttatct gttaatttgg gacattccat atctcttgag
1500tttgttgtgg aaataaatga gcaactttgc caaccacaga gtaaataaat aaatgttaaa
1560gagaataaaa gcatttttac ctcctctctc cctcttaacg gttatttcac tttaagatgg
1620taaattttaa gctttctgag atgaaaaatc attaaaactt aacaagaaca gagaaatgcc
1680atacatacat attttttgtt tgcttgtttc ctgagacaag gtttcactct gtcacccagg
1740ttgaattgca gtggtgcaac ccccaagttg caatcctcca cctaagcctc cagagtagct
1800gggactacag gtgtgagcca ccatgctcag ctaatttttt tacttttttg tagaaggggg
1860tctcactatg ttgcccaggc tgcctcatat tttataagaa tatgacttca aacacttagg
1920cattagcgac aaggttttgt ttttgtcttt taatgacaga ggtatacctc aacatatttg
1980acacaactgt tagagatttg gtttaaaaag aaatagacat ggatgaagct ggaaactatc
2040attctcagca aactaacaca ggaacagaaa accaaacacc tcatgttctc actcacaact
2100gggagctgaa caacgagaac acatggacac aggcagggga acatcacaca ccaaggcctg
2160tcggggagta gggggctagg ggagggatag cattaggaga aatacctaac gtagatgagg
2220ggctgatggg tgcagcaaac caccatggca catgcatatc tatgtaacaa acctgcacat
2280tctgcacatg tattccagaa cttaaagtat aatacaaaat gaaaaaataa ataaaaataa
2340gtagaaaaaa taaacatgta agcatgtgag ctgcctttcc taattctatg tttatgtatt
2400cactgaatac atagtatttt aaaatagtaa tccaataata tatttgagtg tttgtgacaa
2460gtatgaaaat tgtaattttt aaaaaatctt gataatatgc attgaatatg atttaattca
2520cttcactatt tgaactcttt agggattatt tttaaaaata tgattgatat cctttgatat
2580gttttggctc tgtgtttcca tccaaatctc atctcaaatt gtaatcccca cccgtctagg
2640gagggactgt aatccccatg tgtcgaggga gggaggtgat tgggtcatag gggtggtttt
2700cctcatgttg ttctcgtgat actgagtgaa ttctcatgag atctgatggt tttaaaagtg
2760gcagtttttc ctgcactctc atctctcttt cctgctggct tgtgaaggtg cctgcttccc
2820tttctgccat gattttaagt ttcctgaggc ccccacaagc catacggaac tgtgagtcaa
2880ttaaaccttt tgcctttata aattatccag tctcagatat ttctttaaag cagagtgaaa
2940acagactaat acattcttca atttaaaaag ccatactttc tcatacaagt tgaaaccaag
3000aacaatatca tgcataatca agtgattaac tgtgtaaaga taataaggtt gaggagttca
3060gagaagaaaa gaaatgaata gggaactgta gtgataattt aaaatagcca tccctcactc
3120agggtttttg atcttcaggc catgaagaag cttttaatgc tttttagcaa aggaagtaat
3180gttggtgaaa ggctttttct gacgactaat ggaaagcagt gctatgtatg gtgacttggt
3240tatgaaccaa aaccagaatg actggtgaga ggctgactga atacagcaag cttatgtgaa
3300gacaactgga gctggtgcag tggaaaagga agacagcagg actgtaccca caactcaaag
3360aaaaaagtca gaaggtacct cccgcagtcc aacctgaaaa caacaaagtc aaaggaatct
3420tttcaagaat ttggagctct cattcatatc ctaattagtg tatgaaatgt gaggtggctt
3480tgctataatg aaattacctg gaatatttct aacacaaaga aataataaat gcttgaggtg
3540gtgaatatcc tcatttgatc attacacatt gcatgcttat agcaaaagat tacatgtacc
3600ccataaataa ttgcaactat tatgtatcca taataattaa aactaaaaga ttaaaaatta
3660cctgaaaaaa aatgctaaac aggaaaggcc aactagtctt ggttacatat taaaaaacag
3720aaattcttct ctaacctcac tattggagaa atatcctgtt atttttatat atcttttttt
3780tcaccctttc ccaaatctga gcaagtatta taaaggtata accttcaaca atcttttatg
3840atgaggtatt tgcttactgg ggacaaagcc ccagtgctat tacatagtgt agctaaacgc
3900tgtagaatgg taaaaacaag aaaatgctca gcaaagtgtt gtttctcatt taatgaaaat
3960cttattttaa aacacaaaaa ctcaatatac cccaaccaaa aatctgatga acattttctg
4020tttaatattt attatacagt acctttaaaa acgtaatatt cttattctta aaaatttagt
4080gtgctagcaa atagcaatta agtacctaag tcaatcagga cgacaaaaaa atactcaatt
4140tggggagtta gttacttcta tcatctgaat gcgtccctcc aaaattcatg ctgaaaccta
4200ttcctcatca tggcagtatt aagaggtgaa gcctttgaga ggtaattagg tcatgagggc
4260agagtcctca agaatgggat caatgctctt ataaaagagg ccccagggag cttgtaaggc
4320ttttgcccct tctgccatgt tgggggggtg ggggtggggg cgcagcaacc agtgctaact
4380ctgaagcaga gagcagccct caccagaaac cgaatctgtt gaagccttga tctctgactt
4440cccagcctcc agaactgtga gaaataattt tctgttgttt ataaattacc cagtctaggc
4500tgggcgtggt ggatcacctg aggtcaggag ttcaagacca gcctggccaa tatggtgaaa
4560ccccatctct actaaaaata cagaaaatta gctgggcata gttgtgggcg cctgtaatcc
4620cagctactca ggaggctgag gcaggagaat cacttgaacc cagaaggcag aggttgcagt
4680gaatcaagat catgccattg aactccagcc tgggcaacaa gagggaaact gtctcaaaaa
4740aaaaaaaaaa aagtacacac tctaacatat tttggtatag cagcccaaat ggaatggact
4800aagacaatta cccttaaaat aaaagctccc atagagagat catgcattca agtacagagg
4860ttcttaaggg caatgggaat ggaggacata ttcctgcaaa cttttcaaca gctctcatta
4920gcccgatgtt agagctctgc aaagaagact aaattatact gagaaatatt tttaaatctc
4980cacaaatagg aatgctgtaa acgttgattt agtatatata aaattagaca agactaacaa
5040tatccaatgc aatctaaatc ttaggttgac agacaagaaa gccactgcaa acaggaatat
5100accacaatac ctgatcttgc cacatatttg taaatatgca aagtatttca ataacttcca
5160agaaacagta ttactctcat gagaaataac atgatgtaag tcacctttga aactgtcctt
5220gttacttttt caaatgtatg ttagtcattt cttaacacca aatgaaatga aaaactgagg
5280tggtaatggc tggctgctcc catctctcct ctactcatgt gccttcacca atacagcaat
5340cattttttct tatatgggaa atttacagtg ttgatatagc tcagagatat attgaagaaa
5400agcagaaaaa cgaaacttat aaacatttta ggaaacctta tgtattttct taaatagttc
5460aagtgtaaaa cttagaattc ttataaataa tgtgtgttac agctatattg taaatggtgg
5520ctcatgcctg taatcccagc acttcaggag accgaggtgg gaggagagct tgagcccatg
5580agtttgagac tcacccgggc aacacagaga gacctcatct cttaaaaaaa aaagaaagaa
5640agaaagaaat gaaatgcaaa gaaaaagtct ctatttcaaa tgtagccagt agagccaata
5700ggttaaccaa tattaacatt aacgttgata aaacaagaaa tgatgattta ctataagctg
5760aaaatcagac aatgtatgga ctttaagagt aacaggcacg atcatcacaa acttaaatca
5820ggtttgagtc ctatgagtta tatacagtta catgatgcaa caaaagatgc cagccagttg
5880ttaaagagta ttagattcgg ctgggggtgg tggctcatgc ctgtaattcc agcactttgg
5940gaggccgagg agggaggatc acgaggtcgg gagtccgaga ccagcctggc caatatagtg
6000aaacctgatc tctactaaaa atacaaaaac tagtcaggca tggtggcacg tgcctgtaat
6060cccagctact cgggaggctg aggcaggaga attgcttgaa cccagggggc ggaggttgca
6120gtgagccgaa atcgcgccac tgcactctag cctgggcaac agagcaagac tctgtctcaa
6180aaaagagtat tagattcaag tcctgtttct gtcatttatt atggaaccat ggacacaact
6240acctatcttt cctgaacctc agttttttca actgcaaaac aggaatatat acatatgtgt
6300atatatacat ctgtgtaaac acatatgtgt atatatacat ctgtgtaaac acatatgtat
6360atgtataaat ggagataata cctacattat agtttctgag ataataaaat gcacaacaca
6420attctgacac ataacaattt gtaacttaaa acataccatc accagggcca ctagttttag
6480aacactgtaa tgcatagtct aatttaatac tatgcaaact gtgttcactc aaggttttat
6540ttccttttaa tttcattcat ttactcttca gttgtttgta agctaaaaag tccagaatca
6600tgaaattcag aagtttacgt tttaatgttt ttctatatgg caaggaaaaa aaaaagggca
6660aagtcatttt aacactactt tcaaaatcag cctagaactt aacactaaag gcatgaccca
6720taaaagggaa tactaataaa tagacttaat taaaattaaa caacaacaac aacagctaag
6780cttttgttct gcaaaagatc ctgtgaagag aatgaaaaca taagccgcag gctgggagaa
6840aatatttgca aaccatattt ccgagaaagg tcttgtgtct ataatatata agaactccca
6900aaattcaaca gtttttaaaa aaagcaaata atccaattag aaaatgggca aaagacatga
6960acagacattt taccaaagag aatatatagg tggcaaataa gcatatgaaa acatatctca
7020cacatcatta gccattaaag aaatgcaaat taaaaccaca atgtgatatc attacacacc
7080taccaaaata tccaaaataa aaattagtgg taacaccaaa tgctggtgcg catgtggaaa
7140aatagtcctt cacacactga tggtacaaat gcaaaacagt acagtccctc aggaaaggag
7200tatggcagtt tcttacaaaa ctaaacatgc acttaccata tgaccaagta attatactct
7260tgaatattcc cagaagtaaa aatgtcttct ccaaaaaact tatacatgaa cgttcatagc
7320tgttttattc gtgagagtca aaaacagaaa gcaatcccag ggctacccat taaaacaggt
7380gaatgcttat aaactgactg taataggtct gtcccacgga atactactca gcaataaaaa
7440ggaacaaact actggtatat gcaacaactt ggatagatct caagggagtt atgttatgtg
7500aaaaaagtca atctcaaaag gttacacact gcatgactcc actgatataa cattagtgaa
7560atgacaaaaa ttttagaaat ggaaaacaaa ttagtagttg tcagaggtta gggaagaaat
7620gcagtaaggt aggtggctgt ggctataaaa gggtagccta agagatcctt ctgttgaaac
7680gggtatattt tgaatatagg gtgaatttac atatgtgata aagattgcat agaactaaat
7740acacacacac agtatatgta aaactaagga aatctgagta aggtttgtgg attatattaa
7800tacaatttcc tggttgtgat actgtactgt aattatgcaa gatgttagaa ttgggggaaa
7860ctagatgaag ggtatgtaga tctttctgta ttatttctta caattgcatg tgaatctgta
7920attatctcaa aataaaaatt tttttcaaaa tttcaaaaca actagtctag agctttgtta
7980atcaaagttt tctctgagga cctgtagcat tttggttatc acctggatct tattaaaatg
8040tagattctca ggctgcatat tggaattcct gaattggaat ccgcatttta acaagatttc
8100caagtgattc atgtttaaag tttgagaagc actagtctac aacaatgact tttaaccttt
8160caacctactc taacacactt gaaggccata acaaaattca catcaataac agttgctcgg
8220ttggacagtg actctcaaca caaatgagtg aggaaaggtg gggactcaag actcaggtag
8280caggaaaagc cccttaggtg atcctgatga aatgttttct ccatcctggc tgaaaaaccc
8340agaacagtca attaaggctc aaaacaaaag taatgtttat aatactggag atctttaaaa
8400ggcagataat atatactata acagagcaaa ggtaattatt acaatgtata aatcttataa
8460gaaccaaaat cagaattaaa atcactaagc acataatgaa aatcctttaa aaagtataaa
8520aatgaatgta gtctaagtaa atactaataa tggcagttat agtgagaaaa gctctagagt
8580cttttactct tcatacttcc tagtcacaaa catctatttc caaaactgac ccttcgtatt
8640tcaaataatt tatggcctgg tacagtaata agagcatgat atttaaagcc agtcagaaga
8700cacatattct agctctggat ggcacttgat gacgatggat tcagcttatg gttccaatcc
8760cagctctgtc aattagtacc tatatgaccc tagtcaaata cttaaacctt cttgtgttac
8820ttgtgtgtca attgtatcat ctataaaatg aggatattaa cagtatatac ctcatagatt
8880tttttgtgaa ggttatacaa ttaattcata taaagtattt agaacaatgt ctagcacagt
8940gaattctcaa tgagtgttat aattgttctt tttaaatgtg acttgactct caacagaact
9000ctactgaatt ctaatatgta ttctgtattg agctgtcaaa aaaaataagg attataataa
9060catatactat tcttgtagtc aaccctgtta ctatgttatt actagtgtca gttttgttgt
9120tttggtcata catattgttt tacatacatt aagaattatt agaaatgttg gtttattaaa
9180aatgaccatt tatggctaga agggtatata tctggctcac tgactgtgga gtcaatgtcc
9240ataaagagga ggaagaatgc catcagagta aaaggagatt ctattcactg aaacaaagtg
9300ataaaaagct atgaaagaga aaaacataaa ataaccaaag gggtgaaact taacagatgc
9360ccagtagatg cacaatgcac tgggttgtaa aacttaaaat ggccttaatt aaaagccaag
9420cacggatgga ggtgctgggg gagtctccta cggacacagc aggcagaatg taacaatgac
9480aaggggctca agtttattta aaaagagatt ggacaggccg ggcgtggtgg ctcacgcctg
9540taatcccagc actttgggag gctgaggcgg gtggatcatg aggtcgggag ttcgaggcca
9600gcctggccaa catggcgaaa cctcatctct actaaaaata aaaaaaatta gccgggagtg
9660gtggcgtgca tctgtagtcc cagctactca ggaggctgag gcaggagaat cacttgaacc
9720tgggaggcaa aggttgcagt gagctgagat catgtcactg cactccagcc tgggcaacag
9780agtgagactg ctcaggatct cccaaagacc caaatccctg taaactgaat gcataatatc
9840atttgctcca gtgaggctta gatggacatt ctagtcttct tggttgagct gaagaaacaa
9900atattatatt gataatttat gtatgttgta tttttcaagg tatagcaaca agtttttatt
9960catcagctac tttgtgtgtg tgctttgttt ttaagtcttt tgaaacagga tggtgattta
10020ctacatttat aagtaaaatt tatttgattt acaagggttg cttaagtgta tcacaggatt
10080tcacttgtta tatttgcagg tgcttaaaaa atcagctata ctaaactata actggaatta
10140gcaaagttca tttattgatt aatcaagaat ataattagat ttgcctaact atataagtag
10200tactatgtgt tatttaagaa ttaaatctag aaaagggatg gactctggaa atatcaagaa
10260gtgaaaaaga ctgctctcat ttttgtacaa caattactaa atttctaagt agcattaatt
10320gaactgaaaa ggcattttag aaaaactaga ttttacaatt tataactcta ataaaacaca
10380actaactatg agtgtgcttg ttcatgccca aaagctacct tccaaaatta aaaaccctat
10440tggatggctg ggtgcagagg ctcatgcctg taattccagc actttgggag gccaaggcgg
10500gcggatcacc tgaggtcagg agttcgagat cagcctggcc aatatggtga acccgtctct
10560aacaaaaata caaaaattag ccgggcgctg tggcgggtgc ttgtaatccc agctactcgg
10620gaggctgagg caggagaatc acttgatcct gtgaggcgga ggttgcagtg agctgacacc
10680gtcccactgc actccagcct gggcgagagc ccagagcgag actccgtata ttaaacaaaa
10740caaaacaaaa ctcaaaaaac cctattggca attactaggg ccatcaaatc agtatatttt
10800cacttgacac acaattttga gataatgaac cgaacttact atttttgaaa atattacata
10860ataaatatta gtgaagcttc attgctgaaa tggtgacaaa gatgaatagc aataaaactt
10920ttcttataga tctttagcaa aaacaaaaaa accccaagca tactatggta cattacttta
10980gagaatcaag tagctgctag ttgagtaata gtggtaatag gcactacaat gatataaaca
11040aattacaaca aagaatattg tttttatttc ctgtccatgt tttaaaaaag ctttggtttt
11100acctatgttt aacaaaagca taggtacaac aacgactact actactaaca tataagtagc
11160ctggatagaa ttatcttaat agtagtaccc aagtgcagga tctctaagta atgatcagaa
11220ggcaggaata aattttatca gaaatcttca ttcattacat atttactatg catttaccag
11280ggtatcacta tgctaatgga tacaaagata aataacatgc aaacaactgt aatacagtgt
11340tatgtgataa cagaaatatg tacaaagcac tatgaaaaaa attacaaagc ttgagcacaa
11400attttaactc tggacttact ggcatttaga gcaaaaccaa aacaatccta actggttaat
11460ttcattttct aagagttgga agctatatca gtaggtacaa agtaaaatat gctaattgtg
11520gtagaaagta aaatattaca acagtagaga atttcaaaag aagataaaaa taatggaggg
11580aatatagaag gtcttcaagc ttccagcttg aaatacatat ttttttttaa atagagaaag
11640agataaagtc atttgagtat tcagagggca gactgaatat aatggtactt ctgagaaatc
11700agtggataag gagagaaaag tggactaaag gccatagcat atagagcttg gaatgtcaaa
11760tgtagtggaa ataacaaagg tttggttgga atcccaactc ccaacaacgt actgtgtatc
11820tagagcaaat tacatcaacc tttgggagta ctgtttctga atctgaaaaa tgaggaaaac
11880ttatctttga acaattgatg tgataattaa atgagatata tgaaatatct aatgtaacaa
11940gtgcttaaca atgactagtt cttttcattc ctctcttgaa ccattgtgaa acgtagaacc
12000aagaaaggta acagtattta gttgttacag aacccattaa gagagaataa aaaataactg
12060gtattctaac ttcagtttcc tttgaagtct tgttaatgag aataaatatt atgtggcaca
12120aagaaaaaga aaacaggggt ttacacagga tatgctgcca gactttacca acaatgacac
12180atgatatctg cttcaactgt cccatgcata tttggcttaa gatatattca tgcatatcaa
12240attttacatc acatggtttt caaaagaaga ttcattaaaa ttagcttaag aatgtacaca
12300atatacaata cctcattaaa taaaaagaac agaccatttc caaatgaatg cttttagagc
12360tttacagtaa acagtctttt ggtggtagaa agagggggaa cagagagggg agtgggtggg
12420agtctgtagc acttatcaga ctacttttat cctttatgta gagaaatagg agagttgaaa
12480ataagcactt tctgtactta tgttgagagt ctgaagccca cttttaatag tcttgacaac
12540actaaaaaat aataattaac atttgaaaag ctgtcattat tatagtcagg gacacttaat
12600ctccaaagga gaagtttctt aattgatact atgattaaat aaaagcatcc atcagaatta
12660tatccacaat ctggtttgga gtttatgttt tgtcttattt aaattgttat acttattata
12720attctgtcta gacagtgcca aatgtacttt gtcatacaaa cacttgaggc aaattttctt
12780caaataagcg caacactttg tttcctcttc gtatcctttg actgaataac gtgtggtaca
12840gagaagtaat acttcccttt cttgggatcg agatcaattt gatgcttgtt ataagcccat
12900ttacagaaca aatggtattg cttttaaatt tttatatgaa cttatcagta gactagccaa
12960aaaagaagct tcatataaaa gtgctaggat tgatattctt agtaataatt aggtaaattc
13020tctaaaattt tctcccaaaa gatctgaaaa atcataccaa gggaagtata gtttaaattt
13080cattatatat aatagcttta aaatatcttt gctaattcta cccaaagcca cactaaaaag
13140actaatacaa aaagaatgta attaataaac tattttcctc tgaagaatca aagggcactt
13200ctgcatatga acatgtttta tccttttggt gtacttacat aaaataatta agaaacactt
13260ttaattagta taaacaaaga aatcaaaata gcaagaagaa atgtctgagt aaaagcagct
13320gtgctgacct caaaagtgaa attctgttct cttgatgccc agttaagtgt ctaacccagg
13380gaaaagtgat tctaaacctg ggctaggagc tagtggagct cttcaaacag tctcacctac
13440cctcacccct caaggaatgg tctatgggtt ctgtggtgaa cgctaaagtt tataacatgg
13500gaatatttat tattttgttt ctaacacaaa taatttttaa aaatttattc tactaaagta
13560acatcaaagg gaaatttcat aaaaattctt ttgaaatttt tagaagtagc aaataaaggc
13620aagtgataaa tattttacag atttcaccac ttacgtaatc tgatcaacaa attttaaaaa
13680catagcactt gaatactatt aaaaatatat taaaaaggta acatagtaaa actataaaat
13740tctttaaaaa aaatataaga ggaaaccttc gtgaccttgg attaggaaat ggtttcttac
13800atacggcaac ctaaaaatac aagcaaccaa agaaaaaaac agacaaactg gacttcatca
13860aagttaaaaa cttttgttct tcaaatgaca tcatcaagaa aataaatccc acagaatggg
13920acaaaatatt tgcaaaccat atctgataag agaccactat tcagaatatg taaagaattt
13980gtaaaactta taaataaaaa gttaaagaag tcaattttaa aatgagcaaa ggatctgaag
14040acaattctcc taagaaatac gaatggctag ttaaatgcat gaaaagatgt ttagcatcac
14100tggtcattag gaaagagcaa aaaccaaaat gatatactcc ttcataccca ctaagactgc
14160tgtaattaaa actatagaaa ataagcgttg gcaaggatgt ggacaaattg gaaccctcct
14220catacactga tggtagaaat gtaaaatggt gcagatgctt tggaaaacag tctgacaata
14280ccccaaaggt ttaaacgtgg aattaccatg caacccagca attctactcc taagtatcta
14340cccaagagaa atgaaaatat atgttcacca aaacatttgt acataaatat taactgcagc
14400ttttattcat aatagccaaa aagtggagac aatccacatg tctatcaatt ggtgaattga
14460taaacaaaat gtggtatctt catacaacta ttactgggcc ataaaaagaa tgatgtattg
14520atacatgcta caaaatgaat gaaccttaaa aacaatatgc aagcaaaaga aaccagacac
14580aaaaggccat atattacatg atgctaatta cataaaatgt ccagaaggga gaaataaatt
14640agtagttgcc aagggctgga gggaggggga atgatataag tgactgccaa tgggcatggg
14700gtttcttttt agggtgatga aaatgttctg aaattttatc acgggaatgg ttgcacaact
14760ctgtgtaact tagaattcag tgactcctaa aaccaatgaa tagcatgctt taaaaggtga
14820cctttgctga gcatagtggc tatagtccta gctacttggg aagctgaggc aagaggatca
14880cttgagccag gagttccagg ctgtactgca ctatgatcat acctgtaaat agccaccata
14940cacaccagcc tgggcaacac agaccatgtc tctaaataaa taaacaaata aataaataaa
15000agggtgacct ctgtagtatt gagattatac ttcaagtaag ctgttattaa aaaaaaaaaa
15060gttatcatat gggtggcagg ggaaatcatt ctgggatgat ggctaacttc atcagtattt
15120gatttatacc tatgcatcat accttatgtt tgttttatgc attttgtggg ttttttaaaa
15180aaattatatt tcataaaaac aaattttaaa aaaattaaag tcaagaaccc caaaacaaca
15240aagatcagag atacatttct accttatcaa ttcagaaaaa ttacaagttt ttttcttaaa
15300aattgtatag catcatggtg attttaagtt acctgtagga atttaaataa ctttgtctta
15360actgttcacc aaaactcatt taatattcat gttctgatac tgaaaatgaa gctgaaaagt
15420tttgaaatta caatatgcta gtttaaaaag gtttactaaa atacataatt tcattataag
15480gagtaatatg aaataaaagt atcaaatatg ggaccattaa aaatgtcctt actaacaaat
15540tgctacccac attgtggact cactgcgtcc actgtttgcg agcttttcca gaacgctcgc
15600caccagttag ggtagccaag aactcctcat cttcactttc ttcctcacta gcttggaacc
15660tctggattcc cacccacact gctgtgacct gaatggggaa gagaaacgcc atagtaaggg
15720aactcttcct tttatagatt tctgaattag aatctggcat tacaaaagaa caatgttata
15780aatccaggtc agagtttata gttctatttc actattactt atatggcttg tcctaggaac
15840ttaactatta tttacaatgt aagtacctat ttccacaaaa aaattcaaaa ttttggaata
15900caatatctga agagagaatg gtctattgaa tccaaagtag gctgatacat cccaacagta
15960tttcagattg agataataat aataccacca attcatcaag tcaaattata tgcttatttt
16020ccacaatgga agttttaaaa tagtataaac attttaatat atagcaggct taacttatga
16080ttattaaaca gggttctaag aaaatagtat acatcaaata ttaatgtgct tcttgtataa
16140tttaggtgac aatttatcca tctgagaaat gcaaaagaga ctttggtaag gggttgagta
16200aggagcattc tgtgtcaaag aattcactag caaaagaggg tatactgtag ttacaagcta
16260taatcactgt acttatttta aatccctctt cagaaccagg tcttaaaaga tgataaacat
16320ggcctcatga ataactatca accaaactat agaaaagagt gcaagagtgt ggtgttctaa
16380cttaaaatat ggtgttttat tcaaataatt ttatttaagg ctccaaaagc agcagcctca
16440ttccccagaa atcatagtta aatgaaatct tccttactaa aggaaaaatg aatcacaata
16500tttaacgtga acattttaaa aacactctaa agcaacaaaa ctattcaatt gtatgtgata
16560tggcttagaa aggcatgtag gtaaaaagga ctaaaaactc taataatggt tgggccaaaa
16620gtaaatttgt tagttctact ccattaagca ttcctcaagc agtgtaaaaa tcagagttca
16680agttacactt tgatgtgtag atcctttgaa agccactcta ccctgtttta tatgaagcat
16740ccgcagctaa aatgaacacc tagtgaagag tatgaatgct gcaatacata agcagacgtc
16800agaattgtcc caagctgatt ctaagttact ttaaacatgt atgcagagtc agaatatgac
16860ttacttctta gaagtaacag ataattacct ttggcataat gaaaaaaact ttaaatgtaa
16920gttaatacag gtattttccc tttagcaaag ctttgctttt aaaagaaaac ttcaaaactt
16980aaattaaaat aggaaatgct ctactatgta gtaaaaatac tttttagatt actgaagcaa
17040agaaaaggaa ggattctatg agggaggaaa agtgggagaa aaatgtaaag aaaaaaagga
17100agaaggaaag aaaagagaaa aggaggaaag aacacaagga cagaaaggcc tattgaaata
17160tattatttct ttcaaatttt aaacgagcag aataaattct tttgttttat aactatgaaa
17220taatctatgt tcctcttatc tatgcttgga aaatttagac aaaatgttaa gagtaagtac
17280tacattggat ttccgggtct tcagctctga aaacaagctg tttcttaaca tacgtcaatt
17340ttctatattt catgtcattt ctatttgcaa atgttataaa gttcaatatg atgtaaaaca
17400tggttaaatg aagttcaaaa ataagtataa catacattag tttggctatt ccaaatttca
17460tgcacattaa ctcagccaca catctaacac agtcagccct ccctatccag gggttctgca
17520tctgcagatt caactaacca tgggtcgaaa atgtttttgt accaaacatg tacaggcttt
17580ttttcttgtt atcattccct aactacagta taacaactat tttcacagtg tgtacatgtg
17640tatgaaatat tataagtaat ctacagataa tttaaagtat acaagagggt atgcataggt
17700tatatgcaaa tactacacca ttttatatca gactctcaaa catcagtaga atttggtaac
17760ccagggaggt cctggaacta atcacccaga ggtatcgaca gatggctata tataaatcac
17820tcagtgaatt caggattcac attatttcac aactagtata attttatgtt gttcacataa
17880ttgtgtcaca acatatacat gcagacaggt gactttcatg aaaagattac acccaagata
17940gacatatggt ctactcaaat acggtttcca aatgtgtatc caatcttgtt taattataat
18000caaactcacc attccattga taagcgacct ctaccaacct gcttatcccc tccaagcaat
18060ataacagtgg ttctctgaac caatattgac cctcctttaa attgatagcc tttttttaaa
18120aagctaacca ttgagaagta catactgttg aagacagaac atattctgta aaatgctccc
18180aagatatcaa agtcagatga tacaactgaa tgtttatgct agattatatt tctaagctga
18240gaattacatt ttaatatacc ataagcaatc tgcaaaagaa gcaacttgcc taaagatttc
18300aggagtttca agtatgcata tgtcaatatc tgtatcaata tgtaatatca atataatcaa
18360tgcacacaac aatacgtaac tgtacttata tcatctcctt agcactaatt attacaaaca
18420atctgcatgc actgcaaagc aaaagtataa tataaaatcc caaaaaacct tgaaaattta
18480ataaaaccaa aaaacaggca tcacacacaa gaactgaggc gtatacttca ttaatgagta
18540tgatatcctg atatgaaatg tcaaacaaaa ttacccaggc tcaggttaga aataaagata
18600ggacattagt ctttgtattt ttaaattgat tttttcttct aatattcctt aatgataacc
18660ctatatatta cctacttaaa attattagca aatagttatt ttaaaagtat gagtaattag
18720accaaaagca actctcatat ttacccaaaa gaaggaacca ctaccaagaa tcaaagccta
18780gtaattctgt tcttaacaga caggtgttgt gtattctggc atgttacatg aaaatcactt
18840atgagaagaa cagaaaaaaa aattagaagg tagttttcac tatggaaata ggtaagtgat
18900taagcagatt ttcttacacc atgaaattgt cagcagactc aataatcacc ctaaggggca
18960tcattctgga tgccgacatt ctctatgatg gaaagggact gaaagtaaaa tgcactaatg
19020acataaagaa accaatatcc aatagtaaag ttgaagaaat aaacattctt tggacaggaa
19080ctaagctgaa gtttgcaact accaagaatg tattatgcca gcagtaaatt aggaaactaa
19140agcccatgtc aaccaatgaa aaatgggagg actgaaatca atcattaaag cagcagcaag
19200gttctaacta ttctaaggta taggctacct ctggcgtata ttatcagagt tgacaattct
19260tccaagaaat tctaacatca actgtaatct gaggtccttt aaaaaataat ataaaccagg
19320cagtagactt acattttgta atattttctt ctaagagctg tacattaaga ttttatttgt
19380gatataaata ctatcaaata attagctata gaacagctct attttcaaca gttataacat
19440tttaagccat ctcacattta acctaaactt ttatcaaatg tcaaaactga ggccgggtac
19500ggtggctaac acctgtagtc ccagcacttt gggaggccaa gatgggcgga tcacttgagc
19560ccaggaattc gagaccaacc tgggcaacat ggtgaaaccc catctctata aaaaatacaa
19620aaattagctg cgcctggtgg tgtgcgcctg tagtcccagc tactagagag gctgagggag
19680gagaatcacc agggcctggg agatcaaagc tgcagtgagc tgagatcgtg ccactgcact
19740ccaccctggg tgacagagtg agaccctgtc tcaaaaaaaa aaaaaaaaag aaagaaagaa
19800aaaaaaatca aaactgatca cttgaggtcc aacttatgtt tactatatct acttatattc
19860ccaaagacat cttaaggaga gatgaaatca taaaaaggtg aggatgagaa agaaaatagt
19920aagtcagtaa ggtcaatttt tacatatatt aggctagcat aataaaaata tgagtgtctt
19980attattattt ttttttgaga cagagtcttg ctctgttgcc caggctggag tgcagtggtg
20040caatcatggc ttactgcaat gtctgccttc caggttcaag caatccttgt gcctcagcct
20100cctgagtagc tgggattaca ggtgtgcgtc accctgccca gctaattttt gtattttcag
20160tagagacagg gtttcaccac gttaaaccat gagtttggcc aggatggtct caaactccca
20220aagtgctagg attacatgcg tgagccactg cgtctggcct aaagtgtctt attataacca
20280agaatttatt tgtggagaga ggtaaagaaa actcattttt agtgaaataa ttaaaactgc
20340atcattcaca atctatcttt caaaatgagg tattaactat tttggcttct aaaattaccc
20400catatactac atgcatgagc atgggaattg aagttatttt attcctaagt ttgagacttc
20460atgttttaat gtgatcacta aaaatttcct aattgatgat taggaaaata actttctgta
20520aaattccaga attttagctg tttcaatctc ttcatattaa ggggagaaca ttatgttttt
20580actttctgtg catgcacttt ctttattaga agaaaatgga ctgagggcag taagcaaccg
20640aaaaggaaga gtaataagaa gcctgatgtg tgtgaaaact ggagaacagt ctcaaatcat
20700aaaaagttat gacagaagag gcataaaaaa taaaagtaat gaacttaata tatgaaaggt
20760aataatgatt aagagcatag gctataaagc cagactggac tccctggatt caaatcctgg
20820ctcttctaat tactaggtag gtaaccctga gcaagtttca atgaccaatc tttttctcaa
20880ttacctcagg tatataaagg ggacagtaac agcatttaac ccagaggaca ataaggatta
20940aataaataca tgtaaaataa tttaaaacag tacctggtat tcaataaagc gcaataaatg
21000ttagctgcta ttattattca tctaaacttt actttcatta ccagcaatat tttttaatct
21060taaaaatatt gaataaaaca atgacctagc ttagtaaata aattcataat gagaaaatgt
21120tgatttcatt taataataac tttagtagtt tgggataaca ctttgcatat tttaatttcc
21180ccagctataa ataactcaaa taatttgcca tcagatgatc tgttattttg aagttaacaa
21240ataaagcatt tcctaaaaaa gttctaatac ataacttttg ctctcatctt atgttttaaa
21300aacaaaatgg caaatcatct gcatcaaata gttcctactc ttataacatg acaattgttt
21360taaaatatat ctgctggaaa aagcaactga agtcctagaa aatagaaatg taattttaaa
21420ctattccaat aaagctggag gaggaagggg aaaaacatat ctgccaaata agcttataat
21480taatagttgt tttcagtttt caaaaatcca cataggaagc aatttaagcc taaattgcct
21540aagtctcaat ctcagcgtag tagatagctt agggcaatca aaacttgctg tgttgggctg
21600ccccctacag gactcaattt acctatttct tttaaaaggt gtgtaagtag gaaatatgat
21660tcaagtttta cattaacaat attaatgcta aagcagatga ttatcattca cgcattcact
21720ataggaggaa acagtctctg agaaccatct atagagatac agagagaaat gaaacaatcc
21780ttgtccttga ggaattaata gtttactgct tacagagaaa ctacatacat ggtgaaatat
21840ttaaaaatag ctcatgatat cctctatgat attatgtttg ctatagaaaa agaacaaggc
21900tgaagatcta agatccaagt tctactgttg gctctgccat caaacaataa gctaaacaat
21960gtacaagtca gttttgggga agctgtctta ttcccaaaat gaggaggtta aattagttaa
22020ttcttccagc ctctatggct ctaatattcc acagttacat ttgtcaaaac aaaaggtaga
22080aggaaatgtt tcaaaaacag acttcgcaga aagaacatct atatgatatg aagggctggg
22140gcatatgtga agaaatcaag gaagacttct tgaggaaggt gacatctgaa gtaactttag
22200aagcactctg ggagccaagg ctattcccag gagttaacag agtcagataa taaaagatca
22260aagatgttta ggggaatagc atgcagtgtt atttggttgc agtctagcta tattttagga
22320aacatcaaat taatatcagt ataaaactca acagaatgga gggagaaaaa gcaggtagaa
22380aaatctaaga accactaaaa tagttcatct agaagataaa ggacccatga gctaaatcag
22440tgcaaatggc aagaagggaa taaatgaaga cagttctggt ccattagaac tgcaactcaa
22500caaaagtgat caaaagagtt attccaaagt attgacctgg taacttgaag aaaagtaaag
22560aaagaggaaa ctggacactg aaacagaaga agtagattat gtatttggta gtgaatggaa
22620gtagattggt gggaccagtt agaacctcac agagaagaac tatgttaaga ccagaaatac
22680ggccaggtgc ggtggctcat gcctgtaatc ccagcacttt gggaggcctg ggtgggcgga
22740tcacctgagg tcaggagttc aagaccagcc tgacaaagat ggagaaaccc tgtctcccct
22800gtctgtacta atacaaaatt agccaggtgt ggtggtgcat gcctgtaatc ccagctactc
22860aggaggctga ggtaggagaa tcgcttgaac ccgggaggcg gaggttgcag tgagctgaga
22920tcgcaccatt gcactccagg ctgggcaaaa agagcgaaac tcttgtctca aaaaacaaac
22980aaacaaaaca aaacaaaaca cagaaataca tcaattaaaa aagtgagcta ttcaccagat
23040atgttccact ggtcataaaa caaaagaata caggaggcat gacaagccat catcattgct
23100gttaaaataa ctcacagcaa aattataatg atttaagtca ataacatcta ataattccag
23160ctatagtgtg caatttaatt tattatgtgc caggcacaat agtttattaa aggtattacc
23220tctaattttc acaataaccc tattttacag attataaaat ggaggcccag agatgtaagg
23280tgaacgagcc aaatcaccta gttacctgga atataaactc agaactgcct aaatcaaaag
23340ctctcaatct taaccacatg ctatactgat gcatgtcaaa gattcaattc attcagattt
23400ttcaaggtta tcggaaaacc tatgtagata aaaatttcca aaataatcaa ggatatgtaa
23460cttttacaga aagcaatcac tgatcatcta ttgcaatact catgttctta agcaatatac
23520tgagttgaaa tttttatatt ttataaataa ttagaaagaa tacatttttt aaaactttaa
23580aaaacacctc agtttttatt ctcttcccca aatttcaaca aaatccattt atccaaactt
23640gaggttgaat cattaaagtg gtgatatcat cagtaatagc agagtgagga ccctgaatat
23700actctcctcc ataaaagcaa caagaacaca aaaattctca aaatgaactt tttctgaaat
23760ctttcaaaag ccccactctc agaaaactgt cattatttga tctgccagtt ccctagaaaa
23820acctccctca taggacatta tttgacttga ctcagagctc actcagtgca aacaatttta
23880tcaccaggag agtttgtgga aaatcagtgg caattgttaa acatcacatc tgccatgaga
23940tagcaataac agatgggaca aacaagctaa ccaaaaaatt aaaagaaaaa cctgggaaat
24000aagaaatcca aagggggtct gaaaagttct aacatatttc tgataatcca gaaagccata
24060cacatgtata gagctgtgta cacgctcaaa aaacatctac gaaggcccta aactctcacc
24120tatgggaaac cctgaggctc tgtacaagaa gaaagtaaaa tccagttata aattgcttgc
24180cgtatcattg aaggcaatgc cccaacattc acacataggc ccctggcaaa gattggaaga
24240tactctagtt ctaggcattc aagaaaatct cttctaatca tcagatgatc actaaactca
24300ccaagcagta actttagggg cctgtgtgat aaaaaataaa aacctgaaag aattagttca
24360ggaaagaaac taaacaagca acagcaacaa caaaaacaga ccttgggaaa ggggggaagc
24420atctggtttc cagagttatt ctgttatact atataaaata ttcaggtctc aacaacaaca
24480aaattacaaa gacatgcaaa gaaacaagta taagccacaa actgggggga aaaagcagca
24540gaaactggcc ctgaaaaaga ccagatgctg gacttactgg acaaagactt taagagagtt
24600attttaaata tgcgcaaaga actaaaaaaa agtttatcta aagaactaca ggaaagtatc
24660agaacaatat ttctgatcct tcagaagaac cactttttgt cactacagat tagttctgtc
24720tggtctagaa cttcttaaaa acagaatcat agagtatatt ctctttatat cagctctttt
24780tactcaacac aatgttgtgt gagatttatc catgttgttg catgtatcat tcccaaacag
24840aaatagaaat tatagagata aataggagtt acaaaaaagt accaaacaaa aattctggag
24900ttgaaaagca caaaaactga attaacttga ggggctcaac agctgatttg ggcagccaga
24960agaatgaatc agcaaatcta aagataggtc aattgcgaga aagagaggga agaaggaagg
25020aaggaaggaa aggaggctca gagacccaag agacaccatc aggcatacca atatacatat
25080aatgagaggc ccagaagaag atgcagaaaa agggtcagag tatctgaaaa aataatggcc
25140ctaaacttcc cgaacttgac cccaaaaatt aatctacaca tccaagaaga taaacaaact
25200aaaaagaata aaatcaaagc gatccacacc taggtacatc ataatcaaat gactgaaata
25260taaagagaga ctctcaaaac aggcaaggga cttatgtaca aaacatcttc agattaataa
25320caaatttctc atcagaaatg atgttgtcaa taggcaatca gatgacataa tcaaagcact
25380gaaagaagta gaatgtctgg gacctggaat gctggtggac acctgtaatc tcagtatttt
25440gggtggccaa ggtgggagga tcacttgagg caaggagttg aagaccagcc tgggcagcag
25500aaagaggctc tgtctctaca aagaataaaa agattggctg aatgtggtgg tgtggacctg
25560tagtcccagc tactcaggcg gctaaggtgg aaagatcgct tgagcccagg agttggaggc
25620tgcagtgagc tatgactgtg ccactgcact cttgcagtgg agaccctgtc tctataaaga
25680aaaaatgtca accaaaaact acatgcagaa aaactgcact tcaagaaatg atcagtacct
25740tgaagctctg aaggtgctta agactgtaga tcaataccat agaaaataat ttagtattta
25800ggaatgtaag aaaattaaga cagccttgtt tgataactac acataatact gtaactgttc
25860ttgcactgtt ctggttattg tcaagctatg agcacaaact gatgactgaa atacagaata
25920cagaacagga tataaaatct tatcaggtaa agttaggcaa gcaattacta gttgtaattc
25980aacttgaagg agaaggaata aggaaccaac tcaaaccagg cagcaatgaa ttgtaaaaaa
26040gcttaaggta aaacaaacag ggaaataaaa caactcagaa cctaagcata tcgtaagaac
26100ctaatctaac aaggaggggc ttaaactgat tattttacag cttgggtgca attatcccac
26160aaaaaacttt caggagtttc accagtccat aaactatttg gttattagaa aatagcttta
26220ttgggctacc ctctttgggt cccctccctt tgtatgggag ctctgttttc actctattaa
26280atcttgcaac tgcactcttc tggtccgtgt ttgttacggc tcgagctgag ctttcactct
26340ccatccacca ctgctgtttg ccgccatcgc aggcctgcca ctgacttcca tccctctgga
26400tctagcaggg tgtccgttgt gctcctgatc cagtgagacg cccattgccg atcccgactg
26460ggctaaagac ttgccattgt tcctacgcgg ctaagtgccc gggttcatcc taattgagct
26520gaacactagt cactgggttc cacggttctc ttctgtgacc cgtggcttct aatagagcta
26580taacactcac cgcgtggccc aagattccat ttattggaat ccatgaggcc aagaacccca
26640ggtcagagaa cacgaggctt gccatcatct tagaagcagc ccgccaccat cttcggagtt
26700ctgggagcaa ggaccccctg gtaacaattt ggcgaccaca aagggacctg aacccgcaac
26760catgaaggga tctccaaagc ggtaatattg gaccactttt gcttgctact ctggcctatc
26820ccttagaatt ggaggaaaat actgggcacc tgtcggccgg ttaaaaacga ttagcatggc
26880cgccagactt tagactcagg tatgaggcta tctggggaag ggctttctaa caaccctcaa
26940cccttctggg ttgggaacct tggtctgcct ggagccagct tccactttca attttcctgg
27000ggaagccaag ggctgactag aggcagaaag ctgtcgtccc gaactcccgg cattagccgg
27060ttgagatcat gtcgcagcca gaagtctcta ctcaacagtc gcccatgcgt gcgctcctac
27120cttcccttct gtcccacacc tcctgggtcc caaccacgac tttcttgaaa gtgtagcccc
27180aaaattctcc ttacctctga atctacttcc tctgatccct gcctcctagg tactaatggt
27240tgagactttc atttcctcta gcaagttgta tctccaaagg gatctaagga agctctatgc
27300tgcgccctta ggcatctagg ctataaaccc agggagtctt gtccctggtg tccctcctga
27360tttaggtata cagctctaga catgggcagt tatgtgggac ctgttcccca ccacccttgc
27420cagggcccca agtttgtaaa tggctaagag aggaaacaga gagagacaga gagaaagaga
27480cagtgagaga cagacagaga cagagagaga gagagacaga gaggagagag agagagacag
27540ggaggacagg gagagagaca gagaggagag ggagagagac aaagaggaga aagaggcaga
27600gagacaaaca gggagtcaga gaaagaaaga caaagataga aatagtaaaa aaaaacagtg
27660tgccctattc ctttaaaagc cagggtaaat gtaaaaccta taattgataa ttgaaggtct
27720tctccgcgac cctataacac tccaatacta ccttgttgtc agcgtaaaca agggcgtagc
27780ctgaaaacac taagaccact gacaacccat agccttccta tcaaaaatcc ttaacatcca
27840gtgacctgcg gatggcccaa atgcattcaa tctgtagcgg caactgcttt gctaacagaa
27900aaaagtagaa aagtaacttt tagaggaaac ctcattgtga gcacacctca ccggttcaga
27960attattctaa gtcaaaaaag caaaaaggta gcttattaac tcaaaaatat taaagtatgg
28020ggctattctg tcagaaaaag gtaatttaac actaaccact gataattccc ttaaccctgc
28080agatttcctt acaggggatt taaatcttaa ttaccataca aaggtccgac cagacctagg
28140aggaactccc ttcaggacag gatgatagat ggttcctccc aaatgactga ggaaaaaacc
28200acaatgggta ttcagtaatt gatagggaga ctcttgtgga agcagagtta gaaaaattgc
28260ctaataattg gtctcctcaa atgtcagagc tgtttgcact cagccaagcc ttaacgtact
28320taccgaatca aaaagactat ctcaatcctg actcaaaagc ttacttatac cctctctgaa
28380acgaatttgc ctaagaactg ttgtttatgg gaatgcatct tgatggagca gctgggttgt
28440tatgaaatac tcaggaactc agcctagctc taggactcac ccctgagcac aaaggcaatg
28500ttgggcacgc tggtaaagga ccactagaat ccagcagccc ggaccccttt ctttgtgatc
28560aagaaaggcg ggaaaagggg tgagggctgc tacatcagtg agcataacta atccgataag
28620cagaggtcca tgggtggtta cacaccccgg aaaggaataa gcattaggac catagaggac
28680gctctaggac taatgctcat cggaaaatga ctagtggtgc tggcatccct atgttctttt
28740ttcagatagg aaacgttccc ctcaaggcaa aaacacccct aagatgtatt ctggagaatt
28800gggaccaatt tgactctcag atgctaagaa aaaaaagaca tattcttctg cagtaccgcc
28860tggcaacgat atactcttta agggggagaa acctggcatc ctgagggaag cataaattat
28920aacaccatct tacagctaga cctcttttgt agaaaagaag gcaaatggtg tgaagtgtca
28980tacgtacaaa ctttcttttc attaagagac aactcgcaat tatgtaaaaa gtgtgattta
29040tgccctacag gaagccctca gagtctacct ccctacccca gcatccccca gactccttcc
29100ccaaataata aggacccccc ttcaacccaa acggtccaaa aggagataga caaaggggta
29160aacaactaac caaagaatgc caatattccc cgattatgcc ccctccaagc ggtgggagga
29220gaattcggcc cagccagagt gcacgtacct ttttctctct cagactttaa attaaaatag
29280acctaggtaa attctcagat aaccctaatg gctatattga tgttttacaa ggtttaggac
29340aatcctttga tctgatatgg agagatataa tgttactgct aaatcagaca ctaaccccaa
29400atgacagaag tgtcgccgta actgcagcct gagagtttgg cgatctctgg tatctcagtc
29460aggtcaatga taggtcgaca acagaggaaa gagaacgatt ccccacaggc cagcaggcag
29520ttcccagtgt agaccctcac tgggacacag aatcagaaca tggagattgg tgccgcagac
29580atttgctaac ttgcgtgcta gaaggactaa ggaaaactag aaagaagcct gtgagttatt
29640caatgatgtc cactataaca cagggaaagg aagaaaatcc taccgccttt ctggagtgac
29700taacggaggc attgaggaag catacctctc tctgtcaact gactctactg aaggccaact
29760aatcttaaag gataagttta tcactcagtc agctacagac attaggaaaa aacttcaaaa
29820gtctgcctta ggcccggaac aaaacttaga aaccctattg aacttggcaa cctcagtttt
29880ttataataga gatcaggatg agcaggcaga atgggacaaa tgggataaaa aaaaggccac
29940cgctttagtc atggccctca ggcaagcgga ctttggaggc actggaaaag ggaaaagcta
30000ggcaaatcaa atgcctaata gggtttgctt ccagtgcggt ctacaaggac actttaaaaa
30060agattgtcca aatagaaata agccgccccc tcgtccatgc acctcgtgtc aagggaatca
30120ctgtaaggcc cactgcccca ggggacgtag gtcctctgag tcagaagcca ctaaccagat
30180gatccagcag caggactgag agtgcccggg gcaagcacca gcccatgcca tcaccctcac
30240agagccctgg gtatgcttga ccattgacgg ccaggaggct aactgtctcc tggacactgg
30300tgtggccttc tcagtcttat tttcctgtcc cagacaacgg tcctccagag ctgtcactat
30360ccaaggggtc ctaggacagc cagtcactag atacttctcc cagccactaa gttgtgactg
30420gggaacttca ctcttttcac atgcttttct aattatgcct gaaagcccaa ctcccttgtt
30480agggagagac attctagcaa aagcaggggc cattatacac ctgaacatag gagaacaccc
30540gtttgttgtc ccctgcttga ggaaggaatt aatcttgaag actgggcaac agaaggacaa
30600tatggacgag caaagaatgc ccgtcctgtt caagttaaac taaaggattc tgcctccttt
30660ccccaccaaa ggcagtaccc ccttagaccc gaggctcaac aaggactcca aaagattaag
30720gacctaaaag cccaaggcct agtaaaagca tgcaatagcc cctacaataa tccaacttta
30780ggagtacaga aacccagtgg acagtggagg ttagtgcaag atctcaggat tatcaatgag
30840gtcactgtcc ctctatacct agctgtacct aacccttata ttctgctttc ccaaatacca
30900gaggaagcag agtggtttac agacctggac cttaaggatg cctttttctg catccctgta
30960catcctgact ctcaattctt atttgccttt gaagatcctt caaacccaat gtctcaactc
31020acctggactg tttcacccca agggttcagg gatagccccc atctatttgg ccaggcatta
31080gcccaagact tgagccggtt ctcatacctg ggcactcttg tcctttggta tgtggatgat
31140ttttactttt agccgccagt tcagaaacct tgtgccatca agtcacccaa gtgctcttaa
31200attttctcgc tacctgtggc tacaaggttt ccaaaccaaa ggctcagctc tgctcacagc
31260aggttaaata cttagggcta aaattatcca aaggcaccag ggccctcagt gcctattctg
31320gcttatcctc atcccaaaac cctaaagcaa ctaagaggat tccttgacat aacaggtttc
31380tgccaaatat ggattcccag gtacggcgaa atagccagac cattatatac actaattaag
31440gaaactcaga aagccaatac ccatttagta agatggacac ctgaagcaga agcggctttc
31500caggccctaa agaaggccct aacccaagcc ccagtgttta gcttgccaac ggggcaagac
31560ttttctttac atgtcacaga aaaaaacaga aatagctcta ggagtcctta cacaggtcga
31620tgagcttgca acccatggca tacctgagta aggaaattga tgtagtggca aagggttggc
31680ctcattgttt atgggtagtg gcggcagtag cagtcttagt atctgaagca gttaaaataa
31740tacaaggaag agatctgtgt agacatctca taacgtgaac ggcatactca ctgctaaagg
31800agacttgtgg ctgtcagaca accgtgagga aagtaactaa aatcgtaaat ccccatggcc
31860ctcccttatc atatttttct ctttactgtt ctcttacccc ctttcactct cactgcaccc
31920cctccatgct gctgtacaac cagcagctcc ccttaccaag agtttctatg aagaatgcgg
31980cttcccagaa atattgatgc cccatcaaat aggagtttac ctaaaggaaa ctccaccttc
32040actgcccaca cccatatgcc ccacaactgc tataactctg ccactctttg catgcatgca
32100aatactcatt attggacagg gaaaatgatt aatcctagtt gtcctggaag acttggagcc
32160actgtctgtc ggacttactt cacccatact ggtatgtctg aggggggtgg agttcaagat
32220caggcaagag aaaaacatgt aaaggaagta acctcccaac tgacccgggt acatagcacc
32280cctagcccct acaaaggact agatctctta aaactacatg aaaccctcca tacccatact
32340tgcctggtaa gcctatttaa taccaccctc actgggctcc atgaggtctc ggcccaaaac
32400cctactaact gttggatgtg cctccccctg tatttcaggc catgcatttc aatccctgta
32460cctgaacaat ggaacaacta cagcacagaa ataaacacca cttccgtttt agtaggacct
32520cttgtttcca atctggaaat aacccatacc tcaaacctca cctgtgtaaa atttagcaat
32580actgtagaca caaccaactc ccaatgcatc aggtgggtaa ctcctcccac acgaatagtc
32640tgcctaccct caggaatatt ttttgtctgt ggtaccttag cctatcgttg tttgaatggc
32700tcttcagaat ctatgtgctt cctctcattc ttagtgcccc catgaccatt tacactgaac
32760aagatttata caattatgtt gtacctaagc cccacaacaa aagagtactc attcttcctt
32820ttgttatcgg agcaggagtg ctaggtggac taggttctgg cattggcggt accacaacct
32880ctactcagtt ctactacaaa ctatctcaag aactcaatgg tgacatggaa tgggttgccg
32940actccctggt caccttgcaa gatcaactta acttcctagc atcagtagtc cttcaaaatt
33000gaagagcttt agacttgcta acctctgaaa gagggggaag ctgtttattt ttaggggaag
33060aatgttgtta ttatgttatt ttagcggaag aatgttgtta ttatgttaat caatcctgaa
33120ttgtcacaga gaaagttgaa gaaattcgag attgaataca acgtagaaca gaggagcttc
33180aaaaacacca gaccctgggg cctcctcagc caatggatgc cctggattct ccccttctta
33240ggatctctag cagctctaat attgatactc ctctttggac cctgtatctt taacctcctt
33300gttaagtttg tctcttccag aatcaaagtt gtaaagctac aaatcgttct tcaaatggaa
33360ccccagatga agtccatgac taagatctac cgtggacccc tggaccggcc tactagccca
33420tgctccaatt gtaatgatat cgaacgcacc cctcccgagg aaatctcaac tgcacaaccc
33480ctactatgcc ccaattccgc aggaagcagt tagactggtc gtcagccaac ctccccaaca
33540gcacttgggt tttcctgttg agtgggggga ctgagagaca ggattagctg gatttcctag
33600gccgactaag aatcccaaag cctagctggg aaggtgacca catccacctt taaacactgg
33660gcttgcaact tagctcacac ccgaccaatc aggtagtaaa gagagctcac taaaatgcta
33720attagacaaa aacaggaggt aaaaaaatag ccaatcatct atcgcctgag agcacagcgg
33780gaaggacaat gatcgggata taaacccagg cattcaagcc ggcaacggct accttctttg
33840ggtcccctcc ctttgtatgg gagctctctc tgtcttcact ctattaaata ttgcaactgc
33900aaaaaaaaaa tagcttaatt gaagaataaa ttaatacaat aaaaggaata cattttaagt
33960atacagttca aactgtaaca gtgttacagt ttcaagagga ccccttcaac aagatattgg
34020gcatttccat catgccctaa aagttccttc ttgtccctta ctggttgggt ccatctctac
34080tacaccctcc tgacctggcc cagaccttgg cctcagaaga atcatttttt tgtcactaca
34140tattagtttt gtctgttcta gaacttctta aaaacagaat catagagtat gttctctttg
34200tattggttct ttttactcaa tgtaatgttc tgtgacattt atccatatta ttgcatgtat
34260tattcctttt aatcctgaat agtatgctgt tttaggaata taatgcaatt gtttattcat
34320ttacctgttg acagatatct gagctattat gatggatatt atgaataatt ctgctatgaa
34380cacttctgta caatgttttc tcggacatat attttcattt ttcttgagtg gagctgttag
34440aactgttgga tcagaaagta agcatatgtt gaattttgaa agaaactggt aaactcttgt
34500ctaaagtgat ttgtaccatt ttacactcct actaataatg tatgagagtt atatttgctc
34560cacagccttt ttactacttt gttaatcttt ttagtactgt caaccttttt aatttatcca
34620atctagggaa cgtgaagtag tatctcactg ttattttcat tttcctgatg agtaacaata
34680tcgtgtatct tttcatgtgc ttattagcca ttcctatatc ttttgtgaaa tagttaactt
34740aaatttgtaa ctaaaggtgc tttcctgagt ttcaggtagt aagcctattt ccctcaagtg
34800aataaactac agtcttggaa tgaaaaatta aacacagtgg agacattttt tgtataagtt
34860gttttactct gtgtatgtct ggtttgctta gtctattatt atatgcccca tgaaagcaaa
34920cacagtgctt atttcactaa tgagtatcac tagcacatag aactgtgctt gcccaaagca
34980tgaactcaat aaatatgtta atgtgtatgc atgcacatac atctacatgc atgtacatct
35040atacacacat ataaacatat attaattttt agacccacaa atctaagaaa actaattctt
35100gagcctctgg tttgaagaat tctcaaatta ttaacatatc tttatgttcc actccacatc
35160cactgtacct gaaatagccc tactgttcta ctttggtaaa tcaggcaaat ttaatttttt
35220aaataattaa gattccaact aattttaaaa tataatttga aagttaacaa tgaaatacat
35280tacataaaaa gaaaatttta aataaaagca aaactaaacc caataagagg aaagaaagtt
35340gggctgtatt tctttaatcc tttaaaattc aaatcacaca atgctccaat gaaatcttca
35400ttaactgaac caaactatgc ccatgaaaga tctcatatgc aactgctaaa acctcaataa
35460acatattcat cttcttgcaa aaaagatatt tctttataat atgcacatgc agtatatact
35520attttgaggc agatttgtac tttagtcctt gttccattgc ttaccggctg gctgtccttt
35580gtctggtcat tgacctccaa cttaaaaaat aatacttgcc ttgtctaccc cacagaagtg
35640ttatgaaagt caaacaaggt agcataaagg tattttacaa gatataaagt gctataatac
35700agattttaaa aatcactcta catcccataa tactttgttg tacaatttta gagcaatagt
35760agaaaataac aattattgcc taattgaaaa tccagtcccg aattccataa aatgtatgat
35820atgaacatta tagtacatca tattacgagc cccaaataat cactgcttat atagttggtt
35880aggatttcct tagtttgttc atatagttta tatatttatg cagtccctat tttgtgagag
35940gcattgtgag gagcataaag acataagcac agtacagagc cttagcttct ctacatttac
36000taaagaagac ttcttcttgg gtatttaatc aatatttaaa gtattctggg aagaaatgaa
36060attaacttca tagactgacc ttagattact atcattacaa aaagatgcct gagtgatctg
36120tctttaacat accagtattt atcttataac tgttatattt acttgaatca gaagtgaagt
36180ccttttaagc actaagcatc cattctatac tttcttgtct ttacatatga gatacaaatc
36240atatttttaa aacttttatt tacttttatt ttttagagac ggagtcttgc tctgtagccc
36300aggctggagt acagtggcat gatcttggct caccacaatc tccacctcca cttcccaggg
36360ttcaagtgaa caaatcatac ttttaagcac agattctcaa catgtatcct agcatgctac
36420tgccataact agggtgtgaa ttaagtatta aagacagctt accccaaata ttactgtaac
36480atatatctct aaatgaaaaa gaacatatta acaactatac ttggatggga ttctgggagc
36540taacccatcc ctctctcccc tttcctccaa attccatctc ctattaacac accagctctc
36600ctgagctaag cagctcctgg ggttggggaa gggtgtacat ggagaaagct agaacctcta
36660cagtgttttc ctctctggga ggaactagca ggcatacgaa cagaaaaagc tgaataaaag
36720gctgaatcct ttctattcct gaggcagaca gagagaagac cagggaacaa agagacttcg
36780accaagagcc ctgccaggta ttgatacctt tgatactgag aaaatatctg ggatatgaaa
36840tacaaatgct aaataagtat ctttgaaata ggggtaaaag aataaagggt cttgatgagt
36900aaaatgggta gtatttttta ataacctgat aatgagcttt aggaaaaggg aaggtcaacg
36960ttatggaatg aaaacacaga ggtaccaaat ttaaaagcat aaaaaaaagt ggaggggggg
37020aacccaataa cttcatcaaa ctagcaaata acttagtatc atttctaatt agaaacgcta
37080gaaggaaatc acttagatct gataaagact aggctataat tctaactgat gaaacactta
37140aactgtatca attaatacca gaaaacaaac acagaaaagt ctactagaac catcattatt
37200cagcacagtc ttggtaatgc aatactataa tagcaatgca ataaagcaag aaaaaaaaaa
37260gtttgtaaaa acacaatagg atgagatttt tgtttttcca atgccataaa taactagaaa
37320tggaaacaaa ataaagaaaa acaaaatcta caaaacacct ggaaataaaa agaaaaatgg
37380tctatttgaa gaaaacctta aaatctatgc agaacataaa acaaaatctg aataaaaaga
37440aatatcatgt tcttgtctgg gaagacttaa tatcataaga aagtgaatta tatcaaaatt
37500taaatcgaaa tttaatgtat ttccatctct aatcagacag gacactatgg ggaactgaat
37560aagtgatttt aaaagtcatg gaaaattaat aactgagaat aaccatgaaa agtatgaaaa
37620aaggagacaa atgaattgct ccaacagata tcagaacgct aaaattaaat aaaaatacta
37680ctaggataag aaaatacata tactgatgta atgaataaag aatccagaat tagattccag
37740taagtcaaac tactttacta taaaccaggg gtggcatatt catccagtgg gaaaaggaca
37800gtaagaagtg agtaaactat ggcccactgg ccaaattgtg gcctctgcct atttttgcaa
37860ataaagtttt actgggacaa agccaagcct atcatttgca aattgtctat aaatattttc
37920atgttacaga atcacacagt ttcaacagag accatcttgt ctacaaagct gaaaatatct
37980actatctggc ccttgaagaa agtttgccaa accttagttt atataataaa agatcagcta
38040tctcatagac acctatctca cacaacacat tgtgggaaag gaccttcttt tttttttgag
38100acggggtctt gctctgttga ccaggctgga ctgtagtggc atgatcatgg ctcactgcag
38160cctcaacctc ccaggttcaa gtaatgctcc caccacagaa tcccaaacag ctgggagaga
38220tgtgtgccac tacgcctggc taaggggcct ttttaacaga gaaagaaatc cacatactac
38280taagaaaaag aagggcatat ttgatatata tttatatttt ttatatagat atcataaaaa
38340tcaagatgaa ttatacagtt atattttgca atgtgtttga cggtaaaagt ttaatatcta
38400taaaaattat tttataaaat atctttaata tatttataga tattataata taaaatatct
38460ataaaattat tttataaaat aaaaagttaa gaagaaaaga taggcaaaac aaaatacagt
38520gcaatttaca gaaaaccaag tccaaatggt caacaaagat aaaacagatt tataaactca
38580ctaagtgtga gagaattatt agttaaagta aaaatatctc tctataccca caatactact
38640aaaaatcaga gttataatgc cctattgctg gtggagatgt aaggggagaa gcatgctctc
38700atatactgtt agtgaaaatt taaactaata catttttgaa aagtaagctg gcaatttttt
38760ttttaatctc taccttttga tgcaaaaact catttttggg tacctattcc ataccttaaa
38820aaaaatacat atgcttactg tagtactgtt tataatggta aaaactagaa aaaaagaaaa
38880cttgatagtg aatactgaac aaattacagt gcatctacag attaaacata atgcagccat
38940taaaaaagaa taaattaggc tgggtgcggt ggctcatgcc cgtaatccca gcactttggg
39000aggccaaagc aggcggatca cttgaggcca ggagttcgag accagcctgg ccaacatggc
39060aaaaccctgg ctctacaaaa aatacaaaaa ttagtcgggc atggtggtgg gcacctgtag
39120tcccagctac tcaggaggct gaggcaggag aatcacttga gcctgggaga cagagattgc
39180agtgagccaa gatcatgcca cagcattcca gtccaggtga cagaacgaga ctctgtctca
39240acaaaaagaa caaattaaac cctacaactc atcaacaaaa atacccaaac ccaattcaaa
39300aatgggcaaa ggacttgaat agacatttct tcaaggatga taaacaagca catgaaaaga
39360tgcagagcac tattcattag tgattacatc ccacatgcat taggatggct agtatgaaga
39420acagaaaata ataaatattg gtgaagatct gaaaaacaga aacctttgtg cactgttggt
39480gggaatgtaa agtggtacag ctactacgga aaacagtatg gccattcctc aagaaaataa
39540aaataaaatt atcttatgat aggaatatgc atttctgggt aaatacccca aataactgaa
39600aacagggtgt acacccattt caacatttac atgtcaattc aactgggcca gaatacccag
39660atatttgttc aaatattctt ctggatgctt ctatatatat gttttttggc tgaggttaac
39720atttaaattg gtggattctg agtacagcag attaccatcc acaatgtagg tgggcctcat
39780ctactcagtt gaaggtctta cagaaaaaga ctgacctccc ttgagcaaga aagaattcag
39840gcaacagact gcctttggac tcaactgcaa ctcttccttg agtcaacagc ccatcccatc
39900accctggctt ggtgagtcca gggtctgatg aggtaggctg cagactcaag gaagagctgc
39960caaaaccagg aaagccaatt cattaaaata aatctctctc tacacaaaca cacacacaca
40020ctaccaccac caccatgatg gttctgtttc tctggagaat gctaatacac ccctgttcat
40080ggcagcatta ttcacaatag ccaaaaggtg gaagcaactc cagcagatga atggagaagc
40140aaaatgtggt atgtatatac aatggaatat tattaagcct ttaaaaagtg gaaattatat
40200ctatctatat ctatacacac atactcacac acacacacac acatttatag aagacagggt
40260ttcaccatgt tgtcaaggct ggtctcgaac tcctgggctc aagcaaaccg cctgcctcag
40320cttcccaaag tgctgagatt acatgtgtga gccaccacac ccagccaaaa aaaggacatt
40380ctgacacata atacaatata gataaacaat gaggacatca tgatatgcga aataagcctg
40440tcacaaaaag gcaattagtg tatgattcct cttgtatgag gtacctatgg atgtcaaatc
40500cataaagtag aatggggaaa cagagagttg tttaatgggt atagagtttg ttttgcaaga
40560agaaaagagt tttggagaat gaatgtacaa cagtgtgaac ataattaaca ctactgaaaa
40620tggttaagat tataaatttt atgttacatt tattttacca tgattaaaaa ttaaaacaaa
40680ataatattaa ggaaaaatac tataaataac aacaacaaaa aaaacacctc aagcaactta
40740cattcacctg ggaaacagaa tacatcctat tctgctagag atatatctgc agttcaaaat
40800ttattacaaa tgatgttgtg tatctttttg aaatgactga aaaactaaat taaaagcaat
40860aatattcagt ttactaacca gtaagtcctt ctttcatggt tcctgacttt tctgtaagat
40920gttattgcaa gatatctact aaaatggaaa acaactgaaa aggcaaaatt ataatttctt
40980atcaacatcg ctaaaaccct ggaggggaag aatcctaaca aacatggcca taatttgcca
41040catatttcta ctgtcctcac ttttcaaaat ccagaaatca acatttctgg aaacaaaaca
41100gagtctaaaa tttggctcct tcttcagttt agaaggtgcc aagttaatcc ctgacatcct
41160agtttccatt ttcaaaaatg tactttttct ctccccaaac cggtatctag attcttaaat
41220atttttagca catagaagtt aaatagattt gcttaaccaa aatagccagt aaacctccca
41280aaagaattaa aatattaatg gcgctttaat gatacaaatg aacaacttta cattcaatcg
41340tcaatgggaa aggaagcaga attctgagga ttatgaaagt aaacaaaacg aagttcaaat
41400tctactttat tttacttttt tgtaactaat gaacaacttc ttccaaagac aagtaggaaa
41460tacaaaaatt agccaggcat ggcacatgcc tgtagtcctg gttacttgga aggctgaagt
41520gggtggatcg cttgagccgg gaaggcagag gctgtagtga gctgagatca catcactgca
41580ctcaagcctg ggtgacagag caagaccctc tctggggaaa aaaaaaaaaa aaataggctg
41640ggcgcagtgg ctcacacttg taattccagc actttgggag gctgaggcag gtggttcacc
41700tgaggtcagg agttctagac cagcctgacc aatatggtga aaccctgtct ctactaaaaa
41760tacaaaaatt agccaggcat ggtggtgggc aattgtaatc ctagctactc gggaggctga
41820ggcaggaaaa tcgcctgaac ccaagaggcg gaggtttcag tgagccgaga ttgcactagt
41880gcactccagc ctgggcgaca gagcaagact tcatctcaaa ataaataaat aagtaagtaa
41940ataaaattaa aaaatatata aaaataaaac aaagataagt aggaaccatc cttttttttt
42000tttttttttt ttttttttaa agatagggtc tgtttctgat gcccaggctt gagtgtagtg
42060gcatgatcat ggctcactgc aaccttgacc tctcaaatac aagtgactct cctacctcag
42120cctcccaagt agctgggact acaggtgctt accaccccat ccggctcatt taaaaaaatt
42180tttttgtaga ggtggggtct cactatgttg tatccaggct ggtctcattt taactttatt
42240agaaaacaag cattgtttta tcagcttctt gtttttttaa aactaaaaat aacactgcta
42300ggttgtttct atgaagattc tctaaattta tttataacct taagaataac atgtagaaca
42360aagtagatga ctgaatgatc tttgttgaat aaatatgaat ggatattcaa ataattaaaa
42420atctcttaag atctcccatt ctttacagga tacagagaaa actcgttaat atggcctgac
42480ttttaccttt gcagccttat ccaaactctg tggtcaagac aaacaggttg tccttatact
42540tacaacgtcc ccctttgcct acaaagctct tctcatgact ctttgcctat cttaagttca
42600cctatctgtc aaatctctgg gaatgcaaca tttcctcaag gtagccttct ctcctcccaa
42660actagaacaa attcttcctg gggcattagg tttttattgc actgtatgtc tcttcttcac
42720agcaatcaca gttccaatgt tatatttgta ttcttagttg atttgtttct ttccaccttt
42780agactataac cttctaaggg gtcacacata atatcgatca tcagttgtat cccttgtgca
42840tagcacaggg catggcaggc aaatatgtgt gtaaataaac ttgttgaatg aatcaatgag
42900acacactttt cttacccaaa gtataatggc aggataacat ttatcaatct attgcttctt
42960gaaaaacaga tatgatgtgc ttaattttca ttttacatct caaataccaa tgcctaagga
43020attcacagtc attttacaaa tctttttgac aaatgccttc attaatcacc acctgtttac
43080aagtgctaaa taacattttg gttacattct gtaacatttc ctgcacttaa tgtcatctct
43140agaatactgg ctaatatgaa gcacctggac ttcaggaaca caaacctgaa actaacacac
43200caaactaaac tgttatgtaa atgacagaaa tgacacattt tggtctgcaa catctctaga
43260tggcttttgg accaattcaa cttttaccac taaaaatcgg tcacctgact atagtcattt
43320tgagctcatg ataaatgaat tacagatgaa aaataaatag tttgatgaca atctttacaa
43380aagtttatct tcaaagaata ccaccagtca caggtattct aggctcctat caacttattt
43440ggtcagggca gacttcactt ttcatgataa ttatgttctg aaaattctac aaacttaatg
43500attacaaaca aaagtcatag tttgctcata aatcaggcct aggtctggat tctagttctt
43560ccatttttca tttgttcact gaggcaagtg acttaaaatt ccctagcctc agtttcctca
43620catgtaaaat cagataatga ttcctattcc taagatggtt ttgaggcttc aacaagataa
43680gatgggcctc actcaagcat gctcagtact ctgtctctct ctctccggtt atgcagaaat
43740tctattagga ttctgcaaag taaaataaat atttcagtaa aaattatgcc ctttattaat
43800gaatctagat tttcagattt tccttaaatt tacttagtaa cttaagggct caaatattat
43860agagatttgt atctagtatt ttaaagaaat gaaaggtgtt aatcaaaatg ctgcacaaat
43920aaatgctaca tttaacaaac agaatatcac aaccatacaa actaatcaga tataaagaag
43980tcagcaacag aaatctgatg ttgcctttag atcacacaat taggcaaaca aaaatagagt
44040tccatcctcc tttggtcaag gccatggttg aagactgaat accaaatagg gaaataggaa
44100aagccaggaa atggcaaatt agcaaaaact ggactcctta atttttatat tcattttcat
44160atctcacttc taaaacttta attaaattca aataaaaacc aaaatggaac tgagataaag
44220ccaaaaggaa agttatgtag gtcaaatgag aacctatatt gtccttaggc tctttgttgc
44280tttctgttta aggaaaaact gcccaagtgc cttgacacat taaagatcaa gcaggaggtt
44340ctgccgagag tccccatctg gcagccaggt tttgtcaagc aaattttgag aattctctac
44400cctcccactt tctatctaat tatagcactt tataaaaacc attctctctc tgtctctgtc
44460tctctctctc tctctctctc acacacacac acacacacac acacacacac acacacaccc
44520tttctctctc tctctctctg aaacttatct gtattataat aacacaacac taggtatgga
44580ttaatctgac aattttcccc taaaacagaa taaattcaaa aaggaaaacc tttcctctgt
44640acacatgcac tatattctga caataataat tcctaaatta agtataatac attttcccta
44700caggagttta aagaagttac agtaaagaat ctcttgtata aatatatatg ccagaacttg
44760acccaaataa gtgctgagag gtataaatct caaaacagtt tccggactct ttgtgaaatg
44820tcttcagagt ctgcgatata ttttcttcaa ctaaattata caagtaagat attttgctgg
44880gctgtgggaa tgccttacgg catgttactg tggagctcat ggtaaaatag aaagaatata
44940aataattaaa ataaaattga caaatgataa atgatttaat aaattagaaa ttcaaatgcc
45000gggcactttt ctagaacctg gacacaaagc atgaacctaa caataacccc gccttcatga
45060aaaatatgga ctatttgaaa attatacctg caacactaaa taaatattct tcattcttcc
45120agtatattga gatgtttact ttcaattaga caatttgctt tcctctctga acacatagtt
45180atgtgatggc tctataaaag attttaaaat aactatagaa ggaactattg gtaaagactg
45240tgggatacta aaaatggcta caaagaaagt tatgacaaaa cctctgagtt tgaatggaag
45300tcctactaga ttagagtcta agcctgtgac attatgcttc tggttcttgt tcttaaatgc
45360ttttctcatt aatagtatgt aacttacttc ctggaatgcc attcattaaa aaaatattta
45420atatttgcta aatgtcaata tttatgccag cacttttaaa gtacagaaac atggagtttc
45480tttacctcat gcaaatatgc tgtgagaaag acttaagagc ctattgccta ctttgtggta
45540caacactgaa gactcaccat ccaaaacaaa cagacttagt aaattcttgt gatttgcagt
45600agttctgttc tataaggtta ccacaaacac tgaaatcatc gctcctgggg gaatacaagg
45660ttatgtttcc gtgagccctc ggtcacaaca tgttcattaa ctgatcaata cataaccttg
45720ttctatgtgt gtttctgttt aaaaagagca cttcagtgct acatttggag tctgttttaa
45780acagcaaaat cactaataaa aagcacaaaa atgtaaaagc atggcactac atacactgtg
45840acaagaaggc ttgtttatag tatgacagct gagacaagaa ggtagagcct cgctttgatc
45900aacctctgct gggaaatgag catcaggtga atcaattttt caccactctg aatgaccgta
45960aaagtgctcc aagtactgac tttggggtta cacataaatt ttagtaagca tgtgaatctg
46020ccaatatgaa atctacaaat aatgagtacc aaatgcatat gagtcaaata tttcagtgcg
46080gtatctgact tgattgccac tgaaagacac agtttggaaa acccctaata aataccgttt
46140agttactatg cagacaaaga gttctacact agagtgcttc aattaagatg tctgaggctt
46200tcataaatgg atgtttttta aaatgttatt tcctacctga tatattctaa aggggatata
46260acgaaatcca ttttcttctg caggatattc catgagtttc cgattgatgg cccaaaactg
46320gtcaaatctg tctgtaatga
4634067773DNAHomo sapiens 67actgagagac aggactagct ggatttccta ggctgactaa
gaatccctaa gcctagctgg 60gaaggtgacc acatccacct ttaaacacgg ggcttgcaac
ttagctcaca cctgaccaag 120gaaggtgacc acaccctcct ttaaacacag agcttgtaac
tcagctcaca cccgaccaat 180caggtagtaa agagagctca ctaaaatacc aattaggcta
aaaacaggag gtaaagaaat 240aatcaaatca tctatcgcct gagagcacag ggggagggac
aatgatcggg atataaaccc 300aggcatttga gccagatcag gtaaccctct ttgggtcccc
tcacactgta tgggagctct 360gttttcactc tattaaatct tgcaactgca cactcttctg
gtccatgttt gttccggctc 420aagctgagct tttgctcgcc gtccaccact gctgaatgcc
gccattgcag acctgccctt 480gacttccacc cctccggatc cggcagagtg tccgctgcac
tcctgatcca gcgaggcacc 540cattgccact cccgatcagg ctaaaggctt gccattgttc
ctgcacagct aagtgcctgg 600gttcatccta atcaggctga acactggtcg ctgggttcca
cggttctctt ccatgactca 660cagcttctaa tagagctata acactcacca catggcccaa
ggttccattc gttggaatcc 720atgaggccaa gaaccccagg tcagagaata aaaggcccgc
cccatcttgg gag 7736810PRTHomo sapiens 68Phe Leu Gly Glu Glu Cys
Cys Tyr Tyr Val1 5 106910PRTHomo sapiens
69Leu Leu Phe Gly Pro Cys Ile Phe Asn Leu1 5
107010PRTHomo sapiens 70Cys Leu Pro Leu Asn Phe Arg Pro Tyr Val1
5 107110PRTHomo sapiens 71Gly Leu Leu Ser Gln
Trp Met Pro Trp Ile1 5 10729PRTHomo
sapiens 72Cys Leu Pro Ser Gly Ile Phe Phe Val1 5739PRTHomo
sapiens 73Trp Met Pro Trp Ile Leu Pro Phe Leu1 57410PRTHomo
sapiens 74Ile Arg Trp Val Thr Pro Pro Thr Gln Ile1 5
107510PRTHomo sapiens 75Leu Arg Asn Thr Gly Pro Trp Gly Leu
Leu1 5 107610PRTHomo sapiens 76Leu Arg
Thr His Thr Arg Leu Val Ser Leu1 5
107710PRTHomo sapiens 77Lys Arg Val Pro Ile Leu Pro Phe Val Ile1
5 107810PRTHomo sapiens 78Cys Arg Cys Met Thr Ser
Ser Ser Pro Tyr1 5 107910PRTHomo sapiens
79Thr Arg Val His Gly Thr Ser Ser Pro Tyr1 5
108010PRTHomo sapiens 80Ala Arg Glu Lys His Val Lys Glu Val Ile1
5 108110PRTHomo sapiens 81Ser Arg Ile Glu Ala
Val Lys Leu Gln Met1 5 108210PRTHomo
sapiens 82Ser Gln Trp Met Pro Trp Ile Leu Pro Phe1 5
10839PRTHomo sapiens 83Cys Tyr Tyr Val Asn Gln Ser Gly Ile1
5849PRTHomo sapiens 84Phe Tyr Tyr Lys Leu Ser Gln Glu Leu1
5859PRTHomo sapiens 85Thr Tyr Thr Thr Asn Ser Gln Cys Ile1
5869PRTHomo sapiens 86Ser Phe Leu Val Pro Pro Met Thr Ile1
5879PRTHomo sapiens 87Tyr Tyr Val Asn Gln Ser Gly Ile Val1
5889PRTHomo sapiens 88Leu Phe Asn Thr Thr Leu Thr Gly Leu1
5899PRTHomo sapiens 89Leu Phe Gly Pro Cys Ile Phe Asn Leu1
5909PRTHomo sapiens 90Arg Trp Val Thr Pro Pro Thr Gln Ile1
59110PRTHomo sapiens 91Leu Pro Phe Leu Gly Pro Leu Ala Ala Ile1
5 109210PRTHomo sapiens 92Leu Pro Tyr His Ile
Phe Leu Phe Thr Val1 5 109310PRTHomo
sapiens 93Gly Ala Leu Gly Thr Gly Ile Gly Gly Ile1 5
109410PRTHomo sapiens 94Leu Pro Phe Val Ile Gly Ala Gly Val
Leu1 5 10959PRTHomo sapiens 95Arg Arg Pro
Leu Asp Arg Pro Ala Ser1 5969PRTHomo sapiens 96Phe Arg Pro
Tyr Val Ser Ile Pro Val1 5979PRTHomo sapiens 97Arg Arg Ala
Leu Asp Leu Leu Thr Ala1 5989PRTHomo sapiens 98Trp Arg Met
Gln Arg Pro Gly Asn Ile1 59910PRTHomo sapiens 99Asp Arg Ile
Gln Arg Arg Ala Glu Glu Leu1 5
1010010PRTHomo sapiens 100Leu Arg Thr His Thr Arg Leu Val Ser Leu1
5 1010110PRTHomo sapiens 101Glu Arg Val Ala Asp
Ser Leu Val Thr Leu1 5 1010210PRTHomo
sapiens 102Leu Phe Gly Pro Cys Ile Phe Asn Leu Leu1 5
1010310PRTHomo sapiens 103Gln Phe Tyr Tyr Lys Leu Ser Gln Glu
Leu1 5 1010410PRTHomo sapiens 104Gln Trp
Met Pro Trp Ile Leu Pro Phe Leu1 5
1010510PRTHomo sapiens 105Cys Tyr Tyr Val Asn Gln Ser Gly Ile Val1
5 1010610PRTHomo sapiens 106Asn Phe Val Ser Ser
Arg Ile Glu Ala Val1 5 101079PRTHomo
sapiens 107Gly Pro Leu Val Ser Asn Leu Glu Ile1
51089PRTHomo sapiens 108Leu Pro Leu Asn Phe Arg Pro Tyr Val1
510910PRTHomo sapiens 109Leu Pro Phe Leu Gly Pro Leu Ala Ala Ile1
5 1011010PRTHomo sapiens 110Glu Pro Lys Met Gln
Ser Lys Thr Lys Ile1 5 1011110PRTHomo
sapiens 111Leu Pro Tyr His Ile Phe Leu Phe Thr Val1 5
101129PRTHomo sapiens 112Arg Glu Lys His Val Lys Glu Val Ile1
511310PRTHomo sapiens 113Lys Pro Arg Asn Lys Arg Val Pro
Ile Leu1 5 101149PRTHomo sapiens 114Val
Val Leu Gln Asn Arg Arg Ala Leu1 511510PRTHomo sapiens
115Ala Val Val Leu Gln Asn Arg Arg Ala Leu1 5
101169PRTHomo sapiens 116Leu Pro Phe Val Ile Gly Ala Gly Val1
51179PRTHomo sapiens 117Asp Leu Tyr Ser Tyr Val Ile Ser Lys1
511810PRTHomo sapiens 118Thr Glu Gln Asp Leu Tyr Ser Tyr Val Ile1
5 101192615DNAHomo sapiens 119gaattccggg
aagccagacg gttaacacag acaaagtgct gccgtgacac tcggccctcc 60agtgttgcgg
agaggcaaga gcagcgaccg cgcacctgtc cgcccggagc tgggacgcgc 120gcccgggcgg
ccggacgaag cgaggaggga ccgccgaggc tgcccccaag tgtaactcca 180gcactgtgag
gtttcaggga ttggcagagg ggaccaaggg gacatgaaaa tggacatgga 240ggatgcggat
atgactctgt ggacagaggc tgagtttgaa gagaagtgta catacattgt 300gaacgaccac
ccctgggatt ctggtgctga tggcggtact tcggttcagg cggaggcatc 360cttaccaagg
aatctgcttt tcaagtatgc caccaacagt gaagaggtta ttggagtgat 420gagtaaagaa
tacataccaa agggcacacg ttttggaccc ctaataggtg aaatctacac 480caatgacaca
gttcctaaga acgccaacag gaaatatttt tggaggatct attccagagg 540ggagcttcac
cacttcattg acggctttaa tgaagagaaa agcaactgga tgcgctatgt 600gaatccagca
cactctcccc gggagcaaaa cctggctgcg tgtcagaacg ggatgaacat 660ctacttctac
accattaagc ccatccctgc caaccaggaa cttcttgtgt ggtattgtcg 720ggactttgca
gaaaggcttc actaccctta tcccggagag ctgacaatga tgaatctcac 780acaaacacag
agcagtctaa agcaaccgag cactgagaaa aatgaactct gcccaaagaa 840tgtcccaaag
agagagtaca gcgtgaaaga aatcctaaaa ttggactcca acccctccaa 900aggaaaggac
ctctaccgtt ctaacatttc acccctcaca tcagaaaagg acctcgatga 960ctttagaaga
cgtgggagcc ccgaaatgcc cttctaccct cgggtcgttt accccatccg 1020ggcccctctg
ccagaagact ttttgaaagc ttccctggcc tacgggatcg agagacccac 1080gtacatcact
cgctccccca ttccatcctc caccactcca agcccctctg caagaagcag 1140ccccgaccaa
agcctcaaga gctccagccc tcacagcagc cctgggaata cggtgtcccc 1200tgtgggcccc
ggctctcaag agcaccggga ctcctacgct tacttgaacg cgtcctacgg 1260cacggaaggt
ttgggctcct accctggcta cgcacccctg ccccacctcc cgccagcttt 1320catcccctcg
tacaacgctc actaccccaa gttcctcttg cccccctacg gcatgaattg 1380taatggcctg
agcgctgtga gcagcatgaa tggcatcaac aactttggcc tcttcccgag 1440gctgtgccct
gtctacagca atctcctcgg tgggggcagc ctgccccacc ccatgctcaa 1500ccccacttct
ctcccgagct cgctgccctc agatggagcc cggaggttgc tccagccgga 1560gcatcccagg
gaggtgcttg tcccggcgcc ccacagtgcc ttctccttta ccggggccgc 1620cgccagcatg
aaggacaagg cctgtagccc cacaagcggg tctcccacgg cgggaacagc 1680cgccacggca
gaacatgtgg tgcagcccaa agctacctca gcagcgatgg cagcccccag 1740cagcgacgaa
gccatgaatc tcattaaaaa caaaagaaac atgaccggct acaagaccct 1800tccctacccg
ctgaagaagc agaacggcaa gatcaagtac gaatgcaacg tttgcgccaa 1860gactttcggc
cagctctcca atctgaaggt ccacctgaga gtgcacagtg gagaacggcc 1920tttcaaatgt
cagacttgca acaagggctt tactcagctc gcccacctgc agaaacacta 1980cctggtacac
acgggagaaa agccacatga atgccaggtc tgccacaaga gatttagcag 2040caccagcaat
ctcaagaccc acctgcgact ccattctgga gagaaaccat accaatgcaa 2100ggtgtgccct
gccaagttca cccagtttgt gcacctgaaa ctgcacaagc gtctgcacac 2160ccgggagcgg
ccccacaagt gctcccagtg ccacaagaac tacatccatc tctgtagcct 2220caaggttcac
ctgaaaggga actgcgctgc ggccccggcg cctgggctgc ccttggaaga 2280tctgacccga
atcaatgaag aaatcgagaa gtttgacatc agtgacaatg ctgaccggct 2340cgaggacgtg
gaggatgaca tcagtgtgat ctctgtagtg gagaaggaaa ttctggccgt 2400ggtcagaaaa
gagaaagaag aaactggcct gaaagtgtct ttgcaaagaa acatggggaa 2460tggactcctc
tcctcagggt gcagccttta tgagtcatca gatctacccc tcatgaagtt 2520gcctcccagc
aacccactac ctctggtacc tgtaaaggtc aaacaagaaa cagttgaacc 2580aatggatcct
taagattttc agaaaacact tattt 261512029PRTHomo
sapiens 120Leu Gln Asn Arg Arg Ala Leu Asp Leu Leu Thr Ala Glu Arg Gly
Gly1 5 10 15Thr Cys Leu
Phe Leu Gly Glu Glu Cys Cys Tyr Tyr Val 20
2512121DNAHomo sapiens 121cttcaaacaa caaccaggag g
2112220DNAHomo sapiens 122ttggggaggt tggccgacga
2012332PRTHomo sapiens 123Asn
Ser Leu Ala Ala Val Val Leu Gln Asn Arg Arg Ala Leu Asp Leu1
5 10 15Leu Thr Ala Glu Ser Gly Gly
Thr Phe Leu Phe Leu Glu Glu Lys Cys 20 25
3012432PRTHomo sapiens 124Asn Ser Leu Ala Ala Val Val Leu
Gln Asn Arg Arg Ala Leu Asp Leu1 5 10
15Leu Thr Ala Glu Arg Gly Gly Thr Cys Leu Phe Leu Gly Glu
Glu Cys 20 25 3012532PRTHomo
sapiens 125Asp Ser Leu Ala Ala Val Thr Leu Gln Asn His Gln Gly Leu Asp
Leu1 5 10 15Leu Thr Ala
Glu Lys Gly Gly Leu Cys Tyr Phe Leu Gly Glu Asp Cys 20
25 3012632PRTHomo sapiens 126Asp Ser Leu Ala
Ala Val Thr Leu Gln Asn His Gln Gly Leu Asp Leu1 5
10 15Leu Ile Ala Glu Lys Gly Gly Leu Cys Thr
Phe Leu Gly Glu Glu Cys 20 25
3012732PRTHomo sapiens 127Asp Ser Leu Ala Ala Val Thr Leu Gln Asn Cys
Arg Gly Leu Asp Leu1 5 10
15Leu Thr Ala Glu Lys Gly Gly His Tyr Thr Phe Leu Gly Glu Glu Cys
20 25 3012818PRTHomo sapiens 128Leu
Gln Asn Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu Gly Gly1
5 10 15Leu Cys12932PRTType D
retrovirus 129Asp Ser Leu Ala Lys Val Val Leu Gln Asn Arg Arg Gly Leu Asp
Leu1 5 10 15Leu Thr Ala
Glu Gln Gly Gly Ile Cys Leu Ala Leu Gln Glu Lys Cys 20
25 30130571DNAArtificial SequenceSynthetic DNA
130ttcctgagtt cttgcactaa cctcaaatga gagaagtgcc gccataactg caacccaaga
60gtttggcgat ccctggtatc tcagtcaggt caatgacagg atgacaacag aggaaagata
120atgattcccc acaggccagc aggcagttcc cagtgtagac cctcattagg acacagaatc
180agaacatgga gattggtgcc gcagacattt gctaacttgc gtgctagaag gactaaggaa
240aactaggaag atatgaatta ttcaatgatg tccactataa cacaggggaa aggaagaaaa
300tcctactgcc tttctggaga gactaaggga ggcattgagg aagcatacca ggcaagtgga
360cattggaggc tctggaaaag ggaaaagttg ggaaaagtat atgtctaata gggcttgctt
420ccagtgtggt ctacaaggac actttaaaaa agattgtcca atagaaataa gccaccacct
480cgtccatgcc ccttatgtca agggaatcac tggaaggccc actgccccag gggatgaagg
540tcctctgagt cagaagccac taaccagatg a
571131359DNAArtificial SequenceSynthetic DNA 131aacttgcgtg ctagaaggac
taaggaaaac taggaagact atgaattatt caatgatgtc 60cactataaca caggggaaag
gaagaaaatc ctactgcctt tctggagaga ctaagggagg 120cattgaggaa gcataccagg
caagtggaca ttggaggctc tggaaaaggg aaaagttggg 180caaattgaat gcctaatagg
gcttgcttcc agtgcagtct acaaggacgc tttagaaaag 240attgtccaag tagaaataag
ccgcccctcg tccatgcccc ttatgtcaag ggaatcactg 300gaaggcctac tgccccaggg
gacgaaggtc ctctgagtca gaagccacta acctgatga 359132694DNAArtificial
SequenceSynthetic DNA 132cttccccaac taataaggac ccccctttca acccaaacag
tccaaaagga catagacaaa 60ggagtaaaca atgaaccaaa gagtgccaat attccctggt
tatgcaccct ccaagcggtg 120ggagaagaat tcggcccagc cagagtgcat gtaccttttt
ctctctcaca cttgaagcaa 180attaaaaata gacntaggtn aattntcaga tagccctgat
ggytatattg atgttttaca 240aggattagga caatcctttg atctgacatg gagagatata
atattactgc taaatcagac 300gctaacctca aatgagagaa gtgctgccat aactggagcc
cgagagtttg gcaatctctg 360gtatctcagt caggtcaatg ataggatgac aacggaggaa
agagaacgat tccccacagg 420gcagcaggca gttcccagtg tagctcctca ttgggacaca
gaatcagaac atggagattg 480gtgccgcaga catttactaa cttgcgtgct agaaggacta
aggaaaacta ggaagactat 540gaattattca atgatgtcca ctataacaca ggggaaagga
agaaaatcct actgcctttc 600tggagagact aagggaggca ttgaggaagc ataccaggca
agtggacatt ggaggctctg 660gaaaagggaa aagttgggca aattgaatgc ctaa
694133494DNAArtificial SequenceSynthetic DNA
133cttccccaac taataaggac ccccctttca acccaaacag tccaaaagga catagacaaa
60ggagtaaaca atgaaccaaa gagtgccaat attccctggt tatgcaccct ccaagcggtg
120ggagaagaat tcggcccagc cagagtgcat gtaccttttt ctctctcaca cttgaagcaa
180attaaaatag acntaggtna attntcagat agccctgatg gntatattga tgttttacaa
240ggattaggac aatcctttga tctgacatgg agagatataa tattactgct aaatcagacg
300ctaacctcaa atgagagaag tgctgccata actggagccc gagagtttgg caatctctgg
360tatctcagtc aggtcaatga taggatgaca acggaggaaa gagaacgatt ccccacaggg
420cagcaggcag ttcccagtgt agctcctcat tgggacacag aatcagaaca tggagattgg
480tgccgcagac attt
494134184PRTArtificial SequenceSynthetic Peptide 134Arg Thr Pro Leu Ser
Thr Gln Thr Val Gln Lys Asp Ile Asp Lys Gly1 5
10 15Val Asn Asn Glu Pro Lys Ser Ala Asn Ile Pro
Trp Leu Cys Thr Leu 20 25
30Gln Ala Val Gly Glu Glu Phe Gly Pro Ala Arg Val His Val Pro Phe
35 40 45Ser Leu Ser His Leu Lys Gln Ile
Lys Ile Asp Gly Ser Asp Ser Pro 50 55
60Asp Gly Tyr Ile Asp Val Leu Gln Gly Leu Gly Gln Ser Phe Asp Leu65
70 75 80Thr Trp Arg Asp Ile
Ile Leu Leu Leu Asn Gln Thr Leu Thr Ser Asn 85
90 95Glu Arg Ser Ala Ala Ile Thr Gly Ala Arg Glu
Phe Gly Asn Leu Trp 100 105
110Tyr Leu Ser Gln Val Asn Asp Arg Met Thr Thr Glu Glu Arg Glu Arg
115 120 125Phe Pro Thr Gly Gln Gln Ala
Val Pro Ser Val Ala Pro His Trp Asp 130 135
140Thr Glu Ser Glu His Gly Asp Trp Cys Arg Arg His Leu Leu Thr
Cys145 150 155 160Val Leu
Glu Gly Leu Arg Lys Thr Arg Lys Thr Met Asn Tyr Ser Met
165 170 175Met Ser Thr Ile Thr Gln Gly
Lys 180135891DNAArtificial SequenceSynthetic DNA 135gtctacctag
ccaaggcata ttcttcttat gtggaacatc aacctatatc tgcctcccca 60ctaactggac
aggcacctga accttagtct ttctaagtcc caacattaac attgccccag 120gaaatcagac
cctattggta cctgtcaaag ctaaagtccc gtcagtgcag agccatacaa 180ctaatatccc
tatttatagg gttaggaatg gctactgcta caggaactgg aatagccggt 240ttatctactt
cattatccta ctaccataca ctctcaaaga atttctcaga cagtttgcaa 300gaaataatga
aatctattct tactttacaa tcccaattag actctttggc agcaatgact 360ctccaaaacc
gccgaggccc acacctcctc actgctgaga aaggaggact ctgcaccttc 420ttaggggaag
agtgttgttt ttacactaac cagtcaggga tagtacgaga tgccacctgg 480catttacagg
aaagggcttc tgatatcaga caatgccttt caaactctta taccaacctc 540tggagttggg
caacatggct tcttccattt ctaggtccca tggcagccat cttgctgtta 600ctcacctttg
ggccctgtat ttttaagctt cttgtcaaat ttgtttcctc taggatcgaa 660gccatcaagc
tacagatggt cttacaaatg gaaccccaaa tgagttcaac taacaacttc 720taccaaggac
ccctggaacg atccactggc acttccacta gcctagagat tcccctctgg 780aagacactac
aactgcaggg ccccttcttt gcccctatcc agcaggaagt agctagagcg 840gtcatcggcc
aaattcccaa cagcagttgg ggtgtcctgt ttagaggggg g
891136568DNAHomo sapiens 136actttacaat cccaaataag actctttggc agcagtgact
ctccaaaacc gctgaggcct 60agatctcctc actgctgaaa aaggaggact ctgcaccttc
ttaggggaag agtgttgttt 120ttacactaac cagtcaggga tagcatgaga tgccacccag
cgtttacagg aaaaggcttc 180tgaaatcaga cgcctttcaa attcttatac caacctctgg
agttgggcaa catggcttct 240cccctttcta ggtcccgtgg cagccatctt gctgttactc
gcctttgggc cccgtatttt 300taaccttctt gtcaaatttg tttggtctag aatcgaggcc
atcaagctac agatggtctt 360acaaatcgaa ccccaaatga gttcaactaa caacttctac
cgaggacccc tggactgacc 420agctggcact tcccctggcc tagagagttc ccctctgaag
gacactacaa ctgcaaagcc 480ccttcttcgc ccctatccag caggaagtag ctagagcagt
catcggccaa attcccaaca 540gcagttgggg tgtcctgttg attgaggg
5681373273DNAHomo sapiens 137agttgcaatt ccttgcctcc
actgtgagac aaaccccaga cacatctcca gcacacaaga 60acttcgaaat gcctcaacct
caggtgccag gggttcctcc agaaccttct cccccaggag 120cttgctacaa gtgccagaaa
tctggccact gggccaagga atgcccacag accaggattc 180ctcctaagct gtatcccatc
tctgtgggac cccactaaaa atcagactgt tcaactcacc 240tggcagccac ttccagagcc
cctggaactc tagcccaagg ctctctgact gaccccttct 300gagatcttct tggcttagca
gctgaagact gacactgcca gatcgcctcg gaagcctaca 360ggaccatcac agatgctcca
ggtaactctc acagtagagg gtaagtctgt ccccttctta 420atcaatatgg aggctaccca
ctgcacatta ccttcttttc aagggcctgt ttcctttgcc 480tccataactg ttgtgggtat
tgacggccag gcttctaaac ctcttaaaac tccccaactc 540tagtaccaac ttagacaata
ctcttttaag cactcctttt tagttatccc cacttgccca 600gttcccttat gaggccgaga
cacttcaact aaattatctg cttccctgac tattcctgga 660ctacagctac atctcattgc
tgcccttctt cccaatccaa agcctccttt gcatcttctt 720gtatccccca accttaaccc
acaagtataa gatacctcta ttccctcctt ggtgaccaat 780catgcacccc ttaccatctc
attaaaacct aatcactctt acccggctca atgccaagat 840cccatcccac agcatgcttt
aaaaggatta aaacctgtta tcactcgcct gctagagcat 900ggccttttaa agcctataaa
ctctccttac aattccccca ttttacctgt cctagaacca 960gacaagcctt acaggttcag
gatctgtgtc ttatcaatga aattgttttc cctatccacc 1020ctgtggtgct gaacccatat
actctcctat cctcaatacc tccctctaca acccattatt 1080ctgttctaga tctcaaacat
gctttcttta ctatcccttt acacccttca acccagcctc 1140tcttcgtttt cacctggact
gaccctgaca cccatcagtc ccagcagctt acctgggctg 1200taatgctgca aggtttcagg
ggcagccctt attatttcag ccaagctctt tctcatgatt 1260tactttcttt ccacccctcc
acttctcacc ttattcaata tattggtgat gttcttcttt 1320gtagcccctc ctttgaatct
tctcaacaag acacacttct gctccttcag catttattct 1380ccaaaggata tccccctcca
aagctcaaat gtcttctcca tccgttacct accttggcat 1440aattcttcat aaaaacacac
gtgccctccc tgctgatagt gtctgactga tctctcaaac 1500cccaacccct tctacaaaac
aacaactctt ttccatccta ggcatggttg gatactttcg 1560tgttaggata cctggttttg
ccatcctaac aaaaccatta tataaactca caaaaggaaa 1620cctagttgac cccatagatc
ctaaatcgtt tccccactcc tctttccatt ccttgaagac 1680agctttagag actgtctcca
ctctagctct ccctgactca tcccaacact tttcattaca 1740cacagctgaa gtgcagggct
gtgcagtcag aattcttaca caaggaccgg gatcgcatcc 1800tgtagccttt ttgtccaaac
aacttgacct tactgtttta ggctggccat catgtctcca 1860tgcagcgtct gctgccaccc
taatactttt agaggccctc aaaatcacaa actatgctca 1920actcattctc tacagctctc
ataatttcca aaatctattt tcttcctcac acctgacaca 1980tatactttct gctccccggc
tccttcagat atactcactc catttattct cccacaatta 2040ccattattcc tggcctggac
ttcaatccgg cctcccacat tattctggat accatacctg 2100accctcatga ctgcatctct
ctgatccacc tgacgttcac cccatttccc cacatttcct 2160tctgccctgt ttctcaccct
gatcacactt ggtttattga tggcagttcc accaggccta 2220atcgccactc accagcaaag
gcaggatatg ctatgaacta gttgccttaa ttcaagccct 2280cactcttgca aaaggactac
gtgtcaatat ctatactgat tctaaatatg cctttcatat 2340tctgcaccac catgcggtca
tatgggctga aagaggtttc ctcactacac aagtgtcctc 2400catcattaat gcctctttaa
gaaaactctg ctcaaggctg ctttacttcc aaaggaagct 2460ggggtcattc actgcaaggg
gcatcaaaag acttcagatc ccattgctct aggcaatgct 2520tatgctgata aggtggctag
acaagcagct agctctccaa cttttgtccc tcatggccag 2580tttttctcct tcacatccgt
cactcccacc tactccacag ctgaaacttc cacctatcaa 2640gctcttcccc cgcaaggtaa
atggttctta gaccaaggaa aatatctcct tccagcctca 2700caggcccatt ctattctgtc
gtcatttcat aaccttttcc atgtaggtta caagccacta 2760gcctgtctct taggacctct
catttccttt ccatcatgga aatctatcct caaggagatc 2820acttctcagt gttccatctg
ctattctgct acccctcagg gattgttcag gcctcctccc 2880tttcctacac ataaagctcg
gggatttgcc cctgcccagg actggcaaat tgactttact 2940cacatgcctc gggtcagaaa
actaaaatat ctcttagtct gggtagacac tttcactggg 3000tgggtagagg cctttcccat
agagtctgag aaggccaccg cggtcatttc ttcccttctg 3060tcagacataa ttccttggtt
tggccttccc ttctctatac agtctgataa cggaccagcc 3120tttactagtt aaatcaccca
agcagtttct caggctcttg gtattcagtg gaaccttcat 3180atcccttaac atcctcaatc
ttcaggaaag gtaaaaccga ctaatggtct tttaaagaca 3240cacctcacca agctcagcct
ccaacttaaa aag 3273138823PRTHomo sapiens
138Ser Ser Phe Arg Arg Thr Lys Asn Arg Pro Leu Pro Trp Arg Arg Leu1
5 10 15Ala Thr Asp Phe Thr His
Lys Pro Lys Pro Gln Gly Phe Gln Tyr Leu 20 25
30Leu Val Trp Val Asp Thr Phe Thr Gly Trp Ala Glu Ala
Phe Pro Cys 35 40 45Arg Thr Glu
Lys Ala Gln Glu Val Ile Lys Ala Leu Val His Glu Ile 50
55 60Ile Pro Arg Phe Gly Leu Pro Arg Gly Leu Gln Ser
Asp Asn Ser Pro65 70 75
80Ala Phe Gln Ala Thr Val Thr Gln Gly Val Ser Gln Ala Leu Gly Ile
85 90 95Arg Tyr His Leu His Cys
Ala Arg Pro Gln Ser Ser Gly Lys Val Glu 100
105 110Lys Met Asn Glu Thr Leu Lys Gly His Leu Lys Lys
Gln Thr Gln Glu 115 120 125Thr His
Leu Thr Trp Pro Ala Leu Leu Pro Ile Ala Leu Lys Arg Ile 130
135 140Cys Asn Phe Pro Gln Lys Ala Gly Leu Ser Pro
Tyr Glu Met Leu Tyr145 150 155
160Gly Arg Pro Phe Ile Thr Asn Asp Leu Val Leu Asp Pro Arg Gln Pro
165 170 175Thr Leu Gln Thr
Ser Pro Pro Pro Asn Ile Asn Lys Phe Leu Lys His 180
185 190Tyr Lys Glu Pro Ile Pro Glu Lys Arg Glu Lys
Asn Tyr Ser Thr Leu 195 200 205Val
Thr Trp Tyr Ser Ser Pro Phe Pro Leu Ile Pro His Pro Ile His 210
215 220Pro Gly Lys Asp Pro Thr Gln Ser Phe Tyr
Leu Pro Gln Leu Arg Leu225 230 235
240Lys Trp Leu Glu Trp Ser Leu Gly Tyr Ile Thr Leu Glu Ser Asn
Pro 245 250 255Gly Tyr Cys
Gln Arg Asn Leu Lys Ile Gln Glu Thr Thr Leu Ala Ile 260
265 270Pro Val Asn Leu Arg Ile Cys Ala Cys Ser
Ser Asn Asn Asn Gln Glu 275 280
285Glu Ser Asn Asn His Lys Ser Pro Trp Pro Ser Leu Ile Ile Phe Phe 290
295 300Ser Leu Leu Phe Phe Tyr Pro Leu
Ser Leu Ser Leu His Pro Leu His305 310
315 320Ala Ala Val Pro Val Ala Pro Leu Thr Lys Ser Phe
Tyr Gly Glu Cys 325 330
335Ser Val Pro Glu Ile Leu Met Pro His Arg Ile Gly Val Phe Leu Arg
340 345 350Glu Pro Pro Pro Ser Leu
Pro Thr Pro Ile Cys Pro Ala Thr Ala Ile 355 360
365Thr Leu Pro Leu Phe Ala Cys Met Gln Ile Leu Ile Ile Gly
Gln Glu 370 375 380Lys Leu Ile Leu Val
Val Leu Glu Asp Leu Glu Ser Leu Ser Val Gly385 390
395 400Leu Thr Ser Pro Lys Leu Val Cys Leu Met
Gly Val Glu Phe Lys Ile 405 410
415Arg Gln Glu Lys Asn Met Lys Lys Ser Pro Asn Ser Pro Gly Tyr Met
420 425 430Ala Pro Leu Ala Pro
Thr Lys Asp Ile Ser Gln Asn Tyr Met Lys Pro 435
440 445Ser Val Pro Ile Leu Ala Trp Ala Tyr Leu Ile Pro
Pro Ser Leu Gly 450 455 460Ser Met Arg
Ser Arg Pro Lys Thr Leu Leu Thr Val Gly Tyr Ala Ser465
470 475 480Pro Thr Ser Gly His Met Phe
Gln Ser Leu Tyr Leu Asn Asn Gly Thr 485
490 495Thr Ser Ala Gln Lys Thr Pro Leu Pro Phe Asp Leu
Leu Phe Pro Ile 500 505 510Trp
Lys Pro Ile Pro Gln Thr Ser Pro Val Asn Leu Ala Ile Leu His 515
520 525Thr Gln Pro Thr Pro Asn Ala Ser Gly
Gly Leu Leu Pro His Lys Ser 530 535
540Ala Tyr Pro Gln Glu Tyr Phe Leu Ser Val Val Pro Gln Pro Ile Val545
550 555 560Val Met Ala Leu
Gln Asn Leu Cys Ala Ser Ser His Ser Cys Pro Leu 565
570 575Pro Ser Thr Leu Asn Lys Ile Tyr Thr Val
Met Ser Tyr Leu Ser Pro 580 585
590Ala Thr Lys Glu Tyr Pro Phe Phe Leu Leu Leu Glu Gln Glu Cys Val
595 600 605His Val Leu Ala Leu Ala Val
Ser Gln Pro Leu Leu Ser Ser Thr Thr 610 615
620Asn Tyr Leu Lys Asn Met Gly Thr Trp Asn Gly Ser Pro Thr Pro
Trp625 630 635 640Ser Pro
Cys Lys Ile Asn Leu Thr Pro Gln Gln Ser Phe Lys Ile Glu
645 650 655Glu Leu Thr Cys Pro Leu Lys
Glu Gly Glu Pro Val Tyr Phe Gly Lys 660 665
670Asn Ala Val Ile Met Leu Ile Asn Pro Glu Ser Ser Leu Arg
Lys Leu 675 680 685Lys Lys Phe Glu
Ile Glu Tyr Asn Val Glu Gln Arg Ser Phe Glu Thr 690
695 700Leu Asp Pro Gly Ala Ser Ser Ala Asn Gly Cys Pro
Gly Phe Ser Pro705 710 715
720Ser Asp Leu Gln Leu Tyr Cys Tyr Ser Ser Leu Asp Pro Val Ser Leu
725 730 735Thr Ser Leu Leu Thr
Leu Ser Leu Pro Glu Ser Lys Leu Asn Tyr Lys 740
745 750Trp Ser Pro Arg Cys Ser Pro Arg Leu Arg Ser Thr
Ala Asp Pro Trp 755 760 765Thr Gly
Leu Leu Ala His Asp Leu Met Leu Met Thr Ser Lys Ala Pro 770
775 780Leu Leu Arg Lys Ser Gln Leu His Asn Leu Tyr
Tyr Ala Pro Ile Gln785 790 795
800Gln Glu Ala Val Arg Ala Val Val Gly Gln Pro Pro Gln Gln His Leu
805 810 815Gly Phe Pro Val
Glu Met Gly 820139809PRTHomo sapiens 139Ala Pro Ser Gly Glu
Gln Arg Thr Gly His Tyr Pro Gly Glu Asp Trp1 5
10 15Gln Leu Ile Leu Pro Thr Ser Pro Asn Leu Arg
Asp Phe Ser Ile Tyr 20 25
30Ser Gly Ile Leu Ser Arg Val Gly Gln Arg Pro Ser Pro Val Gly Gln
35 40 45Lys Arg Pro Lys Arg Arg His Phe
Met Lys Phe Pro Asp Ser Asp Phe 50 55
60Pro Glu Ala Tyr Arg Val Thr Ile Ala Leu Leu Ser Arg Pro Gln Pro65
70 75 80Arg Glu Tyr Pro Arg
Arg Val Tyr Asp Ile Thr Tyr Thr Ala Pro Glu 85
90 95Gly His Ser Pro Gln Gly Arg Ser Arg Lys Met
Lys His Ser Lys Asp 100 105
110Ile Lys Ser Lys Pro Arg Lys Pro Thr Ser His Gly Leu Leu Cys Cys
115 120 125Leu Pro Lys Glu Ser Ala Thr
Phe Pro Lys Lys Gln Asp Leu Ala His 130 135
140Thr Lys Cys Cys Met Glu Gly Pro Ser Pro Met Thr Leu Cys Leu
Thr145 150 155 160Gln Asp
Ser Gln Leu Ser Cys Arg His His Leu Leu Ser Gln Ile Ser
165 170 175Thr Ser Ser Asn Ile Thr Arg
Asn Leu Ser Leu Arg Arg Gly Lys Arg 180 185
190Thr Ile Pro Pro Leu His Gly Ile Ser Gln Val Pro Ser Leu
Phe Pro 195 200 205Ile Pro Arg Tyr
Ile Leu Gly Arg Thr Leu Pro Ser His Phe Ile Tyr 210
215 220Pro Asn Cys Gly Ser Gly Trp Ser Gly Val Leu Asp
Thr Ser His Leu225 230 235
240Ser Gln Ile Leu Asp Thr Ala Lys Gly Thr Lys Ser Arg Arg Gln Arg
245 250 255Leu Phe Leu Thr Ser
Arg Gly Phe Ala Pro Ala Leu Gln Thr Thr Thr 260
265 270Arg Arg Lys Val Thr Lys Ile Ile Asn Pro His Gly
Pro Pro Leu Ser 275 280 285Tyr Phe
Ser Leu Tyr Cys Ser Phe Thr Leu Phe His Ser His Cys Thr 290
295 300Pro Ser Met Pro Leu Tyr Asp Gln Leu Pro Leu
Pro Arg Val Ser Met305 310 315
320Glu Asn Ala Ala Ser Arg Lys Tyr Cys Pro Ile Val Glu Ser Phe Gly
325 330 335Asn Pro His Leu
His Cys Pro His Pro Tyr Ala Pro Gln Leu Leu Ser 340
345 350Leu Cys His Ser Leu His Ala Cys Lys Tyr Ser
Leu Leu Asp Arg Lys 355 360 365Asn
Asp Ser Leu Ser Trp Arg Thr Trp Ser His Cys Leu Leu Asp Leu 370
375 380Leu His Pro Asn Trp Tyr Val Trp Gly Trp
Ser Ser Arg Ser Gly Lys385 390 395
400Arg Lys Thr Cys Lys Arg Ser Asn Leu Pro Thr His Pro Gly Thr
Trp 405 410 415His Leu Pro
Leu Gln Arg Thr Arg Ser Leu Lys Thr Thr Asn Pro Pro 420
425 430Tyr Pro Tyr Ser Pro Gly Lys Pro Ile Tyr
His Pro His Trp Ala Pro 435 440
445Gly Leu Gly Pro Lys Pro Tyr Leu Leu Asp Met Pro Pro Pro Glu Leu 450
455 460Gln Ala Ile Cys Phe Asn Pro Cys
Thr Thr Met Glu Gln Leu Gln His465 470
475 480Arg Asn Lys His His Phe Arg Phe Ser Arg Thr Ser
Cys Phe Gln Ser 485 490
495Gly Asn Asn Pro Tyr Leu Lys Pro His Leu Cys Lys Ile Gln Tyr Tyr
500 505 510Ile His Asn Gln Leu Pro
Met His Gln Val Gly Asn Ser Ser His Thr 515 520
525Asn Ser Leu Pro Thr Leu Arg Asn Ile Phe Cys Leu Trp Tyr
Leu Ser 530 535 540Leu Ser Leu Phe Glu
Trp Leu Phe Arg Ile Tyr Val Leu Pro Leu Ile545 550
555 560Leu Ser Ala Pro Tyr Asp His Leu His Thr
Arg Phe Ile Gln Leu Cys 565 570
575His Ile Ala Pro Gln Gln Lys Ser Thr His Ser Ser Phe Cys Tyr Arg
580 585 590Ser Arg Ser Ala Arg
Cys Thr Arg Tyr Trp His Trp Arg Tyr His Asn 595
600 605Leu Tyr Ser Val Leu Leu Gln Thr Ile Ser Arg Thr
Lys Trp Gly His 610 615 620Gly Thr Gly
Arg Arg Leu Pro Gly His Leu Ala Arg Ser Thr Leu Pro625
630 635 640Ser Ser Ser Ser Pro Ser Lys
Ser Lys Ser Phe Arg Leu Ala Asn Arg 645
650 655Lys Arg Gly Asn Leu Phe Ile Phe Arg Gly Arg Met
Leu Leu Leu Cys 660 665 670Ser
Ile Arg Asn Arg His Glu Ser Arg Asn Ser Arg Ser Asn Thr Thr 675
680 685Ser Arg Gly Ala Ser Lys His Trp Thr
Leu Gly Pro Pro Gln Pro Met 690 695
700Asp Ala Leu Asp Ser Pro Leu Leu Arg Thr Ser Ser Ser Tyr Asn Ile705
710 715 720Ala Thr Pro Leu
Trp Thr Leu Tyr Leu Pro Pro Cys Leu Cys Leu Phe 725
730 735Gln Asn Arg Ser Cys Lys Thr Thr Asn Gly
Ala Gln Asp Ala Val Gln 740 745
750Asp Asp Leu Pro Gln Thr Pro Gly Pro Ala Cys Pro Thr Ile Cys His
755 760 765Gln Arg His Pro Ser Gly Asn
Leu Ser Cys Thr Thr Ser Thr Thr Pro 770 775
780Gln Phe Ser Arg Lys Gln Leu Glu Arg Ser Ser Ala Asn Leu Pro
Asn785 790 795 800Ser Thr
Val Phe Leu Leu Arg Trp Gly 805 1407140DNAHomo sapiens
140ttggtcttaa gaacacaaat gatatggctc caatgactgg aggaacacca gggtccttgg
60tctcacgctg atttagataa aacgactgtc aggcctctga gcccaagcta agccatcctc
120ccctgtgacc tgcacgtata catccagatg gcctgaagta accaaagaat cacaaaagca
180gtgaaaatgg cctgttcctg ccttaactga tgacattcca ccattgtgat ttgttcctgc
240cccatcttaa ctgagcgatt aaccttgtga aattccttct cctggctcaa aacctccccc
300actgagcacc ttgtgacccc cgcccctgcc cctaagagaa aacccccttt gattataatt
360ttccactacc cacccaaatc ctataaaatg gccccacccc tatctccctt cgctgactcc
420tttttcggac tcagcccgcc tgcacccagg tgaaataaac agccttgttg ctcacacaaa
480gcctgtttgg tggactctct tcacacggac gctcatgaca tttggtgcca aaacctggga
540taggaggact ccttcaggag accagtcccc tgtccttgcc ctcactctgt gaggacatcc
600acctacaacc ttgggtcctc agaccaacca gcccaaggaa cagctcacca atttcaaatc
660aggtaagcag tcttttcact ctcttctcca gcctctcttg ctacccttca aactccctct
720ctcactaccc ttcaatctcc ctgtccttcc aattccagtt ctttttcatc tctagtagag
780acaaaggaga cacattttat ccatggaccc aaaactccag caccagtcac ggacttggga
840agacagtctt cccttggtgt ttaatcactg cggggacgcc tgcctgatta ttcacccaca
900ctccattggt gtctgatcac ggtggggaca cctgccttgg tcactcaccc acattccctt
960ggtggtacgt caactgcaaa agcaggggac gcctgctttg gctgctcacc cacccccttc
1020tctgtgtctc tacctttctc tttaaactta cctccttcac tatgggcaaa cttctgccct
1080ccattccccc ttcttctccc ttagcctgtg ttcttaaaaa cctaaaacct cttcaactca
1140cacctgacct aaaacctaaa tgccttattt tcttctgcaa cactgcgtgg ctgcagtaca
1200aacttgataa tagctttaaa tggccagaat atggcacttt caatttctcc atcctacaag
1260atctagataa tttttgtgga aaaatggaaa aatggtctga gatgcctgac gtccaggcat
1320tcttttacac attggtccct ccctagtctc tgctcccaat gcgactcatc ccaaatcttt
1380cttctttctc tcctgtctgt tccttcagtc tccaccccaa gctctgagtc ctttgaatcc
1440tcctttgcta cagacccatc tgaactctcc cctcctcccc aggctgctcc tcaccaggcc
1500gagccaggtc ccaattcttc ctcagcctct gctcccccac cctataatcc ttttatcacc
1560tcctctcctc acactcagtc cggcttacag tttcgttctg tgactagccc tcccccatct
1620gcccaacaat ttcctcttaa agaggtggct ggagctaaag gcatagtcaa ggttaatgct
1680cctttttctt tatctgacct ctcccaaatc agttagcgtt tacgctcttt ttcatcaaat
1740ataaaaaccc agccagttca tggcccatct ggcaacaacc cttacaggct ttacagccct
1800agaccctgaa gggtcagaag gccgtcttat tctcaatatg cattttatta cccaatccgc
1860tcccaacatt aaataaagct ccaaaaatta aattctggcc ctcaaacccc acaacaggac
1920ttaattaacc tcacttcaag gtgtacaaga atagagtaga ggcagccaag tagcaacgta
1980tttgagttgc aattccttgc ctcaactctg agagaaaccc cagccacatc tccagcaaac
2040aagaacttca aaacacctga actgcagcag ccaggcgttc ctccaggacc acctccccca
2100ggatcttgct tcaagtgccg gaaatctgac cattgggcca aggaatgcct gcagcccagg
2160attcctccta agccacgtcc catttgtgca ggaccccact ggaaatcgga ctgtccaact
2220cacccggcag ccaatcccag agcccctgga actctggccc aaggctctct gactgactcc
2280ttcccagatc ttctcggctt agcagctgaa gactgacact gcccgatcac ttcagaagtc
2340ccctggacca tcacggatac tgagcttcag gtaactctca cagtggaggc taagtccatc
2400ccctgtttaa tcgatacagg ggctacccac tccacatcac cttcttttca agggcctgtt
2460tccctttccc ccataactgt tgtgggtatt gacggccaag cttcaaaacc ccttaaaact
2520cccccactct ggtgccaact tggacaacat tcttttatgc actctttttc agttatcctc
2580acctgcccag ttcccttatt aggccgagac attttaacca aattatctgc ttccccgact
2640attcctgggc tacagccaca tctccttgcc gcccttcttc ccaacccaaa gcctccttca
2700tatcttcctc tcatatcccc ccaccttaac ccacaagtat gggacacctc tactccctcc
2760ctggcaaccg atcacacgcc cattactatc ccattaaaac ctaatcaccc ttaccctgct
2820caatgccagt atcccatacc acaacaggct ttaaagggat tgaagcctgt tatcacttgc
2880ctgctacagc acgggcttct aaaacctata aactctccat acaattcccc cattttacct
2940gtctaaaaac cagataagtc ttacaggtta gttcagaatc tgcaccttat caaccaaatt
3000gttttgccta tccaccctgt agcacccaac tcgtacactc ttttgtcctc aatgccttcc
3060cccacaactc actattccgt tcttgatctt aaagatgctt ttttcactat tcccctgcac
3120ccctcatccc agcctctctt tgcttttacc tggactgacc ctgacaccca tcagtcccag
3180cagcttacct gggctgtact gccgcaaggc ttcagggaca gccctcatta cttcagccaa
3240gctctttctc atgatttact ttctttccac ctctctgctt ctcaccttat tcaatatatt
3300gatgaccttc tactttgtag cccctccttt aaatcttctc aacaagacac cctcctgctc
3360cttcaacatt tgttctccaa aggatatcgg gtatccccct ccaaagctca aatttcttct
3420ccatctgtta catacctcgg cataattctt catgaaaaca catgtgctct ccctgccaat
3480tgcgtctcca actgatctct caaatcccaa cctcttctac aaaacaacaa ctcctttccc
3540tcctaggcat ggttggatac ttttgccttt ggatacctgg ttttgccatc ctaacaaaat
3600cattatataa actcacaaaa ggaaacctag ctgaccccat agattctaaa tcctttcccc
3660actcctcttt ccattccttg aagacagctt tagagactgc tcccacacta gctctccctg
3720tctcatccca acccttttca ttacacacag ccgaagtgca gggctgtgca gtcggaattc
3780ttacacaagg accgggacca tgccctgtag cctttttgtc caaacaactt gaccttactg
3840ttttaggctc gccatcatgt ctccatgcgg tagcttccgc tgccctaata cttttagagg
3900ccctcaaaat cacaaactat gctcaactca ctctctacag ctctcacaac ttccaaaatc
3960tattttcttt ctcacacctg acgcatatac tttctgctcc ccggctcctt cagctgtatt
4020cactctttgt tgagtctccc acaattacca ttcttcctgg cccagacttc aatctggcct
4080cccacattat tctggatacc acacctgacc ctgatgattg tatgtctctg atctacctga
4140cattcacccc atttccccat atttccttct tttctgttcc tcatgttgat cacatttggt
4200ttactgacgg cagttccacc aggcctgatc gccactcacc agcaaaggca ggctatgcta
4260tagaatcttc cacatccatc attgaggcta ctgctctgcc cccctccact acctctcagc
4320aagccgaact gattgcctta actcgggcct tcactcttgc aaagggacta cacgtcaata
4380tttatactga ctctaaatat gccttccata tcttgcacca ccatgctgtt atatgggctg
4440aaagaggttt cctcactacg caagggtcct ccatcattaa tgcctcttta ataaaaactc
4500ttctcaaggc tgctttactt ccaaaggaag ctggagtcac acactgcaag ggccaccaaa
4560aggcgtcaga tcccattact ctaggaaatg cttatgctga taaggtagct aaagaagcac
4620ctagcgttcc aacttctgtc cctcatggcc agtttttctc cttcccatca gtcattccca
4680cctactcccc cattgaaact tccgcctatc aatctcttct cacacaaggc aaatggttct
4740tagaccaagg aaaatatctc cttccagcct cacaggccca ttctattctg tcatcatttc
4800ataacctctt ccatgtaggt tacaagccac tagtccacct cttagaacct ctcatttcct
4860tccatcgtgg aaacatatcc tcaaggaaat cacttctcag tgttccatct gctattctac
4920tacccctcag ggattgttca ggccccctcc cctccctaca catcaagctc ggggatttgc
4980ccctgcccag gactggcaaa ttgactttac tcacatgccc tgagtcagga aactaaaata
5040cctcttggtc tgggtagaca ctgtcactgg atgggtagag gcctttccca cagggtctga
5100gaaggccact gcagtcattt cttcccttct gtcagacata attccttggg ttggccttcc
5160cacctctata cagtccaata acggagcagc ctttattagt caaatcacct gagcagtttt
5220tcaggctctt ggtattcagt ggaaccttcg taccccttac tgtcctcaat cttcaggaaa
5280ggtagaatgg actaatggtc ttttaaaaac acaccccacc aaactcagcc tccaacttaa
5340aaaggaggat agagcccaaa aactcgcaac caagctagta attatgctga acccccttgg
5400gcactctcta attggatgtc ttaggtcctc ccaaatctta gtcctttaat atctgttttt
5460ctccttctct tattcggacc ttgtgtcttc cgtttagttt ttcaattcat acaaaaccgc
5520atccaggcca tcaccaatcg ttctatacaa taaatgctcc ttctaacaac cccacaatat
5580cgccccttac cacaaaatct tccttcagct taatctctcc cactctaggt tcccatgccg
5640cccataatcc ctctcgaagc agccctgaga aacatagccc attatctctc cataccaccc
5700ccaaaatttt tgctgcccca acacttcaac actattttac attatttttc ttattaatat
5760aagaagacag caatgtcagg cctctgagcc caagccatca tatcccctgt gacctgcaca
5820tatacatcca gatggcctga agtaactgaa gaatcacaaa agaagtgaaa atggcctgtt
5880cctgccttaa ccgatgacat tccaccactg tgatttgttc ctgccccacc ttaactgagc
5940aattaacctt gggaaattcc ttctcctggc tcaaaacctc ccccactgag caccttgtga
6000cccctgcccc tccactaccc acccaaatcc tataaaatgg ccccacccca tctcccttag
6060ctgactcctt ttttggactc agcccgcctg cacccaggtg aaataaacag ccttgttgct
6120cacacaaagc ctgtttggtg gactctcttc acagggacgg gggtgacaac aacacggaca
6180cacatggagt ggttttaagg agcagagagt ttaatacgca aaaaagaagg aagaggctcc
6240cctgtacaga cacagaggga gggggctcca agccgagaga aggaaacccc atgtgcagtg
6300gaaaagtggt tgattatact gggaggctgg aggaggcggt gtctgatttg cacagggccc
6360aggggattgg gttgaccagg tgtatcattc atgtaccccg caaaaaacct ggccctccca
6420cctcagccct ttaatatgca aatgtgggtt gccatgatgt tctgaaaaca catgaattat
6480ctggaggggg ccatgacact tggtacatgt gctgacaaga agagggtggg aatcgccatg
6540gtggccatgt tgggtggacc tagtttttaa tagcctgcat ttgcatatca aagtttgctg
6600gcctggctct ttaagctgtc ttttctgtta gaaaaggaat ggtttggaat gggtgagggt
6660tgcttcttat tacaagaaaa tttccaaaaa cctttactct ttctagctgc caaaaaacta
6720tttcttaata acttatgtat taccataatt aggcagcacc aaagatccct gcaggtcaga
6780ccactgcaat taacatgctg gctttactgc tgattatggt agctgcatcc acctagcctc
6840tcatattgca actgcctgac ctctgccacc ccacgagcca cttatcccca cttataatca
6900gcccatttcg attgtaacat ctgccactta ttcccgacgt tgtggtatat cctatagatg
6960aattcattca acatccattc caacaccacc tctcttgcct tcctatactc tctggagagt
7020gaattactga gtcacatgat cttcactgca gtcatttgtg gctatgtgac atagttctgg
7080acagtgaaca tagacagaag tccctggggc gggcttcctt tctgggatga gggcaaaacg
714014115000DNAHomo sapiens 141gatctcttga tcccaggagg tcaaggctgc
aatgagctaa gatcaagcca ctgcattcca 60gcctgagtga tagtgggaga ccttgtcttt
aaaacacaca cacacacaca cacacacacg 120agggcctttg accactcttg agtagaagac
tcgagaagaa caaagtagaa ggccagagaa 180gaacaaagtt acttgaaaga tctcttatta
aagagaatgt acaagctatg aaaaaaaaaa 240aacacacaca cacacacaaa cctcatctgg
aatgaaaaaa acataatgca tttggtttct 300ggttccttag gctgttatgg aacaaccaaa
gaacattatt ttggtttctg aggtcagaac 360tattttattc ccctcaagca cactatgctt
atggtttgag ggagaatgag aaataggaaa 420ctaggaacag gctgaaatgg tctaatcttg
accatctaat tctgcagtgt cttattctca 480ttctaaaaga gaatggttat attcgctgtt
ctagcataaa aagtaatgat aaaaataaaa 540gatcccgtat taccagacaa taatccccta
gactgtttta atgcttggtt gagtatttgc 600ttatgatctc agactttaaa agatggtctc
cccctatggt gaagcttgtt aattatgtag 660gcatcattaa tgtctgttta cttatcaaaa
ttttatcatt gttagttgta ttactacttg 720acagtccaat ttatttaatt gaaaagattg
gttaacattt tatagtcaaa gtaattgttt 780cctgtgtttt ttcctgttta ggttattgga
gtgatgagta aagaatacat accaaagggc 840acacgttttg gacccctaat aggtgaaatc
tacaccaatg acacagttcc taagaacgcc 900aacaggaaat atttttggag ggtaagtaag
ggaaatttct tcagacccat taaatgttag 960gaaaaaatgg agctaaaaga gctgggtggc
tcacctttct catcctgtgc tgagaaatgc 1020tggggctcac ccataagtat ccagcatccc
catggacaca gggaattctg aacaaatgtg 1080atgaaaccga tgaaatgtct ggcctgtagg
tggttagtga tggagatacg ggctatatgt 1140gaatcttgat ttttgcaatt cattagagct
ttgtaatgaa aggaaacagt ttgttgcttg 1200ctttaaggat aggttcattt gcatttctcc
gcaaggaagt agtaatgagt taccaagcct 1260tagatttcac ccctttttga tttcttgctg
acttaacttt aattgaatgg aagagttatc 1320acaaatgaat tatctttttg gttttttttt
ttttgagatg gagtctcact ctgtcaccag 1380gctggagtgc aatggcatga tctcggctca
ctgcaacctc cgcctcccag gttcaagcaa 1440ttgtcctgcc tcagcctccc gagtagctgg
gactaaggtg cgcgccacca tgcccagtta 1500atttttgtat ttttagtaga gacggggttc
cactatgttg gccatgatgg tctcgatctc 1560tggacctcgt gatccgccca ccttggcctc
ccaaagtgct ggaattacag gcaagagcca 1620ccgcgcccag ccaggaatga caaatgaatt
accttataag taaatgccat taaggaagga 1680tagctggaag atgggttgag gggaatggag
gaccacagaa ctagtcctat ttaaatacat 1740gtgcatggta aaatgattcc atttgacaat
aggttaatta tctcatagca taaggaaaat 1800gcttaacagt catatgcaag atgataagct
ttcctatagc atccaaccaa aagatctagc 1860cagtacaatt tcctttgcta tattagggtt
agaaaggccc ccagaggtga accaattaga 1920tggaatcctt gaataaaaca ctggattagc
agtgaacaga aaaaagtcag attgctttcc 1980ttcttcccat agatgtctca gggatattta
gtttcctcag aagataaaga atttagtaag 2040cgtttttttg tgcatactta catgaaatgt
acattatttg aattctttaa aaagaaacag 2100ctgcatgata acaaaaattg tgttatgctt
gctttagctg gtatttttgc ctagaacgat 2160tatatcgttc ggacaagaag ctattcctaa
gaaacaatat ttttaatcca ggaagttttt 2220catttttaga aatttatctt actatttccc
aagcaaaaga gggtagttac agattcacta 2280agaatcatgt gctcacaatt tttatttaat
aattattcct ccttaaaata tattaatcac 2340ctgacttaca atggtggaac catgagtgca
tttttgcctt tattgtcaat aacgtcttct 2400cagaagtgag ccacaaaggt gcatagttct
tggagttaaa ggtctgaatt aagacaatcc 2460agcataagtc tcattaatgt gtgattattt
tgagaaaagg caagaagtac ctaagaatct 2520ccccctcact gtccagttcc ctgtttcatt
taaagattca ctgtaagtaa ctgaaaggct 2580ttccttggga ggatttattt gaatcagtct
ttcacatgca aaggatattg tagaacatct 2640cgtttttgct ggcaggaata tgaacatctg
ttgtgaggaa agaaaaagtt tcatgcaaat 2700tacactgcca aagaagggat gttcaagttg
agaaaccagt gacatttctt gtaactgtac 2760tatgaatcag cgcattttaa tcttctagat
aatatatgga agtgcaggaa ggtggtagga 2820aacggtgttc attttacata tgcgttattt
tattctgtgt gagtgacttc atggcaccga 2880cattgctgtt tttaaatgag gatacagtaa
attgcagtcc gaggaaggct aactggaatc 2940aacatacccg tagctttaga aagcagtttc
cgcaccagcg aagagtacaa gagcgatgga 3000accccatgtt cctggaagtt tgcacatcag
agtaaacaaa cttgaaaacc cctcttgata 3060gcagaattca cccagccttg ttccattttc
tcttaacaaa acacaccgca aaagctctca 3120caagctgctt tgatgaagcc acatgtattt
cccccttcac aatttacagg aagttactct 3180taaaagaaag tgattctggt gtttaccgcc
tgtgttaaag ggacagagtt cctttttatt 3240tctgataacg tttgagcgaa atacagaaac
tatctgtaga ctagcatagt cggtacgtga 3300gtaaggaaaa gcaataacct gctgtccggt
gagcacaaaa ttcctgctac gaacagtgcc 3360ttactgctgc ttggagactg caagtcgcag
atcacactag gtattgactg attgtataag 3420gaaatttctt aaagtctaaa gtaaaggtgg
tacctcctaa aaagagggga agagagaaaa 3480ctttgtgtgg aaggataagg agtgtgttta
tagtttcagt aagagtgtac gttttaattt 3540ttcttcttcc tctgcctctt tgccaagtag
cctgagtgca tctgttatcc agaagtagta 3600ttactctagg acaaacttca aattcttcat
tctgcgttgc ctttaaggaa caacatactt 3660tcttcctgtt ctttttccaa aaacacacgc
ctatggctct gtgtgtggtg ttttagccag 3720cctcctccca gataaggggt tcccttccct
cctttgcatt gaaaggaaag tgcaagtctg 3780gacatgttta tcaagaggaa aagtgacttc
tcagtaatag actgtcaaat tcgggctgct 3840gcccgagtgt tcgctttgtt atggcaggtg
aagttcacct ttgccccacc cagtgtttcc 3900acaaaaaggc aaggttccaa gtattcatat
gaacaagtgt tactttagga cttggagggt 3960tgggggtgga ggatgtttgc atagttgaag
ccttgggcgg gggtgtagga aacggcgagt 4020acagaggcca tagaaaaagc taagactcag
tttgacgtcg tcagccggct tggtcttcta 4080cccagtgact caaagcacta aaagtcagca
taatcggaac tgaagtcagt agcatcgccc 4140atttgccatt cactgcagta gcaaaagtag
tactctgtgg tgggttaatc ggtttgaggc 4200agctccttaa atgaacattt gtgtttcatt
tttctgttat tttcccgaac atgaaaagac 4260gataaaactg aaatggaaaa ggtaactgac
aaaagtgtgc cttacctgtt tccgccctga 4320tttctgctga ttcaagacta ttctggctaa
actgattgga ttctttttct aactaggcag 4380taggggatca gaaatcacac acggtaccgg
ctgtgtttat tctgagaggt gctggggagc 4440tttgggtctg acttcctttt acatgcctgt
cttctctttt ggacagatct attccagagg 4500ggagcttcac cacttcattg acggctttaa
tgaagagaaa agcaactgga tgcgctatgt 4560gaatccagca cactctcccc gggagcaaaa
cctggctgcg tgtcagaacg ggatgaacat 4620ctacttctac accattaagc ccatccctgc
caaccaggaa cttcttgtgt ggtattgtcg 4680ggactttgca gaaaggcttc actaccctta
tcccggagag ctgacaatga tgaatctcag 4740taagtggatt acagaacaaa aaaataaaaa
atgccagtaa tgtcggttct gcccctttga 4800actaataaca tgttgtttaa ttatacggct
ttgtcatgtg ttggatgaag taggtggctt 4860aagctaggga ctaggaagag gaaaaacatt
ttttgagtcc ctattaacta ttaggaaact 4920tgatcattta aaagtatata tatatatgag
gagctacctt gagttttgaa ttcaggatgt 4980tacaggaaga aatatatgtc caattctaat
ttatccaaaa gcagttggga gaattacagg 5040gattggtcca gacatgctgc gtatgcaagg
tatagccctc atctgtggta ctttggcagg 5100gcttagactg catcaaaata tttatagatg
tacatttgag tgtacagtta ggatctgatg 5160tggaacattg taagatcatt gctagaaaaa
ctttgtcata atttttcaat attattctaa 5220gtgaataacc gtaaagattt tacatcttag
cttccttcct tacagtaaaa aaactatctg 5280atctcttgat cagtattata gtagccacct
atcactttat cttaacaaat tctcaattcc 5340ttaggtttat gtgcttttac ttcttttatt
tgattaaaat tgctgtcatg acctctctct 5400gcagagggct gcatcatttt ggtcattctc
aagtgatctc tttgagcaat ttaagaattg 5460ccataagatt ctaacctctg ctgtaactat
ggttgtgtgt tcttggttag accactaaat 5520cttattagca gttttaaaaa ttattccttt
tggtttagaa gttaagacta aatgctgaag 5580tttttgtaac ttttggtttt gatatcattt
caaacttaag aaaacatttg aagaaaagga 5640caaagaattt ccacttaccc tttacccagg
tttaccagtt attgataagt atatccattt 5700gctttaccag aaggctaact tgttttagtt
ctcattttca cctttgagac atttggaata 5760aatatcaatg ttaacataaa ttggaatttt
gactttgatt ttaggaccaa tgaacaagcc 5820aagtacttac cctagtcata tataatccaa
ctgtatggtt atttggtatt cattccacac 5880ttcattttac ttgatctccc ttaagattgc
aagattgtgt ttgcagtttt tctgaaaatc 5940tggggctata aaagcatcag gacctccccc
gtaggggagg tcgtgtgttt ggggtcctta 6000cacaacaggt tacccttgag cttcaggaaa
agaactggct ctcagttccc cagttccagc 6060ttaatgggtc taattaggtc ctgaccaaaa
aggtggcagt tcttttccct catgtctctt 6120cagcgctccc cgagactctg gagactctgt
catatcccta gggctgagcc tcccaggaac 6180cattcggctg ttgtggcatc tgtgtatgcc
atgcccagtg ctgaggacct agtaacaaac 6240gacaaatgca caggcacagt ggcatttttg
tggaactcgt attccagctg tgcgtctcag 6300aagaagcgca cagctccctc ctggctttct
taacatagtg agccacttcc acttaagggt 6360ctccttacat tccttgagtt taatcattca
tggattcaga ggaaagtctt ttgatttttg 6420cttttcttta aacagttcat ttgaggtgac
ctaccccagt gactttgcac caaccaccaa 6480gaaacttttt tgcatgcttc ccgcaccctg
tgccaatcaa gggaagggtt taaaggcctg 6540gcgtttttat tcctcaaaga aaggttttgc
acagtatttt aaggttcaag tgcttctact 6600ttgtgttcag aagcaactgt catatatact
gtgaaatgac accttttatt tatccctttt 6660tatttatgca gtatgtcccc ttttattttg
gcagaatttt ttctaaatgg tggtttaaca 6720ttttcaagca catttcattg tccaatattc
atagtaaaga atgagagtta acaataacca 6780gtcacattaa aacaagattc ctgctgccag
ttgtgaaacc ggttgtctta ggcgtggcag 6840ctgatgattg agactgtgat caggaaaatt
tccactattt catcaggcct aataggtaga 6900ttgtgtctcc aaatgaactg tgttgggttt
ccatgcttaa agcacaatag aggtggtgca 6960agaatctcca tgagggctta aatggcagtg
atggttcagg cggtagagtt tggagaagaa 7020gggatttgaa acaaaccaaa ggaaagaaaa
gtaagtagcc agaaatcaca aaatggcatt 7080tttctaaaaa caaaggaaaa ggaataaaag
aactaataag tttgaaaccc ctacccctcc 7140caaatttggc agggggggag gtattttttt
tctatctatc taactaaccc atctagaaaa 7200cagttgacca aattatagac ttctaaatgt
taatctgctt tctcagtttc agttgaaaag 7260agactttgtt ttgcctactg cagaacttct
aggttctttc ttatagtctt ggggttctta 7320ttatagatcg aaaatgtgag tcggcataat
taagccattc ggagtcttca gaagcagttc 7380actcttgaaa tgactccgtc cgcctacagc
catttaagat ttcagaacaa aaacagatct 7440tgattttctt tttcatgtta actcaagctg
ttgctgagtg ggagagtcag aaatgacacc 7500agctccactg attactcagc tgctgaagga
tgatttttta aaatgcacct ttactgtata 7560tggacttcct aatttccacc tgtagagcat
cttagggagg ctaacatgtc actctggatg 7620ttcttttaga ataagatgca aatctatttt
tctgaaggca ttagagatag caaacattta 7680ttgtgagttt actatatact aggcactgtg
ctaagtgttt tgcatagaaa gtttaaaatt 7740ctggcttttt tgttggccca atcataagtt
tcatatcagt tcaacattca aattatatta 7800aggtacttaa gaagaatccc tggctaaatg
tgaggggcag tgccacagat ggactgaaac 7860tttatgctta ttgcacattt atgctattat
tatttgttga attatagaac caagggagtg 7920tggaagccac tggaaaaaat atgagactta
gatacataat ttgagtaaaa atggctcaaa 7980gtcatgaggg taaagttttt tgtatttcca
ttttattcga gcggcatcgt ttttaaaaat 8040cattatgaat ttgaccctat atagatgttt
ccaaataatt ctttttcacc ttcataaaat 8100tccttcctgt ggctgtgaga tgccttgcct
atcagttttc aagcttagtt gtctttctca 8160tcctttacca ttttagcttt aaaaaacaaa
agtgacaatt agaacttcct gcctgctggg 8220cctcactgaa agaccgatat tggcctgata
aggagatatt tattttgttt tagtggcttc 8280agaaatccct ctccctcagc aagctttcca
tcacggcccc cccgtcagca tcttccctga 8340tagcgttctt ctctgtgttt attctggggc
ttcaggctcg cccaggagga actgataacc 8400gctggcagga gataacattc tctaaggggc
tctcaaattg gaatcgaatc cctcaagcca 8460gtcagcctag agaatacatt taaagggttc
agttctggag tttcacagag ttcatttcta 8520gacctatcag atagcaagtg tggagttctt
tctcaactaa attcaagcag agacattttt 8580tagacgatga aggatatttg cacaaaggct
tcagcatgat cccccaaacc tgctgcctct 8640gaaggcatct ccacacattg acagccaatg
ccttcagtgc gttcctaggg caggtgtcct 8700ggcttgagtg actgtcctcc aataatcaga
gctcaaacta aacatcgtat gttttacttt 8760tggtttccag gcaaggctga gcagggaatt
ttcagttttc cctgcccaga tgggtgtttt 8820ttcctgaagg catcatttat tgtgtagcga
ggagacaggg ctggctgtgg cagggatagt 8880ctagaactgt cctcattgct gctgttccta
aatagtatct ttaccaagta ataacgtgcc 8940gtctttggga ataagtgctt tcctcttagc
ctgttctgtt ttcttgggtg cgctaagtaa 9000ttgaactggc tcaggaagta cctattgtgg
tttggcagag gtgactgtca cgccttgtga 9060ctccaggggc cagcactgct gggatcctgg
ctagaccaga cagagccttg gtgaagtgct 9120taggctgtct gcacatcgcg aggaaggtgg
tattcacttc gctaagctcc ttggcatagg 9180cagtttgaac agggctttat caaattcgta
ttcaacaaga gtagaagcga aaattgatga 9240ctgtgtatta cttgaaatga gtcttaatct
ttcacattta gttctcaggg tatgctgatt 9300tcctttaggt aaaccatgaa catcagaaag
acttttatta acctatgaca gggtccccac 9360cccagtattt ttccactcca ttaaaatgga
agtttttttt ttttttttct tttttgagac 9420agagttttgc tcttgttgcc cagtctggag
tgcaatggca caatctcggc tcaccacaac 9480ctccacctcc cagattcaag cgattcttct
gcctcagcct cccaagtagc tgggattaca 9540ggtgtgcgcc accacgccca gctaattttg
tatttttagt agagatgggg tttctccatg 9600ttggtcaggc tggtctcgaa cttccgacct
caggtgatcc gcccacctcg gcctcccaaa 9660gtgctgggat tacaggcaag agccactgca
tccagcttag gctatcttac tccagcctaa 9720acagcaattt tctatcataa ggtctgtact
aatgaaaaca gaatcaccca aggctgctgt 9780ttgttctgtc tgtgctgcca ttgtccgcat
tttgctgagg aggaaacgga actgcacttt 9840tgagtgagtg gcccagagcc ttctagaatg
agagtgcgtt ggaagccaga tatgtggcga 9900ttgtgtcgcc agctgttact caggttttct
caagaaggag gagcaacttt ggcagttttg 9960cttcagttct ctctagccct ctgtgtaatc
gccccttttt ctttatttca gcacaaacac 10020agagcagtct aaagcaaccg agcactgaga
aaaatgaact ctgcccaaag aatgtcccaa 10080agagagagta cagcgtgaaa gaaatcctaa
aattggactc caacccctcc aaaggaaagg 10140acctctaccg ttctaacatt tcacccctca
catcagaaaa ggacctcgat gactttagaa 10200gacgtgggag ccccgaaatg cccttctacc
ctcgggtcgt ttaccccatc cgggcccctc 10260tgccagaaga ctttttgaaa gcttccctgg
cctacgggat cgagagaccc acgtacatca 10320ctcgctcccc cattccatcc tccaccactc
caagcccctc tgcaagaagc agccccgacc 10380aaagcctcaa gagctccagc cctcacagca
gccctgggaa tacggtgtcc cctgtgggcc 10440ccggctctca agagcaccgg gactcctacg
cttacttgaa cgcgtcctac ggcacggaag 10500gtttgggctc ctaccctggc tacgcacccc
tgccccacct cccgccagct ttcatcccct 10560cgtacaacgc tcactacccc aagttcctct
tgccccccta cggcatgaat tgtaatggcc 10620tgagcgctgt gagcagcatg aatggcatca
acaactttgg cctcttcccg aggctgtgcc 10680ctgtctacag caatctcctc ggtgggggca
gcctgcccca ccccatgctc aaccccactt 10740ctctcccgag ctcgctgccc tcagatggag
cccggaggtt gctccagccg gagcatccca 10800gggaggtgct tgtcccggcg ccccacagtg
ccttctcctt taccggggcc gccgccagca 10860tgaaggacaa ggcctgtagc cccacaagcg
ggtctcccac ggcgggaaca gccgccacgg 10920cagaacatgt ggtgcagccc aaagctacct
cagcagcgat ggcagccccc agcagcgacg 10980aagccatgaa tctcattaaa aacaaaagaa
acatgaccgg ctacaagacc cttccctacc 11040cgctgaagaa gcagaacggc aagatcaagt
acgaatgcaa cgtttgcgcc aagactttcg 11100gccagctctc caatctgaag gtaggccttg
agagagagca gtccaagggg ctgtgagtgc 11160atgcttgtgt ttgtatttag cttgctttcc
atggggtatc gattgcattt gcagtagtat 11220gagcccccgg ttggggatag tgggtatgga
ttccgcctgg cttttgccac ttctagctct 11280ttgactttgg acaagtgact tcccttctcc
tgattttctt ctgaataata aaaaaattag 11340gggtttggac tagaagatta ggtgaaactc
cctgctagcc tgtgattttt gtgcttttaa 11400gaaaaacacc attctgaaaa catgaagatt
tcttcttttt aagactgtct tgatgctttt 11460cttaagatat ttgcatcaac acttgagtct
tggagcagaa atgttaggtc tcagagccag 11520cttgagagca gagctaacac atgtggcttc
ttcccaggtc cacctgagag tgcacagtgg 11580agaacggcct ttcaaatgtc agacttgcaa
caagggcttt actcagctcg cccacctgca 11640gaaacactac ctggtacaca cgggagaaaa
gccacatgaa tgccaggtgc gcagtatttt 11700ctgggtagac cttctgacct ttgtagaaaa
tgtctgtgag tcaccctccc atgtcctata 11760tagcccgtag ttaaagccaa caccagattc
tgcgttgtcc catcctggac tgatggcact 11820atggtccttc ccagtacttt gtatctgctg
atgacttgag atggcacagc cagcttccag 11880tgggtgggaa aatggtaggg gaaataaaca
gcccctcgtg tgctgtgtgc ccacatcccc 11940ccgtttgctt aataccacac tggaggtgcc
acaaggaggc ttctcacctc ctaggttgct 12000gggcgttggc cggtaagcct gcccctcccg
ttggcaactc ttaatcttct ggccttcctg 12060tctcccttcc ctgctgtctc tctcccctac
actgtaggtc tgccacaaga gatttagcag 12120caccagcaat ctcaagaccc acctgcgact
ccattctgga gagaaaccat accaatgcaa 12180ggtgtgccct gccaagttca cccagtttgt
gcacctgaaa ctgcacaagc gtctgcacac 12240ccgggagcgg ccccacaagt gctcccagtg
ccacaagaac tacatccatc tctgtagcct 12300caaggttcac ctgaaaggga actgcgctgc
ggccccggcg cctgggctgc ccttggaaga 12360tctgacccga atcaatgaag aaatcgagaa
gtttgacatc agtgacaatg ctgaccggct 12420cgaggacgtg gaggatgaca tcagtgtgat
ctctgtagtg gagaaggaaa ttctggccgt 12480ggtcagaaaa gagaaagaag aaactggcct
gaaagtgtct ttgcaaagaa acatggggaa 12540tggactcctc tcctcagggt gcagccttta
tgagtcatca gatctacccc tcatgaagtt 12600gcctcccagc aacccactac ctctggtacc
tgtaaaggtc aaacaagaaa cagttgaacc 12660aatggatcct taagattttc agaaaacact
tattttgttt cttaagttat gacttggtga 12720gtcagggtgc ctgtaggaag tggcttgtac
ataatcccag ctctgcaaag ctctctcgac 12780agcaaatggt ttcccctcac ctctggaatt
aaagaaggaa ctccaaagtt actgaaatct 12840cagggcatga acaaggcaaa ggccatatat
atatatatat atatatctgt atacatatta 12900tatatactta tttacacctg tgtctatata
tttgcccctg tgtattttga atatttgtgt 12960ggacatgttt gcatagcctt cccattacta
agactattac ctagtcataa ttattttttc 13020aatgataatc cttcataatt tattatacaa
tttatcattc agaaagcaat aattaaaaaa 13080gtttacaatg actggaaaga ttccttgtaa
tttgagtata aatgtatttt tgtcttgtgg 13140ccattctttg tagataattt ctgcacatct
gtataagtac ctaagattta gttaaacaaa 13200tatatgactt cagtcaacct ctctctctaa
taatggtttg aaaatgaggt ttgggtaatt 13260gccaatgttg gacagttgat gtgttcattc
ctgggatcct atcatttgaa cagcattgta 13320cataacttgg gggtatgtgt gcaggattac
ccaagaataa cttaagtaga agaaacaaga 13380aagggaatct tgtatatttt tgttgatagt
tcatgttttt cccccagcca caattttacc 13440ggaagggtga caggaaggct ttaccaacct
gtctctccct ccaaaagagc agaatcctcc 13500caccgccctg ccctccccac cgagtcctgt
ggccattcag agcggccaca tgacttttgc 13560atccattgta ttatcagaaa atgtgaagaa
gaaaaaaatg ccatgtttta aaaccactgc 13620gaaaatttcc ccaaagcata ggtggctttg
tgtgtgtgcg atttgggggc ttgagtctgg 13680gtggtgtttt gttgttggtt tttgttgctt
tttttttttt ttttttttta atgtcaaaat 13740tgcacaaaca tggtgctcta ccaggaagga
ttcgaggtag ataggctcag gccacacttt 13800aaaaacaaac acacaaacaa caaaaaacgg
gtattctagt catcttgggg taaaagcggg 13860taatgaacat tcctatcccc aacacatcaa
ttgtattttt tctgtaaaac tcagattttc 13920ctcagtattt gtgtttttac attttatggt
taatttaatg gaagatgaaa gggcattgca 13980aagttgttca acaacagtta cctcattgag
tgtgtccagt agtgcaggaa atgatgtctt 14040atctaatgat ttgcttctct agaggagaaa
ccgagtaaat gtgctccagc aagatagact 14100ttgtgttatt ctatctttta ttctgctaag
cccaaagatt acatgttggt gttcaaagtg 14160tagcaaaaaa tgatgtatat ttataaatct
atttatacca ctatatcata tgtatatata 14220tttataacca cttaaattgt gagccaagcc
atgtaaaaga tctacttttt ctaagggcaa 14280aaaaaaaaaa aaaaaaaaaa gaacactcct
ttctgagact ttgcttaata cttggtgacc 14340tcacaatcac gtcggtatga ttgggcaccc
ttgcctactg taagagaccc taaaaccttg 14400gtgcagtggt ggggaccaca aaacaaccag
ggaggaagag atacatcatt ttttagtatt 14460aaggaccatc taagacagct ctattttttt
tttgccactt tatgattatg tggtcacacc 14520caagtcacag aaataaaaaa ctgactttac
cgctgcaatt tttctgtttt cctccttact 14580aaatactgat acattactcc aatctatttt
ataattatat ttgacatttt gttcacatca 14640actaatgttc acctgtagaa gagaacaaat
ttcgaataat ccagggaaac ccaagagcct 14700tactggtctt ctgtaacttc caagactgac
agctttttat gtatcagtgt ttgataaaca 14760cagtccttaa ctgaaggtaa accaaagcat
cacgttgaca ttagaccaaa tacttttgat 14820tcccaactac tcgtttgttc tttttctcct
tttgtgcttt cccatagtga gaatttttat 14880aaagacttct tgcttctctc accatccatc
cttctctttt ctgcctctta catgtgaatg 14940ttgagcccac aatcaacagt ggttttattt
tttcctctac tcaaagttaa aactgaccaa 150001423242DNAHomo sapiens
142gtctggactt gtggtgcgct gccagggatc cgcagcgttg ccggttgtat tcgctggata
60ccagagggcg gaagtgcagc agggttcagc tccgacctcc gcgccggtgc tttttgcggc
120tgcgcgggct tcctggagtc ctgctaccgc gtccccgcag gacagtgtgt caggcgggca
180gcttgccccg ccgccccacc ggagcgcgga atctgggcgt ccccaccagt gcggggagcc
240ggaaggagga gccatagctt ggagtaggtt tggctttggt tgaaataaga atttagcctg
300tatgtactgc tttaactcct ggaagaatga cagatgacaa agatgtgctt cgagatgtgt
360ggtttggacg aattccaact tgtttcacgc tatatcagga tgagataact gaaagggaag
420cagaaccata ctatttgctt ttgccaagag taagttattt gacgttggta actgacaaag
480tgaaaaagca ctttcagaag gttatgagac aagaagacat tagtgagata tggtttgaat
540atgaaggcac accactgaaa tggcattatc caattggttt gctatttgat cttcttgcat
600caagttcagc tcttccttgg aacatcacag tacattttaa gagttttcca gaaaaagacc
660ttctgcactg tccatctaag gatgcaattg aagctcattt tatgtcatgt atgaaagaag
720ctgatgcttt aaaacataaa agtcaagtaa tcaatgaaat gcagaaaaaa gatcacaagc
780aactctggat gggattgcaa aatgacagat ttgaccagtt ttgggccatc aatcggaaac
840tcatggaata tcctgcagaa gaaaatggat ttcgttatat cccctttaga atatatcaga
900caacgactga aagacctttc attcagaagc tgtttcgtcc tgtggctgca gatggacagt
960tgcacacact aggagatctc ctcaaagaag tttgtccttc tgctattgat cctgaagatg
1020gggaaaaaaa gaatcaagtg atgattcatg gaattgagcc aatgttggaa acacctctgc
1080agtggctgag tgaacatctg agctacccgg ataattttct tcatattagt atcatcccac
1140agccaacaga ttgaaggatc aactatttgc ctgaacagaa tcatccttaa atgggattta
1200tcagagcatg tcaccctttt gcttcaatca ggtttggtgg aggcaacctg accagaaaca
1260cttcgctgct gcaagccaga caggaaaaag attccatgtc agataaggca actgggctgg
1320tcttactttg catcacctct gctttcctcc actgccatca ttaaacctca gctgtgacat
1380gaaagactta ccggaccact gaaggtcttc tgtaaaatat aatgaagctg aaacctttgg
1440cctaagaaga aaatggaagt atgtgccact cgatttgtat ttctgattaa caaataaaca
1500ggggtatttc ctaaggtgac catggttgaa ctttagctca tgaaagtgga aacattggtt
1560taattttcaa gagaattaag aaagtaaaag agaaattctg ttatcaataa cttgcaagta
1620attttttgta aaagattgaa ttacagtaaa cccatctttc cttaacgaaa atttcctatg
1680tttacagtct gtctattggt atgcaatctt gtaactttga taatgaacag tgagagattt
1740ttaaataaag cctctaaata tgttttgtca tttaataaca tacagttttg tcacttttca
1800agtactttct gactcacata cagtagatca ctttttactc tgtgttacca ttttgactgg
1860tcgtcattgg catggggtgg atatagggca taggattact tgtctcagaa gctgtcatag
1920aatttcttgc tgccaattaa aaaacctgtg ttctttacac actacacgta taaatattgt
1980aactgttcat ctttgttgtt ttatcactgt aagcctgtca aatcatagta tcctaagcat
2040ctgtaaatgc taattttgca tttttggaaa aacccattcc ttccaagcta gtgtttttca
2100ttggctccag gtctaatttt tcactgtggt ccctggcagc cagtcttttg aagtttaaag
2160attacctgtc tcttgactgc agtacctttt ctttaatttt taccaaaaat atccagaggt
2220tactggagtt cttattcaat ataaggaaag tttgctgcac tttattacca agcctctggg
2280attttaccag tcaaacatat ttgtgcatta catttcattt cttgtgagct agctggctgt
2340ccatattgaa tgttgaccca tttgagtacg ctaaaaggct tacagtatca gacacgatca
2400tggttttaga tcccataata aaaatgratg tttttcttat aaaaaattat acaaatgctg
2460aagtgagatt ctactattgt tcattgcttc cttttctttt tccttttgcg attttcactg
2520attaatagca catttcttca caaaattaga taaagttggt caaagaccag atattctgga
2580atggaaattg taaagcttaa tcaaaaagaa tagccagtac agcatacaat ctcagaaact
2640tagaagcaag tagaaaataa ttggttgatg taaacgaaag tgccatttta gtaaaggcag
2700gaaaaaaata gcaatatttg agttatgtaa ggataaaaaa tccactgact tgtatttttg
2760cacaagaggc tggtctgaat atgattgttc acattaagag tgtttattcg tcggttcatt
2820ttggggattt tcccccttga tgttttgaca gattgaagtg agctttagtg agcaaaagga
2880tcagaatgca gggaacacta agctgtgatg aagaaagtgt ggtaaaaagc cagagtagtt
2940ttatacagac aaaaccagtg tcaggccttt gcagtaggct tgagtgaact tctgatctag
3000atttgaaagt aaattttatg aagacattgc ccatttttac ttcctcattc attattgtac
3060cagcatcata gctttattac tctaatccca ggtaagtcaa gcctacaatg ccctagagga
3120agagtaaaac cagaaattca tgctggctta aataatctat ttttgtttct tttcatttga
3180atatttaaat tttatggttt attaaaaaat taaataaaaa agaaaaaaaa aaaaaaaaaa
3240aa
324214313695DNAHomo sapiens 143cgccgcctgt gcagccgctg ccgccgccgc
cgccgccgcc gccgccgccg ccgccgccgc 60cgccgctgcc gccccggctg ccgcgccgcg
ccgctgcctc tgccccggcc gcccccgccg 120ccgctgccgc cgccggcccg cagccagcca
ggcgggcggc ccagcccgcc tgagcccgca 180gcggctgccg ccgcagcgtc gggtcgctgg
gtgcgcgggc taccgcggac cgagcggacc 240cgagtgggcg accaggcgct tgcccgccca
gtgccactgc cgccgcttcc tcgccggagc 300acaggaccag acacctccag cgcccgctgc
tgctgccgat gcggcccgga cacttttagc 360tgggcgggag ggctggagag ccgggggccg
ccgagaaccg ccagcgagct gtgccgagag 420ccgcgccgac ccgctgcgat cagggacagg
cgcccgcccg ccgccgccgc ctggccgcta 480tggatctatt cgactttttc agagactggg
acttggagca gcagtgtcac tatgaacaag 540accgtagtgc acttaaaaaa agggaatggg
agcggaggaa tcaagaagtc cagcaagaag 600acgatctctt ttcttcaggc tttgatcttt
ttggggagcc atacaaggta gctgaatata 660caaacaaagg tgatgcactt gccaaccgag
tccagaacac gcttggaaac tatgatgaaa 720tgaagaattt gctaactaac cattctaatc
agaatcacct agtgggaatt ccaaagaatt 780ctgtgcccca gaatcccaac aacaaaaatg
aaccaagctt ttttccagaa caaaagaaca 840gaataattcc acctcaccag gataataccc
atccttcagc accaatgcct ccaccttctg 900ttgtgatact gaattcaact ctaatacaca
gcaacagaaa atcaaaacct gagtggtcac 960gtgatagtca taaccctagc actgtactgg
caagccaggc cagtggtcag ccaaacaaga 1020tgcagacttt gacacaggac cagtctcaag
ccaaactgga agacttcttt gtctacccag 1080ctgaacagcc ccagattgga gaagttgaag
agtcaaaccc atctgcaaag gaagacagta 1140accctaattc tagtggagaa gatgctttca
aagaaatctt tcaatccaat tcaccggaag 1200aatctgaatt cgccgtgcaa gcgcctgggt
ctcccctagt ggcttcctct ttattagctc 1260ctagcagtgg cctttcagtt caaaacttcc
caccagggct ttactgcaaa acaagcatgg 1320ggcagcaaaa gccaactgca tacgtcagac
ccatggatgg ccaggaccag gcaccggaca 1380tctcaccaac actgaaacct tcaattgaat
ttgagaacag ctttgggaat ctgtcatttg 1440gaacactctt ggatggaaaa cccagtgcag
ccagttcaaa gactaaactg ccaaagttca 1500ccatcctcca aacaagtgaa gtaagccttc
ccagtgatcc aagctgtgtt gaagaaatct 1560tgcgggagat gacccattcc tggcctactc
ctctcacttc catgcatact gctggacact 1620ctgagcagag caccttttcc atcccaggac
aggaatcgca gcatctgacc ccaggattca 1680ccttacaaaa gtggaatgac ccaaccacca
gagcttctac aaagtcagtg tctttcaaat 1740cgatgcttga ggatgacctg aagctgagca
gtgatgaaga tgaccttgag cctgtgaaga 1800ccttgaccac tcagtgcact gccactgagc
tctaccaggc tgttgaaaag gcaaaaccta 1860ggaataatcc tgtgaaccca cccttggcca
ctccccagcc cccacctgca gtgcaagcca 1920gcgggggttc tggcagctcc agcgaatcgg
agagcagctc tgagtcggat tcagacactg 1980aaagtagcac cactgacagc gaatctaatg
aggcacctcg tgtggcaact ccagagcctg 2040agccaccctc aaccaacaag tggcaactgg
ataaatggct taacaaagtg acatcccaga 2100acaagtcttt tatttgtggc ccaaatgaaa
cacccatgga gactatttct ctgcctcctc 2160caatcatcca accaatggaa gtccagatga
aagtgaagac gaatgccagt caggtcccag 2220ctgaacccaa agaaaggcct ctcctcagtc
tcattaggga gaaagcccgt ccacggccca 2280ctcagaaaat tccagaaaca aaggctttga
agcataagtt gtcaacaact agtgagacag 2340tgtctcaaag gacaattggg aaaaaacagc
ccaaaaaagt tgagaagaac accagcactg 2400acgagtttac ctggcccaaa ccaaatatta
ccagcagcac tcccaaagaa aaagaaagtg 2460tggagcttca tgacccacca agaggccgca
acaaagccac tgcccacaaa ccagccccta 2520ggaaagaacc aagacctaac atccctttgg
ctcccgagaa gaagaagtac agagggcctg 2580gcaagattgt gccaaagtct cgggaattca
ttgaaacaga ttcatctaca tctgactcca 2640acacagatca ggaagagacc ctgcaaatca
aagtcctgcc tccgtgcatt atttctggag 2700gtaatactgc caaatccaag gaaatctgtg
gtgccagcct gaccctcagc accttaatga 2760gtagcagtgg cagcaacaac aacttatcca
tcagtaatga agagccaaca ttttcaccta 2820ttcctgtcat gcaaactgaa atcctgtccc
ctctgcgaga tcatgagaac ctgaaaaacc 2880tctgggtgaa gattgacctt gacttactct
ctagagtacc tggccacagc tcactccatg 2940cagcacctgc caagccagac cacaaggaga
ctgccacaaa acccaagcgt cagacagctg 3000tcacagctgt ggagaaacca gcccctaagg
gcaaacgtaa gcacaagcca atagaagttg 3060cagagaagat ccctgagaag aagcagcgcc
tggaggaggc cacaactatc tgcttgctcc 3120ctccttgcat ctcaccagcc ccaccccaca
agcctcccaa cactagagaa aataattcat 3180ccaggagagc aaatagaaga aaggaagaaa
aactatttcc tcctccactt tccccactgc 3240cagaggaccc tccacgccgc agaaatgtca
gtggcaataa tggtcccttt ggtcaagaca 3300aaaacatcgc catgactgga caaatcacat
ctaccaaacc taagagaact gaaggcaaat 3360tctgtgctac tttcaaaggg atatcggtaa
atgagggaga cactccaaaa aaggcatcct 3420ctgccaccat cactgtcacc aatactgcta
ttgccactgc tactgtcact gctactgcca 3480ttgtcaccac cactgtcaca gctactgcca
ccgccacggc caccaccaca actactacca 3540ctaccatttc caccatcacc tctaccatca
ctactggcct catggatagc agtcacctgg 3600agatgacgtc ctgggcggct ctgccccttc
tatccagcag cagcactaat gtccggagac 3660ccaagctcac ttttgatgac tcggttcaca
atgctgatta ttacatgcaa gaagctaaga 3720agctgaagca caaagctgat gcactgttcg
agaaatttgg caaagctgtg aattatgctg 3780atgccgccct ctccttcact gaatgtggca
atgccatgga acgcgaccct ctggaagcaa 3840agtccccata caccatgtac tctgagactg
tggagctcct caggtatgca atgaggctga 3900agaactttgc aagtcccttg gcttcggatg
gggacaaaaa gctagcagta ctatgctacc 3960gatgtttatc actcctctat ttgagaatgt
ttaagctgaa gaaggaccat gctatgaagt 4020actccagatc actgatggaa tattttaagc
aaaatgcttc aaaagtcgca cagataccct 4080ctccatgggt aagcaatgga aagaacactc
catccccagt gtctctcaac aacgtctccc 4140ccatcaacgc aatggggaac tgtaacaatg
gcccagtcac cattccccag cgcattcacc 4200acatggctgc cagccacgtc aacatcacta
gcaatgtgtt acggggctat gaacactggg 4260atatggccga caaactgaca agagaaaaca
aagaattctt tggtgatctg gacacgctga 4320tggggcctct gacccagcac agcagcatga
ccaatcttgt ccgctacgtt cgccaaggac 4380tgtgttggct gcgcatcgat gcccacttgt
tgtagtgggt gttctcagat ctctagcatc 4440acgacccatc actctacctc taccagcgca
ctgatggtca ctggtggaac tccactcact 4500ggggaacgtt ctctttggtt atgtttgttt
ttatgcttct tttgttatct gtaaaaaaca 4560gaagtcattg taagttgaca ctacaactta
agggcagtgt acgttttatt acttagtcat 4620tttttttctt ttagcatttg atatgcattt
ctcagattcc accatctttt tgtgctttat 4680ggaatgacag tccctacaat attgttttaa
gcccacacta cccaaaacaa agaatgggaa 4740gcacttgtga taaagacagg ctcctgagaa
atgcaacaag tggtcttaca tatacatgag 4800aacttagaca caagggacca tcccccaaac
tctactctta tacccagaaa agaacatatt 4860tcagaatctg tcaaactttt gtgtatccca
cagattcaat cttcaggtga gaattttcat 4920tgtcaaaacc cactggttag atgttgtagc
aacatcataa aatcaagagt atcaagaaaa 4980taaatgagca tagcaatgct actcttaaaa
agatgctatg ccacacaacc agaggacttt 5040cttgttagca tccctttcct gattccctat
tttgttaatt ttaatgataa gaagaaaggg 5100tgacatttat tttgacaagt tttaggcatc
agctggcatc agtgtttttc aactccatta 5160tttgaagtgt aaatcctcac ctggggttct
ctgtgtgcaa agctgtcctt ttgaagaaca 5220gtttggttga tgcatgcctt agtagccaaa
atgctacact ctagacttac aagtgggagt 5280taagagaggt ctggaaagtg tccaacaagg
aattcacacc tctgcctcct ttgcaacaac 5340aacatttaca cagttggtaa gtgggtccat
aactggcagg atttttaaat tgtattttgc 5400tcaaatctat gggaacaaaa gtcaaggtat
cactacctag aagtaatgat atacagtttt 5460cttcctagtg gcttgaaaat ctggacttcc
tcaattatta ttcacatttt ctctcttata 5520ggttttctgt tttctacttt cttttttctc
ttatctgtgt ttccctttcc tttgtttggc 5580tcattaactt ttgactgaat tacaattact
ccttttatta aagtccatat tattgtgaat 5640catttccatg aaaatttcta agaaaactcc
aaactctcta aatagtagct aacttttatt 5700tttttaaaat gagtcgtggg gtagtgcttc
accttgagat gctttgaaag agccctaaac 5760attgggaacc attcacctaa tttggagaca
tttctcactg gttgtgacta cccccttatg 5820atccttcaca ttcattttat gtccctaaac
atcacaatgt aaatatcatt tttgatgttc 5880cagctcacca gaagattctt acacttgggg
taaacactat ccatgcatta cttactggta 5940attacctgct ggtatataat tccatgtagc
ctttaatatg ctgggttatc aaattctgtt 6000cactgagtta tgaccagata aataatagat
atgcacatga aagatgcaaa cttgtgtgat 6060tattaaagcc agccatgcag gtccatgata
gaaacagcag gtgatgactc tgcactctca 6120ttgtcaaggt tagctatatc cccagttgca
aaacagccag acttgagctg tgctctggtc 6180atctttgagt ttaaggcctt ttgttgtata
aggctgtgga agttgtactc caatggctga 6240agccatgttg ttaatatggc tgatgggagc
atccctgcag ctgaacccag cactttttat 6300gctcccactg tggttgagct ttatgtttac
agtctcagca acaacactta tgcatccaaa 6360cactcacaaa tgaaacctga aagaatcttt
tctgagcctc ttaaaagagg aaaatgatga 6420taacattaaa gactctgaac acccaaggtt
ggtgtcacat ataaaaatta agctgatgac 6480tttgcagtga ctcaagttgt ctctttatca
tggtttacca ggtagagtgc ctggctatta 6540ctatataatg aagcccactg gcttgacttg
taagttcaac ctaaaccaca atcctagacc 6600atcatggatt taggagtaga ttcttcttga
aatcccacat ccagaaacta gacattagaa 6660tgttgaggca gtttcccaga gaaacaagca
tattgcctca tggatgaaag acttgtagtt 6720ctagtttcag tgacttgtta tatctactta
catacaacag ggaggcaaga ggattctctg 6780tcatctctgg tgactgagtg taaaatatgt
gccaagtctg cagcacagtg accaaatctg 6840acaatcgagc tctggatcac cacttgatta
tgtagtagac tcatttataa agcagcttag 6900gaactaatta aacatggagg atgaattacc
ttcctatccc ttgagataag acatctttca 6960gtttcatgat taaggattgt tgctgtttta
tagttactct gttcatcaca gtgtaaatgg 7020tgatgcgtgt cgtaggtgtg cagctatttg
agggactaag ggatggagat attctgtcaa 7080atgaatctct tcagtatacc agtttgtggg
agggatatga gacatgtgga tggcagtgag 7140agatcgtgcc tctagatctt gatggaggct
tggtgagaca cacttaaata agcacgtgga 7200ggttagaata gagggcagag taaaaggaag
ctccatctga gcaagtacac caaatgatct 7260cagccctgca acttgaccca ggtagggcca
ccactacgcc ttcacttgtc acccaagctc 7320caaccacaga gagtttgaca agtttgtgtt
atgatgttgg cttggctttg tatttttaat 7380taactttgga tttttagtgg ttttgtcata
taactgtctg agtttggtag gtaggattac 7440tttgaaaagg gtttactagt gtggtcctcc
gggtagaatt tagctgtaac atgttgttag 7500ccagcctgta gactgttaat tacttaataa
tctcattggg aaaatactag tagttttata 7560tttggatgac ataattggaa aaagcagatt
agctgctact acttttaaaa gacttaaggt 7620cgggatgcct ttttttccat gtaaggaaat
gaaaagaccc aaaatcttca ggcaaaaagc 7680aagttgcaaa attagaaacc attggctaaa
aatgtgtttt gttgagtttc caaatggatg 7740aattttcatt tggacattac atcactaaat
tcattagatt ttgtctgcat tggaaagata 7800ctcttctagc atatctttcc caaagatatc
taatttggat tctgtttcat gcaaatttgc 7860atcccggagg ttgaagttgg agtttgaggt
tggaaaatat ctttgaaggc agaatcagtt 7920gagttgtgag ggtgaagcct cacatacttc
tcaacagaca tgataaaatt cacctgcatg 7980agttggcagg tgggagaacc aaactggatc
actgggtaag actactcagt aaagcaatga 8040actgcttgct tagagaagca tcactatccc
cattgagaaa aatgtgtggc aagatgatac 8100agctacacag tatcaaatga atgggtcaat
tcagcacccc caaatttaat tctgtgggga 8160aaaattattg agccagttgt cagtgttctg
ttacatgact ggcagactaa attcttcatc 8220gttgttgtta ttgttgttgt tgtttctcat
tttcactcgc acggccttat tctcataatt 8280aaaatctaat tcattttctc tttagtgtta
gtagactcca acaacagaag tggcatctgt 8340gtattcataa tcagcattta ccctggcagg
agactaatca gataggccgg tctcagacat 8400taatcctacc atctgatatt tttggtgaag
gaaaaagtat taattctctt tccatcctcc 8460tcctcagaaa tatagaagcc ctctttacca
aaatcatcac attttactct gtaatctacc 8520agctaaaaga aaattgcatt gaagccccac
aaagccagat tgcagttctt gccccttttt 8580gcgtctgaca tgagatgtta aagaattatt
cattgtgctc acattgggtt aggggacact 8640gaactgcttt ttagatccat gatcagtcat
cattcttcta agagattgga gctttgctgt 8700ttcattaact gtgcagtgta gactaatggt
gtttaataaa aatcattcaa aatttcaaac 8760tcttttgcca gtgacctcaa ttttgttggc
tctgtgattt gtatcagact ttgaggaggg 8820aagggggaag tgaaggaagc ctacgtccag
gcccctgaca ggatgctgca gtagcaagct 8880caagctcgcc tgcctgccag cagttgctgg
tgagcagcag catgcagacc agctgtggga 8940agcctcctga agaatgcccc agctgatgct
ttcagctggg aatagtttgt tcctattggg 9000gaactcattg ttctccagtc tctgcagcag
gaagccagct gtcatattcg gagggaattt 9060cagatgcttt acctttttgg ttttgtcctg
catcactcat gtggctacga aagtgtctct 9120gagaatagag cccaatgtgg tgacaatggg
tagtcaaatg caccccagat gctcaagccc 9180tgttgtggtt ctgcagtgtt tatgaaattg
ggaggaagga gaccctggac agtaagcaaa 9240attggagaca ctccaacgag gctaagttaa
tgccgtgttg cccagaacaa gatctagctt 9300ctcatttggt cagcctagca tgcaaccagt
ggtgtgctgg taaaatgttt aacaaccagc 9360tcgctgagaa tagaaagcac ctggtttgca
ccatttgcca atttccatgg cataaatact 9420accactttag atgattttaa gctaccaact
gtgatgtcac tgaacacatg gttggaaaga 9480gatgcacgca gttggctctt gcaagcctgg
gcaaaaatgc ttcaacacgc cactggatgc 9540agccagtcag agggttcata tttaatatat
gtgttcatgt ggacacacac agacacacac 9600acacaaactc acccttacac acacacttcg
atgactaaaa caattacata gttttaagat 9660atgaatcaat gtgtgaatgt agaaagctta
tgataaggcc ctagaggtat gggttgccct 9720ggaagcctag gttttaagca ggagaatagc
tgagaagaat gaagccctcc tgagctgaaa 9780ggagagatgg atcaatggag atggttccat
catctccttc catatctcac aggtaaaatg 9840ggcactcaga aaaccctcac gattgatttt
ttaaaaagat aagtgagtgt tttttatttt 9900attattattg tcatcattat tttgatttac
aaatgctatt tgtaactttt acatgtaact 9960aggataaagt atttacggga actctatgga
gaatagcaca atccagaatt tactgtgttt 10020ttcttttatg tgacgtggaa actcagtaat
tctcccacct tcacattgtt gttcataaga 10080attttacttt agttattagg gaatctaagt
tttttgttaa catttgtttt tagttaaaag 10140tatctactta ctgttttagc tctgaactca
aaccagaata tctctgtatc aattgcatga 10200ctattcagaa acaataatcc aaaccaaaat
aattcttttt ccacccagta cgaagaaaac 10260taagctcagt aacaagaagg cataaactaa
agtatataat gaggctttca ttaaatacac 10320acacacacac actcacacac acacacatac
actttttaaa tttttaaatt aggcctccac 10380acataaatca ttttgaaagt agaatagaaa
atctcaaaga attcattctc ctggtcctgt 10440gcatcttctg cagttaataa gaggtttgta
tctggaaaga tggaagaact tgttctaaaa 10500tcttattttt caaaaaaaaa tttccatttt
ctctctgggc ctgtatccat ggttgaatgt 10560tagccctgga ggagatccat gtcttactcg
ctctttctgg cccttctgtc ttttgcctct 10620gcaattcttt ttgtagctgg cacgatagca
gggactgggg gtctatcctt tcatggtatt 10680gctacaatat ttgtccttac tggaaaatgg
taacatccgg gtctgattta attggcatta 10740cacttacaca gggactctga gcacccccgt
caccacacca gacagtggac cagttttcac 10800agctacaaag agctagaaat gtgtttaaca
tcatccagtg catcccctaa ttcaaaacca 10860tcctcactaa tcaatcatat tcacccataa
atattacaaa tgagattgat tccatctcaa 10920gacaatttgt caaatactta attttcttcc
tggatgattc tacttactgg atattttaga 10980aagagaaatg tctgagataa aatccctcac
atttactcaa tataacaaat tactgtttct 11040actcctattc tgagtagtgc ttctgaagat
tgtttgctgt agtgttgtct ttgataaaat 11100gaatgtcagt agtgagcctt ttagagatac
catgctcaga catcctcttt gggatcagaa 11160gatacctaaa attctcccct tttgcccact
tggttagatg agtgatatat tctttggatc 11220ctgcaaagaa gagattggtt tcttttcttt
tctggtggtg gtagtggttg tatctgtggc 11280tgtgatggtt gttgttactt gtctctctct
ctctctggct ctggcttttg ctttcctgct 11340agtgttcttt ctctttccaa acaaatagtt
aaattaaacg tgagcttctg aattgtactt 11400gttcatactt tcaaaacata acagattaat
aaaaatagat gtgtcctgat ttaaaacatg 11460ccccctggaa aggcatgctg tattatgaaa
tcgtgataat ataactgcat tattacatgg 11520cagtataaat attagtctgt tgaattcatt
tgtccaattg tataactttg tggagcagtg 11580ttttgacctt tgatacataa ttctggagca
agtggagtgg ttgcaggcag atgagacagt 11640gttatatcag gatttttcaa tcaactttag
ttggaggcct ggcaattaca aacatcttca 11700gatgtttctg taaccattat aaatatgaaa
aaaacctctt caaaaaattt cccatagtac 11760ttcagtcaag actttttagg tttatctttt
ttttttcatt tctccttttc cttttccatt 11820atttttcgat gggggggttg ttatcattga
ctgaagaaat attttgattg caatggtctc 11880tctctctctc cccctctctc tctctctcct
ctattctttc ctccttccct ctgtccatca 11940cccctcatta aaatattgaa atctggagtc
tttgataaat ctgcattaga ccaggctata 12000tgctaggaat gaaatctggg caaatatcga
tgggttttca aagaatgctc catgttcatt 12060gggccctttc acaccccaca gtgataaatg
aaaaggatag aggtagtttt ttcaaaagag 12120cactttaata atatcctctg agacctaatg
cagtttaaca aatgactcca cctatttttc 12180cagtaggtaa attgactgag acttgcaaaa
tacccctgag agttgtcagg ggtgtcttct 12240gcctggtcta tagcgtgtgt gtttgctttg
tatctaacag gcacattcac gtctcgtgta 12300ctcatatgaa gtatttccta acattcccat
tagcctgtat ataagaatca gaaagataat 12360cccaacatgt tgtaaatgaa gatgtgactc
tataaccttt ctcttcttcc tggaaaaaaa 12420aggacatttt catgcatatt ttaaacagaa
attttgtata tttaagtgtc atagaaaata 12480tttattgagt aactgggaca caaatgggaa
tttaattgtc atcatatgct ttgtgtgtgg 12540ggatgcttac caacaccatg tcgctggacc
attgtggcaa gccataactg cacaaagagt 12600acacatcgtc agtgtgtgtg tgtgtgtgtg
tgtgcgcgca cgcacgtgcg tgtgtgtgtc 12660cctgcatgtg caacatgtct agcttgctgt
ccttcatggg attttagctt tcccttcttg 12720aaaaacatta ttttacagtt ccaggaggcc
ctggttacat tactatatga aggcagtgat 12780ttgaaatgaa aattcctttc ctcttggaag
ctttggtcat aatatcatgg ttcaattaaa 12840cggattccac cggactttgt gatgaaaaag
gctctgttaa aatccaattg agtttccaag 12900aggaaattgt agtaggtcaa gatgcatgag
agggaagatg gaggccacct cagctggaga 12960acatgagctg agttgagccc tcagtgttga
agttgacttg ctccaagctg cagtctaaaa 13020ccctggggcc cgtgcctggc ctatgctccc
tcccaagtaa gtagaggagc agaaccatca 13080ggaacagcct gcctggctcc tatgaagaaa
acttcctgac gtcctgtccc caaaggaaga 13140ccctttcccc aagggcaccc caggtggcca
ttaaattgtg atgatcattc agaaagtgcc 13200cccttggctt tatgagaatc caattagtct
tctgaaccac cttttcttgg gtgcagattt 13260ccaacattca tgctcattgc agatccacca
actgtcactg ttcttaacaa gcatgctcgt 13320cttgtcagaa tttcagtaag ttccaatttc
ctgtacagac cagggtaaac tgttctaaaa 13380tcaatcaatt aatgaaatgt tatctggttt
ttaaaagctg gtttcatgtg ctttatgtgt 13440ataaaactat atctgcctgt gtggctttgc
atttcaaatg tgtggcgcac aagcgttttg 13500ttggtgcttt gttctcagta cagtaactct
gtgtacaaac attttaatgt ggttttgttg 13560ttttccaaca agatgtctct gtaaaaatga
tattggctga gctggtgcgt tggtttctct 13620catagaggca ttaactatac tgccaatgca
ttgaattatt taaaaatgca aaataaaatt 13680tttatgaaaa tctca
13695
User Contributions:
comments("1"); ?> comment_form("1"); ?>Inventors list |
Agents list |
Assignees list |
List by place |
Classification tree browser |
Top 100 Inventors |
Top 100 Agents |
Top 100 Assignees |
Usenet FAQ Index |
Documents |
Other FAQs |
User Contributions:
Comment about this patent or add new information about this topic: