Patent application title: BIVM (BASIC, IMMUNOGLOBULIN-LIKE VARIABLE MOTIF-CONTAINING) GENE, TRANSCRIPTIONAL PRODUCTS, AND USES THEREOF
Inventors:
Gary W. Litman (Gulfport, FL, US)
Noel A. Hawke (Durham, NC, US)
Jeffrey A. Yoder (St. Petersburg, FL, US)
Donna D. Eason (Bradenton, FL, US)
Assignees:
University of South Florida
IPC8 Class: AA61K39395FI
USPC Class:
4241391
Class name: Drug, bio-affecting and body treating compositions immunoglobulin, antiserum, antibody, or antibody fragment, except conjugate or complex of the same with nonimmunoglobulin material binds antigen or epitope whose amino acid sequence is disclosed in whole or in part (e.g., binds specifically-identified amino acid sequence, etc.)
Publication date: 2009-09-17
Patent application number: 20090232824
Inventors list |
Agents list |
Assignees list |
List by place |
Classification tree browser |
Top 100 Inventors |
Top 100 Agents |
Top 100 Assignees |
Usenet FAQ Index |
Documents |
Other FAQs |
Patent application title: BIVM (BASIC, IMMUNOGLOBULIN-LIKE VARIABLE MOTIF-CONTAINING) GENE, TRANSCRIPTIONAL PRODUCTS, AND USES THEREOF
Inventors:
GARY W. LITMAN
Noel A. Hawke
Jeffrey A. Yoder
Donna D. Eason
Agents:
SALIWANCHIK LLOYD & SALIWANCHIK;A PROFESSIONAL ASSOCIATION
Assignees:
University of South Florida
Origin: GAINESVILLE, FL US
IPC8 Class: AA61K39395FI
USPC Class:
4241391
Abstract:
The subject invention provides polynucleotide sequences, designated BIVM,
and transcriptional/translational products obtained from the
polynucleotide sequences of the invention. The subject invention also
provides polynucleotide and polypeptide sequences provided by SEQ ID NOs:
1-28. Also provided are methods of detecting the presence of BIVM nucleic
acids or polypeptides in samples suspected of containing BIVM genes, BIVM
transcriptional products, or BIVM translational products. These methods
are also useful for the detection of BIVM orthologs. Other embodiments
provide polypeptide and/or nucleic acid vaccines for the induction of an
immune response to in an individual. Kits for detecting the presence of
BIVM genes, orthologs thereof, BIVM polypeptides, or BIVM transcriptional
products are also provided.Claims:
1. A method of inducing an immune response in an individual comprising
administering a composition comprising a carrier and a polypeptide
comprising SEQ ID NO: 2 or a biologically active fragment of SEQ ID NO: 2
to an individual in an amount sufficient to induce an immune response.
2. The method according to claim 1, wherein said biologically active fragment comprises at least five consecutive amino acids of SEQ ID NO: 2 fused to a heterologous protein sequence.
3. The method according to claim 1, wherein said biologically active fragment comprises at least ten consecutive amino acids of SEQ ID NO: 2 fused to a heterologous protein sequence.
4. The method according to claim 1, wherein said biologically active fragment comprises at least fifteen consecutive amino acids of SEQ ID NO: 2 fused to a heterologous protein sequence.
5. The method according to claim 1, wherein said composition comprises SEQ ID NO: 2 and a carrier.
6. The method according to claim 1, wherein said biologically active fragment comprises at least twenty consecutive amino acids of SEQ ID NO: 2 fused to a heterologous protein sequence.
7. The method according to claim 1, wherein said biologically active fragment comprises at least twenty-five consecutive amino acids of SEQ ID NO: 2 fused to a heterologous protein sequence.
8. The method according to claim 1, wherein said biologically active fragment comprises at least thirty consecutive amino acids of SEQ ID NO: 2 fused to a heterologous protein sequence.
9. An isolated polypeptide comprising SEQ ID NO: 2 or a biologically active fragment thereof, said biologically active fragment comprising at least five consecutive amino acids of SEQ ID NO: 2 fused to a heterologous protein sequence.
10. The isolated polypeptide according to claim 9, wherein said biologically active fragment comprises at least ten consecutive amino acids of SEQ ID NO: 2 fused to a heterologous protein sequence.
11. The isolated polypeptide according to claim 9, wherein said biologically active fragment comprises at least fifteen consecutive amino acids of SEQ ID NO: 2 fused to a heterologous protein sequence.
12. A composition comprising a carrier and a polypeptide according to claim 9.
13. An isolated antibody that specifically binds to SEQ ID NO: 2.
14. A composition comprising a carrier and an antibody according to claim 13.
Description:
CROSS REFERENCE TO RELATED APPLICATIONS
[0001]The application is a continuation of U.S. application Ser. No. 11/415,536, filed May 2, 2006, now allowed, which is a divisional of U.S. application Ser. No. 10/417,476, filed Apr. 16, 2003, now U.S. Pat. No. 7,038,030, which claims priority to U.S. Provisional Application Ser. No. 60/373,146, filed Apr. 16, 2002, the disclosures of which are hereby incorporated by reference in their entireties, including all figures, nucleic acid sequences, amino acid sequences, and tables.
[0003]The Sequence Listing for this application is labeled "Seq-List.txt, was created on Apr. 14, 2009, and is 251 KB. The entire contents of the sequence listing is incorporated herein by reference in its entirety.
BACKGROUND OF INVENTION
[0004]Considerable uncertainty remains with regards to the total number of human genes. Initial interpretations of genomic sequences resulted in estimates that placed the numbers of genes in man in the range of 30,000 to 40,000 (Lander, E. S., et al. [2001] "Initial Sequencing and Analysis of the Human Genome," Nature, 409:860-921; Ventner, J. C., et al. [2001] "The Sequence of the Human Genome," Science, 291:1304-51). Subsequent re-examination of the sequence data suggests the number of genes in the human genome is likely to be between 65,000 and 75,000 (Wright, F. A., et al. [2001] "A Draft Annotation and Overview of the Human Genome," Genome Biology 2:1.1-1.39). Predictions of 35,000 to 120,000 genes have been projected on the basis of EST sequences (Ewing, B., et al. [2000] "Analysis of Expressed Sequence Tags Indicates 35,000 Human Genes," Nature Genet. 25:232-234; Liang, F., et al. [2000], "Gene Index Analysis of the Human Genome Estimates Approximately 120,000 Genes," Nature Genet. 25:239-240). New genes continue to be recognized through inspection of genomic sequences as well as through a variety of different biochemical, immunological and other directed approaches.
[0005]The immunoglobulin superfamily (IgSF) represents a particularly large and extensively diversified family of genes (Barclay, A. N., et al. [1997] The Leucocyte Antigen FactsBook, Academic Press, San Diego). Each IgSF member encodes at least one Ig that consists of ˜100 amino acid residues that are arranged in two β sheets, which are comprised of anti-parallel β strands that are linked by an intrachain disulfide. Although the majority of genes in the IgSF function in the immune response, other IgSF genes are involved with cell-adhesion or growth factor recognition. IgSF domains are the most abundant domain type found in leukocyte membrane proteins.
[0006]In the course of an electronic EST database search for novel human genes encoding Ig domains, we identified an anonymous EST (IMAGE 785450; GenBank AA449273) (Hawke, N. A., et al. [1999] "Expanding Our Understanding of Immunoglobulin, T-cell Antigen Receptor, and Novel Immune-Type Receptor Genes: a Subset of the Immunoglobulin Gene Superfamily," Immunogenetics 50:124-133) and cloned the corresponding full-length cDNA. The predicted structure of the protein encoded by this gene, which is termed BIVM (basic, immunoglobulin-like variable motif-containing), includes short peptide motifs characteristic of an Ig variable (V) region, one of the subtypes of Ig domains. However, it lacks significant sequence identity to any group of proteins heretofore described.
[0007]We have determined the sequence of BIVM cDNA in species representative of critical points in phylogeny, examined the intracellular distribution of a recombinant form of BIVM, characterized its expression patterns in various tissues at different times in development, and defined other features of the gene that further emphasize its unique character. In addition, we have identified a BIVM-like gene in the protozoan parasite, Giardia lamblia.
BRIEF SUMMARY
[0008]The subject invention provides polynucleotide sequences, designated BIVM, and transcriptional/translational products obtained from the polynucleotide sequences of the invention (SEQ ID Nos:1-28). The subject invention also provides methods of detecting the presence of BIVM nucleic acids, transcriptional products, or polypeptides in samples suspected of containing BIVM genes. These methods are also useful for the detection of BIVM orthologs. Other embodiments provide polypeptide and/or nucleic acid vaccines for the induction of an immune response. Kits for detecting the presence of BIVM genes, orthologs thereof, BIVM polypeptides, or BIVM transcriptional products obtained from the polynucleotide sequences are also provided.
BRIEF DESCRIPTION OF THE TABLES AND DRAWINGS
[0009]The file of this patent contains at least one drawing executed in color. Copies of this patent with color drawings will be provided by the Patent and Trademark Office upon request and payment of the necessary fee.
[0010]Table I. Exon-intron organization of human BIVM. Three alternative splice donors in the 5' untranslated region are designated A1, A2, and A3. Nucleotide positions are relative to FIG. 1, intron length and splice donor/acceptor sequences are shown. Coding sequence is in upper case.
[0011]FIGS. 1A-1C. Human BIVM. The nucleotide sequence (SEQ ID NO: 1) and predicted amino acid translation product (SEQ ID NO: 2) of a human BIVM transcript. Translational start and stop codons are in reverse text. RNA splice junctions are underlined (see Table I). Nucleotides at 5' ends, defined by analyses of RACE products, are boxed. Nucleotide numbering is on the right; amino acid numbering is on the left. The M1(GX6C), M2 (WFRQ), M3a and M3b (YFC and YHC) motifs are shaded. The Alu sequence in the 3' untranslated region is in lower case.
[0012]FIG. 2. Predicted genomic organization of human BIVM. BIVM consists of nine coding exons (exons 1-9) and two 5' untranslated region exons (A and B). Alternative splice donor sites are present within exon A (see Table I); transcripts have been identified that include exon A, but not exon B. The CpG island is denoted by a solid bar, the Alu sequence is denoted by a hatched bar, and the location of the sequence-tagged site (STS) marker, WI-5740, is indicated (see also FIG. 1A).
[0013]FIGS. 3A-3B. BIVM is well conserved among deuterostomes. ClustalW alignment of the human BIVM peptide sequence (BIVM.Hs; (SEQ ID NO: 2)) with orthologous sequences from mouse (BIVM.Mm; (SEQ ID NO: 27)), chicken (BIVM.Gg; (SEQ ID NO: 8)), Xenopus (XBIVM; (SEQ ID NO: 5)), zebrafish (BIVM.Dr; (SEQ ID NO: 11)), and sea urchin (SpBIVM; (SEQ ID NO: 13)). The sea urchin sequence lacks a stop codon and therefore is predicted to encode a longer polypeptide (indicated by . . . ). The M1, M2, M3a and M3b motifs are indicated. The highly conserved domain within BIVM is indicated with arrowheads. Identical residues are shown in reverse text (black), similarities are shaded (gray). Gaps introduced to maintain/maximize alignment are indicated with (-).
[0014]FIG. 4. Syntenic relationship between the human BIVM region and the mouse genome. The relative locations of human BIVM and flanking genes on chromosome 13q32-33; known corresponding chromosomal map positions are indicated for mouse. Transcription direction is indicated with arrows. Approximate distances between genes (if known) are indicated.
[0015]FIGS. 5A-5F. Expression of BIVM. RNA blots of BIVM expression from (A) human tissues and cell lines, (B) mouse embryos and somatic tissues, (C) Xenopus embryos and kidney, and (D) sea urchin embryos. Approximately 2 μg of polyA+ RNA/track was analyzed in human and mouse; ˜10 μg of total RNA/track was analyzed in Xenopus and sea urchin. Actin is used as a loading control with human and mouse blots; 18S ribosomal RNA is used as a loading control with Xenopus and sea urchin blots. Real time PCR analysis of BIVM expression in (E), developing zebrafish embryos and adult tissues, and in (F) chicken bursa at various stages of embryonic development. The quantity of BIVM (designated on the left) is relative and normalized (see Methods). Note that the level of zebrafish BIVM expression in the 0 hpf embryo is approximately 10 times the level detected at 6 hpf. Time points in the analysis of bursa are days of embryonic life (e.g. E12) and chicken embryonic fibroblasts (CEFs) were included as a control. Days post coitus=dpc, stage=st., hour post fertilization=hpf, days post fertilization=dpf and intestine=intest.
[0016]FIGS. 6A-6J. BIVM localizes to the nucleus and the cytoplasm. (A) Western analysis of whole cell lysates from pIRES2-EGFP (EGFP), pBIVM-N2/EGFP (N2/EGFP) and pBIVM-K1/EGFP (K1/EGFP) transfected Cos-7 cells. Recombinant BIVM is detected with an anti-V5 antibody. EGFP is shown as a transfection and loading control. Note that only a single protein corresponding to the 5' ATG is generated from the endogenous transcript (pBIVM-N2); protein synthesis is increased by modification of the translational start site (pBIVMK1). Size standards are indicated. (B) Western analysis of nuclear and cytoplasmic fractions from pBIVM-K1/EGFP transfected Cos-7 cells. OCT-1 (Pombo, A., et al. [1998] "Regional and Temporal Specialization in the Nucleus: A Transcriptionally-Active Nuclear Domain Rich in PTF, Oct1 and PIKA Antigens Associated with Specific Chromosomes Early in the Cell Cycle," EMBO J. 1768) and HSP90 (Perdew, G. H., et al. [1991] "Evidence that the 90-kDa Heatshock Protein (HSP90) Exists in Cytosol in Heteromeric Complexes Containing HSP70 and Three Other Proteins with Mr 63,000, 56,000, and 50,000," J Biol Chem 6708) are nuclear and cytoplasmic markers, respectively. (C-J) Immunocytochemical localization of BIVM. Cos7 cells transiently transfected with pBIVM-K1 were analyzed by conventional fluorescent microscopy. Recombinant BIVM (green), actin (red), nuclei (blue), and overlayered images are shown. Note that levels of nuclear BIVM vary (compare C to G).
[0017]FIGS. 7A-7C. Giardia BIVM-like sequence. (A) The nucleotide sequence (SEQ ID NO: 14) and predicted amino acid translation product (SEQ ID NO: 15) of a Giardia lamblia BIVM-like (BIVML) transcript. Translational start and stop codons are in reverse text. Numbering is as in FIG. 1. Grey shading indicates conserved motifs. A sequence resembling predicted giardial initiator regions is boxed. A classic giardial polyadenylation signal sequence is underlined. (B) Alignment of the predicted BIVML protein (SEQ ID NO: 15) with the C-terminal region of human BIVM (SEQ ID NO: 2). Labeling is as in FIG. 3. (C) RNA blot (10 μg/track) probed for BIVML in vegetative-stage (veg) and 21 hr encysting Giardia. Calmodulin is shown as loading control.
[0018]FIG. 8. RT-PCR analysis of extracts from BIVM expressing and non-expressing human cell lines indicated that EP58/MGC5302 was expressed in all cell lines that express BIVM but not in a BIVM non-expressing cell line.
[0019]FIG. 9. Potential binding sites contained in the 41 bp region separating the BIVM and EP58/MGC5302 genes revealed sites for cell type specific factors such as the myeloid zinc finger-1 (MZF-1), the hematopoietic-expressed Ikaros-2 (IK2) factor, and the ubiquitously expressed transcription factors NF1, USF, NFκB, and NMYC.
[0020]FIG. 10. Detection of bands representing NFκB-specific binding constitutively present in nuclear extracts.
[0021]FIG. 11. Binding of the 41 bp intergenic region by NFκB complexes containing c-Rel and RelB factors, which are constitutively present in the nuclear extracts from the BIVM expressing K562 cell line.
[0022]FIG. 12. TNF-α activated NFκB increases the expression of BIVM in the BIVM-expressing HeLa cell line (DNS). A cell line devoid of basal BIVM expression, the Raji Burkitt's lymphoma line, is induced to express BIVM by TNF-a.
[0023]FIGS. 13A-13C. Flow cytometer analyses of cells stained with propidium iodide.
BRIEF DESCRIPTION OF THE SEQUENCES
[0024]SEQ ID NO: 1--human BIVM cDNA
[0025]SEQ ID NO: 2--human BIVM amino acid sequence
[0026]SEQ ID NO: 3--human BIVM gemonic sequence with upstream partial sequence of MGC5302 gene and downstream partial sequence of ERCC5 gene
[0027]SEQ ID NO: 4--Xenopus BIVM open reading frame
[0028]SEQ ID NO: 5--Xenopus BIVM amino acid sequence
[0029]SEQ ID NO: 6--Chicken BIVM open reading frame
[0030]SEQ ID NO: 7--Alternatively spliced chicken BIVM open reading frame
[0031]SEQ ID NO: 8--Chicken BIVM amino acid sequence
[0032]SEQ ID NO: 9--Alternatively splice chicken BIVM amino acid sequence
[0033]SEQ ID NO: 10--Zebrafish BIVM open reading frame
[0034]SEQ ID NO: 11--Zebrafish BIVM amino acid sequence
[0035]SEQ ID NO: 12--Sea urchin BIVM partial coding sequence
[0036]SEQ ID NO: 13--Sea urchin BIVM partial amino acid sequence
[0037]SEQ ID NO: 14--Giardia BIVM-like open reading frame
[0038]SEQ ID NO: 15--Giardia BIVM-like amino acid sequence
[0039]SEQ ID NO: 16--Lancelet BIVM partial coding sequence
[0040]SEQ ID NO: 17--Lancelet BIVM partial amino acid sequence
[0041]SEQ ID NO: 18--Mouse BIVM exon A nucleotide sequence
[0042]SEQ ID NO: 19--Mouse BIVM exon B nucleotide sequence
[0043]SEQ ID NO: 20--Mouse BIVM exon C nucleotide sequence
[0044]SEQ ID NO: 21--Mouse BIVM exon 1 nucleotide sequence
[0045]SEQ ID NO: 22--Alternative mouse BIVM 5' end clone (6359)
[0046]SEQ ID NO: 23--Alternative mouse BIVM 5' end clone (6358)
[0047]SEQ ID NO: 24--Alternative mouse BIVM 5' end clone (6356)
[0048]SEQ ID NO: 25--Alternative mouse BIVM 5' end clone (cDNA)
[0049]SEQ ID NO: 26--Mouse BIVM cDNA with clone 6359 5' end
[0050]SEQ ID NO: 27--Mouse BIVM amino acid sequence
[0051]SEQ ID NO: 28--Mouse BIVM genomic sequence with upstream partial sequence of KDEL gene
[0052]SEQ ID NO: 29--Human BIVM exon A1 splice donor sequence
[0053]SEQ ID NO: 30--Human BIVM exon A2 splice donor sequence
[0054]SEQ ID NO: 31--Human BIVM exon A3 splice donor sequence
[0055]SEQ ID NO: 32--Human BIVM exon B splice acceptor sequence
[0056]SEQ ID NO: 33--Human BIVM exon B splice donor sequence
[0057]SEQ ID NO: 34--Human BIVM exon 1 splice acceptor sequence
[0058]SEQ ID NO: 35--Human BIVM exon 1 splice donor sequence
[0059]SEQ ID NO: 36--Human BIVM exon 2 splice acceptor sequence
[0060]SEQ ID NO: 37--Human BIVM exon 2 splice donor sequence
[0061]SEQ ID NO: 38--Human BIVM exon 3 splice acceptor sequence
[0062]SEQ ID NO: 39--Human BIVM exon 3 splice donor sequence
[0063]SEQ ID NO: 40--Human BIVM exon 4 splice acceptor sequence
[0064]SEQ ID NO: 41--Human BIVM exon 4 splice donor sequence
[0065]SEQ ID NO: 42--Human BIVM exon 5 splice acceptor sequence
[0066]SEQ ID NO: 43--Human BIVM exon 5 splice donor sequence
[0067]SEQ ID NO: 44--Human BIVM exon 6 splice acceptor sequence
[0068]SEQ ID NO: 45--Human BIVM exon 6 splice donor sequence
[0069]SEQ ID NO: 46--Human BIVM exon 7 splice acceptor sequence
[0070]SEQ ID NO: 47--Human BIVM exon 7 splice donor sequence
[0071]SEQ ID NO: 48--Human BIVM exon 8 splice acceptor sequence
[0072]SEQ ID NO: 49--Human BIVM exon 8 splice donor sequence
[0073]SEQ ID NO: 50--Human BIVM exon 9 splice acceptor sequence
[0074]SEQ ID NO: 51--HSMAP5 primer
[0075]SEQ ID NO: 52--HSMAP6 primer
[0076]SEQ ID NO: 53--xfbivmMAPF1 primer
[0077]SEQ ID NO: 54--xfbivmMAPR1 primer
[0078]SEQ ID NO: 55--M1 amino acid motif
[0079]SEQ ID NO: 56--M2 amino acid motif
[0080]SEQ ID NO: 57--M3a amino acid motif
[0081]SEQ ID NO: 58--M3b amino acid motif
[0082]SEQ ID NO: 59--BIVM N-terminus region of homology
[0083]SEQ ID NO: 60--BIVM C-terminus region of homology
[0084]SEQ ID NO: 61--BIVM amino acid motif 1
[0085]SEQ ID NO: 62--BIVM amino acid motif 2
[0086]SEQ ID NO: 63--BIVM amino acid motif 3
[0087]SEQ ID NO: 64--BIVM amino acid motif 4
[0088]SEQ ID NO: 65--BIVM peptide domain (N-terminus of region of interspecific homology)
DETAILED DISCLOSURE OF THE INVENTION
[0089]The subject invention provides isolated and/or purified nucleotide sequences comprising: a) a polynucleotide sequence, or fragment thereof, or a polynucleotide encoding an amino acid sequence, or fragment of said amino acid sequence, of a sequence selected from the group consisting of SEQ ID NOs: 1-64 (or the complements of said polynucleotide sequences or fragments thereof); b) a polynucleotide sequence, or fragment thereof, comprising a sequence having at least about 20% to 99.99% identity to a polynucleotide selected from the group consisting of SEQ ID NOs: 1-28; c) a polynucleotide sequence encoding a polypeptide comprising a sequence selected from the group consisting of SEQ ID NOs: 2, 5, 7, 8, 9, 11, 13, 15, 17, or 27; d) splice variants of SEQ ID NOs: 1-3 or 6-9; or e) a polynucleotide sequence encoding a polypeptide fragment of SEQ ID NOs: 2, 5, 7, 8, 9, 11, 13, 15, 17, or 27, wherein said fragment has substantially the same biological or serologic activity as the native (or intact) polypeptide.
[0090]Nucleotide, polynucleotide, or nucleic acid sequence(s) are understood to mean. according to the present invention, either a double-stranded DNA, a single-stranded DNA, or products of transcription of the said DNAs (e.g., RNA molecules). It should also be understood that the present invention does not relate to the genomic nucleotide sequences encoding BIVM in their natural/native environment or natural/native state. The nucleic acid, polynucleotide, or nucleotide sequences of the invention have been isolated, purified (or partially purified), by separation methods including, but not limited to, ion-exchange chromatography, molecular size exclusion chromatography, affinity chromatography, or by genetic engineering methods such as amplification, cloning, or subcloning.
[0091]Optionally, the polynucleotide sequences of the instant invention can also contain one or more polynucleotides encoding heterologous polypeptide sequences (e.g., tags that facilitate purification of the polypeptides of the invention (see, for example, U.S. Pat. No. 6,342,362, hereby incorporated by reference in its entirety; Altendorf et al. [1999-WWW, 2000] "Structure and Function of the Fo Complex of the ATP Synthase from Escherichia Coli," J. of Experimental Biology 203:19-28, The Co. of Biologists, Ltd., G. B.; Baneyx [1999] "Recombinant Protein Expression in Escherichia coli," Biotechnology 10:411-21, Elsevier Science Ltd.; Eihauer et al. [2001] "The FLAG® Peptide, a Versatile Fusion Tag for the Purification of Recombinant Proteins," J. Biochem Biophys Methods 49:455-65; Jones et al. [1995] J. Chromatography 707:3-22; Jones et al. [1995] "Current Trends in Molecular Recognition and Bioseparation," J. of Chromatography A. 707:3-22, Elsevier Science B. V.; Margolin [2000] "Green Fluorescent Protein as a Reporter for Macromolecular Localization in Bacterial Cells," Methods 20:62-72, Academic Press; Puig et al. [2001] "The Tandem Affinity Purification (TAP) Method: A General Procedure of Protein Complex Purification," Methods 24:218-29, Academic Press; Sassenfeld [1990] "Engineering Proteins for Purification," TibTech 8:88-93; Sheibani [1999] "Prokaryotic Gene Fusion Expression Systems and Their Use in Structural and Functional Studies of Proteins," Prep. Biochem. & Biotechnol. 29(1):77-90, Marcel Dekker, Inc.; Skerra et al. [1999] "Applications of a Peptide Ligand for Streptavidin: the Strep-tag", Biomolecular Engineering 16:79-86, Elsevier Science, B. V.; Smith [1998] "Cookbook for Eukaryotic Protein Expression: Yeast, Insect, and Plant Expression Systems," The Scientist 12(22):20; Smyth et al. [2000] "Eukaryotic Expression and Purification of Recombinant Extracellular Matrix Proteins Carrying the Strep II Tag", Methods in Molecular Biology, 139:49-57; Unger [1997] "Show Me the Money: Prokaryotic Expression Vectors and Purification Systems," The Scientist 11(17):20, each of which is hereby incorporated by reference in their entireties), or commercially available tags from vendors such as such as STRATAGENE (La Jolla, Calif.), NOVAGEN (Madison, Wis.), QIAGEN, Inc., (Valencia, Calif.), or InVitrogen (San Diego, Calif.).
TABLE-US-00001 TABLE I Splice variants of BIVM (SEQ ID NOs: 29-50) Seq ID Intron No. Exon Splice Donor Splice Acceptor Position (bp) 29 A1 CGGCCCCAGGgtaac -- 1-415 -- 30 A2 TGTGATCCAGgtccg -- 1-365 -- 31 A3 CAGGCCAGAGgtacc -- 1-473 -- 33/32 B TTTCTGTCAGgtgat ttccctaaagGAATC 474-557 5785 35/34 1 CACAAATCAGgtaag ttcctcttagGAGCT 558-1157 1754 37/36 2 TCAGACGATGgtgat tgtattctagGCAAT 1158-1284 8682 39/38 3 GAGCTGCAAAgtaag gtgttctcagGTACT 1285-1380 4481 41/40 4 CACTTATGAGgtatg tcttttgtagCCTTC 1381-1485 609 43/42 5 GGAGAAACTGgtagg ttactttcagGTGGT 1486-1580 216 45/44 6 AAGCATTCAGgtaag tttttaatagCTTCA 1581-1713 9405 49/48 7 AACAAAGAAGgtaag ttaactatagATGGG 1714-1800 2768 50 8 -- ttcttctcagGTTGG 1801-1897 4089 50 9 -- ttcttctcagGTTGG 1898-3029 832
[0092]Other aspects of the invention provide vectors containing one or more of the polynucleotides of the invention. The vectors can be vaccine, replication, or amplification vectors. In some embodiments of this aspect of the invention, the polynucleotides are operably associated with regulatory elements capable of causing the expression of the polynucleotide sequences. Such vectors include, among others, chromosomal, episomal and virus-derived vectors, e.g., vectors derived from bacterial plasmids, from bacteriophage, from transposons, from yeast episomes, from insertion elements, from yeast chromosomal elements, from viruses such as baculoviruses, papova viruses, such as SV40, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived from combinations of the aforementioned vector sources, such as those derived from plasmid and bacteriophage genetic elements (e.g., cosmids and phagemids).
[0093]As indicated above, vectors of this invention can also comprise elements necessary to provide for the expression and/or the secretion of a polypeptide encoded by the nucleotide sequences of the invention in a given host cell. The vector can contain one or more elements selected from the group consisting of a promoter, signals for initiation of translation, signals for termination of translation, and appropriate regions for regulation of transcription. In certain embodiments, the vectors can be stably maintained in the host cell and can, optionally, contain signal sequences directing the secretion of translated protein. Other embodiments provide vectors that are not stable in transformed host cells. Vectors can integrate into the host genome or be autonomously-replicating vectors.
[0094]In a specific embodiment, a vector comprises a promoter operably linked to a protein or peptide-encoding nucleic acid sequence, one or more origins of replication, and, optionally, one or more selectable markers (e.g., an antibiotic resistance gene). Non-limiting exemplary vectors for the expression of the polypeptides of the invention include pBr-type vectors, pET-type plasmid vectors (Promega), pBAD plasmid vectors (Invitrogen) or those provided in the examples below. Furthermore, vectors according to the invention are useful for transforming host cells for the cloning or expression of the nucleotide sequences of the invention.
[0095]Promoters which may be used to control expression include, but are not limited to, the CMV promoter, the SV40 early promoter region (Bernoist and Chambon [1981] Nature 290:304-310), the promoter contained in the 3' long terminal repeat of Rous sarcoma virus (Yamamoto, et al. [1980] Cell 22:787-797), the herpes thymidine kinase promoter (Wagner et al. Proc. Natl. Acad. Sci. USA 78:1441-1445), the regulatory sequences of the metallothionein gene (Brinster et al. [1982] Nature 296:39-42); prokaryotic vectors containing promoters such as the β-lactamase promoter (Villa-Kamaroff, et al. [1978] Proc. Natl. Acad. Sci. USA 75:3727-3731), or the tac promoter (DeBoer, et al. [1983] Proc. Natl. Acad. Sci. USA 80:21-25); see also, "Useful Proteins from Recombinant Bacteria" in Scientific American, 1980, 242:74-94; plant expression vectors comprising the nopaline synthetase promoter region (Herrera-Estrella et al. [1983] Nature 303:209-213) or the cauliflower mosaic virus 35S RNA promoter (Gardner, et al. [1981] Nucl. Acids Res. 9:2871), and the promoter of the photosynthetic enzyme ribulose biphosphate carboxylase (Herrera-Estrella et al. [1984] Nature 310:115-120); promoter elements from yeast or fungi such as the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) promoter, and/or the alkaline phosphatase promoter.
[0096]The subject invention also provides for "homologous" or "modified" nucleotide sequences. Modified nucleic acid sequences will be understood to mean any nucleotide sequence obtained by mutagenesis according to techniques well known to persons skilled in the art, and exhibiting modifications in relation to the normal sequences. For example, mutations in the regulatory and/or promoter sequences for the expression of a polypeptide that result in a modification of the level of expression of a polypeptide according to the invention provide for a "modified nucleotide sequence". Likewise, substitutions, deletions, or additions of nucleic acid to the polynucleotides of the invention provide for "homologous" or "modified" nucleotide sequences. In various embodiments, "homologous" or "modified" nucleic acid sequences have substantially the same biological or serological activity as the native (naturally occurring) BIVM polypeptides. A "homologous" or "modified" nucleotide sequence will also be understood to mean a splice variant of the polynucleotides of the instant invention (see Table I) or any nucleotide sequence encoding a "modified polypeptide" as defined below.
[0097]A homologous nucleotide sequence, for the purposes of the present invention, encompasses a nucleotide sequence having a percentage identity with the bases of the nucleotide sequences of between at least (or at least about) 20.00% to 99.99% or higher. The aforementioned range of percent identity is to be taken as including, and providing written description and support for, any fractional percentage, in intervals of 0.01%, between 20.00% and 99.99% or higher. These percentages are purely statistical and differences between two nucleic acid sequences can be distributed randomly and over the entire sequence length.
[0098]In various embodiments, homologous sequences exhibiting a percentage identity with the bases of the nucleotide sequences of the present invention can have 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent identity with the polynucleotide sequences of the instant invention.
[0099]Both protein and nucleic acid sequence homologies may be evaluated using any of the variety of sequence comparison algorithms and programs known in the art. Such algorithms and programs include, but are by no means limited to, TBLASTN, BLASTP, FASTA, TFASTA, and CLUSTALW (Pearson and Lipman [1988] Proc. Natl. Acad. Sci. USA 85(8):2444-2448; Altschul et al. [1990] J. Mol. Biol. 215(3):403-410; Thompson et al. [1994] Nucleic Acids Res. 22(2):4673-4680; Higgins et al. [1996] Methods Enzymol. 266:383-402; Altschul et al. [1990] J. Mol. Biol. 215(3):403-410; Altschul et al. [1993] Nature Genetics 3:266-272).
[0100]The subject invention also provides nucleotide sequences complementary to any of the polynucleotide sequences disclosed herein. Thus, the invention is understood to include any DNA whose nucleotides are complementary to those of the sequence of the invention, and whose orientation is reversed (e.g., an antisense sequence).
[0101]The present invention further provides fragments of the polynucleotide sequences provided herein. Representative fragments of the polynucleotide sequences according to the invention will be understood to mean any nucleotide fragment having at least 8 or 9 successive nucleotides, preferably at least 12 successive nucleotides, and still more preferably at least 15 or at least 20 successive nucleotides of the sequence from which it is derived. In other embodiments, fragments contain from one nucleotide less than the full length polynucleotide sequence to fragments comprising up to, and including 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 205, 210, 215, 220, 225, 230, 235, 240, 245, 250, or 255 consecutive nucleotides of a particular sequence disclosed herein. Yet other embodiments provide fragments (or detection probes) comprising nucleotides 1446 to 1697 or 1447 to 1698 of FIG. 1 (SEQ ID NO:1). It is to be understood that such fragments refer only to portions of the disclosed polynucleotide sequences that are not listed in a publicly available database or prior art references.
[0102]Among these representative fragments, those capable of hybridizing under stringent conditions with a nucleotide sequence according to the invention are preferred. Conditions of high or intermediate stringency are provided infra and are chosen to allow for hybridization between two complementary DNA fragments. Hybridization conditions for a polynucleotide of about 300 bases in size will be adapted by persons skilled in the art for larger- or smaller-sized oligonucleotides, according to methods well known in the art (see, for example, Sambrook et al. [1989]).
[0103]The subject invention also provides detection probes (e.g., fragments of the disclosed polynucleotide sequences) for hybridization with a target sequence or an amplicon generated from the target sequence. Such a detection probe will advantageously have as sequence a sequence of at least 9, 12, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100 nucleotides. Alternatively, detection probes can comprise 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 205, 210, 215, 220, 225, 230, 235, 240, 245, 250, or 255 consecutive nucleotides of the disclosed nucleic acids. The detection probes can also be used as labeled probe or primer in the subject invention. Labeled probes or primers are labeled with a radioactive compound or with another type of label. Alternatively, non-labeled nucleotide sequences may be used directly as probes or primers; however, the sequences are generally labeled with a radioactive element (32P, 35S, 3H, 125I) or with a molecule such as biotin, acetylaminofluorene, digoxigenin, 5-bromo-deoxyuridine, or fluorescein to provide probes that can be used in numerous applications.
[0104]The nucleotide sequences according to the invention may also be used in analytical systems, such as DNA chips. DNA chips and their uses are well known in the art and (see for example, U.S. Pat. Nos. 5,561,071; 5,753,439; 6,214,545; Schena et al. [1996] BioEssays 18:427-431; Bianchi et al. [1997] Clin. Diagn. Virol. 8:199-208; each of which is hereby incorporated by reference in their entireties) and/or are provided by commercial vendors such as Affymetrix, Inc. (Santa Clara, Calif.).
[0105]Various degrees of stringency of hybridization can be employed. The more severe the conditions, the greater the complementarity that is required for duplex formation. Severity of conditions can be controlled by temperature, probe concentration, probe length, ionic strength, time, and the like. Preferably, hybridization is conducted under moderate to high stringency conditions by techniques well known in the art, as described, for example, in Keller, G. H., M. M. Manak [1987] DNA Probes, Stockton Press, New York, N.Y., pp. 169-170.
[0106]By way of example, hybridization of immobilized DNA on Southern blots with 32P-labeled gene-specific probes can be performed by standard methods (Maniatis et al. [1982] Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York). In general, hybridization and subsequent washes can be carried out under moderate to high stringency conditions that allow for detection of target sequences with homology to the exemplified polynucleotide sequence. For double-stranded DNA gene probes, hybridization can be carried out overnight at 20-25° C. below the melting temperature (Tm) of the DNA hybrid in 6×SSPE, 5× Denhardt's solution, 0.1% SDS, 0.1 mg/ml denatured DNA. The melting temperature is described by the following formula (Beltz et al. [1983] Methods of Enzymology, R. Wu, L. Grossman and K. Moldave [eds.] Academic Press, New York 100:266-285).
[0107]Tm=81.5° C.+16.6 Log[Na+]+0.41(% G+C)-0.61(% formamide)-600/length of duplex in base pairs.
[0108]Washes are typically carried out as follows:
(1) twice at room temperature for 15 minutes in 1×SSPE, 0.1% SDS (low stringency wash);(2) once at Tm-20° C. for 15 minutes in 0.2×SSPE, 0.1% SDS (moderate stringency wash).
[0109]For oligonucleotide probes, hybridization can be carried out overnight at 10-20° C. below the melting temperature (Tm) of the hybrid in 6×SSPE, 5× Denhardt's solution, 0.1% SDS, 0.1 mg/ml denatured DNA. Tm for oligonucleotide probes can be determined by the following formula:
[0110]Tm(° C.)=2(number T/A base pairs)+4(number G/C base pairs) (Suggs et al. [1981] ICN-UCLA Symp. Dev. Biol. Using Purified Genes, D. D. Brown [ed.], Academic Press, New York, 23:683-693).
[0111]Washes can be carried out as follows:
(1) twice at room temperature for 15 minutes 1×SSPE, 0.1% SDS (low stringency wash;2) once at the hybridization temperature for 15 minutes in 1×SSPE, 0.1% SDS (moderate stringency wash).
[0112]In general, salt and/or temperature can be altered to change stringency. With a labeled DNA fragment >70 or so bases in length, the following conditions can be used: [0113]Low: 1 or 2×SSPE, room temperature [0114]Low: 1 or 2×SSPE, 42° C. [0115]Moderate: 0.2× or 1×SSPE, 65° C. [0116]High: 0.1×SSPE, 65° C.
[0117]By way of another non-limiting example, procedures using conditions of high stringency can also be performed as follows: Pre-hybridization of filters containing DNA is carried out for 8 h to overnight at 65° C. in buffer composed of 6×SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 μg/ml denatured salmon sperm DNA. Filters are hybridized for 48 h at 65° C., the preferred hybridization temperature, in pre-hybridization mixture containing 100 μg/ml denatured salmon sperm DNA and 5-20×106 cpm of 32P-labeled probe. Alternatively, the hybridization step can be performed at 65° C. in the presence of SSC buffer, 1×SSC corresponding to 0.15M NaCl and 0.05 M Na citrate. Subsequently, filter washes can be done at 37° C. for 1 h in a solution containing 2×SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0.1×SSC at 50° C. for 45 min. Alternatively, filter washes can be performed in a solution containing 2×SSC and 0.1% SDS, or 0.5×SSC and 0.1% SDS, or 0.1×SSC and 0.1% SDS at 68° C. for 15 minute intervals. Following the wash steps, the hybridized probes are detectable by autoradiography. Other conditions of high stringency which may be used are well known in the art (see, for example, Sambrook et al. [1989] Molecular Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47-9.57; and Ausubel et al. [1989] Current Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, New York, each incorporated herein in its entirety).
[0118]A further non-limiting example of procedures using conditions of intermediate stringency are as follows: Filters containing DNA are pre-hybridized, and then hybridized at a temperature of 60° C. in the presence of a 5×SSC buffer and labeled probe. Subsequently, filters washes are performed in a solution containing 2×SSC at 50° C. and the hybridized probes are detectable by autoradiography. Other conditions of intermediate stringency which may be used are well known in the art (see, for example, Sambrook et al. [1989] Molecular Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47-9.57; and Ausubel et al. [1989] Current Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, New York, each of which is incorporated herein in its entirety).
[0119]Duplex formation and stability depend on substantial complementarity between the two strands of a hybrid and, as noted above, a certain degree of mismatch can be tolerated. Therefore, the probe sequences of the subject invention include mutations (both single and multiple), deletions, insertions of the described sequences, and combinations thereof, wherein said mutations, insertions and deletions permit formation of stable hybrids with the target polynucleotide of interest. Mutations, insertions and deletions can be produced in a given polynucleotide sequence in many ways, and these methods are known to an ordinarily skilled artisan. Other methods may become known in the future.
[0120]It is also well known in the art that restriction enzymes can be used to obtain functional fragments of the subject DNA sequences. For example, Bal31 exonuclease can be conveniently used for time-controlled limited digestion of DNA (commonly referred to as "erase-a-base" procedures). See, for example, Maniatis et al. [1982] Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York; Wei et al. [1983] J. Biol. Chem. 258:13006-13512. The nucleic acid sequences of the subject invention can also be used as molecular weight markers in nucleic acid analysis procedures.
[0121]The invention also provides host cells transformed by a polynucleotide according to the invention and the production of BIVM (or BIVM ortholog) polypeptides by the transformed host cells. In some embodiments, transformed cells comprise an expression vector containing BIVM, or BIVM ortholog, polynucleotide sequences. Other embodiments provide for host cells transformed with nucleic acids. Yet other embodiments provide transformed cells comprising an expression vector containing fragments of BIVM, or BIVM ortholog, polynucleotide sequences. Transformed host cells according to the invention are cultured under conditions allowing the replication and/or the expression of the nucleotide sequences of the invention. Expressed polypeptides are recovered from culture media and purified, for further use, according to methods known in the art.
[0122]The host cell may be chosen from eukaryotic or prokaryotic systems, for example bacterial cells (Gram negative or Gram positive), yeast cells, animal cells, plant cells, and/or insect cells using baculovirus vectors. In some embodiments, the host cell for expression of the polypeptides include, and are not limited to, those taught in U.S. Pat. Nos. 6,319,691; 6,277,375; 5,643,570; 5,565,335; Unger [1997] The Scientist 11(17):20; or Smith [1998] The Scientist 12(22):20, each of which is incorporated by reference in its entirety, including all references cited within each respective patent or reference. Other exemplary, and non-limiting, host cells include Staphylococcus spp., Enterococcus spp., E. coli, and Bacillus subtilis; fungal cells, such as Streptomyces spp., Aspergillus spp., S. cerevisiae, Schizosaccharomyces pombe, Pichia pastoris, Hansela polymorpha, Kluveromyces lactis, and Yarrowia lipolytica; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS, HeLa, C127, 3T3, BHK, 293 and Bowes melanoma cells; and plant cells. A great variety of expression systems can be used to produce the polypeptides of the invention and polynucleotides can be modified according to methods known in the art to provide optimal codon usage for expression in a particular expression system.
[0123]Furthermore, a host cell strain may be chosen that modulates the expression of the inserted sequences, modifies the gene product, and/or processes the gene product in the specific fashion. Expression from certain promoters can be elevated in the presence of certain inducers; thus, expression of the genetically engineered polypeptide may be controlled. Furthermore, different host cells have characteristic and specific mechanisms for the translational and post-translational processing and modification (e.g., glycosylation, phosphorylation) of proteins. Appropriate cell lines or host systems can be chosen to ensure the desired modification and processing of the foreign protein expressed. For example, expression in a bacterial system can be used to produce an unglycosylated core protein product whereas expression in yeast will produce a glycosylated product. Expression in mammalian cells can be used to provide "native" glycosylation of a heterologous protein. Furthermore, different vector/host expression systems may effect processing reactions to different extents.
[0124]Nucleic acids and/or vectors can be introduced into host cells by well-known methods, such as, calcium phosphate transfection, DEAE-dextran mediated transfection, transfection, microinjection, cationic lipid-mediated transfection, electroporation, transduction, scrape loading, ballistic introduction and infection (see, for example, Sambrook et al. [1989] Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.).
[0125]The subject invention also provides for the expression of a polypeptide, derivative, or a variant (e.g., a splice variant) encoded by a polynucleotide sequence disclosed herein. Alternatively, the invention provides for the expression of a polypeptide fragment obtained from a polypeptide, derivative, or a variant encoded by a polynucleotide fragment derived from the polynucleotide sequences disclosed herein. In either embodiment, the disclosed sequences can be regulated by a second nucleic acid sequence so that the polypeptide or fragment is expressed in a host transformed with a recombinant DNA molecule according to the subject invention. For example, expression of a protein or peptide may be controlled by any promoter/enhancer element known in the art.
[0126]The subject invention also provides nucleic acid based methods for the identification of the presence of the BIVM gene, or orthologs thereof, in a sample. These methods can utilize the nucleic acids of the subject invention and are well known to those skilled in the art (see, for example, Sambrook et al. [1989] or Abbaszadega [2001] "Advanced Detection of Viruses and Protozoan Parasites in Water," Reviews in Biology and Biotechnology, 1(2):21-26). Among the techniques useful in such methods are enzymatic gene amplification (or PCR), Southern blots, Northern blots, or other techniques utilizing nucleic acid hybridization for the identification of polynucleotide sequences in a sample. Thus, the subject invention can provide nucleic acid based methodologies for the identification of G. lamblia in environmental or biological samples and provides sensitive assays for the diagnosis of G. lamblia infections. Alternatively, the nucleic acids can be used to screen individuals for cancers, tumors, or malignancies associated with dysregulation of the BIVM gene or its transcriptional products.
[0127]The subject invention also provides polypeptides encoded by nucleotide sequences of the invention. The subject invention also provides fragments of at least 5 amino acids of a polypeptide encoded by the polynucleotides of the instant invention. In some embodiments, the polypeptide fragments are reactive with antibodies found in the serum of an individual infected with G. lamblia.
[0128]In the context of the instant invention, the terms polypeptide, peptide and protein are used interchangeably. Likewise, the terms variant and homologous are also used interchangeably. It should be understood that the invention does not relate to the polypeptides in natural form or native environment. Peptides and polypeptides according to the invention have been isolated or obtained by purification from natural sources (or their native environment), chemically synthesized, or obtained from host cells prepared by genetic manipulation (e.g., the polypeptides, or fragments thereof, are recombinantly produced by host cells). Polypeptides according to the instant invention may also contain non-natural amino acids, as will be described below.
[0129]"Variant" or "homologous" polypeptides will be understood to designate the polypeptides containing, in relation to the native polypeptide, modifications such as deletion, addition, or substitution of at least one amino acid, truncation, extension, or the addition of chimeric heterologous polypeptides. Optionally, "variant" or "homologous" polypeptides can contain a mutation or post-translational modifications. Among the "variant" or "homologous" polypeptides, those whose amino acid sequence exhibits 20.00% to 99.99% (inclusive) identity to the native polypeptide sequence are preferred. The aforementioned range of percent identity is to be taken as including, and providing written description and support for, any fractional percentage, in intervals of 0.01%, between 50.00% and, up to, including 99.99%. These percentages are purely statistical and differences between two polypeptide sequences can be distributed randomly and over the entire sequence length.
[0130]"Variant" or "homologous" polypeptide sequences exhibiting a percentage identity with the polypeptides of the present invention can, alternatively, have 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 91, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent identity with the polypeptide sequences of the instant invention. The expression equivalent amino acid is intended here to designate any amino acid capable of being substituted for one of the amino acids in the basic structure without, however, essentially modifying the biological activities of the corresponding peptides and as provided below.
[0131]By way of example, amino acid substitutions can be carried out without resulting in a substantial modification of the biological activity of the corresponding modified polypeptides; for example, the replacement of leucine with valine or isoleucine; aspartic acid with glutamic acid; glutamine with asparagine; arginine with lysine; and the reverse substitutions can be performed without substantial modification of the biological activity of the polypeptides.
[0132]In other embodiments, homologous polypeptides according to the subject invention also include various splice variants identified within the BIVM coding sequence (see Table I).
[0133]The subject invention also provides biologically active fragments of a polypeptide according to the invention and includes those peptides capable of eliciting an immune response. In one embodiment, an immune response directed against G. lamblia is provided. The immune response can provide components (either antibodies or components of the cellular immune response (e.g., B-cells, helper, cytotoxic, and/or suppressor T-cells)) reactive with the biologically active fragment of a polypeptide, the intact, full length, unmodified polypeptide disclosed herein, or both the biologically active fragment of a polypeptide and the intact, full length, unmodified polypeptides disclosed herein. Biologically active fragments according to the invention comprise from five (5) amino acids to one amino acid less than the full length of any polypeptide sequence provided herein. Alternatively, fragments comprising 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 205, 210, 215, 220, 225, 230, 235, 240, 245, 250, or 255 consecutive amino acids of a disclosed polypeptide sequence are provided herein.
[0134]Fragments, as described herein, can be obtained by cleaving the polypeptides of the invention with a proteolytic enzyme (such as trypsin, chymotrypsin, or collagenase) or with a chemical reagent, such as cyanogen bromide (CNBr). Alternatively, polypeptide fragments can be generated in a highly acidic environment, for example at pH 2.5. Such polypeptide fragments may be equally well prepared by chemical synthesis or using hosts transformed with an expression vector containing nucleic acids encoding polypeptide fragments according to the invention. The transformed host cells contain a nucleic acid and are cultured according to well-known methods; thus, the invention allows for the expression of these fragments, under the control of appropriate elements for regulation and/or expression of the polypeptide fragments.
[0135]Modified polypeptides according to the invention are understood to designate a polypeptide obtained by variation in the splicing of transcriptional products of the BIVM gene, genetic recombination, or by chemical synthesis as described below. Modified polypeptides contain at least one modification in relation to the normal polypeptide sequence. These modifications can include the addition, substitution, deletion of amino acids contained within the polypeptides of the invention.
[0136]In order to extend the life of the polypeptides of the invention, it may be advantageous to use non-natural amino acids, for example in the D form, or alternatively amino acid analogs, such as sulfur-containing forms of amino acids. Alternative means for increasing the life of polypeptides can also be used in the practice of the instant invention. For example, polypeptides of the invention, and fragments thereof, can be recombinantly modified to include elements that increase the plasma, or serum half-life of the polypeptides of the invention. These elements include, and are not limited to, antibody constant regions (see for example, U.S. Pat. No. 5,565,335, hereby incorporated by reference in its entirety, including all references cited therein), or other elements such as those disclosed in U.S. Pat. Nos. 6,319,691; 6,277,375; or 5,643,570, each of which is incorporated by reference in its entirety, including all references cited within each respective patent. Alternatively, the polynucleotides and genes of the instant invention can be recombinantly fused to elements that are useful in the preparation of immunogenic constructs for the purposes of vaccine formulation or elements useful for the isolation of the polypeptides of the invention.
[0137]The polypeptides, fragments, and immunogenic fragments of the invention may further contain linkers that facilitate the attachment of the fragments to a carrier molecule for the stimulation of an immune response or diagnostic purposes. The linkers can also be used to attach fragments according to the invention to solid support matrices for use in affinity purification protocols. In this aspect of the invention, the linkers specifically exclude, and are not to be considered anticipated, where the fragment is a subsequence of another peptide, polypeptide, or protein as identified in a search of protein sequence databases as indicated in the preceding paragraph. In other words, the non-identical portions of the other peptide, polypeptide, of protein is not considered to be a "linker" in this aspect of the invention. Non-limiting examples of "linkers" suitable for the practice of the invention include chemical linkers (such as those sold by Pierce, Rockford, Ill.), peptides which allow for the connection of the immunogenic fragment to a carrier molecule (see, for example, linkers disclosed in U.S. Pat. Nos. 6,121,424; 5,843,464; 5,750,352; and 5,990,275, hereby incorporated by reference in their entirety). In various embodiments, the linkers can be up to 50 amino acids in length, up to 40 amino acids in length, up to 30 amino acids in length, up to 20 amino acids in length, up to 10 amino acids in length, or up to 5 amino acids in length.
[0138]In other specific embodiments, the polypeptides, peptides, derivatives, or analogs thereof may be expressed as a fusion, or chimeric protein product (comprising the protein, fragment, analog, or derivative joined via a peptide bond to a heterologous protein sequence (e.g., a different protein)). Such a chimeric product can be made by ligating the appropriate nucleic acid sequences encoding the desired amino acid sequences to each other by methods known in the art, in the proper coding frame, and expressing the chimeric product by methods commonly known in the art (see, for example, U.S. Pat. No. 6,342,362, hereby incorporated by reference in its entirety; Altendorf et al. [1999-WWW, 2000] "Structure and Function of the Fo Complex of the ATP Synthase from Escherichia Coli," J. of Experimental Biology 203:19-28, The Co. of Biologists, Ltd., G. B.; Baneyx [1999] "Recombinant Protein Expression in Escherichia coli," Biotechnology 10:411-21, Elsevier Science Ltd.; Eihauer et al.
[0139]"The FLAG® Peptide, a Versatile Fusion Tag for the Purification of Recombinant Proteins," J. Biochem Biophys Methods 49:455-65; Jones et al. [1995] J. Chromatography 707:3-22; Jones et al. [1995] "Current Trends in Molecular Recognition and Bioseparation," J. Chromatography A. 707:3-22, Elsevier Science B. V.; Margolin [2000] "Green Fluorescent Protein as a Reporter for Macromolecular Localization in Bacterial Cells," Methods 20:62-72, Academic Press; Puig et al. [2001] "The Tandem Affinity Purification (TAP) Method: A General Procedure of Protein Complex Purification," Methods 24:218-29, Academic Press; Sassenfeld [1990] "Engineering Proteins for Purification," TibTech 8:88-93; Sheibani [1999] "Prokaryotic Gene Fusion Expression Systems and Their Use in Structural and Functional Studies of Proteins," Prep. Biochem. & Biotechnol. 29(1):77-90, Marcel Dekker, Inc.; Skerra et al. [1999] "Applications of a Peptide Ligand for Streptavidin: The Strep-tag", Biomolecular Engineering 16:79-86, Elsevier Science, B. V.; Smith [1998] "Cookbook for Eukaryotic Protein Expression: Yeast, Insect, and Plant Expression Systems," The Scientist 12(22):20; Smyth et al. [2000] "Eukaryotic Expression and Purification of Recombinant Extracellular Matrix Proteins Carrying the Strep II Tag", Methods in Molecular Biology, 139:49-57; Unger [1997] "Show Me the Money: Prokaryotic Expression Vectors and Purification Systems," The Scientist 11(17):20, each of which is hereby incorporated by reference in their entireties). Alternatively, such a chimeric product may be made by protein synthetic techniques, e.g., by use of a peptide synthesizer.
[0140]Another embodiment of the subject invention provides for the use of polypeptides encoded by the polynucleotides of the subject invention for the induction of an immune response or protective immunity in a subject to which the polypeptides are administered. In this aspect of the invention, compositions containing polypeptide are administered to a subject in amounts sufficient to induce an immune response, and/or induce protective immunity. The composition administered to the subject may, optionally, contain an adjuvant and may be delivered to the subject in any manner known in the art for the delivery of immunogen to a subject. Compositions may be formulated in any carriers, including for example, carriers described in E. W. Martin's Remington's Pharmaceutical Science, Mack Publishing Company, Easton, Pa.
[0141]The expression of the BIVM gene or BIVM gene product (e.g., DNA, RNA, or polypeptide) is dysregulated in a variety of cancers, tumors, and/or malignancies. Non-limiting examples of such cancers, tumors, and/or malignancies include prostate cancer, breast cancer, melanoma, chronic myelogenous leukemia, cervical cancer, adenocarcinomas, lymphoblastic leukemia, colorectal cancer, and lung carcinoma. Accordingly, the present invention provides a method for screening, or aiding in the diagnosis of, an individual suspected of having a malignancy or cancer. The subject invention provides methods comprising the steps of determining the amount of BIVM in a biological sample obtained from said individual and comparing the measured amount of BIVM to the amount of BIVM found in the normal population. The presence of a significantly increased amount of BIVM is associated with an indication of a malignancy or cancer. BIVM gene product can be detected by well-known methodologies including, and not limited to, Western blots, enzyme linked immunoassays (ELISAs), radioimmunoassays (RIAs), Northern blots, Southern blots, PCR-based assays, or other assays for the quantification of gene product known to the skilled artisan. This information, in conjunction with other information available to the skilled practitioner, assists in making a diagnosis.
[0142]The terms "comprising", "consisting of" and "consisting essentially of" are defined according to their standard meaning and may be substituted for one another throughout the instant application in order to attach the specific meaning associated with each term.
[0143]Following examples illustrate procedures for practicing the invention. These examples should not be construed as limiting. All percentages are by weight and all solvent mixture proportions are by volume unless otherwise noted.
EXAMPLE 1--IDENTIFICATION OF BIVM
[0144]Human BIVM was identified originally as an EST (IMAGE #785450; GenBank AA449273) that encodes the two short motifs WFRQ (motif 2 [M2]) and YFC (motif 3a [M3a]), which correspond to framework region 2 (FR2) and FR3 of an Ig V domain, respectively (Barclay, A. N., et al. [1997] The Leucocyte Antigen FactsBook, Academic Press, San Diego). The W in M2 and C in M3a correspond to W41 and C104 of the IMGT numbering system. Complete sequencing of this EST, overlapping ESTs (IMAGE #2184889, GenBank AI538125; IMAGE 136117, GenBank R33273; IMAGE 1060823, GenBank AA568610; and IMAGE 785450, GenBank AA449273) and RACE strategies were used to resolve the complete mRNA sequence.
[0145]Human BIVM mRNA is 3857 nucleotides and encodes a 503 amino acid protein (FIG. 1). No proteins with significant identities (E<0.01) to BIVM have been identified using BLAST analyses. Searches of current motif databases (BLOCKS, PRINTS, Conserved Domain Database, Domain Architecture Retrieval Tool, Simple Modular Architecture Research Tool) also failed to identify any additional significant motifs within the BIVM protein.
[0146]In addition to the shared M2 and M3a motifs, a second V domain FR3 motif, YHC (M3b), is located several residues amino terminal of M3a. Furthermore, a putative FR1 motif (M1), encoding the conserved V domain residues G16 and C23 (IMGT amino acid numbering), was identified by visual inspection of BIVM peptide sequences (FIG. 1).
[0147]The 42 amino acids between M1 and M2 in BIVM are inconsistent with the sequence relationship in a V region in which the corresponding motifs would be separated by no more than 12 residues. This increased distance between C23 and C104 of M1 and M3a (or M3b), which normally form a disulfide bridge and stabilize the Ig domain architecture, strongly suggests that BIVM is not a member of the IgSF. Since these peptide motifs are extremely short, it could be argued that their presence in BIVM may be a random occurrence. However, it should be emphasized that in the original search of the EST database, only 17 sequences were identified that encode W(Y/F)R(Q/H) and YFC that are correctly spaced and maintain an open reading frame. Of these 17 sequences, 16 were TCR cDNAs (encoding WYRQ) and one was BIVM (encoding WFRQ) (Hawke, N. A., et al. [1999] "Expanding Our Understanding of Immunoglobulin, T-cell Antigen Receptor, and Novel Immune-Type Receptor Genes: a Subset of the Immunoglobulin Gene Superfamily," Immunogenetics 50:124-133).
EXAMPLE 2--GENOMIC ORGANIZATION OF BIVM
[0148]GeneBridge 4 radiation hybrid panel mapping (Gyapay, G., et al. [1996] "A Radiation Hybrid Map of the Human Genome," Hum Mol Genet 5:339-346) localized BIVM on chromosome 13q32-33 (data not shown). Examination of the publicly available Human Genome Project database revealed the exon-intron structure of BIVM. A 5' truncated BIVM sequence (hypothetical protein FLJ20159) was initially placed on the publicly available human genome map at 13q14-q21. The 5' untranslated region of BIVM consists of two separate exons (designated exons A and B), followed by the coding region consisting of nine exons; the exon/intron boundaries are indicated in Table I.
[0149]Inspection of genomic sequence localizes BIVM between ERCC5 and "hypothetical protein" MGC5302, a human ortholog of the gene encoding the mouse protein Kdel1/EP58 (Kimata, Y., et al. [2000] "Identification of a Novel Mammalian Endoplasmicreticulum-Resident KDEL Protein Using an EST Database Motif Search," Gene 261:321-327). A CpG island is located in the 5' untranslated region of BIVM; the 3' untranslated region contains an Alu sequence (FIGS. 1 and 2). The Alu polyA sequence in the 3' untranslated region leads to the spurious production of 3' truncated cDNAs including many that are represented as ESTs.
[0150]Multiple 5' untranslated region splice variants were observed in analysis of 5' RACE products. Specifically, exon A has at least 3 splice donor sequences and exon B, which has a poor splice acceptor sequence, can be absent from the mature transcript (FIGS. 1 and 2; Table I). In addition, it is likely that multiple transcriptional start sites are present (FIG. 1).
EXAMPLE 3--BIVM IS HIGHLY CONSERVED WITHIN DEUTEROSTOME SPECIES
[0151]BIVM orthologs were identified in: mouse, chicken, Xenopus and zebrafish in order to address its potential phylogenetic conservation, as well as to define conserved motifs potentially relevant to function. In addition, a partial sequence for a BIVM ortholog was identified in sea urchin. The identity of the human BIVM protein to these orthologs ranges from 35-87% overall and is consistent with the phylogenetic relationships of the species considered (FIG. 3; see below). The C-terminal region of BIVM shares the highest degree of interspecific sequence identity. The N-terminus of this peptide domain is RK(V/C)LD (SEQ ID NO: 65) and the C-terminus is GGNLHC (SEQ ID NO: 60). This region includes all of the V domain motifs, and is 220 amino acids in human (indicated by arrowheads in FIG. 3).
[0152]The corresponding domains in mouse, chicken, Xenopus, zebrafish and sea urchin are 97%, 91%, 91%, 87% and 64% identical to the human domain, respectively. In addition, BIVM ESTs have been identified from an ascidian, sea squirt (Halocynthia roretzi) (e.g., GenBank AV385966), and a BIVM cDNA fragment has been isolated from a protochordate (cephalochordate), lancelet (Branchiostoma floridae), using an RT-PCR strategy (Yoder and Litman, GenBank AF411393). Their sequences within this domain are highly conserved.
EXAMPLE 4--CLOSE PHYSICAL LINKAGE OF BIVM AND EP58/MGC5302
[0153]Human BIVM maps between EP58/MGC5302 and ERCC5 on 13q. The human EP58 EST (that extends most 5'), places the transcriptional start sites of EP58 and BIVM only 41 bp apart. We identified a mouse BIVM genomic clone (from a e FixII library), which also encodes the 5' end of Ep58/Kdel1 (FIG. 4). The mapping position of Ep58/Kdel1 and BIVM in mouse has not yet been determined. The tight physical linkage of the EP58 to BIVM (41 bp in human and 224 bp in mouse) is consistent with a shared regulatory control system that functions in opposite directions (FIG. 4). Notably, both Ep58 and BIVM appear to be ubiquitously expressed (FIG. 5) (Kimata, Y., et al. [2000] "Identification of a Novel Mammalian Endoplasmicreticulum-Resident KDEL Protein Using an EST Database Motif Search," Gene 261:321-327). Finally, zebrafish BIVM has been mapped to linkage group 6 (LG6); however, its linkage relationship to kdel1 is unknown.
EXAMPLE 5--EXPRESSION OF HUMAN BIVM
[0154]The human BIVM transcript is ˜3.8 kb and appears to be expressed ubiquitously; the highest relative levels of expression are in spleen, ovary, small intestine, colon, peripheral leukocytes and liver (FIG. 5A). Additional RNA dot blot analyses indicate expression of BIVM in human testes, ovary, aorta, appendix, trachea, pituitary gland, bladder, uterus, spinal cord, salivary gland, stomach, mammary gland and bone marrow as well as in several fetal tissues (data not shown). Notably, BIVM expression was not detected in fetal spleen, adult thymus and certain cancer cell lines (e.g., promylocytic leukemia, HL-60, and Burkitt's lymphoma Raji) while significant expression was evident in other lines (e.g., HeLa, S3, and colorectal adenocarcinoma, SW480).
EXAMPLE 6--EXPRESSION OF BIVM IN OTHER SPECIES
[0155]The predominant mouse BIVM transcript also is ˜3.8 kb (FIG. 5B), of which ˜3.3 kb have been sequenced. Comparisons of 5' mouse BIVM RACE products indicate that the 5' untranslated region undergoes alternative RNA splicing, which, like in the human gene, does not affect the coding sequences. The highest levels of expression of mouse BIVM are in heart, brain, liver and kidney (FIG. 5B).
[0156]A major difference between the expression of human and mouse BIVM is observed in the spleen, in which expression is high in the human but appears to be minimal in the mouse. In the developing mouse embryo, BIVM expression is detected at a uniform level after gastrulation (FIG. 5B). An ˜2.1 kb XBIVM cDNA was identified in Xenopus that is consistent with the length of the predominant transcript observed in RNA blotting (FIG. 5C). The broad, diffuse nature of the principal hybridizing band could reflect sequence heterogeneity. The nature of the larger transcript (˜4.4 kb) is unknown. Northern blot analysis of sea urchin RNA detects two SpBIVM transcripts of ˜7.4 and 8 kb (FIG. 5D), which are notably longer than the human and mouse forms. The additional sequence in these transcripts might be a result of additional 5' or 3' untranslated regions and/or could reflect polyadenylation effects. Extended 3' untranslated regions are encountered frequently with sea urchin mRNA.
[0157]Real-time PCR was used to analyze BIVM expression levels throughout development in zebrafish (FIG. 5E). As observed in Xenopus and sea urchin, there is a large maternal store of BIVM transcript in the 1-cell embryo (0 hpf in zebrafish) which appears to be quickly lost after the initial cellular division(s). In zebrafish, the level of BIVM expression drops by ˜90% within the first 6 hours of life (midgastrula stage) and is comparatively undetectable by 12 hpf (post-gastrula stage). Although comparable stages of development were not examined in mouse (see above), it is likely that this early embryonic regulation of BIVM expression will be conserved.
[0158]We noted BIVM expression in chicken bursa, which serves as the primary site of B lymphocyte differentiation. BIVM expression in chicken bursa decreases slightly between embryonic day 12 and day 14, increases significantly at day 19, and is the highest in the 4 month old chicken bursa, in which levels are 6-fold greater than observed in embryonic fibroblasts (CEFs; FIG. 5F). Expression of BIVM in other tissues in chicken has not been characterized.
EXAMPLE 7--BIVM ENCODES A NUCLEAR/CYTOPLASMIC PROTEIN
[0159]The relatively high predicted pI of BIVM (9.1) suggests that it may bind other proteins and/or DNA (or other nucleic acids). The levels of BIVM produced from the native pBIVM-N2 construct and modified pBIVM-K1 construct (see Methods) were compared in whole cells lysates from transiently transfected Cos7 cells. BIVM levels are higher in cells transfected with the modified pBIVM-K1 (FIG. 6A), which was used in all subsequent transfection experiments. It should be noted that the size of this recombinant protein (with C-terminal epitope tags) is ˜61 KDa, whereas the native protein (without post-translational modifications) is predicted to be ˜57 kDa. The observation that a single protein is generated from this transcript argues that translation does not begin at a more 3' ATG as suggested by the "hypothetical protein" FLJ20159 GenBank entries (which are predicted to encode a ˜27 kDa protein). Western analysis using antibodies that recognize the V5 peptide sequence indicate that the epitope-tagged BIVM is present both in cytoplasmic and nuclear fractions (FIG. 6B). These results were confirmed by direct immunohistochemical localization of BIVM in the cytoplasm and nucleus (FIG. 6C-J). Variation in the relative amounts of BIVM in the nucleus was observed in individual cells. Thus, it is possible that the BIVM protein enters and exits the nucleus in a regulated or cell-cycle-dependent manner.
EXAMPLE 8--GIARDIA MAY HAVE ACQUIRED A BIVM ORTHOLOG BY HORIZONTAL GENE TRANSFER
[0160]A tBLASTn search identified a BIVM-like gene (named BIVML) in the genome of the primitive protozoan parasite, Giardia lamblia (McArthur, A. G., et al. [2000] "The Giardia Genome Project Database," FEMS Microbiol Lett 189:271-273). The 2045 nucleotide BIVML cDNA is predicted to encode a 270 amino acid protein (predicted MW ˜30 kDa; Pi=7.56) with no predicted signal peptide, membrane spanning regions or nuclear localization signal; thus, it is likely to be cytosolic. BIVML contains 17 cysteine residues (6.2%) throughout the protein (FIG. 7A). Known giardial proteins that are secreted to the trophozite surface or the cyst wall are also highly cysteine rich. This sequence is 22-25% identical and 46-49% similar to the carboxyl-terminal region of all deuterostome BIVM peptides described here, correlates directly with the conserved domain described above, and includes the M2 and M3b motifs (FIG. 7B). Northern analysis detects an ˜2.0 kb BIVML transcript as well as a larger transcript of unknown identity in both vegetatively growing and encysting cells (FIG. 7C).
[0161]BIVML is unusual in having long untranslated regions consistent with the size of the transcript. The 5' and 3' untranslated regions were determined by RACE and are 229 nucleotides and 983 nucleotides, respectively (FIG. 7A), most transcripts of giardial chromosomal genes characterized to date have very short (<20 nucleotides) untranslated regions, although exceptions are being noted.
[0162]The identification of a BIVM ortholog in such an early branching eukaryote was unexpected since tBLASTn searches of the currently available S. cerevisiae and Drosophila as well as S. pombe and C. elegans genome databases failed to identify any sequences exhibiting significant identity to BIVM. Furthermore, it has not been possible to identify BIVM-like sequences in the complete genomes of Campylobacter jejuni (Parkhill, J., et al. [2000] "Complete DNA Sequence of a Serogroup A Strain of Neisseria Meningitidis Z2491," Nature 404:502-506), Mycobacterium leprae (Cole, S. T., et al. [2001] "Massive Gene Decay in the Leprosy Bacillus," Nature 409:1007-11), Mycobacterium tuberculosis (Cole, S. T., et al. [1998] "Deciphering the Biology of Mycobacterium Tuberculosis from the Complete Genome Sequence," Nature 393:537-544), or Neisseria menigitidis (Parkhill, J., et al. [2000] "The Genome Sequence of the Food-Borne Pathogen Campylobacter Jejuni Reveals Hypervariable Sequences," Nature 403:665-668). In DNA hybridization studies, a Giardia BIVML probe failed to cross-hybridize to Trichomonas foetus, Trichomonas vaginalis or Entamoeba histolytica genomic DNA (data not shown).
[0163]The identification of a BIVM-like gene in the Giardia genome, but not in other similar proteostome genomes, taken together with the fact that Giardia is parasitic, suggests that BIVML may have been acquired via horizontal gene transfer from a higher eukaryotic host.
EXAMPLE 9--PHYSICAL LINKAGE OF HUMAN AND MOUSE BIVM TO THE EP58/MGC5302-EP58/KDEL1 GENE
[0164]The transcriptional start site of the human EP58/MGC5302 sequence (GenBank XM--015844) is only 41 bp from that of BIVM: BIVM and EP58 genes are in a head-to-head orientation, in opposite transcriptional orientation. The mouse EP58/Kdel1 and BIVM genes share the same physical orientation separated by 224 bp. This exceedingly tight physical linkage and close spacing of BIVM and EP58 suggests that common regulatory elements located in or near the intergenic region potentially control the expression of both genes. RT-PCR analysis of extracts from BIVM expressing and non-expressing human cell lines indicated that EP58/MGC5302 was expressed in all cell lines that express BIVM but not in the BIVM non-expressing cell line, Raji (FIG. 8). Based on these results, it is possible that these genes are co-regulated and that the transacting factors associated with the 41 bp intergenic region linking these genes control their expression.
EXAMPLE 10--DNA BINDING ACTIVITY ON THE BIVM-EP58/MGC5302 41 BP INTERGENIC REGION
[0165]A MatInspector V2.2 search for potential binding sites contained in the 41 bp region separating the BIVM and EP58/MGC5302 genes revealed sites for cell type specific factors such as the myeloid zinc finger-1 (MZF-1), the hematopoietic-expressed Ikaros-2 (IK2) factor, and the ubiquitously expressed transcription factors NF1, USF, NFκB, and NMYC (FIG. 9). Nearly identical sites also were predicted for the mouse 224 bp Bivm-Kdel1 intergenic region. MZF-1 and IK2 are expressed in the K562 human erythroleukemia cell line and IK2 is expressed in the Raji Burkitt's lymphoma cell line. Based on this information, electrophoretic mobility shift assays (EMSAs) were performed to compare protein binding to the 41 bp region in nuclear extracts from BIVM expressing and non-expressing cells (FIG. 10).
[0166]MZF-1 and IK2-specific binding would be expected to produce unique bands in the K562 and Raji nuclear extracts that are not observed in nuclear extracts from non-lymphoid cell lines. In addition, an NFκB consensus sequence was used as probe and competitor (Santa Cruz Biotechnology) to detect bands representing NFκB-specific binding that may be constitutively present in the nuclear extracts (FIG. 10). Significant DNA binding activity was observed with the 41 bp BIVM-specific probe in all extracts assayed, producing 1 minor band and two major bands (FIG. 10; Lanes 4-10), one of which was competed by the addition of cold NFκB-specific probe, indicating that NFκB complexes may be present (FIG. 10; Lane 3). One major band was detected with the NFκB consensus probe in the nuclear extracts from BIVM expressing lines (FIG. 10; Lanes 13-17) that was competed by the BIVM-specific probe (FIG. 10; Lane 11). An additional complex also was observed bound to the NFκB-specific probe in the extracts from a BIVM non-expressing line (FIG. 10; Lane 18). Together these results show that the 41 bp BIVM-EP58/MGC5302 intergenic region supports DNA binding activity and that the bound complexes include factors that also bind the NFκB consensus probe. Similar DNA binding activity was observed in the BIVM non-expressing Raji cell line as in the BIVM expressing cells and may result from constitutive nuclear NFκB factors and suggests either that additional flanking regions function in BIVM gene regulation or that protein co-factors or other mechanisms, such as methylation-dependent promoter silencing, could play a role in BIVM expression. The presence of a CpG island 5' of the BIVM gene, together with the lack of both BIVM and EP581MGC5302 expression in the Raji cell lines, supports the latter hypothesis.
EXAMPLE 11--REGULATION OF BIVM EXPRESSION BY TNF-A OR OTHER INDUCING AGENTS
[0167]As described above, the 41 bp intergenic region contains putative sites for ubiquitous transacting factors and an NFκB site that appears to be bound by NFκB complexes containing c-Rel and RelB factors, which are constitutively present in the nuclear extracts from the BIVM expressing K562 cell line (FIG. 1). NFκB comprises a large family of transcription factors, most of which are sequestered in the cytoplasm through inhibitor binding. Activation of the cell by various agents, such as the proinflammatory cytokine TNF-a, leads to phosphorylation-induced degradation of the inhibitor and nuclear translocation of additional NFκB transacting factors. Although constitutive factors may drive basal BIVM expression, TNF-α activated NFκB increases the expression of BIVM in the BIVM-expressing HeLa cell line (DNS). Furthermore, a cell line devoid of basal BIVM expression, the Raji Burkitt's lymphoma line, is induced to express BIVM by TNF-a (FIG. 12). The specific TNF-a activated factors associated with the BIVM promoter can be defined using antibody shift assays.
EXAMPLE 12--CHARACTERISTICS OF RECOMBINANT BIVM PROTEIN
[0168]The BIVM encoded protein has a high proportion of lysine and arginine residues and a predicted isoelectric point (pI) of 9.1. The net positive charge under physiological conditions suggests that BIVM may interact with other proteins and/or DNA. Western blot analysis and cytoimmunofluorescence studies utilizing transfected, epitope-tagged BIVM expression constructs revealed that BIVM is present in both cytoplasmic and nuclear fractions. Variation in the relative amounts of nuclear recombinant BIVM was observed in individual cells and may reflect regulated or cell cycle-dependent BIVM nuclear import/export. The Cos7 cells that have been transformed stably with BIVM exhibit a decreased cell doubling time compared to untransformed Cos7 cells, suggesting the potential role for BIVM in cell cycle regulation. Furthermore, preliminary studies of Cos7 BIVM stable transformants stained with a nuclear stain (DAPI) reveal a high proportion of cells containing multiple nuclei compared to untransformed cells. Flow cytometer analyses of these cells stained with propidium iodide indicate that ˜90% of the cells contain tetraploid or greater DNA content, consistent with the presence of multiple nuclei (FIG. 13; Panel 3). This phenomenon was not observed in a G418-resistant, BIVM-revertant cell line, which has lost expression of recombinant BIVM and exhibits both a nuclear morphology and a diploid DNA content similar to that of the untransformed parental line (FIG. 13; Panels 1 & 2).
EXAMPLE 13--IDENTIFICATION OF BIVM PROTEIN BINDING PARTNERS
[0169]The high proportion of lysine and arginine residues and the net charge of the protein (pI 9.1) suggest that BIVM may interact with proteins and/or DNA (or other nucleosides). Specifically, protein-protein interactions are being assayed using the BacterioMatch two hybrid system (Stratagene). This system provides a rapid, selective approach to identify BIVM-specific protein interactions in vivo. Mouse Bivm has been utilized initially as we can take advantage of mouse cDNA libraries that are commercially available for this system (Stratagene) and because the results obtained can be used to complement concurrent BIVM knock out mice studies now underway in our laboratory. Although it is possible that BIVM may function differently in human and mouse, the 87% sequence conservation between human and mouse BIVM protein, strong synteny in BIVM flanking genes, and the tight physical linkage observed between the BIVM and EP58 genes, is consistent with functional equivalence.
EXAMPLE 14--MATERIALS AND METHODS
Example 14A--General Methods
[0170]RNA was isolated with RNAzol B (Teltest, Friendswood, Tex.) or Trizol (Gibco BRL, Rockville, Md.). Mouse genomic DNA (e FixII) and liver cDNA (e ZAPII) libraries were screened using standard procedures (Strong, S. J., et al. [1999] "A Novel Multigene Family Encodes Diversified Variable Regions," Proc Natl Acad Sci USA 96:15080-15085). DNA sequencing and the analysis of DNA sequences were carried out as described previously (Rast, J. P. et al. [1994] "T Cell Receptor Gene Homologs are Present in the Most Primitive Jawed Vertebrates," Proc. Natl Acad. Sci. USA 91:9248-9252). Alignments were constructed using ClustalW 1.8. Identity relationships were examined using BLAST and ALIGN software. Rapid amplification of cDNA ends (RACE) utilized a standard protocol (Mertineit, C., et al. [1998] "Sex-Specific Exons Control DNA Methyltransferase in Mammalian Germ Cells," Development 125:889-897) or the GeneRacer kit (Invitrogen, Carlsbad, Calif.). The RNA sources for RACE were: human HeLa cells, mouse liver, chicken bursa, Xenopus laevis liver, zebrafish (Danio rerio) liver, 15 hpf sea urchin (Strongylocentotus purpuratus) embryos, and vegetative-stage Giardia lamblia.
Example 14B--Genomic Mapping
[0171]Human BIVM was mapped using HSMAP5 (CCATGCCTCTCTACTACTCACTCCCAACAC) (SEQ ID NO: 51) and HSMAP6 (GGTAAGAAGAACACCATTGTGTTTGAAGGC) (SEQ ID NO: 52) intronic primers (which correspond to sequence between exon 8 and 9) and the GeneBridge 4 radiation hybrid (RH) panel (Gyapay, G., et al. [1996] "A Radiation Hybrid Map of the Human Genome," Hum Mol Genet 5:339-346) (Research Genetics, Huntsville, Ala.). Zebrafish BIVM (see below) was mapped using the zfBIVMMAPF1 (CAATGCCTAACACTGTGGAAAGTGAAGGCG) (SEQ ID NO: 53) and zfBIVMMAPR1 (GATAACTGTCGAGCTCGGTTGAGCAGGGC) (SEQ ID NO: 54) primers and the T51 RH panel (Glusman, G., et al. [1996] "Sequence Analysis in the Olfactory Receptor Gene Cluster on Human Chromosome 17: Recombinatorial Events Affecting Receptor Diversity," Genomics 37:147-160) (Research Genetics). Additional gene mapping data were derived from the Human-Mouse Homology Map and the Mouse Genome Informatics Database (Blake, J. A., et al. and Mouse Genome Database Group [2001] "The Mouse Genome Database (MGD): Integration Nexus for the Laboratory Mouse," Nucleic Acids Res 29:91-94).
Example 14C--Identification of BIVM Orthologs
[0172]Mouse BIVM Partial sequence of the mouse BIVM gene was obtained by screening a mouse genomic library with a human BIVM cDNA probe. A mouse BIVM cDNA was recovered by screening a liver cDNA library with a probe corresponding to mouse exon 6.
[0173]Chicken BIVM tBLASTn searches using the human BIVM sequence identified a chicken (Gallus gallus) bursal EST (GenBank AJ399198) encoding an avian ortholog (BIVM). RACE strategies identified a complete open reading frame cDNA. A single RNA-splicing variant, which encodes an additional 23 amino acids, also has been sequenced (GenBank AF411388; data not shown).
[0174]Xenopus XBIVM Partial Xenopus laevis XBIVM sequence was identified as an oocyte EST (GenBank BF047666) using tBLASTn searches with the human BIVM sequence. RACE strategies resolved a complete open reading frame cDNA.
[0175]Zebrafish BIVM Touchdown PCR (Don, R. H., et al. [1991] "`Touchdown` PCR to Circumvent Spurious Priming During Gene Amplification," Nucleic Acids Res 19:4008) and nested degenerate primers, designed with CODE-HOP software (Rose, T. M., et al. [1998] "Consensus-Degenerate Hybrid Oligonucleotide Primers for Amplification of Distantly Related Sequences," Nucleic Acids Res 26:1628-35), were used to amplify BIVM cDNA fragments from zebrafish liver. Primers for the primary PCR were designed to amplify the coding sequence between the amino acid motifs GNTTLMWRF and YFCPIGFEA; primers for the nested PCR were designed to amplify the sequence between motifs WFRQINDHF and YRHQNHYFCP. PCR products of the expected size were gel purified, cloned and sequenced. Full-length clones were derived by RACE.
[0176]Sea urchin SpBIVM A fragment of the sea urchin SpBIVM cDNA was recovered from 20 hpf embryo cDNA using nested PCR as described for zebrafish. RACE strategies identified a 1,899 nucleotide coding region that corresponds to the complete open reading frame of BIVM from other species; as of yet it has not been possible to identify a stop codon.
[0177]Giardia lamblia BIVM-like The Giardia lamblia BIVML sequence was initially identified with a tBLASTn search of the High Throughput Genomic (HTGS) database with the human BIVM sequence. BIVML is encoded in four overlapping genomic clones (clone KJ1556, GenBank #AC049185; clone MJ4898, GenBank AC083097; clone EJ2770, GenBank #AC038625; and clone KI0613, GenBank #AC046875). RACE was used to identify the complete, 2,045 nucleotide cDNA.
Example 14D--Transient Transfections
[0178]The coding region of human BIVM was cloned into pcDNA3.1/V5-His TOPO (Invitrogen) in order to generate pBIVM-N2, which encodes a BIVM-V5 fusion protein (the VS epitope is at the C terminus). A similar construct, pBIVM-K1, was generated in which the translational start sequence was modified in order to increase protein synthesis, as described in Kozak, M. [1996], "Interpreting cDNA Sequences: Some Insights from Studies on Translation," Mamm. Genome 7:563-574. Both of these constructs were then subcloned into pIRES2-EGFP (Clontech, Palo Alto, Calif.) to create pBIVM-N2/EGFP and pBIVM-K1/EGFP, which produced recombinant BIVM and EGFP from the same plasmid. Cos7 cells (˜60% confluent) were transiently transfected with expression constructs using the GENEJAMMER® transfection reagent according to manufacturer's instructions (Stratagene, La Jolla, Calif.).
Example 14E--Western Blots
[0179]Whole cell lysates were prepared from transfected cells in the presence of 1× Protease Inhibitor Cocktail Set III (Calbiochem, San Diego, Calif.) essentially as recommended by Santa Cruz Biotechnology. Nuclear and cytoplasmic extracts were prepared from transfected cells essentially as described in Yu, C. L., et al. [1995] "Enhanced DNA-Binding Activity of a Stat3-Related Protein in Cells Transformed by the Src Onco Protein," Science 269:81-83. Protein concentrations were determined using Protein Assay Reagent (Bio-Rad, Hercules, Calif.). Whole cell, nuclear, and cytoplasmic extracts were separated by SDS-polyacrylamide gel electrophoresis (10% polyacrylamide), transferred to Immobilon P filters (Millipore, Bedford, Mass.) and blocked prior to incubation with mouse anti-V5 monoclonal antibody (Invitrogen), anti-OCT1 polyclonal antibody (Santa Cruz) or anti-HSP90 monoclonal antibody (StressGen Biotechnologies Corp, Victoria, BC, Canada). Following incubation with alkaline phosphatase-conjugated secondary antibodies, reactive proteins were detected using Western Blue Stabilizer Substrate (Promega, Madison, Wis.).
Example 14F--Immunohistochemistry
[0180]Transfected Cos7 cells were fixed for 15 minutes with 3% paraformaldehyde, permeabilized in 1% Triton-X 100, incubated with primary antibodies, washed and incubated with secondary antibodies and 2 μg/ml Hoechst 33258. Primary antibodies included a mouse anti-V5 monoclonal antibody and an anti-actin polyclonal antibody (ICN Pharmaceuticals, Inc., Costa Mesa, Calif.) that were detected with a Cy2-conjugated, anti-mouse antibody (Jackson Immuno Research Laboratories, West Grove, Pa.) and a Cy3-conjugated, anti-rabbit antibody (Sigma, St. Louis, Mo.), respectively.
Example 14G--RNA Blots
[0181]Multiple Tissue Northern (MTN®) blots (human and mouse) were obtained from Clontech. In addition, 10 μg of Xenopus, sea urchin and Giardia lamblia total RNA were subjected to electrophoresis through 1.2% agarose, 2.2 M formaldehyde gels and transfer to nylon membranes (Zetaprobe®-GT; BioRad). RNA blots were hybridized with radiolabeled probes in Expresshyb® (Clontech). The Giardia RNA blot was hybridized with single strand-specific probes as described in Knodler, L. A., et al. [1999] "Developmental Gene Regulation in Giardia Lamblia: First Evidence for an Encystation-Specific Promoter and Differential 5' mRNA Processing," Mol Microbiol 2:327-340. Blots were stripped and reprobed with actin, 18S rRNA or Calmodulin probes.
Example 14H--Quantitative PCR
[0182]Real time PCR analysis detected BIVM expression from chicken bursa and zebrafish embryos and tissues using a GeneAmp 5700 Sequence Detection System (PE Biosystems, Foster City, Calif.) with SYBR Green detection. Each PCR series was done in triplicate. The relative expression levels were determined for each transcript from plasmid standards that were included in each experiment and normalized to the expression of S17 rRNA (chicken bursa) or S26 rRNA (zebrafish) levels.
[0183]All patents, patent applications, provisional applications, and publications referred to or cited herein are incorporated by reference in their entirety, including all figures and tables, to the extent they are not inconsistent with the explicit teachings of this specification.
[0184]It should be understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application.
Sequence CWU
1
6513857DNAHomo sapiensmisc_feature(1)..(473)Exon A - untranslated
1agtaacgcct tctccaagtg gatggcgggg tggacacgcg tcccggcgcc ccgggctccc
60tgggatatgt agttcgcgac aggacgagcg gaaatactgc caggatttta ccacctctcg
120cccatttatt tacttctcgg tcaccgcttt cgggggacag ataaacacca cagatgccca
180tcaaaggggc gcacgggtct ggaggcgcag ctcaggtttt tgcgttggtc accctgccct
240ccgcacgtgg agagggcagg cataaagcac cttgaaagga aggtgctgtc aatgctatcc
300gacgacctgt cgccgggcac cgcagcatcc tcgctcgctc cgatgggacg agggacgccg
360gccccagggt aacaggaggc gcctcgccgg ccgcgcgctg gatgctgtga tccaggtccg
420gagccgggtt ccgccgcggc cgcagcgacc cgaccccacc cgacaggcca gaggaatcag
480tttagacttg aaattcagtt tttcctgaaa ctgatcagaa gttagtgaca ccttgattgg
540atccgttttt ctgtcaggag ctcattttgc agctctcaag cttttatagc atgctgtaaa
600caattgtcaa agttgtttat caagaaacag atagagttgc aacttgtttc tagtaataga
660aacttttaca ctgcattcaa tgcctaacgt tgcagaaaca gaaaggtcaa atgattctgg
720aaatggtgag cacaaatctg agagaaagtc acctgaagag aatctacaag gtgctgtaaa
780atctttctgc acaagtgcct caggagcacc cttgggtccc aaaggagatg gtcattatcc
840atggagttgt ccagtgactc atacacggga aaaaatttat gccatctgtt cggactatgc
900ctttctcaac caggcgacct caatctataa aactccaaat ccatcccgct ctccttgcct
960ccctgatagt acctctttat ctgctggaaa taattcatca agatacattg gtatcccgac
1020tagtacatcg gaaattatct acaatgaaga aaatagcttg gaaaacttat ccaacagcct
1080gggcaagcta cctctcgcat gggaaattga taaatctgaa tttgatgggg tgaccacaaa
1140ttcgaaacac aaatcaggca atgcaaagaa acaagtttcc aagagaaaaa cttcagataa
1200aaagggaaga tatcagaagg aatgtcctca gcattctcct cttgaagata ttaaacagcg
1260gaaagtatta gacctcagac gatggtactg cataagccga ccacagtata agacttcttg
1320tggcatctct tcattaattt cttgttggaa tttcttatac agcacaatgg gagctggaaa
1380ccttccacct attacccaag aagaagcttt acatattctg ggctttcaac ctccatttga
1440agatattagg tttggtcctt tcacggggaa tacaacactt atgaggtggt ttagacaaat
1500taatgaccac ttccatgtaa aaggatgctc ttatgttcta tataagcctc atgggaagaa
1560taaaacagca ggagaaactg cttcaggggc cctgtcaaag ttaacccgtg gattgaaaga
1620tgaatcgctg gcttatatct atcattgcca aaatcattat ttttgtccaa ttggcttcga
1680agcaacccct gttaaagcta ataaagcatt cagcagggga cctctctcac cacaggaagt
1740tgaatattgg atcttaattg gagaatcaag tagaaaacat cctgccattc actgtaaaaa
1800atgggcagat attgttactg atctaaacac tcaaaatcca gaatacctgg atatccggca
1860cttagagagg ggactgcagt atagaaaaac aaagaaggtt gggggaaatt tgcattgcat
1920catagcattc cagagactta actggcaaag atttggcctt tggaactttc catttggaac
1980cattagacaa gaatcacaac ctccaacaca tgcccaggga attgccaaat ctgagagtga
2040agacaatatt tccaagaagc agcatgggcg tctgggccgg tctttcagtg ctagtttcca
2100tcaggactcg gcatggaaaa agatgtctag tatccatgag agaaggaaca gtggttacca
2160gggttacagt gattacgatg ggaatgattg actatgcttg ctactgaaca gctggcatta
2220tatatgaaac tgctatatac aggactgtat aaagacagta gaagatttta gtaagcctac
2280attaaatagg agcagatctt gtggtataaa aaataacctt gtagttctcc agatactaag
2340cttgtatatg attatggtgg gtgatttcag atatataagc agataagcac agattattgt
2400cctttcaagt taagagtata taatctggac agaaaatttc acaaaattca ataaaattac
2460aactgttgtc taaataagtg aaacacaaat tcacttaata gcatcaagat ttgaaatact
2520taagcatgaa gtgactttta taatgactcg atccctagac atttgttaca gatagtttta
2580tgcctaagac caagatgtaa agtaccatct gcccttaaaa aaaattgggg ctgtcaattt
2640ctagttttca ctcatggtta acacgcattt aaaattattt catgagtcta gtagttcttt
2700gatttatagc aggatcttgc ttgcctcatt tgtttcctgg ttatgttctt aggattctga
2760ctaagaggca aaagagaaaa gactcaagaa actgatcctg gagatcgaga ccatcctggc
2820taacatggtg aaaccccgtc tctactaaac atacaaaaaa ttagccgggt gtagtggtgg
2880gcacctgtag tcctagctac tcgagaggct gaggcaggag aatggcgtga acccgggagg
2940tggagcttgc agtgagcgga gatcgcgcca ctgcactcca gcctgggcga cagggcaaga
3000ctctgtctca aaaaaaaaaa aaaaaaaaaa gacggatcct tttttttggt gcaaatgggt
3060gacttagtgc attgattcag atttttaaaa tttcttgatg tggtttgtaa taatcaaata
3120ttgacaagaa ccttaggtct cgaaagactt ttataagtct agatgacgtt tgccttaggg
3180gtaaagtaaa agaacaattg gcaccttaag tttctatacc caaggttatc tgtgaaatga
3240gatctcctga tatttgattg ctttctcagt atggagtcat atgttgataa cagtactgaa
3300gatgcataag aaatgcccaa gtcactcaga ggacaactac ccatattcca gactctgagc
3360tgtttccttt ttaaaaatca tatagacaat tagctgtttg aagtgagtat taaatatttc
3420agaagtgtga atttcatgta tttgagctcc tctagttgct gttggttttt cttctgctgc
3480caacctgtga ctcacaaatg actaggatct cttgttcttt aattttaggg tcttgttcca
3540ggactcaaat cagtaacttg gtgattacaa ggtgctgaat gtgttggtaa ccatatcgca
3600atacacctca aggaaaaggt tcagattttt atttttaaaa tattttcatt tttttcttga
3660attttatatc cgtttgttca ctcgtacatg cctagcctac agaaggggat atatattatg
3720aaatggtcat ttttctgaag agaatatttt gcttgaaatg caaaggactg aaagagattt
3780gtaggttgtt gattttgtta cttcatactg gaacttttaa aaagatttca tcaaataaag
3840ttttgttttc tactttt
38572503PRTHomo sapiens 2Met Pro Asn Val Ala Glu Thr Glu Arg Ser Asn Asp
Ser Gly Asn Gly1 5 10
15Glu His Lys Ser Glu Arg Lys Ser Pro Glu Glu Asn Leu Gln Gly Ala
20 25 30Val Lys Ser Phe Cys Thr Ser
Ala Ser Gly Ala Pro Leu Gly Pro Lys 35 40
45Gly Asp Gly His Tyr Pro Trp Ser Cys Pro Val Thr His Thr Arg
Glu 50 55 60Lys Ile Tyr Ala Ile Cys
Ser Asp Tyr Ala Phe Leu Asn Gln Ala Thr65 70
75 80Ser Ile Tyr Lys Thr Pro Asn Pro Ser Arg Ser
Pro Cys Leu Pro Asp 85 90
95Ser Thr Ser Leu Ser Ala Gly Asn Asn Ser Ser Arg Tyr Ile Gly Ile
100 105 110Pro Thr Ser Thr Ser Glu
Ile Ile Tyr Asn Glu Glu Asn Ser Leu Glu 115 120
125Asn Leu Ser Asn Ser Leu Gly Lys Leu Pro Leu Ala Trp Glu
Ile Asp 130 135 140Lys Ser Glu Phe Asp
Gly Val Thr Thr Asn Ser Lys His Lys Ser Gly145 150
155 160Asn Ala Lys Lys Gln Val Ser Lys Arg Lys
Thr Ser Asp Lys Lys Gly 165 170
175Arg Tyr Gln Lys Glu Cys Pro Gln His Ser Pro Leu Glu Asp Ile Lys
180 185 190Gln Arg Lys Val Leu
Asp Leu Arg Arg Trp Tyr Cys Ile Ser Arg Pro 195
200 205Gln Tyr Lys Thr Ser Cys Gly Ile Ser Ser Leu Ile
Ser Cys Trp Asn 210 215 220Phe Leu Tyr
Ser Thr Met Gly Ala Gly Asn Leu Pro Pro Ile Thr Gln225
230 235 240Glu Glu Ala Leu His Ile Leu
Gly Phe Gln Pro Pro Phe Glu Asp Ile 245
250 255Arg Phe Gly Pro Phe Thr Gly Asn Thr Thr Leu Met
Arg Trp Phe Arg 260 265 270Gln
Ile Asn Asp His Phe His Val Lys Gly Cys Ser Tyr Val Leu Tyr 275
280 285Lys Pro His Gly Lys Asn Lys Thr Ala
Gly Glu Thr Ala Ser Gly Ala 290 295
300Leu Ser Lys Leu Thr Arg Gly Leu Lys Asp Glu Ser Leu Ala Tyr Ile305
310 315 320Tyr His Cys Gln
Asn His Tyr Phe Cys Pro Ile Gly Phe Glu Ala Thr 325
330 335Pro Val Lys Ala Asn Lys Ala Phe Ser Arg
Gly Pro Leu Ser Pro Gln 340 345
350Glu Val Glu Tyr Trp Ile Leu Ile Gly Glu Ser Ser Arg Lys His Pro
355 360 365Ala Ile His Cys Lys Lys Trp
Ala Asp Ile Val Thr Asp Leu Asn Thr 370 375
380Gln Asn Pro Glu Tyr Leu Asp Ile Arg His Leu Glu Arg Gly Leu
Gln385 390 395 400Tyr Arg
Lys Thr Lys Lys Val Gly Gly Asn Leu His Cys Ile Ile Ala
405 410 415Phe Gln Arg Leu Asn Trp Gln
Arg Phe Gly Leu Trp Asn Phe Pro Phe 420 425
430Gly Thr Ile Arg Gln Glu Ser Gln Pro Pro Thr His Ala Gln
Gly Ile 435 440 445Ala Lys Ser Glu
Ser Glu Asp Asn Ile Ser Lys Lys Gln His Gly Arg 450
455 460Leu Gly Arg Ser Phe Ser Ala Ser Phe His Gln Asp
Ser Ala Trp Lys465 470 475
480Lys Met Ser Ser Ile His Glu Arg Arg Asn Ser Gly Tyr Gln Gly Tyr
485 490 495Ser Asp Tyr Asp Gly
Asn Asp 500396898DNAHomo
sapiensmisc_feature(466)..(702)Inverse complement of MGC5302 Exon 6
3ctagacagag aaagcaaacc ctaattccca gacagaacat ttggatgagt ttcacgtcat
60tccccaccag aaggcctggt aagaaacaag tagtttatat cttccttatt ctgttgtgat
120aacatggctc tgggaaaaag tatttgtaaa atgacttaca tgtaagtatt aataaaaaca
180tatttggagg tgacatgtgc ttatctcaag ggttagctgg aaaaaaatta tagctctatc
240caggctccaa actcccttga tgcgcattta atatcaagac tgggcagtga gggcagaccc
300tggttgccaa aacagtcccc agcaccccca tgtctcaata ttcgctcaat tttgtgctaa
360cttctcccac ctcttgaaat ttgcaggcct taatttccct tcccaaagca ctatgtacaa
420atacattaga aaaacaaaaa agattagcta ctgattaagt cataccttga agaaatcaaa
480aaatgaaata tgtttcacaa tgggaccata caggttttca tcgtgtttaa agaagaaaaa
540gttggtgaaa gcagcgtcta tgagttctgg gtgttttcta ctgagtttaa ccagctcgag
600tctctctttg cggctgtctc gccctctcca gacggcagtg gaatttttgc tttcccaggg
660aggacccgtg ttagcttgca cggacatcat atccagactt accctagaag acaaagtgca
720acagattttc ctcccaaatc atcatatcac aaaggttgtt gcaaaagaac tcaccaaaga
780aagaaaacca aggctctggg ccaaccctga cttatctctc accggcccat ggtttccaga
840acagaatcag tcaaatcgta cgtaggcatc acgatatcct tggaatctgt ggagccacac
900caggaaaaga tcggatggat gtttgaattg gatttctttt tttccaaagg ccagtctccc
960aaattaacaa agagctccac atctggcatc ttcacctaga aaaaacaaaa cgaggagctt
1020attcacattt cctgcattgg ataatacatt gtatttatca atttaaattt gaaattgcat
1080taatgaaagg ccaatgttac atgtttcagg taataataat taccttggtt tgaatagtgg
1140gcaatgtttc agaatatggt cacatgtata agtattctat gctccctatg aaagggatga
1200aactgcaaaa gtaatctctg ttctaatact gcgaacagcc tctaagatga tcaaagagaa
1260ggagaattat tggaaaataa tttaaaaaat agcacttggg actctaaccc tagtttcctg
1320ctagtgagcc agatggttcc attttaaaaa acatacacca ggtaggtggt gggaccagga
1380tttgaactca ggaagcccaa caccaacagc aacgtagtaa gtttcaagct atgcttcctt
1440cctctactgg caaatggcat gaatatgtaa agaggatatg ttttatctag tcacagaaaa
1500tgtttagagt atttacaaaa caacagatat acatttttaa ggccagaaaa gcactgacca
1560gtctgagaag catcttgaga aaccgaagtc ctgaagaaca ttctcatttt cccatggatt
1620gacaaagagg agcaagagaa gtatgacggt actctgatgt cttagtttaa aggaggctta
1680atattgatgc tatataacta cctacattca gaattaacag actgtaagtg ctttgaaatc
1740ttgaaaaaaa ggcattatga ttttccatga gtagtttaat caagatatac atgcaaatta
1800ttcaacaaat aaatataata gattacaaat aaatatacat ttggtaaaaa tgttggtaga
1860gtctttcaca tccaccattt taacattact ttcaaaccat tccactagaa cccaacaaaa
1920gcccattatc ccaaggaatg ggattcagta caaggcaagt catctttgga ctcagagtta
1980gtttagtacc aaaataattt tatgataagg catttttctt tcctctataa aatattgtgt
2040ccctgcaaaa gatatgctga agtcttaaat cccagtgtct cagaatgacc ttatttggaa
2100atagggttgt tgcagatatt agttagcatg aggtcatact ggagtaggct gggccattaa
2160tccactatgc tgtgtcctta taagaagaga cccagagaca tgaggggaga agcccacgtg
2220atgacagagg cagagatggg aggatgcagc tgcaaaccaa ggaacaccaa agatggaggg
2280ccactgacag cagccaggaa gaggcaggaa aggaaaggaa aggatcatct cagagggagc
2340tggcctcctg acactttaat tttttttggg gggggacagg atctcactct tgtcacccag
2400gccttgggag tgcagtggcg tgatctcggc tcatgcagcc tcgacctcct aggttcaagc
2460gatcttcctg cctcagctcc ccattagctg aaactactac aggctaattt ttgtattttt
2520tgtagagaca ggtcttcacc atgttgccca gcctggtctc aagctcctgg gctcaagtga
2580tccacccgct ttgacctcct aaagtgctag gattacaggc gtgagccatg gcacctagct
2640gacactttga ttttggacat ctggcctcca taactgccag agaatacatt tctgttattt
2700taaggcttcc agtttgtagc actttgttac ggcagcccta ggaaacaaac gcaggtacct
2760gaaataacag aaagcttcag aatagtatta tcataaggct ccagaagaga ataccctgat
2820agtgtataat ttgtattttg aaaaattatc ctgtaaaatc tgggcttaaa ttatctatta
2880cagctaagag aaaatatata cttacctttc tagtcaaaga aagtagtatg gcatccatga
2940aaattctaaa acctacatgt tcaccatgag tcttgatata aacctaaaag agaatttacc
3000atttattatg ttagtctaaa gacgctggca gacttcactg aggaaaagct tgtcacagtg
3060ctcattcgaa tgatgtttat aaaatgattt agctaattgt agccaaatgt tcaaaacaag
3120aaaaaaaatc actaaaacaa acaagcaaaa aatcatgtgt tgtaccttgt tatcctttaa
3180ggtgtagtga cataggctct gcctctgtcc aaatcttttt gggatttcta ctgcaatctt
3240ttctggatcc acagcaggga aatgtgccag atctctctga atctgagcaa tggtttcagg
3300gcagttcatc tcccgtagcc aggctgcact atcttgcaga ggacagtcac agttctcatg
3360gtaaaccggc cctatttaca taagaataac aaagtacaac agtgatcatt tcactcagca
3420ttcgaaactt gtaataaaca tctataatgt ggcttcatga attttgctta gttacatagg
3480aagagcattt tcattgaaaa cacattttaa gagaattaca ggccagcaag gttggctcat
3540gtcggtaatc ccagcacttt gggaggctga ggcaggaaga cagcctgagc ccaggagttc
3600gggagcagcc tgggcaacac agggagacca tgtctctaca aaaaataaaa aattagctgg
3660gtgcgttggc atgcacctgt ggtttcagct acttgggagg ctgacgtggg aggatcactt
3720gagcccagga ggttgaagct gcagtgagcc atgtttgcgc cactccatcc tgggcaagac
3780agtgagagcc cgtttcaaag aaaaaaaaag taatacattt ttactatact ggggcattgc
3840aagtaagtca gccttctact ctgacagctc ttcacagtga agttttgtga tattttcata
3900ataataaggt ctactttgca tgagttcaaa agaaaggaaa tagaggctgg gtgcagtggc
3960tcacgtctgt aataccagca ctttgggagg ctgaggtggg tggataactt gaggtcagga
4020gtttgagacc agcctggcca acatgttgaa accccatctc tactaaaaat acaaaaaagt
4080ttagctgggt atggtggtgt gtgcctgaaa tctcagctac tcaggaggct gaggcatgac
4140aatagcttga atctgggagg tgaaggttgc agtgagcgga gattgtgcca ctgtgctcca
4200gcttggggga cagagtgaga ctcttgtttc aaaaaataaa agaaagacaa cagatacttg
4260gtttatgtct atggtaatat tacagctcat tattccttat gttttgcaga ctgaagaata
4320accaaatact ggtaaaaggg tatcagagga gcatacctat tttaatgaaa gacaaaagtg
4380atatgattta ctattttgat ggtctgttat acaaacaggt tcttaaagtg tctaaggtac
4440ttttctcaat aagtaaatca tcaataacag caaaaataaa caaggaactg ctattctttt
4500tttttttttt tttaagaatt attaaggagg aaaagtaggc tgggttcttg gctcatgcct
4560gtaatcccag cactttggga ggctgaggcg ggcggatcac gaggtcagga gatcgagacc
4620atcctggtca acatggtgaa accccgtctc tactaaaaat acaaaaatta gccaggtgtt
4680gtggcgcgtc cctgtagtct cagctatttg ggtggctgag gcagcagaat cgcttgaacc
4740cgggaggcag agcttgcagt gagccgagat tgcaccactg cactctaggc tagtgacaga
4800gcgagacatc gtctcaaaaa aaaaaaaaaa aaaaaaagca aaagtagctg ggtacagtgg
4860ctcacgcctg taatcctagc actttgggag actgaggtgg gcagatcact tgagctccag
4920ggttcgggac cagtctgagc aacatggcaa aaccctgttt ctacaaaaaa tacaaaaatt
4980agtggggcat ggtggtgagt gcctgtagtc ccagctactc atgcagctga ggtgggagga
5040ttgcttgaac ccaggaggtc gaggctgcag taagcagtga ttgcaccact gcactccagc
5100ctgggcaaca gagcgagacc ctgtctcaac aacaacggca acaaaaagaa gcaaaagtaa
5160ttctcaaaac agtccacttc actaatttta taacaaatta attacagtct gcactgaggt
5220ttttactgtt attccttttt ataattctca gatcccacct aacccaggca gtggctgaca
5280atggaatatc tttttaaggt ttagtgggtg atactgtacc aggctgtact ggcagaatgt
5340aggaaaggaa cctagacact cttgaaaagt gtttaccttt tcttacttct ctgcagagtt
5400cacaaaaata aaaaaaaaaa agtttacttt tcttggggtt gttaaggggg ggacaagatt
5460tctgcctttg tatatacact gcttccctac tgtcttgtgg tactgtcgcc tgtaagaggg
5520aaggagatgg ctctaggtaa taaaactgta ctctcatcct atataagaaa catcagaatg
5580gccgagcatg gtggctcaca cctgtaatcc cagcactttg ggagactgag gcaggcagat
5640catgaggtca agagatcgaa accatcctgg tcaacatggt gaaatcctgt ctctactaaa
5700aatacaaaaa ttagcctggc atggtggtgc gtgcctgtaa tcccagctac atgggaggct
5760aaggcaggat aatcgcttga acccggaagg cggagcttgc agtgagccaa gactgcgcca
5820ttgcactcca gccttgtgac agagcgagac tctgcctcaa aaaaaaaaaa aaaaaaataa
5880ataaataaat aaataaataa atgtcagaga atttctctgg gaaaatggca ttggaacaaa
5940gacaaaaaaa caaccaccag tgcgctcctt gtctttcgag ctatctccct tcctgaagtg
6000atctacctaa tttcaataac acttatcagt tacttaagtc atatatttcc aattaagaaa
6060gtatcatata tgagcatgaa cacagcttga ttattcttgc taatgtatgt cttcgcggag
6120taaattctct actaatatgt tccttgctct ttagcaaatc agaatttccg ttcaaatctt
6180gaatgtcttt ttaccaggac tcataactta gctttcaagt agaaagcctt tatttttctt
6240ctttcagtaa aaaaaaaaaa aaaaatatat atatatatat atatatcatg ccaagttgtt
6300tttgttgatg gaataacata tattatttta ttcaaatact gctgttataa aattatattc
6360caaattacct tttaaaatat atggggattt ggccacatgt tgcccttgga atttaatttc
6420caccttcaga tttttgtagc ttgcatacat tctgtatctt actatgaagg acccatcttt
6480tcggtctaaa acctggactc caactctagt gaattgctcc tctggtgctg agactttcac
6540ctggaagacc ttttcgcctg gagaagatgt gaatctgtga tgaaaaggca atgggggaga
6600taaacagatt ttaacgaaca aacacacaaa ggttatgctc ttcagtctgg aaacttgatt
6660attctcctcc acattcttcc tctaacatat gattgcttct tcttctctta tgcttttagt
6720accagaatga atatgatgga tcagagattt aaaaaaatga ggatgacaat ggccaagata
6780acatgaaaca gtttcattaa acagtcacaa tggagtatgt taagactatt aatctatata
6840agcacagaac atactcatgc attccatatg gaaaaatatt cttactaact tatttcatgc
6900tatcacatct ggatatttgt gatcaaatga tgctcccaaa gataatgatc ttattttgaa
6960aaatctctgc caatgaatct ttacaatatc agtattataa atcacgctca ggcctggcac
7020tgtggcccat gcctactatc tcaacatctt gggaggccaa ggagggaaga ctgcttgagg
7080ccaggagttc gagatcagcc tgggcaacaa ggtgagatcc tgtctctaca aataataata
7140ataataataa aaaacaaata gccaggcgag gtggcacacg cctgtagtcc caactactag
7200ggaggctgag gtgggaggat agcttcagcc tgggaggttg agggtacagt gagctgtgat
7260tgcaccactg cactccagac taggctagga agtgaaaccc tgtcttaaaa aaaaaaaact
7320cacgctcagc catgggcaaa agcaaagtag taggtagtgt atcatctttt gccaaataac
7380aaaatttaca acctgccttt tacttgacaa ccttttattc cagtattttt ttctgtatga
7440aatacacaca gtcagcatcc accaataata gagacaatca tttcagaagt ataggaatta
7500taaaaataag ttttaagtat gatgttaaag tattaaattc attggaacaa aatatttgtt
7560acaatcatat aggagattaa tacttcgata ctttccttcc gaactctgaa gtttcaaata
7620gtacttcatt tttctttttc agtttcaaaa catgcttacc tgcagatgtc tccttgaaaa
7680ttgagggccg caccagagaa agatgaagcc catcaagtct ctaacaacta ggtttaaatc
7740ccttataata gcttatgcga aaagtagtta gaaacagtat ttccagcaaa gtggagggac
7800tttgtcttgc cctgataatg tgggaatctg aatcaaaatt gctaggagat tattttatat
7860gttatataca atacattata ttgcatatat tataattcgg aacaatttct acagctacag
7920agaaaaatgg gtaaccaaat atcctggaat ataaactgct gtggagaaac gagttcttgg
7980aaagaaaaat cttaaaggga gaaacagatt taaccgccat ccaaaatagg taaggagccg
8040tttctcgact tacttattcc ctgatgtatc cactgcctga atatagaaat agcgggcggg
8100aaggacgacg tctgctttta gcccgggtcc ccatatttcg ctcttctccg ggctcagctg
8160cctttctccg ccggtctcgg cgagtgctgg aactgtcgcc agaaagaagc aataaagtag
8220caaagtgcca aacatttaca agacggactc tcgaaatgat ccaccgataa atgaagaagt
8280gtaagaggtg gacagaagca gccgagcctt cggcagggac ccgccggccg atcgcagcag
8340ccaacgcgac tgcaaaggtt gccgcccggc gtgcagggca ggcgcgcggg tctccgcgac
8400cccaggacaa tcaaagcccg tgccccggcg cgcccaggtg agggtcccct ggcgttctgc
8460tgtcccggcc gagaaccgcg ctgctcctct ctctcaggac aatgatgaac ttgagttgct
8520cctgccactg gagcatcatt tgggagcgaa tccgtctcag gttccagcca aggtgtaggg
8580cgaggggatt ggcccgtgcg tcgggccagg ctcagtaacg ccttctccaa gtggatggcg
8640gggtggacac gcgtcccggc gccccgggct ccctgggata tgtagttcgc gacaggacga
8700gcggaaatac tgccaggatt ttaccacctc tcgcccattt atttacttct cggtcaccgc
8760tttcggggga cagataaaca ccacagatgc ccatcaaagg ggcgcacggg tctggaggcg
8820cagctcaggt ttttgcgttg gtcaccctgc cctccgcacg tggagagggc aggcataaag
8880caccttgaaa ggaaggtgct gtcaatgcta tccgacgacc tgtcgccggg caccgcagca
8940tcctcgctcg ctccgatggg acgagggacg ccggccccag ggtaacagga ggcgcctcgc
9000cggccgcgcg ctggatgctg tgatccaggt ccggagccgg gttccgccgc ggccgcagcg
9060acccgacccc acccgacagg ccagaggtac cccggggcgg ggggcagggg ccgaggtggc
9120ggccggctgt gcgctctgag cgcctggccc tccgctgggc acctgggcgc cgccagcccg
9180gcctgctgcc gctctacgcg cagccacctg ggcattcaaa atttttactt aattcgatac
9240cggcctgggc tgccaggggt catcgcctct ccgagccccg tggcgtccag atggaggcca
9300ctgcatgggt gcgcgctctc ccgggaagga gtagggggaa gagctgtgtc gcggggggaa
9360gagaagcgcc agggaaagaa gggcctagcc ctctggtgaa caaagctcga ttaggaggtg
9420tccatgtgga taccggtgac ccctgtgcgg ccgtcggtct ccacgccacg ctgggcaggg
9480tccgggaacc agcgcgcagc ggccgtcgcc ttcccctgca cgccagcacc cgggtctggg
9540ccgccgccag gagtcacggg ctcaccgccc tggtcagctt ggcagtcgga cccggagccg
9600cctcctctgc ctgcctccct cttgccagct gccccgaaaa cccagaagag ccggtggctc
9660cgagccaagg cgggcctggt tcggcgccag gaaagggggt ttctttccta ttttcttttg
9720tgcaattgtc attattaatg ataccgactc gtttactcaa acagtcgaat cggagcccca
9780gctcttagcc cggatgcagc aattgcccgt gggccccctt taacaccaac agcgtccccg
9840gggcccgggg caagcatgtt cgaggcggtc acccccgggc ctcggcgcgc tccccctggc
9900ggagagcctc gtcttccggc cggtgaggga aggtagagga gggagtaggg gcgaggaggc
9960ctcggcggcc cttgggctct gcgggctggg gactcggggt gcccgcgaca cgcgcggagg
10020cgcgggctgg gttggccacg ggcagggagc gcagccgcgc tccttcctct ctgcccgcgt
10080cgcctccgcg cgcactggtt ctgcgcggcg gggcttggcc tgcgcgactg tctactccgt
10140cccggcggcc tcggagcccg gccgagcggc gagcttgtca gaggacggtg gtggaaacgc
10200tcccggcctc cccaggggcg cgggctggag gctggcgcca ggcgcggagg actcccggta
10260tcttttgaca ggctggcgcc tcggctctgg ggacccgcag gtctgaaggg gaggaagggg
10320cctggagggc gcgggaggac accgggtggg aagggtggca ttagctcggc cgggggctat
10380gcgcctctgg tttcgccctc ccgcgcatat tcgaccctta cgaggtcacc ggaatgcccc
10440tgctcctcag ttgccttcta tacaggatat cgatcagggt attttgttat acgaaaaggc
10500tttactgaag aggtttttag agatgtttgg ttctctcata aacttgatac ttgagaatac
10560agacaaaata taacctgaaa agactacaac ctaggcgatg aagattggct ttacaaatgg
10620acgtttattt tacagaacac ttcgttcagt gactttgaac aatcatgact ctggcggtgc
10680tttttaaact tgccatttta taaatttttg ctttgcatac gagcaaacca tatttctatt
10740gcttatgaca tgattttatg agtaagctat tagttgagcc tgaggtcctg cagtcattct
10800tagtagtaaa tttttttttt tttttttttg agacggagtt ttgctctgat cgcccaggct
10860ggagtgcaat ggtgcaactg taacctccac ctctggtgtt cagcgattct ccagccttgc
10920ctcctgagta gctgggccta cacgcatgcg ccaccatgca cggctaattt tgtatttttt
10980tattttttat ttttttttag tagaaacggg tctcaccatg ttggccaggc tggcctcaaa
11040ctcctgacct cagatgatcc acccgcctca gcctcccaaa gtgctgggat tacaggtgtg
11100agccaccacg ccggcttagt aaatcttaat atagcaacac ctcacttgcc tggaagaggg
11160aaccgcaatc aatcaaaatg agggcctaca gtaatgcctg gcatgatgca aacacttaaa
11220aattatctgt tgaatgagac gtctacaaat cctaggccct ggggatacaa taatctggaa
11280aaccagactt gcaggatgca gacgttgatc atatgaacag atatgcacag aagtaagtgt
11340aaaattgcca cctggtaaga cctgtgtgag gaaggtacta gagtctgtac cgggtcacct
11400ggcctagttt gagaagccaa gaaggtttcc cccagaaagt gacatttgag ctgacattgg
11460aaagatgaat gggaatgagc taagtaaggg agacagtatt gggagaacca aaaagtaatg
11520tggagcttgg ggggtggggg aaaggaatga gatggagcta accagataga tctagggacc
11580atcaggagtt ggccttttgt tctaagagca gaggctttca ggcagaggag ttctgtgatc
11640atatatatgt agcaaacttt attttctaat atctctgcac aggtcaggtt agaagtgtca
11700actcactggg aatagtttat aaatagagaa acaaaaggag ataaaagatg agtcaaaaca
11760gtgacagctg cagcctatta agtggaggga caaactgcct ctctatagct cagtattgtc
11820tataatgatt ctgttattag tattatcagt aataaattgt gcttagtgta ctttaagaaa
11880gctagaatct gagcatgcaa taatagaagc ccccttggcc tcttggggct ctcactatag
11940tggagagaat agacgtgaga cagtgtggaa agaaagtaaa cactagcagt gtttgggtcg
12000tgggatttgg gtaatttcca ttttcctgta atatcttttg gtactttgca tttttttgta
12060atgttttact tataaaatct atgaatatta cattttcaaa gagaaattta catatagttt
12120ccaatgagaa tgtttcatgc ccttggattt tagtgacagt caatataaaa tgcatcctta
12180tattgatgat cttcattttt tttttgctaa aacttcgacc aaataaatca tcttgttccg
12240tgaccattat ttaaaagcaa acaaactaaa aacacaaaca aaccagactg ttactttttt
12300ctctctttcc tttttttttt ttttttttga gacagagtct tgctttgttg cccaggctag
12360agtgcagtgg tgtgatcata gctcactgca gcctcaaact cctggcctca agtgattctt
12420ttgcctcagc ctcccaaagc atagatatta caggtatgag ccactgtgcc tggctcacac
12480tgttactctt tttattaatc tagtgctgtg ttctatcttt agcgtccagg aagcttaccc
12540ccaacttttg tgcttaaatg cagtcatttc cctttgccta tgtttttgat aagaatattc
12600tccatggctg ggcatggtgg cttgtgcctg taatgccagc aatttgggag gctgaggcgg
12660gaggattgct tgaagccaag aggtcaagac cagcctaggc aacatagcaa gaccctgttt
12720cttaaaaaaa aaaaaaaaaa aggttattct atatatgttc caaatgagca tacttttaca
12780atccctgcca ggtgcagtgg ctcattccta taatcccagc actttggggg gccgaggcca
12840gcagatcacc tgaggtcagg agttccagac cagcctggcc aacgtggtga aaccccatct
12900ctactaaaaa tacaaaaatt agccaggcat ggtggcacct gcctgtaatt ccagctactc
12960cagaggctga ggcaggagaa tctcttgaat ccaggagaca gaggttgcag tgagctgaga
13020tccggccatt gcactccagc ctgggcaaca gagtgagatt ccatttaaaa aaatcaaatc
13080aaatccctac actgtcacac agagagctgg tcccacaggc aaaattccat tcagtgtgag
13140gaaggaagcc ctgggaaagt ggaagccaag tctgagatga ggatataaaa ggggcagggc
13200ctggaacatt tccgtctcgc caccaaactc actctaataa cctttgtcta ttgcctctca
13260ccgagactat atgctctttc attcctcacc tcgcacagcc cacccccacg accccaatac
13320cacaaatacc tacctctctg tccaccacac tgatgtagag aaaggcatga aggtcacaga
13380tgagaagtag aaaatgctat gttaggacat ctgctgagaa tcagagcaac tctgtcttcc
13440aaaaagacaa gagtttggtc tgaacaacgc caggtactga gcttccctct gccatcaccg
13500ttgcaccacc agatgaataa ggagagagca ccacttccac ttgaggaccc actacaacta
13560ctccaagaat tttttttacc aaaagaaagt gaaagttttc aaagtgaaac cacaggaggt
13620tccacctttc gtggtaatat tcctatccaa ctgaccctct tgcaaacaac tataaactct
13680gcacaaatta ttttaaaact gaagagtttt ttgtttgttt gttcgtttcg agacaagatg
13740gagtgcaggt cacccaggct ggagtgcagt ggcgcgatct tggctcaacg caacctccac
13800cttcgagcct caagcaattc tgcctcagcc tcccgagtag ctgggattac aggcatgcgt
13860cactaccgcc tggctaattt ttatcttttt tttgagatgg agtttcactc tgtcacccag
13920gctggagtgc aatggcagga tctcagctca ctgcaacctc cacctcctgg gttcaggccg
13980ttctcctgcc tcagcctcgc aagtagctgt gattacaggt gcgcgccatc atgcccagct
14040aatttttttg tgttttttag tagagacagg gtttcaccct gtgtgccagg ctggtctcga
14100actctgacct catgatccac ccgcctcggc ctcctacagt gctgggatta caggtgtgag
14160ccactgcacc cggcccttac tgtctggcta atttttaaat tttcagtcga gatggggttt
14220caccatattg gccaggctag tcttgaactc ctgacctcaa atgatccacc cgcctcagcc
14280tcccaaagtg ctgggattac aggcatgagc caccacacca ggcctaaaaa ctgaaggttt
14340gaatagagaa aaagcatgct ttaaaagtaa agaaaatgga attttgccta gcatatgtgg
14400agtcctaata tgcagctctg tttccttaaa ttccatgaaa gccatgcagt acctttgcta
14460gtttctcctc acagatcagg ataacctagg gggctcttgt gtgaatcgtc ttctatttct
14520tgcagcctaa ctcataggct ttcgttgttc aatatttgta tgatggtttt gatactattt
14580ttggtaaccc atgacagtta tttttatttc taatttttta agtaagcaaa tgggcagaga
14640tattaactgg taaaagtcca actgatcacc cagggtggac tgaatctctc aactgatgct
14700ctgttgctgg agccctgaga aacccgcata ccctgcccgg gcacctgcct ggggttgtct
14760gctgcgtgtc ctgggatggt tcaattcacc aaggacttcc tctggtataa atcttcagct
14820tccttgcatg ccctcagttg ctatttaagc tttctgtttt cttccctaaa ggaatcagtt
14880tagacttgaa attcagtttt tcctgaaact gatcagaagt tagtgacacc ttgattggat
14940ccgtttttct gtcaggtgat gaatctttgg aaaatttact ttctgtattc tgtgtttatt
15000taaatctgtg gccgttattc atcatgtatc ctttatgcct atgtacgtaa aaaatcttgc
15060taatacatta ttttttagac aactttatgg aggtataatt cacacaccat ataatttacc
15120cattaagtta tgcaattcat tggcttttag ggtatttaca agttgtgtgt tcattgccac
15180aatcatttat agaatattat cattattaca aaaagaaact ccatcacccc caaaccccaa
15240gccctaggaa accatgaatc tactttctgt ctgcatagat ttgcctgttc tggacatgtt
15300atataaatag aataatacac tatctggtcc tttgtgacga tcttctttta ctcactataa
15360tgttttcggg gttcatccat gttgtagcat gggtcagtac ttcatttctt tttattgcca
15420aataatattc cattgtatgg atataccaca ttttatttat acattcctca gttggtggac
15480atttgggttg tttccatttt ttggccatta tgaataatgc tgctattaac atttgtgtga
15540agatgtattt tcatctgtca cggatatata ccttggcatg aaattgctgg accatatggt
15600aactctgttt aattgttgga agaactgttt tccaaagcaa ctggaccatt ttacatttcc
15660attagcaatg tatgagggtt atgatttctc cacgtcctca ccaacatttt tgattatagc
15720cattctagcg tgtgtgaggt gttaatctca ttgtggtttt gatttgtatt tccatgatgg
15780ctaatgatac tgagcatctt ttcatgtgct tattggccat ttatttttat ttttgataca
15840gtctcgctct gttgcccagg ctagagtgta gtggcgcgat ctcggctcac tgcaacctct
15900acctcccagg ttcaagtgag tcttatgtct cagcctcctg agtagctggg actacaggca
15960tgtgccacca tgcctggcta atttttgtat ttttagaagg gacggggttt caccatgttg
16020gccaggctgg tctcgaactc ctgaccccaa gtgatccacc tgccttggcc tcccaaagtg
16080ctggattaca ggtgtgagtg actgcgcctg gccttattgg caatttctgt actgattttg
16140gagaagacac tattcagata ctttgcccat ttttaaaaat tgggctattt gctggccggg
16200catggtggct cacatctgta attccagcac tttgggaggc agaggtgggc agatcacttg
16260aggtcgggat tttgagacca gcctgaccaa gatggagaaa aaacatctnt actaaaaata
16320caaaattagc cgggcgtggt ggcgcatgcc tgtaatccca gctacttggg aggctgaggc
16380aggagaatca cttgaacccg ggaggcggag gttgcggtga gccgagattg tgccattgca
16440ctccagcctg ggcaacaaga gtgaaactcc atctcaaaaa aaaaaaaatt gggctatttg
16500ctttttaatt attttttaat tatttgaaaa taatttaatg catattttag actaatttaa
16560aaaataagat agtgattgtg actccagtca tatagtagtt gtaaaattaa tatagaatga
16620aggcatatgt atgcataaaa cttgctatgc tttttagtgg ctctttgtgt atctggtgga
16680ttgttgatca ttctttttcc ttcctcttag gagctcattt tgcagctctc aagcttttat
16740agcatgctgt aaacaattgt caaagttgtt tatcaagaaa cagatagagt tgcaacttgt
16800ttctagtaat agaaactttt acactgcatt caatgcctaa cgttgcagaa acagaaaggt
16860caaatgattc tggaaatggt gagcacaaat ctgagagaaa gtcacctgaa gagaatctac
16920aaggtgctgt aaaatctttc tgcacaagtg cctcaggagc acccttgggt cccaaaggag
16980atggtcatta tccatggagt tgtccagtga ctcatacacg ggaaaaaatt tatgccatct
17040gttcggacta tgcctttctc aaccaggcga cctcaatcta taaaactcca aatccatccc
17100gctctccttg cctccctgat agtacctctt tatctgctgg aaataattca tcaagataca
17160ttggtatccc gactagtaca tcggaaatta tctacaatga agaaaatagc ttggaaaact
17220tatccaacag cctgggcaag ctacctctcg catgggaaat tgataaatct gaatttgatg
17280gggtgaccac aaattcgaaa cacaaatcag gtaaggaggg agccatgaag ttcatatgtg
17340aaaataatga gaaaacaaac actatgtctt gtttaatctt gccattacac atagtttcct
17400tgtataatac tagataagga acatggctat catcttgtct gtcaatgtag ttttaggaaa
17460gtaaccttga cgtagggcat gtagttcatt gcggggcttc cactggaaac ttcaagcata
17520agctttgtat caaatatttt gagagatttg aaaatctaat aatgtaaaat attataaaca
17580gatggtagct tagaaaatga aatgttaata acatggctag aataacttac tactgtttca
17640tagttttata ggcacatgaa gttgtatttc ctgaccaaac atcttttttt cctgctatat
17700aatgttttag ctttttttgt tgttaaaatt ttttaggcac ccagcaaagc ctccatgtac
17760caccaagtgg gttgtgtact gctcaattta agaggatgct gtttaccgag gttgtgcata
17820actttcacag ttgcaatggg gtggtcctat ggcagataga aatactttac acttctttct
17880tttgaattca aagtaataca ggaattttat gaggcaggta ctgttagccc catttgtagg
17940tgaagaaatt gaggcttaga agggataaat ctctagaccg aagttgcaga gttaataaga
18000ggaggtcaaa ctaggatttg tatttacttc tagtcttctc tgatgatttt ataaaacctt
18060aatgcttctg cttgtttatc tgcaaaatca atttgtttca tagaatttta ggtaaatttg
18120taattcttaa gggtaggagg gggattttgc tttttttgtc taaatttgta ggtaatgagt
18180ctgcaatttt ctttttgtct aatgcctttg gccagttttg gaattagggc tatgcaaatc
18240ttctaagaaa agttgggaag tgttttttcc tcctgttttc tgaggatttt cagagagata
18300ttaaaatctg tatttataat tatggagttg tcttgatttc tgttaatttc tgacaaattt
18360taatttgtcc atgtcattta agttgtcaaa tttgggggct taaagttaat gttatccctc
18420taacatctaa ggaatctttg ttgacaactc cttttgcatc ctaatactgg taattaattt
18480attttctcta ttttttttaa aactgatcaa tctagctagg agtttatcaa ttttgttaat
18540cttttcaaat aactagcttt tgttaatttt ctcaattgtt tgttttctat ttcattggtt
18600tctgctcttt attatttgct tcctctgact tactttgggt ttattttgtt cttcattttc
18660tgacttctta agatggaagc ttagatcatt gattttagac ttttctacca taagcataga
18720atactccaaa ttgctaagta ctgctttagc tgcagctcac aaattttgat atgcttatta
18780ttattactta attggaaata ttttctaatt tcctttgcaa tttattcttt gatacatgta
18840ttacttagat gtatgctgtt taatttctag atattaatag tttttctaaa tattgatttc
18900tagtttagtt ccattgtgga cagagggcat atgctctcag tctttttaaa tttactgaga
18960cttgttttat gacccaacat atggtctatc ttggtgaatg tgccatgtgc acttgaaaag
19020aatgtgcatt ctgcagtcat tgggagtatc tataaatatt aattatgtcg aagtgtttga
19080aagtgtcatt cacatctttt gtgtctccgc ttaacttgtg tcttgttcta tcaattacca
19140aaagaagggt gttaaaaatc ttcaactatg attgtgaagt tgtcttttct ccatttaatt
19200tcttttttaa aaactaatac atgtctaata acagaaaatt tactgtctta accattttta
19260cgtgtacagt ttagtggcat taagtacatt tacgttgttg tgcaaccatc atcactattc
19320atctccaaat ctctttcctt ttcttttctt tctttttttt ttttttctga gacagagtct
19380cgctctgtcg cccaggctgg agtgcagtgg cgcgatctcg gctcactgca agctccgcct
19440cccgggttca caccattctc ctgcctcagc ctcccgagta cctgggacta caggcgcctg
19500tcaccgtgcc cggctacttt tttgtatttt tagtagggtc ggggtttcac cgtggtctcg
19560acctcctgac ctcgtgatcc gcccgcctcg gcctcccaaa gtgctgggat tacaggcgtg
19620agccaccccg cccagccctt cttttcatct tacaaaataa actttgtacc cattaaacaa
19680taaatcctca ttccttctcc tcacagtccc tggaaaccac aattttactt tttgtcactg
19740tgaatttgac tattctaggt accttatgta agtgaaatca tatagtattt gtctttttat
19800gactggttta tttatagcca cccctgctct cttctgggtg ctatttgctt tggaatactt
19860atttccatcc ttttactttc agcctatttg tatctctaga tctaaagtga gtgtttcaga
19920gacagcatat agttagctaa ttttttctgt ttctgttttt gtgtgtgaac ccacgtatac
19980aaaaacatat ttttaaaaaa atgcattctg ccagtctcta tgttttgagt ggagaattta
20040atccatttac atttgaagta atcactgata aggagagact tgtcatttta gtacttgtta
20100tttatatgta aacactctgt tcctatactg cttcgtccac acttaacttt cagttattga
20160tgttatcaaa ttacaaattt atatattgtg tgtctgaaaa cataaactaa taatttttat
20220gtattaatct cttaaataat gtggaaaaca aaatgtggag ttacaaacca aagttattat
20280aataatagct tttttatttt tattttttat tttgaaatag ggtcttgctc tcttgcccag
20340gctggagtgc agtggcacga ccatggctta ctgcagcctt gacctcaggc ttaagcaatc
20400ctcccacttc agccttctga gtagctggga ctacaggtgc acatcaccac gactggctaa
20460tttttaatat ttttttgtag agatagagtc tcactatgtt gccaggggtg gtcttgaact
20520cctgggctca agcaatcctc ctgcctcggc cttccaaagt gctgggaata caggcatgag
20580ccactgctcc tggcctacta ctagctttta agctaataat aatttctcaa aaatgtatta
20640gtctcataaa tcatatagaa tacaaaaact ggagttgcaa actaacaaaa taacactggc
20700ttttataatt gttcgtgtat taccttcact gaggtaaatt tatttcttcc tatggctttg
20760aggtactagc taatgtcctt tcatttcaac cagcaggaat ccttttagca ttccttacag
20820ggcaagtcta gtggtaataa actccttcag tttttgtttg tctgggaatg tctgagtttc
20880ttcttcactt ttgaaggaca gttttgctgg atttagaatt cttgtttgaa tttttttttt
20940atttcagcac tttgaatata tcagcccact tccttctggc ttccaaagtt tctgatgaga
21000aatctgttga tattcttatt aacagtccat tgtatgtgat gagttgcttc tcttgtgttg
21060cttttaagag tctttgtctt tttgcaattt ttaaattttt tatttgatac ggagttttgt
21120tcttgttgcc caggttggag tgcaatggca cgatcttggc tcaccacaac ctccaccttc
21180tgggttcaag caattctcct gcctcagcct cccaagtagc tgggattaca ggcgtgcgcc
21240accatgcctg gctaattttg tatttttagt agagacggag tttctccatg ttggtcaggc
21300tggtcttgaa ctcctgacct caggtcatcc gcctgcctca gcctcccaaa gtgctgggat
21360tacaggcgtg agccaccgtg cccagccttg caattttatt ataacatata ttggtgtggg
21420tctttctaag ttcatcctat ttggaatttg ttgagtttct tggatgttta tattcatgtt
21480ttttaaatca aatttggaaa gtttttaggc attatttgtt tggataatct cctacccctt
21540tttcagtgtc ttctccttct ggaacttctg caatgtatag gttggtccac tcgatggtgt
21600ccccaggtcc cttaaggtct gtttactttt cttcagacct ttttttttct gttcctcata
21660ctcaattatt tcagttttcc catcttcaag gtcactaatt tttctgcctg ctcaaaactg
21720cctttgaatc actctagtga gtttttcatt tcagttatta tatttttcag cccaagaatt
21780tatttttggt tttaagtttt ctatctcttc attgatattt ccatgttctt catacataca
21840tctttcgtta gttcttttgg catctttaag atagtggttt taaagtcctt gtctagcaag
21900tctgccattt ggtctttcca ggatggtttc tgttggttta tatatttttt tgtttctttg
21960aatgggcctt attctgtttc ttgtgatttt ttgttgttgt tgttgttgga aactagacat
22020tcaaatatta tttaataatg cagtaactct ggaaatcaat acccatcttt cccagagttt
22080gttgatttta gtttttgttt tttgattgct ataggctgtt tctatgctgg taatcaacct
22140gaggtataca tttaagggct tctcagggaa aagcccctgc ctttctctga gtgtgtacag
22200tgattttcta catttccctg tataagtgat tgcttttgaa tatactgttt ttgaaatgtc
22260tggctcccca aacagaaaat ggagaggaaa aaaaacagaa caaaaaaacc acaccaaagg
22320tgctggcccc ttaagtcctc tggaagttgc ttctgttgta gtgggagggg cttgcaacat
22380tggagggaga gttgcaacaa tggctgcccc cctgtatctg tgctttcaag atcagaagca
22440gcagtcagca atcagaacac agatttccaa ttttcatagg acatggacct ttttgcccac
22500catggttccc acaaactgcc tgaagcttct ccaggagcat atgcacagtt tcttggactt
22560ggggataggt agttgtacct tgctatgtgc tgaaattgac caaaatttgc atttactgtt
22620caggacttcc tctgaaagtt gcaatccttt gaatagactc tggaattcca aaataattac
22680atcagacaga ttctgccagt gcagttgtct aggtggggag acagattcct ggtgcctcct
22740acactgcact taattctgtt aattttttta aaaatgtaat ttgaagctct gttattaagt
22800tcacatacat ttatgagatt taggccttct tcctgaattg acccttttat cattatgaaa
22860tatatttcgt tattcctgga agtatttctt gttctgaaag ttactttgtt tgatattaat
22920aaagctcctc agctttattt tgattatatt tgcatggtat atatttttcc atctatttat
22980tttagttcta gaatttacat ttttatagct tctatttctc atctgagatt tgctatattt
23040ttcatttatt acaagagcat tttatctatg tcataatgca gagttataat atctacttta
23100atttcttatc tgcaaatttc aaaatctggc taatctcaca gtaggcctta tcttttttct
23160tgagaatgag tcatattttt cactttcctt atatgttcta taattttgca ttgtatccta
23220gagagtggga ctattatgtt atggcaactc tggtatattc cttcaaagtg ttgatttttt
23280tttggttttg ggaggcagtt aatttggttg aactaaaact gtaaattctg tctccttggt
23340acagctccaa ccatctcagt tcagctcttt tagctgtgtt gcttgaattt gctccttaca
23400tgcatggtcc aggggtcagt ccaaggtttg ggccaaattt atacacagaa tttgggcccc
23460ctcttctctg tggctctctt ctttctgggt ttcctctcac ctcactttca gtgatggtga
23520ttgcacagga ctctgtcctt tggttcttcc agtaaaaaaa gacagtggat tttctgcttt
23580tgtttgctct tctcatacct ttttctaaaa gtcagtgtgg tgaatacata tatttgtgtg
23640tgtatgtgtc tgtgtgtgtc tgtctgtctg tgtgtgtgtt tggtgttgag gctgtgtgtg
23700tgtgctgtgg gtagggtgag gaatggaaga aggaactcag taatgagaag gagcagtggg
23760cgaaggttct tgaagctgat gtgatggaag gcagaaggtg caggtgttgc tcttctatgt
23820tcccatctca aacggcgctt tcactttgat gtggcttctt tccctcaagt caaatgaact
23880tttgcctgtg ggaccagctc tttctgtggc agtgcgggca ttttatgaag tacgctcatt
23940tgggccgtat atgctaattg atcccagggg caatccgaga ttcaacaaaa agagaatagg
24000gggcaaggtc atggaaccct agaggaaagt gtgagtacag actgttggca gagcctatca
24060aggtcacaca tttctatgtt tcattagctt taacatttta agtggctggg aaaactactt
24120agggaattaa gtcattatta ggcagtgaag acatctcaag aaatgttaga aactaattct
24180tctgttatct gactctgtaa gagtcatttt accagtcaag gaaatttggc acaatgagat
24240ttggcacagt tgctcacctt tgccagtgat gttcagctct gtgaaaagtg tgttctctgt
24300aaaaacttag aaaaaataat taaaagggtc acgcatgatg gctcacactt gtaatcccag
24360cattttgtga ggctgaggca ggtggattgc ttgagctcag gagtttgaga ccaacctggg
24420caacatagtg agaccctgac tttgttaaaa atacaaaaat tagctgggca tggtcgcacg
24480cacttgtggt cccaggtact cctgaggctg atgtaggagg attgcttgag ccccggaggc
24540agaggttgca atgagtggag attgcgtcac tgcactcttg tctgggcaac aaagcaagac
24600cttgtctcaa aacaaacaaa caaacaaaca aacaaaaaga ataattaaaa ggaaggcgaa
24660acattgtttc ttgactttat agtcttcatt attattacat ttttacagaa attccctgtg
24720taataatagt tcctgagttc cagctgttcg taggtgtcaa atggtttctc tgtatagtat
24780cttgaaggaa taaaactgat ctctttccat gtttgcttac taggcatata tgtatataat
24840ctattttata atttatgaat gactcataaa aatgaaatat tagccttcag ttaattttta
24900taacagaact gttttaaaat agaatatgtg tgaaatatta aagtatttga gcatagctat
24960ctgaaatctt aatagtattt taatgaaatg aggcttggat gttatttatt gatattttac
25020tttatataat tttttactag atacttcact aacgatttag aaataaaact tataaaaata
25080aaaagtatag ggatggatac cccattctcc atgatgtggt tatttcacat tgcatgcttg
25140tatcaaaaca tcttatgtac ccctaagcat gtacacctac tatgtactct caaaaattaa
25200aaaaaattaa aaagtacaaa ttctcactct tacttttggc ttccttttgt ctagctccca
25260attctccctt ccccagtagt aaccactgtt gtttgccttg agggtctttt ggagaatgtt
25320tgtatgtgtg tgcatgtatg agtgtgtgtg cacatgcata catagatgca tacatgcaca
25380caggaggctt gcttttcatt ctagtttctg tgctagagtt gcattttagt aatgtatatg
25440tgtgcatcat tcttcattca ttctgtcgtc tctgtcggtc cagagatatt tttaaacatc
25500acatcatcac attaatggat ctggtgcttt tactttctta acagagttgc tttaatttgg
25560ttttattgtt ttatagaaag ccttcatttt ctggaaagtg gtctagttaa atttttttac
25620tgtcgataat ttgattttcc ggatgtggct aaagatttag tgattgactg ttccaccagc
25680taagtgacta gctctggaat tggagtataa gaacctgaca cgaaatagga ttcatggttg
25740tttaaatttg tttttattat ttctctttaa ggtaggggac tattattgaa aatacatgta
25800ttacttcatt gtgtttgcaa atatcacttt tagaacctgt ggcagttatt tatttgttat
25860tattttttct ttgcactttc tcagcagaat tattaaggca gttacttctt aattttggtg
25920cataattaat ttttgtcatt tcagtttact tcatcttaag cattttgctt attttatact
25980cttgtattct aggcaatgca aagaaacaag tttccaagag aaaaacttca gataaaaagg
26040gaagatatca gaaggaatgt cctcagcatt ctcctcttga agatattaaa cagcggaaag
26100tattagacct cagacgatgg tgatgttatc agtttttatt tttttcattt ttgaaatatg
26160taatatatgt tggcaataac gtagcagttt acaaattaat acattatatg tatcataaat
26220atttttattt ctaaaattat taccaggatt tctgaatatt taaagtcacc acaaatgttt
26280gaagtagatt aataatgagt tatgctggtc tttcccaaac tcggcttatt attagagtca
26340tctgtgtgac tccacgtaac tgccatgtag atttaacaat gattaacctt ttgatgtagt
26400taaatactta gcttttcatt gctgaagaat tttcaagtaa gtgactgatt tttattatat
26460ttcatctcaa gttacctaag tatgcatctc taaaaaatga cttttttcta cttaaccaca
26520atacttttgt catacttaat aagattaacg atatttccct agtgtctaat tcccaggcta
26580aattcacgtt tcccagattg ttctgaaaag gtcttcacat ctgtttgttc aaagtgggac
26640ccagttgtga accacatgtt tgcttgtcat gacccataat tctctcaatc taggtcagtc
26700tccaccccac tttccccctt gtcttttaaa aattctatta ttttaatcga caaatgagaa
26760taataaagat tgacttattg aagagacttg gccagatatt ttatagaatg tcctacattc
26820cagatttgta ttttcctttt agtttcctaa attatgcatc taacctcaat atttcctgta
26880aactggaagt tgaatacaaa gacttgatta aattccagta caacatttta tgcaagaaca
26940ctgcttaagt gggccctcat acttcaaagg tttcacatca gaaggcacat ggtatctgat
27000tatccagtat tagcaattct aagtgattac tgggttaagg tggtgacagc tgtattctac
27060attgacgagt tctttctttc tcctctggta ccctcaagta atcagtagag tgacagagtg
27120gcattaggta atagccaggt tccatcagct ttttacctta tggttttagc caaatgtatt
27180gacaacccaa acccattgat gatccttgtc tgaatatttg ttgcaaaatg gttatatttc
27240aaaattccaa tttttttttt tttttacttt tattacccag ccttcttctg tagctttcac
27300tgatcaactg gggctatttg gttatcttaa aatacagttc ctagtctaat gaaaacaagt
27360ttaattcatg acaactttta tgtattactg accctgattt gggtttttaa aattcctttg
27420tatgacattt tagaaatata tttttatcac ttatttaata aatattggac cctttgctgt
27480aggctgagaa acagagatgc acacaacatt gtctttgttt tcccaaggtt tacaatccag
27540ttgaggtgag gggactaaac agaaaaatgt taaatatctt aggcagtttg tcatgtgatg
27600ggtatgaatc actaattaac catgaaagga ggtaatgatc accagaatgc tttactaagg
27660tcctgggtct taagggagga aatatgaagg gttggagggt gaagggtctt tgaaagaatt
27720acagcaaagt gtctctggaa tgtgggtaat gaagagtaga gtagcttgtc taaagtgagg
27780ctcagagagg ccaagaaacc tgccagtgtc acatggctaa taagtagaag atcttgtatt
27840gaagctgaag agtcagactc aacattcatg ctgtttgcgt ataatacatg tcatgatctg
27900tttcttagat atcaggctca atatttagat tttttttctg agttaaagat aagccatgga
27960cgatttctta acaaaaggaa cttttatcaa gataaatttg gaatagttag tatgtaggga
28020ataatggaga agagacatta aagtcacaaa cattcatttt agtttctttg aaaagaaggt
28080tttgctcgtt tgtgcttcta atcacttaga tttaagcttt gtttcaacaa caaatctgtg
28140taactacata ttaatagtat aacatgtgag gccatgaagg ctaggactag gaaggataga
28200tgggagaaat gccctgaggg gaaaacatca ccaggactat aagcctgttt tccactgaga
28260atattacaag attaggttag aatggaaagt cccgtagaag atagtagaat gttggtaagc
28320attttaacct ggtcataaac aagactagga aacaagttta gattgatgaa aaagtatgaa
28380aaatcttaag agaaattccc ctttccctct cctttttaaa aattctttca ggccaaattt
28440agtgttatga gaaaaactat atacatgcat gcaatacttt gttagtaaat atatgtaaag
28500ataaatgttg taaaatataa aacactgtta gtattggact tgtaaaataa tccccaaaat
28560tgaactctgg caaaaatatt cacagcagaa aaaaaaaatc agtaaatgaa tgagttgctt
28620gtgttttatc agcaagggcc atatagctta tgatgacaga actgttaata atcacttgtc
28680cacaaaggtc tcttaggttg gagttcacac tcatatttaa acaactaggg cagattactt
28740gaataaggac attagtaaat tgctcatgtt tccagggata ggggaaaggg gtaaaaggga
28800gttgtgttgt tcagtgggta aagagtttca tttttgcaag atgaaaaagt tctgttgcct
28860aacaatttgc atatagttga cactactgac ctgtacactt aaaaatagtg tacatgataa
28920atttgatgtt gtatgttttt atcacaatta aagaacaaag caaaactgag ggaggactca
28980atagatttga cagtaaggtg ttgaagacag cacttagtat gttaagtgat agttattagc
29040ttttgtgtgt cattttgtgt agtctacaga tttagggctc gtggcacacg ttctcactga
29100taagtgagag ccgaacaatg tgaacacatg gacgtaggga gggaacaaca cacactgggc
29160cctgtcgcgg ggtgggaacc ggggacagga gagtatcagg aaaaatagct aatgcacaca
29220gagtgtaata cctaggtaat gggttgatag atgcagcaaa ccaccatgac acacatttat
29280ctatgtaata aagctgcaca tcctgcatgt ataccctgga acttaaaaaa aaatattgaa
29340gggtcatggt taacttgtat gtgtgccaat atgcatagta tatatacata tacaaatgca
29400tatatgtaca catgcatatg tctaattttt cattttgtaa ttaattttta gaccacacta
29460aagttgctct tagtgatgat agggcttgtt tctttgtttc attgtggcta cttcttagca
29520ccttctttag aaagcagtta ggagaagatc atttgaaggc caacgagtgt tgtgggttat
29580tgttaagctg atttaacatt atctcccccc acaaccacgc ttgactagct tcacatttgg
29640ccaggtgcag tggctcacgt ctgtaatcca gcactttggg aggctgaggt ggaagatcac
29700aaggtcaaga gattgagacc atcctggcca acatagtgaa accctgtctc tactaaaaat
29760acaaaaatta gctggttgtg gtggcgcacg cctgtcgtcc cagctactcg ggaggctgag
29820gcaggagaat tgcttgaaca cgggaggtgg aggttgcagt gagccaagat tgcaccacag
29880cactccagcc tgggcgacag agtgagactc tgtctcaaaa aaaaaaaaaa aaaaaaaaaa
29940gagaaagcaa acgtatttct tcttaaaaca gaataaataa atagtctgtc tctttctcct
30000tctgttcaca tttgccccag tttcttctct tgaatcatga cagtttggaa aattgtctgg
30060attgcttagt gccactgaat catgccatgg aaggattttg tatttcactt ttaaacttct
30120ctgtgacagg agaagcactg cttcatggct tcttgcccaa ggattttaga tggacacagt
30180gggtaataaa tggatgaatt tttgtttggg ttgaagaatc tctctgagaa gttgacacgt
30240gggggcaatg gtttgtttct cttgtatttc tgaagttgca aataatcatg taagcagttc
30300aaccaggagt ttacaccaaa cttttaatag gcgatatatc attatttttt ttcccattgg
30360tttggataac atccacttta actggcagtt agtcatactt agctattttt gttaaagcag
30420gtgatttatt gttattttat atttatgaca tgattaataa gtgaatatgg aagattttac
30480attgacttag gggatcaaag ttttcattat attaacacct ttaattgcca tgagttttct
30540atttctagca tgcatatttt gtgttcattc aagtgaagaa aacagtcttt tgtgttctca
30600ggtactgcat aagccgacca cagtataaga cttcttgtgg catctcttca ttaatttctt
30660gttggaattt cttatacagc acaatgggag ctggaaagta agtatgtcaa tttatcagta
30720cccccaaact ccaaagtaat ttgatgttgc tttttctata acagaaaaaa atttaagaat
30780agatttttta taaatttaac aaaaccctgc tgtattttag tgtaagtctt ttagcttaaa
30840atatatacat gtattatttt cagtgaaata aaaatgggct gggtacagtg gcccacactt
30900gtaatgtcag cactttggaa ggccaaggcg ggagtattgc ttgaagccaa ggagtttgag
30960accagcctgg gcaacaaagc aagaccccac ctctacaaag taaaaaaata aaaacaacta
31020gccaggcaca atggcatgta cttgtatttc tagtctctta ggagactgag gcaagaggat
31080cacttgagcc caggagttta aggctgcagt gagctatgat cacgccactg cactccagcc
31140tgggccacag agtgagaacc tgtctctaaa aaaataaata aaataaaata atgaaaaata
31200ttactaatat tatttgcaaa tcagacaagc atattaacat tgagacaggc tgtatttgcg
31260tatgactgga attgaaaaat gaaaggcaat gaatgtttct tttgtagcct tccacctatt
31320acccaagaag aagctttaca tattctgggc tttcaacctc catttgaaga tattaggttt
31380ggtcctttca cggggaatac aacacttatg aggtatgaag accctcttag aggcaatatc
31440gtgtttctag ttttgcaaaa taataatgat gtagatgtgt gttgatagta aaccatgtat
31500ccagctgctg ggcttcaacc tctcataggt ataccaactt tgggctatgc ctgaatttct
31560ttagaattgg aataatgcca tctttgttgt agaattgttg ccatgaacaa atcttcttgt
31620tactttcagg tggtttagac aaattaatga ccacttccat gtaaaaggat gctcttatgt
31680tctatataag cctcatggga agaataaaac agcaggagaa actggtaggt aaacatatag
31740aagatttaca tacacacata cacacatgca cacacacaca tacacacaca cggtttagag
31800ttcgtctcaa agactgcctg tttagcttcc tgggacatta tgtaaaccct cagcacagtc
31860tgttacttgc atgtaatctg tagggcactt tttaataagc agtttaatat tttacctctt
31920ctaacctttt tatgtagaac ctttagaact ttaaggtata tcacaaacac tactctaaca
31980cattgtattg tttgggtccg tggtgtgtgt gtgtgtttgt gtgtgtatgt atgtgcgtgt
32040gtacgcgcac atgctagtaa aactctaagg aagcagcatg gagttagcag ttttcttcct
32100aagaaggaat gtaaggattt gagaatattt gatgttagac ctgtgtggtg acatataatg
32160gcaggaaaat agactgttga agtgacaaag ttctgctata aaatagtctt gtttataaca
32220gaaatattaa tggagttgat ctcaaggatt attttagtta tgtgctaatt aaataatttg
32280tactatttag caatttccat ttttaaaatt gttacaatct tttgactgtg taaatcagaa
32340aattgtaata cattgttaaa gaaaaaatta ttctgacact tgttaatagg gtgagaagac
32400tttattttaa acttgtgaaa ggcctattgt agtaggaatg agagatcagg ctcagctctg
32460aatacagcaa agacaacaaa gacaactggg actttatagc caagggacac agcgaggggt
32520ttgggggtgg gatattacta agagggtcaa gggtatgggg attcttgcta aagcctaaca
32580gacttcttgc taaaggcagg tcaagggctg agacatcagg agttggcagg ggacgtccgt
32640aaggaacttg attagatatt aagggttgtg ggttctctct aaactgactt agcaggattc
32700atgctaaaac tggacacaga agtctaaggt ccaggcctgt ttgagaagag gactcaagga
32760ttaatcagga agagaattgt tatcaataga gggtatatgg tttaccaagg taaaagactg
32820agcacaaagt cagatatatt tctgttgacc tatattgggt atatcaggtt ggttacttgg
32880aggcaggctt ggagattaca tgttgctttc tacacatact agttttgtga ccttgaacaa
32940gtcacttgct gagccttgat gtagaccttt tatctttcta tatttgtata gggtcagtgt
33000gaggataaaa ggagacatac ttgtaatgta cttagcacgc tgtgttacag agtaaggact
33060cagtaagtgc tagcatttat tcctactgac taaattctat gcattcacat ggggcctgtc
33120tatccttctg gaaaacacag ctttaaatgc tgtgttgcta tttatttctg tttctagtga
33180gactgtcacc acactactct cagtaagtga tcaggcaacg aaaaactaag aaaggatata
33240ttatcctgtt tggtaatttt ctcttcagct gtgtctagtt tgctttttat actgcctatt
33300gagtttgagt tttttaaaag aggttttcaa tcatttttat tctaaaatat gtagttcttt
33360taaaaatctg cttttaaaaa taacctcatt ccttactcat attttcaaac ccttttattt
33420tttaaatcta ttgaaacttt ttcctctgta tttggtgtaa tgtaggaaat gtgtatatat
33480ctgattttgc tgtctcttgt ttcttctgga tcttatttga ggtgtcttcg tgtgccatga
33540aaagaaaatg cataatttgg cattgtcttt tatagttctt gaatgtttac ttccacagga
33600tgtgccaaac ctttttctgg tccaaaagat ttttatggtc attctccagt tattagaagg
33660tataataatg attctgtcag ttgtgttcat taaaagtttt ataaaatgtc tgccaatgac
33720attttgcacc ctgtaaactg aatacatagc agttcagtga aagcaagtga gagggtaagg
33780ggtgtcaagg cacagtaatt agcaacctga gctggcacca ccatgtggag ttcctgcccc
33840tctctcagga tagcacatgt gtgacctggg acctttggat gtagggttgt atagtgtaga
33900cagcatccac actcaatcca cagaataaat actagtaaag cttaatttct ttttaaaata
33960tgagggtcac tataaataaa tgttctaata ttttgtttct gtgcccacaa agggggtcat
34020cttgtataac cacctggatg tgtcggtact gaaaatcatt aacctagagg agcagtagtt
34080ggcaaatgtt tttacataaa aggcttgata gtaaatattc ttggctttga gagacacatg
34140gttctggggc agtcacagct gcttgatgca gctgcttgga tcttgaaggc agccatagac
34200aacatgtaaa caaatggata tggtggtgtc ctgggaaaac cttactgatg gacctgaaat
34260ttgaatatca cataattttc acatgtcaca aaatattatt tttcttttga acttttaaaa
34320attttattat ttatttttta gagatgaggg tctcattccg tccactaggc tggaatgcag
34380tggctgtgat catggcttac tgcagccttg aactcctagg ctcaagcgat cctttcccct
34440tagcctccca agtagctagg actacaggtt catgccactg cacctggcta atttttaatt
34500taattaatta attaattttt tttttgagac ggagtctcgc tctgttgccc aggctggagt
34560tcagtggcgc gatcttggct cactgcaagc tccacctgct gggttcacgc tattctcctg
34620cctcagcctc ccgagtagct gggactacag gcgcccacca ccatgcccgg ctaatttttt
34680gtattttttt tagtagagat ggggtttcac catgttagcc aggatggtct cgatctcctg
34740accttgtgat ccgcccgcct cggcctccca aagtgctggg attacaggtg tgaaccactg
34800tgcccagact aattttattt tttaattttt ttcaattaaa tttttagaga tgggggctca
34860ctatattgcc caggctagtc ttgaactcct ggccttaagt gatcctccta cctcggcctc
34920ccaaagtgct gggattacag gcatgagcta ctgtgcctgg cccaattttc ttttgatttt
34980taacaatgta aaaaccattt tagcttgctg cccatttcaa cacaggctgc agtctggaga
35040ctggatttac ctgttggtgg gagtttgcca aaccctaacc tagaggatga agcctaaatt
35100cctgagcatg gtgtttgagg gactggcttt cgaatatcct gcttcagcct acctcccctg
35160gtgttcagcc atgctgggct cctcgatgtt tccctgttct tcacaagata caggtcatcc
35220ttaggcctat gtgcctcact gatgggatgc cctgctgtgg agcatgtgga atgtgggatt
35280tcatatcttt gtgattggtg gattcctatt tatttgtcca tcctggactt cagcgtaaca
35340aatggatatg gtggtgtccc gtgaaaacct ggggggatgg ctttttcttc tgcttgccag
35400gcaggattca atcccttcgt ctcctgggtt gtcacagcac catgtgtgcc tctggtagaa
35460ctttgatttc attgcgaggt aattcttggt gcaccgtatg gtttcatgca gaaaaactag
35520agttgccttt cttgaggatt aggccaagtt cttactcatt tgtgcatcac agtgcctgat
35580acttaggtgc tcagtaggag ttaatttaat gaattcgcaa ttgaggatat gagtcaggag
35640gaattcagga gattggggtt acgaaacttc taagagagaa ttttgttgtc cattccaaag
35700ctggcattct tcctttgcaa gagctctctc ttgtgatgtg gaaaggaaaa tgttttctct
35760ttccctttct gtagctcatc tttccttctg gctcctacct gctctcccct ccccacactc
35820ccagacacac actcacctgc acctgtgtgc tagatctgcc ctgctttcca tcagaaccca
35880gtgcgtcctc cttctgtctt ctggagtctc tcagttgaca gtttttcgtg tgttcctaag
35940gacaggaaat gagccaactc attgaatctg aagttttcaa ctttacatct tagcagtgcc
36000ttgaacattg ccttttacat tataaatagt aagagattaa gattgtctca gaatcacttc
36060ctaactgatt catagggtgg ctttatatct cttttaatgg ggtgtggacc aaaatgcatg
36120tctgggagta tagaaataat cactggaacg aatgtcatac ttacgtgtcg gatataatgg
36180aaagctatta caatacagtt ctcagtcttg tgtcccaaga ctgcattatg agtaaggccc
36240tgggagacac cagaacaagt ctcataattc cttttacttg ttttcttttt gcctgctgat
36300tagaagagga cttcatttga gtagttaatg gctttttcta ttaatatcag cagtatctag
36360atggagtgag aggatagatg aagagatggg tcagtcaaag taaaggaaat taagggaagc
36420aggaagaccc agagaaatta gaataagaga gaaaatctat caaagaaact ctgggtatcg
36480aggaaacatc agctatatac atttaatcaa attcccaaag taagaacata tacttttata
36540attaatggaa tggaataaaa cactaaaagg gccatggaga gagttgactt aggaatcact
36600attctggaat tttaaaagat ttttgatcaa ctgtcttgtc caactttaag attttatact
36660ataattttgc tagaaattct catttgtaga tcttttgaat atctgtactt tatttatttt
36720attttccaat agtctggtac ctcccaagtt ctttgtgact tatttaaatg taatgatttg
36780gtgcatgtgt taaccagttc ccttaataat ctaggatgtg tagcattggg tcccaatgac
36840tttttttttt tttttaacct atttgtttaa atttcctcca gcatacatcc aaacttttca
36900ttgaggaatg ggataaacta cctattttcc ttaaatttgt tttcttttgt tagggggttg
36960aggtgcttac tgtaatttga gtaagtttta atatattttt caagttttat tcctactttc
37020aatttttttt ttccagaata ttcgactttt tactgctttt ctgtgtgcat ttcctgaagg
37080ataattaatc tgcgtacgcc tcctggaata tttttgtctt tacctttgct aattggggga
37140cccagttaag ccctgcttca tctctgatat gttccttggc caccttagcc ctggattcct
37200ctaaattcct ctagtactga tttttcacac ccatcatttt ccatttgcta tgtgctatgt
37260tttattatgt ttcacctgtt caatgtgagt gtccagctcc ttgaccatag ggactgttct
37320gagtagccct tacatgctag atatgatgcc cacgacatta cacgttaccc tatgaattct
37380gatggtattc aggcggtagg gtggttcatg tgtaaagcag acgtctcttc cgttctctcc
37440cattgtggaa gaaagagttg cagagatagg ctttctgatt tctctctggg atgttcactg
37500gtgaaaacca ttgtcagtgg cgctatttgt ttaaaaacac ttttaaatct atcttctctt
37560tgtgcttaaa ttgtttttta aggaccacta agcctgtttt tcacaatggg ctttacgttt
37620tcctggggac cacgctgttg tgattcaggg actccccggt gcttatgtcc agtgggctct
37680tagtaactgt aaactaagca cactaatgga ataaaagtga agtccaggga ttttgataag
37740acttagctgt taatatcagt atttttttca gagatgtcac tagtcacaat gaaaatttat
37800tctaggagtt gccaacatta ggtctctgtt cagataactt ccgagtatgg gttcacttag
37860ccaccaacat ttgcaccagg caagtccaga ctcatttttt taatcagcta gcttgaaaat
37920tgcatttata ttcatgtatt tagttttttc tagtgggtat tgcagttcat aacttttgct
37980taacagagaa gacccgtctc tggcctcatt cctgtataac tcacacatcg cttcaccatg
38040attgttgtta ccagcttact gaagcacctc agtacatctc cggtggacaa tacattagta
38100ttctgttttc ctcatgtact ggttttctcc aattaatatt agacacaaat aataaatctc
38160ttgaacatgc agtgctaaca tacccctttc ctctttagga gctgtttggt gcttgcttac
38220agcataaact gcttagtgtc attccaagct gacacaaatt ggccttcccc tgctgtcctc
38280catgtaccag ttgtatttaa gatgtcaggg gaagctggca tagtggctca cgcctgtaat
38340cccagcaggt tgggaggccg aggtgggtgg attccttgag gccaggagtt tgagactatc
38400ctggccaaca tagtgaaacc ttttctctac taagaataca aaaaattacc tgggcatggt
38460ggcgtgcacc tgtaatccca gctacttggg aggctgaggc acgagaattg cttgaacccg
38520ggaggtggag gttgcagtga gctgagattg tgccatgcac tccagcctgg gcatcttttt
38580tttttttttt aaaataaaaa agatgtcagg ggaataagga ccttgcagag taatgttaga
38640ttgctggatt acctatcaga gtttcagaag ctattcttgt ctttgaaggt gtaaaaatat
38700actctataag ttcttatgga gttatctagc attcctctat agatgatcag ggtctacagg
38760acaagctcat actaagcaaa ctcagaatgt cagaatgtta acagagtaac ttgaagtaag
38820aaacataatc tcaaagggac cagaattctc ttattggctg ctcagaatac ctgctcttct
38880ccatgcagca ggccagtgga ttcctgtagt ttttttctcc tttctctatg ccgtgcagat
38940catggctctg catttggtgg tggttgcttt gttgcttaag tttcagattt taaagctata
39000ctgaagtaca gactcaactt cattctgaat ttagcttccc tcagtaatac atttgtgtct
39060tctcatacac agaattactt tcttggacaa ctaccttaat gtgttgaatg ctgtatttac
39120tggctcggca cagtggctca cacctgtaat cccagcactt tgggaggctg agaggggcgg
39180gacacttgat tttgggaatt tgagaccagc ctggccaaca tagtgaaacc ccgtctctac
39240taaaaataca aaaattagct gggcgtggtg gtgagtgcat ggaatcccaa ttacttggga
39300ggctgaggca tgaggattgt ttgaaccggg gaggtagagg ttgcagtgag ccgagatcat
39360gccactgcac tccagcctcg gcgacagagc aaggccctgt ctcaaaaaat aaaataaaat
39420aaaataaata attaaataaa tgctgtattt actatctttt atttcagagg gtgatctttt
39480tcttattttg gcctgaaatt gttggattgt cccacagtag gctcctctat accatcctac
39540cctgtattct ataaactaga tcacgggctg ttccctgagt atattccttt tccccagtgg
39600ttttgttttc tcctaatgct ctttcctctg tctagattgt tgaaatcctg acacttcatt
39660aagtttcatc ttgaatgcta cttcttcttg actcctcaaa cctgatgtgc tctttctcct
39720ttgaattcta tgcccctttt gtggcactat cttattccac cttatataaa agttacttgt
39780tccatcttcc ctactaatgc acttgaggga ccgtgcatca ctccttggct ctcttctgcg
39840gccgtggacc tgctgctctt agtggtcttc agctatttca gtggccagaa agtcgatcac
39900tgaaactatg acttgcaagc tttcatctgc ttcatctatt tttaggattt ctagttttaa
39960tgggctgttt atgaataaat aatttagctt cttctaggac atgccagtta atgaaacctc
40020agttaatggc tttgagtact aagtgctggt gaaaagtcat ttcttatggt aatacttttg
40080aaattgacca ctattatagt tatggcaaat gtccagggaa gagtgggcat aagaggttgt
40140ggtggtacta ccagcctgtc tctgtagaga tgttatcttg tcacttggct tgtgaagact
40200actcctctag cctccataag ttttgtgtaa agattaaagt gtgtgtagaa aaatctgggt
40260tcttatcaca caacaaggaa agattaggct cacagatact ttgaagggtg aaggggaaca
40320gaatttattg ggtgaaaagg aaaaaactca gcaaagcgag aagggttcct gctaacaggc
40380ccccatctca cagattgaat cccaggttgc cacacaggga caggagaggc caaggtcctc
40440ccctctgcaa acagcgggca cttcctgagg cccaacccca tcctcccagc acgcaggcca
40500gctggagatt ctccaggaag ccgtttttac ttggctgtct cattagtagg agaagtttgc
40560atttttagaa cttgattatt acctggagtt tatttttatt catatacatc agtagttttt
40620taagttggga tgtgtgtact gcatatatat tagaacttct atattttaac ctaattttaa
40680attccaactt tttgtatatc ttttataaag acatattagt acagtggtat atgtatatac
40740tttataaatg catgtgctaa cattgggcat gcttgttcag aattttaagt gacagctgtg
40800tacaaccaga aaagttgaaa ccattgctga atacactagg taaatttcat agacaagtat
40860gtgtgttatt agaaggtttt ggtaagcaag gacatatttt agttctttag tttgctaatg
40920agcttacctg aatttcagtt taaagagtaa aacagaatac atttattaac attttttgta
40980cctatcaata gcttcttgtt tttccccagc agtgtcagtg tattccaagt aatgtctttt
41040ctgtccttaa gcccttaacc ggtcctttgt gtaaagcatg gacactgctt tatgggcttt
41100cagtttgtgt gtttgtttgt ttttaatagc ttcaggggcc ctgtcaaagt taacccgtgg
41160attgaaagat gaatcgctgg cttatatcta tcattgccaa aatcattatt tttgtccaat
41220tggcttcgaa gcaacccctg ttaaagctaa taaagcattc aggtaagcat tgacgtgttt
41280tagaaagtgc attttaagaa atattaaaaa atagatgggt gcggtagctc acgcctgtaa
41340tcctagcact ttgggaggcc gaggtgggcg gatcacgagg tcagaagatc gagaccatcc
41400tggctaacac ggtgaaaccc cgtctctact aaaaatacaa aaaaaaaaaa aaaaaaaaaa
41460ttagccgggc gtggtggcag gcgcctgtag tcccagctac tcgggaggct gagggaggag
41520aatggcgtga acccgggagg cagaacttgc aaaaaaaaaa ttaaaaaaat taaaaaatat
41580tcaaagtctg tgatattggg aggcttggcc atctgctttc ctgacatcaa gttagactat
41640tctttaaaca ttatgactta ttcttctgca gaattgcatt tagttaattg tgctgttgaa
41700aatatccatt tagatactgt tgttcagtca ttataggaaa aagtcatttg aaaagtcact
41760tgtttttctc ttagagacag ggtctcactc tgtcatccag gctggagtgc aatggtgtaa
41820ttataggtca ctgctgcctc aaactcctgg cctcaagcaa tccttctacc tcagcctcct
41880gagtggctag aactacaggc atgcaccacc atgcccagct cataaaattt tttaatttgt
41940gttttctata gagacggggt cttgctatgt tgtgcaggct ggtctcaaac tcctggcctc
42000aagtggtcct tattcctggg cctcccaaag tgctgggatt acaggtgtga gccaccacac
42060ccttcctgaa aagtaatttt tacatttatt ataaaacagt tgcaaaggat tttatagact
42120atgtcacagt gcctcatcca acgtcccaca tgaacaatat attacagaat ctagtgtcaa
42180gaagattgtc acaactgagt aaattaaaaa gcatcttctt ttcttaaagt gaagaatatg
42240gaatttgtct gggttccccc tttgtcctga ttgcaaggaa gctctgagcc aatttttatt
42300tccaatgtca gaaactacat gactttcatg gttatatttc ttttttgcct aatctagagt
42360ttctgtttta ttaactatat aactttactg gtaaacttgc atctttgtgg aagagatgat
42420ccctaagaaa gagaaggtga ctaagtttta atggtttttt tcctgtttag ccacttttcc
42480ttatgatggg ttgagataat accaggatgg ttgaaagcca agggagaagt atctagttaa
42540gaattggtgg gtggggtgca gtggtgtaat cccagcattt tgggaggctg aggcaggcag
42600attgcttgca cccaggagtt caagaccagc ctgggcaaca tagtgagacc ctgtctctta
42660aaaaaataca aaagttagct gagtgtgttg gtatgcacct gtagtcccag ctacttggga
42720ggctgagacg ggaggatcgc ctgagctgag aacatgccac tgcactccag cctgggcaac
42780agagcgagat cctttctgta aaaaaagaaa gaaagaaaga aagaattggt gaaaacacaa
42840gccatgagta ctttgagcaa agggattggc ttggtcatgg ggataggatg ggtttttttt
42900tttttttttt gagacaaggc cttgctgtgt tgcccagact ggagtgcagt ggtgagatca
42960tagctcactg cagcctcgac ctcctgggct taagtgatcc tcccacctca gcctcctgag
43020tagctggaac tacaggtaca caccaccata cttggctaat tttgagatgg ggtttcaggg
43080tttcaccatg ttacccaggc tggtcttgaa ctcctgggct ccagtgatct gaccatgtca
43140gtctcccaaa gtgcttggat tatagacgtg agccaccaca cccagccagg atgggttatt
43200tggggctgga ggccatgtgg ctttggagtc ccactcaaag gcccctcggg tttggggaag
43260agagggattc tgagacaggt tggaatgacc tgggagaacc aggaagctgg aagttgtgtt
43320ttgtgaataa ggagcaagag ggaactctct tgttctctcc caatgctgaa atgctttatt
43380catctttctg gaggtggaat ttcttgtaag tgaggtatac tgtatgtttc ttttatgatt
43440tcatcctgca gcttcacctt atttctgaca tttatcaaat ataaatcatg gttttaaacc
43500aactttgtac cctttaaaag agagtccctc acttctgcct gtacctaaga atcatgtgag
43560attttcttaa aaaactatat aaatccccag gctctactcc agacctattt gaattagaat
43620gtccagggat gggtttagga catcagtgtg tttaaagact tcattgcttt agggtttcat
43680ttttaggatg cgttttgaga tgtcttgaat tcatagtgaa aggagaagag tagtggaaaa
43740atgcagatag cattccagga tttttttttc ctcctagaat tttagaggtg gtatattaaa
43800acattttata gccctagatt ttgaatagag gattaaggct gaaattttaa tttgtgagtt
43860ctctttaatg gcctgtagaa atttgccctc acctaagctc taatgtctgt tcttgatgaa
43920tgaatgccag gtttgaatac actctagtat gccttttatt aatatatatt tgaagcaata
43980tactcaaggg agtaactatt aaacgtactg tgaatgtatt ttatatttag caggggacct
44040ctctcaccac aggaagttga atattggatc ttaattggag aatcaagtag aaaacatcct
44100gccattcact gtaaaaagta tgttaacttc cctttatttt ctttaattga ggtaacattt
44160agatacagtg aaatgcacag atcttagttg tatataatta gtttgataaa tgaatgcacc
44220tgtataatca ccacccaaac aggatatggt acacattcat tgccatggaa aatactctca
44280ctctctactc ctatcaatct ccatagaaag cccctcttct gatttctgtc actacagatt
44340tgctttgcct cttcttgaat tccatgtaaa gagaatcaga caatattatt tttgcatcaa
44400tatcttaagt aacatttttg agattcctcc attgtgtcat gtgtatcaag agtttattat
44460ttttttattg ctgagtagta ttctgttgca taagcatgct gcaatttgtt ttccattttc
44520ctgttgatgg gcatttggat tgctttgagg tttggtctct tgtcaatgaa gctcttggga
44580acattcatgt acaagccttg aaacaaatta tataaaaatt ttattaaaat ataattcaca
44640taccataaag ttcactgcac ccaccatgcg cggtggctca tgcctgtaat cccagcactt
44700tgggaggctg aggtgggagg atcgcttgag cccaggagtt caaggccaac ctgggcaaca
44760tagtgagacc ctgtctctac aaaaaaatta aaaaatgagg caagaggatt gcttgagcct
44820aggagattga ggctgtaata agctgtgatc gtatcgctgc actctcatct gggtgacaga
44880acgagatcct gcctcaaaaa aagggaaaaa agtgtacaat tccatagatt ttattatagt
44940tacagggttc tgcaccaatc accactatat aattgcagaa cattttcatc actccagaaa
45000gaaaccccat accccttggc agtcactccg tattccctga gccctggcaa ccactgatct
45060accttctgtc tctagggatt tgcctattct tgtttgtttg tttgtttggt ttttttaaga
45120cagagtctct ttctgtcact taggtgggag tgcagtggtg tgatgtcagc tcactgaacc
45180ttcatctccc aggttcaaga gattctcctg ccgtagcctc ctgagtagtt gggcttacag
45240gcgcccacta tcacgcccag ctaatttttg tgtttttcgt agagacgggg tttcgccatg
45300ttggccaggc tggtctcaaa ctactgatat caagtgatcc accctcctca gcctcccaaa
45360gtgttgggat tacaggcgtg agccaccacg cctggctggg atttgcctgt tcttgacatt
45420tcgtataaat ggtgtcatac agtatgtagc attttgtacc tggcttcttt cacttaatac
45480agtgttttca atgtcatcca tgttttagca tggattagaa cttcattcct ttatatggcc
45540aaatcatttc ctttgtatgg agaagtcact tttgttgtac aagtctttta gaggacatat
45600gttttgttct cctggaagac acctaggact agaattgctg attcatagta tagatgtata
45660tttaagaaac tgtgagaaaa ttctccaaag tggttgtaaa ccttctgatg aacagaagtt
45720ctgattctaa ttaatcttaa ttaaatttat tttatcaatt ttcttctttt atatgttttg
45780tgcctctata agaaattttt gcctacccca cgacctcaaa gatgctttcc tgtttttttc
45840tagaagcttt atagatttta tgtttagatt tctgatccct ttctattatt gttttgagac
45900agggtctctc tctatcaccc aggccagaga acagcggtgt gatcatggct cactaaagcc
45960tcgacctcct gggttcaagt gatcctccca tctcagtctc ctgagtagct gagaccacag
46020atgtgtgccc ccatgcccta ctaattttaa aacttttttg tagagacaga gtcttgctgt
46080gttgtccagg ctggtttcaa actcctggac tcaagtgatc ttacttcctt ggcctcccaa
46140agggccggga ttacagatgt gagccaccat gcctggcctt aattaccttt tgtgtgtggt
46200atgtggtaga gctcaacttt gattttttcc cccagtgttt atctagttat tctagcacta
46260tttgtttaca aagattttcc tttccgtatt taacttcttt ggtgcctctg ctgaaaactg
46320tatgtgtggg ctgtttctgg accctgtcct actaatctgt ctgtcctttt accgatgcca
46380cactctcaat tgttgtttat actaaatcct gaaattagat agcacgaatt ctctgaattt
46440ttctttttct taaagattgt tttggctctt ctaggacctt ggcttttcca taaaaatttt
46500agaatttgtc cttttaatta gagtttgtaa agattttgat gaggatagaa tctatagaat
46560tcataatttt ggctgggcgc ggtggctcat gcttgtaatc ccagcgctta ggtgggagga
46620tcacttcagc tcaggagttc gagaccagcc ttggcactat ggcgaaaccc catctctact
46680aaaacgacaa aaattagcca ggcatggtgg catgtgcgtg tagtttcagc tactcgggag
46740gctgaggggg gaggatcaac tgagcctggg aggtcgaggc tgcagtgagc tgagattgca
46800ccactgcact cctccctaag tgataaggtg agaccctgtc tcataaagaa agaaagaatt
46860catagtttag ttttggaaga attgacatca taacaacatg gactatatct ttacttaatt
46920agagctttaa tttcaacttt ttaaaaaaat tttttcagta tggagtttta tgtcttttat
46980taaacatttt ttctctatgt gttctattac actgctggtg ggaatgtaaa ttagtacaat
47040cactatgaca aacagtatgg agattcctta aagaactaaa agtagatcta ccatttgatc
47100cagcaatccc actactgagt atctgcccaa aggaaaataa gtcatatgaa aaacacacag
47160gcacacacgt ttatagtggt acaattcgca gttgcaaaga tggagaatca acctaagtgc
47220ccatcaacca atgagtggat aaagaaaatg tggtatgtat attaccacgg aatactactc
47280agccataaaa aggaatgaaa tgatgtcttt tgcaacaact tgggtggagc cggaggccgt
47340tattctaagt gaagtaactc aggaatggaa agccaaatat catatgtttt cactttaagt
47400gggagctaag ctatgcggat gcaaaggcat aagaatgata taatggactt tggggactcc
47460tgagggcagt ctgggaggcg ggtgagagat aaaagactac atattggatt cagtgtacac
47520tgcttggatg atgggtgcac taaaatctca gaaatcacca ccgaagaact tattggtata
47580accaaaaccc agctgtaccc caaaaacaat tgaagtaaaa cttaaaaaag atattgcaat
47640aaaaaggttt tcctcagtgt gttatacttt ttgatgctat tgtaaattga tttttttaaa
47700atttcatttc ttcattattt gctggttgaa tacaaaaata caattgattt tcctatgtag
47760accttgaatc ttccatcttt accaaatcct tttagttcta gtagttgttt tgtaattccc
47820ttggattttc tgcacaattg tgttgtctcc taatagagac atattaactt tacattcttt
47880taaaagtgga cttgactgta attgaaaagg aaatatagca tattgctgct ttagcatgca
47940ggatgtcatc taacatgttg gtatttattt attctgtagg aagagtaacc aattgtacac
48000tgaaaacctt ggcaacggtt ggataaacag aattgtcata aaactagaag ttattacagt
48060ttagtagatg aaacaatgtt gagataagag aattactgcc aattattgct gtgataatat
48120tgcagcaact tgctcatgaa aaattccatt gcaatgatac tttagaccta aagaaaattg
48180tgctttatcc tttcttctta actatagatg ggcagatatt gttactgatc taaacactca
48240aaatccagaa tacctggata tccggcactt agagagggga ctgcagtata gaaaaacaaa
48300gaaggtaaga agaacaccat tgtgtttgaa ggcatttccc agctgaccaa aatgtggtgt
48360tttacttagc actctttagg ttgcaagtaa cagaaaccta tatttgatag aggagtgaaa
48420attattttaa ggacacacgg ctgtctttaa aacccaaggg caaggggtat aacctggagt
48480cataaaggaa tggtagatgg tagcaggtat tggaatgaca gcaagggcca agtagctggt
48540agtctgcttc actcttttgg gggctatata ttgttcttag ctttcctttt gtctgattac
48600tttcttcgta agatcatcat ctttccattt atgtgtcgtt tttagatgaa cagtctagac
48660tgagactgac ctgtcttggg ttcagttccg tagtcttagc ccagcttggg ccagattaac
48720tgtggccttg tggcagagtc atgtgtacaa acaaggcttt gggagaggct ctcaggaaat
48780gatatcttat tgatgcaatc ttactcatat aactatatga agaataggag agagaggaaa
48840aggaatgatc ttggtgtttg aattaggcag atggggaagt tctctagtat gtgttgggag
48900tgagtagtag agaggcatgg ggaggaaggt gttttggctc cggtgggcat tgttatattt
48960tgtaaaggca gctgttagaa aaggtgcagg atgaggtgct ctgatgatgc ctggaaggaa
49020agtaccattc tgaggccggt gaagtaaaaa aagaaagaag ggagttcata gggacctaca
49080aattagtaca attaatttcc ctttattttc ttcagccaac atttttttct tctcaggttg
49140ggggaaattt gcattgcatc atagcattcc agagacttaa ctggcaaaga tttggccttt
49200ggaactttcc atttggaacc attagacaag aatcacaacc tccaacacat gcccagggaa
49260ttgccaaatc tgagagtgaa gacaatattt ccaagaagca gcatgggcgt ctgggccggt
49320ctttcagtgc tagtttccat caggactcgg catggaaaaa gatgtctagt atccatgaga
49380gaaggaacag tggttaccag ggttacagtg attacgatgg gaatgattga ctatgcttgc
49440tactgaacag ctggcattat atatgaaact gctatataca ggactgtata aagacagtag
49500aagattttag taagcctaca ttaaatagga gcagatcttg tggtataaaa aataaccttg
49560tagttctcca gatactaagc ttgtatatga ttatggtggg tgatttcaga tatataagca
49620gataagcaca gattattgtc ctttcaagtt aagagtatat aatctggaca gaaaatttca
49680caaaattcaa taaaattaca actgttgtct aaataagtga aacacaaatt cacttaatag
49740catcaagatt tgaaatactt aagcatgaag tgacttttat aatgactcga tccctagaca
49800tttgttacag atagttttat gcctaagacc aagatgtaaa gtaccatctg cccttaaaaa
49860aaattggggc tgtcaatttc tagttttcac tcatggttaa cacgcattta aaattatttc
49920atgagtctag tagttctttg atttatagca ggatcttgct tgcctcattt gtttcctggt
49980tatgttctta ggattctgac taagaggcaa aagagaaaag actcaagaaa ctgatcctgg
50040agatcgagac catcctggct aacatggtga aaccccgtct ctactaaaca tacaaaaaat
50100tagccgggtg tagtggtggg cacctgtagt cctagctact cgagaggctg aggcaggaga
50160atggcgtgaa cccgggaggt ggagcttgca gtgagcggag atcgcgccac tgcactccag
50220cctgggcgac agggcaagac tctgtctcaa aaaaaaaaaa aaaaaaaaag acggatcctt
50280ttttttggtg caaatgggtg acttagtgca ttgattcaga tttttaaaat ttcttgatgt
50340ggtttgtaat aatcaaatat tgacaagaac cttaggtctc gaaagacttt tataagtcta
50400gatgacgttt gccttagggg taaagtaaaa gaacaattgg caccttaagt ttctataccc
50460aaggttatct gtgaaatgag atctcctgat atttgattgc tttctcagta tggagtcata
50520tgttgataac agtactgaag atgcataaga aatgcccaag tcactcagag gacaactacc
50580catattccag actctgagct gtttcctttt taaaaatcat atagacaatt agctgtttga
50640agtgagtatt aaatatttca gaagtgtgaa tttcatgtat ttgagctcct ctagttgctg
50700ttggtttttc ttctgctgcc aacctgtgac tcacaaatga ctaggatctc ttgttcttta
50760attttagggt cttgttccag gactcaaatc agtaacttgg tgattacaag gtgctgaatg
50820tgttggtaac catatcgcaa tacacctcaa ggaaaaggtt cagattttta tttttaaaat
50880attttcattt ttttcttgaa ttttatatcc gtttgttcac tcgtacatgc ctagcctaca
50940gaaggggata tatattatga aatggtcatt tttctgaaga gaatattttg cttgaaatgc
51000aaaggactga aagagatttg taggttgttg attttgttac ttcatactgg aacttttaaa
51060aagatttcat caaataaagt tttgttttct acttttaatt atatgaatgt ttttaaacct
51120ttgttttagg tagaaggtac cattgtgtct ttgaagtaca tgataatttg tcaattctgc
51180tcaactggta cattgtaaga aaccatcttg agccttttat aattaatgaa acaatatgca
51240ttatgatgac tgtaatttta gatttctaat ttaataggaa atagaaaatt tgattctttt
51300attagcttga accaaatgaa gttgtcatct ttgtaggtca aaaatggttg aatattagca
51360atttcataca gttcaactga atatttcaga gaagacgtag gtaggaagaa ccaagaggaa
51420aggagaaaaa tgcaaaataa aataagaaat taaaggatag ggagacacct agaaaaacag
51480gatgagttat tcagactatt gaccataatc aatttttata aaagtcttga tctgttctaa
51540gtttggcttc caagtttggc tttccagtta ttaagagcac aatgaggttt gagtttagtg
51600agattatctt tcccgcagaa gctgtaagca agagttactg catacttctc ttagaagatt
51660agtaaaattc ccttgatatt tgatttcttg gtcagctctt aggaatccta tagatacagt
51720gaaagttcaa atactggctc tgcacttaca agctatatca cctatagtga tataggcaag
51780ttaaggttaa ttttttttct gtgcctattt cctcattagt aaagtggggg taatagtatc
51840tactttataa atgaatttga agaataagct aatacatgta gtgtttagaa cagtgctttg
51900tagataggaa gtgctattta agagcttgct attattccaa aagatgtgaa ttttactatt
51960cagagtcttt agagagagcc ctttagatag catcttaagg agctaattcc ttttaaatca
52020catatgcacc ccttagttgc tgtttcttca aaagaatatt tcatattcaa gaatgttgct
52080ttattttttt gagacagagt ctcgctctat tgccaaggct ggagtgcagt ggtgctctct
52140gcaacctccg cctcccgggt tcaagtggtt ctcctgcttc agcctcccga gtagctggga
52200ttacaggcat gcaccaccat gcccggctaa tttttgtatt tttagtagag actgggtttc
52260accatgttgg ccagggtggt ctcgagctcc cggcctcaag tgatctgcct gcctcggcct
52320cccaaagtgc tgggattaca ggcgtgagcc accatgccca gcctttgctt tggttttaaa
52380agtacttaca atactcaaat gcctatgttg gctatttatt tttacccagc tcacaggcag
52440aaaaaaaaaa gtttatttag atcaaattct gcagcaattc ctttccctac tgctattact
52500gttaaagaac tgtgtgccat cattaggcca agttggtgca gcacccaaaa tacttgccaa
52560cttgtcttct cccagactgg aggggctcag gcagctcttc taggatccat cacattctgc
52620atctcaactt taccaataac actcccaccc ctcctgtacc agcgattcac tcaacaaact
52680aaattataca ccattaccag gtcagttctt aaatagctca gcagcaacaa gaacaacaag
52740atgttggtcg cattctaatt ttactggcag aaactgaggg tttcattggt gaagaaacct
52800gcctaggatc acagtgtctt agtttctcag ggctaccata acaaaacacc acagacaggg
52860tggcgtaaac aacagaaatt tgttttctca tggttctgga aactagaagt ccaggagcag
52920gcaggtttgg tttcttctga ggcctgtctc cttcacctgc agagggccgc cttctcactg
52980tgtcctcaca aggcctttcc tctgtgccta tcatccctgg agtctctccc tgtggcccaa
53040tttcctcttt ctacaaagac accagtcaaa ttggatttag ggcctatcct aatggcctcg
53100tttcaattta atcattttta aacgctgtgt cttcagatag tcacattctg aggtactcag
53160gcttccacag atgaattttg gaggacacaa ttcagaccat aacacaaggc aacatatggt
53220tcagatgaga tctgaatgaa tgatcagcct aacctccaag cagattcttt cagcagactg
53280caaggtgcac tggagagctt tagactagag gcttaagagg tcatttaggc aatatttaca
53340gaactgctaa gtgccaggat tgggggatgt agcagtcacg aaaatcgttc ttcgtttcag
53400taagtttaca gtttaccatg gggaagagac aaatactgaa caggcagtta tattactctc
53460agtaaaataa caactgggaa cagtttctgg ggattacttt acatatggag gaaatgcaaa
53520aaacactttg tcaggattat tcctgtagca aattagatgt gactaggtca accaaacatg
53580gccactgtga accatcttat tgagcataga agtggttttg ctaaaaatgg atttctccac
53640ggagcacggt ggctcatgtc tgtaactcca gcactttggg agactgaggc aggaggattg
53700tttgagacca gtttgggcaa caaagcaaga ccctgtctca aaaaaaaaaa aatcagggcg
53760tggtggtggg ggcctgtaat ctcaactact cgggaggctg aggcaagagc atcacttgag
53820cctgagaggt ggaagttgca gtaagctgag actgcactac tgcactctag cctgggcgac
53880agagtgagac cctgtcaaaa aaataaaaaa taaaaaaata aaaaaaatca atttcccgta
53940tacttctgtt ataaagaagt ctttaaaaaa ttgcctgctc ccttaagtca gccttttcac
54000tgagtttaaa tttcgttcca atttgaacaa atatggatgc taatactatc ctttatagtt
54060actagtgcta agtgcgttgc atgtaacacc ttgaatactt actgaagtct gcaaggttgg
54120ttttatgccc tgattactga tggagacaca aattctgaac aactgtgact tcagggatgc
54180taaacaccat agtgaggcac gatgcgggga tttgaatctt ggcctgaggg ttccagagct
54240gtggcctttt ctggggttac tctgttaatt gatttctagt cctttctgat ctacaagccg
54300cggcattata acttttagat gctgaagaaa actaaactat atgtcaagga ttaaggcttg
54360tgaaccccca aaatttggga caggtctcag ttaatttaga aagtttattt tgccaacgtt
54420aaggacgcgc agctgtgaca cagccccagg aagtccagat gacatgtgcc caaggtggtt
54480ggggcacaga ttggttttat acattttagg gagacaggag acatcaatca acatatgtaa
54540gtacactggt tccttccaga aaggtgggga caactcggaa gcaggaaggg cttctaggtc
54600acaggtagat gagagacaaa aggctgcata cgagtttctg ataagccttt ccaaaggaga
54660caatcagaat atgcatctat ctcagtgagc agaaggatga ctgactagaa tgggaggcag
54720gttttgccct gagcagttcc cagcttgact tttccctttt gcttagtaat tttgggaccc
54780taacattttc acaggcttta aattttatta ttctttagtt actacgtgct agcatataaa
54840taaatagtac aaaaccaaga aggcatccac cttttggttg tctcttcacg tgtaaaacaa
54900cactttgtgt taagtatctt cacacacggc ggcgcaaagg tagaaaccga tactaaaaaa
54960gcgtgtagaa aatagttccc agcctgggca acacagggag acctcatttc tacaaaaata
55020attcgccaag catagtggtg cgcacctgcg gtcccagcta cttgagaggc tgagatggga
55080aagttgcttg agctcgggca gcaggagttc caggctgcag tgagctaaga atgcgccact
55140ggactccagt atgggcgaca gcgtgagacc ctgtctcaaa caaaaacaaa agcccgttac
55200tccaccaaga aggcgctttt gcacattgtt ttaatgctta acgccttcag gatgccagcg
55260tgacggaagc aagtaaccac caaggcatca ccactggcgc taaacttctc acttccggag
55320tgctgcaagc gcagaaaata tacgtcatgt gcggaggcgg agcttccgcc ctgcgcgtcg
55380tattagacgg aaaccgagcg ggcccatttt tcatgggttt gcggacccac cagcgaaggc
55440gggaggtgtc gcagggacat cttctggctg tttccgtcgc ctgcgtggcc cttgcacccc
55500ggtcttccat tagcggcgca gacgtttggg cctaagcgct gggcgaggcg aggccctgcc
55560cctccccgcc aacggccatt ctctggacct gtctttcttc cgggaggcgg tgacagctgc
55620tgagacgtgt tgcagccaga gtctctccgc tttaatgcgc tcccattagt gccgtccccc
55680actggaaaac cgtggcttct gtattatttg ccatctttgt tgtgtaggag cagggagggc
55740ttcctcccgg ggtcctaggc ggcggtgcag tccgtcgtag aagaattaga gtagaagttg
55800tcggggtccg ctcttaggac gcagccgcct catgggggtc caggggctct ggaagctgct
55860ggagtgctcc gggcggcagg tcagccccga agcgctggaa gggaagatcc tggctgttgg
55920tatccttaac gccgcgttgg gacttggggt gcagggattc ggggctggat tcctcgcggg
55980gctctgcctt gggcacagtg gcatctgcag gatgatggtc ttgggtcggg gtcggggtcg
56040ctatagaatc tctgtcacta ggttttctaa gtacagtcgt ccctcggtat ccccggggct
56100ttggttccag cccctcctcc gtataccacg atgttcaagt tccttcaact cccttatata
56160atggcgtggt atttgcatat aaactaccca cttccgtaat cttttaaatc gtttctagct
56220tacttgtaat gccgaatgca atgtaagtga tctgtaaata gttgttatac tgtattttaa
56280aattttttgt agtttttatt ggtatgtttt atttatttat ttttttccat cgcaaatatt
56340tttgatccgt ggtaggttga ttgcggaatc gggtgatgag gcgggccgcc ctgtctgctt
56400tccccagctt tgcagtctta gcggcctgtg catcctggtt tgtcactttg tggcagtgct
56460ttatgttcct ctctgcctta gttctctcat ctgcaaagta gaggtggtga tagtatctac
56520acacaggatt ggaatgagga ataaagaaat tgcctacatg agagaagttt agtgcgatta
56580atacagtaaa tcttaaagtt attatcctgt ccgggaggtc agtaaggaga gcagagtaga
56640cttcgacgat tagttttgct tgagtcttgc cccatttatg tttcttagag gaaggatagt
56700gtggacaggt gttttaccca ttttttaaat tgacttttta aggactattg tttctgtaca
56760tgtttggctg gttttgtttg tcattggaat taaattcttt tttcattagc aaaacgtgat
56820actgcttttg aaatttttat ctttttcttt gttatgtagt catttttttt ttttttttga
56880gtatctattc tggctcagaa tctggtgata ccttacacca atttcctaaa tggggatcta
56940tgagtctagg ggctcgtggg cttgtggaaa gagtcctgta aatgattgcg gggggattgg
57000ggacgggaaa tggagtgcaa gagtgtgtgg gcgttcagca aaggaatccc tgactgtgga
57060gccctaattc ttcagtaggt acaaaactta taaataaaac gacattgata agttttaaaa
57120cataatgaga ttatcatttt ttgcaagtga ataatcttta aataaatctg taattgattt
57180ttggcttaaa agtttttata cagggtactt tgaattgggg aaagccaaag gatttttgtt
57240ttgttttgtt ttgttttttg ttttttagag cacatgcgct gttgctgccc tcaagctgtt
57300gcttagtatc attgcagtct aaagattttc tcagaaataa agggtaaagg ttagttttca
57360gtgacaagaa cccttaaaac ttcagcaaag atttagatca ttttatgtag cagcccttgt
57420gaagaattac taaagaggac tgtggccagg ctcagtggct tacgcctgtc atcccagcac
57480tttgggaggt caaggcagga ggatcatttg agcccaggag ttcaagacca gcctgggcaa
57540catagtgaac ccctgtctac acacagtcaa aaaattagct ggacgtggtg gcctgcacct
57600ttggtcccag ctacacggaa ggctgaggca ggaagattgc ttgggcccag gcggtcaagg
57660ctgcaatgag ccgtgttcat gtctgtctct gaataaataa ataaaaagga ctgtgaaaga
57720cattctagtc atgacaacct catatattta taataaactc gtttatgtca aaggagtata
57780tctgattact tgttctacaa tcggttttgc tttatttcat tttcacggga attcagacac
57840ctgaaattat ttagatttta ttgtattgtt tgttaatgac aaacttaccc gttattatac
57900tagtttcata atttactctt gtatctctat actaaattgt ttggtcttta aaccatctat
57960atagcttagt gtcttattta tacagtctaa atgtctgtgt caaataatgc agaagtaagt
58020taaacttttg tctcttatag attttaatgt ttgtaatatg cgtttaagtg tattagcaag
58080atattaccat ctctttaggc tatgtatact aatggagatt tttaagtcac ctaactaaag
58140aactaaagaa acttattttt tgtattgaaa tgttattggt ctttgggtct tataccgaaa
58200gtgatttttg gctttgctgc agattttaac ttttctgtcc gcagaaattt aattttgcgt
58260atataaactt actgaaatta gacaagtcaa attatacaaa tattttcaga ttgtctcatt
58320tttcatattt cttgtctaac aatttatgtg aatattttca ggtcgtctaa ttttacatat
58380ttcatctcta agaattcaag ccaaaattct caaccctagc tgagaattgg cattacctgt
58440agttaattga aaaataaaaa ataaaagaag tcctacctca gtctctatga tttagaattt
58500taggcccatg gggtttaata atccgtatgt ttcagaagct tcatatccta ttctaatgga
58560tagcagggat gagagccact gatctgaaac tagatttcct cactgaagtt aaagtttaaa
58620gtagtgggcc aagtagaaac taatttaatc ctatgtaata gtgagcttca tggcttttcc
58680agactatact tgtagcaagc agaaacatcg atctaataca agacagatgc acatcaacac
58740caagttctac ctctgagatg tatcttttat ccgatctctt ctcgtggtct ccaccactac
58800tgtctgcgtt aacttgggct attgtaactc attttagaac tggtttcctt ataatctgtt
58860cttcacatgt tcatcaaagt tatccttaaa tcatgttcta gaaacttccc agtggccgta
58920tcacctcctg ccattccttt tgtcctgctg cactggcttc ctggagatag ggaccgtctc
58980tgttttgttc atggctgaat tgtttggtgt atatccctag ggtcttggat atgcctggcc
59040catggtattg ctaaatatta tagaaataag catagcagta gcttgctttc cactaggtat
59100tttgttacat agtgttttgt aaattaattg ttgatggacc tacatttttt tttatgttag
59160taaaagttag gcagtgtaca ttcatttaaa tactaaactg ttcaacttat ttaatagcag
59220atatttattg tgtataagag tccctgtgaa agggagtgca gtgcacaaat tgcccgggca
59280agccatggag cacacaggct gggttttttc cctgctagat tataaactcc atagggccaa
59340tactgtgttt tgttcagtgc actggatatg ccagtgcata gaaaaaatgc tgccacataa
59400taaaaacaga aataacgtga taggtggccc cttcaatgtt cttttcctat gttagaaatt
59460tacctgcaaa tattagcatc agtccaggtt tctctctgtt ttttttttct ttttgttttt
59520ttttgagaca gagtctccct ctgttacgca ggctcgagtg cagtggcgtg atcttggctc
59580actgcaacct tcgcctccca ggttcaatcg attctcctgc ctcagcctta taggcacacg
59640ccaccatgcc tggctaattt ttgtattttt agtagagaca ggatttcacc attttggcca
59700ggatggtctc gatctcctga cctcaggtgg tctgcccgcc ttggcctccc aaagtgctgg
59760gattacaagt gtgagctacc gcactcggcc tcaatatatt ttttaaatag gaatgtatac
59820aaggaggatg tcttcatcta gactgaggag gggacccctg gactgagacc ttagagtagg
59880agagaactag gcggagagag ggagaaactg tttcaggcag aggaagcagc atgtgagaag
59940aacctggagg caggaaggag cttagtgatt tctgggtttg aggaaagccc agtagcagga
60000agtgagcaga ggccagatca ttcaaggctt cctagaccag gagaagaagt tttagatagt
60060atcttgaagg cagtgggaag ctttggaggg ttttcaccaa agatacgttg gtgaaatcta
60120gtatatacct tcaagataca ttagaagata gaattgatag acctggtaat agctataggt
60180ggtgatgccc agcattctgg ctcaggtgaa tggcagggac attcactgag atgggggcat
60240gagcagctga gtggagaaga catggagttc agttttgatc atgttaagtt tgaaatgtct
60300atgaggtatg gggtagagac gtcatctggg ctctctgttc tggggtcagg agcgtgctct
60360gggccagaca tatacatgga gtcagggcgt ggtgtatgtg gaggggagtg gaggttgata
60420gttcagacag gaggaaagaa gagggcctga gaaagatcac tgatgaagtg aatactttaa
60480cagtgttaca ggagctggca aagagtctga gagcaggaaa accagcagag cgcggtgtca
60540ctctttcttt tggacaaata taacattttt tgagaagcac tttttagaaa agtttaaaaa
60600tataagaaaa cttcaagatg aaaatataag ttatctttag tctcagcaga cataatatct
60660actttaatat tttggtatgt gaactagttt tttatattta acatatatac tgtatcagaa
60720aatggagtta tactttatat ggttttatag cctgttttac tatctttaat aatatttaat
60780gactatcttc cttatcacac atgctcctgc aacttccatt tttaagggct ttttggtatt
60840ccagcaaata ggtctgccat aattagtttt tcttttttga gacggagtct tgctctgtca
60900ccaggctgga gtgcagtggc gcgatctcgg ctcactgcaa cctctgcctc ctgggttcaa
60960gcgattctcc tgcctcagcc tcctgagtag ctgggactac aggtgcgcac caccatgccc
61020agccaatttt tgtattttta gtagagatgg ggtttcacca tgttggccag gatggtctcg
61080atctcttgat ctcgtgatcc acctgcctcg gcctcccaaa gtgctgggat tacaggtgtg
61140agccaccgcg cccggcctat gattagtttt ttaaaaagac ctgctggaaa acttttaagt
61200tgatttttat attttcatta ttataaacaa gcattctgta aataactctc attatatatc
61260catgatgatt ttctgaggat caatgtctgg aagtggtaaa atagactgca aagttttaaa
61320aagttttttt cactatgaat tgtcaaattg tcctccagaa aatttgtgct aattcatgaa
61380aattctttag caatcactta ttttttttta aagaagcaat tctttgcaat gagacttctt
61440aaaaggatat gtctgtcttt gtatgatttt aaaatgcagt atgtgaatag gggtaacaag
61500agttcaacta aaagttaact gactttaggt agatcccatg agagctaaat gtttttcaat
61560tttaaatgaa tagtgataag tatttagtgt tcaacgtttg gataatatca gttattagga
61620aattgaagtt gtgaggatga agagaaaaat cccggagttt tttccattaa caattctccc
61680agatattagc atttggttaa accaagcact taaaggagtc cgggatcgcc atgggaactc
61740aatagaaaat cctcatcttc tcactttgtt tcatcggctc tgcaaactct tattttttcg
61800aattcgtcct atttttgtgt ttgatgggga tgctccacta ttgaagaaac agactttggt
61860aagtgtcgta tagtttttag taagtgtcaa ataatttttt tctttctgca ttcttagaaa
61920aattcacata aaatttttgt tttctcttta gaattttaga aacagactta ttttgacaca
61980tacttaatta catctacttt tttatcttga acaattaatt tttcttttaa aaagttttat
62040gagagttcat catgggtaca atgataaaat ttaactttta aataaaacta actactaaaa
62100accttgctgt tgagagtttt cccttcagag gatcttgtta ggtgttttat atttatttca
62160aggatgcggc catcactcag aacactgtgg aaaccctctt ttgggaagtg cctccctgaa
62220tctgagatca agtatcctcc gtgatagcaa gccttctttt aaggggaatt ggattttagg
62280agtaagcaat gttatgtggt gccaagtggt gactaaagtg gattatctag gaaggaaatt
62340gaccatgata tagaagagtc agaatgagtt gctgctttat tgttctggaa gtagttttaa
62400aaaatgtttt tggaaggccg ggggcaggag gattgcttga gcccaagtcc agcctgggca
62460acacagggag accccatctc taaaaaaatg acataaaata atgtttaaac agtccaaatg
62520acagcaccca tttggatata tgacttttat ttgttgaaaa taccaaagta atgaaatgat
62580gactttgtaa tcataccaga tactctctaa atgaggatat aggataaaag taaggatata
62640ggataaaaat aaaaagcatt tataaatgct gggggacctt tgttgcctca gtaagagtgt
62700ctagtcttga atgttgcagt tttcagttac ccaaggcata ctgtaaatca ggtactttgt
62760ggaaggatta gaaaggaaaa gagaatatct tggttatttt tatccgatat tcagccataa
62820gcacagggac tatccatttg tcactacttg aacatttctt acaagatatt tctatacatt
62880ttagaatttc tcatgatata taatcttgaa tattgaaggg cagaattaca tgtaagtaat
62940gcatgtttat aaaaagcaat attagaaatt gtttttggtg tcactgtgac attttataga
63000ataaataaat aatttctttt gggaagcatt taagctttcc acttcttggg catattaaaa
63060gacatttaaa agaaactgaa agtaaattaa gtttcccaaa tatattgtgt agtagcccat
63120taagtaatag atggaaactc tgggtgtcct ttacgtagca gcaacctgaa gatatacact
63180gatatggcaa ttaggaggaa atgctaaagc agccatagtc tgaaaactca gacaaaccaa
63240attctctgga aaataaatca cagcaatgtt tctagtggtc taatatcctg aagtgagatc
63300ttacatcctt tcttctcata ggtgaagaga aggcagagaa aggacttagc gtccagtgac
63360tccaggaaaa cgacagagaa gcttctgaaa acatttttga aaagacaagc catcaaaact
63420gccttcagaa gcaaaaggca agaggaaaat tatagtcgtg ttagagatga agttttaaaa
63480aagtgatttt tgtcttgatt tcctgcgatt ctctttccct atctaatttt gactctcaac
63540agaaaataga gagtgaaatg agacaagtag gctgccattt tgacctggta atttggagtt
63600gtggcaattc tccgttctgt gagaatcaac tttgctaatg agaaaaaaaa gctgtcgtgt
63660tgcgtcatgt acacttttta ctttgattat ggtcttcttg actctaggtg agcagccccg
63720ccaaggttcc ttcctttctc tcggctgcat ttattttcca cagcagtggc ctgagagcag
63780ccaggtcagg tccctgttca ccatcctgag cagggtctgc ataatctgtt taaagatttg
63840tgtactttcc agagatgaag cactacccag tcttacccaa gttcgaagag aaaacgacct
63900ctatgttttg cctcctttac aagaggaaga aaaacacagg taaatgttta actatttaag
63960aatattattt tagtcattgc tacattcaga cacatttaaa ccttgatgtg ttatctacat
64020gataaggcat gtgaacattt cttaatgcat ctgaaatagg catgctctat acctttcaga
64080atatttttca aagactaaat tttttattta ctattctttg tgttttggtg attcatgatt
64140cttattcctc ttcattctaa agaacttctg ccagggctgt ggattctaaa ttctccaagg
64200catgaatttt agttttgctt acactttgct tacacatgct gtctctagct gcttttcaat
64260ccagtatggt ctctctcacc atgatgctaa aataccacaa cccatgacat tcttgttgct
64320aaatctagca gacgctttgc acgtcttagt gtgctcgact tctctgcact aaacaccatg
64380ccacactgcc ccgctccccc ggcatctctc tggcgctttc tgctcagcct ctctgtggac
64440tcctcttcct ctgtctctct cttaatggtt ggtggtttct gtgtttgaag tgaaggtgga
64500gaaccggaag cttggcagtt tgcctttgtc ccctgtcctt ccctccacct cttcctttcc
64560caggagttcc caagagacaa cttagctttt gtatcagctc agttctattt gtctccattg
64620ctaccactgt cattttttct ccaggttaca ttattagctt ttaaacttct ttctgtcttt
64680gtctttgatc ataccctttt tgtaatagtt cctcccacca agcccaatct ctgttttgtg
64740ttatagtctg tttaaaatcc aaatctgata ttgtcatttc cttacttaaa acactttcta
64800gcttctcctt ctcatttttt tctttctttt ttttttgaga tggagtctct gttgcccaga
64860ctggagtgca gtggcacaat ctcagctcac tgcaacccct gcctcctggg tttaagcgat
64920tctcctgcct cagcctccgg agtagctggg attacaggca cgtgccacca cacctggcta
64980attttttgta tttttagtag agacgaggtt tcaccatgtt gtccaggctg gtctcgaact
65040cctgacctca agtgatccac ctgcctcggc ctcccaaagt gctgggattg caggcgtgag
65100ccaccacacg cagcttagct tctcatttct ttaggataaa actttaactc cttaattaca
65160ctttccagtc ttatccctgc ctttcttcac atcatccaca ggcaggcttt cttttccttg
65220tttctatatg gtcatacaac ttagttcttt tgcctggaac actcttctcc atgaaccttc
65280ccgctcccta caactacact cattaattcg tggctaactc caaccagtgt aattgcagga
65340gatcctggaa agcttttgtg actcctaaat tgggtgaggt gactccctat tgcatctcta
65400gtgacctgcc cttcccttgt ggtggaagtt attagatccc actctaattg cttacttaca
65460cgtgtccccc accagaccat gagaccagta gtggcacaga tcttgctgtt ttattcacca
65520ctgtagcccg tataacgagc agagccttgc atacaagtat ttttgtaagg ggtccttaaa
65580aatcatagat atcgtaaaag tatgtttgac tttcagttca gaagaggaag atgaaaaaga
65640atggcaagaa agaatgaatc aaaaacaagc attacaggta tttagatcat ttttgaattc
65700agaatgtatt ctgttatttg aaatgaatga catgaaaatg aatattaatg aggtatatca
65760aactgtgaaa gttcctgata aaaagtaaag acagatggct ttttggttgt gcatatatat
65820gtgtacatgt atgtatttaa aacacactca cctacacacg tgtatatata tatatatgga
65880atttgccatt atgcacatct atatattcaa aacaagctat ttttctttta caaccaacca
65940accaatagta atatgtgctt acatagaaaa tatagaaaac atatcattca taaataactt
66000gtaataacag tttgatatat tttcatccat tctttgcctg ccccatatag ttgaatatct
66060gtgtgtgtat ttgtatatat gtatcaaatg tgcatgtgta catatatatg tgtacacata
66120tgtgtgtatg ttctagttat ctgtcgctgt gtaacttagt ggttaaaaca tcatttgtta
66180tcgtcatctg tctgggttcc aggggtagat gagctcagac aggcagttct ctctggggtc
66240tcttgtgtag atgcagccag tctgtagctg gggctggagt gattaacgag gtttccttgc
66300tgccatgggc ggtggttgat gctggctgtc agccagtctt ccatgcggcc agcaccagag
66360cacctacttg tggcctctcc acgtggcctg ggcattttcc cagcatgatg actgtattcc
66420cagtgtgggc atcccaagag aaagttctaa gcagaagccc tgttgccttt ctgacctagc
66480tttggaagtt gtgcagcatc cttcccacta catcctgttc ctcagaagct agttactgcc
66540ctatggaagg ggcagagagt tagacaagaa gaatgtccca cacatatttt catgaataac
66600atttttacaa attgagatta tgctgtatat atacattttt ttcataaaca tttttcaagt
66660ttcttagtgt cacagatagc attttttgat tcccaaattc actgtttctt gtttgttcat
66720atggttttgt tctatcaggt agaaactttt gagctaaaac ctcgagatcc ttcaggttgc
66780ggaggtgctg tgctattaat gcagctagtg agagcaatct tagttaagac catgggcctc
66840gtggtcaaaa agacctgcgt ttcagtccag gctccactgt tcactagctg tgtgatctta
66900cacagattaa attctccaag gctttatttt tttttgtaaa acgaaattga tatcaatgcc
66960ttcctgatgg gggtgatggg agcattagtg agggaaagca caggaaatgc tgagcgcagt
67020gacaggtctg tcctacgtgt ttgctggttg gtacgtacac atgtaacata cagacatgca
67080ggacatcagg ttgtattagc ttatcttttt tactaatgaa ggattttaaa gtacttttgg
67140taatgagtta aagttgagaa aagttttaca cttttccatg tcttctgtga atttctttta
67200atttcatttc ttatgttata agcaaaataa aactaactta attatgggaa tgatatatat
67260tgcttgaatt cctaaggcac attctgtctt gggttcttgc attttctgtt ctctcagcct
67320gaaatgctct atcctcagtg aggctctttg atatcttcca agtccctgct cacacatcct
67380ctttttagtg gggtcttctt tatctatcaa atagatcttt ttttccacgt tctgttcatg
67440taatcactct catatcagat aaatctgctt tattttacca ttatcttgtt catttattta
67500tttatttatt gtcttctctc ctctgctcct aagagcaggg accttatctg gcaccttatc
67560cgatctcaca gtaccttagc acctagaaca gtgcctggtg ccttagaggt acctgctaag
67620tatttttttc ttttgaatga attaaagagt gaatggctac attccctaat ttgcctactc
67680actttgttgc ctgtcacaga ttatatgcaa ctgtgtttag ccaattgttg attatgtaga
67740actgtgttta ttatataaac ataatacata tccttaatgt tgaatagaac taagtgtatg
67800aaatgtaaat ttcatggtgc tgtgatttta tctttacagg aagagttctt tcataatcct
67860caagcgatag atattgagtc tgaggacttc agcagcctgc cccctgaagt aaagcatgaa
67920atcttgactg atatgaaaga gttcaccaag cgcagaagaa cattatttga agcaatgcca
67980gaggtgaaat atgcaacagt acattcatgc ttagaattaa gaacttcagc aaaacttttt
68040attagaaaga agagaaaatt gataagcaat acttacacga tatctcagtt aacagtaaac
68100agcatttcta catctcagat tctaagaagc atcgtatatt tatacgtttg agcctataga
68160catttactct aagaagtttt tcttgacttt tgacccgaga ctaggtcttt tttcctggtc
68220tttgttctca cagcaccctg taatatcact tcatagttct tagttccaaa acacgcttat
68280cttgctcacc tctgtatttt cagtgtctag ctcagtattt ttcacatggt atgtgtccag
68340tagatgctta ctgactcaat tcttaggtta ggtcataaaa gttattgtaa cctataatat
68400acattgtcta taaaaactaa tagtcatata gaatctaatc acaatggaaa aataagttct
68460aaattgaaat tccaggtata tcttcctctg ctgcagccct agagatgcca ttggctctcc
68520acattccctt gccctcttcc tggacagtgc gaatggggct tcttcacctt ggaacatctt
68580gtagcttggc aggcccagaa agctagagtg gaggtggtat gtgcagttgg gtgctagcaa
68640atgtgtctcc tgatcatgct gccattgata cttaattcat gttactattg atgactccct
68700gtcttagttg ccagtgagtg agttcttttt ctctttctgt tgctgctacc tgttatttct
68760accgtagttc tccattcacc cactatagga cagaatcgaa attttgcagc atcatcgacc
68820ttagtgcata gatggagtgt ttttttattt tctacaattt ttgaatattg cttaaattga
68880tagcagaaat atgaaaaagg aagggtaaat ttcttttctc atcactccct gttttttcca
68940caaagaatgt gcagtagcac acactaaggt gcacagaagt gacattcttg ggtctttgga
69000tatacaaagg acagaagtaa attgattttt atttcaggag aaaaatccag gctcagtctg
69060tctatcaggc attttatttc ttgagtatga aaggatctct ggctggcagt tgaggaagta
69120gaattttggt tgtgtaaaca ataacaggaa gaaatgggag aaagagagac agtccctaat
69180gatttactgt tctttatttt ctttctgcaa ccatgaagtc tctggaagtg gtggactgta
69240gggtggtgtg gagtagcagc ttactggatc tgtaattttg atagagatgt tctaagtcat
69300ccatgttggg cctttgtgtg atctgtatgt cgtgcaaatg taatattgat aatagtagtg
69360atggtaggta ataatagcag tagtaataat cataatacca tagttccact ttactcacgg
69420tttgcagttt tcagtgacct gtggtaaact gtggtctgaa aatattaaat ggaaactttc
69480agaaataaac aattcataag ttttaagttg cacaccattc tgagtagagt gatgaaatct
69540cacaccctcc tgctccatct tgcctggaac gtgaatcctc cctttgtcta gcatctccgt
69600gctgtagatg cttcctgcct gttaatcact gagtagctgt cgcggtgatc agatcaactg
69660tcgcgatatt gcagtgcttc tcctcaagtc actcttattt gacttaatga tggcacaaca
69720gtgcaagagt atgatgctgg caatttgaat atgccaaaga gaagctgtaa agtgcctcct
69780ttaaatgaaa aggtgaaagt tcttgaatta ataaggaaag aaaaaaatcg tattctggct
69840gaggttgctt agatctgcat aaaaatgact tttctatctg tgatattgtg aagaagcgaa
69900aagaaattgg tgctagtttt gctgccgtac cataaactgc aaaagtcatg acctcagcgt
69960gtgataagtg ctcagttagg atggaaaagc cattacattt tggggtggaa gacatgaaga
70020gaaacatgtt ctgattgatg acaatcaggt ttggtacttc tgcagtttca ggcatcctct
70080gggggtcttg gaacataccc caaggatgag ggggctgtct actatgttaa tagaatcaat
70140tgtagtaaat tgacatgctt ttgatcccag atctaccact tattagccct gtgactctag
70200ggaggttacc taacctattt aagtcccaat ttcttcattt ataaaatgga ggtgatatct
70260gtttcatagg atgattgtga gaataaaatg aggtattata tgtaaaagca cttagaaaaa
70320tgccctccat gggaaatgcc ttataatgtt aagtattact gttaataact gtgattactg
70380tgatttattg tgtcttttat gggataaggt tgtgcaggac acttcacttg catatttacc
70440tacattctag aagattgtta agccataatc agatgtcata gtgactgcta tgcattacat
70500gctcaataca tgtttattga ataatgatta aatcataaac agtattcatg attttttttt
70560tttttttttt gaggggaagt ctcgctcttg tccctcaggc tggagtgcaa tggcacaatc
70620tcggctcact gcaacctctg cctccccggt tcaagtgatt ctcctgcctc agtctcctgg
70680gtagctggga ttacaggtgc ctgccatcac acccggctaa tttttgtatt tttagtagag
70740acagggtttc accacgttgg ccaggctggt ctcgaactcc tgacctccgg tggtccaccc
70800atctcggcct cccaaagtgc taggattaca ggcatgagcc accgcacctg gccatagtat
70860tcatgatttt tttttgccca actctttcga agattatttt tttaaaagga agctgtagtt
70920tttcttgtta ttcacctttt ataatatgaa actaccatca atgaaaaaag ccaattgttc
70980tttgttccct gttggggaaa gggtggaaat atggtaatat tatctgtatt taatataaaa
71040cagtaatttt gtttgtttat tttgccttta ggagtctgat gacttttcac agtaccaact
71100caaaggcttg cttaaaaaga actatctgaa ccagcatata gaacatgtcc aaaaggaaat
71160gaatcagcaa cattcaggac acatccgaag gcagtatgaa gatgaagggg gctttctgaa
71220ggaggtagag tcaaggagag tggtctctga agacacttca cattacatct tgataaaagg
71280tatcaggcac catcatttat atatttacat taaaaaatca aagatatatc atgactctga
71340attctataaa ctagcacccc tggataatat taatgaaatt ctatttatgt aataactgta
71400tactgctatt aatggattaa ctactatagt gccaaaccac tttaaaatta gctaatgaat
71460taactcctag ttgccgatta aatgaaaatg tatatactta tttatgagaa ccagtgttct
71520cttatccatc ttactagaag cgtattgtca cactgtaaaa ctgaatggtg agaagtgttt
71580taattcttct taaggtattc aagctaagac agttgcagaa gtggattcag agtctcttcc
71640ttcttccagc aaaatgcacg gcatgtcttt tgacgtgaag tcatctccat gtgaaaaact
71700gaagacagag aaagagcctg atgctacccc tccttctcca agaactttac tagctatgca
71760agctgccctg ctgggaagta gctcagaaga ggagctggag agtgaaaatc gaaggcaggc
71820ccgtgggagg aacgcacctg ctgctgtaga cgaaggctcc atatcacccc ggactctttc
71880agccattaag agagctcttg acgatgacga agatgtaaaa gtgtgtgctg gggatgatgt
71940gcagacggga gggccaggag cagaagaaat gcgtataaac agctccaccg agaacagtga
72000tgaaggactt aaagtgagag atggaaaagg aataccgttt actgcaacac ttgcgtcatc
72060tagtgtgaac tctgcagagg agcacgtagc cagcactaat gaggggagag agcccacaga
72120ctcagttcca aaagaacaaa tgtcacttgt tcacgtgggg actgaagcct ttccgataag
72180tgatgagtct atgattaagg acagaaaaga tcggctgcct ctggagagtg cagtggttag
72240acatagtgac gcacctgggc tcccgaatgg aagggaactg acaccggcat ctccaacttg
72300tacaaattct gtgtcaaaga atgaaacaca tgctgaagtg cttgagcagc agaacgaact
72360ttgcccatat gagagtaaat tcgattcttc tcttctttca agtgatgatg aaacaaaatg
72420taaaccgaat tctgcttctg aagtcattgg ccctgtcagt ttgcaagaaa caagtagcat
72480agtaagtgtc ccttcagagg cagtagataa tgtggaaaat gtggtgtcat ttaatgctaa
72540agagcatgag aattttctgg aaaccatcca agaacagcag accactgaat ctgcaggcca
72600ggatttaatt tccattccaa aggccgtgga accaatggaa attgactcgg aagaaagtga
72660atctgatggt acgtgtctgt gcttttgtag aaatctggaa cggtaggatt tcccctctgt
72720aggaattcag agatcggtta gtgtagtccc gttttaactt tttacagata aggaacgaga
72780gacgtagaaa gaaagatgaa atgactttcc cagggagtca cagctggtca tggaatcttg
72840accttccctg tgttgctctg ctttttgtta tcatttttaa aggcatgaag tgccctattt
72900ggggaaggta aagttgagtt tccctctagt ttttaaaaac tttttatttt gaaataatta
72960tgaactttaa aaagttggaa gaatattata aaacactggt tccttcatcc agtacctcag
73020tggctagcat gttaccacat tagcttagaa tttctcttgg tctctctgtg gccctctata
73080tgtatatcat atatctccaa atctgtatac atatgtatac cattgatcct cattatttgt
73140agattccata tttgcaaaat tgcctgatca ctaaaattta tttataactc caaaatcagt
73200actcaccgca atgtctttgt ggtcatttgt ggacatttgc agagttgggg aaaagcttga
73260gttgccacac tgtcccctgc tgaggttaag caaggtgaca ctctgcctgg ttcccgtgtt
73320ctgagagaga tgaccagagg gtggggacag taggggatta tgcaatggag agagagcaag
73380aagctccggc cccaggccag ttggaccaga tttgaatccc tattctggca cctgttagcg
73440tggcagcttc acacaggtca ctaatttgtt tcttgaactt tgtttcttgt ttataaaata
73500aatggaatct attaagatgg tggtttttta ggatttaaga taatatatat gaaatgtgtt
73560catatatatg ttatatatgc atatgtgtgt atatgcacat agatatgttt aggagcaatg
73620actcggtatt ggctaattta gtgttcacag agacttcata cgtgatggcc actttgaata
73680agagaatcaa ccacacacac acacacacac acacacacac acacacacac aatttgttcc
73740tggtatctgc tagttttctt cattcaaatg ttactatttc ccttttgtaa ttaataagta
73800ttttgtggaa aaggaatttt tggagctata taaatatgct gttcctgaac aaacttccac
73860ccacttgtta gcatccattg atgtttacct gaataatttg ttactacgtt ggttgccaaa
73920tgatggtttt tctaactcca tcattcctta tatattatta cttgacatcc tcctatgagg
73980aagatctttt ccttctcccc atttattttt attattttta atcagtgtag actcctgtat
74040tcctatttag tgagttataa tccaatactg tcataattta ctttgttact caaattatca
74100cagctttggc cattggggct ccttctaatg gctttcagca gttttttcat tatattttga
74160gcttttcctt gctttctggc caagctgttt caggcatatg ttgtactttc tctgccctgg
74220tcctggaacc agccatttca ccagggagct ctggttcctt tcagtggagc atggggttta
74280gacaccacaa gctggatgtg agtgtgctta tggatcctga ggtataactg tctcaggcct
74340tttcagcaac agtgccagga agtatattta tgtatacata tacatgcaca cacacatcta
74400tatttatttc tatgtctatc tgtactaaaa tccatgagtt tatactgaca tctgcaattc
74460catggggttc agtctagcct cctgcttctt tatagtttcc ctaacaatga gaaacattgc
74520tccccttatc ctcaatacat ttacatctgc ttattctccc tggatatgta accatctccc
74580cctcccactg gcctcctcct tggccctgct ctcttctttg cttcagctgt gtccttggtg
74640ccagctccca gtccctgaga gccccctcct ctgttctgat tgtctcctta aacccagctg
74700gacaggcctt gccagccctc tccacctaca gggaaggaag gcaaccatta aatatatttt
74760aaggagaagg aaagacagta agacagtaag agaggagaga agggaagtgg aagaggaaga
74820actatttctt agtcacagct ttattctgtg ctgtgtaaat agcataaaaa catactgagc
74880aacttccatg attgtttata tactttgata atcctccttt ttgaattttt aaaacaatgt
74940cagttaactt agaacatatt tatataaagc gaatatacaa atcttaagag agtttcctac
75000tttcaaagac agtgccagtt tacctaattg aaaaggcttg ttttgaagtt acaggcattt
75060gtgattacat ttatttatta ataacgctac tattacatgt attctgttat agtcatatct
75120ttccttttta ggatgtagca tttttcaggt tcctccagaa agctcttgat gattgcagga
75180tcattttaat gttttgattg tagatgaagt gaccttttaa ttttggtaca ggaagtttca
75240ttgaagtgca aagtgtgatt agtgatgagg aacttcaagc agaattccct gaaacttcca
75300aacctccctc agaacaaggc gaagaggaac tggtaggaac tagggaggga gaagcccctg
75360ctgagtccga gagcctcctg agggacaact ctgagaggga cgacgtggat ggtgagccac
75420aggaagctga gaaagatgcg gaagattcgc tccatgaatg gcaagatatt aatttggtaa
75480taccgtaaca ttgtgtttcg acttcttgct gaggaagcca ggttaagtag gttttgagtt
75540ttaaggagtt ggtggatgag tatttagtag ctatttgcag tacatcttgt ggttgctgat
75600ggcttcattt ttgtgtaggt tactggctgg gatagactcc gttttccatg tggtttagtg
75660atgaatctct aaagatatta cagagtcttg gttagacatc cagtggagta cttcctaagg
75720agaaagagct tattggtaat ttcagtcaga ctaaatgcag gctttttgta aacaaaactc
75780atttggatta ttaatataaa tctataaatg aaaaaacatt ttataggagg agttggaaac
75840tctggagagc aacctcttag cacagcagaa ttcactgaaa gctcaaaaac agcagcaaga
75900acggatcgct gctactgtca ccggacagat gttcctggaa agccaggtgg gtgcaggcag
75960cttgggtttc ctttaccacc ttcttcagac ccctggggga atgcactgca tgaagggggt
76020atgcactgtg ccccctggtg ctcagggcct ggtgatgccg ttccctgggg gtcactgtgt
76080gtccctaact ctgcaggaat gaatgcatta catgaagtgg taggcactgc tccccctgtg
76140ctcagggcct ggcggtgccc ttccctgggc gtcactgtgt acccctcact ctgcaggaac
76200tcctgcgcct gttcggcatt ccctacatcc aggctcccat ggaagcagag gcgcagtgcg
76260ccatcctgga cctgactgat cagacttccg gaaccatcac tgatgacagt gatatctggc
76320tgtttggagc gcggcatgtc tatagaaact tttttaataa aaacaagttt gtagaatatt
76380atcaatatgt ggactttcac aatcaattgg gtaagacttc agagtctttt tgattacttt
76440ctgacattta ccttcagagt ttgtcctagg aagttttctt tccaaggaac tagtttgatg
76500cattgatgga aattgcaggt ctatgcaaat ttttatatga gtgatctttg gcttatatag
76560aggaatagga ttttaaacat ttgaattaag gaattaaagt cctagtatgt ttaggtagtt
76620aatcaactga cttagttaaa ctttgactag ttacccgaga tctccacagt gaacaaaagg
76680tggtggagag gggaagcagg ccgcgcctgg gcctgtattc gggtttctgg cactgatctt
76740cttctgttca tccagcaaat atatgtttgg tgactcctat atgccagttg ttcttaacaa
76800gagggaagga acagagagta aaatagtagg agaaacagat gataagcaga tacataaatc
76860ataatttgac agttggtggt aagtgccaaa tagaaaaata taacagtaaa ggagaggaga
76920gagtgaactt caggcatcga gagtgcaggt gctgtttcag agagttgttg ggaaaggctt
76980tactcgtgta acttcaaggc agggacctgc aggaaatagg aagcaagcac tgtagataga
77040tacctcggga ctagtgagtg aaaggggctg aggcagaagc ttgtgggctg tgttctaggg
77100aagcaaggag gccagtgtga gaggaggagg gaggaaatgg caagggttaa tcttagagaa
77160taggaagaag ctgaaaccca ccagggtctg gtgtgctagg gtatgcaggc agattgaatg
77220tggggtggga gaagaatgaa gtcgaggagg attccagata ctgagcagct ggtcgagtgg
77280acctgccttc agtaagacaa ggagggagca cagtggagga ggagaaacga agtgttcagg
77340tttggatatg ataaattggg gatgctcatt ggatacccag tggaggtgtt gagttgatag
77400ttgaatatat gtgactgtgg ttcagagaga ctcaggctag atagctaact ttgggggtca
77460gcgtatagag ggtatttaaa gctatgagaa tggatgaaac tttaaaaata ttaacagaat
77520gccattgaat aaaataattt attttcaaat aataagatat ttttggtggt tggatataga
77580tatagatata cacacgtaca tgatttatat aataaaatgt ttataaatgt catataagaa
77640atcttgataa aaattaaaaa atattgttac tctttaggat tggaccggaa taagttaata
77700aatttggctt atttgcttgg aagtgattat accgaaggaa taccaactgt gggttgtgta
77760accgccatgg aaattctcaa tgaattccct gggcatggcc tggaacctct cctaaaattc
77820tcgtaaggtc ttttatttct ttaatttgga taattgtgta aatacccaaa taagcaaata
77880gaactattat ttacagcatg aactgtcatg ctgtaacatg tgaacaatgg ttcactgaga
77940aagcagcaga aagtattggt tgttttccat tttctagaga tgatgaaatc agagtcagtt
78000cttagtggtg ctgggcttat cctagttcaa gggtacaaag ccagtcctgt ggatttcaca
78060ggaatgtaga agttgccttt tcatccattg acatacttat agagcagcta tgatgtgtca
78120gacactgtgc tggccctggg gaaaggagag atgagtaagg cataggccat accctcaagg
78180actcccccca tatacttgga ggtgatggag aagcaaagca aattgtactt gtctgtgtgt
78240ttggtgtcct gagagaggtc agccaggaag cgacttggaa atataatgcc aggaatgtta
78300catccagcac tcctgtcctg tttctcacca tgtgaatccc gtgacgtgtt cagtggtgaa
78360gtcttgctcc atccagaacc cagagtcgtg ctatactcgg ggtacattta tcattttgaa
78420gattctgaga tggtaggagt tatgagtatg tctgggaaag tagttttttc tttgtccttg
78480atggcatctt tttaaaaaat tgaaatatag tcacatacca cagagttcac cgatttaaag
78540tgtacagttc attggttttt agtgtattca caagattgtg gtggcatctt ttaaccgtca
78600ctatgtcgtg agatgctgtt tggtggtggc tgatttaatg atccttgagt gttctctcca
78660gttacatatc ctgtgtttga aagatgctag gatgctctga cctgaagagt cagtgactgg
78720aatcactgaa atgaacggcg aaactgttgt agtgatcatg gtcttccagt catagcacac
78780tcaccaatgt gttgtatttt atactttctt catttatctt tttctgattt tcttattgtt
78840gaggaaattt atataccggg aatgagttat gaggtagaca tacgtcatgg taggtaaata
78900acttgaaaac ctacaaataa ctttattata caaataaagt aaatacatca tagaaaatta
78960taaatatgta ttaaaaagat gattacagtt tacaatttgt ttatgtaatt ttagacttca
79020cacatatata tgcatataca ttgcatatat atgtatactc atttgagtgt gtgtatgatt
79080ttttttataa gaatgggatg atgctatata ctttttattt tacctgtaac ccatttggta
79140ttagaacaag acctattatc aacgatgata tagatttata tcgtttttaa tggctacagg
79200tagtatcttt ttgcgtctat cagggctagt ttaatcaagt gtcttttgtt atttctattt
79260ttagaaaggt attaacagac attgatttga gtgtccttaa tcttcgggtt cttatctaat
79320gattatttta gaataagtga aatagtatct gacttttcca tttggtttct tgataactat
79380tgagagtgaa ttttttatgt tcatagacgg tttgtgtttc tgatttagtg aatgacctgt
79440ttatgctgtt ccacttttct ttgttcagca gtttttgctc gtttgttgcc tgctgaatca
79500ttttcatatt tgagcagttt ctgcctttgg tcctccagct cgttgatgcc agataggagc
79560agccttcgct ctgcctggct caggacattt aggatgaata cagagtaaag agcaggaagg
79620atggtgggaa accagtaaaa tcagaatgtg gcctcacttt gttttctgtt ctcctcttct
79680gagggcaccc tggctggtga atcattctta gagtggggct gcattgggcc tgcaggttga
79740attgagctgt agtcacattt ggtttggact acacggtgta ttcttctctt gagctttaat
79800tttcaattgg ctaccaacat ttacctttgg gagattttac acaaaattat ggattctgtt
79860gggggaaaaa ggagagctga aaatactggg cctgaatttc tggatagccc caaaatagct
79920ggcatggagc aacagcagtt cccctcccct accccaaggc cttggactgg gcatgaatcc
79980ttcagtttca tcacagttgc ctccactccc ccagctcagc tgattgacag acaccttcct
80040cctttcatct ttatgtgaag tgccaggccc tcctgcttga aggagtaacc atagctttgg
80100cttgatgagc tcacaagtgc actgattcta actgccagtc atgattacag tttcaaaaac
80160gaaaatctgc aagtaaagta gaggagaatt atcggtattc tctcagtgca ctcctggaaa
80220gaatgttgta aaagtaatga agaaaaatta ttttttcctt tttgcatttt gttgttataa
80280tactttgaag accgcaatta gtaatttcca acgctagatg gcactcctgc atcggtgagg
80340cggaccctgg ccctgacagc cggtctgagt gggttcctaa tgctgcacta gcaagcttgt
80400gaagcggggg agcctgagct gcgggcgttt tttcaggttc cattgagtac atagattggt
80460gaggattatg ttagtaactt tgatggcatt tattttttcc taattgtgta aatgtttctg
80520gtcagtcgtc actgcattac aaactacctt cagccttagt ggcttcacac agcagtagtc
80580actgattttg ctcatgcctc tgaagctcgc tcaggcctgg tgggggtgtc ttgtctctgt
80640tccatgtcgg ctaggacagc tcagctgggc taggacccct taagatggct cgcggggctg
80700atgaggtggc ttcggcctcg gggcctgggt ggacacttgg tgagcttggc ttcctcatag
80760catgttggag ccaagagcag gaggtccagg ggacaggaag tggcgagggc ctcccctagg
80820cctggcacgg cttgagtcac actggtcagc catggccgcc ctgctcaaga ggaggccaca
80880gggaccccct tcgtgatggg aggagggtcc aagagtgtgt cgctgtcttt atccggccga
80940tttaggctta gaatggaaaa cattgaaaaa tgaataaaat ataaagaaga aaatgaatgg
81000ctgtgtgctc acgtagaata ttgcctagca cacagcgaaa taatagttgt tagtgttatt
81060tattttaagt atctgtttac ttttctcata tatgtgtttt ttccaaaact ggaatcatac
81120tttaagtaga gttctattat ctctgttttt gtatatattt atttgtttta catacatatg
81180tatttatata tatataaaat gttaaatttt aaaaacagat acatgtatct gtttttacag
81240attttttaaa caaatttaac attttcttgg catatttaca ttttattaaa tattcttcga
81300tccattattg tatttgattt gaggtaaata atacagatac attacttcca ttttactgct
81360gagaaagaga cccagagaag ttgtaacttc tgtggtataa ttatgagtgt gggctctaga
81420ggaagactgt accaattcca gctgtgaaat cttgagccag ttacttattt ttgagttcag
81480ttccctcctc tgtgaggtga gggtaacgat agggcatggc ttgtgtgatg attgggcatg
81540tgaaatattt aacacagtgc caagcacaga ggaagacttc cataaatgtt tgccatcatt
81600atacattgtg gctaatagta ctaaaattaa taaaatattc tatatagcat acaatttgag
81660ttagaacttg agtttacatg ttctaagttt agttcttcat ccatagtgta tgaactataa
81720tgtctcattg ctgtgtaagt aattgtttcc tttattttac agagaatggt ggcatgaagc
81780tcaaaaaaat ccaaagataa gacctaatcc tcatgacacc aaagtgaaaa aaaaattacg
81840gacattgcaa ctcacccctg gctttcctaa cccagctgtt gccgaggcct acctcaaacc
81900cgtggtggat gactcgaagg gatcctttct gtgggggaaa cctgatctcg acaaaattag
81960agaatatcct ttgcttctta aaagagaagg aaacaccttg tcaaatatgt gtttggttta
82020aaacttatgg aagagaaact tggatcattt ttttcttata tcccgttaat cccattttct
82080taatatcccg ctctcctttt cactctttct tccctctcct cagttgttgt tttttgtttg
82140tttgtttgtt ttaaaggaac agtgtatcac tctgtcaccc agcctggagt gcagtggtgc
82200aatcatagct cactgcagcc accacttcct gggctcaagg aatcctcctg cctcagcctc
82260ctaagtagca caagccactg catctggccc ctctccttag ttttaaagag ggtctttgtt
82320tgaataagca gtgataagca ttcatagata taaccacata cttaaatatt attgattctg
82380aaagaaggaa agatgactaa tcttgtcctt tttattactg gttgtccttt ccgtgtatat
82440ttaagtaatc atttggatag atgtttcaaa acttgcacta gaattaatcc aactacgttt
82500gattttttct cttaatgaaa taggcaagat atctgaaact tgtttttatt ccagttaacc
82560ttttttcata gtcacaagtc tttgatgtcc taaagtgaga gctaaatatg ttttatttta
82620tagaattgaa aagaacatag tgccagatga ttatgctgga gtcatttatt ttgttagtac
82680tttcttttag cactctaatc tttataaata ggaaaatctt aggagataca gggaatggaa
82740tcaagaatgg gttctttgga cctttttatt tgtcacttgt ttaaatatct ttcaaaatat
82800ttataagtct taactgcatg catattttgt cagcggtatt tcggctggaa cagaacgaag
82860acagatgaat ctctgtttcc tgtattaaag caactcgatg cccagcaggt aatcatggtg
82920gacccttctc ctaagttcag gatgaagggt aggctgtggt tgacagctgt taaagaccag
82980ttaactctta ttttggggca tagagcagac attttgaaat taggataatt tagatgagag
83040aatagaagaa aatagaaaaa gaaaagttag actttggctt cattttctat gatcactctt
83100agggctgaac tttggagtcc tttcatattc atttttttct ttgaaaaaac cagtctaata
83160actgatttca cactaaggtg tttctgatta aatacattac cccttgggat ttactttcat
83220tttttaacga aaggaagtct tcaaagcaat tctgatcatt taagttttta taagtactaa
83280tatttttaaa ttctataaac gaatcttgaa aggtagaagt tcagtcatta ttgtgtatca
83340gtaggagagg tttttgtggg aaggagccct tttgtatgac ctttagttcc tctaggtacc
83400tacttcttgc tttatctgtg agtatgttct gactcacaca aaacaagctt tcgcccacac
83460tagaaacaac ccacaaaaca tacatacaaa taaaccactt agcagaaatg agaatgtagc
83520ttttgtgata atatatttaa attaaaccca ttttcattag gcttggataa ttttgaaaaa
83580catgaacaaa actaagacag catttctatg agtttctctt tgttttttgg atgactatct
83640actggaaaca ttataaacca accaaaaagg aaagctattt gtcttagtgt ctggtgattt
83700cttaatttca gataaatttt gttgtaatat ttatctcctt tttcattcta aaacttaatg
83760tgtttatttt tccaaaatta gaaaatgaag aaaatgaatt aaaactactc attagttacc
83820tcctctaata cagttgttaa catttaggtt aaaaaatttt cctgcattta tttgtggcca
83880tactaaatgt aacttaatag atctaatcct cccccccaac ttattagcat attttcttct
83940atgtattgtg tggttttctt aattccagat gactagatta tatttcatga aagatattta
84000aataattagt tcagtttact tgttagacct ttaattttca agtttgttat agtaatgctg
84060tgatgaacat taggcatata gctttaccca tatatttttt ttttgagacg gagtctcact
84120ctgtctccca ggctggagtg cagtggtgag aactcggctc actgcaagct ccgcctcccg
84180ggttcacgcc attctcctgc ctcagcctcc cgagtagctg ggactacagg cgcccgccac
84240cacgcctggc taatgttttg tattttttta gtagagacgg ggtttcaccg tgtaagccag
84300aatggtctcg atctgacctc gtgatccacc cacctcggcc tcccagagtg ctggggttac
84360aggcgtgagc caccatgccc ggccaaagaa tatcttctta aaagtgaatt tactgaataa
84420aaggcatgaa tatttcttac agttgctaat atatactgtg aacttgcctc tcaaaggtat
84480tgtatgatga taatgttttt aaaagaaaga tatagtagga cttagaaaca ggccccatga
84540agtcgtgttg ctcctgagga agatgaaatg tttagctaca gaaaaatatt taacgctctt
84600tgaatatctt aggaagagat ttctcatttg agatgtggac taaagactta gttgacagag
84660atggtacaag tactgcttta ggtatgattt agaaagtgaa aattactgtc agtaattcac
84720tgggagagaa ctgggttttg ggagataatg aattaatatt ctctgacata gtaatccaat
84780gtgagtgatc aaggttgagc ttgttgattt ggtttagaaa cttgacttac ttgtctgatt
84840tattattatt attcttttgt tattttttta gacacagctc cgaattgatt ccttctttag
84900attagcacaa caggagaaag aagatgctaa acgtattaag agccagagac taaacagagc
84960tgtgacatgt atgctaagga aagagaaaga agcagcagcc agcgaaatag aagcagtttc
85020tgttgccatg gagaaagaat ttgagctact tgataaggca aaaggaaaaa cccagaagag
85080aggcataaca aataccttag aagagtcatc aagcctgaaa agaaagaggc tttcagattc
85140taaaggaaag aatacatgcg gtggattttt gggggagacc tgcctctcag aatcatctga
85200tggatcttca agtgaagatg ctgaaagttc atctttaatg aatgtacaaa ggagaacagc
85260tgcgaaagag ccaaaaacca gtgcttcaga ttcgcagaac tcagtgaagg aagctcccgt
85320gaagaatgga ggtgcgacca ccagcagctc tagtgatagt gatgacgatg gagggaaaga
85380gaagatggtc ctcgtgaccg ccagatctgt gtttgggaag aaaagaagga aactaagacg
85440tgcgagggga agaaaaagga aaacctaatt aaaaaatatg tatcctctat aattagttat
85500gacagccatt tgtaatgaat ttgtcgcaaa gacgtaataa aattaactgg tggcacggtc
85560tttgtattta gtgtgtggtt cctaaaaaca aatgctaaat ctgacatttg ttttttaatg
85620ttttactttt ctagtatttt ttagctgaat atttcaagta tcattggata ttatcttgta
85680ttcacaggct ttgtcttttc atgttttcat tatcttaaca atgtctgatc cttcctggtc
85740acatgttaaa aaagcgaaaa agatttctat tgatcagcac tcactctcaa taggctttcc
85800ctctgacatt cagacgtagc tgagaagaaa tacgtgcatg tttctaattc cacaatagtg
85860gcagttttac acaactgttt agccctgctg cccacggctt tgcattttcc ctcaggttcc
85920acttaaaagc atagcaggag ggagcctcac tgctggaaca tatttcaata tgtttgctgt
85980ggttttagca aacaattagg aaacctaaat ggggtgcatc attctacctg tgaacattaa
86040gtgtatggga acctctgtac cgttatgttt ggcttttaaa ccagacttca cttattagaa
86100gctgacttct gtgtaaatgg atttggaggc tgggggctgg agttgaactg gtgtaggtgg
86160gtcagcttta ggagtggcct gcaggggatg attgttgttg acacagtgtt tgttcagagt
86220ggacaaagaa ggttatttta agactgctcc tgtggagacg tctcccaaga acaaccccca
86280agctcctatt tgcttcgagt taagaatgat tggaggagag tactaccaat tactttgagc
86340gtgggtcctc tgccttcaga gctaccttcc caagtctgtg tccttttggc atctttagta
86400tttccatctt tcctgacttt ttccttcagc cttcaaattg aaaatcttca ccaatgaaaa
86460caaaatccaa atagataatc tccagcagtc tttgaacgac tcctagatgt gttgtttttt
86520aggctctcat ttatattgac ttggccagcc aaacttctcc agtacttgga ctacagcagc
86580tgcttttctt tgcacactat tttttcctaa tttcaaatac gtcttctgca tccatcattc
86640ttctctagca gctccttcta aagtcaacaa tgatttccac ataaactcaa atcttttctt
86700catgcttctt ttctctggag gtacttggag cagaacattt ttggaggcaa ggggagggga
86760aaattaactt tttcgtgttt ttctcatttc tcagaattac ttgtcactgg gagttttctg
86820catatatatt tgcaggtaga ataattgtct tgctaacaaa tttaatttga tgcctttaat
86880agatatttac atttgttttt cactgcgtta gctttgggaa aggataagct actgtaacaa
86940aagatgctca tatacagtgg tctgaataag ctagaacatt acttctctct tactcaacag
87000tctggccagt ctgagctggt gaagcagttg tgcaccatga ggtcatgcag tcctgtgttc
87060agccaacccc tagggtgcta tcttgcctga gtggtctata ctgggtgatt acttcatctg
87120cattgcatcc tgccagaagg ggaaagaacc aaagtccagg gcaagcagaa ggggaaatgg
87180atggtcagga ccatgagaaa cctgttctgt aacactattt atatgagagt ttgatattgt
87240atcttctttt tctcttaacc ctgtgaccaa gacataatgc ttttacaagg ggagaaacac
87300gtttagagac attaaccagt tttgccaaat tcacccatgt agtaggtaaa acttctcacc
87360tgaaccaaaa tcttctaact caaaattcca gggtgctatt ttacctatga aagaggttgc
87420ctgtttaaaa taatcagcag ctgaacaaat tctttattat tcacatcaag ttggtgtacc
87480tagagcttat ttaggatgat gaccatcata cgccttttcg attatcttcc agctcattaa
87540tattttgttt agctttatct atacttgctg ttaaacccat atattgagta ttttatttca
87600ttgattgtta tttttcagtt ttatttgact cttttttttt ttcaatttga ctcattttta
87660tagattccag ttccctggtg tacttcttgt catctgtttt tgagtatctt aatcacagtt
87720actttaaagt ctgtcgaata actccaatat ctaagtcacc tatgagttta tttctattgt
87780ctgtttcttg gtatgcatgg gtaaatttta actgaatgtt ggacagtcta tgggaaaaat
87840tataggtacc ctgtaccttc ctctagagag gattccagaa tgtatttgga caggcagcag
87900aggggaagat cacctcagtg cagccagcga ggacaaagta gattgaaggc tagtttgcag
87960ttttttctgt aagactccag cttctagttt cacccgtatt ccaggaaagt cactttccag
88020gagtgccctg atgtttcttc ttttgcgctt ggatctgctt ttgccactga atgcttcaaa
88080aattagcctc ttgccctaaa cagcttaatc agtcaattct tcgactgaat agtcagcact
88140aagcgtcagg cttctctctc ccttttttct gggatcttgg gcccacaaat ctgactgtct
88200tggcagccac actcttcttt tgtctccaca gttctgtgag attgcctgat gctccgctgc
88260tatccttaca tgcggtcttc tctttggctt cctcaggaaa cagcaatgcc tggaattctt
88320ggctgccttg acagctctct gatactttat aacagaggca tgtttgtgtg gggtgtgtgt
88380gtgtgtttaa aaatctattt ttctcattgt taatagaagc tgaggtttgc tataagctac
88440ttacagctgg aggcagaagt ctgtatgtat ttgttaaaca gtcatgttgg ttaagatgct
88500tgattgaggg attctgcctc ttaagagatc taacagtgag gattgcatga tctttcaaaa
88560aaatccttaa aaaagacaat ggtttttatt cgtcatttat tttgttttta aaaaagcccc
88620aaagaaagag tatcctctgc cccaagttaa ctgcctatga ataatagttc acattgctta
88680ggattttgaa ctttccagaa tgtttttacc tataatattc accccacaaa tatatgttgt
88740atgacttttg tatcattttt aaaagacatt tatatttttt tgagataggg tcttgctctg
88800ctacccaggc gggagtttgg tggtgtgacc acggctcact gcagcctcca tttcctgggc
88860tgagtggtcc tcctgccttg gtctcccaaa ggctaaggca ccctgccgat atatacttgt
88920tctatagatg atgaaaccaa gaaacatact aagtaaatgg cttaaaatca caatctaatt
88980ataggcagga ataactaaaa tcaggcttcc caactcctag ccctttccaa tacattaaag
89040tgacctcgat tcatctgtgt atacattatg gacctgtagt cacactccta agtttaactt
89100tccaatattg ctttatctta actctgtaac tttctatcag ggatctttcc agtggcatgg
89160gaaatgtgta tgtcctctga gaggcacaca tacatgcact tgggcaggtg cacagatgga
89220tctatgttcg gagcctgccc attcagtctg aataaaaagg aatcatggta gtttgatttc
89280tgggtgcaga gcaatgctac agacattcta gtatggttgt tggagtctaa agacagtgtt
89340cccatgcaga taggtgacat gaaaaatatg tagcgttcat ttcctagaat gtcatagcca
89400cggagtctta gaataactct tggcatccaa gttaaaagct ggatcacttc ctccaaatag
89460tcttcactca cttccgggag tgggttgtat tcacctttag ctctgttcac gagacactcg
89520tgggatattc ctagccttct gcccccttag ttacgggcat tgccgtctca ctgctgtata
89580accagaggcg tgggagctgc agctgtcaga atcctaatcc cttctgcctg gactcgttac
89640cagtggtagc gagtgtgtat agtagtaaac tgtgctgcga tgctagcctt tcagtcacta
89700ccagaggtcc cagcagcaaa aacaaaaatt gtggatgata cattttaacc ttctagaaac
89760tcagctaact cctttaggta tgtccaggct ttccatttaa atatcagcct taaggcagac
89820atgtctttct ttaatggaag agatctaaat tgggagcaaa acatgtgacc acttttgcta
89880tcgttttagc tgtgtagcct caatgacata atttaacctt ggttttcctg atttgcaaac
89940tgtagggaaa ataatgtctt ctaacttgct gaagtatgct gtaaaagaaa atgtggtggc
90000tgtaaaatgt gctttgggat tctcggaaga aaggagtgct atattaattc atggtactgc
90060aattgttatt ttaccccaga ttgagaagta aaatatcaaa tcgttatttt ctaaagtttg
90120aataactgtt aaatttattc agataatctg ctctattcaa gaggcataaa tagtttcttc
90180agttttaaat atatcggctg tgatggtatg gactccttaa ggtagacgct gctcaaaaga
90240ttatctggca atgaggctgt ttgacattgc ataggtgcta atgatgcatt gatggtaaca
90300agtgaacttt caatgtttta atcgcctctt aaattttagg attgcagtct tactagtcaa
90360tatgaagatg atgcctttct ttgttacaac tgaggtagtc tctggtgaag aatatgtttt
90420ctttctgtag tagcttttta aaatttctga ctcagaagct tttgttctga ttttaatttg
90480ttctagatag tgtttagatg gtctttctta agtcacgctt cgttaatata tgatatttta
90540atataatttt gtctgctgta tttgaagtta agctttatta agaaactata aaagcactcc
90600atacatattt tgaattatta ccatttgata ttttaatgta ttatctatct agttttcaag
90660aaataatgaa gaaaacatca atatatcacc taccatcatt tcattacttg atggattcag
90720aagctcaaga aggtaagtca ttttatgttt atgtagctta ccaattctaa gaagacaaat
90780aactaataga atataaagtg tcacctttgt agtattcatt aatgtcttaa atggagctga
90840cacttttatc aatggcatat tatgtcttgg tttatatttg tgagcctgca aacatcttga
90900atcatgttta agtctattac tataccatga tcttgatgat atgttaaaga tacttttgac
90960agattaaatt aaatatatag acttactaag catgtacagg gtttttgatc ttcaaataag
91020gaatacattt ttttctcaag tgctgtacac taaatgtatg atgtgatttc taagcaaatt
91080ttatttcaga taagaagctt gcaagaaaga taaaaatagt tttatgtgaa aactataagt
91140ttttactggt ggttggatga aaggtgaatt taggctatac agacatctgt agcctgagtt
91200tatggttcct ctaactattt attagaggca aatacataat gtaatttgac ctctggatgt
91260aaagagatta agtttgaatt ctactatcag tgtggttatt tgaaattgaa ccatcatttt
91320tatatagtct taagtatcaa gcgtgacttt gcatgggtta tgtaatataa ctgtccccat
91380tgtcttgaga tcaaaagtac agacatacac tcttgacaga ctagaaaaat taccatattt
91440ttgtagagca gttgtaatac aaatctaaat atcttgattt cttttgttct ctccttatct
91500gcacacactg tagatgtttt ctgaacacta tagttctggt ttttctgttt ttggccattc
91560caggtggctt ctacctagga gaacacatct ttaggccaag aaagctgggt gtatgcccag
91620tgttggcagc ctacctggct taaacacaaa tagttttacc aagcgattct tgcaaaccta
91680accatttaag tcgatcactt agagaaccca gccatttaga tgatttgaat gaagacgcaa
91740tttctgtatg gtttatgaat aaaattatat cattatcatt accattagta acaattcttg
91800gtttcaactg tgtagaagat aatttggcaa attacaaagg aagtggaaga agttcaaggt
91860aaggttgaga cggaactagc attacaatgt tttacaacac aaacaaacac actctgtaat
91920catctatata gccaaatgaa gctttaaaaa tgtgttcaca cacacagagc taagggaatt
91980aaggatgtgc ccttggagca gatcaggtgg catttagttg gaaagccatc accaagtaga
92040ggaacactca gttctcaagt actgaaggat actgactttt cctgccctct tttatactta
92100gaatcctgtg cttggaaagg atggagcaaa cccatagctc agtgctcata tttcaacact
92160ggcatgaaga tccaagaact ataggatgtc acttatttat acacagatta aaattagggc
92220tatggtgggt gctaaaaaat tcttctgtat gtgtttctaa ggctatattg taatctaagg
92280tgtctaaaac cttacccaca ttaactattc taaatatcct atgttgcctg cctcatgcca
92340gcagcctctc tccagctgtt agtacacgta gcaatctgtt atagaatagt agggtcaccg
92400gaggacctgc agccttgggg cagattcatc cttgcccaag gctcagtgat tctggctctc
92460aatgaccctc gagtctgtca cgtcagctgc ttacttgaca tcctacttac tacctcttgt
92520cagtttaggg ccttcaagac atagatgcca aaatggggct aagcatgtaa gggatttact
92580gggggaaatg tttgtggagg ataaggaaga aaacagaagt aggctagggg agccttcaga
92640gcccaacatg ggtccagtgc ccgtgagagg agacagaaga cagattggat gggaagagtc
92700tccaattgca acacaattct gagacagtct caggtgggct gatgaaaatc cctgagcagg
92760agttgcccac tagaggagtg tcctatatgg aacaggaact ggctggcttg agtaggcagc
92820ctcaggagga acatggtctc ccaggagcca caggtgcagc aatgggcagt gttgggcaac
92880tctgctcctt gtggcaggtt ctcttaatct gagtggcaca ccagggtaat atgtatttaa
92940aataattttc ctttacaggt ttgcccttcc acctatattt tcgatctcac tcaatggcaa
93000catcactccc caggctaccc agaaacctgg aatcatcctg attttctccc tttccttcac
93060acccttcatc attacatctc attgatttta cattcctgat atgtttccct acgccttgtt
93120ccatttccac catcacttac cttgattctt cattatctct gctcctgact gctgttaacc
93180accttctaat tggctgtctc gctctgggcc aactcccttt ctttaccctc tctagttctt
93240ttcccacact gtaacctgca ctgcatgatc ttttaaaaat ccaaatcttg tcacatttgt
93300cttctgcttg aaattcttca gtgcctcatt atttcttaca ggataaaatc aaggcttctt
93360agtatggtgt aatgtgtttt accttgtgct actatcctca tttatcctat attctggttc
93420tcagggtttc caaaattggt caggagactt tttgccagtg ggcttttgca catgtcattt
93480cctgtatatg gcaggtcttc ccttctccta tttgacaaac tcataattag gaaggctcag
93540ataattcttt cctgagctct ctgagaggtg gaactttcca tttttgcatt gcctactgaa
93600gtctctgtta tagcactttt actgaacctc tcttatatag cacttggtct gccttctaat
93660tcatagtgcc caggtactta ataagtgagt gaatgaatga atgaatgaat tagctataac
93720ataaccgtat tggctgactt aaatttgctc atgagtgatg atcttgtatt tctgatcata
93780agtcagttga ctcccttcag gacctaatat aatgttctcc tcaaggcatc acattaaaaa
93840ttttcaggag catcttcata aatctttgca gaattggatt aatgggtaat atggttttta
93900taaacagtat tgcattaata tatatggaaa tttatgtaat tctaccactg ttttaagaaa
93960cctatcaaac tacagtctga ggcaatatat ttaatatact agttggagta ctatagcaca
94020tataattact aaagtgagaa taatttatta acacatttta aacattttac aagtaacatg
94080aataattatc ttccagataa gactaataaa ttttctacat gttgaataaa aataactctt
94140gaatatacca gtctacttca gatgtcttca aaactcagat ttgaacgtca agtttgaatg
94200tcaattgtcc acccctaggg gaggcatacc accactttga aagcacacac gttcatactt
94260ggctccttga cccaaggcct tcatttagac tcttgtcatt tttcatctag actgttccaa
94320taatcttcta attagcttgt aaggtcacca gttcttcagc tcctgctgtc agatgtagtt
94380cttaaatatc aaatctgatc atgtcaattc ctgttctaat aatctttatt agctcctcct
94440caccttatag aacaaaggtc aatgttgcac ataaatttaa tcccaaccta cttcttcagg
94500cttattttct atagttcccc tccatgcacc ttgtgtaata ttcattataa tccctaaaga
94560cacaatccaa aatgccatga tcctgaatgt tgaaatcccc aaagaacaaa tgcccttaca
94620gctaattgaa ttcccaaacc ataatgatag atttggaatt aggtgccatc aaaacttcta
94680aaggtggatt tcaaggtgtt agcaacagtt tctttttcca ttcagcccaa tacatttgcc
94740ggaaaattca gatgaatgga ttgcctgtag aatatagtaa catggaaaac ttcagtttat
94800aacgcattat ttgtctgcat tggcattcct tctagctgat gaaattctag gaggtcttaa
94860tgagttaaag ctgaatttgc ctgatgaagc cagcaaagtt actgactcta aaataactat
94920gtgcatggta ggataagaaa gcacacaatg atgttgctgt tcgatcacca gtattgtttc
94980agccaaatct gtggtctgta tatgagtgca tgagcaaagg atttccatgt acccaaaaca
95040acacagaagc atgacacaga gataggaaat ttaatagaaa atgctcatgt cagtgtgtat
95100tgaataaaac tagaatttca aaaagagcag catcatgtag aaaatcaatg tgaatgtatt
95160cttcaagaag agccatgtcc taaaagaaaa gaaacagtta ctcatctcca cgcaagactt
95220cagaatacag ttaatgattg tggaagtcgg ccagctctta tgggctacct ccgtgcagtt
95280gcccataatt tatccctgta atgcgctttt tcatatgttg aattttcttt ttagtttctt
95340tgggaaggtt ttgtttttct ttttcctttt ttaaagtttt tttcctcacc attttaaatc
95400atcagcatta ttttttacaa ctcactatgg tatgctacgt gtttcatctt tacatttcca
95460aaactagagg cataaattgt atgagctttt agagagttct aatttgtttt atgcactttt
95520tgttttttgc aaaattgact ccatggaagt gcattttaat aacactgacc ttgtgtgtaa
95580gcattgtgca tatatatgta aaaaccttaa aacttcctca ataaataaag ggatatcctt
95640tttctgcatc tgcatttgtg aaagataaca tttttcaata tctcagctct ttaggcaact
95700gtatatgact gtatatgtgg tggtgaccca ttgttggtat ttaattttaa tttttttagc
95760aaggggcgtc tcactatgtt gcccaggctg ggctctaacg atccttccac ctcagcttcc
95820tgagtagcgg ggactacagg cgtgagccac tgcattcagc tccactgtag gtttggatct
95880gtttcatcag aagacttagg ttgttcatca ggatatttca agtgaccaaa ctatataagt
95940gatttcttta tgaatacaat ttctctgctc ataactgtta tgcctgtgta actgtgggta
96000gcacacctga gtgcttatca ttgcaaaatt atgtatatac aattttattg tataaagtag
96060cccacaaagt gttctgttgt gtttttatgt ttctaaactc ttttaaaaat gtaaatatgt
96120tttaaagaac ttttagaatt atttttatca gaattatgta tttggggttt tgatctttca
96180ggattcaaca ctggggatta tggcatcaga aactatcttt tgggattctg gcccaaacct
96240ctagccttgt accctcgctc catagatctt ctaacagctc ctgggcttaa atatctcctc
96300ttttatgatg cttctcctga ttcaaccaac cagtggtgct tcccatagca ttttagaaag
96360acctgtttta tagcatttat caaggtattg acatttgttc agttgtctgc cccttccgtg
96420atctcagtag caaacttgta gtacaactaa tttaggttct ttatttttcc acttgcatcc
96480tgagcaacct caataaaagg tgcattttct ttcttttgta aacacaaagt gtagcattgt
96540ttggtctact tgattagagt tacaacctct gtacattctt gaaactaaga agaccagcag
96600acagagaata tctgtagaat cttctcagga attggtttca acataagggt ccaagacctt
96660ggcattgctg aggttttcag aattctatct tttaagttaa gcctgtgact atttactaat
96720ctgaaaatat ctttaacttc ttttttttaa tctgtaggca tttctaagaa gcacatgaca
96780ttcagctttg aatagaagat tcatttgaat gacagtttct ttcatttagt agaggggatt
96840ttactttaaa ttttgctgac aggttttcgt gacagccctc aaatggaagc ttgaattc
9689842129DNAXenopus sp.misc_feature(74)..(74)n = a, c, g, or t.
4gaagaagagg agaatggtag gacggagcca gcagctttat gacaaaatag gttgaaaatt
60gttgattgag gcanccgctg tgggatgtgc aaagattgta agctgttgtg agtggcagag
120atctctggcc ccatttggat ccccgtacct atgtttgtta cgtaattgtt ttactggcga
180acaaggtctc agttaaagga tgaagctcat tcggtgagac ttttattgac caaaacatag
240tcaatatttt tgttgtgatt tgcaccgaac tgcattgttc atcttcattc aatgcctaac
300atagaagcgg acagagttac aagcgttccc gaaaataacg actgcaaatc caagagtcag
360cctctgagga ataatctcca tgaaactgta aaatcctaca gtatacaagg tgctgccaca
420gcaaatatcg agcctcctgc agaaagacct tatccctggg gatgtcctgt gacacacaca
480aaggaaaagt tctataccat ctgcgcagac tacgcctttt tgaatcaagc aacatctctt
540tgcaaatcgt ctagttctgt ctgcagctca agctcagagg acaaatctgc tctgagcaat
600acaataaatt acattgatct tcagaccagc gaatcagatt ctgtatacaa cgaggatgca
660agcttggagt ctttatctag caatcttggt acacgtccgc ttgcctggga aattgataaa
720tcagacttca gcacaatgac ttccaagtta aaaagatcag gtgtaaaaaa acaaacacct
780aagaagaaac ctgacaggaa ggcaaaacca ttaagggact gtcctcaaca cttaatcctg
840gatgatgtta aacagcggaa agttctagac ctcagacgat ggtattgtat cagtcgaccc
900cagtacaaga cctcgtgtgg catttcttct ctggtatcat gctggaactt tctttacagt
960actctgggag caggaagcct cccaccaatt actcaagagg aagctttaca tattttgggc
1020tttcaaccac cctttgaaga gattaggttt ggtcctttta ctgggaacac aactttgatg
1080agatggttca gacaaatcaa tgatcatttc catgtaaaag gatgctccta tgttctgtat
1140aagccgcatg gcaagaacaa gacagcagga gaaactgctg ttggggcact atcaaagtta
1200acacaagggt taaaagaaga ctcaacagcc tacgtctatc attgtcagaa tcattatttt
1260tgcccaattg gttttgaggc aacccctgtg aaggcatcca aagcatacag gggccaactt
1320ttcccgcatg aagtggagta ctggatttta attggtgagc caagcagaaa acaccctaca
1380attcactgca aaaagtgggc agatattgtt actgacttaa atactcaaaa tccagaatat
1440tttgatatta gacacactga aagaggcctt cagtacagga aaacaaaaaa ggttggagga
1500aaccttcact gccttctggc atttcagaga ctcagctggc aaagatttgg tccatggccc
1560ttacagcttg gaacccttag gccagaaccc cagccacccg tacaaggaag aagaatccct
1620aaatctgaaa gtgaggataa tgtctccaag aaacagcatg ggcgtctggg gaggtcattt
1680agtgctggat ttcagcaaga gcttgcatgg aaaagaatgt gtaatatacg tgaacgcagg
1740ggcagtggct cacctgaaag tgatacggac tgagaaggaa atgattaaat tatacaaagt
1800cagtgttact tgtagttttg ggttcatggc actacgatta aactaaacat tagtcatata
1860atgctggaca tggttggcag acattatttg tagttttggg ttcatggcac taccattaag
1920ctaaacatta gtcatataat gctggacatg gttggtagac atctatagtg ctctcccatt
1980aatcataaaa cctttgcaaa ctttttacaa tcatttatga acttattgct caaatgccat
2040tcctgatcta cagtatactg ggtattgtat actgccatgt caggagtatt ttcattaatt
2100aataaaattg gagtttaaaa tcaaaaaaa
21295493PRTXenopus sp. 5Met Pro Asn Ile Glu Ala Asp Arg Val Thr Ser Val
Pro Glu Asn Asn1 5 10
15Asp Cys Lys Ser Lys Ser Gln Pro Leu Arg Asn Asn Leu His Glu Thr
20 25 30Val Lys Ser Tyr Ser Ile Gln
Gly Ala Ala Thr Ala Asn Ile Glu Pro 35 40
45Pro Ala Glu Arg Pro Tyr Pro Trp Gly Cys Pro Val Thr His Thr
Lys 50 55 60Glu Lys Phe Tyr Thr Ile
Cys Ala Asp Tyr Ala Phe Leu Asn Gln Ala65 70
75 80Thr Ser Leu Cys Lys Ser Ser Ser Ser Val Cys
Ser Ser Ser Ser Glu 85 90
95Asp Lys Ser Ala Leu Ser Asn Thr Ile Asn Tyr Ile Asp Leu Gln Thr
100 105 110Ser Glu Ser Asp Ser Val
Tyr Asn Glu Asp Ala Ser Leu Glu Ser Leu 115 120
125Ser Ser Asn Leu Gly Thr Arg Pro Leu Ala Trp Glu Ile Asp
Lys Ser 130 135 140Asp Phe Ser Thr Met
Thr Ser Lys Leu Lys Arg Ser Gly Val Lys Lys145 150
155 160Gln Thr Pro Lys Lys Lys Pro Asp Arg Lys
Ala Lys Pro Leu Arg Asp 165 170
175Cys Pro Gln His Leu Ile Leu Asp Asp Val Lys Gln Arg Lys Val Leu
180 185 190Asp Leu Arg Arg Trp
Tyr Cys Ile Ser Arg Pro Gln Tyr Lys Thr Ser 195
200 205Cys Gly Ile Ser Ser Leu Val Ser Cys Trp Asn Phe
Leu Tyr Ser Thr 210 215 220Leu Gly Ala
Gly Ser Leu Pro Pro Ile Thr Gln Glu Glu Ala Leu His225
230 235 240Ile Leu Gly Phe Gln Pro Pro
Phe Glu Glu Ile Arg Phe Gly Pro Phe 245
250 255Thr Gly Asn Thr Thr Leu Met Arg Trp Phe Arg Gln
Ile Asn Asp His 260 265 270Phe
His Val Lys Gly Cys Ser Tyr Val Leu Tyr Lys Pro His Gly Lys 275
280 285Asn Lys Thr Ala Gly Glu Thr Ala Val
Gly Ala Leu Ser Lys Leu Thr 290 295
300Gln Gly Leu Lys Glu Asp Ser Thr Ala Tyr Val Tyr His Cys Gln Asn305
310 315 320His Tyr Phe Cys
Pro Ile Gly Phe Glu Ala Thr Pro Val Lys Ala Ser 325
330 335Lys Ala Tyr Arg Gly Gln Leu Phe Pro His
Glu Val Glu Tyr Trp Ile 340 345
350Leu Ile Gly Glu Pro Ser Arg Lys His Pro Thr Ile His Cys Lys Lys
355 360 365Trp Ala Asp Ile Val Thr Asp
Leu Asn Thr Gln Asn Pro Glu Tyr Phe 370 375
380Asp Ile Arg His Thr Glu Arg Gly Leu Gln Tyr Arg Lys Thr Lys
Lys385 390 395 400Val Gly
Gly Asn Leu His Cys Leu Leu Ala Phe Gln Arg Leu Ser Trp
405 410 415Gln Arg Phe Gly Pro Trp Pro
Leu Gln Leu Gly Thr Leu Arg Pro Glu 420 425
430Pro Gln Pro Pro Val Gln Gly Arg Arg Ile Pro Lys Ser Glu
Ser Glu 435 440 445Asp Asn Val Ser
Lys Lys Gln His Gly Arg Leu Gly Arg Ser Phe Ser 450
455 460Ala Gly Phe Gln Gln Glu Leu Ala Trp Lys Arg Met
Cys Asn Ile Arg465 470 475
480Glu Arg Arg Gly Ser Gly Ser Pro Glu Ser Asp Thr Asp
485 49062841DNAGallus
sp.misc_feature(235)..(237)Translation initiation codon (ATG) 6ggacactgac
atggactgaa ggagtagaaa gcaggtgagc gctcgtcgtg gcttctcccc 60cccctgcgtc
gcgcactgcg tctgtttccg gcgcgggcac attccccgct ccgccgcggg 120cccgcgcagg
tacctcacct tgcagtaaca atggaggcag cgatgcaaac tatgtggtaa 180ttaaaaataa
gcaaaaccat tcctcgcagt acaagacgct tccaatatta ttcaatgcct 240cacatctcag
aagatgaaaa ggagaatggt tctggaaaca atggaaacac tgaaaagaaa 300cctgggaaag
aatcctcaga agcttctctt cgtgatccta taaagtcgta ctgcatctca 360gatgcctcca
ctgtgtcttt ggtgtccagg ggagatggac attacccatg gggatgtcct 420gtgactcaca
cacgagagaa attttatacc atttgctcag actatgcttt tttaaacaga 480gtaacatcta
tttgtaaaag cccaagtgct tcagttaacg cctgcctgtc aggcagtgct 540gccttaaacg
ttggaaataa cacacctagc ttactgggca ttcaaactgg tgcttcggag 600ataatctaca
gtgaagatgc taacttggaa accttgtctg gcagccttgg aaagcttcca 660ctggcatggg
aaattgacaa atcagaattc aacagcgtga ctgcgaatca taaaaacaaa 720gcaggcaaca
tgaagaaaca agtggcaaag aaaaagtcct cagacaaaaa aagcaaacag 780tacaaggagt
gtcctcagct gtctgctctt gaagatgtga aggagaggaa agtgttggac 840ctccgaagat
ggtactgtat tagccgacct cagtacaaga cttcttgtgg aatttcttca 900ttagtgtctt
gctggaattt cttatatagt acgctgggag ctggcagttt accacctatt 960actcaagaag
aagctttgca tatattgggt tttcaacccc catttgagga gatcaggttt 1020ggtcccttca
ctggaaatac gactttaatg agatggttta ggcaaataaa tgatcacttc 1080catatcaagg
gttgctcata tgttctgtat aaacctcatg gaaagaacaa gacagctgga 1140gaaactgctg
tgggggccct tgcaaagcta acacgtggac tgaaagatga atcaatggcc 1200tacatctacc
attgccaaaa ccattatttt tgcccaattg gatttgaagc aactccagta 1260aaagctagta
aagcgtatag aggtcgtgtt ttgcagcaag aagtagaata ctggatctta 1320attggagagc
cgagcagaaa acatccaacg atacactgta aaaggtggac agatattgtc 1380actgacctaa
acacccaaaa tccagagtac ctagatattc ggcacctaga gagaggactg 1440cagcatcgga
aaacaaagaa ggttggagga aatcttcatt gcatcatcgc ctttcagaga 1500cttaactggc
aaagatttgg tccttggaat attccatttg gaagtgtcag acaggataaa 1560caatcccaaa
cacaaggaca aggtattgcc aaatctgaga gtgaagacaa tatctctaaa 1620aaacaacatg
gacgactggg tcgatctttc agtgctggtt tccatcaaga atctacatgg 1680aaaaagtcta
gtcttcgtga gaggaggaac agcgggtatc agagctataa tgattatgat 1740ggagatgatt
agaattaact ttaggtaata gagtttatat atcaaagtta gttttaatca 1800acacagaata
ggggtttatt agtcctagga tacatgtgaa tagaaaatat ggcataagat 1860acagctttgt
aatccttaaa tcaattatga attatatggt tgcagtggat aaaagagcag 1920attgaaatta
gccaatgtaa taaacagatt tcattgaaaa tacttgatat tcagaagcat 1980gaaaatgtat
tatatgactt tataaaaagg gttatactgc atatggtgta aggataaaag 2040taaacatttg
ccttcctttt tagcactcca ttttgttaag gctgctgata tccagtgaga 2100agaaagaaat
tgaataggtt agaaaacctt gtcagattaa caaaattgaa tgtatattct 2160caatctagtt
gtcagtagaa ttctgtgagt cagataatcc tgttttgtag gtagatccca 2220gttatttttc
ccatagctag atacctgttt taaactgaga agaattgctg gtggcaagga 2280aggtttgaag
atggacattt actgcttttg ctctgtggat atggtagcag attttctatc 2340ctgtgagctc
tggtgagcag tgactgcata acacaggctt gtgaaaatca tttttataaa 2400gctgcattta
acctgagccc aatgaactgg ctgaacagtg tgttctgctg gcaattcttt 2460tccttgttca
gtctcaaaac tcctgttgtt tttgtgctgc tctcttgatt ttgtatgaag 2520gtgatgcaag
tgccgacaac tgctggcagc ccttatgata tacctctatg ccagcaaaca 2580atccaagtct
tttcaggtgt ccatgtgcag tttttttttt ttcctttctg gtttattcag 2640ttgtttgccc
aaatgcatct cgacagttgt aactttgtgt gcgaatgtcc acacctgctc 2700aaggattttt
ttttttttac ataaaacaat ttgtcatgta atgcagggtt tttgtaggtt 2760gatgctgttg
ttaaccaaaa atggagggag acttttggac tttcgttcat tcaataaaat 2820ttgttttatt
taaaaaaaaa a
284173038DNAGallus sp.misc_feature(235)..(237)Translation initiation
codon (ATG) 7ggacactgac atggactgaa ggagtagaaa gcaggtgagc gctcgtcgtg
gcttctcccc 60cccctgcgtc gcgcactgcg tctgtttccg gcgcgggcac attccccgct
ccgccgcggg 120cccgcgcagg tacctcacct tgcagtaaca atggaggcag cgatgcaaac
tatgtggtaa 180ttaaaaataa gcaaaaccat tcctcgcagt acaagacgct tccaatatta
ttcaatgcct 240cacatctcag aagatgaaaa ggagaatggt tctggaaaca atggaaacac
tgaaaagaaa 300cctgggaaag aatcctcaga agcttctctt cgtgatccta taaagtcgta
ctgcatctca 360gatgcctcca ctgtgtcttt ggtgtccagg ggagatggac attacccatg
gggatgtcct 420gtgactcaca cacgagagaa attttatacc atttgctcag actatgcttt
tttaaacaga 480gtaacatcta tttgtaaaag cccaagtgct tcagttaacg cctgcctgtc
aggcagtgct 540gccttaaacg ttggaaataa cacacctagc ttactgggca ttcaaactgg
tgcttcggag 600ataatctaca gtgaagatgc taacttggaa accttgtctg gcagccttgg
aaagcttcca 660ctggcatggg aaattgacaa atcagaattc aacagcgtga ctgcgaatca
taaaaacaaa 720gcaggcaaca tgaagaaaca agtggcaaag aaaaagtcct cagacaaaaa
aagcaaacag 780tacaaggagt gtcctcagct gtctgctctt gaagatgtga aggagaggaa
agtgttggac 840ctccgaagat ggtactgtat tagccgacct cagtacaaga cttcttgtgg
aatttcttca 900ttagtgtctt gctggaattt cttatatagt acgctgggag ctggcagttt
accacctatt 960actcaagaag aagctttgca tatattgggt tttcaacccc catttgagga
gatcaggttt 1020ggtcccttca ctggaaatac gactttaatg agatggttta ggcaaataaa
tgatcacttc 1080catatcaagg gttgctcata tgttctgtat aaacctcatg gaaagaacaa
gacagctgga 1140gaaactgctg tgggggccct tgcaaagcta acacgtggac tgaaagatga
atcaatggcc 1200tacatctacc attgccaaaa ccattatttt tgcccaattg gatttgaagc
aactccagta 1260aaagctagta aagcgtatag gttgctggat ttggactcgg gagacctggg
ttcggttccc 1320agttcaaccg cagacttcca ttgtgatttt agaggtcgtg ttttgcagca
agaagtagaa 1380tactggatct taattggaga gccgagcaga aaacatccaa cgatacactg
taaaaggtgg 1440acagatattg tcactgacct aaacacccaa aatccagagt acctagatat
tcggcaccta 1500gagagaggac tgcagcatcg gaaaacaaag aaggttggag gaaatcttca
ttgcatcatc 1560gcctttcaga gacttaactg gcaaagattt ggtccttgga atattccatt
tggaagtgtc 1620agacaggata aacaatccca aacacaagga caaggtattg ccaaatctga
gagtgaagac 1680aatatctcta aaaaacaaca tggacgactg ggtcgatctt tcagtgctgg
tttccatcaa 1740gaatctacat ggaaaaagtc tagtcttcgt gagaggagga acagcgggta
tcagagctat 1800aatgattatg atggagatga ttagaattaa ctttaggtaa tagagtttat
atatcaaagt 1860tagttttaat caacacagaa taggggttta ttagtcctag gatacatgtg
aatagaaaat 1920atggcataag atacagcttt gtaatcctta aatcaattat gaattatatg
gttgcagtgg 1980atgacatctg atacatgaac tgacagataa gcacagatta ttgtactttt
gtaatcaaaa 2040gcagatatga cagctaaatc aatcacttat tttgaagtta ctatactata
tcctgatctg 2100tgagaataaa agagcagatt gaaattagcc aatgtaataa acagatttca
ttgaaaatac 2160ttgatattca gaagcatgaa aatgtattat atgactttat aaaaagggtt
atactgcata 2220tggtgtaagg ataaaagtaa acatttgcct tcctttttag cactccattt
tgttaaggct 2280gctgatatcc agtgagaaga aagaaattga ataggttaga aaaccttgtc
agattaacaa 2340aattgaatgt atattctcaa tctagttgtc agtagaattc tgtgagtcag
ataatcctgt 2400tttgtaggta gatcccagtt atttttccca tagctagata cctgttttaa
actgagaaga 2460attgctggtg gcaaggaagg tttgaagatg gacatttact gcttttgctc
tgtggatatg 2520gtagcagatt ttctatcctg tgagctctgg tgagcagtga ctgcataaca
caggcttgtg 2580aaaatcattt ttataaagct gcatttaacc tgagcccaat gaactggctg
aacagtgtgt 2640tctgctggca attcttttcc ttgttcagtc tcaaaactcc tgttgttttt
gtgctgctct 2700cttgattttg tatgaaggtg atgcaagtgc cgacaactgc tggcagccct
tatgatatac 2760ctctatgcca gcaaacaatc caagtctttt caggtgtcca tgtgcagttt
tttttttttc 2820ctttctggtt tattcagttg tttgcccaaa tgcatctcga cagttgtaac
tttgtgtgcg 2880aatgtccaca cctgctcaag gatttttttt tttttacata aaacaatttg
tcatgtaatg 2940cagggttttt gtaggttgat gctgttgtta accaaaaatg gagggagact
tttggacttt 3000cgttcattca ataaaatttg ttttatttaa aaaaaaaa
30388505PRTGallus sp. 8Met Pro His Ile Ser Glu Asp Glu Lys Glu
Asn Gly Ser Gly Asn Asn1 5 10
15Gly Asn Thr Glu Lys Lys Pro Gly Lys Glu Ser Ser Glu Ala Ser Leu
20 25 30Arg Asp Pro Ile Lys Ser
Tyr Cys Ile Ser Asp Ala Ser Thr Val Ser 35 40
45Leu Val Ser Arg Gly Asp Gly His Tyr Pro Trp Gly Cys Pro
Val Thr 50 55 60His Thr Arg Glu Lys
Phe Tyr Thr Ile Cys Ser Asp Tyr Ala Phe Leu65 70
75 80Asn Arg Val Thr Ser Ile Cys Lys Ser Pro
Ser Ala Ser Val Asn Ala 85 90
95Cys Leu Ser Gly Ser Ala Ala Leu Asn Val Gly Asn Asn Thr Pro Ser
100 105 110Leu Leu Gly Ile Gln
Thr Gly Ala Ser Glu Ile Ile Tyr Ser Glu Asp 115
120 125Ala Asn Leu Glu Thr Leu Ser Gly Ser Leu Gly Lys
Leu Pro Leu Ala 130 135 140Trp Glu Ile
Asp Lys Ser Glu Phe Asn Ser Val Thr Ala Asn His Lys145
150 155 160Asn Lys Ala Gly Asn Met Lys
Lys Gln Val Ala Lys Lys Lys Ser Ser 165
170 175Asp Lys Lys Ser Lys Gln Tyr Lys Glu Cys Pro Gln
Leu Ser Ala Leu 180 185 190Glu
Asp Val Lys Glu Arg Lys Val Leu Asp Leu Arg Arg Trp Tyr Cys 195
200 205Ile Ser Arg Pro Gln Tyr Lys Thr Ser
Cys Gly Ile Ser Ser Leu Val 210 215
220Ser Cys Trp Asn Phe Leu Tyr Ser Thr Leu Gly Ala Gly Ser Leu Pro225
230 235 240Pro Ile Thr Gln
Glu Glu Ala Leu His Ile Leu Gly Phe Gln Pro Pro 245
250 255Phe Glu Glu Ile Arg Phe Gly Pro Phe Thr
Gly Asn Thr Thr Leu Met 260 265
270Arg Trp Phe Arg Gln Ile Asn Asp His Phe His Ile Lys Gly Cys Ser
275 280 285Tyr Val Leu Tyr Lys Pro His
Gly Lys Asn Lys Thr Ala Gly Glu Thr 290 295
300Ala Val Gly Ala Leu Ala Lys Leu Thr Arg Gly Leu Lys Asp Glu
Ser305 310 315 320Met Ala
Tyr Ile Tyr His Cys Gln Asn His Tyr Phe Cys Pro Ile Gly
325 330 335Phe Glu Ala Thr Pro Val Lys
Ala Ser Lys Ala Tyr Arg Gly Arg Val 340 345
350Leu Gln Gln Glu Val Glu Tyr Trp Ile Leu Ile Gly Glu Pro
Ser Arg 355 360 365Lys His Pro Thr
Ile His Cys Lys Arg Trp Thr Asp Ile Val Thr Asp 370
375 380Leu Asn Thr Gln Asn Pro Glu Tyr Leu Asp Ile Arg
His Leu Glu Arg385 390 395
400Gly Leu Gln His Arg Lys Thr Lys Lys Val Gly Gly Asn Leu His Cys
405 410 415Ile Ile Ala Phe Gln
Arg Leu Asn Trp Gln Arg Phe Gly Pro Trp Asn 420
425 430Ile Pro Phe Gly Ser Val Arg Gln Asp Lys Gln Ser
Gln Thr Gln Gly 435 440 445Gln Gly
Ile Ala Lys Ser Glu Ser Glu Asp Asn Ile Ser Lys Lys Gln 450
455 460His Gly Arg Leu Gly Arg Ser Phe Ser Ala Gly
Phe His Gln Glu Ser465 470 475
480Thr Trp Lys Lys Ser Ser Leu Arg Glu Arg Arg Asn Ser Gly Tyr Gln
485 490 495Ser Tyr Asn Asp
Tyr Asp Gly Asp Asp 500 5059529PRTGallus sp.
9Met Pro His Ile Ser Glu Asp Glu Lys Glu Asn Gly Ser Gly Asn Asn1
5 10 15Gly Asn Thr Glu Lys Lys
Pro Gly Lys Glu Ser Ser Glu Ala Ser Leu 20 25
30Arg Asp Pro Ile Lys Ser Tyr Cys Ile Ser Asp Ala Ser
Thr Val Ser 35 40 45Leu Val Ser
Arg Gly Asp Gly His Tyr Pro Trp Gly Cys Pro Val Thr 50
55 60His Thr Arg Glu Lys Phe Tyr Thr Ile Cys Ser Asp
Tyr Ala Phe Leu65 70 75
80Asn Arg Val Thr Ser Ile Cys Lys Ser Pro Ser Ala Ser Val Asn Ala
85 90 95Cys Leu Ser Gly Ser Ala
Ala Leu Asn Val Gly Asn Asn Thr Pro Ser 100
105 110Leu Leu Gly Ile Gln Thr Gly Ala Ser Glu Ile Ile
Tyr Ser Glu Asp 115 120 125Ala Asn
Leu Glu Thr Leu Ser Gly Ser Leu Gly Lys Leu Pro Leu Ala 130
135 140Trp Glu Ile Asp Lys Ser Glu Phe Asn Ser Val
Thr Ala Asn His Lys145 150 155
160Asn Lys Ala Gly Asn Met Lys Lys Gln Val Ala Lys Lys Lys Ser Ser
165 170 175Asp Lys Lys Ser
Lys Gln Tyr Lys Glu Cys Pro Gln Leu Ser Ala Leu 180
185 190Glu Asp Val Lys Glu Arg Lys Val Leu Asp Leu
Arg Arg Trp Tyr Cys 195 200 205Ile
Ser Arg Pro Gln Tyr Lys Thr Ser Cys Gly Ile Ser Ser Leu Val 210
215 220Ser Cys Trp Asn Phe Leu Tyr Ser Thr Leu
Gly Ala Gly Ser Leu Pro225 230 235
240Pro Ile Thr Gln Glu Glu Ala Leu His Ile Leu Gly Phe Gln Pro
Pro 245 250 255Phe Glu Glu
Ile Arg Phe Gly Pro Phe Thr Gly Asn Thr Thr Leu Met 260
265 270Arg Trp Phe Arg Gln Ile Asn Asp His Phe
His Ile Lys Gly Cys Ser 275 280
285Tyr Val Leu Tyr Lys Pro His Gly Lys Asn Lys Thr Ala Gly Glu Thr 290
295 300Ala Val Gly Ala Leu Ala Lys Leu
Thr Arg Gly Leu Lys Asp Glu Ser305 310
315 320Met Ala Tyr Ile Tyr His Cys Gln Asn His Tyr Phe
Cys Pro Ile Gly 325 330
335Phe Glu Ala Thr Pro Val Lys Ala Ser Lys Ala Tyr Arg Leu Leu Asp
340 345 350Leu Asp Ser Gly Asp Leu
Gly Ser Val Pro Ser Ser Thr Ala Asp Phe 355 360
365His Cys Asp Phe Arg Gly Arg Val Leu Gln Gln Glu Val Glu
Tyr Trp 370 375 380Ile Leu Ile Gly Glu
Pro Ser Arg Lys His Pro Thr Ile His Cys Lys385 390
395 400Arg Trp Thr Asp Ile Val Thr Asp Leu Asn
Thr Gln Asn Pro Glu Tyr 405 410
415Leu Asp Ile Arg His Leu Glu Arg Gly Leu Gln His Arg Lys Thr Lys
420 425 430Lys Val Gly Gly Asn
Leu His Cys Ile Ile Ala Phe Gln Arg Leu Asn 435
440 445Trp Gln Arg Phe Gly Pro Trp Asn Ile Pro Phe Gly
Ser Val Arg Gln 450 455 460Asp Lys Gln
Ser Gln Thr Gln Gly Gln Gly Ile Ala Lys Ser Glu Ser465
470 475 480Glu Asp Asn Ile Ser Lys Lys
Gln His Gly Arg Leu Gly Arg Ser Phe 485
490 495Ser Ala Gly Phe His Gln Glu Ser Thr Trp Lys Lys
Ser Ser Leu Arg 500 505 510Glu
Arg Arg Asn Ser Gly Tyr Gln Ser Tyr Asn Asp Tyr Asp Gly Asp 515
520 525Asp 101946DNADanio
reriomisc_feature(165)..(167)Translation initiation codon (ATG)
10cgttgaaaca gctaggcagt cagagaggtc gcgcagcgca ttttatataa aatattcttg
60ctgaacttgc gagtcggtgg attattttgg gactgagtca tcatatctgc ggtacacctg
120acctgttggt tattctctta accatcatcc cagttctcca ttcaatgcct aacactgtgg
180aaagtgaagg cgccaaggta tccgctagta cagatcagga ggccccatca cgggccccgg
240gacgagagga tgaacgtgag cgcagcttcc tgagccccat gatgcgagat gctctgcggg
300tacgacgggc ctccagcgca gagctccagc ttccatggac gtgccctgta acccactcca
360gggagaagtt ctacaccgtc tgctcggact atgccctgct caaccgagct cgaccagtta
420tcacatccga agatgcatca cagaccaatc ctgacagcgg gacatcatta gccaagagca
480acacagcaac atcttctcag agtcactcag ggggaataag cgtatcttta gatgggaact
540gtgatatgga ggttgtgtcc tccagcaaca agcctgtgct ggcctgggag attgacacct
600cagatttcga tgccgtttta acccggaaag ccagaacaag taatttgaag aaattcaaca
660ctaagaaaat gaaatcatct gacaggccaa gcagaaacct gcaagatgtc ccgccacaag
720cctctctaga tgaaatcaaa cagagaaaag tgctggacct ccgtagatgg tactgcatca
780gccgaccaca gtataaaaca tcatgtggaa tctcttcact tgtttcttgc tggaactttc
840tctacagtac tctcggagca ggcagtctcc cacctatttc tcaagaagaa gctctgcata
900tacttggatt tcagcctccg tttgaagata tcaaatttgg accatttact ggcaatgcca
960ctttaatgag atggttcaga caaatcaatg ataattttcg tgttcggggt tgctcatata
1020ttctgtacaa gcctcatggg aagcacaaga cagcaggaga gacagccgag ggggcgctca
1080tgaagcttac acagggtctt aaagacgaat ccatggccta catttatcac tgtcagaatc
1140actacttctg tcctgtgggc tatgaagcta ctccactgaa agcagccaaa gcatacaggg
1200gaccactgcc tcttaatgag atggagcact ggattctcat tggtgaacca agccggaaac
1260atcctgcaat ccactgtaaa aaatgggcag acatcgtgac ggacctaaat actcagaacc
1320cagaatactt agacattcgc catattgaga gaggcataca gtatcgcaaa accaagaagg
1380ttggaggcaa tctgcattgc atcatggcct tccagagagt gaactggcaa aaattgggac
1440catgggcgct gaatctggaa aacctgaggc atgatctcca tcatcaggct ccagaacaca
1500gaggccaagc ttcaacagag gacagttctg aggagcgaac ggtgaaacgc ctgggtaggt
1560ctctcagcac ggggaacaag cctgaaaatg cctggaagcg tttgtccaac acagccgagt
1620acaggcacag aggctctcca gacagtgacc tggatgaaga catcactgac taaatatgaa
1680gggccaggtg ggtttcgaca cttttattca agattattaa ccttccaggt tattagctat
1740agttaaaggt tacaatccgg tatgaggttg tgatgtaaga gttagtgctc agactggtaa
1800acttaaaaat ggaagtttga cgccaataag aatatgggaa agagctcttg tggaggacat
1860ctgtgtaata ctgacagcaa tgtgaattaa gttacactgg ctttggtgat gtgccgataa
1920ataaaggttt aaaatactaa aaaaaa
194611502PRTDanio rerio 11Met Pro Asn Thr Val Glu Ser Glu Gly Ala Lys Val
Ser Ala Ser Thr1 5 10
15Asp Gln Glu Ala Pro Ser Arg Ala Pro Gly Arg Glu Asp Glu Arg Glu
20 25 30Arg Ser Phe Leu Ser Pro Met
Met Arg Asp Ala Leu Arg Val Arg Arg 35 40
45Ala Ser Ser Ala Glu Leu Gln Leu Pro Trp Thr Cys Pro Val Thr
His 50 55 60Ser Arg Glu Lys Phe Tyr
Thr Val Cys Ser Asp Tyr Ala Leu Leu Asn65 70
75 80Arg Ala Arg Pro Val Ile Thr Ser Glu Asp Ala
Ser Gln Thr Asn Pro 85 90
95Asp Ser Gly Thr Ser Leu Ala Lys Ser Asn Thr Ala Thr Ser Ser Gln
100 105 110Ser His Ser Gly Gly Ile
Ser Val Ser Leu Asp Gly Asn Cys Asp Met 115 120
125Glu Val Val Ser Ser Ser Asn Lys Pro Val Leu Ala Trp Glu
Ile Asp 130 135 140Thr Ser Asp Phe Asp
Ala Val Leu Thr Arg Lys Ala Arg Thr Ser Asn145 150
155 160Leu Lys Lys Phe Asn Thr Lys Lys Met Lys
Ser Ser Asp Arg Pro Ser 165 170
175Arg Asn Leu Gln Asp Val Pro Pro Gln Ala Ser Leu Asp Glu Ile Lys
180 185 190Gln Arg Lys Val Leu
Asp Leu Arg Arg Trp Tyr Cys Ile Ser Arg Pro 195
200 205Gln Tyr Lys Thr Ser Cys Gly Ile Ser Ser Leu Val
Ser Cys Trp Asn 210 215 220Phe Leu Tyr
Ser Thr Leu Gly Ala Gly Ser Leu Pro Pro Ile Ser Gln225
230 235 240Glu Glu Ala Leu His Ile Leu
Gly Phe Gln Pro Pro Phe Glu Asp Ile 245
250 255Lys Phe Gly Pro Phe Thr Gly Asn Ala Thr Leu Met
Arg Trp Phe Arg 260 265 270Gln
Ile Asn Asp Asn Phe Arg Val Arg Gly Cys Ser Tyr Ile Leu Tyr 275
280 285Lys Pro His Gly Lys His Lys Thr Ala
Gly Glu Thr Ala Glu Gly Ala 290 295
300Leu Met Lys Leu Thr Gln Gly Leu Lys Asp Glu Ser Met Ala Tyr Ile305
310 315 320Tyr His Cys Gln
Asn His Tyr Phe Cys Pro Val Gly Tyr Glu Ala Thr 325
330 335Pro Leu Lys Ala Ala Lys Ala Tyr Arg Gly
Pro Leu Pro Leu Asn Glu 340 345
350Met Glu His Trp Ile Leu Ile Gly Glu Pro Ser Arg Lys His Pro Ala
355 360 365Ile His Cys Lys Lys Trp Ala
Asp Ile Val Thr Asp Leu Asn Thr Gln 370 375
380Asn Pro Glu Tyr Leu Asp Ile Arg His Ile Glu Arg Gly Ile Gln
Tyr385 390 395 400Arg Lys
Thr Lys Lys Val Gly Gly Asn Leu His Cys Ile Met Ala Phe
405 410 415Gln Arg Val Asn Trp Gln Lys
Leu Gly Pro Trp Ala Leu Asn Leu Glu 420 425
430Asn Leu Arg His Asp Leu His His Gln Ala Pro Glu His Arg
Gly Gln 435 440 445Ala Ser Thr Glu
Asp Ser Ser Glu Glu Arg Thr Val Lys Arg Leu Gly 450
455 460Arg Ser Leu Ser Thr Gly Asn Lys Pro Glu Asn Ala
Trp Lys Arg Leu465 470 475
480Ser Asn Thr Ala Glu Tyr Arg His Arg Gly Ser Pro Asp Ser Asp Leu
485 490 495Asp Glu Asp Ile Thr
Asp 500122062DNAStrongylocentrotus
purpuratusmisc_feature(164)..(166)translation initiation codon (ATG)
12gcagtttgtg tgtgattctc aatctcattg tgcgcattat aggcctatag ctttcgggaa
60aacagacaga ttagtgcttt gcaaaccttg catacattga ggcaaccaga aagttgggct
120caactttcag ataacctttg acctttcctg ggcctgaggt ataatgggta actggccttc
180agttctctct ggtgagggaa gtgaggacag cagcagcgag agcaacaacg aaagcaacaa
240ccaggaaacc agtgatcagg aaaacacaag acatcatctc tgtggctcag aggagagcta
300cttctccgag gaggaactcc ttcccattgt ctaccctgat gatgatgatg atgctgctgc
360tcgtgatgac gtgttgggag acttcttgtc cgttaaagaa gatggagagt ttacaactga
420cgaggttgat gggtctcgat atgacctagc acccgagtat taccccacct ctcttcatga
480agacgtcact gcgagattct cagatcttgc ctcacctgta gatcgcaaag aaagcagcta
540cagcagcact gacgactatg atgacaatga cagtgatgat gaggaggagg aggaggatga
600ccactattac caaagaagga ggaatgataa atattcccta atgaaggaag acgatgatga
660taatgagctc tccagcattc cactgccacc tccctcatca ctgtatgaag ttgcatcagc
720tgagcagatg caaggggtca cagcttacct gaatgctgac cgacctgaca cactccaaga
780aaccatcgtc ccttttgaga gtcgtgcaga agagtgcagt gcccctgaga gggtggttgc
840atgggagata gacgtcagcg acatgacggg atccaagaag actaagaaga gaccacccaa
900taaactttca aaggcaaaat caaggaaaag ttcatcgaaa ggtagcatgg atagtgccta
960tatcccgcca actgtatcaa caacacctga gctcctagca cagagaaagt gcttggacca
1020aaagagatgg ttttgtgtga gtagacccca gtacagcaag tcatgtggcc tatcgtcctt
1080ggtttcttgc tggaactacc tgttcagtac cctaggaggg ggcaccatgc cccccatcac
1140ccaggagcaa gcccttaacg tcctggggtt ccaaccaccc ttcggtgaga tccgttttgg
1200gcctttcaca gggaatgcca ccctcatgag gtggttcaag cagctgaatg atcactacag
1260agtgagagga agggcatact tccagtacaa accccatggc aggagtagaa cagtgggaag
1320aacatctgcc caaggtttac atctgttacg acaagggttg aaggatccta acatggcttt
1380catataccac tgccataacc actacttctg ccccattgga tacgaagatg tgcctctgaa
1440ggctgtagat gcatacaggg atcctttaaa ccttgatgag gtagagacat ggatactgat
1500cggtgatcct agtagaaagc aaccaggaat ccactgtttc aaatgggaag acatcagcac
1560agatctgaac tgccagaacc ctgactatct caacatccgc aagctacggc ttggagtgca
1620gcagaggagg acaaagagaa ccggtggcaa cttgcactgc atcatggcct tctgtcgcag
1680tgcaggcttt ctcaccagac caaccaagag caagaaagag ggtgcaatga aggacacttc
1740tagtaacagc aagagtagga agtctggctc cgttcggatg tcaggacgta aggttggcga
1800gagtaagagt gaggggatgg tggggcgtcc agctccagga gggagtgtgc catgtctgca
1860gactggcaaa gcggacagta gcgatatcat cgagcacttt gcttttgaga ctgtgagttg
1920cgaccatagc agtgagggcc gaagctgtag atcagaagtt gttaaaaaga ctaaaagtga
1980atctcaggtt ggcagacgaa gggcaaaggc atctgttgta aagcaggagg ataaggagat
2040cagagtgaag agttctgagg ca
206213633PRTStrongylocentrotus purpuratus 13Met Gly Asn Trp Pro Ser Val
Leu Ser Gly Glu Gly Ser Glu Asp Ser1 5 10
15Ser Ser Glu Ser Asn Asn Glu Ser Asn Asn Gln Glu Thr
Ser Asp Gln 20 25 30Glu Asn
Thr Arg His His Leu Cys Gly Ser Glu Glu Ser Tyr Phe Ser 35
40 45Glu Glu Glu Leu Leu Pro Ile Val Tyr Pro
Asp Asp Asp Asp Asp Ala 50 55 60Ala
Ala Arg Asp Asp Val Leu Gly Asp Phe Leu Ser Val Lys Glu Asp65
70 75 80Gly Glu Phe Thr Thr Asp
Glu Val Asp Gly Ser Arg Tyr Asp Leu Ala 85
90 95Pro Glu Tyr Tyr Pro Thr Ser Leu His Glu Asp Val
Thr Ala Arg Phe 100 105 110Ser
Asp Leu Ala Ser Pro Val Asp Arg Lys Glu Ser Ser Tyr Ser Ser 115
120 125Thr Asp Asp Tyr Asp Asp Asn Asp Ser
Asp Asp Glu Glu Glu Glu Glu 130 135
140Asp Asp His Tyr Tyr Gln Arg Arg Arg Asn Asp Lys Tyr Ser Leu Met145
150 155 160Lys Glu Asp Asp
Asp Asp Asn Glu Leu Ser Ser Ile Pro Leu Pro Pro 165
170 175Pro Ser Ser Leu Tyr Glu Val Ala Ser Ala
Glu Gln Met Gln Gly Val 180 185
190Thr Ala Tyr Leu Asn Ala Asp Arg Pro Asp Thr Leu Gln Glu Thr Ile
195 200 205Val Pro Phe Glu Ser Arg Ala
Glu Glu Cys Ser Ala Pro Glu Arg Val 210 215
220Val Ala Trp Glu Ile Asp Val Ser Asp Met Thr Gly Ser Lys Lys
Thr225 230 235 240Lys Lys
Arg Pro Pro Asn Lys Leu Ser Lys Ala Lys Ser Arg Lys Ser
245 250 255Ser Ser Lys Gly Ser Met Asp
Ser Ala Tyr Ile Pro Pro Thr Val Ser 260 265
270Thr Thr Pro Glu Leu Leu Ala Gln Arg Lys Cys Leu Asp Gln
Lys Arg 275 280 285Trp Phe Cys Val
Ser Arg Pro Gln Tyr Ser Lys Ser Cys Gly Leu Ser 290
295 300Ser Leu Val Ser Cys Trp Asn Tyr Leu Phe Ser Thr
Leu Gly Gly Gly305 310 315
320Thr Met Pro Pro Ile Thr Gln Glu Gln Ala Leu Asn Val Leu Gly Phe
325 330 335Gln Pro Pro Phe Gly
Glu Ile Arg Phe Gly Pro Phe Thr Gly Asn Ala 340
345 350Thr Leu Met Arg Trp Phe Lys Gln Leu Asn Asp His
Tyr Arg Val Arg 355 360 365Gly Arg
Ala Tyr Phe Gln Tyr Lys Pro His Gly Arg Ser Arg Thr Val 370
375 380Gly Arg Thr Ser Ala Gln Gly Leu His Leu Leu
Arg Gln Gly Leu Lys385 390 395
400Asp Pro Asn Met Ala Phe Ile Tyr His Cys His Asn His Tyr Phe Cys
405 410 415Pro Ile Gly Tyr
Glu Asp Val Pro Leu Lys Ala Val Asp Ala Tyr Arg 420
425 430Asp Pro Leu Asn Leu Asp Glu Val Glu Thr Trp
Ile Leu Ile Gly Asp 435 440 445Pro
Ser Arg Lys Gln Pro Gly Ile His Cys Phe Lys Trp Glu Asp Ile 450
455 460Ser Thr Asp Leu Asn Cys Gln Asn Pro Asp
Tyr Leu Asn Ile Arg Lys465 470 475
480Leu Arg Leu Gly Val Gln Gln Arg Arg Thr Lys Arg Thr Gly Gly
Asn 485 490 495Leu His Cys
Ile Met Ala Phe Cys Arg Ser Ala Gly Phe Leu Thr Arg 500
505 510Pro Thr Lys Ser Lys Lys Glu Gly Ala Met
Lys Asp Thr Ser Ser Asn 515 520
525Ser Lys Ser Arg Lys Ser Gly Ser Val Arg Met Ser Gly Arg Lys Val 530
535 540Gly Glu Ser Lys Ser Glu Gly Met
Val Gly Arg Pro Ala Pro Gly Gly545 550
555 560Ser Val Pro Cys Leu Gln Thr Gly Lys Ala Asp Ser
Ser Asp Ile Ile 565 570
575Glu His Phe Ala Phe Glu Thr Val Ser Cys Asp His Ser Ser Glu Gly
580 585 590Arg Ser Cys Arg Ser Glu
Val Val Lys Lys Thr Lys Ser Glu Ser Gln 595 600
605Val Gly Arg Arg Arg Ala Lys Ala Ser Val Val Lys Gln Glu
Asp Lys 610 615 620Glu Ile Arg Val Lys
Ser Ser Glu Ala625 630142031DNAGiardia
lambliamisc_feature(230)..(232)Translation initiation codon (ATG)
14gcacatcttg caggtcaaaa cgaacacccc ctccttcgat atcctctcag accctacact
60ctcaattgtg ttacagaccg ggcatgggaa gaacttgcta cggccggctt tcttaggggg
120cgccgccctt gtcctcttct tctttcccat cctcctgtcc tctttttgtg actgtttgtg
180actagacgcc gtttctaaca aaattgccaa gcatgtatgc aaaattaaaa tggaaagata
240ccccagacaa cggttagacg acggcaggtg gcagtgcgtg gcagcgcagt acagatactc
300ctgcgccatc tcatgccttg tgagcatatt caatcatctc ttcaacagag acatgaccct
360ggacgagtgt attgctattc tctttccaga cctgaaagaa gacccacgac actatgattt
420tggacctcag gcttctaaca gtgctgttca aagctggttc aagaccctct gcatgcacta
480tggcctttct ggcacctctt gcacgatata caaggagcag ggcagaacga gaactgcgtg
540tagcaagcaa gaggcactta agaatatcat cactgctttg aatacgccaa gatgtgcgtt
600actgtatcac tgcttgaacc attactgcat aatcgtaggc tatataataa gtccatctac
660gcctaataga ccaagtaatc attgcgtctt cagcggggat gatggatgca ccctcaagct
720cctgtgtgca gacggcacag aagccgagga cgtggacgat agtaatattt ggttaatagt
780ggcagactgt gggaaaggaa ctgctcccct taggtcactg acctgggaat ttgtacataa
840agatatatct acccgacctc cgtatgcata taacgctagg tgccctgaga gaggactgct
900aaggaaaaca gaatcaaagg gatatatacc agttgagata gactcagtgc ttgttaacag
960cacgggagta tccacctgtg ttagatctgg tggcgtcatc aagggatcgt cgcactgcat
1020cattggattt gttagtgact agagccccgt ttattactcc cggacgaaag tataactatt
1080aacaccacaa gcacaacgat agctccagta gagcagagcc gaagcacttg aggcagcgag
1140gcctccaaat acccacatag aacgtcacag atgatagctg tccatgtcgc aattgacaag
1200gttaacggga aggttgaaac aggcgagggc gtccatctgg tacgttgtac tttggttgtt
1260gaatattgaa ctgttgtaag tgttgatttg ctgggtatat ctattgctta tgtaccgaaa
1320aagggcattg caaacgtcat atattgcatc tatctgatga acacagaccc cagttttttg
1380aagatttgca agtcttcttt gtggtggggc attcatatat gaataagagc agacttctcc
1440gcaggcaaag gacatggact gaatggcatg ctcgtaacca gttaggtcca gtgctttggt
1500tcgtgcatag tatttaaaga ccttctgaag aaggatggtt tgaaataggg tcgtcctgtc
1560cacacagtcc aggcagttta tccgcggata gcacttctga acaaagtcag gaagagcaac
1620tccgacatca ccgctaggaa ctagaactgt gcttgtggct atgtcatctg ctaactggtg
1680atactctgtg ttgctgtgtc tacgtatgtt gtagttcatc aacttaacgt tgagggagtt
1740cttgcggcga gaatcagcag tttttctcat agactcggta aagaacgccg tcagagccgc
1800tcatcggcgg tctcaaggct tttcttttca ctggcagcaa tggagtcatc caaaagatcg
1860acttcatttt tgaggaggtt gacgataagt atctctgcgt ctgcagtcac taagttaccc
1920aatagaaggc ttatatgcct ttgcaagaga ctactaaact gagcgaggcc ctgctcttca
1980tgagccccat ctgggaagcg tatggcagga gtgaacttgt aagtaaaaaa a
203115270PRTGiardia lamblia 15Met Glu Arg Tyr Pro Arg Gln Arg Leu Asp Asp
Gly Arg Trp Gln Cys1 5 10
15Val Ala Ala Gln Tyr Arg Tyr Ser Cys Ala Ile Ser Cys Leu Val Ser
20 25 30Ile Phe Asn His Leu Phe Asn
Arg Asp Met Thr Leu Asp Glu Cys Ile 35 40
45Ala Ile Leu Phe Pro Asp Leu Lys Glu Asp Pro Arg His Tyr Asp
Phe 50 55 60Gly Pro Gln Ala Ser Asn
Ser Ala Val Gln Ser Trp Phe Lys Thr Leu65 70
75 80Cys Met His Tyr Gly Leu Ser Gly Thr Ser Cys
Thr Ile Tyr Lys Glu 85 90
95Gln Gly Arg Thr Arg Thr Ala Cys Ser Lys Gln Glu Ala Leu Lys Asn
100 105 110Ile Ile Thr Ala Leu Asn
Thr Pro Arg Cys Ala Leu Leu Tyr His Cys 115 120
125Leu Asn His Tyr Cys Ile Ile Val Gly Tyr Ile Ile Ser Pro
Ser Thr 130 135 140Pro Asn Arg Pro Ser
Asn His Cys Val Phe Ser Gly Asp Asp Gly Cys145 150
155 160Thr Leu Lys Leu Leu Cys Ala Asp Gly Thr
Glu Ala Glu Asp Val Asp 165 170
175Asp Ser Asn Ile Trp Leu Ile Val Ala Asp Cys Gly Lys Gly Thr Ala
180 185 190Pro Leu Arg Ser Leu
Thr Trp Glu Phe Val His Lys Asp Ile Ser Thr 195
200 205Arg Pro Pro Tyr Ala Tyr Asn Ala Arg Cys Pro Glu
Arg Gly Leu Leu 210 215 220Arg Lys Thr
Glu Ser Lys Gly Tyr Ile Pro Val Glu Ile Asp Ser Val225
230 235 240Leu Val Asn Ser Thr Gly Val
Ser Thr Cys Val Arg Ser Gly Gly Val 245
250 255Ile Lys Gly Ser Ser His Cys Ile Ile Gly Phe Val
Ser Asp 260 265
27016196DNABranchiostoma floridae 16tgatttgtgg ttccggcaga tcaatgatca
tttccatgta aaaggatgct cctatgttct 60gtataagccg catggcaaga acaagacagc
aggagaaact gctgttgggg cactatcaga 120gttaacacaa gggttaaaag aagacccaac
agcctacgtc tatcattgcc agaaccacta 180cttctgcccc aatccc
1961765PRTBranchiostoma floridae 17Asp
Leu Trp Phe Arg Gln Ile Asn Asp His Phe His Val Lys Gly Cys1
5 10 15Ser Tyr Val Leu Tyr Lys Pro
His Gly Lys Asn Lys Thr Ala Gly Glu 20 25
30Thr Ala Val Gly Ala Leu Ser Glu Leu Thr Gln Gly Leu Lys
Glu Asp 35 40 45Pro Thr Ala Tyr
Val Tyr His Cys Gln Asn His Tyr Phe Cys Pro Asn 50 55
60Pro6518382DNAMus musculus5'UTR(1)..(382)Exon A -
untranslated 18ccccaactac tttcgtccct tccctccgtc cctcactctc cctcctcctt
tctccccccc 60taccttcctt tctacttctt ttttcaactt tggagcacgg ctttctggca
accttaaata 120ctacagttgc gcaactagca tgtctggagt cacagcaaag atttcccaac
ttatattttg 180ttcaaggtat ccaccgcaaa tggcaggtat atagtaaacg ctgaaaggga
ggctaggtgt 240tatcaatgat acccagtcac tcggtgctat tcttgtgcgc tcaatgggac
gaaagattct 300gggccttggg taggagactt ggagatgcaa gatctggtgt tgccttccag
caccagagtt 360ccgggaccca acaggaacag ag
3821942DNAMus musculus5'UTR(1)..(42)Exon B - untranslated
19ccctggaagg atctgggtcg agctgagtct ctgaggagag at
4220311DNAMus musculus5'UTR(1)..(311)Exon C - untranslated 20ttttcttccg
gctgggagtg agggagcagg ccgggaggag gttacaaggc tttagatctg 60gtcttggcca
gtggggacta gggacgcctg gcactgggtt ggccaccgca ggacagtagt 120gggaacccgg
cacagtagcg ctgcagcagt tgcacttgca acatccctgc tctcccggtt 180ctcctccacc
tgcacctttg tcaccttcag gtgcttcgga gcctcaaaga gggggcagtg 240ggaagtctcc
tggctcctca gagtctgaac tccagagggc atcatgtgct gcatgaatct 300catactcaca g
31121601DNAMus
musculus5'UTR(1)..(125)Exon 1 - untranslated 21gatcccattt gtcagctctc
aagccttttt agaatcctgt gaacatttgc caaagttgct 60tttttttttt ttaaagagag
ggttgcggct tcttcctagg aacagagaca tctgcatttg 120ctctcatgcc taacgccact
gaagctggaa aagccactga tcctggacat ggtgagcaca 180catctgagaa caagtcacca
gaagagggtc tacaaggtgc tgtaccatct ttctacacaa 240gtgcctcaga agcacccata
gcgcccagag gagatgggca ttatccatcg agttgtccag 300tgactcacac tcgagagaaa
atttatgcga tctgctcaga ttatgccttc ctcaaccagg 360caacatcagt ctacaaaact
cctagcctaa cccgctctgc ttgcctccct gataacacct 420ctctttctgc tggaaatact
acaagatata ttggaatttc aactagtaca tcagaaataa 480tctataatga aggaaaataa
cttggaaaac ttgtccactg gcatgggcaa gctacctctt 540gcatgggaga ttgataaatc
tgaatttgat ggggtgacta caaatttgat acataagtca 600g
60122912DNAMus
musculusmisc_feature(1)..(912)Alternative BIVM 5' end clone (6359)
22atattttgtt caaggtatcc accgcaaatg gcaggtatat agtaaacgct gaaagggagg
60ctaggtgtta tcaatgatac ccagtcactc ggtgctattc ttgtgcgctc aatgggacga
120aagattctgg gccttgggta ggagacttgg agatgcaaga tctggtgttg ccttccagca
180ccagagttcc gggacccaac aggaacagag gtgcttcgga gcctcaaaga gggggcagtg
240ggaagtctcc tggctcctca gagtctgaac tccagagggc atcatgtgct gcatgaatct
300catactcaca ggatcccatt tgtcagctct caagcctttt tagaatcctg tgaacatttg
360ccaaagttgc tttttttttt tttaaagaga gggttgcggc ttcttcctag gaacagagac
420atctgcattt gctctcatgc ctaacgccac tgaagctgga aaagccactg atcctggaca
480tggtgagcac acatctgaga acaagtcacc agaagagggt ctacaaggtg ctgtaccatc
540tttctacaca agtgcctcag aagcacccat agcgcccaga ggagatgggc attatccatc
600gagttgtcca gtgactcaca ctcgagagaa aatttatgcg atctgctcag attatgcctt
660cctcaaccag gcaacatcag tctacaaaac tcctagccta acccgctctg cttgcctccc
720tgataacacc tctctttctg ctggaaatac tacaagatat attggaattt caactagtac
780atcagaaata atctataatg aaggaaaata acttggaaaa cttgtccact ggcatgggca
840agctacctct tgcatgggag attgataaat ctgaatttga tggggtgact acaaatttga
900tacataagtc ag
91223912DNAMus musculusmisc_feature(1)..(912)Alternative BIVM 5' end
clone (6358) 23ttttcttccg gctgggagtg agggagcagg ccgggaggag gttacaaggc
tttagatctg 60gtcttggcca gtggggacta gggacgcctg gcactgggtt ggccaccgca
ggacagtagt 120gggaacccgg cacagtagcg ctgcagcagt tgcacttgca acatccctgc
tctcccggtt 180ctcctccacc tgcacctttg tcaccttcag gtgcttcgga gcctcaaaga
gggggcagtg 240ggaagtctcc tggctcctca gagtctgaac tccagagggc atcatgtgct
gcatgaatct 300catactcaca ggatcccatt tgtcagctct caagcctttt tagaatcctg
tgaacatttg 360ccaaagttgc tttttttttt tttaaagaga gggttgcggc ttcttcctag
gaacagagac 420atctgcattt gctctcatgc ctaacgccac tgaagctgga aaagccactg
atcctggaca 480tggtgagcac acatctgaga acaagtcacc agaagagggt ctacaaggtg
ctgtaccatc 540tttctacaca agtgcctcag aagcacccat agcgcccaga ggagatgggc
attatccatc 600gagttgtcca gtgactcaca ctcgagagaa aatttatgcg atctgctcag
attatgcctt 660cctcaaccag gcaacatcag tctacaaaac tcctagccta acccgctctg
cttgcctccc 720tgataacacc tctctttctg ctggaaatac tacaagatat attggaattt
caactagtac 780atcagaaata atctataatg aaggaaaata acttggaaaa cttgtccact
ggcatgggca 840agctacctct tgcatgggag attgataaat ctgaatttga tggggtgact
acaaatttga 900tacataagtc ag
91224888DNAMus musculusmisc_feature(1)..(888)Alternative BIVM
5' end clone (6356) 24ccccaactac tttcgtccct tccctccgtc cctcactctc
cctcctcctt tctccccccc 60taccttcctt tctacttctt ttttcaactt tggagcacgg
ctttctggca accttaaata 120ctacagttgc gcaactagca tgtctggagt cacagcaaag
atttcccaac ttatattttg 180ttcaaggtgc ttcggagcct caaagagggg gcagtgggaa
gtctcctggc tcctcagagt 240ctgaactcca gagggcatca tgtgctgcat gaatctcata
ctcacaggat cccatttgtc 300agctctcaag cctttttaga atcctgtgaa catttgccaa
agttgctttt ttttttttta 360aagagagggt tgcggcttct tcctaggaac agagacatct
gcatttgctc tcatgcctaa 420cgccactgaa gctggaaaag ccactgatcc tggacatggt
gagcacacat ctgagaacaa 480gtcaccagaa gagggtctac aaggtgctgt accatctttc
tacacaagtg cctcagaagc 540acccatagcg cccagaggag atgggcatta tccatcgagt
tgtccagtga ctcacactcg 600agagaaaatt tatgcgatct gctcagatta tgccttcctc
aaccaggcaa catcagtcta 660caaaactcct agcctaaccc gctctgcttg cctccctgat
aacacctctc tttctgctgg 720aaatactaca agatatattg gaatttcaac tagtacatca
gaaataatct ataatgaagg 780aaaataactt ggaaaacttg tccactggca tgggcaagct
acctcttgca tgggagattg 840ataaatctga atttgatggg gtgactacaa atttgataca
taagtcag 88825668DNAMus
musculusmisc_feature(1)..(668)Alternative BIVM 5' end clone (cDNA)
25ccctggaagg atctgggtcg agctgagtct ctgaggagag atgtgcttcg gagcctcaaa
60gagggggcag tgggaagtct cctggctcct cagagtctga actccagagg gcatcatgtg
120ctgcatgaat ctcatactca cagagagggt tgcggcttct tcctaggaac agagacatct
180gcatttgctc tcatgcctaa cgccactgaa gctggaaaag ccactgatcc tggacatggt
240gagcacacat ctgagaacaa gtcaccagaa gagggtctac aaggtgctgt accatctttc
300tacacaagtg cctcagaagc acccatagcg cccagaggag atgggcatta tccatcgagt
360tgtccagtga ctcacactcg agagaaaatt tatgcgatct gctcagatta tgccttcctc
420aaccaggcaa catcagtcta caaaactcct agcctaaccc gctctgcttg cctccctgat
480aacacctctc tttctgctgg aaatactaca agatatattg gaatttcaac tagtacatca
540gaaataatct ataatgaagg aaaataactt ggaaaacttg tccactggca tgggcaagct
600acctcttgca tgggagattg ataaatctga atttgatggg gtgactacaa atttgataca
660taagtcag
668263312DNAMus musculusmisc_feature(1)..(209)Exon A 26atattttgtt
caaggtatcc accgcaaatg gcaggtatat agtaaacgct gaaagggagg 60ctaggtgtta
tcaatgatac ccagtcactc ggtgctattc ttgtgcgctc aatgggacga 120aagattctgg
gccttgggta ggagacttga ggatgcagat ctggtgttgc cttccagcac 180cagagttccg
ggacccaaca ggaacagagg tgcttcggag cctcaaagag gggcagtggg 240aagtctcctg
gctcctcaga gtctgaactc cagagggcat catgtgctgc atgaatctca 300tactcacagg
atcccatttg tcagctctca agccttttta gaatcctgtg aacatttgcc 360aaagttgctt
tttttttttt tttaaagaga gggttgcggc ttcttcctag gaacagagac 420atctgcattt
gctctcatgc ctaacgccac tgaagctgga aaagccactg atcctggaca 480tggtgagcac
acatctgaga acaagtcacc agaagagggt ctacaaggtg ctgtaccatc 540tttctataca
agtgcctcag aagcacccat agcgcccaga ggagatgggc attatccatc 600gagttgtcca
gtgactcaca ctcgagagaa aatttatgcg atctgctcag attatgcctt 660cctcaaccag
gcaacatcag tctacaaaac tcctagccta acccgctctg cttgcctccc 720tgataacacc
tctctttctg ctggaaatac tacaagatat attggaattt caactagtac 780atcagaaata
atctataatg aagaaaataa cttggaaaac ttgtccactg gcatgggcaa 840gctacctctt
gcatgggaga ttgataaatc tgaatttgat ggggtgacta caaatttgat 900acataagtca
ggcaatgtaa agaaacaatt ttccaagaag aaaacgtcgg ataaaaaagg 960gcggcatcag
agggagtgtc tccactattc tcctcttgat gatgttaaac aacgcaaagt 1020gttagacctt
aggcgatggt actgcataag ccgaccacag tacaagactt catgtggtat 1080ctcctcattg
atttcttgtt ggaatttctt atacagcata atgggagctg ggaatctccc 1140acctattacc
caagaagagg cattacatat tttgggcttc caacccccat ttgaagatat 1200taggtttggc
cctttcactg gaaatacaac actcatgaga tggtttagac aaattaatga 1260ccactttcat
gtgaaaggat gctcttatgt tctatataag ccccatggga agaacaaaac 1320agctggagaa
actgctccag gggccttatc aaagttgacc cgaggattga aagatgagtc 1380actggcttat
atctatcatt gccaaaatca ctatttctgt ccaattggct ttgaagcaac 1440ccctgtgaaa
gctaataaag cattcagcag ggggcccctc tcttcacaag aagtagaata 1500ctggatttta
attggagagt caagtagaaa acatcctgcc attcactgta aaagatgggc 1560agatattgtc
actgatctaa acactcaaaa tccagaattc ttagatatcc gacatctaga 1620gagggggctg
cagttccgga aaataaagaa ggttggagga aatttgcatt gcatcatagc 1680attccagaga
ctcagttggc agagatttgg cttttggaac tttccatttg gaaccattac 1740acaagaatca
caacatccca cacatgtccc gggaattgcc aaatctgaga gtgaggacaa 1800tatctctaag
aagcagcatg ggcgcctggg caggtccttc agtgcgagtt tccatcagga 1860ctcggcatgg
aagaacatgt ctagcatcca cgagaggagg aacagtggct accacagctt 1920tagagattat
aatggcaatg actgaccatg ccaaaactta gccactggtg ttacccacac 1980agctgttatg
tacaggactg cattaggaca tcagctggtt ttattaagtc tgtcaatagg 2040aacagatttt
gtggtacaaa acacaccctg tagttctcta gtaaaaaagc ctacatagga 2100ttactatggt
tggcttcaaa tatacaggca ggtaagcaca gaaccccgcc cttctaaagt 2160taaaagtaga
taagcaatct ggacaaaggg tttcacaaaa tccaatacaa tcaaaacggc 2220ttcaaagcaa
aaacacaaat gcatttaatt tgaaaagcat cgaaacttga actacttaag 2280catgaagcga
cttattgata cttgatccct agcatttatt acaacacttt aattcctaag 2340gcatcatctg
tccttaaaaa atgggggcag tcaaggtcta gtttttgctc atggttaaaa 2400ctaatttaaa
attatctttc tagtctagtt gttctttcag tgctaacagt atccacctcc 2460catcgttgct
ttcctgaata actctcagga ttctccaaaa agcagcagaa actactccag 2520gaactgacct
tttctctagg tgcagatagg tgacttaggt cattgatcct gatactcttg 2580acttggcacg
tggttgtgaa atagctacaa gaagaatata ggtctggagc gaagtctgat 2640gttctagaac
aaaccttgtt tcagggatat agttagagag cacttggcat ccaaagtttc 2700cttatccacg
gtaacatgtg ctgtgagatg tcacatttga cttgtctctt aatggagtca 2760tgtgttaaca
acagcactga tgtcatgttg gcaatgtcca gctcactctg aggaagactt 2820tgtattttca
actctgagcc gtttcctttt gtgaaacctc caagcaatta ggtgttggaa 2880gtgtgagtta
catattctgg aagtgtgagt tcaatacttg agctcctctt tagcggctct 2940tgttttcctt
ttgctgccaa ggtgtgactc atagccgtct atgatgctgc tctttcacgt 3000cgtaggttta
ttccaggatt caaatcagta acttggtgat tacaaggtgc tgagtatgtt 3060ggaaccattg
caatacacct caaagggagg tgtcggattt tgacttttta aaaaaaattt 3120tcatttttct
cttgaatttc atatccatct atccactcat atatgtttag cctacagaat 3180tacaaactag
tcctgtttct gaagaggttc tttagcttga aatgtaaagg actgaaagat 3240ttgtaggtgt
tcttttgtta cttcacactg gaactttgaa aatgttttca tcaaataaag 3300ttttgttttc
ta 331227502PRTMus
musculus 27Met Pro Asn Ala Thr Glu Ala Gly Lys Ala Thr Asp Pro Gly His
Gly1 5 10 15Glu His Thr
Ser Glu Asn Lys Ser Pro Glu Glu Gly Leu Gln Gly Ala 20
25 30Val Pro Ser Phe Tyr Thr Ser Ala Ser Glu
Ala Pro Ile Ala Pro Arg 35 40
45Gly Asp Gly His Tyr Pro Ser Ser Cys Pro Val Thr His Thr Arg Glu 50
55 60Lys Ile Tyr Ala Ile Cys Ser Asp Tyr
Ala Phe Leu Asn Gln Ala Thr65 70 75
80Ser Val Tyr Lys Thr Pro Ser Leu Thr Arg Ser Ala Cys Leu
Pro Asp 85 90 95Asn Thr
Ser Leu Ser Ala Gly Asn Thr Thr Arg Tyr Ile Gly Ile Ser 100
105 110Thr Ser Thr Ser Glu Ile Ile Tyr Asn
Glu Glu Asn Asn Leu Glu Asn 115 120
125Leu Ser Thr Gly Met Gly Lys Leu Pro Leu Ala Trp Glu Ile Asp Lys
130 135 140Ser Glu Phe Asp Gly Val Thr
Thr Asn Leu Ile His Lys Ser Gly Asn145 150
155 160Val Lys Lys Gln Phe Ser Lys Lys Lys Thr Ser Asp
Lys Lys Gly Arg 165 170
175His Gln Arg Glu Cys Leu His Tyr Ser Pro Leu Asp Asp Val Lys Gln
180 185 190Arg Lys Val Leu Asp Leu
Arg Arg Trp Tyr Cys Ile Ser Arg Pro Gln 195 200
205Tyr Lys Thr Ser Cys Gly Ile Ser Ser Leu Ile Ser Cys Trp
Asn Phe 210 215 220Leu Tyr Ser Ile Met
Gly Ala Gly Asn Leu Pro Pro Ile Thr Gln Glu225 230
235 240Glu Ala Leu His Ile Leu Gly Phe Gln Pro
Pro Phe Glu Asp Ile Arg 245 250
255Phe Gly Pro Phe Thr Gly Asn Thr Thr Leu Met Arg Trp Phe Arg Gln
260 265 270Ile Asn Asp His Phe
His Val Lys Gly Cys Ser Tyr Val Leu Tyr Lys 275
280 285Pro His Gly Lys Asn Lys Thr Ala Gly Glu Thr Ala
Pro Gly Ala Leu 290 295 300Ser Lys Leu
Thr Arg Gly Leu Lys Asp Glu Ser Leu Ala Tyr Ile Tyr305
310 315 320His Cys Gln Asn His Tyr Phe
Cys Pro Ile Gly Phe Glu Ala Thr Pro 325
330 335Val Lys Ala Asn Lys Ala Phe Ser Arg Gly Pro Leu
Ser Ser Gln Glu 340 345 350Val
Glu Tyr Trp Ile Leu Ile Gly Glu Ser Ser Arg Lys His Pro Ala 355
360 365Ile His Cys Lys Arg Trp Ala Asp Ile
Val Thr Asp Leu Asn Thr Gln 370 375
380Asn Pro Glu Phe Leu Asp Ile Arg His Leu Glu Arg Gly Leu Gln Phe385
390 395 400Arg Lys Ile Lys
Lys Val Gly Gly Asn Leu His Cys Ile Ile Ala Phe 405
410 415Gln Arg Leu Ser Trp Gln Arg Phe Gly Phe
Trp Asn Phe Pro Phe Gly 420 425
430Thr Ile Thr Gln Glu Ser Gln His Pro Thr His Val Pro Gly Ile Ala
435 440 445Lys Ser Glu Ser Glu Asp Asn
Ile Ser Lys Lys Gln His Gly Arg Leu 450 455
460Gly Arg Ser Phe Ser Ala Ser Phe His Gln Asp Ser Ala Trp Lys
Asn465 470 475 480Met Ser
Ser Ile His Glu Arg Arg Asn Ser Gly Tyr His Ser Phe Arg
485 490 495Asp Tyr Asn Gly Asn Asp
5002834562DNAMus musculusmisc_feature(349)..(554)KDEL exon
28ccagacttgg tcctttgcaa gtgcttttca ctgccgagcc atctctctag cttttttttt
60tttttttttt ctctctagct ttttaagttg attattctca caggtgtgtc ctcaaggagc
120aacttttcta gtaatgtttc ttggtcttta gtgaattata atccccatgt aaataatcaa
180tatctctctt ttgccaagtt gcctaattta gttttcaatt agaaaaacat ctagtttttt
240tttccattta gtgaaaaaag tatgctaagt ttttgataga attcttattg atggaatgac
300atgccttttt gtatacaaat cctaccataa atttttgttt caaaatacct cttaaaacat
360aaggagactc ggcaacatgt tgcccatggt gtttaacctc tatcttcaga tttctgtagc
420tggcatacat tctgtatctt actatgaagg aaccgtcctt tcgatctaaa acctggacac
480caactctcgt gaactgctca tcgggtgctg agattttaac ctggaacacc ttttcacctg
540gagaagatgt aaacctgtta tggaaaacaa tgagagatga gggttatttt taagatcaaa
600tgcacgctac ttattctctt taatgtgtaa aggttggacc ttctgcttta tcctgatttt
660aagatattgt tgttcgatta ctctcttagg cttttaggac cagaaccaat atgatagaac
720aaagataaaa aagaaaacaa taataacata aaataatttc attagatagt tgccaggaga
780taacacttaa gactagcaat ctatttagca taccaaatag agcactatca gtgtactatt
840attaattcat aaccaacaca gctggcaatt tgtcatttgt aacatgttta cttcaaggga
900tgaccttatt tgaaaaattg ttcttaatgg cttaaaatac aatccagtac tgtggggcta
960gggagatagc tgaatggttt agagcattta ccattatgta aaagacttag ttcagttcct
1020agtatccatt gtccttaagg ctcacaacta tctgcagttt cagctcaagg agattcaagg
1080ccctcttctg gcttcctcca gcacttgggt gtgagtatac accctaccca catcatgtaa
1140ttaaaatatt actactaata aaagtctaac aaatatacca gtactctaaa ttatgtttag
1200ttatggggga aatcaaaata actggtcatc atgttagtaa cacaagtgtt ttggactgaa
1260tattcctgca ttttgaatgc atttagtttc tagtgtttag ttttcccatt taaaaataac
1320ggagatttgt ttagcctcaa ttcccatcta aacagccaag aagagagctg ttattttagc
1380aatattattt acactataca tatttcagaa ttattaagtc acagagaaaa gttgattttg
1440aaggaaaagt ctattataac cagatgtaga cagcactcga gaattctgga atgtaaatgg
1500acagcagtaa agacgagtcg gggcatataa agatgaagga caaaaccaaa ataaacaaca
1560ggaacaaaaa cccctcgagt tattttaaac cgcgatctca aatagttgag gagtcatttc
1620tccacttact gttcccctga ggtatccacc gcccgaatgt acgaaatagc gggcaggaag
1680gaccacgtgt gcttttagcc cgggtcccat atttcgctct tctctgggct cagctcctct
1740ctccgccggt ctgggcaagt gctggcacag tccccagaaa aaggcaggaa agcagcgaaa
1800tgctgaacat ttacaagacc gagtcttgga agatccacga ataaatgcaa acgtgctaag
1860tggacggagc tagcctgggt aggtaggagg acggtggtca gtcctagcag cagtcaaagt
1920ccctaaagtg tcgcttgacg tgcggggtcc gccgtgggac tatgctgtgc ccaagaatgt
1980cacaacccga gcggtggcat gtccaggaga aggatgccac acgttctgct gaacctggtt
2040aagaaccgca ggctccactt tctccggaca atgacaaact tgtgttactc ttgccactag
2100agcatcactt tgggaacgaa tccgtcgcaa gttctagcca aggtgagagg agaaggattg
2160gcctgcgctg tgaccagtct tagtaaagtc ttctgctagt ggatgagtgg gtcagagtgg
2220aaaacgcgtc ccgggcccct tagcttcctg ggatatgtag tccgccatag gactagcgga
2280aatcctgcca ggagttcacc cccaactact ttcgtccctt ccctccgtcc ctcactctcc
2340ctcctccttt ctccccccct accttccttt ctacttcttt tttcaacttt ggagcacggc
2400tttctggcaa ccttaaatac tacagttgcg caactagcat gtctggagtc acagcaaaga
2460tttcccaact tatattttgt tcaaggtatc caccgcaaat ggcaggtata tagtaaacgc
2520tgaaagggag gctaggtgtt atcaatgata cccagtcact cggtgctatt cttgtgcgct
2580caatgggacg aaagattctg ggccttgggt aggagacttg gagatgcaag atctggtgtt
2640gccttccagc accagagttc cgggacccaa caggaacaga ggtattcccc agcggagggc
2700ctagcccagg cactgcggtg cgctccgcct ctccgtagcg ccgtcactgc ctcgttgcag
2760acaaccccca cccccacccc caccccgccg cgcccctgga aggatctggg tcgagctgag
2820tctctgagga gagatgtaag ggatagaaat caccagaaga aagctcggcc tgaggggggt
2880gcatccgtgg gtatccctgt actctcttca ggagcgggtc ttcccvtgcg ctagcggatc
2940ccaggcaccc ggcctgcagt ggccatcctc tctttgcacc ctcgcttctc ggccagtgag
3000gtcaagagga gtagcaggct ttgtcctcct ccacggggta aggggcgtgg aaaacataga
3060cggcctggtt gtgagcccaa ggcaagggga ttctttttcc ccctgccccc cccccccgct
3120attgttatta ttggtgatat aatcattcat cgcggcctcc cctcccagct ctcaggctgt
3180cactgtctgc gtgccaccag cctctccagt ccctagcgcg taggaagcgg cccccttcag
3240gcctgcgcgc tccccctcct ggcgagcctt ttcttccggc tgggagtgag ggagcaggcc
3300gggaggaggt tacaaggctt tagatctggt cttggccagt ggggactagg gacgcctggc
3360actgggttgg ccaccgcagg acagtagtgg gaacccggca cagtagcgct gcagcagttg
3420cacttgcaac atccctgctc tcccggttct cctccacctg cacctttgtc accttcaggt
3480gcttcggagc ctcaaagagg gggcagtggg aagtctcctg gctcctcaga gtctgaactc
3540cagagggcat catgtgctgc atgaatctca tactcacagg taggcctccc aggtctggtc
3600tggtagtcta ggcaagggcg ctgatagaaa aggggggcgg tgggggcacc tggctgctgt
3660cttagcagct cttatcaatc ctcagcaaaa cacttttctg ggtctcagtt gctttctatg
3720cagaatattg attatactgt ttgtgatatg tggaaggctt tactgagatt ttattggatg
3780tctagctctc tactgaatgt gatacttcag catgtagaca cgaaaggcac ataaatggaa
3840aagaataaac tcaggcaatc gatatcggtt ttgcaaacag tatgtttatt tgacagaatt
3900gttcattcca aagactttca taagtcattc ctctagtgcc gtcctagaag cttggggttt
3960atattgctta tgagatactg tgaataaatt gagtggtatt ccgcaagtcc taatatgtta
4020atcaaaacga gggagaacaa tttcaataat gcctagtgca ttgtagacac ctaaaaatgt
4080ttgttagacc tggcttctac aagactaggc ctggagatgc aatttcagga aaactagact
4140ttcaggagtt aaccatttgt gtacacacac acacacacac acacacacac acacacattt
4200agaaagctta atcatattct agaaagaaaa gttcctcact atggataggg tatctggcct
4260agttagaaca gcaaaaaaaa aaaaaaaaat tcttcagaaa gtgacatttg agtataactt
4320tgaaagatgg atgagaatga ataattagta aggggctaca ggaagacaac attcagaaaa
4380ccaaaaagta atgtggaagg acagaagaag gtatgtaggg taaggagttg atcttttgct
4440tcaatagctg agggccttag gcagaaaatt tctatgagga tatatatgta gcagacttta
4500ctctttgatg tctatgcagg gatcagacta gaaagaacag ctcaaagggt gggtggcatg
4560aagttataag cataaagata aaagcagata taaaccccag atagctgcag atacagtgct
4620ggacaatcaa tcagccttca taactccgtt ttccctgaga ctaccattgc atcttactag
4680aataatctat gcctgtaatg gtcttgtttc cagtagttgt ggctttcttc attctaagaa
4740agcaggctct gagtgaagtc ctaaggatac catagcagaa gcccttcaga gtttaactac
4800aaagggtgga gatgcagatc agtcacggga tgcaaagaag tagagtgcca gcatgggttc
4860tgggtcattc tttttcattt tctgtgaata tcttctgata tttcataatt ttctacaata
4920tactttgtgt catcaaacct ataaatatat gtattttcag agatattcac ataaaacagt
4980ggttcaaatg aggaaatgtt gatatctttc attttcagtg acaactatgt aaaatcttca
5040ggtagatggc aggtttgttt gttttttttt aagataaaat gagagcaaag agattatttt
5100tattcagtga ccattgctta aagcatacaa tccaaaacca gatggttgca tgttttataa
5160atcaagtgtt ttactcttaa cagcttcatc tcctagcaat ttccattgta ttttttcatt
5220cataaatgaa cctgtttccc ttggcaatgt tatttggtta gctttggtta tttgagctag
5280ggtcttgccc ggtgtgtact gggctagatc agaagtagct gtgtagttta gtctagagta
5340gatcttgcag cagctctctt gtatctgcat cttgagtctt ggaattacag aaatgggcta
5400ccacgttcta ttttgaccat actttttttt ctttccatat gttccaaatg agcataaaac
5460catcaaactc tggtacattg tactagtccc ataggcaaaa ttatattcaa ggtgagcgag
5520gcagtgatgg caaggtgagg ttaatgtcgg gttgtgatct acaaaaggta gagcctgggg
5580tgcctgggat gtactgaatc aagctttccc actgccccag ctgaccatct gttgtttcct
5640atcgttccac acccaaccac agacacatta tacccctcta ccacacagct tttaaagaag
5700aacgtgagca tcacagacag gaagtgggca atggagaagt ggagaaactc agtgctcaga
5760ggaaaggaag cagtgtggtc tgcaagcctc atgcactgta catcctttga gtcttcttcc
5820actgtcacca tcacaccgca cacagaatga ggacaaagga gagagctcag acactacttg
5880tacacacact aagacctttt taccaaacag tatttgaaag gaaagcactg cagatgccag
5940ctttggtgat aatgtacttg catcctgtga gccactgtta actgaagatg agaaaaaagc
6000ggacgggaag ttgtgagaag gtgcatctca taaggggagc tcactctgca cagataggag
6060ttccttccta ggactgtttc cccttagatt ccataatggt gaaagtgcct tcaccccttc
6120cattcagaag atactggaga ttccggttta tcatcacatc tgttcgttag agactcagac
6180aggcttgtgc cattcaattc ttttataatg gttattaaac aaatcataat ctgtgattat
6240tttcatttct gatgaggaga gagattaata atggatgggt ttagtctctc atgcactgcg
6300cggtcactgg agcctggaaa ggcagatctc gccctgcatt tggtggttgg tgaaggtgat
6360ttgaactgtg gaagcctggc tctagaggtt tcagagaaga attttaatat gtagccataa
6420tattttggtg aagaatgtgg ctgctttttt gcccttgtct gaagagtcta ccggaggcta
6480aggtgaagag atttagatta attcctttgt taaaggaaat ctcaagacat cctgatataa
6540attctgttga gtggttacta aacttctctc ttacaaagac tgttttaatg aaaagaagca
6600agctgagaaa ggaataatat aaaatatatg gttgcagtat taaaaggggc accaggaagt
6660gaaatggagc tgaatcctgt gttcatggat attaaattga attaatgggg tggtgacttt
6720gaggcaagag tccatccagc taaatttagg tccaggcatg gtagtataca agcctttaat
6780cccaggagtc aaggcaagca ggtctctgag tttaaggcca gcctaggaca gagcaagttc
6840taggtgaaga aaatcttaag tgcaggcatg gtggcatata cctttaattt caggagacat
6900ctctgagttc aaaatcaatc tacagaacaa gttccaggac agccaagctt aggaagtgaa
6960ggagttggaa aacaaaaagc tggtaataga atatgggggg agggggggaa gggggccatg
7020ttctaatccc agcatgcagc agaactcagc agctttgacc atgtggctct ggctttagag
7080tccagaatag aagggactac tggaacaatt gatgctcgtt agctggagct aagacattag
7140tgatgattaa gaagagaccg gtatcactga ggtgtaatct ggcgttttct gagagcacaa
7200agaagctgtg ttccagagat agccaatgtt gtccctcgtg ctgcagctgg acttggtagt
7260gtgtaagaac cacccaagtt gtactggttt tgaaggtatg gaggagtcat ggagagcagc
7320cgaggctgct gtgagaggcc atgggaagcc actggtgaag gtgcagcatt agttgtagtt
7380gatggcccag gactgaaggg gccatgcaaa gaagttgagg cttgactcca tgaagagagc
7440ctacgagagg ctattggtga atcctagttg cagtggaaga ccccagggta ttggagatgc
7500cactaccatg gggatgatca ccaagaacag cagcagcagt ggagtagagt caaccagagc
7560ctagagtgct acagagagca gagctggaga tatgacccaa gccctttgaa ggagtccaga
7620agatcatgtg tggatcccag acattggaaa gagaagctgt aatgttgaag tggccttgga
7680taccctaaga tgttcgagat tgcagagctg tggaatacct gtcaaggaaa gctgctaaca
7740gggagtggaa tcacccagga gaaaaaacct tgttgcagtc agtaaagatg aaaagggagt
7800agagatatga agacagcttt gacatagaca tggagatgaa gagtttggat tttgcccagc
7860tggtgtcctg tcttgttttg gggattacag ttaagtgatt ggatgaatgt cagaagagac
7920tttgaccttt ggacttttaa cattgttgag actgctgtag actatgggga ctttggaagt
7980tggagtaagt gtagtttttt attatgctgt atttaggtat ggccctatag actcatatgt
8040ttgaacgagc ctatgggtcc atggagtaga atgtagtggt ttgagtattc ctagctcagg
8100gagtggcact attaggaggt gtggtcttgt tggagtaact ttgtcactgt gggtgtgggc
8160tttaataccc tagtcctagt tgcctagaag ccagtcttct cctagcagct ttcagatgaa
8220tatgtagaac tctcagcttc tcctgcacca tgcctgcctg gatgctgcca tgctcttgcc
8280ttgatcataa tgcattgaaa cttggaacct gtaagccagc ctcatttaaa tgtcatcttt
8340ttaacagttt cctttgttat ggtgtctgtt cgcagcagta aaaccctaag taagacacac
8400gctataaatc actcattaca atgtataatg tacaaaaagc tcatcttgtt gaacctttca
8460tcccaaggca actgctaagg tgtattgtag acttgtccct cctggaggtc ctgggctgtt
8520aggggaccag ctcctttgtg attggcatgt ttccaaagtc cattcacgtt gtatcatgac
8580cagtagttct ttttattgca aagtagaatt ttaccaaatg catgtggcac ttttatccat
8640aattcagttg ctaggctttg ggttgtttcc actttttgcc attcatgggc atggtttaat
8700gttttctgta tgtatctaga acagcaattt ccatattaca tgggaagtgt tgaaatgtat
8760gaaaactgct ctgtaaccct atatcatttt atatccttga aagcagctta tgaaggatct
8820gattcgccac acattttcta atgctctttg ttatcgatgt cttttttttt ttttttttaa
8880agatagggtt tctctgtgta gccctggctg tcctggaact cactctgtag accaggctgg
8940cctcgaactc agagatctgc ctgcctctgc ttcccgagtg ctggtattaa aggcatgtgt
9000caccactgcc cggctttatg tctttttatt atagacatct tggtgggtat gagttggcaa
9060accattgagg ttcggattgt gtttccatag caacttatga tgaggaactt cttttcatgc
9120aatcattgtt catctctgta gcatctttac agatgtgact tcccctttta actaggttat
9180gtatattttc tccttataat ttgttttaaa ctaatttttg tgtgtatgtg tttatgtgta
9240tatgtggtgt gtgttttcct gcctctcttt acccagagct ggcattatgg tttacacagc
9300ttcttattga tctattgttt aatgtgggtg ctggagatcc agttcagatc ctcatatttg
9360catgacaccc ccttaaccaa ctaagccata ttccccatcc gcaagctcat ctttaatgta
9420agatagatgg ttagtaatca taactcagga tgtatggtca taaaatcaat atgcattgag
9480tgcaggtata tgaatcatac aataatgttt attgccattc atcatgcctg ctggatagtc
9540aatcagtctt tcttctctgt cttttcttag gatcccattt gtcagctctc aagccttttt
9600agaatcctgt gaacatttgc caaagttgct tttttttttt ttaaagagag ggttgcggct
9660tcttcctagg aacagagaca tctgcatttg ctctcatgcc taacgccact gaagctggaa
9720aagccactga tcctggacat ggtgagcaca catctgagaa caagtcacca gaagagggtc
9780tacaaggtgc tgtaccatct ttctacacaa gtgcctcaga agcacccata gcgcccagag
9840gagatgggca ttatccatcg agttgtccag tgactcacac tcgagagaaa atttatgcga
9900tctgctcaga ttatgccttc ctcaaccagg caacatcagt ctacaaaact cctagcctaa
9960cccgctctgc ttgcctccct gataacacct ctctttctgc tggaaatact acaagatata
10020ttggaatttc aactagtaca tcagaaataa tctataatga aggaaaataa cttggaaaac
10080ttgtccactg gcatgggcaa gctacctctt gcatgggaga ttgataaatc tgaatttgat
10140ggggtgacta caaatttgat acataagtca ggtaagaagg agctatgaag tttacaggta
10200acaacaatca gaaacgaatg ctatctattg ctaagtcttc caatgaaatg ttttttgttg
10260ctaagccagc agcatcattg ccatcttatc tgtcattgca gtttttggtt ttgttttgtt
10320tctttcgtaa gtaaccctag atatggttag tctctgactg tgttgcccat ggaaacttct
10380aatatcatat gtgcatttga gcagctttga aaatcaaaaa gaacaaaata taagtattaa
10440agataataca gtagcttcaa aaaggctact gacataacta gaatattacc attatcttac
10500agttttgcag agatgtgata atattttcta attcaggagg tattaagaca tttttgtttt
10560gaaaaaattt gagttaaaaa agaacattca ttttgatcaa agtcttgatt ttatttaaag
10620ctacaattat gtggctctct tttctaaacc atattctaaa gtccatttta tttctcatgt
10680tattttaacc cgctctaaga gtctagctct ggacttggat acaatcttga taggaatacc
10740gtttctgatg gttcaaattg ttttaaattc tcttcctgct ttctctctag agaagagtag
10800tattctagaa agcacaggta ttactttgag acatttgcag ataccatttt cagaatgcat
10860ggccagtcct ctaattctgt tgtaacttct ggcacgggtt ttactttatc tgaacatttc
10920ttgtattact tctttgttct gtaggcaatg taaagaaaca attttccaag aagaaaacgt
10980cagataaaaa agggcggcat cagagggagt gtctccatta ttctcctctt gatgatgtta
11040aacaacgcaa agtgttagac cttaggcgat ggtatgtggc catgtcagtt tttacttttt
11100ccaatcttaa aaatatgtaa tttgacatta attttcctga gtataggtta ataaattata
11160ttaactataa acactgttag ttccaaaatt atgtctagat actttaggta tatctccgat
11220tttggaagta gtctagttta gctcagcctg gctgtcagtc tcattcagtg atccctctta
11280atgttaagcc acattggctg acacttaaag tcggaatagt cattcacctg gttgttcttt
11340gtgactaatt ctataggcag tgatggtagc ttacagctat tttaattatt gccataccta
11400gtaaaatgaa caatatttcc tgtatgtata ctttcagact aaattgacac tttcctttct
11460agattgttct aaaagttcat catatgcgtg cttgtttgtc atggcccgtg agtttcaatt
11520tagagtttcc agctttcttt ctttgtctcc gtctctccct ggaggatttg tgttttcctt
11580ttaggtcccc agtgacagtt ctgttctggg gatttctgcc caggggttca tgggatgtgc
11640ttagcctgag gagaatctct gaatctcttg acaaaaggag agcggtttgc tttgcttctc
11700cagtatttca gaagctgcag aggatgcctg gcccactcaa atgagaattc acatagacat
11760ttgagtcgtt gccatcaaaa tttcttggtt tgaataagac atactttagt aggttgctgg
11820ccataatgta gctcatcttg ataaaacatg atttttatgc tcatgcattt tttatgtgtt
11880taataggtag gggatacttt aaacaaaaaa tgagttacac tgcattgaag tttgtattat
11940ggtcacgttt taaatcaccc ataaactata tatttctgtt ctgcatgttt tgtattcatt
12000cgagtgaaga atactgtttg atttcccagg tactgcataa gccgaccaca gtacaagact
12060tcatgtggta tctcctcatt gatttcttgt tggaatttct tatacagcat aatgggagct
12120gggaagtaag tatgccagtt tactgctgac accaaactcc atctttgaaa gtagtgtcaa
12180ggaaactgag tatagtgatg cacaccttta atcccagcac tcgggaggca gagtgaggca
12240gatcactgtg agttggaaat cagactggtc tacagaatag attctaggac atctagggct
12300acatagagac cctgtctaat tttaaggtac ctgtgttata atggaaagtc tctgagagtt
12360gactttgtac aaactgagaa aaattgtgtg tattgttaca agcctttcag gttaaaatat
12420ttgcatgtat tcttttcaat aaaataaatg aaaaatattg ctaaaatgtt tctaagctaa
12480gcctatatat tactaacact ggggcatatt ttatttgcat atgactggaa ctgtgaaatg
12540aaaggaatgt ttcttttata gtctcccacc tattacccaa gaagaggcat tacatatttt
12600gggcttccaa cccccatttg aagatattag gtttggccct ttcactggaa atacaacact
12660catgaggtac ggagctgcca cttagggatt acatacgcct tcctttaatt ctgtgaagtg
12720ataactatgt agtgtttggg gacggtacaa gatagacttt gctgctgggt cgcaggctcc
12780ctaagatatg ccatgtgtgg gctgtgcttg ggtttctgtg caattaggac agtagcctgt
12840gttatgaaac tattgctatg agcaaatctt cttcttaatt tcacatggtt tagacaaatt
12900aatgaccact ttcatgtgaa aggatgctct tatgttctat ataagcccca tgggaagaac
12960aaaacagctg gagaaactgg taggtgaaaa tacacacaaa cacacaaaca cacacacaca
13020cacacacaca cacacacaca cacacacaca caccccaaac tttcaggttg agtacatcgc
13080agaaattggc cacgttgttt cctgggatat caaattaact cttaatagtc tgatgttttc
13140atacctcttt aagaggaaaa tcatgatagt aatatataga gcactttcaa tattcaattc
13200cattttcacc cctttctgat attttctatg taaatgttac aaatttaaag cattgtctat
13260aaacactgtc tcttaacaca ttgttttgtc tgggtttagg acttgttata tttgcggtgc
13320gcatgttgtt tgtgcgtgtg tccaccttct acgcaaaaac cttgaattta cagtttattt
13380taaaacagag tgagagcact tgaggacatg tgatgttgga ctggcatcaa gtgaatacac
13440agaacagcag agtaacaaac tgggaagtga caacactctg ttggttttaa catactcatt
13500aatgaagttg aatttaagga ttatttgtta tctattagtt gaattatttg tattgtttaa
13560tgatttcact tataaatggt cacaataaac tttgaaattt atggagcagg ctgctgtaag
13620gtcctgctgt taaagaagaa acttttcgga ctatcctgga aacgggagac tttatctaag
13680gttgttacag tggggtgttg tgataaggat taaagattaa cctcggcttc aaaggtgata
13740aaggtggctg taaaggcaga tggataattg atgcgaaaga atacagggta ttgagaaact
13800ataggtagta atgcaaattt aacaacattc tcaaagtata aaaatataaa acaattaata
13860aaatgtagta aaacaacaaa agggctcctg agtgttagct taggattagt attcagaaat
13920tttatgattt ttttttttga tcaattgtgc tgtccagtat gttttggagc ataatttgtt
13980agagatttta gtgtatggat ctttagaaca acagtacttg gtttactttc aaatggtcct
14040ggttcgtaca attctccctg agttatttat aaaatgagtg gaaggatatg aatgctgtta
14100tccagtttcc taacggacta ggatatgcta ggctgcttgg caccgacagg cacactctga
14160tttcgtttac ctctgtttaa acatccttca acatatgcac agtcttttac ttatagaatg
14220gaataaagtg ttttccttta caaggtttaa aatgacatat atgcatgtgt ttatttcatg
14280tgtgatcaaa ggacagcctg agggatttgg ttctctcctt tcaccatttg ggccctgggg
14340atcaagcttg ggcaagaggg taccgttact tccaaagcca tcttcctaag cccctccctt
14400tttatctctc tttttatgca aaggcaagat ggtagtttct tcagatttct tcaaatacag
14460tttaatgaca tcttttacca attgtggtct ccccaacccc cttccacctc cacagcctgg
14520caccatttct gaaatgtccc tggacagtct agccccagga tcctattatc tcatgtactt
14580ctatccattg gccccagaat cctattagct catgtactac ttactatcca ttggcctctg
14640tcaggtgggg tttgggatgg actatcatga cttttatctt tttaatgtaa atgtcttgct
14700cagtgcccat agtcaccatc tgaagtagcg ctttgatgct ttgattctgt gtatgataca
14760catcagcgat agttacactc aggtcggcag agcactgata ttctgcttcc tgtcctcatt
14820ttgtccatac tctctcttat agaagagctg ccaagaggag ctttcttcaa tggtttcgga
14880tcttcaatgg ttaaaaccat ggccagctgg tattgcttat ttagtaacta ctcggaaact
14940tccctcccca ccccccaccc ccctttgagg tagatctgat tttcacagtg tgctttcagt
15000gttcttggta cttgcacaac ctggatttag taactcaggt tcttgaatgg cagtgccatg
15060gccctcaata gtcactgtaa atgaagcagc tcatagagtt taaaaaaaca aacaaacctg
15120tgcacatctc aggtgtacag cacactggta ctgttttcct catctccagt gcttttcgag
15180ttcatattga gccgttttct ttaacctgca gtgtggttat gcccctttcc acctcgggag
15240ctctcgaggt ttgcgtacag accaagcttc atagtgttga tagcatgtgg agccaacatc
15300aactggactc atgtgcctgc tgtatttaaa aggtatcaaa tggatagggg ccttgcagga
15360caatgccatg ttgcttgact acttaccata gttccagaat cttcccttga ctttcaatgt
15420ataaagatgt gctctcagtt ctctaaggct cttctgcaga aggactggtt aatactaaac
15480catcttgaag aatgtacggt gctgacagtg tattggagtg agagctataa tgccaaaggt
15540ggccgaaccc attgacagct caagttgtcc actcctttca actctgaacc taattcttat
15600cgctttttcc acatccatcc tgtctgtttg ctgtacagct gtggaggtag tggctttgtg
15660acttggcttt taggtttcct ctgttattgg gtgaactata ctctgaacct cgtttctctt
15720agcaacacat ttgtcttctt atcacagatc atttcctttg gtaactgcct tctgagtaat
15780gctgcattgt cttttatttt agaaggtgat cttttctctg attttgtcct gaaattgtgg
15840aggctttctc tgttttactc tacattctta cctagtgtct tgatgtatgt tcccctgtgg
15900atttgcctct ccttatactc cttcctctgt ctagattgct aaaaccctcg ctcctcatta
15960agttccatct tgaatgccat cccttcttga ttcctctgat ttctataccc atttgtgatg
16020ccttaacatt agctctttca tctttcacgc tagagcttgt cttgcttcca tatggtatct
16080gacagagtta ctaggtactc atgagttaaa tcaaaatctc catttaatct aaactagctg
16140ctttcaaaat actgtccctg gatgcatcac ctgataactt gttaacaatg caaattctta
16200gttcccaacc acagacctat aaggccagcc actcagactg ggctctagta agactttatt
16260gaagtcttaa caacgagatg gtatgtagcc tctttaccag gagtttgatt ttgcattgag
16320agaagatgca gtcattttct aattgtgttc agatgaaacc tgaggattcc ttttaaaggc
16380ctgaaaggga ggtgatagaa gagaactgtg cacacactta ttttagctac agcagctctg
16440ctttaagaat aagtttgctg atcttgaaaa catatgaaga atgttatgta ccttggagaa
16500acaagagcaa acatcccatt actagaaatc atgtgtatag atgttatata tcgtatttaa
16560caatgctaga agcattactt catagaagaa aacccatgtc atggcatttg gaaaactagg
16620acatggtaaa agagtgtttg cagcagacaa aattatgtga ttgccaagga gggcagactt
16680tgttatccat ttcatctgcc tctagaatcc ttacttctcc tcccctatgt gattgccaag
16740gagggcagac tttgttatcc atttcatctg cctctagaat ccttannnnn nnnnnnnnnn
16800nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
16860nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
16920nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncatgatgt gattcaggct
16980agcctgaact tgtctatggc cttacatctc tgttttttta aaatttaatt taatttttaa
17040tttgtttttt ccactccata ttccattcct ctybbyyycc cccaccccca tccaccctct
17100gactgctcca catcccacac ctcctcccca ctccaccctg tctccatgtg ggtactccta
17160ccccccaccc cacctgatct ttaaactccc tggggccgcc agtctcttga ggttaggtgc
17220atcatcttaa aattaattta atttttaatt tgttttttcc actccatatt ccattcctct
17280cgcccccccc ccccatccac cctctgactg ctccacatcc cacacctcct ccccactcca
17340ccctgtctcc atgtgggtac tcctaccccc caccccacct gatctttaaa ctccctgggg
17400cctccagtct cttgagggtt aggtgcatca tttctgaatg tacacagacg gaagtcctct
17460actgtatgtg tgttgggggc cgcatatcag atggtgtgtg ctgtctgttt ggtggtccag
17520tgtttgagag atctcggagg tccacattaa ttgagactgc tggtcctcct acaggatcac
17580ccttctcctc agcttctttt agccttctct aattcaacca cagggtcagc tgcttctgtc
17640cattgattgg gtgcaaatat ctgcatctga ctctttcagc tgcttgttgg atctttcaga
17700gggcagtcat gatagatccc tttttgtgag cgctccatag cctcagtaat agtgtcagac
17760cttgggacct ccccttgagc tggatcccac tttgggcctg tctttggacc ttgtttggat
17820caggctctct ccatttccat ccctgtaatt ctttcagaca ggaacaatta tctgtcagag
17880atgtgactgt ggggtggaac cccatccctc acttgatgtc ctgtcttcct gctggaggtg
17940ggctctatac gttccctctt cctactgtca gccttacatc tctgaatgag ttcaatcttg
18000cctgatccta aaaggtctag cctgattaat acttgggaga aagagtgaga ttctgtcttc
18060aagagtcaat tactggaact tacactgtat tcagtggtag agcttgcctg acataactaa
18120aggctttgag tacaagtatt attgcaaaac agtattttca gttatctgag aaagcttctc
18180attgattaat ttaaatttaa agagatttca ggagtttttc tatggcaatg gtggtaatag
18240tcatctttta caatgttttc atctacagag tacagttaac tgaatccacc atctggaatc
18300tatttacagt catgctcttt tatggaaatt aaagactgtt ctgaatttat tcctaaaaat
18360tctttatcta aactgccaag gtgggggtca gtttttaaga tgtgaaggaa tacaactcaa
18420tcctggggag tagctagtca tgactggtct ttgtgaagta gagattttag tagagaggac
18480tccctgtctg tgctgatcta cctggaagtg aaagcttatg tggaatgcct tgatggaact
18540tcagagtaga gaccagacac attaacttat gcataggcag agctcccacc cccaaaccct
18600tgtgaggcag tcaggttgcc tctgtggagc tgctgcacac actgactagc agatctcaat
18660ggtcatcagc tatttcagtg gtcagaaaat cgatcgctgt gaacatacca tgcaggcttt
18720cagcttgctt tttttagggt tcccagtttt aatggactac ttgttaatga ataatttaag
18780ttcttccagg aatggaagtt aatgaaatct cagttaatgg ctttgagtac tgagtgctga
18840taaaattgtg tctcttacgc taatactttg gaaatcattt gacagctatt acaatgatgg
18900gcaacagatg aggagaaaca gttgtgtaac tccagcccga ctgcagagct gatgcctttg
18960atgtgtgtga tgcaggtgta gaaagacaat tttcttagtg tttctattgc tgtgatgaaa
19020accatgacca aagcaactta gggaggcatt tggctggtgt gtcctgaatc ccagtccatt
19080gagcaaagtc aaagcaggaa ctcgacaggg taggaacctg gaggcaggac ctgatgtaga
19140gaccatagag aaatgctgct tactggcctg ctccacatgg gttgctcaga ctacttttta
19200atggaatcca gaaccagtag cccagagatg gcactgccca caatgggctg agtcctccca
19260catcaattac taattaagga aatgctctac aggctcccct acagccaaat cttatgaaag
19320tattttctaa atcaatgttt cctcctttca aatgactgta gcttgtggga gttgacacaa
19380aaatgagcca atataatgat aatatttgtt atgtggcagg tgtgagtgtt attgtgtggc
19440aggtacactc aagtgcatgc atgtgtgtgt gtgtgtacga gtgtgtgtgt gtatgcatgc
19500acaccattgt tgaggtttgt tgttatttaa ggtttgtgtt tattcatatg cattaataaa
19560ctttaagttg gatgtagatg ttggatgtta tttaaggttt atgtttattc atatgcatta
19620ataaacttta agttggatgt agatccagac attagaatgt ctataattta acctaattta
19680aggttctttt gttaattctt ttgtagactt agtttatatg tgagcgttta tagacacatg
19740tttcacttgg gcatgcttat tcagacgctg gtagtgaggg atgcccacag cagaaaagct
19800aagacggtga ttgtgcaatt actgggtttt acacaatcat tgaatatctt agacaacttg
19860gtatattctt agaaagttct gataagcaag gacatttgtt ttactactga gtttatctaa
19920aactaaaagt aaattaattc atgtttttgc ccacattggt agcgtctcat tcattccctc
19980gatggaactc agaatttcag tgtattccaa attctgtctt ttagtcctta actagtcctc
20040tgcattaaat gtggacatcg ctttgtgttc ctcaggttgt ctggttttgt ctttgtagct
20100ccaggggcct tatcaaagtt gacccgagga ttgaaagatg agtcactggc ttatatctat
20160cattgccaaa atcactattt ctgtccaatt ggctttgaag caacccctgt gaaagccaat
20220aaagcattca ggtaagcatt ccatacttga taaaggcaca gcactaggaa aatttgatat
20280acaaagtaat tttgggaatt aaacaaattc tagtctctta gaaattggga agcatggtaa
20340tgtgttttat tcaaaccaag tgtgcatatg ttttgcagga ctgattgcac ttcagagctg
20400ttgctcagtc attgtaagaa aaagccatct gatgtaccac ttttatgatt agtatacaca
20460gatgcagggg ccttcccatg ttattccctg tgtcctctcc gttgccatac atgtccttgt
20520gtcaagagtg ctacaactaa gtacattaag aagcaccttc tatgctcata gtgaagaact
20580aaggaattgt taggattttt ctccaaggga gctcaggacc agtttcatct ctaatcttct
20640tggttatatt tgcctcttgc ttttcttaat ccagtttctg tctttttgaa caacgtgctt
20700ttacacacac ataaacttgg gtctctttga aggagatgct tatgtgctga gaggttgccg
20760agtttgaatg ttattctctt taccaagttc ctcataaggg acctgaaaac agaggttagg
20820tttaagaaca ggtggcgtga ctgtcttacc ccgctgctct gctggttgat gcacccttgg
20880attgctgaga ttggtgatta gggaggatga aaagattggg aggctcttgg aagttatctc
20940ttaggtgaag gcccagattc tggctgaaga acaggcttgg cctttaggac tggagggttt
21000ttatgtactg gaggtcatgg atccctggag tcccactcaa aggtccttgg atgtggtgcc
21060taggggaacc tggaacagag gagcagaagt tggaggactt aggaggctgg atactacttg
21120tgtagaagga gcaagagtaa cactgctttc tctccaactg ctgatgtgct cggttcactt
21180tctggagtga ggggcacttt tgcacttctt tgatggctcc aacctctgcc tttaccttcc
21240ttcaggtacc tatcaaatgt aaaacatgcg ttgacttcta ctgctctgct cctttaatag
21300ttaggcttcc ccagtgaatg cctgtgccta tgaaccatct gatttttttt cttaaaaatt
21360atgtaaatct ccaggctcct ctccagactt gaattagagt ttctggggat gggaccgaaa
21420cattgtgtgt ttaaaatcat catcatctcc tctgatccaa gggggtgata tgttaaaatg
21480cttataaccc tagattttga atatggaagt aataaagctc aaacctaatt caaggatttc
21540tagaatggga ccaaacctaa gtgctcatat ccatcctaga ttaacaaagc catctttgag
21600tatcagtgta ttttagttca aacagcattc tctgggggag tgaatgccat ttatcaagct
21660gattgtgaac gtattttctt tttagcaggg ggcccctctc ttcacaagaa gtagaatact
21720ggattttaat tggagagtca agtagaaaac accctgccat tcactgtaaa aggtatgttt
21780tagtagcctt tatgttttct ttaattggta atagttccgt caccgaaaga acagatcaaa
21840attgtgtaac tagatgcatg tagataaatg tgcacacttg tgtggtctgt actctgtaca
21900gtcttgccac agaaagttct gcattctact tctctccaat ctcacagcta accggttatg
21960tttcccatta ccacagaagt cattttcttc tctctctctc tctctctctc tctgctttcc
22020catataagta acatattaag aactttttct ctaaaaataa ttagaattgg ttcatttctg
22080ctaaaagtcc tgatgcatac ttgattaaaa taaaatcttc aggactggtg tggtggctta
22140ccaggtacag gtgcttgcat gcagtcccga tgatctgagt tagattccag gagtcaacat
22200ggtagaatga cagaaccaat agttctcttc caaactccag acatacagaa tggcatgtgc
22260atgcatgcgc ctgtctgtcc tccctccctc cctgttagtc tctttctctc ccctactctg
22320tttccccaca cctgctgaga gctcctggcc cagagagggt agagacaggg catagagtag
22380gactctggtg tggctttaaa gactgcagat ctccagacag tctagctctc taactgtact
22440aaaatgctgg agataccttc tgaagtccta caagctttct tgctaattct gagagtaaaa
22500agctgttggc ttaggtgatt gacacctgta gcctgcagtg ggggattagg taaagtggcc
22560ttttcctatc tcagcaggaa ctgcacagct ctaactttca gcctgctagt agaagtccca
22620ggaagaaaaa gggactcttt gaaaagtgtt tatagcattt attactaagt tgtaaaaagt
22680tacccatgaa agctatctag gagaatgggg ggtagcagga agatcctaag tggggcaggg
22740aataattttc cctttggatc aataaacttc tattgaacca ggagacaaga ataatgcttg
22800cagaaggaaa aactttacat aagcaaatat gcataaatgc acatgaattc atatgcagat
22860tcacacatac acatttatac atatccattc aaaccagtta gatccagctt gcattcattc
22920ccacaattac acacacacac acacacacac acacacacac acacacacac acacagatcc
22980atacttacat atgcatttga aacaaaagac caagcaccag tccagaaaga actcacttat
23040tctggatgaa aaatgagctc caatctttat tgcataggga gagaaaaggc ttctaccatt
23100tgaatgccaa atgaattaaa ctcatgtttg taagaagaaa gctttattcc ttttaatagt
23160actaagcttg ccttaagagt agacctgttc tgttgcatgc taagaagcct gcctgcttct
23220tctgctctct cttaagtttt gtttatatct gactaaaaaa tgttttgtct aaaaagattc
23280tcctcagttc agttatgact cttctctttg acctcagctc tgaaacacct ttcaggatac
23340atggctacac gattaaaagt tcaccacaag tttacacaaa ttcaaatcat aactagaata
23400ggaagtttac gacagagaat gtttacatgt atatccatta ggagtaatta tctggctaac
23460catttgccac ctgtcacagc tccacaggtt cactggaagt taaaaactgt aactgtgaag
23520gtttgtatag atttaccctg tcaaaatgtt atcttctctc ctaccaccta taataaattg
23580ttagttccct ttttatgacc gtagttaatg gttgtagaaa gtctggttac tatctagctg
23640gcgacacaat ggaagactgg cagagttctc attgcagttt tgacaatcag aagagaccta
23700atagcacttc cactataaaa gagcttagta aatactgata taattttagg aattcttata
23760gggtcatcat taagaattaa gaagtcattt atttgtctat atagcattac tacaagacag
23820tacatctttg tagatcggca gaaagctact caaaagggtg ggctaatatc tagtgattgt
23880tgtatatgtt taataataac aggagaaata tattaatagc aggaatcttt cctaaaatga
23940ttttcccctg ggccttgcct atgggatcaa atctgtcatg gattacaatc cgaggcagac
24000cgtctcagga agatcacctg ctagttatta gcttgtccta tgatggctcc tgacacacac
24060ccttgctctt tttcactatt gcctgtgcat acacacagac tgacacaatc tgaggattca
24120tggtgtagtt aagggagtac tgacatcata atgacatgga ctcttacaag ctaacatcaa
24180ggactctatc accacatact cagaacttta atttccttta gcactgattt tgatttttca
24240aaaaaaaact cataatgtcc tttaaatgat ttttcctatt gtaaattaat ttaaaaattt
24300cattttatgt tattgctggt aatataaaga aaaagaactc cccgatttta ctatatagac
24360cttgattttc acatctttgc caaatttact tattatttgt ataaaatgtg ctttgatttt
24420tctccacaag cattttatta gtatattgtt tattagtaat tctgacagaa tgtggcacat
24480agtggcacac tggtaaatga agcaaaagga tcctgtaaac caaggggttc aaggccagct
24540tatacaacat acaaaaccat accccattgc tgcccccccc ataagaaaat aaattaacga
24600gtataacaga aagtgtagta tgccactgct ttagcctgcc aggtattatc tggtatgttg
24660acatataaga ataggaccag ttataaactg agaacattga cagtaattgg ataatcataa
24720ctgtcactta cacacaaaaa agaagctttt aaagtctagt tgatggaaca atgctgacct
24780aagagaaaaa tcaccaatta ttgtggcagc ttgctcatgg aaaaagctta ctaaagtggt
24840actttgaacc tgaagaaaac tgtccttttt cttactatag atgggcagat attgtcactg
24900atctaaacac tcaaaatcca gaattcttag atatccgaca tctagagagg gggctgcagt
24960tccggaaaac aaagaaggta agaatactat tacattggaa gtaacttgtc gaatgaccag
25020attgcaatat tttattaata atcttgtctt ggtaaataac agaaaccaag ttcaaactac
25080cttagtcagg aaagtaagat tattttaagg attcaggcta actttaagct tccagctaag
25140gatagtcata aggaaatgac acacagcact gcagtaatgg tagaatggct tatgtaggct
25200gtgctgtact aagtgcttta tctgaatgta aatttcttga gcaataactt tcaacttttt
25260attctcaggt gtagattgag acttaacatg ccttggttga gctccatagt cttggcctaa
25320gttgggtaaa ttatggtcac aagacagagc caggtgttca agcatgacta cagaaagaag
25380ctttcagaaa cagatatagt atagattgag taggcaagct tttgctataa atgaagctgg
25440gagtggtagc acatgcttgt aatctcagct cttaggcgag aggattttca tgctatctta
25500aactattgag attattatct caaaagtctt gtgtaaacat acacacacac acacacacac
25560acacacacac acacacacac acatgcgtgc gtgtgcacgt gggagggaat aagataggat
25620tttagtagaa tggaggagag gagactacac acacagcaaa ggaaaagcat gattttaatt
25680tgtgagtcag gaatatgtgg atatgactat gccgactcta ggctgtggag gggcatatgg
25740gacacagtgc tctggtccct aagggcttta ttataatgta aaggcaggtg ttgggaaagg
25800tgagtatgtt aaagtgttct gataaagcag agtagtagtg ttttcagaca tgtggaagaa
25860taccaccatc acctcaaggg agtcaataag agccctaaaa acttacaaac ttattttctc
25920tttattgtct tcatcttttt tcccttctca ggttggagga aatttgcatt gcatcatagc
25980attccagaga ctcagttggc agagatttgg cttttggaac tttccatttg gaaccattac
26040acaagaatca caacatccca cacatgtccc gggaattgcc aaatctgaga gtgaggacaa
26100tatctctaag aagcagcatg ggcgcctggg caggtccttc agtgcgagtt tccatcagga
26160ctcggcatgg aagaacatgt ctagcatcca cgagaggagg aacagtggct accacagctt
26220tagagattat aatggcaatg actgaccatg ccaaaactta gccactggtg ttacccacac
26280agctgttatg tacaggactg cattaggaca tcagctggtt ttattaagtc tgtcaatagg
26340aacagatttt gtggtacaaa acacaccctg tagttctcta gtaaaaaagc ctacatagga
26400ttactatggt tggcttcaaa tatacaggca ggtaagcaca gaaccccgcc cttctaaagt
26460taaaagtaga taagcaatct ggacaaaggg tttcacaaaa tccaatacaa tcaaaacggc
26520ttcaaagcaa aaacacaaat gcatttaatt tgaaaagcat cgaaacttga actacttaag
26580catgaagcga cttattgata cttgatccct agcatttatt acaacacttt aattcctaag
26640gcatcatctg tccttaaaaa atgggggcag tcaaggtcta gtttttgctc atggttaaaa
26700ctaatttaaa attatctttc tagtctagtt gttctttcag tgctaacagt atccacctcc
26760catcgttgct ttcctgaata actctcagga ttctccaaaa agcagcagaa actactccag
26820aaactgacct tttctctagg tgcagatagg tgacttaggt cattgatcct gatactcttg
26880acttggcacg tggttgtgaa atagctacaa gaagaatata ggtctggagc gaagtctgat
26940gttctagaac aaaccttgtt tcagggatat agttagagag cacttggcat ccaaagtttc
27000cttatccacg gtaacatgtg ctgtgagatg tcacatttga cttgtctctt aatggagtca
27060tgtgttaaca acagcactga tgtcatgttg gcaatgtcca gctcactctg aggaagactt
27120tgtattttca actctgagcc gtttcctttt gtgaaacctc caagcaatta ggtgttggaa
27180gtgtgagtta catattctgg aagtgtgagt tcaatacttg agctcctctt tagcggctct
27240tgttttcctt ttgctgccaa ggtgtgactc atagccgtct atgatgctgc tctttcacgt
27300cgtaggttta ttccaggatt caaatcagta acttggtgat tacaaggtgc tgagtatgtt
27360ggaaccattg caatacacct caaagggagg tgtcggattt tgacttttta aaaaaaattt
27420tcatttttct cttgaatttc atatccatct atccactcat atatgtttag cctacagaat
27480tacaaactag tcctgtttct gaagaggttc tttagcttga aatgtaaagg actgaaagat
27540ttgtaggtgt tgcttttgtt acttcacact ggaactttga aaatgttttc atcaaataaa
27600gttttgtttt ctacttttaa tcctatgaat tttaatgtct atgtttaagt tagcgtgtat
27660tcttgtaact gtgtgaagca gatgataatt tgctaattcc atgtaatcag tgttataaga
27720aacatcttac aatttttata atcaccggaa caatgtgaaa agccaataac ttccattcca
27780tgcttgctac ttttcaagta cttgagcact gattctctaa tccttcacaa cgtgatttag
27840gaattttctt atattaatta aaaaaaaaag gttggaattt ttgtggttca ggcatatttt
27900caatataaac tgcttttaat agttcaattg agtatttcaa agcaataggt ttgaagagct
27960aagaggaaag aatacaaatg caaaataaga tgtaaacctt aagaacaaag tgacagctgg
28020gaaaataaga tgagtttatt tcctgtacac taatcatatg ctttattgaa atcactgaca
28080gacatctgac cacttaaaac ttaagcttac agatttaaag atgacttaga gcacagaatg
28140tttgaattca gtgggagttt tttttttttt tttttttttt tttttcaggg tatgtaggta
28200agaggtcctg caaagtccct tttgaaaact agtaatattc ctatttgatt tgttagtcaa
28260cctgtcttag aaattgtcat agatcttcta aggaaggata ctgattggct cagaagttac
28320aagttgtctc acctgtgaga ttatgtcctc tttggtagtg atgtggaaga acggttacgc
28380taatgagcat ggtatttata atagtagttg aaaagctatt taaatgcttg ctataattat
28440tattttgaaa atgtgttctt aattgattag tctgttatca gcatggcaac ctgcaggcag
28500gcatggcact ggagaaggag ctgagagtcc tacatcctga tcggaaggca gccaggagag
28560gacagtcttc cacaggcatc caggaagaag gttcttttcc acacgaggag gagcttgaga
28620ataggacctc aaagctcacc cctggtgatg aacttcctct gacaagggca cacctcctaa
28680gagtaccact tcccatgggc caagtatttt gaaaccacca catttataga acctgttaat
28740cctttatgca ctactacagt tacaggggta tatcccatca tagaaagcag gtatagcaac
28800tgaagtgttt ctcaagttgc tttgttacag ccatggagtg agcctaggca accactccag
28860aacctatggc attatttaac tcagtttcac tcttagcatt tctaccctgc caatctttca
28920cttaaaaaaa aaaataaaga caaaacaaaa gccctgatct ctatgccatc actttctagg
28980tgctttcata ggaaaatcag aacagcaccc cagctgttgg tggtagtgac gtgtgccagc
29040taaaactgag gtttgcattg gtggagcatc ttgcctacgt ttatagagat tcttagtttg
29100ctagagcttt cataataaag tactcaggtt gcttaaaaac aagtttgttt ttgtttcgtt
29160ttttttttcc ctcacagtta catagaccaa gacaaggtct ggcacatttg gtttcttatc
29220ttttggactt gaagctggtc accctttttg tcctcccatg gcctttcctg tatttgtggg
29280catactttat gatgttccca ctccttctta ttagtataac cgttatactg gattagaggc
29340caccctgata gtctctctgt atatcaatgg tgaggtagaa atcacatcca tggacttaga
29400cgtatcttca tgtttgtgat ctattactga ccatgactct gtcagacaca gaaatggttt
29460tgttaagttc tggttcatgt gcccttaggt gcctaatggc cttattgttt tgacaatcat
29520tttctaaaaa tcactttgaa tacaggcaca ttgtgcagta cttatattca acacgtgaat
29580tttgaattct acatggatac tgacctgaat acatagtaat tccgcggtcc agccaagatc
29640tgaactatca tctgcatgac ctctactcca aatattttcc acaggatgca aaatgtcctg
29700gggagcatca gacctgagac ttgagaggtc acttaagcaa cattgacgaa ctcccgctat
29760gcactactaa ggaaggggtg taacactcat caagattgat accagtttca acatgtttac
29820agtctactgc tggaagtgat aaaaattaaa caggcaatta aattgccttc agtaaaatac
29880gactttacta aaatgcaaaa gtccgtaatt ttacacattg agggaatgct aaactgtaaa
29940ccacctgttt ggacatagat acgtttttgc taaaatttga cttctgatag acttcagctg
30000taaatgaatt tttttcccca gaaattatct acttctatcc cttccttttt tttttttttc
30060ttagacaagg ccatgtgagt cttgagcttg tgatattcct gcccatctcc cagatatttg
30120tgccaccatg tcgggctatc tattgtttct tatcaatctt tttactgggt ttgaataatt
30180acaataccgt ccatcaaaat tgagccattg tatgctaagt gctttgccca tattcactct
30240gctcggtaaa agattgttct tttgtactgc ttacagatga cggagaagaa attctgaaga
30300aacgtaatct ccacagagct aactctatag taaggcctca aagggctgca ttcgatttag
30360aatgtgcata tcgtgtcttt tcctggctct actctgcccc aagtcatttt tcccaattgt
30420ggaactcaaa gtttcagatg ctggaggaaa ttcaaagtta agcactgcgt tgtactgcca
30480tccagttact gcacgttgtg aaaatatttg ccttcaccaa agtcaagatg ctcatctttc
30540cgttcttttc ttgtcgaacc caatacttgg cgccaaatcg gcaaagctgg ggaggcaatg
30600gggagactga tactgaaaat gcacaaagga gacctcactc cattaagaaa gccaatatgt
30660gcaaacactt tttgcgccta acagtcagcc gaacagaggc acagattcaa aaccgaatcg
30720ccacacacac tgacttcggg agtgctcggg cgtgggattt acgtgtcaag tgtcaacgcg
30780ggaatcccaa ccctcgcgtt cacttggaag ccaagaggtt tcatcgtttg gtcagcggga
30840gaaatcactc gcgaccggag gaaggctcgg cgctggcgtc accggtgctg cttctcagta
30900cttttaggcg tgccagtact ggggctgaaa cctttctgct cctcccacca catttaaatt
30960ccgctgttct ttcgggagac cgggtccgct ccggaagtgc gtcagcgccg tgttcctcct
31020tcaagccctg gtgacagcgg gttccaggcg gagagcggtg ggctctgcgt gagttggtgg
31080cgctgcgtgc tggtgcgggg cgcgtccttt atcctaacgg gaaacaaagg ccccgcgagc
31140ccggccgaga gagagcgcct gcgccatggg agtgcagggg ctgtggaagc tgctggagtg
31200ctcgggccac cgggtcagcc cggaggcgct ggagggcaag gtgctggccg tgggtatcct
31260tcaaggcggc tccggagcgc cggatgagcg cctgctcggc gactgggctt cgtccgggac
31320ttggtgctgg gggtcgattg ggtgtctgtc tatctagagt gcaggcctgc gctccttagg
31380ccggcccatt ttgtgccccc tatgctttag ttctcacagc agagtggcta cagagttgtc
31440gcggagaata aggagattta aatgataaaa tttggtatag cgtcaggctc cgtagtactg
31500tacatctgag aagtcattct gtcagagtag agggggggaa aaaaaagaag agttccgaac
31560cctcccaccc tcctttttga atagttacct cttgtttgtt ggaataaaaa aagttcatac
31620ggtgtatagt gaattagttt cctccagctt ctgtgatggg tgtgtgcatt ttagttttgt
31680gtcattggag ttttgttttt ctcccacttt gaaaatgtga taccaatttt gaagttttgt
31740ttacgtttat ctgttccacg gcccgtttca cagtgcctct tttatcttag aaggtacttt
31800tacatagttt tcttaatgaa aactagaaat tcctgtaaat agttgctgaa tgtgcgtaca
31860tgcctggacg tgtgtccgta ggtggtggca tgccatgcca ttctgtaaac tatgtcagtt
31920cttcacgatg tgcccagctt ataattaatg gagaacttgc agttttgaaa aatcacattt
31980tatgccaaat acatcaatga caaaaattaa aaatttgtct tacaagtctt tgcacagggt
32040accataaacc ctagtggttt ctcggcgtca ctgctgttta aggatttctt cattatttat
32100gatggaagaa caatttttag tgagggggtt cttaaattct catcaaaggg ttttagtagt
32160ggtagagcac tttcttagta tgtatgataa accagagttc cattcacatc tttgtgagaa
32220agtattgctg acaacaccat caaggagtta cgttattttc tcttccagcc cttataaaca
32280attaggaagt taagcatcct agcattcagg gttgtcaaat gaaaagtaat ccatgtagat
32340tattcataaa tggctgtgtt ccatttgggg tgtgaagtta tgtttatgag gttattaagc
32400tcttctgaaa taatggatcg tttaaagaaa atcgtgaggg attttgccta cactcacatg
32460taatctggtt acttctgtaa tttaattttc tgtgtttcat catgagaatt ctgatatctg
32520aaatttaggt gttagaaatt tattcggttt gctaataaat aaatcatcca caattatctc
32580aggtttgtaa tttacatact tttccaaata aactgtttgg gttttagacc atgtgaatag
32640ctgagtgtat tatttaaaca gtttatgtct gtgtcaaaaa aatgttagga aataacagat
32700tagtttttcc tcattattga ccttaacatc tgttagtgca ttggcagtat taagcagata
32760gtaccctcac tggacacatg aacaacatgg attaacttag tctccagtca cctaattcca
32820ggactaagga aaattcccat ttattttcag atgttattgg ttctttgagt ttaaattcta
32880aagcaagttt tgactttgcc tttgcttcat tctttaattc ttctaaccct tgaggagttt
32940agttttgcca catttgtgtt gaagcttgac taatcaaact gtgtaaattc tttcagccta
33000tctgatttta tctgtttttt tttttgtttt ttgttttttg ttttttttta actaagagtt
33060tcatgttgga actctcggcc ttccatgaga agtggtaagc tgtggttaat ttaaatgtga
33120aaaggaatta tttcagtgtg atttagtgtt ttaggaacat ggggtttaat cttttctcag
33180aagctgtagg ccctagactc atatatatga gttaagaggt atcttaactc tgaaattgag
33240ttttctcaat tctaaagtca aaagtgattg attgtataga cacttttgat ggcagtgtag
33300caagcagaaa tggaccaagt atgagtagct gtccagagta gaactgagac tgcagaaggc
33360ttgctctaag ctggatcctg gacctgatca ctcatgtctg agacctacca tcactatcat
33420ctcctttcat gtgggctgct ggaacattct tttttttttt tttttgttat atggattttc
33480atacttcttc tttcttgtat tagatatttt ctttatatac atttcaaatg ctatcctgaa
33540agttccctat accctcccct actgctgtaa cattcttaaa actactctcc ttgcagttga
33600tttttacatg tctcctaagg cccgaagtca ttctggaaac tgctcagatg tgtatattgt
33660atatcagctt ccatacccaa gtgccccacc cctgcgccca ccacagggaa ctgtgtagac
33720gtggctctgc cctaggcgtt tgtatttgct gcgtctcgtc tcccttctgt aactagagtg
33780ctaaactcat aactactctc ctgacctatt atttcttggc acttaatgtg tcaattcttt
33840taggatatgt gcatctcaga cataggggtg catttgtctt gctcacaaat acaatgttta
33900atgtatctcc gtagggtctt ggatttactt ttacatggga gctctttaac atatagggat
33960ggagatacac acacacacac acacacacac acacacacac acacacacac acatattgtc
34020aacttggcat ggcatgagag tcatcaagag tcaattgaga aaattccttt ggaagaccaa
34080gcaagactat agggcttttc ctaattagcc attaatggga gagggcccag ccaattgtgg
34140atggtgccac ccctgggctt gtgtcctggg ttctataagc aggcaagctg agtaagccat
34200gaggagcaag cagctaagca gctcctgtac ccaagttcct gctctcttgg agtcgctgcc
34260ttgattttcc tcagtgatgg actgtgatgt ggatgtgtaa gccaaataaa ctccccagtt
34320tgcttttggt tatggtgttt tatcatagca agtagaaatc ctaagatatt ggcttaaaac
34380acaaaataca ctagcaactt ttgcagtagt aaatgaataa ctgtacatta atttttattt
34440atttattttc ccttaatttt tttattattt aaatgcattt tatacatcaa ccatattaat
34500aatattgagt atttttataa tacataaaaa tgttcaactt ttatattcat atcctttcag
34560ac
345622915DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon A1 splice donor
sequence 29cggccccagg gtaac
153015DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon A2 splice
donor sequence 30tgtgatccag gtccg
153115DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon A3
splice donor sequence 31caggccagag gtacc
153215DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon B
splice acceptor sequence 32ttccctaaag gaatc
153315DNAHomo sapiensmisc_feature(1)..(15)BIVM
Exon B splice donor sequence 33tttctgtcag gtgat
153415DNAHomo sapiensmisc_feature(1)..(15)BIVM
Exon 1 splice acceptor sequence 34ttcctcttag gagct
153515DNAHomo
sapiensmisc_feature(1)..(15)BIVM Exon 1 splice donor sequence
35cacaaatcag gtaag
153615DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon 2 splice acceptor
sequence 36tgtattctag gcaat
153715DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon 2 splice donor
sequence 37tcagacgatg gtgat
153815DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon 3 splice
acceptor sequence 38gtgttctcag gtact
153915DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon 3
splice donor sequence 39gagctggaaa gtaag
154015DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon 4
splice acceptor sequence 40tcttttgtag ccttc
154115DNAHomo sapiensmisc_feature(1)..(15)BIVM
Exon 4 splice donor sequence 41cacttatgag gtatg
154215DNAHomo sapiensmisc_feature(1)..(15)BIVM
Exon 5 splice acceptor sequence 42ttactttcag gtggt
154315DNAHomo
sapiensmisc_feature(1)..(15)BIVM Exon 5 splice donor sequence
43ggagaaactg gtagg
154415DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon 6 splice acceptor
sequence 44tttttaatag cttca
154515DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon 6 splice donor
sequence 45aagcattcag gtaag
154615DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon 7 splice
acceptor sequence 46ttatatttag caggg
154715DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon 7
splice donor sequence 47actgtaaaaa gtatg
154815DNAHomo sapiensmisc_feature(1)..(15)BIVM Exon 8
splice acceptor sequence 48ttaactatag atggg
154915DNAHomo sapiensmisc_feature(1)..(15)BIVM
Exon 8 splice donor sequence 49aacaaagaag gtaag
155015DNAHomo sapiensmisc_feature(1)..(15)BIVM
Exon 9 splice acceptor sequence 50ttcttctcag gttgg
155130DNAArtificial SequenceHSMAP5 primer
51ccatgcctct ctactactca ctcccaacac
305230DNAArtificial SequenceHSMAP6 primer 52ggtaagaaga acaccattgt
gtttgaaggc 305330DNAArtificial
SequencezfbivmMAPF1 primer 53caatgcctaa cactgtggaa agtgaaggcg
305429DNAArtificial SequencezfbivmMAPR1 primer
54gataactgtc gagctcggtt gagcagggc
29558PRTArtificial SequenceM1 amino acid motif 55Gly Xaa Xaa Xaa Xaa Xaa
Xaa Cys1 5564PRTArtificial SequenceM2 amino acid motif
56Trp Xaa Arg Xaa1573PRTArtificial SequenceM3a amino acid motif 57Tyr Phe
Cys1583PRTArtificial SequenceM3b amino acid motif 58Tyr His
Cys1595PRTArtificial SequenceBIVM N-terminus region of homology 59Arg Lys
Xaa Leu Asp1 5606PRTArtificial SequenceBIVM C-terminus
region of homology 60Gly Gly Asn Leu His Cys1
5619PRTArtificial SequenceBIVM amino acid motif 1 61Gly Asn Thr Thr Leu
Met Trp Arg Phe1 5629PRTArtificial SequenceBIVM amino acid
motif 2 62Tyr Phe Cys Pro Ile Gly Phe Glu Ala1
5639PRTArtificial SequenceBIVM amino acid motif 3 63Trp Phe Arg Gln Ile
Asn Asp His Phe1 56410PRTArtificial SequenceBIVM amino acid
motif 4 64Tyr Arg His Gln Asn His Tyr Phe Cys Pro1 5
10655PRTArtificial SequenceBIVM N-terminus region of homology
65Arg Lys Xaa Leu Asp1 5
User Contributions:
comments("1"); ?> comment_form("1"); ?>Inventors list |
Agents list |
Assignees list |
List by place |
Classification tree browser |
Top 100 Inventors |
Top 100 Agents |
Top 100 Assignees |
Usenet FAQ Index |
Documents |
Other FAQs |
User Contributions:
Comment about this patent or add new information about this topic: